Skip to content

ForgeOps Brand Page: Harness Engineering

Status: Active Updated: 2026-03-01

Build Fast. Keep Entropy Under Control.

ForgeOps is not just another Agent CLI wrapper. ForgeOps is an AI R&D control plane that turns Issue -> Run -> Step -> PR into an observable, recoverable, governable delivery system.

ForgeOps Is Not a Skill. It Is a Control Plane for an Agent Team.

ForgeOps itself is the control plane, not a single skill. To make it easy to enter the control-plane workflow from any agent runtime, we ship a Meta Skill as the entrypoint (best experience with Codex today):

  • The skill guides the agent to call the forgeops CLI and operate managed projects
  • ForgeOps orchestrates a multi-role team (architect, issue manager, developer, tester, reviewer, governance/GC) with Codex as the default runtime
  • Every action is persisted as Run -> Step -> Session state for observability, replay, and recovery

From a user perspective, the prerequisites are intentionally small:

  • working network access
  • GitHub + Codex accounts/tokens

When throughput grows, attention becomes the real bottleneck

In agent-driven engineering, the challenge is not only writing faster code. The challenge is reducing repeated failures, architectural drift, and stale documentation.

Common failure modes:

  • Context distortion: critical project knowledge is not consistently injected.
  • False quality signals: CI passes while real runtime behavior fails.
  • Poor traceability: root causes are scattered across chat and log fragments.
  • Entropy accumulation: local patches compound into systemic tech debt.

How ForgeOps addresses this:

  • Structured workflow: project-level workflow.yaml governs execution.
  • Dual runtime gates: both CI Gate and Platform Gate are enforced.
  • Session recovery: resume context first, avoid costly restarts.
  • Scheduled governance: cleanup and automation loops reduce entropy continuously.

CLI Status Cards (SVG)

When you need to share “what is the system doing right now” as evidence, ForgeOps CLI can generate an SVG status card and store it under the runtime config directory (default ~/.forgeops/charts/). Agents can run the command, pick up the file path, and attach it to a chat/issue.

bash
forgeops chart system
forgeops chart project <projectId>

Harness is an executable discipline, not a slogan

1. Context Engineering

Use a short map (AGENTS.md), deep document index (docs/00-index.md), and skill assembly rules to control context size and improve re-entry.

2. Architectural Constraints

Turn boundaries, invariants, and dependency rules into machine-checkable constraints.

3. Observability

Track run / step / session / events / artifacts end-to-end for diagnosis and replay.

4. Garbage Collection

Move debt cleanup from ad-hoc incidents into a regular system loop.

Dual Loop Model: Delivery Loop + Harness Loop

Delivery loop (default 6 steps)

  1. Architect: define boundaries and design constraints.
  2. Issue: create a structured requirement entry.
  3. Implement: build in isolated worktrees and commit safely.
  4. Test: run tests and platform acceptance checks.
  5. Review: converge on quality and risk.
  6. Cleanup: reduce entropy and capture reusable capabilities.

Harness loop (anti-regression)

  1. Observe recurring failures.
  2. Locate missing capabilities (tooling/rules/context).
  3. Encode experience into mechanisms (docs/scripts/invariants/skills).
  4. Verify recurrence drops in real execution.

Core capabilities

  • Runtime Adapter: stable boundary, Codex-first runtime today.
  • GitHub strong process: Issue-Only intake and PR-based closure loop.
  • Session Recovery: continue from interrupted context whenever possible.
  • Quality Gates: invariants + platform acceptance.
  • Scheduler Automation: cleanup / issue auto-run / skill promotion.
  • Skill Governance: candidate promotion path decoupled from delivery DAG.

Standard flow (How It Works)

  1. Create or receive a GitHub issue.
  2. Create a run and bind an isolated worktree.
  3. Schedule and execute steps via DAG.
  4. Enforce gates with budgeted self-healing rounds.
  5. Merge PR, clean up, and write back final states.

Verifiable capabilities (current)

  • Stable runtime baseline on Node.js 22+.
  • Structured documentation and process governance checks.
  • Quick-by-default mode with explicit standard escalation.
  • GitHub Pages automated publishing pipeline.

Next evidence to publish

Before scaling the public narrative, prioritize three evidence types:

  • Real runtime metrics (success rate, recovery rate, recurrence rate).
  • User stories (team size, context, measurable outcomes).
  • Public demos (docs site, sample repos, short walkthrough videos).

Released under the MIT License.