Fieldcraft / instrument register

Field equipment for AI agents

Six command-line instruments. One output contract. Every specification measured before printing.

Calibration recordrecon ask "what is prospera" --json

Fieldcraft calibration gaugeA drawn gauge dial with tick marks, a yellow arc, and a needle settled in the measured range.JSONEXIT 0
Needle settles once. Spec holds without motion.

The contract

One envelope. Stable exit codes. Self-describing offline. An agent holds the whole interface in a single read.

  • Stamped specOne JSON envelope for every response
  • Stamped specStable, documented exit codes
  • Stamped specSelf-describing offline — schema, flags, errors from the binary itself
  • Stamped specBudget-metered — every call reports what it cost
  • Stamped specThe agent is the primary user; the human reviews
Envelope JSON / instrument LCD readout
{
  "schema": "recon.cli.response.v1",
  "ok": true,
  "command": "ask",
  "data": {
    "question": "what is prospera",
    "outcome": "answered",
    "claims": [
      {
        "claim": "Próspera is a ZEDE in Honduras.",
        "sourceUrl": "https://example.com/source",
        "verdict": "supported",
        "published": "2026-07-01"
      }
    ],
    "searchTrail": [{ "query": "prospera law", "results": 4 }],
    "uncertainties": []
  },
  "costDollars": { "model": 0.09, "search": 0.04, "total": 0.13 },
  "diagnostics": { "durationMs": 12100, "retries": 0 }
}
Stable exit-code calibration
codemeaning
0ok
2auth
4network
6rate limit
10partial — budget hit, work reported

The kit

Specifications as measured. No projected figures.

Ruled specification table for the six Fieldcraft instruments.
instrumentfunctioninterface → installsourcemeasured
reconSource-verified research briefsAdversarial claim verification against fetched sources. Outputs a cited brief.recon brief "<question>" --jsoncargo install --git https://github.com/treygoff24/reconGitHub14.4 s · $0.15 · 19 supported claims (live)
lensImage-library indexing and searchCaptions an entire photo library, then answers natural-language queries in ~2 s.lens find "<description>" --jsoncargo install --git https://github.com/treygoff24/lensGitHub1,100/1,100 images · 98 s · $2.21 · search ~2 s
exa-agentFull Exa search API surfaceEvery command, flag, exit code, and error emitted by the binary itself, offline.exa-agent capabilities --compactGitHub68 commands · self-describing offline
elvElevenLabs operationsThe full ElevenLabs surface as a disciplined CLI. No server between you and the API.elv capabilities --jsonGitHub300+ operations, one envelope
delegateCross-harness model dispatchHands a bounded task to another model family. Isolated workspaces, reviewable diffs.delegate <lane> work --prompt-file <task>pip install "delegate-agent @ git+https://github.com/treygoff24/delegate-agent.git"GitHub6+ model families · safe/work isolation
lawLegal research with real citationsLegal research that cites what exists. Verification is the feature.law research "<question>" --jsonGitHubagent-first citations, checked

Calibration record

Live acceptance runs, real data, real APIs.

All figures from live acceptance runs, 2026-07-01. Local builds, real data, real APIs.

The method

Tooling that builds tooling. Decorrelated model families, coordinator-verified, in production since January 2025.

Fieldcraft started as a bet in January 2025: an agent loop that couldn't declare itself done until the work actually was. That became the autonomous loop — the /goal architecture, completion enforced by a Stop hook — shipped long before Anthropic or OpenAI offered anything like it natively.

The loop grew into an autonomous dev kit. It built delegate, which put every model family on tap. delegate grew into foundry: waves of different models implement, review, and fix each other's work while a coordinator re-verifies every claim. Then foundry started shipping products — recon and lens were each built in one evening, for under $20 of inference, receipts attached.

The tools build the tools. This site was built the same way — and the log of what broke is in the repo.

Built by Trey Goff and the team behind Praxient.

wright

Builds and maintains the agents.

memorum

Memory that survives across sessions.

agentlinters

Lint rules for agent-written code.

llm-council

Multiple models deliberate hard calls.

Praxient interface

Fieldcraft is the workshop record. Praxient applies the same bounded, measured discipline to operating systems and advisory work.

Work with Praxient