No description
Find a file
Kevoun b1dc0f3af5
Some checks failed
fences / fences (push) Failing after 7s
v0.7.0 peer-review round 2: address Codex pass-2 findings
Pass 2 confirmed all six pass-1 findings resolved. New findings, all fixed:
- Untrack the stray peeragent run log accidentally swept into 75baa61 by
  `git add -A`, and gitignore .peeragent/ so review logs never get committed again.
- README.md: bump v0.6.0 -> v0.7.0 (license/versioning + status lines) and add
  the v0.7.0 entry to the running version-history summary.
- Defensibility positions: the §9-mapped store-pole + record-provenance positions
  said "nine fields"; the registration grew to ten in v0.7.0 (decision_relevance),
  corrected to ten (the store-pole posture they argue is count-independent).

gen-contract in sync · meta-fences 4/4 · conformance 57/57.
2026-06-24 17:42:43 +00:00
.forgejo/workflows ci: add Forgejo Actions workflow running the three fences 2026-06-17 20:23:15 +00:00
.research v0.7.0 peer-review round 2: address Codex pass-2 findings 2026-06-24 17:42:43 +00:00
example v0.7.0 peer-review round 1: address Codex findings 2026-06-24 17:30:24 +00:00
kernel v0.7.0 peer-review round 1: address Codex findings 2026-06-24 17:30:24 +00:00
tools v0.7.0 peer-review round 1: address Codex findings 2026-06-24 17:30:24 +00:00
.gitignore v0.7.0 peer-review round 2: address Codex pass-2 findings 2026-06-24 17:42:43 +00:00
ADOPTING.md docs: surface kernel-vs-deployment coverage + networking behavior; pre-1.0 quickstart caveat 2026-06-17 20:25:09 +00:00
AGENTS.md docs: add Codex + harness-adapter affordances; drop standalone adversarial-review doc 2026-06-17 20:22:31 +00:00
ard.json v0.7.0 — engagement economics & acquisition realism 2026-06-24 17:11:57 +00:00
CATALOGS.md v0.7.0 peer-review round 1: address Codex findings 2026-06-24 17:30:24 +00:00
CLAUDE.md ARD v0.1 — agent-agnostic research-discipline framework for AI agents 2026-06-02 15:21:04 +00:00
CODEX.md docs: add Codex + harness-adapter affordances; drop standalone adversarial-review doc 2026-06-17 20:22:31 +00:00
GEMINI.md ARD v0.1 — agent-agnostic research-discipline framework for AI agents 2026-06-02 15:21:04 +00:00
HARNESS_ADAPTERS.md docs: add Codex + harness-adapter affordances; drop standalone adversarial-review doc 2026-06-17 20:22:31 +00:00
LICENSE ARD v0.1 — agent-agnostic research-discipline framework for AI agents 2026-06-02 15:21:04 +00:00
README.md v0.7.0 peer-review round 2: address Codex pass-2 findings 2026-06-24 17:42:43 +00:00
SPEC.md v0.7.0 peer-review round 1: address Codex findings 2026-06-24 17:30:24 +00:00
VERSIONING.md v0.7.0 peer-review round 1: address Codex findings 2026-06-24 17:30:24 +00:00

Agentic Research Discipline (ARD)

ARD is an agent-agnostic framework for grounded, verifiable research conducted by AI agents. It codifies a research discipline — an anti-fabrication core, a multi-stage verification stack, and a per-engagement control-space navigated against a non-erodable floor — that addresses a documented inventory of agentic research-failure shapes. It is designed to be adopted by any agent system, not tied to one.

ARD structures a research engagement as a control-space — a floor of always-on checks plus selectable gates, dialed to the engagement and walked as a decision-graph — and measures it against a catalogue of documented ways agent research goes wrong. The catalogue's canonical entry, and the shape ARD fences most tightly, is anchor-and-drift fabrication (GR.1): an agent fetches a real source, uses it correctly for a verifiable detail (a figure, a version, a name), then composes the surrounding attribution (which dataset, which baseline, which version) from training-recall rather than from the source. The fetch makes it look grounded; the context drifts. Anti-fabrication is the always-on floor; the verification stack and the per-engagement controls extend outward from it.

Where to start

  • To understand the framework — read SPEC.md (the architecture) alongside CATALOGS.md (the baseline catalogs you populate and extend). These two are self-contained; everything else is concrete support, not a prerequisite.
  • To adopt itADOPTING.md is a four-tier guide with a minimum-viable quickstart. Partial adoption is first-class — stop at any tier.
  • To adapt it to another harnessHARNESS_ADAPTERS.md shows the thin-adapter pattern for Codex, Claude, Gemini, and other agent systems.
  • To vendor the reference surfacekernel/ is what every adopter copies as-is: the runnable lint and the artifact templates (reference implementations of the spec, assuming only its §4.2 contract). Re-point the paths and it runs.
  • To see it wired upexample/ is one concrete instantiation on the Claude agent system, built around the kernel.
  • To check the reasoning.research/ traces why each commitment is shaped the way it is.

What's here

Surface Path What it is
Overview (this file) README.md the front door
Framework specification — the architecture SPEC.md what is invariant across adopters
Baseline catalogs — grow by extension CATALOGS.md the baseline inventory you populate
Adoption guide ADOPTING.md four tiers, stop at any
Harness adapter guide HARNESS_ADAPTERS.md thin adapters for Codex, Claude, Gemini, and others
Release manifest — pin + diff this ard.json version + the enumerated vendorable surface
Vendorable reference surface kernel/ discipline bundle, lint, templates, schema, catalog data, conformance
Worked example example/ one Claude-agent-system instantiation
Defensibility trace .research/ curated reasoning behind each commitment
License LICENSE MIT
Versioning policy VERSIONING.md Semantic Versioning

License & versioning

MIT — permissive; free to use, modify, and redistribute with attribution. Versioned by Semantic Versioning: currently v0.7.0, pre-1.0, so no stability promise yet — anything may change before 1.0.

Status

v0.7.0, pre-1.0. The architecture is proposed; this repository extracts and neutralizes it into a standalone, deployment-neutral form. v0.2 grew the baseline catalogs (the WM failure-locus, per-class IP profiles); v0.3.0 added a derived consumption contract over the canonical prose — a release manifest, generated catalog data, a verbatim discipline bundle, and a conformance set — so adopters consume updates as data instead of re-deriving from prose; v0.4.0 adds two optional, additive contracts harvested from independent deployments — typed cross-references (the related: frontmatter + a twelve-predicate vocabulary) and a sixth citation-chain check, handle uniqueness; v0.4.1 hardens the reference lint's source_url liveness probe against SSRF (a PATCH — no commitment change); v0.5.0 promotes the decomposition-rationale artifact to a canonical data contract (SPEC §10.6) and adds a proactive-enrichment acquisition discipline that fences candidate-suggestion against recall-fabrication (the AQ.3 failure shape); v0.5.1 adds false-positive suppression (frontmatter, code blocks, inline code, URLs, blockquotes, attestation files) and a --stats audit mode to the reference lint (a PATCH — no commitment change); v0.6.0 is the largest inventory bump — re-engagement & correction-propagation (a refresh change-mode, the reconcile verb + a spot-check job catalog, the PR.3 non-propagated-correction fence, GR.8) and catalog/tooling growth (the source-code source class + the vendored-source acquisition pattern, reference-lint hardening + CI meta-fences, and the one migration item — the substrate_confidence fail-open deprecation); v0.7.0 adds engagement economics & acquisition realism — the AQ.4 inadequate-attempt-blocking shape + acquisition-mode-exhaustion, the forward-looking GR.9 metadata-recall-sourcing shape + metadata-tier source-binding, a model-diversity verification property (partial decorrelation of the correlated-verifier blind-spot), and the decision_relevance value-of-information yield gate (with one migration item — decision_relevance joins the always-present registration shape). See VERSIONING.md §Version history. Pre-1.0 means the catalogs and mechanism details may still move.