How agent-ready is Anthropic?

Independent agentability audit of Anthropic, scored across the 8 principles of Agent Factors Engineering — how well AI agents can parse, navigate, and operate it.

Audited June 12, 2026 · Rubric v0 · 3 page(s) evaluated

Audit summary

Anthropic's public website scored 25/100 on its homepage and 26/100 on pricing pages, indicating limited agentability for autonomous AI systems attempting to parse and interact with these surfaces. Documentation fared better at 48/100, though still below the threshold for smooth agent operation. These scores reflect significant gaps in how the site exposes structure, state, and control mechanisms that agents rely on.

The audit identified three principles with zero scores—status reporting, clean handoffs, and transparency—meaning the site provides no machine-readable feedback on system state, minimal structured error context, and no audit trails or confidence metadata. Control mechanisms scored 19/100, lacking explicit confirmation patterns and cancel affordances for destructive or long-running actions. Machine readability (65/100) and defaults (59/100) were relative strengths, while chunking (41/100) and shadow UI avoidance (15/100) fell into the middle range.

For a company building agent-capable AI models, these scores suggest that Anthropic's own digital properties have not yet been optimized for the autonomous agent workflows its products enable. Addressing the transparency, control, and error-handling gaps would materially improve agent integration outcomes.

Score by principle

Machine Readability65 / 100

Chunking41 / 100

Control19 / 100

Status0 / 100

Defaults59 / 100

Clean Handoffs0 / 100

No Shadow UI15 / 100

Transparency0 / 100

Key findings

Clean Handoffs

On errors return structured context (what failed, error code, offending field) in both UI and API — not an opaque 'Something went wrong'.

Transparency

Surface confidence/certainty on key outputs (a confidence field in API responses, or a 'verified vs best-guess' label in the UI).

Transparency

Offer a one-line summary plus an expandable drill-down for important results.

Transparency

Attach source references or input citations (link or id) to generated outputs.

Control

Gate destructive actions behind an explicit, distinctly-labelled confirmation step — not a same-styled button.

Control

Provide a labelled pause/cancel/stop control for long-running actions; give it an accessible name like 'Cancel import', not a bare icon.

Machine Readability

Among the stronger areas for Anthropic, scored 65/100.

Defaults

Among the stronger areas for Anthropic, scored 59/100.

How Anthropic could improve its score

Anthropic can improve agentability by implementing the following changes:

Return structured error context in both UI and API responses, including specific failure reasons, error codes, and the fields or inputs that triggered the problem, rather than generic messages like 'Something went wrong'.
Add an activity log or audit trail that records system actions in machine-readable format (such as a JSON event feed), and surface a user-facing version to explain results and decisions made by the platform.
Gate destructive actions—such as deletions or irreversible changes—behind explicit, visually distinct confirmation steps, and provide undo or override capabilities for completed actions where feasible.
Include confidence or certainty indicators on key outputs in API responses and UI, such as 'verified' versus 'best-guess' labels, to help agents assess reliability.
Provide labeled pause, cancel, or stop controls for long-running operations with accessible names (e.g., 'Cancel upload'), not icon-only buttons.
Attach source references, input citations, or identifiers to generated outputs, and offer expandable drill-downs on important results—a summary line with optional detail view.

Work at Anthropic? Re-audit any page free.

Scores refresh automatically when we re-crawl — or run an instant audit on any URL now.

Run a free audit

Embed this score

Show your agent-readiness in your docs or README. The badge links back to this live report.