Skip to content

ReadMe Has an AI Linter Now. It's Solving the Wrong Problem.

ReadMe launched an AI Linter. Mintlify has Agent Score. The documentation industry is racing to automate quality checks. They're all measuring structure. None of them measure truth.

boringdocs
AI linter ReadMe Mintlify documentation quality doc scoring validation competitive analysis

ReadMe Has an AI Linter Now. It’s Solving the Wrong Problem.

ReadMe just launched an AI Linter that scans your documentation for quality issues. Mintlify has Agent Score. GitBook is building AI-assisted editing into its core. The documentation platform industry has decided that AI-powered quality is the next battleground.

They’re all measuring the wrong thing.

What ReadMe’s AI Linter Does

ReadMe’s AI Linter scans your docs and flags issues: unclear descriptions, missing examples, inconsistent formatting, style violations. It’s a grammar and structure checker for documentation. Think Grammarly for API docs.

This is useful. If your docs have typos, inconsistent terminology, or missing code examples, the linter will catch them. It’s a quality gate that runs before publish.

Mintlify’s Agent Score does something similar from the outside. You give it a URL, it scores your docs on structure, navigability, and AI-readability. It tells you whether an AI agent can efficiently parse your documentation.

Both tools measure structural quality. Is the doc well-formed? Is it complete? Is it consistent? Is it navigable?

These are real problems. And neither tool answers the question that matters most: Is any of it true?

The Quality Trap

Here’s the trap: a document can be perfectly structured and completely wrong.

An API endpoint description can have a clear title, a well-formatted parameter table, a properly syntax-highlighted code example, and a complete response schema — and every single field type can be wrong.

ReadMe’s linter will give it a 95. Mintlify’s Agent Score will give it a 90. The docs look great. The AI agent reads them efficiently. The generated code compiles, deploys, and fails at runtime because the amount field is documented as an integer and the API expects a string.

Structural quality is a necessary condition. It’s not a sufficient one.

This is the food-safety analogy again, but it bears repeating: you can inspect a restaurant’s cleanliness, service speed, and menu design. A restaurant can ace all three and still serve contaminated food. If the food is unsafe, nothing else matters.

Documentation is the same. If the content is wrong, structural quality just means your mistakes are well-organized.

What the Industry Is Building

The documentation platform industry is in an AI arms race. Here’s what’s launching:

ReadMe: AI Linter (quality scoring), Docs Audit (completeness checking), MCP server (AI agent access), GitHub AI Writer (AI-assisted authoring).

Mintlify: Agent Score (AI-readability scoring), collaborative editor (human + AI co-authoring), Help Center Starter Kit (templates).

GitBook: AI-assisted editing, “knowledge that never stands still” (continuous updates).

Every one of these tools is solving the authoring problem or the structure problem. None of them are solving the accuracy problem.

This is the gap. The entire industry is building better ways to write, score, and organize documentation. Nobody is building the validation layer that checks whether the documentation matches the code it describes.

Why Accuracy Is Harder Than Quality

Structural quality is easy to measure because it’s visible. You can parse a document, check its structure, and score it. No external reference needed. The document is its own input.

Accuracy is harder because it requires an external reference. To know if a document is accurate, you need to compare it to the thing it describes. For API documentation, that means the actual codebase. For product documentation, that means the actual product. For process documentation, that means the actual process.

This requires a system that can:

  1. Read the code — extract endpoints, schemas, types, error paths, behaviors.
  2. Read the docs — extract what the docs claim about those same endpoints, schemas, types, error paths, behaviors.
  3. Compare them — identify discrepancies, not as suggestions, but as errors.
  4. Run continuously — because the code-doc gap widens with every commit.

This is a harder engineering problem than scoring document structure. It’s also a less marketable problem. “AI checks your docs for typos” is an easier sell than “AI compares your docs to your code and tells you where they disagree.”

The easier sell is winning. The harder problem is unsolved.

The MCP Factor

There’s a reason accuracy matters more now: MCP (Model Context Protocol) is becoming the standard interface between AI agents and documentation.

ReadMe launched an MCP server. Mintlify has one. The idea is simple: instead of an AI agent scraping your docs site, it queries your docs through a structured protocol. Your documentation becomes a real-time data source for AI agents.

This is great for developer experience. It’s terrible for documentation accuracy.

Here’s why: when an AI agent queries your docs through MCP, it gets the current version. If the current version is wrong, the agent generates wrong code. The MCP server doesn’t validate the docs — it serves them. It’s a high-speed delivery mechanism for whatever content you have, accurate or not.

MCP amplifies the impact of documentation errors. Every AI agent that connects to your docs through MCP is a new consumer that will generate code based on what it reads. If the docs are wrong, the MCP server efficiently delivers those wrong docs to every connected agent.

The MCP trend makes validation more urgent, not less. The more agents that read your docs, the more important it is that the docs are right.

What Validation Actually Looks Like

The documentation industry needs a different kind of tool. Not a linter. Not a scorer. A validation layer.

Here’s what that looks like:

1. Extract ground truth from code. Parse the actual codebase — OpenAPI specs, function signatures, type definitions, error handlers — and build a model of what the code actually does.

2. Extract claims from docs. Parse the documentation and build a model of what the docs claim the code does.

3. Compare and report discrepancies. Not as suggestions. As errors. “The docs say amount is an integer. The code expects a string. Fix one or the other.”

4. Run on every commit. Not at publish time. Not during a documentation sprint. On every code change. Because drift starts the moment code ships.

5. Block the pipeline if accuracy drops below threshold. Just like failing tests block a merge. If the docs don’t match the code, the change doesn’t ship.

This is validation. It’s not writing assistance. It’s not quality scoring. It’s a continuous check that the documentation matches reality.

The Competitive Landscape

The documentation platform market is heating up. Mintlify raised $45M. ReadMe is shipping AI features at a rapid pace. GitBook is repositioning around AI-native knowledge.

All of them are building authoring tools with AI features layered on top. None of them are building validation infrastructure.

This is the gap in the market. The teams that need documentation accuracy — API companies, platform teams, infrastructure providers — don’t need a better editor. They need a system that keeps their existing docs accurate as their code changes.

The authoring tools are solving a problem that’s already solved. You can write docs. You can even write them with AI assistance. The unsolved problem is keeping those docs accurate over time, at scale, as the codebase evolves.

What This Means for Teams

If you’re evaluating documentation tools in 2026, here’s the question to ask:

“Does this tool check whether my docs are right, or just whether they’re well-written?”

If the answer is “well-written,” you’re getting a structural quality tool. It will help you write better-organized, more consistent, more navigable docs. This is valuable. Do it.

But also ask: “What happens when the code changes?”

If the answer is “you update the docs” or “the AI will help you update them,” you have a tool that creates docs and a process that requires humans to keep them accurate. This doesn’t scale. It doesn’t work with AI-generated code. It doesn’t work when your API surface grows faster than your documentation team.

What you need is a validation layer. A system that sits between your code and your docs and checks, continuously, that they agree. Not a linter. Not a scorer. A validator.

The Bottom Line

The documentation industry is having an AI moment. AI linters, AI writers, AI scores. The tools are getting smarter. The docs are getting prettier.

But prettier wrong docs are still wrong docs. And in the AI era, wrong docs don’t just confuse humans — they mislead AI agents that generate code at machine speed.

The next wave of documentation infrastructure won’t be about writing. It will be about validating. The teams that understand this will build (or adopt) validation layers. The teams that don’t will keep scoring high on Agent Score while their docs quietly diverge from their code.

ReadMe’s AI Linter is a good tool. Use it to clean up your structure. Then ask the question it can’t answer: Is any of it true?


Structural quality is not accuracy. Start validating. Join the waitlist — boringdocs is the validation layer that checks whether your docs match your code, continuously. Because a high score on wrong docs just means your mistakes are well-organized.