Self-Improving Agent Documentation Through Log Analysis

A technique for automating the maintenance of Claude/Codex project documentation by analyzing agent session logs to identify recurring pain points and improvement opportunities.

Maintaining accurate project documentation for AI agents like Claude Code and Codex presents a significant challenge as projects evolve. The CLAUDE.md file (or AGENTS.md equivalent) serves as critical context for these agents, yet manually keeping it updated becomes increasingly burdensome as project complexity grows. This documentation drift creates friction during development, forcing users to repeatedly clarify context or re-explain project specifics across sessions.

A practical solution leverages the detailed session logs automatically generated during agent interactions. Both Claude Code and Codex store these logs locally:

Claude Code: ~/.claude/projects
Codex: ~/.codex/sessions

These JSONL files contain comprehensive session records including user prompts, agent responses, and execution details. While the schema differs between tools, both formats capture valuable interaction patterns that reveal documentation gaps.

The optimization process involves directing your agent to analyze these historical logs against your current CLAUDE.md file using prompts like:

"Review my Claude JSONL history files for this project and suggest improvements to CLAUDE.md. Identify instances of user frustration and recurring requests across sessions."

This approach transforms documentation maintenance from a chore into an efficient refinement process. In practice, agents successfully identify:

Repeated requests for clarification about project structure
Common misunderstandings about domain-specific terminology
Frequently referenced files missing from context
Outdated configuration instructions

However, parsing raw JSONL logs poses efficiency challenges. Agents often generate complex jq commands that struggle with larger log volumes. To solve this, I developed a purpose-built CLI tool that abstracts log parsing complexities:

claude-log CLI help output showing commands for parsing and analysing Claude Code chat logs

The CLI enables rapid log analysis that previously took minutes to complete in seconds. Key capabilities include:

Filtering sessions by date range
Searching for specific error patterns
Identifying repeated command sequences
Aggregating user correction patterns

This efficiency opens possibilities for automated documentation refinement. A scheduled task could run weekly analysis, propose CLAUDE.md updates, and even implement approved changes - creating truly self-improving project documentation.

For teams adopting this approach:

Start with targeted log reviews after noticeable friction points
Gradually increase analysis scope as trust in the process builds
Curate initial suggestions before full automation

The technique demonstrates how leveraging existing telemetry unlocks significant workflow optimizations. As agent ecosystems mature, standardized approaches like AGENTS.md could further streamline this process across tools and teams.

The CLI tool referenced may be published on GitHub based on community interest. Implementation details vary between Claude and Codex due to their different log schemas.

#Documentation #Log Analysis #Automation #CLI #AI_Agents

Self-Improving Agent Documentation Through Log Analysis

Comments