MR
Mayur Rathi
@muratcankoylan
⭐ 5 GitHub stars

Context Compression

Design and evaluate compression strategies for long-running sessions

mkdir -p ./skills/context-compression && curl -sfL https://raw.githubusercontent.com/mayurrathi/awesome-agent-skills/main/skills/context-compression/SKILL.md -o ./skills/context-compression/SKILL.md

Run in terminal / PowerShell. Requires curl (Unix) or PowerShell 5+ (Windows).

Skill Content

# Context Compression Strategies


When agent sessions generate millions of tokens of conversation history, compression becomes mandatory. The naive approach is aggressive compression to minimize tokens per request. The correct optimization target is tokens per task: total tokens consumed to complete a task, including re-fetching costs when compression loses critical information.


When to Use


Use this skill when designing or evaluating context compression for long-running agent sessions, when codebases exceed context windows, or when debugging agent memory/forgetting issues.


When to Activate


Activate this skill when:

- Agent sessions exceed context window limits

- Codebases exceed context windows (5M+ token systems)

- Designing conversation summarization strategies

- Debugging cases where agents "forget" what files they modified

- Building evaluation frameworks for compression quality


Core Concepts


Context compression trades token savings against information loss. Three production-ready approaches exist:


1. **Anchored Iterative Summarization**: Maintain structured, persistent summaries with explicit sections for session intent, file modifications, decisions, and next steps. When compression triggers, summarize only the newly-truncated span and merge with the existing summary. Structure forces preservation by dedicating sections to specific information types.


2. **Opaque Compression**: Produce compressed representations optimized for reconstruction fidelity. Achieves highest compression ratios (99%+) but sacrifices interpretability. Cannot verify what was preserved.


3. **Regenerative Full Summary**: Generate detailed structured summaries on each compression. Produces readable output but may lose details across repeated compression cycles due to full regeneration rather than incremental merging.


The critical insight: structure forces preservation. Dedicated sections act as checklists that the summarizer must populate, preventing silent information drift.


Detailed Topics


Why Tokens-Per-Task Matters


Traditional compression metrics target tokens-per-request. This is the wrong optimization. When compression loses critical details like file paths or error messages, the agent must re-fetch information, re-explore approaches, and waste tokens recovering context.


The right metric is tokens-per-task: total tokens consumed from task start to completion. A compression strategy saving 0.5% more tokens but causing 20% more re-fetching costs more overall.


The Artifact Trail Problem


Artifact trail integrity is the weakest dimension across all compression methods, scoring 2.2-2.5 out of 5.0 in evaluations. Even structured summarization with explicit file sections struggles to maintain complete file tracking across long sessions.


Coding agents need to know:

- Which files were created

- Which files were modified and what changed

- Which files were read but not changed

- Function names, variable names, error messages


This problem likely requires specialized handling beyond general summarization: a separate artifact index or explicit file-state tracking in agent scaffolding.


Structured Summary Sections


Effective structured summaries include explicit sections:


```markdown

Session Intent

[What the user is trying to accomplish]


Files Modified

- auth.controller.ts: Fixed JWT token generation

- config/redis.ts: Updated connection pooling

- tests/auth.test.ts: Added mock setup for new config


Decisions Made

- Using Redis connection pool instead of per-request connections

- Retry logic with exponential backoff for transient failures


Current State

- 14 tests passing, 2 failing

- Remaining: mock setup for session service tests


Next Steps

1. Fix remaining test failures

2. Run full test suite

3. Update documentation

```


This structure prevents silent loss of file paths or decisions because each section must be explicitly addressed.


Compression Trigger Strategies


When to trigger compression ma

🎯 Best For

  • Claude users
  • Designers
  • Creative professionals
  • Product teams

💡 Use Cases

  • Design system documentation
  • Component specification creation

📖 How to Use This Skill

  1. 1

    Install the Skill

    Copy the install command from the Terminal tab and run it. The SKILL.md file downloads to your local skills directory.

  2. 2

    Load into Your AI Assistant

    Open Claude and reference the skill. Paste the SKILL.md content or use the system prompt tab.

  3. 3

    Apply Context Compression to Your Work

    Provide context for your task — paste source material, describe your audience, or share existing work to guide the AI.

  4. 4

    Review and Refine

    Edit the AI output for accuracy, tone, and completeness. Add human insight where the AI lacks context.

❓ Frequently Asked Questions

Does Context Compression generate production-ready design specs?

It generates detailed specifications that developers can use directly. Review and adjust for your specific design system.

How do I install Context Compression?

Copy the install command from the Terminal tab and run it. The skill downloads to ./skills/context-compression/SKILL.md, ready to use.

Can I customize this skill for my team?

Absolutely. Edit the SKILL.md file to add team-specific instructions, examples, or workflows.

⚠️ Common Mistakes to Avoid

Not reading the full skill

Skills contain important context and edge cases beyond the quick start.

🔗 Related Skills