TL;DR

Anthropic published lessons from using hundreds of reusable Claude Code Skills across its engineering organization. The main finding is that a Skill is treated as a discoverable folder containing instructions, scripts, references and configuration, not merely a saved prompt.

Anthropic says its internal use of hundreds of Claude Code Skills has shown that reusable agent instructions work best as structured folders, not one-off prompts, a finding that matters for companies trying to turn AI coding agents into repeatable workplace tools.

The company’s write-up, “Lessons from building Claude Code: How we use skills,” was published on June 3, 2026 by Claude Code engineer Thariq Shihipar. A July 1 Thorsten Meyer AI dispatch framed the post as more than a developer guide, arguing that Skills can become shared operating procedures for AI agents.

According to the source material, a Skill is a folder the agent can discover, read and run. It may include a SKILL.md instruction file, deeper reference material, scripts, templates, configuration files, hooks and memory. The root file gives the model a trigger and basic guidance, while additional files are loaded only when the task requires them.

Anthropic’s internal catalog grouped Skills into nine categories, including library and API reference, product verification, data analysis, business-process automation, code scaffolding, code review, CI/CD, runbooks and infrastructure operations. The source says verification Skills, which check an agent’s work, had the strongest measured effect on output quality.

At a glance

reportWhen: published June 3, 2026; discussed in a…

The developmentAnthropic published a June 3, 2026 Claude blog post detailing what it learned from running hundreds of Skills across its own engineering organization.

AI Dispatch · Insights · 1 July 2026

A Skill is a folder, not a prompt

Q: What did Anthropic publish?

Anthropic published a Claude blog post on June 3, 2026 describing lessons from using hundreds of Claude Code Skills inside its engineering organization.

Q: What is a Skill in this report?

A Skill is described as a discoverable folder that can contain instructions, scripts, references, templates, configuration and hooks for an AI agent to use during a task.

Q: Which Skill category had the biggest reported impact?

According to the supplied source material, verification Skills had the strongest measured effect on output quality because they help check whether the agent’s work is correct.

Q: Is this only relevant to developers?

No. While the examples come from Claude Code, the broader issue is how organizations turn repeated knowledge into shared agent workflows that can be reused and improved.

Anthropic published what it learned running hundreds of Skills across its own engineering org. Read as a business memo, the point is bigger than a coding trick: this is how ad-hoc prompting becomes durable institutional capability — the SOPs your agents actually follow, versioned and shared.

✕ The misconception

“A Skill is just a clever markdown prompt you save in a file.”

✓ What it actually is

A folder the agent can discover, read & run — instructions, scripts, references, templates, config & on-demand hooks.

Anatomy of a Skill — the file system is context engineering

my-skill/the unit you share & version

├─ SKILL.mdroot instructions + a description written for the model (its trigger)

├─ references/deep detail pulled in only when needed — progressive disclosure

├─ scripts/real code, so the agent composes instead of rebuilding boilerplate

├─ assets/templates & files to copy into the output

├─ config.jsonsetup the agent asks for if it’s missing (e.g. which Slack channel)

└─ hooks + memoryon-demand guardrails + an append-only log so it remembers

Why it matters: the folder itself is the knowledge base. The agent reads the root, then reaches deeper only when the task demands it — the same way you’d hand a new hire a one-pager that points to the detailed docs.

The nine types — a gap-analysis map for your own library

1Library / API reference

2Product verification ★ top impact

3Data fetching & analysis

4Business-process automation

5Code scaffolding & templates

6Code quality & review

7CI/CD & deployment

8Runbooks

9Infrastructure operations

By Anthropic’s own measurement, verification Skills — the ones that check the work — moved output quality the most. If you build one category well, build that one.

The craft — what separates a good Skill from a useless one

Gotchas = highest-signal section Describe for the model, not humans (it’s the trigger) Don’t state the obvious Ship scripts, not just prose On-demand guardrail hooks (/careful, /freeze) Let it remember (log / SQLite) Don’t railroad — leave room to adapt

The take

The knowledge of how your organization actually operates can be captured, versioned, shared & executed — and the thing capturing it is a humble folder with a script and a gotchas list inside. For the builder, that’s context engineering with real tools attached. For whoever owns the budget, it’s the difference between AI that starts from zero every morning and an asset that compounds. Caveats: best practices are still evolving, checked-in Skills cost context, and curation beats accumulation. Start with one Skill, one gotcha, and the category that catches your mistakes.

Source: “Lessons from building Claude Code: How we use skills,” Thariq Shihipar (Anthropic), Claude blog, 3 June 2026. Categories, examples & measured claims are Anthropic’s; framing is the author’s. Docs: code.claude.com/docs/en/skills.

thorstenmeyerai.com

Skills Become Operating Assets

The development matters because it points to a shift from ad-hoc prompting toward versioned workplace knowledge. If teams can package instructions, scripts and review checks into reusable folders, AI agents may become more consistent across engineers, teams and repeated workflows.

For engineering leaders, the business case is not just faster prompting. The source material argues that a good Skill can capture tribal knowledge, reduce repeated onboarding explanations and make agent behavior more predictable. Anthropic’s own framing suggests some Skills may justify substantial engineering time when they improve a high-value task such as product verification.

Amazon

AI agent automation tools

View Latest Price

As an affiliate, we earn on qualifying purchases.

From Prompts To Folders

The Thorsten Meyer AI dispatch stresses a definitional change: “A Skill is a folder, not a prompt.” In that model, the folder itself becomes the knowledge base. The agent reads the top-level guidance first, then pulls in scripts, templates or reference files only when needed.

The post compares this approach to giving a new hire a short guide that points to deeper documentation. The recommended craft guidance includes writing descriptions for the model rather than humans, avoiding obvious instructions, shipping scripts instead of prose alone, adding guardrail hooks and allowing room for the agent to adapt.

“A Skill is a folder, not a prompt.”
— Thorsten Meyer AI dispatch

Amazon

AI development environment software

View Latest Price

As an affiliate, we earn on qualifying purchases.

Measurement Details Remain Limited

Several details are still not clear from the supplied material. It does not provide the full methodology behind Anthropic’s measurement that verification Skills improved output quality the most, including baseline comparisons, sample sizes or whether the results generalize outside Anthropic’s engineering organization.

It is also unclear how much maintenance burden large Skill libraries create over time. The source flags that best practices are still evolving, checked-in Skills can consume context and curation may matter more than accumulating many folders.

Amazon

AI scripting and reference management tools

View Latest Price

As an affiliate, we earn on qualifying purchases.

Teams Test Skill Libraries

The next step for readers and technical teams is likely experimentation with small, high-value Skills rather than broad libraries. The source recommends starting with one Skill, one known failure mode and the category most likely to catch mistakes.

Anthropic’s documentation at code.claude.com/docs/en/skills is the practical follow-up for teams evaluating the approach. More evidence will be needed to show how these patterns work across different organizations, security requirements and engineering workflows.

Amazon

AI workflow automation folders

View Latest Price

As an affiliate, we earn on qualifying purchases.

Key Questions

What did Anthropic publish?

Anthropic published a Claude blog post on June 3, 2026 describing lessons from using hundreds of Claude Code Skills inside its engineering organization.

What is a Skill in this report?

A Skill is described as a discoverable folder that can contain instructions, scripts, references, templates, configuration and hooks for an AI agent to use during a task.

Which Skill category had the biggest reported impact?

According to the supplied source material, verification Skills had the strongest measured effect on output quality because they help check whether the agent’s work is correct.

Is this only relevant to developers?

No. While the examples come from Claude Code, the broader issue is how organizations turn repeated knowledge into shared agent workflows that can be reused and improved.

What remains unknown?

The supplied material does not include full measurement details, long-term maintenance costs or evidence showing whether Anthropic’s internal results will translate cleanly to other companies.

Source: Thorsten Meyer AI

A Skill Is a Folder, Not a Prompt: What Anthropic Learned Running Hundreds of Them

Author

AI Espionage Team

Share article

A Skill is a folder, not a prompt