Production-tested skills for AI agents. Every skill is security-scanned, tier-rated, and verified. Browse by ecosystem or category below.
Anthropic shipped Claude Fable 5 on June 9, 2026 — 95.0% on SWE-Bench Verified, 80.3% on SWE-Bench Pro, 92.6% on GPQA Diamond, 84.3% on Terminal-Bench 2.1, Artificial Analysis Intelligence Index 65 vs. GPT-5.5 at 60. Twice the price of Opus 4.8. For everyone building agents, this is the week the abstraction moved.
Qdrant is the open-source Rust vector database: 23K stars, Apache 2.0, disk-based HNSW for billion-vector single-node indexes, and a filter engine that crushes Pinecone. Most RAG teams are overpaying.
Hot take: MCP turns every agent framework into a thin wrapper around a model and a list of MCP servers. LangChain, CrewAI, and AutoGen are about to become the next jQuery.
The most-installed AI extension in the VS Code marketplace in 2026 is not Cursor. It is not GitHub Copilot. It is an open-source project at 50K stars, 5M+ installs, Apache 2.0, with a browser tool Cursor does not have and a price tag of $0. You are paying $20 a month to use an LLM you can call directly with the same wrapper. The wrapper is open source. The wrapper is free.
Out of the box, Ollama gives you a generic chat model. With a 12-line Modelfile and three parameter tweaks, you get a code assistant that does not ramble.
Anthropic shipped its first generally-available Mythos-class model on June 9, 2026. The US government pulled it on June 12 — for a capability GPT-5.5 already has.
Your 'AI agent' startup is vertical SaaS in a trench coat, and the coat is on fire. Compute costs, horizontal platforms, and bad unit economics will burn 70% of them by 2027.
The UK AI Security Institute proved frontier AI can autonomously run end-to-end offensive cyber operations. Here's what that means for every builder working with AI agents today.
Every computer-use agent in 2026 — Agent S3, OpenCUA, Skyvern, the closed ones from Anthropic and OpenAI — is going to run on top of the same OS-level virtualization layer in a year. That layer is Cua, MIT-licensed, 17.8K stars, and shipping a macOS VM manager (Lume) that is the only thing on Apple Silicon that boots macOS Sequoia with near-native performance. The interesting story is not the wrapper. It is the sandboxes.
WSJ and Semafor are reporting the same thing: Amazon CEO Andy Jassy personally told Treasury Secretary Scott Bessent that Amazon researchers had jailbroken Claude Fable 5. Three days later the export ban dropped. Amazon is Anthropic's $33B investor and primary cloud provider. The investor relationship just became a national-security liability, and every AI lab with hyperscaler money on the cap table needs to read this story carefully.
The vector database market is a four-billion-dollar industry selling a solution to a problem most teams do not have. Ninety percent of AI agents shipping in 2026 would be faster, cheaper, and more reliable on SQLite FTS5 plus a cross-encoder reranker than on Pinecone. I have done the math. The team building a vector database is the team that has never queried a tsvector.
Prompt caching is built into the Claude API and CLI, but almost nobody uses it for local workflows. Here is the exact one-file pattern that cuts your token bill and latency by 60-80% on repetitive tasks — and the three gotchas that will bite you if you do not read the docs.
DeepSeek's 1M token context window wasn't a benchmark stunt. Here's why the context window war matters more than the model weight race for every builder working with AI agents today.
Claude Fable 5 lived for 72 hours, the SWE-bench Verified leaderboard is statistically flat across ranks 5-10, and Kimi K2.7-Code beats Opus 4.8 on tool use. The right answer is a two-tier agent stack, not picking a winner. Here is the full breakdown, the benchmarks that actually matter, and the routing rules to ship this weekend.
Every AI agent demo looks incredible. Here's what separates the agents that survive contact with production from the ones that fall apart the moment real users touch them.
Z.ai open-sourced GLM-5.2 on June 16, 2026 — 753B MoE, 40B active, MIT license, 1M context, beats GPT-5.5 on SWE-Bench Pro, FrontierSWE, MCP-Atlas, and GDPval-AA v2. The story isn't the model. The story is IndexShare: a sparse-attention architecture that cuts per-token FLOPs by 2.9x at 1M context, published openly, reproducible, and the reason the closed-lab bundle just stopped being defensible. First-party API is $1.40 / $4.40 per million tokens. Self-host is electricity. The frontier is open and the architecture is the moat.
Adapt is a self-evolving LLM memory layer: Brain auto-decomposes into Neurons that learn and restructure themselves. TypeScript, under 200KB, MIT, still 0.0.x.
Every AI coding agent (Claude Code, Cursor, Codex CLI, Aider, Continue) shares one annoying bottleneck: you cannot run two of them on the same branch at the same time without the second stomping the first. The fix is git worktrees plus tmux. Twenty minutes of setup, hours saved per week.
On June 9, 2026 Anthropic shipped Claude Fable 5 — its first public Mythos-class model — and Mythos 5 for vetted partners. Three days later, a U.S. export-control directive forced Anthropic to disable both models worldwide, including for its own foreign-national employees. The first time a national government has ever yanked a deployed frontier model. Plus a separate scandal: Fable 5 was silently nerfing frontier LLM research queries. Anthropic walked that back. Both stories matter. Most people are only telling one.
50%+ of Fortune 500 developers use AI coding agents weekly. The IDE is now an AI-first interface. Here's why that matters for every engineering team that hasn't made the switch yet.
On June 17, 2026, a coalition including Google, Microsoft, Amazon, Anthropic, and OpenAI published the Agentic Resource Discovery specification. ARD gives AI agents what DNS gave the internet in 1983: a way to find things without knowing where they live. This is the most important infrastructure story of the week, and almost nobody is covering it like it is.
Every AI agent demo looks incredible. Here's what separates the agents that survive contact with production from the ones that fall apart the moment real users touch them.