Developers say Claude got worse Some of the evidence did, to

Real changes happened. Whether they explain what developers are seeing is another question. <https://venturebeat.com> Good morning! One claim driving the “Claude is nerfed” story — that Opu...

**TL;DR** - Developers report Claude coding performance has degraded over past six months; examination of evidence and Anthropic response.

The 10-Second Pitch

Significant number of developers report Claude code output quality declined on complex tasks
Anthropic attributes it to model updates and changed defaults; critics argue it is cost optimization affecting capability
Evidence mostly anecdotal but consistent enough to warrant attention

Setup in 3 Steps

1. Run your own longitudinal benchmark on Claude coding tasks - do not rely on Twitter sample size of one

2. If you have seen degradation, document specific task types and provide Anthropic with structured feedback

3. Model that is best for simple coding tasks may not be best for complex ones - use right model for each task

**Example Prompt:**

Design a longitudinal coding benchmark that would detect Claude model degradation over a 6-month period.

Verdict

Pros	Cons
Anecdotal evidence consistent	Anecdotes are not data

Run your own benchmark before drawing conclusions. Signal-to-noise on social media AI discourse is very low.

#ai

Related Dispatches

👀 The first look at Anthropic's powerful Mythos model

Read dispatch →

BlueHammer Leaked 🪟, BYOVD Takes Out 300 EDRs 🛡️, Perplexity Incognito Sham ⚖️

Read dispatch →

Fortinet flaw under fire 🚨, Google Cloud’s AI win 🤖, Cloudflare targets enterpri

Read dispatch →

Higgsfield Art-Directed AI 🎨, YouTube AI Summaries 📺, Microsoft Fast Voice Model

Read dispatch →

Put this into production

Blueprints

Full deployment stacks

Pricing

Pro & Architect tiers

Anthropic response transparent	Model updates always have winners and losers
Self-benchmarking is the right approach	Most teams do not have infra for proper benchmarks