Blog
Notes on backend, AI agents, and shipping things that hold up in production.
48 posts
Together AI Just Raised $800M at an $8.3B Valuation — Aramco Ventures Is Betting Sovereign Capital on Open-Source AI Infrastructure
On July 1, 2026, Together AI closed an $800 million Series C at an $8.3 billion valuation — more than 2.5x what it was worth 16 months ago — led by Aramco Ventures, the venture arm of Saudi Aramco, with Nvidia, Vista Equity Partners, and General Catalyst also participating. The company says annual bookings now exceed $1.15 billion and that enterprises are cutting inference costs up to 60x versus closed-model APIs by running open-weight models like Llama, DeepSeek, and Qwen on its platform. Here is what the round actually signals about where AI infrastructure value is concentrating, and what to check before your next model-hosting decision.
GitHub Copilot CLI Went Multi-Model in One Week — And Nobody Noticed the Chinese Coding Model It Just Added
In three GitHub Copilot CLI releases between June 30 and July 1, 2026, GitHub added Claude Opus 4.8 Fast, Claude Sonnet 5, and Moonshot AI's open-weight Kimi K2.7-Code — a roughly 1-trillion-parameter model that beats Opus 4.8 on MCP Mark Verified while burning about 30% fewer reasoning tokens. Each changelog entry is a single line. The implications for how enterprise engineering teams vet, budget for, and secure their AI tooling are not. Here is what actually changed, and what to check before your next sprint planning.
Claude Fable 5 Returns After 19 Days Offline — But Its New Safety Classifier Will Flag Your Routine Code Too
On July 1, 2026, Anthropic restored global access to Claude Fable 5, ending a 19-day suspension triggered by a US export control order after Amazon researchers found the model would produce exploit code once a security-review request was rephrased as "fix this code." The fix is a new cybersecurity classifier that blocks the technique in over 99% of cases — but flags routine coding and debugging more often too, silently falling back to Opus 4.8 when it fires. More than 100 security experts, led by Katie Moussouris, dispute this was ever a real "jailbreak." Here is what changed, what's still disputed, and what to check before July 7.
Claude Sonnet 5 Just Became the Default in Claude Code — Near-Opus Agentic Coding at Less Than Half the Price
Anthropic shipped Claude Sonnet 5 on June 30, 2026, and made it the default model in both Claude Code and claude.ai from day one. The headline number: 63.2% on SWE-bench Pro, up from Sonnet 4.6's 58.1% and closing most of the gap to Opus 4.8's 69.2% — at introductory pricing of $2/$10 per million tokens through August 31, less than half of what Opus 4.8 costs. A full 1M-token context window ships with no long-context premium, and the timing lines up with Anthropic's confidential SEC filing for an IPO targeting a $1 trillion valuation. Here is what actually changed, and what to do about it.
Context Engineering Is the New Load-Bearing Skill of AI Development — Anthropic's Engineering Blog Just Made It Official
Anthropic's engineering blog published the definitive technical guide to context engineering for AI agents this week — the discipline Gartner named the breakout AI capability of 2026. The 2026 Agentic Coding Trends Report found that teams mastering context engineering complete tasks 55% faster with 40% fewer errors. A peer-reviewed study of 9,649 experiments confirmed it: context quality outweighs prompt quality. Most agent failures are not model failures. They are context failures. Here is what that means for how you build.
GitHub Copilot's Billing Shock Is Fueling the Open-Source AI Coding Revolution — OpenCode Just Crossed 160K Stars
GitHub Copilot's shift to usage-based billing on June 1, 2026 triggered immediate developer backlash — users on $39/month plans burning through 8% of their monthly AI credits in two hours. In the same window, OpenCode, the open-source terminal-native coding agent, crossed 160,000 GitHub stars with 7.5 million monthly active developers. These are not two separate stories. They are the same story: when proprietary AI tools start charging per-token, the open-source model-agnostic alternative wins on economics.
The US Just Suspended the World's Most Capable Coding AI — And Every Engineering Team Needs a Geopolitical Contingency Plan
On June 12, 2026, a US export control directive forced Anthropic to suspend Claude Fable 5 and Claude Mythos 5 globally — and within 48 hours, engineering teams across the EU, UK, and Southeast Asia discovered their AI-powered production pipelines were broken. This is not a story about one model being unavailable. It is a story about a new class of infrastructure risk that no existing runbook covers: when the model is fine, the API is up, but government policy makes it unreachable for your users.
Figma Config 2026: Code Layers Put the Codebase on the Design Canvas — And That Changes Everything for Dev Teams
At Config 2026, Figma CEO Dylan Field declared 'code is material for design' and backed it up with Code Layers — a feature that brings a live, GitHub-connected codebase directly onto the collaborative Figma canvas. Combined with custom AI Skills, native animation, and GPU shader effects, Figma's 2026 release inverts the traditional design-to-code handoff in a way that touches every product engineer, front-end developer, and UX team.
MCP Is Now the USB-C of AI: How the Model Context Protocol Became the Universal Standard Every Developer Needs to Know
In December 2025, Anthropic donated the Model Context Protocol to the Linux Foundation's Agentic AI Foundation — and six months later, MCP has become the fastest-adopted open standard in AI history. With 9,652 registered servers, 97 million monthly SDK downloads, and 41% of software organizations now in production, MCP is no longer a developer experiment. It is the infrastructure layer that connects every AI agent to every tool on the planet.