Gen α AI · Field notes for AI builders

Depth over hype, for people who bet on AI.

Evidence-first analysis of agentic systems, model evaluation, and the economics of AI software. We read the system card, find the primary source, and tell you what actually changed — and what didn't.

Evidence over vibesDepth over volumeHonest about uncertainty
157Deep dives published
9Evergreen pillar guides
BiweeklyThe field briefing
Editor’s picksNew here? These are the pieces we’d hand you first.
The latestFresh analysis, published continuously — the full archive lives in the rail and the pillars below.
Jailbreak Evaluation Frameworks for the Reasoning-Model EraModel Evaluation

Reasoning Models Break Guardrails 97% of the Time. Score It Like CVSS.

A practitioner framework for scoring jailbreak severity, choosing benchmarks, and assuming reasoning-model attackers in your red team.

11 minJune 26, 2026
C2PA Watermarking for Model Outputs: The 2026 Engineering Ship PlanAI Frontiers

C2PA Watermarking for Model Outputs: A 2026 Ship Plan

Article 50 takes effect in August. Here's the layered provenance stack that actually satisfies it.

10 minJune 26, 2026
VLLM vs TensorRT-LLM vs SGLang: The 2026 Same-Hardware Serving BenchmarkModel Evaluation

VLLM vs TensorRT-LLM vs SGLang: 2026 Serving Benchmark, Same Hardware

Tokens-per-second-per-dollar on identical GPUs decides more deployments than peak throughput, and tail latency plus cold start decide the rest.

11 minJune 26, 2026
Multimodal Evals Are Now the Hardest Part of the StackModel Evaluation

Multimodal Evals Are Now the Hardest Part of the Stack

Text benchmarks have saturated, so differentiation moved to vision, audio, video, and real-time duplex tasks where evaluation is still immature and gameable.

10 minJune 26, 2026
Facebook AI Mode Grounds Answers in the Social Graph. Publishers Need a New GEO Play.Search & GEO

Facebook AI Mode Cites Your Friends' Posts. Here's the GEO Play

Meta's billion-user Muse Spark rollout makes public Facebook posts the citation layer for AI search, opening a social-graph GEO surface no one has optimized for yet.

10 minJune 26, 2026
Generative UI: The Third Interface Pattern Beyond Chat and CopilotAI Tools

Generative UI Quietly Became the Third Interface Pattern

AI is no longer just answering in chat or suggesting in your editor; it is composing the interface itself, and the protocols to do it portably just shipped.

14 minJune 26, 2026
Neocloud GPU Economics Are Cheap, Fragile, and Winning AnywayAI Economics

Neocloud GPU Economics Are Cheap, Fragile, and Winning Anyway

GPU rental prices have collapsed 64-85% below hyperscalers, but the debt and utilization math underneath is brutal.

13 minJune 26, 2026
Praxis and the Architecture of AI SovereigntyModels & Releases

Fable 5 Went Dark. Praxis Is the City Built to Outrun It.

A single Commerce Department letter switched off Anthropic's Fable 5 worldwide; Praxis wants to make sure the next frontier model can't be turned off the same way.

18 minJune 26, 2026
John Jumper's Move Says AI Life Sciences Is Now a Platform WarAI Frontiers

John Jumper's Move Says AI Life Sciences Is Now a Platform War

When the AlphaFold architect leaves DeepMind for Anthropic, the message is clear: foundation models are commoditizing, and the moat moved upstream.

12 minJune 26, 2026
Multimodal Evaluation Broke. Here's How Teams Fix ItModel Evaluation

Multimodal Evaluation Broke. Here's How Teams Fix It

Benchmark scores don't predict production vision AI failures. Here's the evaluation stack teams actually ship.

10 minJune 26, 2026
The AI Hallucination Tariff: How 2026 Court Sanctions Became a Cross-Domain Liability TemplateAI Frontiers

The AI Hallucination Tariff: 2026 Lawyer Sanctions Decoded

Courts are now pricing fabricated AI citations at $500 each, and the verification workflow that stops them is cheaper than a single sanction.

13 minJune 26, 2026
AI Data Center Permitting Met a New Bypass: National SecuritySecurity & Safety

AI Data Center Permitting Met a New Bypass: National Security

The DOJ's xAI intervention turns 'national security' into a legal lever to skip Clean Air Act permits, and founders building AI infrastructure should track every move.

9 minJune 26, 2026
Explore the pillarsNine durable guides that organize everything we publish.
AI Tools16 pieces

AI Coding Tools in 2026: The Power-User Field Guide

The gap between demo and production is the harness you build around the model, not the…

Explore →
Search & GEO11 pieces

Generative Engine Optimization: How to Earn AI Citations

Search is becoming synthesis. If ChatGPT, Perplexity, and Google's AI Overviews don't cite…

Explore →
Agents & Harnesses19 pieces

Agent Harness Engineering and Agentic Loops: 2026 Field Guide

Execution loops, externalized state, and verification gates now matter more than raw model…

Explore →
AI Economics14 pieces

AI Coding Agent Economics: Real ROI and Cost per Pull Request

Frontier labs now ship more AI-written code than human-written code, but the viral ROI…

Explore →
Model Evaluation22 pieces

Evaluating AI Models and Agents: The 2026 Field Guide

Why static leaderboards lost authority, and how to build an eval program that survives…

Explore →
Memory & Context15 pieces

Context Engineering for AI Agents: Memory, RAG & MCP

Why the context window, not the prompt, is the real bottleneck, and how to engineer…

Explore →
Security & Safety9 pieces

Securing AI Agents and LLM Apps: The 2026 Threat Model

Why indirect prompt injection, tool-mediated exfiltration, and rogue agents now define LLM…

Explore →
Models & Releases14 pieces

AI Models 2026: The Mid-Year Frontier and Open-Weight Map

How the open-weight cluster closed the gap, why reasoning became the default, and which of…

Explore →
AI Frontiers37 pieces

AI Frontiers 2026: Diffusion Models, Multimodal AI & More

A practitioner's map of frontier AI in mid-2026, where independent measurement finally…

Explore →