Gen α AI · Field notes for AI builders

Depth over hype, for people who bet on AI.

Evidence-first analysis of agentic systems, model evaluation, and the economics of AI software. We read the system card, find the primary source, and tell you what actually changed — and what didn't.

Evidence over vibesDepth over volumeHonest about uncertainty
187Deep dives published
9Evergreen pillar guides
BiweeklyThe field briefing
Editor’s picksNew here? These are the pieces we’d hand you first.
The latestFresh analysis, published continuously — the full archive lives in the rail and the pillars below.
The Fable 5 Vigil: Superpowers, Doomers, and the China GapModels & Releases

Claude Fable 5 Returns With a Government Key in the Lock

Eighteen days of outage turned a coding model into an eschatology. Here's what the saga actually revealed about who controls frontier AI.

15 minJuly 1, 2026
Cloudflare Is Rewiring GEO: Block, Charge, or Allow AI CrawlersSearch & GEO

Cloudflare Is Rewiring GEO: Block, Charge, or Allow AI Crawlers

Your robots.txt says allow, but Cloudflare's edge decides first. Here's how to audit the config that quietly governs your AI citations.

11 minJuly 1, 2026
Fable 5 Is Back: The Long-Horizon Agent Playbook Power Users NeedAI Tools

Fable 5 Is Back: The Power-User Playbook for Long-Horizon Agents

After a 18-day government suspension, Anthropic's Mythos-class model returns July 1. Here's exactly what it enables over Opus 4.8 and GPT-5.5, and how to wield it.

16 minJuly 1, 2026
Claude Science: Anthropic's Workflow Bet on AI for ScienceAI Frontiers

Claude Science Is a Workflow Bet, Not a Model Bet

Anthropic shipped a science workbench with no new model. The product is the reviewer agent that audits its own citations, numbers, and figures.

9 minJune 30, 2026
Claude Sonnet 5 Makes Opus 4.8 Hard to Justify for CodingAI Frontiers

Claude Sonnet 5 Makes Opus Hard to Justify

Anthropic put Opus-class agentic coding at roughly 60% less, but a new tokenizer and a strict cyber filter quietly shrink the discount.

11 minJune 30, 2026
LongCat-2.0: Meituan's 1.6T Coding Model Trained Without NvidiaAI Frontiers

A Delivery Company Trained a 1.6T Coding Model, No Nvidia

Meituan open-sourced a near-frontier agentic coding model trained on a 50,000-chip domestic cluster. Here's what's verified and what to do with it.

8 minJune 30, 2026
Meta Restricted Claude Code and Codex. The Real Reason Is DistillationAI Tools

Meta Curbed Claude Code and Codex. Distillation Is Why

Meta's curb on Claude Code and Codex is a model-distillation and training-data-contamination problem, and it changes how any fine-tuning team must govern AI coding tools.

11 minJune 29, 2026
Meta's Brain2Qwerty v2: Real-Time Brain-to-Text, and the Room It Can't LeaveAI Frontiers

Brain2Qwerty v2 Hits Real-Time Typing, Still Stuck in a Room

Meta's non-invasive decoder jumped to 61% word accuracy by treating an LLM as the denoiser. The hardware is still a half-ton lab.

12 minJune 29, 2026
ChatGPT Logs Are Now Evidence. The Palisades Fire Trial Rewrites AI Governance.AI Frontiers

ChatGPT Logs as Evidence: What the Palisades Fire Trial Means for AI

A federal arson case just made AI conversation transcripts admissible in court, and that changes what every company must do about AI governance.

12 minJune 29, 2026
Cascaded vs End-to-End Voice Agents: Which Architecture Ships in Healthcare?Agents & Harnesses

Cascaded vs End-to-End Voice Agents: Which Ships in Healthcare?

The latency gap is narrowing, but the workflow, not the benchmark, picks the architecture.

13 minJune 29, 2026
Pentagon Agent Network: What PSP-2 Actually Tells Us About Defense Multi-Agent AIAgents & Harnesses

Pentagon Agent Network: The Multi-Agent Architecture No One Is Parsing

The DoW's second Pace-Setting Project names vendors, latency targets, and a hard human-in-the-loop line. The engineering questions are the interesting part.

10 minJune 29, 2026
Agent Reliability Needs a Score, Not a Gut FeelingAgents & Harnesses

Agent Reliability Needs a Score, Not a Gut Feeling

A five-metric scoring framework that turns production agent reliability from vibes into a number you can alert on.

11 minJune 29, 2026
Explore the pillarsNine durable guides that organize everything we publish.
AI Tools18 pieces

AI Coding Tools in 2026: The Power-User Field Guide

The gap between demo and production is the harness you build around the model, not the…

Explore →
Search & GEO12 pieces

Generative Engine Optimization: How to Earn AI Citations

Search is becoming synthesis. If ChatGPT, Perplexity, and Google's AI Overviews don't cite…

Explore →
Agents & Harnesses24 pieces

Agent Harness Engineering and Agentic Loops: 2026 Field Guide

Execution loops, externalized state, and verification gates now matter more than raw model…

Explore →
AI Economics17 pieces

AI Coding Agent Economics: Real ROI and Cost per Pull Request

Frontier labs now ship more AI-written code than human-written code, but the viral ROI…

Explore →
Model Evaluation25 pieces

Evaluating AI Models and Agents: The 2026 Field Guide

Why static leaderboards lost authority, and how to build an eval program that survives…

Explore →
Memory & Context16 pieces

Context Engineering for AI Agents: Memory, RAG & MCP

Why the context window, not the prompt, is the real bottleneck, and how to engineer…

Explore →
Security & Safety11 pieces

Securing AI Agents and LLM Apps: The 2026 Threat Model

Why indirect prompt injection, tool-mediated exfiltration, and rogue agents now define LLM…

Explore →
Models & Releases17 pieces

AI Models 2026: The Mid-Year Frontier and Open-Weight Map

How the open-weight cluster closed the gap, why reasoning became the default, and which of…

Explore →
AI Frontiers47 pieces

AI Frontiers 2026: Diffusion Models, Multimodal AI & More

A practitioner's map of frontier AI in mid-2026, where independent measurement finally…

Explore →