Fable Without Fable: Sakana Fugu Ultra's Big Bet
Sakana's most interesting move is selling learned orchestration as a frontier-model substitute, with Fable-class pressure and very different production risks.
Evidence-first analysis of agentic systems, model evaluation, and the economics of AI software. We read the system card, find the primary source, and tell you what actually changed — and what didn't.
The gap between demo and production is the harness you build around the model, not the model you license.
Pillar guide → 02 · Search & GEOSearch is becoming synthesis. If ChatGPT, Perplexity, and Google's AI Overviews don't cite you, you're invisible, and…
Pillar guide → 03 · Agents & HarnessesExecution loops, externalized state, and verification gates now matter more than raw model IQ. Here's how the agents…
Pillar guide → 04 · AI ToolsFrontier labs now ship more AI-written code than human-written code, but the viral ROI numbers are wrong. Here is the…
Pillar guide →
Model EvaluationBenchmarks can tell you whether a model is capable; production evals tell you whether your text, image, OCR, video, and tool pipeline will survive contact with real inputs.
AI FrontiersSynthetic data works when the target distribution is narrow, the answers are verifiable, and real data stays in the loop.
Agents & HarnessesA practical reference architecture for turning biological foundation models, docking, ADMET, LIMS, and lab automation into a measurable closed-loop discovery system.
AI FrontiersThe shift that matters now runs through assays, clinics, model access terms, and the governance layer around frontier biology.
AI FrontiersCandidate generation is getting cheaper. The limiting work is now target biology, safety evidence, biomarkers, and clinical proof.
Models & ReleasesThe fallback to Opus 4.8 is best understood as a frontier-access control system, with real consequences for biotech teams and AI drug discovery workflows.
Search & GEOA complete operating manual for earning citations in ChatGPT, Perplexity, Google AI Overviews, and Gemini without confusing GEO with old SEO theater.
Models & ReleasesHow a defensive cybersecurity preview became the most powerful public AI model, triggered an export-control emergency, vanished worldwide, and returned under restrictions.
The right response to fragmented AI law is a small control plane that produces evidence across states, sectors, and buyers.
AI EconomicsThe Nvidia question is now a workload-matching problem: memory bandwidth, utilization, and latency SLOs decide the real inference bill.
Model EvaluationA production eval program needs offline gates, calibrated human judgment, and live monitoring tied to the failures that cost you money.
AI FrontiersA legal-looking Code becomes a release checklist once you map each Article 53 duty to artifacts, owners, and audit trails.
The gap between demo and production is the harness you build around the model, not the…
Explore →Search is becoming synthesis. If ChatGPT, Perplexity, and Google's AI Overviews don't cite…
Explore →Execution loops, externalized state, and verification gates now matter more than raw model…
Explore →Frontier labs now ship more AI-written code than human-written code, but the viral ROI…
Explore →Why static leaderboards lost authority, and how to build an eval program that survives…
Explore →Why the context window, not the prompt, is the real bottleneck, and how to engineer…
Explore →Why indirect prompt injection, tool-mediated exfiltration, and rogue agents now define LLM…
Explore →How the open-weight cluster closed the gap, why reasoning became the default, and which of…
Explore →A practitioner's map of frontier AI in mid-2026, where independent measurement finally…
Explore →