Across five units, the course assigns something like thirty readings. They disagree about almost everything on the surface — timelines, mechanisms, who should hold the wheel. But disagreement is the visible layer. Underneath it, the readings rest on a small number of shared assumptions that almost no author argues for, because almost no author notices they are assuming it. This essay names six. For each one I do three things: name the readings that depend on it, name the readings that quietly violate it, and ask what changes if the assumption breaks. This is a map, not a verdict.
§01Capability is scalar
The dominant grammar of the course treats intelligence as a quantity — a dial you turn up. Amodei's Machines of Loving Grace runs on "marginal returns to intelligence," which only makes sense if intelligence is the kind of thing you can have more or less of. Aschenbrenner counts "OOMs" — orders of magnitude — and draws a straight line from GPT-2 to superintelligence. AI 2027 plots capability as a curve that bends upward. The METR time-horizon work literally measures capability as a scalar: the length of task (in human-time) an agent can complete at 50% reliability. As of METR's live time-horizons page, frontier time horizons are around twelve hours with a doubling time of roughly four months; the Time Horizon 1.1 update (29 January 2026) revised the post-2023 doubling to about 131 days, roughly 20% faster than the original 7-month trend.
Exactly one reading objects at the root. François Chollet's On the Measure of Intelligence argues that intelligence is not skill but skill-acquisition efficiency — how well you handle novelty you weren't prepared for. On this view a system can post superhuman benchmark numbers and still be near-zero at the thing the word is supposed to mean. The empirical wedge is striking: in ARC Prize Foundation's verified February 2026 evaluation, Gemini 3 Deep Think scored 84.6% on ARC-AGI-2 — above the ~60% human average — yet on the interactive ARC-AGI-3 (released 24 March 2026), the same model scores 0.37%, where humans score 100%. Same systems, opposite verdicts, depending on whether you think capability is one number or many.
| ASSUMPTION | Aschenbrenner | Chollet | Erdil | AI 2027 | RAND | Buterin | Davidson | Amodei |
|---|---|---|---|---|---|---|---|---|
| Capability is scalar | ● | ✗ | ◐ | ● | ◐ | ◐ | ◐ | ● |
| Compute is the binding lever | ● | ◐ | ✗ | ● | ◐ | ◐ | ◐ | ● |
| The developer is well-intentioned | ◐ | ◐ | ◐ | ◐ | ● | ✗ | ✗ | ● |
| Timelines are short (3–10 years) | ● | ✗ | ✗ | ● | ◐ | ◐ | ◐ | ● |
| Western actors write the rules | ● | ◐ | ◐ | ◐ | ● | ✗ | ◐ | ● |
| Defence is the goal | ● | ◐ | ◐ | ● | ● | ◐ | ● | ● |
// pullIf capability is a vector, "more intelligent" stops being a destination and becomes a direction — and you have to say which way.
What changes if the assumption breaks? Almost everything downstream. The intelligence-explosion arguments need scalar capability to feed back on itself. If Chollet is right that capability is situated and jagged, then "AGI by 2027" is not so much wrong as ill-typed — you cannot reach a point on an axis that doesn't exist.11. Chollet himself moved. In his June 2025 conversation with Dwarkesh Patel he shortened his AGI estimate from ~10 years to ~5, citing test-time adaptation and "fluid intelligence" — without conceding the scalar framing. The objector updated his timeline while keeping his ontology.
§02Compute is the binding lever
The second shared assumption is that compute is the control surface — that whoever governs chips, datacentres and training runs governs the outcome. Aschenbrenner's entire strategy is downstream of compute scaling. The AI Treaty letter proposes compute thresholds and a compliance commission. RAND's Securing AI Model Weights builds a five-tier security framework (SL1–SL5) on the premise that the crown jewels are concentrated artefacts you can wall off.
The quiet violator is Ege Erdil at Epoch. His case for multi-decade AI timelines points at a number almost nobody else cites: revenue per H100-equivalent has stayed roughly flat at about $10K/year since the ChatGPT moment. All the revenue growth has come from scaling up the quantity of inference compute, not from each unit becoming more economically productive. If that holds, compute is not a lever that multiplies value — it is a commodity input whose returns are linear, and the binding constraint on transformative AI is something else entirely.
The mid-2026 financial picture complicates things further. OpenAI crossed $25B in annualised revenue at end-February 2026 — extraordinary growth. But Stargate was revised down hard: from ~$1.4 trillion to a roughly $600 billion compute-spend-through-2030 target, explicitly tied to expected revenue. And METR's July 2025 RCT found 16 experienced open-source developers were 19% slower with early-2025 AI tools — against a self-predicted 24% speedup. If compute buys capability but capability doesn't yet buy productivity, the "binding lever" is gripping something that isn't quite the thing you wanted to move.
// pullEveryone agrees compute is a lever. Erdil's flat $10K/H100 quietly asks whether it is the lever — or just the one we can see.
What breaks: the chip-control theory of governance. Export controls, SL5 weight-security, compute treaties — all assume that gating compute gates outcomes. If returns per unit of compute are flat, the most legible policy instrument in the course is aimed slightly off-target.
§03The developer is at least somewhat well-intentioned
Most of the defensive apparatus in Units 3–5 quietly presupposes that the lab building the system is, on balance, trying to do the right thing. Sarah's Introduction to AI Control is explicitly a method for getting safety out of a possibly-misaligned model — but it assumes the humans running the control protocol want it to work. RAND's weight-security tiers protect the developer's assets from outside theft. Even the Redwood control numbers — 62% safety from trusted monitoring, rising toward ~92% with defer-to-trusted — describe a blue team that is on humanity's side by construction.
Two readings problematise the developer directly. Buterin's d/acc states flatly that the centre is often the source of risk. Tom Davidson, Lukas Finnveden and Rose Hadshar's AI-Enabled Coups inverts the assumption completely: its three risk factors — an AI workforce made "singularly loyal" to institutional leaders, "secret loyalties" hidden in systems, and a few people gaining "exclusive access" to coup-enabling capabilities — are precisely the failure modes you get when the developer is the threat.
What breaks: the layered defences mostly stop being defences. A control protocol run by a bad-faith operator is theatre. This is the assumption whose failure is least studied in the course: every layer is designed to stop bad behaviour, but none of them makes good behaviour the attractive choice.
§04Timelines are short
The course's centre of gravity sits at 3–10 years. Amodei's "powerful AI" arrives in the late 2020s. Aschenbrenner dates AGI to 2027 and superintelligence to ~2030. AI 2027 is a year in its title. The AI Treaty letter and the Global Call for AI Red Lines (launched at the 80th UN General Assembly, 22 September 2025, initially signed by 200+ individuals and 70+ organisations including economists Stiglitz and Acemoglu, biochemist Jennifer Doudna, physicist Giorgio Parisi, and AI researchers Bengio and Hinton) both demand action "by the end of 2026." Urgency is the premise.
The dissenters are Erdil (median ~20 years to full automation of remote work) and Narayanan and Kapoor's AI as Normal Technology, which argues diffusion, not capability, is the rate-limiter — and diffusion runs on the timescale of institutions, not training runs.
What breaks: if timelines are long, the emergency framing inverts. The "we must act by 2026" instruments risk locking in rules built for a world that arrives in 2045 — exactly the stasism Toner warns against. Short-timeline readings rarely cost out the harm of premature lock-in, and long-timeline readings rarely cost out the harm of being caught flat-footed.
§05Western actors get to write the rules
The governance readings almost all assume the pen is held in Washington, London or Brussels. Aschenbrenner is explicit: a "coalition of democracies" runs The Project, with Western labs expected to "voluntarily" merge under the USG. The Treaty letter is implicitly Western-led — a CERN-for-AI-Safety modelled on European institutions. Amodei's "entente strategy" frames a democratic coalition setting terms for everyone else.
The mid-2026 record makes this look less like a description and more like a contested bet. The US and UK declined to sign the Paris AI Action Summit declaration (February 2025). The US AI Safety Institute was rebranded the Center for AI Standards and Innovation (CAISI) on 3 June 2025, with "safety" dropped and the remit narrowed toward competitiveness. Meanwhile the Global Call for AI Red Lines drew civil-society signatories including the Beijing Institute of AI Safety and Governance alongside Western names.
RAND's structural critique applies to the genus: any framework that uses AI advantage as leverage generates first-mover incentives and pushes rivals to race or defect. If the pen is shared, "Western actors write the rules" stops being a strategy and becomes a negotiating position.
§06Defence is the goal
The deepest shared assumption is structural and almost invisible: every layer in the course is designed to stop bad behaviour. Prevent dangerous training. Constrain dangerous capabilities. Withstand dangerous actions. Control catches the scheming model. Weight-security stops the theft. Red lines prohibit the unacceptable.
What almost no reading does is make good behaviour the attractive choice — design incentives so the cooperative path is also the profitable one. Buterin's d/acc is the partial exception: the "build the beneficial thing first" move is at least an attempt to make the good outcome materially attractive rather than merely mandated. IFP's selective-acceleration playbook (Human Genome Project, Operation Warp Speed) is another. But these are a minority.
// pullA civilisation that only knows how to say "don't" has not yet decided what it wants to say "yes" to.
What breaks if defence is not enough: you can win every defensive engagement and still lose, because nothing pulls the system toward a good equilibrium. The "withstand" layer feels suspiciously agreeable precisely because it is the most open to interpretation, and therefore the least binding. That is the signature of a corpus that has worked hard on the brakes and barely touched the steering.
The six assumptions are not errors. They are the scaffolding that lets the readings say anything specific at all. The contribution of reading them as a set is that you can see which load-bearing walls are shared, and therefore which single failure would bring down the most rooms at once. If I had to bet on the wall most likely to crack, it is not timelines and it is not compute. It is the sixth one — that we have built a magnificent set of brakes and forgotten to ask where we are trying to go.
This essay was written as part of BlueDot Impact's AGI Strategy cohort (May 2026). The synthesis is my own; any errors in characterising the readings are mine.
— written as part of BlueDot Impact's AGI Strategy cohort (May 2026), in London.