Underlying Assumptions on the AGI Strategy Course

Across five units, the AGI Strategy course assigns something like thirty readings that rest on a small number of shared assumptions, this essay names six. For each one I map three things: name the readings that depend on it, name the readings that may violate it, and ask what changes if the assumption breaks.

§01Capability is scalar

The dominant grammar of the course treats intelligence as a quantity — a dial you turn up. Amodei's Machines of Loving Grace runs on "marginal returns to intelligence," which only makes sense if intelligence is the kind of thing you can have more or less of. Aschenbrenner counts "OOMs" — orders of magnitude — and draws a straight line from GPT-2 to superintelligence. AI 2027 plots capability as a curve that bends upward. The METR time-horizon work literally measures capability as a scalar: the length of task (in human-time) an agent can complete at 50% reliability. As of METR's live time-horizons page, frontier time horizons are around twelve hours with a doubling time of roughly four months; the Time Horizon 1.1 update (29 January 2026) revised the post-2023 doubling to about 131 days, roughly 20% faster than the original 7-month trend.

Exactly one reading objects at the root. François Chollet's On the Measure of Intelligence argues that intelligence is not skill but skill-acquisition efficiency — how well you handle novelty you weren't prepared for. On this view a system can post superhuman benchmark numbers and still be near-zero at this version of intelligence. The empirical wedge is striking: in ARC Prize Foundation's verified February 2026 evaluation, Gemini 3 Deep Think scored 84.6% on ARC-AGI-2 — above the ~60% human average — yet on the interactive ARC-AGI-3 (released 24 March 2026), the same model scores 0.37%, where humans score 100%. Same systems, opposite verdicts, depending on whether you think capability is one number or many.

~/shubzsharma.com/six.py

ASSUMPTION	Aschenbrenner	Chollet	Erdil	AI 2027	RAND	Buterin	Davidson	Amodei
Capability is scalar	●	✗	◐	●	◐	◐	◐	●
Compute is the binding lever	●	◐	✗	●	◐	◐	◐	●
The developer is well-intentioned	◐	◐	◐	◐	●	✗	✗	●
Timelines are short (3–10 years)	●	✗	✗	●	◐	◐	◐	●
Western actors write the rules	●	◐	◐	◐	●	✗	◐	●
Defence is the goal	●	◐	◐	●	●	◐	●	●

● depends on◐ neutral✗ violates

Hover a row to see what breaks if that assumption is false.

→Six assumptions mapped against eight course readings. Each cell shows whether a reading depends on, is neutral to, or quietly violates the assumption. Hover a row to see what breaks if that assumption is false.

// pullIf capability is a vector, "more intelligent" stops being a destination and becomes a direction — and you have to say which way.

What changes if the assumption breaks? Almost everything downstream. The intelligence-explosion arguments need scalar capability to feed back on itself. If Chollet is right that capability is situated and jagged, then "AGI by 2027" is not so much wrong as ill-typed.11. Chollet himself moved. In his June 2025 conversation with Dwarkesh Patel he shortened his AGI estimate from ~10 years to ~5, citing test-time adaptation and "fluid intelligence" — without conceding the scalar framing. The objector updated his timeline while keeping his ontology.

§02Compute is the binding lever

The second shared assumption is that compute is the control surface — that whoever governs chips, datacentres and training runs governs the outcome. Aschenbrenner's entire strategy is downstream of compute scaling. The AI Treaty letter proposes compute thresholds and a compliance commission. RAND's Securing AI Model Weights builds a five-tier security framework (SL1–SL5) on the premise that the crown jewels are concentrated artefacts you can wall off.

According to Ege Erdil at Epoch's case for multi-decade AI timelines, revenue per H100-equivalent has stayed roughly flat at about $10K/year since the ChatGPT moment. All the revenue growth has come from scaling up the quantity of inference compute, not from each unit becoming more economically productive. If that holds, compute is not a lever that multiplies value — it is a commodity input whose returns are linear.

The mid-2026 financial picture complicates things further. OpenAI crossed $25B in annualised revenue at end-February 2026 — extraordinary growth. But Stargate was revised down hard: from ~$1.4 trillion to a roughly $600 billion compute-spend-through-2030 target, explicitly tied to expected revenue. And METR's July 2025 RCT found 16 experienced open-source developers were 19% slower with early-2025 AI tools — against a self-predicted 24% speedup. If compute buys capability but capability doesn't yet buy productivity, the "binding lever" is gripping something that isn't quite intentional.

~/shubzsharma.com/predictions.py

Hover a dot to read the mid-2026 data point.

→Predictions versus mid-2026 anchors. Three forecast lines plotted against the real data points — the gap between ARC-AGI-2 (84.6%) and ARC-AGI-3 (0.37%) on the same model captures the scalar-vs-vector tension in a single season.

// pullEveryone agrees compute is a lever. Erdil's flat $10K/H100 quietly asks whether it is the lever — or just the one we can see.

What breaks: the chip-control theory of governance. Export controls, SL5 weight-security, compute treaties — all assume that gating compute gates outcomes.

§03The developer is at least somewhat well-intentioned

Most of the defensive apparatus in Units 3–5 quietly presupposes that the lab building the system is, on balance, trying to do the right thing. Sarah's Introduction to AI Control is explicitly a method for getting safety out of a possibly-misaligned model — but it assumes the humans running the control protocol want it to work. RAND's weight-security tiers protect the developer's assets from outside theft. Even the Redwood control numbers — 62% safety from trusted monitoring, rising toward ~92% with defer-to-trusted — describe a blue team that is on humanity's side by construction.

Two readings problematise the developer directly. Buterin's d/acc states flatly that the centre is often the source of risk. Tom Davidson, Lukas Finnveden and Rose Hadshar's AI-Enabled Coups inverts the assumption completely: its three risk factors — an AI workforce made "singularly loyal" to institutional leaders, "secret loyalties" hidden in systems, and a few people gaining "exclusive access" to coup-enabling capabilities — are precisely the failure modes you get when the developer is the threat.

What breaks: the layered defences mostly stop being defences. A control protocol run by a bad-faith operator is theatre. This is the assumption whose failure is least studied in the course: every layer is designed to stop bad behaviour, but none of them makes good behaviour the attractive choice.

§04Timelines are short

The centre of gravity sits at 3–10 years. Amodei's "powerful AI" arrives in the late 2020s. Aschenbrenner dates AGI to 2027 and superintelligence to ~2030. AI 2027 is a year in its title. The AI Treaty letter and the Global Call for AI Red Lines (launched at the 80th UN General Assembly, 22 September 2025, initially signed by 200+ individuals and 70+ organisations including economists Stiglitz and Acemoglu, biochemist Jennifer Doudna, physicist Giorgio Parisi, and AI researchers Bengio and Hinton) both demand action "by the end of 2026." Urgency is the premise.

The dissenters are Erdil (median ~20 years to full automation of remote work) and Narayanan and Kapoor's AI as Normal Technology, which argues diffusion, not capability, is the rate-limiter — and diffusion runs on the timescale of institutions, not training runs.

What breaks: if timelines are long, the emergency framing inverts. The "we must act by 2026" instruments risk locking in rules built for a world that arrives in 2045 — exactly the stasism Toner warns against. Short-timeline readings rarely cost out the harm of premature lock-in, and long-timeline readings rarely cost out the harm of being caught flat-footed.

§05Western actors get to write the rules

The governance readings almost all assume the pen is held in Washington, London or Brussels. Aschenbrenner is explicit: a "coalition of democracies" runs The Project, with Western labs expected to "voluntarily" merge under the USG. The Treaty letter is implicitly Western-led — a CERN-for-AI-Safety modelled on European institutions. Amodei's "entente strategy" frames a democratic coalition setting terms for everyone else.

The mid-2026 record makes this look less like a description and more like a contested bet. The US and UK declined to sign the Paris AI Action Summit declaration (February 2025). The US AI Safety Institute was rebranded the Center for AI Standards and Innovation (CAISI) on 3 June 2025, with "safety" dropped and the remit narrowed toward competitiveness. Meanwhile the Global Call for AI Red Lines drew civil-society signatories including the Beijing Institute of AI Safety and Governance alongside Western names.

RAND's structural critique applies to the genus: any framework that uses AI advantage as leverage generates first-mover incentives and pushes rivals to race or defect. If the pen is shared, "Western actors write the rules" stops being a strategy and becomes a negotiating position.

§06Defence is the goal

The deepest shared assumption is structural and almost invisible: every layer in the course is designed to stop bad behaviour. Prevent dangerous training. Constrain dangerous capabilities. Withstand dangerous actions. Control catches the scheming model. Weight-security stops the theft. Red lines prohibit the unacceptable.

What almost no reading does is make good behaviour the attractive choice — design incentives so the cooperative path is also the profitable one. Buterin's d/acc is the partial exception: the "build the beneficial thing first" move is at least an attempt to make the good outcome materially attractive rather than merely mandated. IFP's selective-acceleration playbook (Human Genome Project, Operation Warp Speed) is another. But these are a minority.

// pullA civilisation that only knows how to say "don't" has not yet decided what it wants to say "yes" to.

What breaks if defence is not enough: you can win every defensive engagement and still lose, because nothing pulls the system toward a good equilibrium. The "withstand" layer feels suspiciously agreeable precisely because it is the most open to interpretation, and therefore the least binding.

The six assumptions are not errors. They are the scaffolding that lets the readings say anything specific at all. The contribution of reading them as a set is that you can see which walls are shared, and therefore which single failure would bring down the most rooms at once.

— written as part of BlueDot Impact's AGI Strategy cohort (May 2026).

shubz@torus~/writing/bluedot-assumptions%ls ../related/# 3 essays

ai strategy & policy

A unit on bending the curve

Who shapes artificial intelligence and what 'shape' means.

22 min · ↗ read

ai strategy & policy

The same word, five meanings

How most AI debates dissolve once you say whose dictionary you're using.

14 min · ↗ read

ai strategy & policy

The civilisation frame

Four readings, four ideologies, one shared unit of analysis.

16 min · ↗ read

← PREVIOUS

A unit on bending the curve

The same word, five meanings

Underlying Assumptions on the AGI Strategy Course.

§01Capability is scalar

§02Compute is the binding lever

§03The developer is at least somewhat well-intentioned

§04Timelines are short

§05Western actors get to write the rules

§06Defence is the goal

Related essays