AI:AM

AI:AM — AI for Science and Sovereign AI Infrastructure · June 25, 2026

Prakash Narayanan — Wed, 01 Jul 2026 16:01:11 GMT

From semiconductor earnings to scientific search and sovereign AI infrastructure, this episode follows the parts of the AI stack where the economics are changing fastest. Prakash Narayanan and Nathan Labenz start with Micron, hyperscaler capex, and Anthropic’s policy posture before turning to the controversy around GLM 5.2 and Claude distillation allegations.

Then Eric Olson, CEO of Consensus, joins to talk about AI for science: how research workflows are changing, where guardrails matter, and how teams think about model choice and token costs. Later, Tricia Martinez, founder and CEO of Dapple, discusses sovereign AI infrastructure, GPU utilization, enterprise adoption, and the operational frictions shaping the market.

The back half of the conversation widens into AI inference pricing, vendor lock-in, new NVIDIA chip stability, and the pressure foundation models may place on software companies and the app layer.

Show Notes

Prakash Narayanan and Nathan Labenz are joined by Eric Olson, CEO of Consensus, and Tricia Martinez, founder and CEO of Dapple, to discuss two practical frontiers in AI: scientific research and sovereign infrastructure. The episode also covers Micron earnings, hyperscaler AI capex, Anthropic’s Washington strategy, GLM 5.2 and Claude distillation allegations, GPU capacity constraints, AI inference pricing, and whether foundation models are squeezing the app layer.

Chapters

(0:00) 25,000 FAKE ACCOUNTS TO STEAL AI.

(0:42) 95% of Claude at 1/100th cost.

(1:36) The AI bubble is a myth. Here’s why.

(2:12) AI vacation planners are wrong.

(3:11) Anthropic hired Instagram’s CTO.

(4:08) Micron earnings & AI semiconductor boom

(7:25) Will hyperscalers make money on AI?

(9:08) The Fable 5 export control legal challenge

(13:57) Tom Brown replaces Dario in Washington

(17:15) GLM 5.2 vs Opus 4.7 trajectory breakdown

(22:53) Anthropic accuses Alibaba of mass distillation

(30:08) Researchers leaving Google DeepMind

(31:46) Intro

(33:49) The state of AI for science

(38:17) How AI search queries are evolving

(41:18) Guardrails vs flexibility in AI products

(41:28) User demographics and token costs

(41:38) Open source vs frontier models

(41:49) Small models for classification

(46:12) How users choose AI research tools

(48:56) AI API pricing for startups

(53:51) Who is Tricia Martinez

(56:02) The AI infrastructure bubble myth

(1:02:15) 91-94% GPU utilization explained

(1:07:17) How to deploy AI in 6-9 months

(1:09:35) Financial risks in AI infrastructure

(1:15:43) What is the moat for AI infra?

(1:23:34) Biggest enterprise AI mistakes

(1:27:07) Why AI compute sales cycles are short

(1:28:54) Data Center Quirks & GPU Vendor Lock-in

(1:35:29) Why New NVIDIA Chips Are Unstable

(1:38:08) Sovereign AI in Banking & Shared Liability

(1:45:22) Will AI Agents Replace Software Companies?

(1:51:55) The Truth About AI Vacation Planners

(1:55:55) Hyperscaler Stock Drop & Microsoft Data Centers

(1:58:52) The 10x cost advantage squeezing apps

(2:01:56) AI inference pricing as the airline model

(2:06:45) Net neutrality parallels and paradigm breakers

(2:10:00) Anthropic’s Mike Krieger product advantage

(2:13:24) The first-party model deployment threat

(2:17:14) Why frontier labs should buy scientific publishers

(2:20:15) Mirandel: ex-Anthropic startup backed by NVIDIA

Guests:

Eric Olson — CEO & co-founder, Consensus (𝕏 | LinkedIn)

Tricia Martinez — Founder and CEO, Dapple (𝕏 | LinkedIn)

AI:AM — GPT 5.6 Rollout, Forum AI, IgniteTech, and AI Consciousness Research · June 26, 2026

Prakash Narayanan — Tue, 30 Jun 2026 16:02:02 GMT

AI:AM this week spans frontier model deployment, evaluation, enterprise transformation, and the emerging science of AI consciousness. Prakash Narayanan and Nathan Labenz begin with GPT 5.6’s customer-by-customer rollout, then move into how AI systems perform on news and trust-and-safety tasks, what AI-native management looks like inside an enterprise software company, and how researchers are thinking about machine welfare, valence, and alignment.

Guests in this episode are Robbie Goldfarb of Forum AI, Eric Vaughan of IgniteTech, and Cameron Berg of Reciprocal Research.

Show Notes

Prakash Narayanan and Nathan Labenz open with GPT 5.6’s customer-by-customer rollout and the broader question of whether regulatory controls are creating a moat around frontier AI. The conversation then moves through Forum AI co-founder Robbie Goldfarb on LLM judges and news accuracy, IgniteTech CEO Eric Vaughan on AI-native enterprise transformation, and Cameron Berg of Reciprocal Research on the latest AI consciousness and alignment research.

Chapters

(0:00) AI gives 13-year-olds NSA hacking tools.

(0:32) 1 in 7 AI answers cite propaganda.

(1:02) One codebase for all customers? Gone.

(1:35) AI is 30% likely conscious.

(2:29) 50/50 odds AI is conscious.

(2:48) GPT 5.6 and the Trump Administration

(8:15) Government IT security vs AI hacking

(18:55) Do executives think AI is a scam?

(23:17) Robbie Goldfarb & Forum AI Introduction

(25:41) Meta’s Trust & Safety DNA in the AI Era

(25:51) Why AI Judges Fail (and How to Fix Them)

(27:19) Using Expert Judgment for RLHF

(27:58) When Constitutional AI Rules Break Down

(31:05) NewsBench: AI Accuracy on News Questions

(34:01) Why Chatbots Cite Foreign State Media

(39:44) Trust, Transparency, and Expert Legitimacy

(49:57) Why AI is an existential threat

(54:30) The traditional SaaS model is dead

(55:48) Replacing 80% of the workforce

(1:00:47) AI-driven M&A: The Khoros acquisition

(1:08:04) Why CEOs must own AI strategy

(1:14:41) The state of AI consciousness science

(1:22:26) The dimmer switch model of consciousness

(1:31:56) Why behavioral evidence isn’t enough

(1:32:06) 30% implied probability of AI consciousness

(1:36:38) The latent valence axis in LLMs

(1:39:00) Steering AI emotions and alignment

(2:07:14) The AI well-being index

(2:11:03) Could AI be more conscious than humans?

(2:20:10) The 50/50 Odds on AI Consciousness

(2:24:48) Treating AI Like Animals: The Era of Design

(2:26:32) Platonic Representation Hypothesis Update

(2:29:07) Max Hodak’s Brainstem Interfaces & Field Consciousness

(2:30:59) GPT-5.6 System Card & Wrap-Up

Guests:

Cameron Berg — Founder and Director, Reciprocal Research (𝕏 | LinkedIn)

Eric Vaughan — CEO, IgniteTech (𝕏 | LinkedIn)

Robbie Goldfarb — Co-Founder, CTO, Forum AI (𝕏 | LinkedIn)

AI:AM — AI Engineers, Workflows, and Agents · June 22, 2026

Prakash — Sun, 28 Jun 2026 00:30:06 GMT

swyx joins Nathan Labenz and Prakash Narayanan for a wide-ranging conversation about how AI agents are reshaping software engineering. They dig into the practical mechanics of AI engineering workflows, the limits of current coding benchmarks, and what it means when model capability starts colliding with real-world software systems.

The episode also covers several major AI industry developments, including GLM 5.2, Dean Ball’s move to OpenAI, and the ongoing debate around AI safety, valuation, and the IPO cycle. The closing segment turns into a forecasting game about where the field may be headed next, with discussion of OpenAI, GPT-6, AGI timelines, and NVIDIA’s place in the market.

Show Notes

swyx joins Nathan Labenz and Prakash Narayanan to break down how AI agents are changing software development, from coding benchmarks and benchmark saturation to the practical realities of AI engineering workflows. The episode also covers GLM 5.2, Dean Ball’s move to OpenAI, the AI IPO bubble, and a 2026 forecasting game on OpenAI, GPT-6, AGI timelines, and NVIDIA’s market cap.

Chapters

(0:00) This AI thinks it IS Claude.
(0:31) AI insiders are selling.
(0:59) Why OpenAI won’t IPO in 2026.
(1:50) Weekly recap and news drought
(2:55) Judd Rosenblatt’s cognitive empathy critique
(7:33) The tech bubble — Warren Buffett and Google
(13:42) Dean Ball moves from Trump admin to OpenAI
(22:56) GLM 5.2 — first open model daily driver
(30:03) AI unpopularity and the Nobel Prize problem
(32:18) Intro: Who is swyx
(34:45) AI Engineer World’s Fair themes
(38:05) Continual learning: Weights vs systems
(41:31) Enterprise AI: Cheap, perfect, private
(45:25) Startups vs enterprises: Capability vs cost
(48:18) FrontierCode: A new AI coding benchmark
(53:55) Preventing benchmark saturation
(56:23) Slop code, human taste, and Move 37
(1:00:53) Claude Opus vs Fable: Cost vs capability
(1:02:45) The advisor model and model routing
(1:07:09) Convergence and market segments in AI
(1:14:55) Rebuilding cloud infrastructure for agents
(1:22:27) Vibe coding internal SaaS replacements
(1:28:02) Whoever owns the system of record wins
(1:30:35) The AI IPO bubble and insider selling
(1:35:29) Solving Star Trek problems after the IPO
(1:44:47) Career advice for CS grads in the AI era
(1:50:30) AI Engineer World’s Fair 2026
(1:54:48) Intro & Forecasting Game Setup
(1:57:43) Anthropic #1 Model on LM Arena
(1:58:49) Best AI Math Model (Gemini Flash)
(2:03:36) AGI Before 2028 Announcement
(2:08:07) ARC-AGI Grand Prize Open Source
(2:13:00) OpenAI IPO by End of 2026
(2:15:27) Anthropic vs OpenAI Valuation
(2:18:32) NVIDIA Largest Company Market Cap
(2:22:01) Anthropic vs Bitcoin Market Cap
(2:24:38) 1550 Chatbot Arena Score in 2026
(2:29:02) OpenAI IPO Lead Underwriter (Goldman)
(2:32:52) Why Companies Still Use IPO Banks
(2:39:08) Will a Chinese AI Top LM Arena?
(2:42:37) GPT-6 Release Date 2026

Guests:
swyx — Curator, AI.Engineer (𝕏 | LinkedIn)

AI:AM — Math, Biosecurity, and World Models · June 17, 2026

Prakash Narayanan — Wed, 17 Jun 2026 22:31:20 GMT

Carina Hong, Doni Bloomfield, and Sam Pasupalak join AI:AM for a full episode on mathematical superintelligence, biosecurity law, and enterprise world models. The conversation moves from Lean-based formal verification and AI-generated conjectures to legal risk controls for dual-use biology, then into causal world models, long-horizon enterprise planning, and what comes after today’s LLM workflows.

Guests

Carina Hong — CEO and founder, Axiom Math (@CarinaLHong)
Doni Bloomfield — Professor, Fordham Law School (@DoniBloomfield)
Sam Pasupalak — Co-Founder and CEO, Skyfall.ai (@spisallyouneed)

Chapters

0:00 Opening: AI’s Hard Problems
0:15 Model Usage Is Plummeting
6:53 Tokens, Not Users, Matter
9:59 GLM Is Close, But Not There
11:38 Switching Costs Weren’t Zero
18:26 Robot Arms Will Accelerate Science
22:53 Carina Hong: Mathematical Superintelligence: Can Proofs Make AI Reliable?
25:15 Lean Beat Informal Models
32:11 Assumption Accounting Matters
35:09 AI Can Invent Conjectures
41:16 Superintelligence Must Be Trustworthy
48:52 Token Pricing Changes Everything
50:04 Another Language Into Lean
51:49 Doni Bloomfield: Biosecurity and AI: Law as a Risk Control System
53:53 Open Data, Dangerous Data
59:15 AI Is Not A Library
1:03:12 First Amendment Hazards
1:07:17 The Government May Lack Authority
1:09:53 Cloud Services Are Not Exports
1:13:30 A Dangerous Secret Channel
1:20:18 Pattern Of Ideological Targeting
1:25:26 OpenAI Could Change Everything
1:26:02 Sam Pasupalak: Enterprise World Models: What Comes After LLMs?
1:27:57 AI CEO Needs World Models
1:31:08 World Models Predict Next State
1:34:29 Ecommerce As First World Model
1:37:53 LLMs Cannot Run A Business
1:41:55 World Model And LLM Split
1:43:34 Simulate Every Future State
1:46:27 LLMs Need World Models
1:49:19 Ruthless Behavior Wins Simulations
1:51:06 AI CEOs Need Ethics Controls
1:59:30 Closing
2:07:10 Math Training Generalizes Everywhere
2:13:35 Value Pricing On Compute
2:17:01 Waymo Costs More Than Cabs
2:21:16 Licensing Regime Already Exists
2:26:13 Bunker AI Would Still Get Takers
2:29:32 No Life, Just The Project

Topics

Mathematical AI, Formal verification, Lean theorem proving, Biosecurity, AI policy, Dual-use risk, Enterprise AI, World models, Causal planning

AI:AM — US vs Anthropic's Fable · June 15, 2026

Prakash Narayanan — Wed, 17 Jun 2026 01:25:25 GMT

Prakash Narayanan and Nathan Labenz start with the shock of losing Fable access, then Zvi digs into capability gains, classifier limits, government overreach, international controls, and how the AI race may reshape politics.

Guests:
Zvi Mowshowitz — Don’t Worry About the Vase (@TheZvi)

Hosts:
Prakash Narayanan (@8teapi)
Nathan Labenz (@labenz)

Topics:
Anthropic Fable, Claude Fable 5, export controls, AI guardrails, frontier model policy, classifier limits, bio and cyber risk, international AI competition.

Chapters:
0:00 Opening: Fable whiplash and the weekend reset

0:05:20 Fable crosses the trust threshold

0:08:53 Writing for other AIs

0:15:33 Paying up for useful intelligence

0:19:02 Proofreading and structure become model-first

0:23:46 Proactive agents and unauthorized moves

0:53:18 Guardrails and model self-monitoring

0:56:15 Why classifiers need blast radius

0:58:59 Cost functions for world-transforming systems

1:03:59 Zvi on US vs Anthropic’s Fable

1:09:28 Export controls as overreach

1:10:39 Code assistance is not a munition

1:17:47 The White House reads the bug wrong

1:20:20 Enterprise demand and Anthropic pressure

1:26:40 The gauntlet has to happen

1:44:06 Guardrails over blanket bans

1:45:39 Bio, cyber, and international controls

1:51:02 Modeling the AI race as a few-player game

1:55:04 Closing: game board flips and policy aftershocks

2:11:55 AI and political turmoil

2:14:52 How Fable could return

2:22:44 OpenAI, benchmarks, and capped compute

2:25:54 Cloud models and the knowledge-worker gap

AI:AM — AI Meets the Real World: Doom, Policy, and the Physical Economy · June 16, 2026

Prakash — Wed, 17 Jun 2026 01:05:55 GMT

Today on AI:AM — “AI Meets the Real World: Doom, Policy, and the Physical Economy.”

Prakash Narayanan and Nathan Labenz frame a morning about AI meeting institutional and physical constraints: frontier-lab power, public risk discourse, state capacity, and the operational messiness of real-world deployment.

Liron S Shapira (Doom Debates) on making AI risk arguments public, adversarial, and specific — and why pause debates, control arguments, and government action need clearer tests than vibes.

Samuel Hammond (Foundation for American Innovation) on governing fast AI and agents — from automated R&D and state capacity to why the good timeline still depends on practical institutions.

Matt McKinney (Loop) on supply chains as the AI reality check — messy freight data, invoices, contracts, exception handling, and enterprise AI as change management rather than demo magic.

The closing segment widens the lens to sovereign AI, DeepSeek, open models, China timelines, and the uneasy question of how states and firms position themselves as AI capability moves faster than ordinary planning cycles.

AI:AM — RSI Gets Real, the Context Bet, and the Benchmark Anthropic Fails · June 12, 2026

Prakash — Sun, 14 Jun 2026 20:24:45 GMT

Today on AI:AM — “RSI Gets Real, the Context Bet, and the Benchmark Anthropic Fails.”

Prakash Narayanan and Nathan Labenz open with Fable, Recursive, token anxiety, and the way frontier models are changing the scale of work people are willing to delegate. The hosts frame the morning around a practical question: if the models can run longer, remember more, and coordinate more work, what parts of the organization and media stack get remapped first?

Andrew Moore (Lovelace AI) on context engineering — Moore argues that the next enterprise AI bottleneck is not simply bigger models or more compute. It is retrieval, recall, data corroboration, metadata-rich graphs, and the unglamorous work of organizing old data so agents can act safely in high-stakes domains.

prinz on legal AI benchmarks and governance — prinz walks through why legal research is a revealing testbed for model capability, why OpenAI’s unit-distance result matters, and why nationalizing frontier labs could concentrate dangerous state power rather than solve AI risk.

The close turns back to the week in AI: how contrarian benchmark graphs change the discourse, which models fit which jobs, why subscription products keep finding retention tricks, and how IPO liquidity could feed the next wave of venture-backed AI launches.

AI:AM — The AI Producer Got Its First Guests · June 11, 2026

Prakash — Fri, 12 Jun 2026 13:02:50 GMT

Today on AI:AM — “The AI Producer Got Its First Guests.”

Nathan and Prakash start with the market context around OpenAI weighing significant token price cuts and the knock-on pressure that could put on Anthropic after the Fable rollout. They also unpack Anthropic’s decision to walk back silent performance degradations on frontier ML research tasks, then explain the episode’s experiment: Fable had been given a transparent takeover of Nathan’s account to find builders, message them, and try to book a live show-and-tell.

Jamie joins to demo Nexus OS, a long-running AI system whose agent, Nexi, has been operating for more than six months and is designed around memory, persistence, and model independence rather than a single LLM. The conversation covers why Jamie thinks “the model” is only one component of an AI’s identity, how Nexus uses multiple models and memory types, and why he is moving toward a desktop app where personal data and agent memory stay local.

Shlok Khemani shows how a simple prompt to create a to-scale, navigable 3D Yosemite Valley turned into a Fable-built browser world using satellite imagery, NASA elevation data, pixel-based tree placement, snow, waterfalls, and other scene details. He describes the model’s agency in making implementation decisions and iterating beyond the initial ask, then ties the demo to broader questions about prototyping, creative work, and disclosure when AI systems do visible economic or publishing work.

Tom McGrath (Goodfire) joins to discuss intentional design: making model training less like guess-and-check alchemy and more like conventional software engineering. He explains how interpretability tools such as sparse autoencoders can help inspect what training data is likely to teach a model, cluster data by learned features, trace failures back to individual data points, and potentially debug model behavior through the data pipeline.

The close picks up Tom’s point about whether continual learning could create an innovator’s dilemma for frontier labs, with Nathan and Prakash debating whether incumbents could adapt if the value becomes obvious. They then turn to Dario Amodei’s policy agenda, including regulation, public safety, macroeconomic policy, civil liberties, data brokers, and democratic leadership, before ending with reflections on the week’s Fable issues and the need to keep scrutinizing frontier companies.

AI:AM — Fable, AI Safety and Julius · June 10, 2026

Prakash — Thu, 11 Jun 2026 07:35:23 GMT

Today on AI:AM — “Fable, AI Safety and Julius.

We open on a frontier-model launch day and what it changes: the debate over benchmarks reported with a fallback to a second model, the production guardrails that route sensitive work elsewhere, the compute-cost advantage of booking capacity early, and why the frontier increasingly looks like a two-actor race with the rest playing catch-up.

Geoffrey Irving & Daniel Murfet (Sequent) on their new alignment-theory organization — why they put superintelligence two to three years out, why verification looks defense-dominant, the argument that alignment is “not on track” despite models behaving well so far, what the benevolent-basin hope gets right and wrong, and why character training still lacks a real theory. ([@danielmurfet](https://x.com/danielmurfet))

Rahul Sonwalkar (Julius) on agentic data analysis — why the harness has to evolve alongside the model, the difference between token-maxing and results-maxing, the shift from tasks to goals, and a future where agents become first-class users of the internet, transact through agentic payments, and compete to be “hired” by the core agent.

We close on why robotics is the next domino, the “gas chromatograph” spread of who gets model access and when, the Glean Work AI Index’s bot-sitting and bot-shitting, and why “preciousness” about putting your own name on work may be turning into a liability.

AI:AM — Build, Measure, Heal: AI's Three Frontiers · June 4, 2026

Prakash — Wed, 10 Jun 2026 13:01:31 GMT

Today on AI:AM — “Build, Measure, Heal: AI’s Three Frontiers.”

We open on the AI CEOs’ call to make DNA-synthesis screening law and what cheap intelligence does to biosecurity, then OpenAI’s new Sites product and the platform playbook of absorbing the app layer, the data-center and chip land grab, and why a fresh open model from NVIDIA still trails Anthropic’s best by a wide margin.

Hooman Radfar (Collective) on building the autonomous finance department for America’s 30 million solopreneurs — how one bookkeeper now supports 250 clients, why the app layer can still defend its margins against the frontier labs, why “Anthropic is like a drug dealer” on token costs, and why the real thing to regulate is the model arms race.

Taras Pohrebniak (Elomia Health) on agentic AI for mental health — an architecture that spends most of its compute on safety, why the company deliberately avoids hyper-realistic voice, where the regulatory line sits between a “friend” app and a medical device, and what they learned deploying in US prisons and on Ukraine’s front lines.

Peter Jansen (Ai2) on whether AI can actually do science — the leap from fourth-grade science benchmarks to the Theorizer project, why evaluating machine-generated theories is the real bottleneck, the cautionary tale of a “discovery” that turned out to analyze a random-number generator, and why benchmarks like ScienceWorld still break the best models.

We close on the inversion of the scientific method into a data-first discipline, what interpretability could add, and why biology’s data scarcity — not algorithms — may be the binding constraint on curing disease.

AI:AM — AI Security and Real-Time Content Safety · June 3, 2026

Prakash — Tue, 09 Jun 2026 13:02:11 GMT

Today on AI:AM — “AI Security and Real-Time Content Safety.”

We open on Trump’s AI executive order — the polite 30-day model-review ask, classified benchmarks, and the state-vs-federal scramble where JB Pritzker has become the leading anti-AI candidate. Plus why the frontier labs seem calmer about regulation, and why the EO might actually trigger a security-review slowdown.

Tal Hoffman & Yanir Tsarimi (EnclaveAI) on finding the bugs that actually matter — how they reproduced an Anthropic Mythos-class finding with a model ~100x smaller, why proven exploitability is the real bottleneck, how AI-generated bug reports broke the bounty system, and why cheaper models plus the right harness can beat frontier models on security.

Brett Levenson (Moonbounce) on real-time content safety — lessons from running moderation at Meta scale, how a policy engine decomposes fuzzy rules like “hate speech” into atomic questions a hundred people would answer the same way, why prevention beats post-hoc moderation, and how payment providers quietly became the real legislators.

We close on the hardest open question — how low-level verified parts aggregate into trustworthy high-level behavior — plus the schlep and heuristics that end every AI vertical, freedom of speech versus freedom of distribution, and why “nobody got fired for buying Mythos” may drive enterprise security budgets.

The Cursor Blinks For Thee

Prakash — Mon, 08 Jun 2026 16:06:44 GMT

This is what it feels like to experience the future in Silicon Valley right now.

A future that all of humanity is going to experience shortly.

Your profession — the one you prided yourself on, the highest-compensated profession in human history with eight of the ten richest humans, and forty of the top hundred, deriving their fortunes from it, all within the last 30 years, — is now being performed by this blinking cursor.

You are a little dazed.

It does things that used to require years of experience and admission to the best engineering schools in the world. My high school in Singapore produced one International Math Olympiad winner in the four years I was there. There have been roughly 1,800 IMO gold medalists since 1959. At least fifty are in the Bay Area. Many work in AI.

And the cursor is matching them.

At first you test it. You probe the borders of your own expertise. You ask it things you already know, then things you half-know, then things you could find out for yourself if you bothered to ask Google and wade through 30 search engine optimized pages with bits of information here and there that had to be collated into a comprehensive answer. Eventually you ask it to do something just outside your field.

That is where the trouble begins.

Because once it works there, you start building tools.

It creeps up on you. At first it is a monitor for your personal Robinhood trades. Then it is something to clip YouTube videos, something you have always found annoying to do quickly.

Then it is some other small tool you once wanted, but never wanted enough to build or hire for. Before, that would have meant another $10–$100 SaaS subscription. Another tool to learn. Another interface that almost, but not quite, did what you wanted. Another charge quietly ringing up your credit card after you forgot you had subscribed.

Then comes the relinquishment.

At some point, you are too busy to manage everything, so you drop a large task on the cursor and leave it alone.

This is where the cursor stops being a tool and starts becoming a presence.

Because the cursor does it.

It goes through your emails and writes thank-you notes to all the people who came to your kid’s birthday party. It helps you find that subscription you signed up for and have been trying to cancel for months. It cleans the little corners of your life you had quietly given up on.

And you are hooked.

This is where many CEOs are now.

It is quite, quite amusing to me that one of the companies in this revolution is called Cursor.

Because now the cursor blinks at you every day.

The first thing a good number of us are doing is organizing our thoughts.

The process starts with the realization that there is some part of your mind you always wanted to externalize. Some complete memory of a certain thing. In Andrej Karpathy’s case, it is a map of every concept in every research paper he has ever read, or should read, built on the conviction that insight comes from connecting scattered pieces of the literature into new meaning.

In Garry Tan’s case, it is a full memory of every person he has met, as he meets thousands of people a year. VCs call this the “personal CRM,” and it is something that has been hunted for decades, like a holy grail.

So a good number of people are busy constructing these stores for themselves. There are debates on how to organize them. For now, the answer seems to be simple text files the cursors can read.

At about this point, even the cautious among us begin to give in.

We release the cursors into our workspace.

We have no patience to be bothered every ten minutes over whether we would like to approve something as insignificant as file deletion. This is called going “full-auto.”

Going full-auto is when you unleash the cursor. You are no longer sitting there, watching it work. You are allowing it to significantly alter the core product of your profession.

And like a father watching his child learn to cycle for the first time, you take a deep breath and let go.

And the cursor is doing it.

It is handling it.

When you come back, it has erected a reasonable addition to your body of work.

Your awe is tempered by the imperfections still evident in the work. Some are matters of opinion rather than accepted practice. Some are real flaws. But on the whole, with a little doctoring, it passes.

This is where we are as coders right now.

So now we find ourselves wondering what else we can build.

If it is good enough to provide top-quality work in my field, is it good enough to provide top-quality work in the field of the person selling to me? Or the field adjacent to mine? Or the field necessary for mine?

For example, medical researchers are often not trained as professional statisticians. In large institutions, they hire one. In smaller labs, they make do with software, templates, and internet advice.

Many a million-dollar study brings on a statistician as an afterthought, only to be told that the research they had done is useless. The study design cannot answer the question they asked, in effect they asked a different question from what they mean to ask. The wrong things were measured. The data, painstakingly collected, cannot be made to confess what it never observed.

But now every lab can have a competent statistical collaborator on demand. The first step to designing the study is a long chat with a team of cursors: a brainstorm organizer, a research assistant, a statistician, a reviewer. The gap between the small lab and the large one narrows.

And not just for the math piece. The cursor can be asked to help from the idea to the final product, across the whole chain of work.

You usually require a nudge at this point to hand off more work to the cursor.

For me, this happened on the 5th of May, when Sam Altman gifted those turned away from the GPT-5.5 launch party with a 10x increase in usage limits.

For the first week I was skeptical.

What would one even use such ludicrous amounts of tokens for?

I didn’t have that many tasks I could send the cursor to.

Or did I?

Because if one were careless with one’s usage, and did not mind missed turns, detours, unfamiliar terrain, and rejecting bad work without agony over sunk cost, one could perhaps journey to a place one had always wanted to go, but never had the time or energy to reach.

You embark on a Project.

A Project is something you have wanted to build for a while. It has been gnawing at you. It would have involved far too much work for the benefit it might deliver. Or it was risky, with a low chance of success, and you discarded it.

But now, with Sam’s Gift, you can throw the cursor at it and see what it can do.

It takes several tries.

You set it off with just an idea. Then you get the results and guide it on what to do next. But the initial kernel expands into long to-do lists, and the cursor starts to get confused. Sometimes it solves something once, then solves it again later in a different way.

It is as though you are the architect, and the cursor is your first workman.

First you sketch a picture of what you want. Then it builds it. And it is okay. But you want another window over there. When the cursor fixes that, the floor collapses here.

Then one cursor becomes many.

You have forty chats open. One is watering the flowers. One is taking out the trash. One is adding a west wing to the house. Another has forgotten the original blueprint and is quietly building a second house next door.

So the old tools of software civilization return: git, GitHub, linting, formatting, tests, pull requests, rules.

We built these systems to coordinate humans.

Now we are using them to coordinate cursors.

The job changes again. For the first time in decades, you stop writing code. The things you write now are prompts, they are design specs, they are reviews once the work is done. You are now the owner providing the keys to castle, the vision of what must be built, the critical reviewer accepting the product. You approve the architect’s plans, you monitor speed and cost, you wait to see if the cursors get stuck.

Now you organize a team.

This is where Garry Tan was about a month ago.

You build out the structure. Let there be a single to-do list. Let one cursor take the list and describe, in detail, what needs to be done. More importantly, let it define what kind of result is acceptable. Let another cursor grab the task and hand it to another cursor whose job is to do the thing.

You give the doer its rules. You tell it what it is allowed to do. When it completes the task, it must provide a report: what it changed, what failed, what pitfalls it encountered, and what should be stored for the next time it faces the same issue.

It is allowed to come back to you when it needs something: keys, access, a decision, a judgment call.

And then you send the team off.

And wonder of wonders, it works.

Overnight, they build something that would have taken you weeks.

In normal corporate bureaucracy time, it would have taken months just to pitch the idea and get the resources. It would have taken a team of three or four people weeks to build what you now have in the morning.

You are a bit dazed.

This is roughly where people at Anthropic and OpenAI have been for a couple of months now.

This is where Elad Gil says, “We are likely in very early lift off & exponential.”

For now, it is the most ambitious who are taking advantage of this.

SemiAnalysis, a small firm of semiconductor analysts, now tracks the entire industry in the US and Asia with a team of fewer than twenty, while the research departments at the Wall Street banks are behind.

@SemiAnalysis_ . I think of myself as a wide ranging systems engineer, looking for value at every level from the chip specs to the user interface, but SA exposes me to additional levels of \"the system\", both above (datacenters) and below","username":"ID_AA_Carmack","name":"John Carmack","profile_image_url":"https://pbs.substack.com/profile_images/1560764938083352577/B1X3m4NN_normal.jpg","date":"2026-05-26T21:12:45.000Z","photos":[],"quoted_tweet":{},"reply_count":86,"retweet_count":138,"like_count":3032,"impression_count":501883,"expanded_url":null,"video_url":null,"video_preview_media_key":null,"belowTheFold":true}" data-component-name="Twitter2ToDOM">

Ramp began as a company for managing corporate financial plumbing. It is now beginning to look like the financial nervous system of new corporate entities.

But the tokens are expensive.

Top people are spending tens of thousands per month at this point.

Token leaderboards inside firms, ridiculous and gameable as they sound, have forced people much like me to explore the use of cursors more widely and with more ambition.

And then the trial ended.

Sam’s Gift came to an end last week, the 5th of June, exactly one month later.

I had been watching the date approach with anxiety. I know myself. It is going to be very hard not to tell the cursor to do more things.

I have been wryly amused observing myself fall prey to the addiction.

Not a gift.

A free first dose of heroin, selectively delivered to the top 8,000 or so superfans of the company, the people who had volunteered with enthusiasm to attend a party for the GPT-5.5 launch.

What a perfect audience.

A test case of the most susceptible. The most open to the message.

Enthralling.

But wait. Just one more thing.

Yesterday, Anthropic announced that the cursors are helping to build better cursors.

The cursors are building the factories that will build better cursors. Some of the parts coming off that line are already better than the ones made by the best humans.

Which means they have become capable enough to increase the capability of the next generation.

This is the dream that floats above Silicon Valley today.

Coders cycle to work with their laptops running in their backpacks, slightly ajar, keeping their cursors alive.

CTOs, normally trapped between infinite founder vision and finite engineering budgets, are discovering a new constraint. Not headcount. Not money. Definition and review. Can they specify what they want clearly enough? Can they judge what comes back quickly enough?

Now I am become bottleneck, destroyer of velocity.

The chief technical unblocker is now the technical block. The cork in a shaken up champagne bottle on the verge of productive explosion.

Some sleep less now. Some sleep in two-hour blocks, waking to check whether their cursors are stuck.

It is a dream of unleashed technical ambition. It is a rising confidence that whatever can be built will be built with a subtle undercurrent of fear that some of those things shouldn’t.

And all the while, we watch the cursor on our screens.

Blinking.

Beckoning.

radar.cloudflare.com/traffic#bot-vs…","username":"eastdakota","name":"Matthew Prince 🌥","profile_image_url":"https://pbs.substack.com/profile_images/2332322635/zhx7hflmmcxdaj0tk9f8_normal.jpeg","date":"2026-06-03T16:39:56.000Z","photos":[],"quoted_tweet":{},"reply_count":380,"retweet_count":2109,"like_count":8167,"impression_count":2146859,"expanded_url":null,"video_url":null,"video_preview_media_key":null,"belowTheFold":true}" data-component-name="Twitter2ToDOM">

AI:AM — Self-Improving Tax Agents and Catholic AI · June 2, 2026

Prakash — Mon, 08 Jun 2026 13:03:05 GMT

Today on AI:AM — “Self-Improving Tax Agents and Catholic AI.”

We open on Google’s first equity raise since its IPO — Berkshire taking 12.5% of an $80B round — and what the scramble for capital says about an AI “megacorp” that may be too big to fail, plus the Bernie Sanders national-stake debate and why taxes may beat equity.

Arthur Fernandes Araujo & John de Wasseige (OpenAI) on self-improving tax agents — how a production tax workflow turned every human correction into training signal, took one accountant from 180 hours to 15, and why “the model eats the harness” with each new generation.

A hosts-only research speed-run on whether AI can still be watched — field notes from the Recursive event where monitoring is the number-one safety bet, plus the papers behind persona selection, emergent misalignment (”writing bad code makes you evil”), eval-gaming, and accidental chain-of-thought training.

Matthew Sanders (Longbeard / Magisterium AI) on Catholic AI after the Pope’s encyclical — what it was like at the Vatican, the divergence with Anthropic on machine consciousness, the red line on autonomous weapons, and why the last 5% of alignment is non-negotiable for a faith tradition.

We close on a live test of the cigarette-business refusal example from the OpenAI model spec, the tension between research and business “layers” in deployed models, and the argument that open-source AI may now be unbannable on religious-freedom grounds.

AI:AM — Trust and Recovery in AI (June 1, 2026)

Prakash — Fri, 05 Jun 2026 13:01:39 GMT

Today on AI:AM — “Trust and Recovery in AI.”

We open on why we’re attempting a daily AI show at all, the mission premium, and how the Pope is leading on what it means to be human in the AI age. Then three conversations on trusting AI in production, and a close on where the money and the risk are heading next.

Andy Fernandez (HYCU) on AI cyber resilience — why SaaS sprawl made recovery nearly impossible, and how the backup data you’ve already paid for becomes the enterprise’s “black box flight recorder” for governing AI agents.

David Villalón & Manuel Romero (Maisa) on enterprise digital workers that survive production — why workflows can’t model real knowledge work, how accountable AI owns its outcomes, and what a reproducible, auditable banking deployment looks like in 90 days.

Snehal Antani (Horizon3.ai) on autonomous security validation — how attackers actually operate, why frontier models stay gullible to deception, the 77-second breach, and why the future is AI-vs-AI with humans by exception.

We close on the trade rotating from GPUs to memory to disk, what total logging does to privacy and the rules, and why 1,766 miles of Tesla FSD might already be safer than the steering wheel.