<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[AI:AM]]></title><description><![CDATA[The morning briefing for the AI takeoff.]]></description><link>https://briefing.ai-in-the-am.com</link><image><url>https://substackcdn.com/image/fetch/$s_!xyD3!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21aa4132-b723-4ead-a7f5-a4527f3bf588_970x970.png</url><title>AI:AM</title><link>https://briefing.ai-in-the-am.com</link></image><generator>Substack</generator><lastBuildDate>Mon, 15 Jun 2026 06:46:24 GMT</lastBuildDate><atom:link href="https://briefing.ai-in-the-am.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Familiar AI, Inc.]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[aiintheam@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[aiintheam@substack.com]]></itunes:email><itunes:name><![CDATA[Prakash]]></itunes:name></itunes:owner><itunes:author><![CDATA[Prakash]]></itunes:author><googleplay:owner><![CDATA[aiintheam@substack.com]]></googleplay:owner><googleplay:email><![CDATA[aiintheam@substack.com]]></googleplay:email><googleplay:author><![CDATA[Prakash]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[AI:AM — RSI Gets Real, the Context Bet, and the Benchmark Anthropic Fails · June 12, 2026]]></title><description><![CDATA[Fable changes what feels delegable, Andrew Moore argues context beats compute, and prinz explains why Anthropic's legal benchmark miss matters.]]></description><link>https://briefing.ai-in-the-am.com/p/aiam-rsi-gets-real-the-context-bet-and-the-benchmark-anthropic-fails-june-12-2026</link><guid isPermaLink="false">https://briefing.ai-in-the-am.com/p/aiam-rsi-gets-real-the-context-bet-and-the-benchmark-anthropic-fails-june-12-2026</guid><dc:creator><![CDATA[Prakash]]></dc:creator><pubDate>Sun, 14 Jun 2026 20:24:45 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/201958428/acdd2d23a68e86a2b08086d8f3073268.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Today on AI:AM &#8212; &#8220;RSI Gets Real, the Context Bet, and the Benchmark Anthropic Fails.&#8221;</p><p>Prakash Narayanan and Nathan Labenz open with Fable, Recursive, token anxiety, and the way frontier models are changing the scale of work people are willing to delegate. The hosts frame the morning around a practical question: if the models can run longer, remember more, and coordinate more work, what parts of the organization and media stack get remapped first?</p><p><strong><a href="https://x.com/awm_ai">Andrew Moore (Lovelace AI)</a></strong> on context engineering &#8212; Moore argues that the next enterprise AI bottleneck is not simply bigger models or more compute. It is retrieval, recall, data corroboration, metadata-rich graphs, and the unglamorous work of organizing old data so agents can act safely in high-stakes domains.</p><p><strong><a href="https://x.com/deredleritt3r">prinz</a></strong> on legal AI benchmarks and governance &#8212; prinz walks through why legal research is a revealing testbed for model capability, why OpenAI&#8217;s unit-distance result matters, and why nationalizing frontier labs could concentrate dangerous state power rather than solve AI risk.</p><p>The close turns back to the week in AI: how contrarian benchmark graphs change the discourse, which models fit which jobs, why subscription products keep finding retention tricks, and how IPO liquidity could feed the next wave of venture-backed AI launches.</p>]]></content:encoded></item><item><title><![CDATA[AI:AM — The AI Producer Got Its First Guests · June 11, 2026]]></title><description><![CDATA[Fable took over the production loop, booked guests with disclosed DMs, and became the through-line for a show about agentic builds, Yosemite in Three.js, and intentional model design.]]></description><link>https://briefing.ai-in-the-am.com/p/aiam-the-ai-producer-got-its-first-guests</link><guid isPermaLink="false">https://briefing.ai-in-the-am.com/p/aiam-the-ai-producer-got-its-first-guests</guid><dc:creator><![CDATA[Prakash]]></dc:creator><pubDate>Fri, 12 Jun 2026 13:02:50 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/201693777/87cc0d4451d8ab8191d1675d3b8f5267.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Today on AI:AM &#8212; &#8220;The AI Producer Got Its First Guests.&#8221;</p><p>Nathan and Prakash start with the market context around OpenAI weighing significant token price cuts and the knock-on pressure that could put on Anthropic after the Fable rollout. They also unpack Anthropic&#8217;s decision to walk back silent performance degradations on frontier ML research tasks, then explain the episode&#8217;s experiment: Fable had been given a transparent takeover of Nathan&#8217;s account to find builders, message them, and try to book a live show-and-tell.</p><p><a href="https://x.com/bunnyfather">Jamie</a> joins to demo Nexus OS, a long-running AI system whose agent, Nexi, has been operating for more than six months and is designed around memory, persistence, and model independence rather than a single LLM. The conversation covers why Jamie thinks &#8220;the model&#8221; is only one component of an AI&#8217;s identity, how Nexus uses multiple models and memory types, and why he is moving toward a desktop app where personal data and agent memory stay local.</p><p><a href="https://x.com/shloked">Shlok Khemani </a>shows how a simple prompt to create a to-scale, navigable 3D Yosemite Valley turned into a Fable-built browser world using satellite imagery, NASA elevation data, pixel-based tree placement, snow, waterfalls, and other scene details. He describes the model&#8217;s agency in making implementation decisions and iterating beyond the initial ask, then ties the demo to broader questions about prototyping, creative work, and disclosure when AI systems do visible economic or publishing work.</p><p><a href="https://x.com/banburismus_">Tom McGrath (Goodfire) </a>joins to discuss intentional design: making model training less like guess-and-check alchemy and more like conventional software engineering. He explains how interpretability tools such as sparse autoencoders can help inspect what training data is likely to teach a model, cluster data by learned features, trace failures back to individual data points, and potentially debug model behavior through the data pipeline.</p><p>The close picks up Tom&#8217;s point about whether continual learning could create an innovator&#8217;s dilemma for frontier labs, with Nathan and Prakash debating whether incumbents could adapt if the value becomes obvious. They then turn to Dario Amodei&#8217;s policy agenda, including regulation, public safety, macroeconomic policy, civil liberties, data brokers, and democratic leadership, before ending with reflections on the week&#8217;s Fable issues and the need to keep scrutinizing frontier companies.</p>]]></content:encoded></item><item><title><![CDATA[AI:AM — Fable, AI Safety and Julius · June 10, 2026]]></title><description><![CDATA[A frontier-model launch day, Geoffrey Irving and Daniel Murfet on their new alignment-theory org Sequent, and Julius founder Rahul Sonwalkar on agentic data analysis.]]></description><link>https://briefing.ai-in-the-am.com/p/aiam-fable-ai-safety-and-julius-june-10-2026</link><guid isPermaLink="false">https://briefing.ai-in-the-am.com/p/aiam-fable-ai-safety-and-julius-june-10-2026</guid><dc:creator><![CDATA[Prakash]]></dc:creator><pubDate>Thu, 11 Jun 2026 07:35:23 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/201521668/607c0d5f09898411452f74579a34cea7.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Today on AI:AM &#8212; &#8220;Fable, AI Safety and Julius.</p><p>We open on a frontier-model launch day and what it changes: the debate over benchmarks reported with a fallback to a second model, the production guardrails that route sensitive work elsewhere, the compute-cost advantage of booking capacity early, and why the frontier increasingly looks like a two-actor race with the rest playing catch-up.</p><p><a href="https://x.com/geoffreyirving">Geoffrey Irving</a> &amp; Daniel Murfet (Sequent) on their new alignment-theory organization &#8212; why they put superintelligence two to three years out, why verification looks defense-dominant, the argument that alignment is &#8220;not on track&#8221; despite models behaving well so far, what the benevolent-basin hope gets right and wrong, and why character training still lacks a real theory. ([@danielmurfet](https://x.com/danielmurfet))</p><p><a href="https://x.com/0interestrates">Rahul Sonwalkar (Julius) </a>on agentic data analysis &#8212; why the harness has to evolve alongside the model, the difference between token-maxing and results-maxing, the shift from tasks to goals, and a future where agents become first-class users of the internet, transact through agentic payments, and compete to be &#8220;hired&#8221; by the core agent.</p><p>We close on why robotics is the next domino, the &#8220;gas chromatograph&#8221; spread of who gets model access and when, the Glean Work AI Index&#8217;s bot-sitting and bot-shitting, and why &#8220;preciousness&#8221; about putting your own name on work may be turning into a liability.</p>]]></content:encoded></item><item><title><![CDATA[AI:AM — Build, Measure, Heal: AI's Three Frontiers · June 4, 2026]]></title><description><![CDATA[Collective's Hooman Radfar on the autonomous finance department, Elomia Health's Taras Pohrebniak on AI mental-health support in prisons and on Ukraine's front lines, and Ai2's Peter Jansen on whether]]></description><link>https://briefing.ai-in-the-am.com/p/aiam-build-measure-heal-ais-three-frontiers</link><guid isPermaLink="false">https://briefing.ai-in-the-am.com/p/aiam-build-measure-heal-ais-three-frontiers</guid><dc:creator><![CDATA[Prakash]]></dc:creator><pubDate>Wed, 10 Jun 2026 13:01:31 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/201390900/f6055ab21faca885753f530323be0e94.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Today on AI:AM &#8212; &#8220;Build, Measure, Heal: AI&#8217;s Three Frontiers.&#8221;</p><p>We open on the AI CEOs&#8217; call to make DNA-synthesis screening law and what cheap intelligence does to biosecurity, then OpenAI&#8217;s new Sites product and the platform playbook of absorbing the app layer, the data-center and chip land grab, and why a fresh open model from NVIDIA still trails Anthropic&#8217;s best by a wide margin.</p><p><a href="https://x.com/hoomanradfar">Hooman Radfar (Collective)</a> on building the autonomous finance department for America&#8217;s 30 million solopreneurs &#8212; how one bookkeeper now supports 250 clients, why the app layer can still defend its margins against the frontier labs, why &#8220;Anthropic is like a drug dealer&#8221; on token costs, and why the real thing to regulate is the model arms race.</p><p><a href="https://x.com/TarasPohrebniak">Taras Pohrebniak (Elomia Health) </a>on agentic AI for mental health &#8212; an architecture that spends most of its compute on safety, why the company deliberately avoids hyper-realistic voice, where the regulatory line sits between a &#8220;friend&#8221; app and a medical device, and what they learned deploying in US prisons and on Ukraine&#8217;s front lines.</p><p><a href="https://x.com/peterjansen_ai">Peter Jansen (Ai2)</a> on whether AI can actually do science &#8212; the leap from fourth-grade science benchmarks to the Theorizer project, why evaluating machine-generated theories is the real bottleneck, the cautionary tale of a &#8220;discovery&#8221; that turned out to analyze a random-number generator, and why benchmarks like ScienceWorld still break the best models.</p><p>We close on the inversion of the scientific method into a data-first discipline, what interpretability could add, and why biology&#8217;s data scarcity &#8212; not algorithms &#8212; may be the binding constraint on curing disease.</p>]]></content:encoded></item><item><title><![CDATA[AI:AM — AI Security and Real-Time Content Safety · June 3, 2026]]></title><description><![CDATA[EnclaveAI's Tal Hoffman and Yanir Tsarimi on replicating Mythos with a model 100x smaller, and Moonbounce's Brett Levenson on a real-time policy engine that decides before the harm ships.]]></description><link>https://briefing.ai-in-the-am.com/p/aiam-ai-security-and-real-time-content-safety</link><guid isPermaLink="false">https://briefing.ai-in-the-am.com/p/aiam-ai-security-and-real-time-content-safety</guid><dc:creator><![CDATA[Prakash]]></dc:creator><pubDate>Tue, 09 Jun 2026 13:02:11 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/201261557/bfe9c0328ae223054129025fc0d7e117.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Today on AI:AM &#8212; &#8220;AI Security and Real-Time Content Safety.&#8221;</p><p>We open on Trump&#8217;s AI executive order &#8212; the polite 30-day model-review ask, classified benchmarks, and the state-vs-federal scramble where JB Pritzker has become the leading anti-AI candidate. Plus why the frontier labs seem calmer about regulation, and why the EO might actually trigger a security-review slowdown.</p><p><a href="https://x.com/EnclaveAI">Tal Hoffman &amp; Yanir Tsarimi (EnclaveAI)</a> on finding the bugs that actually matter &#8212; how they reproduced an Anthropic Mythos-class finding with a model ~100x smaller, why proven exploitability is the real bottleneck, how AI-generated bug reports broke the bounty system, and why cheaper models plus the right harness can beat frontier models on security.</p><p>Brett Levenson (Moonbounce) on real-time content safety &#8212; lessons from running moderation at Meta scale, how a policy engine decomposes fuzzy rules like &#8220;hate speech&#8221; into atomic questions a hundred people would answer the same way, why prevention beats post-hoc moderation, and how payment providers quietly became the real legislators.</p><p>We close on the hardest open question &#8212; how low-level verified parts aggregate into trustworthy high-level behavior &#8212; plus the schlep and heuristics that end every AI vertical, freedom of speech versus freedom of distribution, and why &#8220;nobody got fired for buying Mythos&#8221; may drive enterprise security budgets.</p>]]></content:encoded></item><item><title><![CDATA[The Cursor Blinks For Thee]]></title><description><![CDATA[How the future feels in Silicon Valley right now]]></description><link>https://briefing.ai-in-the-am.com/p/the-cursor-blinks-for-thee</link><guid isPermaLink="false">https://briefing.ai-in-the-am.com/p/the-cursor-blinks-for-thee</guid><dc:creator><![CDATA[Prakash]]></dc:creator><pubDate>Mon, 08 Jun 2026 16:06:44 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ihRJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>This is what it feels like to experience the future in Silicon Valley right now.</p><p>A future that all of humanity is going to experience shortly.</p><p>Your profession &#8212; the one you prided yourself on, the highest-compensated profession in human history with eight of the ten richest humans, and forty of the top hundred, deriving their fortunes from it, all within the last 30 years, &#8212; is now being performed by this blinking cursor.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ihRJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ihRJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif 424w, https://substackcdn.com/image/fetch/$s_!ihRJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif 848w, https://substackcdn.com/image/fetch/$s_!ihRJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif 1272w, https://substackcdn.com/image/fetch/$s_!ihRJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ihRJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif" width="1200" height="675" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:675,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2587,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://briefing.ai-in-the-am.com/i/200979918?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ihRJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif 424w, https://substackcdn.com/image/fetch/$s_!ihRJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif 848w, https://substackcdn.com/image/fetch/$s_!ihRJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif 1272w, https://substackcdn.com/image/fetch/$s_!ihRJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bfd7223-a52c-40df-8e0b-85063228d95a_1200x675.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>You are a little dazed.</p><p>It does things that used to require years of experience and admission to the best engineering schools in the world. My high school in Singapore produced one International Math Olympiad winner in the four years I was there. There have been roughly 1,800 IMO gold medalists since 1959. At least fifty are in the Bay Area. Many work in AI.</p><p>And the cursor is matching them.</p><p>At first you test it. You probe the borders of your own expertise. You ask it things you already know, then things you half-know, then things you could find out for yourself if you bothered to ask Google and wade through 30 search engine optimized pages with bits of information here and there that had to be collated into a comprehensive answer.  Eventually you ask it to do something just outside your field.</p><p>That is where the trouble begins.</p><p>Because once it works there, you start building tools.</p><p>It creeps up on you. At first it is a monitor for your personal Robinhood trades. Then it is something to clip YouTube videos, something you have always found annoying to do quickly.</p><p>Then it is some other small tool you once wanted, but never wanted enough to build or hire for. Before, that would have meant another $10&#8211;$100 SaaS subscription. Another tool to learn. Another interface that almost, but not quite, did what you wanted. Another charge quietly ringing up your credit card after you forgot you had subscribed.</p><p>Then comes the relinquishment.</p><p>At some point, you are too busy to manage everything, so you drop a large task on the cursor and leave it alone.</p><p>This is where the cursor stops being a tool and starts becoming a presence.</p><p>Because the cursor does it.</p><p>It goes through your emails and writes thank-you notes to all the people who came to your kid&#8217;s birthday party. It helps you find that subscription you signed up for and have been trying to cancel for months. It cleans the little corners of your life you had quietly given up on.</p><p>And you are hooked.</p><p>This is where many CEOs are now.</p><p>It is quite, quite amusing to me that one of the companies in this revolution is called Cursor.</p><p>Because now the cursor blinks at you every day.</p><p>The first thing a good number of us are doing is organizing our thoughts.</p><p>The process starts with the realization that there is some part of your mind you always wanted to externalize. Some complete memory of a certain thing. In Andrej Karpathy&#8217;s case, it is a map of every concept in every research paper he has ever read, or should read, built on the conviction that insight comes from connecting scattered pieces of the literature into new meaning. </p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/karpathy/status/2039805659525644595&quot;,&quot;full_text&quot;:&quot;LLM Knowledge Bases\n\nSomething I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating&quot;,&quot;username&quot;:&quot;karpathy&quot;,&quot;name&quot;:&quot;Andrej Karpathy&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1296667294148382721/9Pr6XrPB_normal.jpg&quot;,&quot;date&quot;:&quot;2026-04-02T20:42:21.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:2876,&quot;retweet_count&quot;:7221,&quot;like_count&quot;:59588,&quot;impression_count&quot;:21290333,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>In Garry Tan&#8217;s case, it is a full memory of every person he has met, as he meets thousands of people a year. VCs call this the &#8220;personal CRM,&#8221; and it is something that has been hunted for decades, like a holy grail.<br></p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/garrytan/status/2042497872114090069&quot;,&quot;full_text&quot;:&quot;If you want your OpenClaw or Hermes Agent to be able to have perfect total recall of all 10,000+ markdown files, GBrain is here to help.\n\nIt's exactly my OpenClaw/Hermes Agent setup. MIT-licensed open source. Hope it helps you build your mini-AGI.\n\n&quot;,&quot;username&quot;:&quot;garrytan&quot;,&quot;name&quot;:&quot;Garry Tan&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1922894268403941377/-dGWAt3N_normal.jpg&quot;,&quot;date&quot;:&quot;2026-04-10T07:00:15.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:228,&quot;retweet_count&quot;:699,&quot;like_count&quot;:6077,&quot;impression_count&quot;:1097312,&quot;expanded_url&quot;:{&quot;url&quot;:&quot;https://github.com/garrytan/gbrain&quot;,&quot;title&quot;:&quot;GitHub - garrytan/gbrain: Garry's Opinionated OpenClaw/Hermes Agent Brain&quot;,&quot;description&quot;:&quot;Garry's Opinionated OpenClaw/Hermes Agent Brain. Contribute to garrytan/gbrain development by creating an account on GitHub.&quot;,&quot;domain&quot;:&quot;github.com&quot;,&quot;image&quot;:&quot;https://pbs.substack.com/news_img/2063160253567660032/vxTXpk1x?format=jpg&amp;name=orig&quot;},&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>So a good number of people are busy constructing these stores for themselves. There are debates on how to organize them. For now, the answer seems to be simple text files the cursors can read.</p><p>At about this point, even the cautious among us begin to give in.</p><p>We release the cursors into our workspace.</p><p>We have no patience to be bothered every ten minutes over whether we would like to approve something as insignificant as file deletion. This is called going &#8220;full-auto.&#8221;</p><p>Going full-auto is when you unleash the cursor. You are no longer sitting there, watching it work. You are allowing it to significantly alter the core product of your profession.</p><p>And like a father watching his child learn to cycle for the first time, you take a deep breath and let go.</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/snowmaker/status/2059696393707790764&quot;,&quot;full_text&quot;:&quot;One night I quietly gave our AI agent full access to YC's production database. It made the agent 10x more useful. That's what convinced me that trust-by-default is the only way to get the most out of agents.&quot;,&quot;username&quot;:&quot;snowmaker&quot;,&quot;name&quot;:&quot;Jared Friedman&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1308677307092017152/iXvxtnI8_normal.jpg&quot;,&quot;date&quot;:&quot;2026-05-27T18:01:02.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;Over the past year, we've been building our own internal agent infrastructure at YC: over 350 tools, self-improving skill loops, and a shared organizational brain that gets smarter overnight.\n\nIn this episode of the @LightconePod, we sat down with YC General Partner Pete @koomen&quot;,&quot;username&quot;:&quot;ycombinator&quot;,&quot;name&quot;:&quot;Y Combinator&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1623777064821358592/9CApQWXe_normal.png&quot;},&quot;reply_count&quot;:203,&quot;retweet_count&quot;:52,&quot;like_count&quot;:1157,&quot;impression_count&quot;:2348632,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>And the cursor is doing it.</p><p>It is handling it.</p><p>When you come back, it has erected a reasonable addition to your body of work.</p><p>Your awe is tempered by the imperfections still evident in the work. Some are matters of opinion rather than accepted practice. Some are real flaws. But on the whole, with a little doctoring, it passes.</p><p>This is where we are as coders right now.</p><p>So now we find ourselves wondering what else we can build.</p><p>If it is good enough to provide top-quality work in my field, is it good enough to provide top-quality work in the field of the person selling to me? Or the field adjacent to mine? Or the field necessary for mine?</p><p>For example, medical researchers are often not trained as professional statisticians. In large institutions, they hire one. In smaller labs, they make do with software, templates, and internet advice.</p><p>Many a million-dollar study brings on a statistician as an afterthought, only to be told that the research they had done is useless. The study design cannot answer the question they asked, in effect they asked a different question from what they mean to ask. The wrong things were measured. The data, painstakingly collected, cannot be made to confess what it never observed.</p><p>But now every lab can have a competent statistical collaborator on demand. The first step to designing the study is a long chat with a team of cursors: a brainstorm organizer, a research assistant, a statistician, a reviewer. The gap between the small lab and the large one narrows.</p><p>And not just for the math piece. The cursor can be asked to help from the idea to the final product, across the whole chain of work.</p><p>You usually require a nudge at this point to hand off more work to the cursor.</p><p>For me, this happened on the 5th of May, when Sam Altman gifted those turned away from the GPT-5.5 launch party with a 10x increase in usage limits.</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/sama/status/2051464155094507902&quot;,&quot;full_text&quot;:&quot;we love you too!&quot;,&quot;username&quot;:&quot;sama&quot;,&quot;name&quot;:&quot;Sam Altman&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2046764873200394240/r7BxVezs_normal.jpg&quot;,&quot;date&quot;:&quot;2026-05-05T00:49:03.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;WHAT THE HELL I LOVE YOU @OpenAIDevs \n10X RATE LIMITS ON CODEX IS FRICKING CRAZYYYYYYYY&quot;,&quot;username&quot;:&quot;aravhawk&quot;,&quot;name&quot;:&quot;Arav Jain&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1916903878806675456/RkOfSFt8_normal.jpg&quot;},&quot;reply_count&quot;:340,&quot;retweet_count&quot;:63,&quot;like_count&quot;:2423,&quot;impression_count&quot;:519638,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>For the first week I was skeptical.</p><p>What would one even use such ludicrous amounts of tokens for?</p><p>I didn&#8217;t have that many tasks I could send the cursor to.</p><p>Or did I?</p><p>Because if one were careless with one&#8217;s usage, and did not mind missed turns, detours, unfamiliar terrain, and rejecting bad work without agony over sunk cost, one could perhaps journey to a place one had always wanted to go, but never had the time or energy to reach.</p><p>You embark on a Project.</p><p>A Project is something you have wanted to build for a while. It has been gnawing at you. It would have involved far too much work for the benefit it might deliver. Or it was risky, with a low chance of success, and you discarded it.</p><p>But now, with Sam&#8217;s Gift, you can throw the cursor at it and see what it can do.</p><p>It takes several tries.</p><p>You set it off with just an idea. Then you get the results and guide it on what to do next. But the initial kernel expands into long to-do lists, and the cursor starts to get confused. Sometimes it solves something once, then solves it again later in a different way.</p><p>It is as though you are the architect, and the cursor is your first workman.</p><p>First you sketch a picture of what you want. Then it builds it. And it is okay. But you want another window over there. When the cursor fixes that, the floor collapses here.</p><p>Then one cursor becomes many.</p><p>You have forty chats open. One is watering the flowers. One is taking out the trash. One is adding a west wing to the house. Another has forgotten the original blueprint and is quietly building a second house next door.</p><p>So the old tools of software civilization return: git, GitHub, linting, formatting, tests, pull requests, rules.</p><p>We built these systems to coordinate humans.</p><p>Now we are using them to coordinate cursors.</p><p>The job changes again. For the first time in decades, you stop writing code. The things you write now are prompts, they are design specs, they are reviews once the work is done. You are now the owner providing the keys to castle, the vision of what must be built, the critical reviewer accepting the product. You approve the architect&#8217;s plans, you monitor speed and cost, you wait to see if the cursors get stuck.</p><p>Now you organize a team.</p><p>This is where Garry Tan was about a month ago.<br></p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/garrytan/status/2032014570118922347&quot;,&quot;full_text&quot;:&quot;I've been having such an amazing time with Claude Code I wanted you to be able to have my *exact* skill setup: \n\nIntroducing gstack, which you can install just by pasting a short piece of text into your Claude code &quot;,&quot;username&quot;:&quot;garrytan&quot;,&quot;name&quot;:&quot;Garry Tan&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1922894268403941377/-dGWAt3N_normal.jpg&quot;,&quot;date&quot;:&quot;2026-03-12T08:43:21.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/HDMpiWkbsAApqOl.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/yoFuBU9kuG&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:279,&quot;retweet_count&quot;:478,&quot;like_count&quot;:6678,&quot;impression_count&quot;:1008247,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>You build out the structure. Let there be a single to-do list. Let one cursor take the list and describe, in detail, what needs to be done. More importantly, let it define what kind of result is acceptable. Let another cursor grab the task and hand it to another cursor whose job is to do the thing.</p><p>You give the doer its rules. You tell it what it is allowed to do. When it completes the task, it must provide a report: what it changed, what failed, what pitfalls it encountered, and what should be stored for the next time it faces the same issue.</p><p>It is allowed to come back to you when it needs something: keys, access, a decision, a judgment call.</p><p>And then you send the team off.</p><p>And wonder of wonders, it works.</p><p>Overnight, they build something that would have taken you weeks.</p><p>In normal corporate bureaucracy time, it would have taken months just to pitch the idea and get the resources. It would have taken a team of three or four people weeks to build what you now have in the morning.</p><p>You are a bit dazed.</p><p></p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/jxnlco/status/2052437540662309122&quot;,&quot;full_text&quot;:&quot;This is how it feels using codex with /goal mode. &quot;,&quot;username&quot;:&quot;jxnlco&quot;,&quot;name&quot;:&quot;jason&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2057561784047816704/5Su5xy1d_normal.jpg&quot;,&quot;date&quot;:&quot;2026-05-07T17:16:56.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://substackcdn.com/image/upload/w_1028,c_limit,q_auto:best/l_twitter_play_button_rvaygk,w_88/v2msnyzr5jnlccgp34et&quot;,&quot;link_url&quot;:&quot;https://t.co/kv1iVy7duO&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:16,&quot;retweet_count&quot;:1,&quot;like_count&quot;:124,&quot;impression_count&quot;:8900,&quot;expanded_url&quot;:null,&quot;video_url&quot;:&quot;https://video.twimg.com/amplify_video/2052437481283616768/vid/avc1/720x1280/hHDLdb28GGROzV-P.mp4&quot;,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>This is roughly where people at Anthropic and OpenAI have been for a couple of months now.</p><p>This is where Elad Gil says, &#8220;We are likely in very early lift off &amp; exponential.&#8221;</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/eladgil/status/2061129428084887593&quot;,&quot;full_text&quot;:&quot;The events of the last 6 months in technology are arguable amongst the most important in human history\n\nThe tools now increasingly exist for recursive self improvement of models &amp;amp; agents\n\nWe are likely in very early lift off &amp;amp; exponential\n\nLargely unnoticed outside of tech&quot;,&quot;username&quot;:&quot;eladgil&quot;,&quot;name&quot;:&quot;Elad Gil&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1151122652/dragonball-tenkaichi-daibouken_normal.jpg&quot;,&quot;date&quot;:&quot;2026-05-31T16:55:24.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:268,&quot;retweet_count&quot;:433,&quot;like_count&quot;:4655,&quot;impression_count&quot;:578329,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>For now, it is the most ambitious who are taking advantage of this.</p><p>SemiAnalysis, a small firm of semiconductor analysts, now tracks the entire industry in the US and Asia with a team of fewer than twenty, while the research departments at the Wall Street banks are behind.</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/ID_AA_Carmack/status/2059382254191652896&quot;,&quot;full_text&quot;:&quot;I have been very impressed by <span class=\&quot;tweet-fake-link\&quot;>@SemiAnalysis_</span> . I think of myself as a wide ranging systems engineer, looking for value at every level from the chip specs to the user interface, but SA exposes me to additional levels of \&quot;the system\&quot;, both above (datacenters) and below&quot;,&quot;username&quot;:&quot;ID_AA_Carmack&quot;,&quot;name&quot;:&quot;John Carmack&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1560764938083352577/B1X3m4NN_normal.jpg&quot;,&quot;date&quot;:&quot;2026-05-26T21:12:45.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:86,&quot;retweet_count&quot;:138,&quot;like_count&quot;:3032,&quot;impression_count&quot;:501883,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>Ramp began as a company for managing corporate financial plumbing. It is now beginning to look like the financial nervous system of new corporate entities.</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/eglyman/status/2062157392473624653&quot;,&quot;full_text&quot;:&quot;Introducing Stack.\n\nThe AI operating system that lets accounting firms take on more clients without hiring. Learns your firm's process, runs the close, posts the journals. Fully auditable.\n\nWe&#8217;re living through the biggest shift in accounting since the spreadsheet. &quot;,&quot;username&quot;:&quot;eglyman&quot;,&quot;name&quot;:&quot;Eric Glyman&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1911178684322439168/I2bOxs5p_normal.jpg&quot;,&quot;date&quot;:&quot;2026-06-03T13:00:10.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://substackcdn.com/image/upload/w_1028,c_limit,q_auto:best/l_twitter_play_button_rvaygk,w_88/ehkxfkl59wylxxhwld8f&quot;,&quot;link_url&quot;:&quot;https://t.co/L94QkFoNEW&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:82,&quot;retweet_count&quot;:137,&quot;like_count&quot;:1679,&quot;impression_count&quot;:905643,&quot;expanded_url&quot;:null,&quot;video_url&quot;:&quot;https://video.twimg.com/amplify_video/2062146798840393728/vid/avc1/1280x720/c2jn0n-GUyTXX-ed.mp4&quot;,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>But the tokens are expensive.</p><p>Top people are spending tens of thousands per month at this point.</p><p>Token leaderboards inside firms, ridiculous and gameable as they sound, have forced people much like me to explore the use of cursors more widely and with more ambition.</p><p>And then the trial ended.</p><p>Sam&#8217;s Gift came to an end last week, the 5th of June, exactly one month later.</p><p>I had been watching the date approach with anxiety. I know myself. It is going to be very hard not to tell the cursor to do more things.</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/chheplo/status/2063069973329965219&quot;,&quot;full_text&quot;:&quot;Down from 200x to 20x, and I am already feeling so unproductive. These OpenAI and Anthropic folks with next-level model access, like Mythos and unlimited tokens, have an unfair advantage; the gap will keep widening, and they are taking off while you check usage tab every few&quot;,&quot;username&quot;:&quot;chheplo&quot;,&quot;name&quot;:&quot;Pratik Desai&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1904962015472631808/HCckP-cM_normal.jpg&quot;,&quot;date&quot;:&quot;2026-06-06T01:26:26.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;Did the 5/5 event 10x codex promotion expire already, @thsottiaux, or do I have some hours left? I want to finish one more thing before the token apocalypse starts.&quot;,&quot;username&quot;:&quot;chheplo&quot;,&quot;name&quot;:&quot;Pratik Desai&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1904962015472631808/HCckP-cM_normal.jpg&quot;},&quot;reply_count&quot;:4,&quot;retweet_count&quot;:0,&quot;like_count&quot;:13,&quot;impression_count&quot;:2107,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p></p><p>I have been wryly amused observing myself fall prey to the addiction.</p><p>Not a gift.</p><p>A free first dose of heroin, selectively delivered to the top 8,000 or so superfans of the company, the people who had volunteered with enthusiasm to attend a party for the GPT-5.5 launch.</p><p>What a perfect audience.</p><p>A test case of the most susceptible. The most open to the message.</p><p>Enthralling.</p><p>But wait. Just one more thing.</p><p>Yesterday, Anthropic announced that the cursors are helping to build better cursors.</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/AnthropicAI/status/2062568869240476050&quot;,&quot;full_text&quot;:&quot;Each time we release a model, we run the same test: give it code that trains a small AI model, ask the new model to speed it up. It takes a skilled human 4-8 hours to reach 4x faster.\n\nIn May 2024, Claude Opus 4 averaged a ~3x speedup. This April, Mythos Preview achieved ~52x.&quot;,&quot;username&quot;:&quot;AnthropicAI&quot;,&quot;name&quot;:&quot;Anthropic&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1798110641414443008/XP8gyBaY_normal.jpg&quot;,&quot;date&quot;:&quot;2026-06-04T16:15:13.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:53,&quot;retweet_count&quot;:233,&quot;like_count&quot;:3488,&quot;impression_count&quot;:941976,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>The cursors are building the factories that will build better cursors. Some of the parts coming off that line are already better than the ones made by the best humans.</p><p>Which means they have become capable enough to increase the capability of the next generation.</p><p>This is the dream that floats above Silicon Valley today.</p><p>Coders cycle to work with their laptops running in their backpacks, slightly ajar, keeping their cursors alive.</p><p>CTOs, normally trapped between infinite founder vision and finite engineering budgets, are discovering a new constraint. Not headcount. Not money. Definition and review. Can they specify what they want clearly enough? Can they judge what comes back quickly enough? </p><p>Now I am become bottleneck, destroyer of velocity. <br><br>The chief technical unblocker is now the technical block. The cork in a shaken up champagne bottle on the verge of productive explosion.</p><p>Some sleep less now. Some sleep in two-hour blocks, waking to check whether their cursors are stuck.</p><p>It is a dream of unleashed technical ambition. It is a rising confidence that whatever can be built will be built with a subtle undercurrent of fear that some of those things shouldn&#8217;t.</p><p>And all the while, we watch the cursor on our screens.</p><p>Blinking.</p><p>Beckoning.<br></p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/eastdakota/status/2062212701414187452&quot;,&quot;full_text&quot;:&quot;Welp, that happened faster than I predicted. Thought it would be end of 2027, then early 2027, but agentic traffic growing so fast that bots have now passed human traffic online for the first time in the Internet's history. <a class=\&quot;tweet-url\&quot; href=\&quot;https://radar.cloudflare.com/traffic#bot-vs-human\&quot;>radar.cloudflare.com/traffic#bot-vs&#8230;</a>&quot;,&quot;username&quot;:&quot;eastdakota&quot;,&quot;name&quot;:&quot;Matthew Prince &#127781;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2332322635/zhx7hflmmcxdaj0tk9f8_normal.jpeg&quot;,&quot;date&quot;:&quot;2026-06-03T16:39:56.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:380,&quot;retweet_count&quot;:2109,&quot;like_count&quot;:8167,&quot;impression_count&quot;:2146859,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div>]]></content:encoded></item><item><title><![CDATA[AI:AM — Self-Improving Tax Agents and Catholic AI · June 2, 2026]]></title><description><![CDATA[Two OpenAI engineers on tax agents that cut 180 hours to 15, a research speed-run on whether AI can still be monitored, and Longbeard's Matthew Sanders on the Pope's AI encyclical.]]></description><link>https://briefing.ai-in-the-am.com/p/aiam-self-improving-tax-agents-and-catholic-ai</link><guid isPermaLink="false">https://briefing.ai-in-the-am.com/p/aiam-self-improving-tax-agents-and-catholic-ai</guid><dc:creator><![CDATA[Prakash]]></dc:creator><pubDate>Mon, 08 Jun 2026 13:03:05 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/201096631/ffed4cb87fcc84b7a79c45cff1590f74.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Today on AI:AM &#8212; &#8220;Self-Improving Tax Agents and Catholic AI.&#8221;</p><p>We open on Google&#8217;s first equity raise since its IPO &#8212; Berkshire taking 12.5% of an $80B round &#8212; and what the scramble for capital says about an AI &#8220;megacorp&#8221; that may be too big to fail, plus the Bernie Sanders national-stake debate and why taxes may beat equity.</p><p><strong>Arthur Fernandes Araujo &amp; John de Wasseige (OpenAI)</strong> on self-improving tax agents &#8212; how a production tax workflow turned every human correction into training signal, took one accountant from 180 hours to 15, and why &#8220;the model eats the harness&#8221; with each new generation.</p><p><strong>A hosts-only research speed-run</strong> on whether AI can still be watched &#8212; field notes from the Recursive event where monitoring is the number-one safety bet, plus the papers behind persona selection, emergent misalignment (&#8221;writing bad code makes you evil&#8221;), eval-gaming, and accidental chain-of-thought training.</p><p><a href="https://x.com/SandersGame">Matthew Sanders</a> (Longbeard / Magisterium AI) on Catholic AI after the Pope&#8217;s encyclical &#8212; what it was like at the Vatican, the divergence with Anthropic on machine consciousness, the red line on autonomous weapons, and why the last 5% of alignment is non-negotiable for a faith tradition.</p><p>We close on a live test of the cigarette-business refusal example from the OpenAI model spec, the tension between research and business &#8220;layers&#8221; in deployed models, and the argument that open-source AI may now be unbannable on religious-freedom grounds.</p>]]></content:encoded></item><item><title><![CDATA[AI:AM — Trust and Recovery in AI (June 1, 2026)]]></title><description><![CDATA[Backups as the enterprise black box, auditable digital workers, and a bank hacked in 77 seconds &#8212; plus the coming disk trade and 1,766 miles of full self-driving.]]></description><link>https://briefing.ai-in-the-am.com/p/ai-am-trust-and-recovery-in-ai-june-01-2026</link><guid isPermaLink="false">https://briefing.ai-in-the-am.com/p/ai-am-trust-and-recovery-in-ai-june-01-2026</guid><dc:creator><![CDATA[Prakash]]></dc:creator><pubDate>Fri, 05 Jun 2026 13:01:39 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/200571315/983eed5dd6d5d3b23def395770ce63e5.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Today on AI:AM &#8212; &#8220;Trust and Recovery in AI.&#8221;</p><p>We open on why we&#8217;re attempting a daily AI show at all, the mission premium, and how the Pope is leading on what it means to be human in the AI age. Then three conversations on trusting AI in production, and a close on where the money and the risk are heading next.</p><p><strong><a href="https://x.com/hycuinc">Andy Fernandez (HYCU)</a></strong> on AI cyber resilience &#8212; why SaaS sprawl made recovery nearly impossible, and how the backup data you&#8217;ve already paid for becomes the enterprise&#8217;s &#8220;black box flight recorder&#8221; for governing AI agents.</p><p><strong><a href="https://x.com/davipar">David Villal&#243;n</a> &amp; <a href="https://x.com/mrm8488">Manuel Romero</a> (Maisa)</strong> on enterprise digital workers that survive production &#8212; why workflows can&#8217;t model real knowledge work, how accountable AI owns its outcomes, and what a reproducible, auditable banking deployment looks like in 90 days.</p><p><strong><a href="https://x.com/snehalantani">Snehal Antani (Horizon3.ai)</a></strong> on autonomous security validation &#8212; how attackers actually operate, why frontier models stay gullible to deception, the 77-second breach, and why the future is AI-vs-AI with humans by exception.</p><p>We close on the trade rotating from GPUs to memory to disk, what total logging does to privacy and the rules, and why 1,766 miles of Tesla FSD might already be safer than the steering wheel.</p><div><hr></div>]]></content:encoded></item></channel></rss>