0:00
/
Transcript

AI:AM — AI Security and Real-Time Content Safety · June 3, 2026

EnclaveAI's Tal Hoffman and Yanir Tsarimi on replicating Mythos with a model 100x smaller, and Moonbounce's Brett Levenson on a real-time policy engine that decides before the harm ships.

Today on AI:AM — “AI Security and Real-Time Content Safety.”

We open on Trump’s AI executive order — the polite 30-day model-review ask, classified benchmarks, and the state-vs-federal scramble where JB Pritzker has become the leading anti-AI candidate. Plus why the frontier labs seem calmer about regulation, and why the EO might actually trigger a security-review slowdown.

Tal Hoffman & Yanir Tsarimi (EnclaveAI) on finding the bugs that actually matter — how they reproduced an Anthropic Mythos-class finding with a model ~100x smaller, why proven exploitability is the real bottleneck, how AI-generated bug reports broke the bounty system, and why cheaper models plus the right harness can beat frontier models on security.

Brett Levenson (Moonbounce) on real-time content safety — lessons from running moderation at Meta scale, how a policy engine decomposes fuzzy rules like “hate speech” into atomic questions a hundred people would answer the same way, why prevention beats post-hoc moderation, and how payment providers quietly became the real legislators.

We close on the hardest open question — how low-level verified parts aggregate into trustworthy high-level behavior — plus the schlep and heuristics that end every AI vertical, freedom of speech versus freedom of distribution, and why “nobody got fired for buying Mythos” may drive enterprise security budgets.

Discussion about this video

User's avatar

Ready for more?