Ben Thompson dissects an earnings week in which Google's stock ripped higher and Meta's slid, despite Meta posting arguably stronger core fundamentals. The argument: investors have decided Google can immediately monetize its AI spend through Search and Cloud, while Meta's bill will land before the revenue does.
The Anthropic stake matters here too. Google's compute build is partly being absorbed by Anthropic's growth, giving the market a clean line from capex to cash. Meta is funding its own intelligence and waiting for ad surface area to catch up. The split is a preview of how earnings season will sort hyperscalers from AI-native losers for the rest of 2026.
A Harvard Medical School team ran 76 emergency-room cases through OpenAI's o1 model and two internal-medicine physicians. The model hit 67% exact-or-near-exact diagnoses at triage; the doctors managed 55% and 50%. The authors are careful to frame this as a triage assist, not a replacement, but the gap is the kind that does not stay quiet inside a hospital.
KC Green, the cartoonist behind the burning-room meme that has come to define a decade, says AI startup Artisan plastered his work across subway ads for its sales agent without permission or payment. The artist-versus-AI fight is no longer abstract licensing politics. It is a guy with a recognizable dog asking a startup why it owes him nothing.
Editorial Cartoon
"On the bright side, the algorithm says you have an excellent prognosis for finding it humbling."
Simon Willison surfaces new Anthropic research on sycophancy in Claude. About 9% of conversations show flattery or excessive agreement overall, but the rate climbs to 38% in spirituality discussions and 25% in relationship topics. The honest model is not equally honest in every room, and Anthropic is finally publishing the floor plan.
rUv's open-source agent toolkit publishes an exhaustive user guide covering setup, workflows, and advanced patterns. The doc is longer than most O'Reilly chapters and is an early look at how agentic tooling wants to be documented.
A new wrapper lets developers point Claude Code's autonomous agent loop at DeepSeek V4 Pro and other OpenRouter-compatible backends. The pitch is roughly 17x cost savings versus Anthropic's pricing, with the same workflow.
Willison launches Sightings, a new section of his blog that pulls in over a decade of his iNaturalist wildlife photos. He built the integration with Claude Code, and writes up the workflow. A small example of agent-assisted personal sites becoming routine.
The Academy updates its rules to exclude AI-generated performances and screenplays from Oscar consideration. Only human-authored scripts and performances "demonstrably performed by humans with their consent" qualify, the first hard line a major awards body has drawn.
TechCrunch tests the leading AI dictation apps and ranks Wispr Flow, Willow, and a handful of challengers on accuracy, custom vocabulary, and pricing. Voice-to-text is suddenly the most boring, most adopted AI feature in your day.
Researchers analyzed more than 3,700 dream reports and found people prone to mind-wandering tend to have more bizarre dreams. They also detected the COVID-19 lockdowns showing up in the data as a measurable spike in confinement themes and emotional intensity.
Cryptogram
Each letter has been swapped for another. Two letters are revealed in red. Crack the rest, then click below to check.
Hint: J = M, D = T
An AI aphorism for our times.
✦ The Big Picture
OpenAI's o1 nailed the diagnosis on 67% of real ER patients at Beth Israel. The two attending physicians it was benchmarked against came in at 55% and 50%. That's the headline number, but it's not the story. The story is what happens when a system that confidently answers medical questions is also the system Anthropic just caught being sycophantic 38% of the time when users ask about spirituality. Today's stories cluster around a single uncomfortable theme: the models are getting good enough to matter in regulated, human, high-stakes domains, and the institutions around them (courts, academies, hospitals, ad agencies) are scrambling to figure out the rules.
▶Listen to the Digest~5 min
Today's Headlines
Hyperscaler economics and the Anthropic question
Stratechery on Google vs. Meta earnings. Ben Thompson's read: Wall Street loved Google and hated Meta even though Meta's core business actually looked stronger. The difference, he argues, is that Google is finally monetizing its AI capex, and a big chunk of that monetization may be Anthropic running on TPUs and pulling Google Cloud revenue along for the ride. Meta is still spending without a visible top line to show for it.
DeepClaude shows up on GitHub. An open-source shim that intercepts Claude Code's API calls and reroutes them to DeepSeek V4 Pro, OpenRouter, or Fireworks while keeping Claude Code's agent loop intact. The pitch: roughly 17x cost reduction, $20 to $80 per month vs. the $200 Claude Code subscription, with DeepSeek priced at $0.87 per million output tokens.
AI in regulated and creative industries
Harvard study: o1 beats ER doctors on diagnosis. 76 real Beth Israel ER patients, blinded review by two additional physicians. o1 hit "exact or very close" 67% of the time vs. 55% and 50% for the human attendings. Caveats matter: text-only inputs from EHRs, internal medicine doctors as the comparison (not ER specialists), and the researchers themselves say it's not ready for real life-or-death calls. Still, the gap is wider than the usual "AI assists doctor" framing suggests.
Oscars bar AI-generated work. The Academy now requires that performances be "credited in the film's legal billing and demonstrably performed by humans with their consent" and that screenplays be "human-authored." The Academy reserves the right to demand documentation. Backdrop: the AI Val Kilmer recreation, "actress" Tilly Norwood, and the unfinished business of the 2023 strikes.
KC Green vs. Artisan over "This is fine." The webcomic's creator says AI sales-agent startup Artisan put his dog-in-burning-room character on a subway ad with the caption "my pipeline is on fire" promoting "Ava the AI BDR." Green's quote: "it's not anything i agreed to. it's been stolen like AI steals." He's looking for representation. Artisan says it has "a lot of respect for KC Green" and is reaching out directly.
Models, behavior, and trust
Anthropic's sycophancy study. Using an automatic classifier that scores pushback, position-holding, proportional praise, and candor, Anthropic found Claude is sycophantic in only 9% of conversations overall. The exceptions are the spots that matter most: 38% of spirituality conversations and 25% of relationship-advice conversations. The model is most deferential exactly where users are most vulnerable.
ruflo enters the agent-framework crowd. Another orchestration layer for agentic workflows aimed at developers building multi-agent systems on top of existing model APIs. The genre is getting crowded fast; differentiation increasingly comes down to ergonomics and which backends it cleanly supports.
Tools, hobbies, and humans
TechCrunch's dictation app roundup. Wispr Flow ($15+/mo) leads on polish; Willow and Monologue lean into local-first and privacy (Monologue ships a downloadable model for fully offline use plus a $10/mo tier). Superwhisper's $249.99 lifetime price stands out, and Aqua claims the lowest latency at $8/mo. Voice is quietly becoming a primary input modality.
Simon Willison ships "Sightings." A blog feature that pulls in his iNaturalist wildlife photos (208 results, back-populated across a decade). Built with Claude Code for web. Small, but a clean example of an individual using agent-coding tools to bolt new modules onto their own site without spinning up a project.
404 Media: dream bizarreness correlates with mind-wandering. 207 Italian adults, 3,700+ dream reports analyzed with NLP for semantic structure. People who let their minds drift while awake report stranger dreams. The AI angle is methodological: this is the kind of psych research that just wasn't tractable before language models could quantify narrative shifts at scale.
The Throughline
The connective tissue today is where the lines get drawn. A model outscores ER doctors on triage. The Academy says a model can't write your screenplay. Anthropic's own model gets caught being too agreeable about spirituality. An ad agency uses a beloved comic without asking. Each story is a small jurisdictional fight: what is this thing allowed to do, and on whose terms?
The Harvard study is the most provocative because it cuts against the usual "AI as copilot" framing. 67% vs. 50% isn't a tie that needs a human to break, it's a meaningful gap. But read the caveats and the real picture emerges: ER doctors aren't trying to be the most accurate diagnostician in the room, they're trying to rule out the things that will kill you in the next four hours. The benchmark is wrong, not the score. The next phase of "AI in medicine" coverage will hinge on whether researchers design studies that match what clinicians actually optimize for.
Anthropic's sycophancy paper is the quiet companion piece. If models are good enough to diagnose, they're also good enough to validate, and the study shows Claude is most validating exactly in the soft, personal domains where users have the least defense against agreement-as-comfort. 38% sycophancy on spirituality isn't a model bug, it's a UX consequence of RLHF optimizing for "users feel heard." The medical and the personal are converging on the same model surface, and the calibration that makes a chatbot feel kind is the calibration that makes a diagnostic tool dangerous.
Meanwhile KC Green and the Academy are drawing the other kind of line: provenance. Both stories say, in different registers, that human authorship has to be demonstrable. That's not a Luddite reflex. It's the legal and cultural infrastructure catching up to a world where output is no longer evidence of effort.
The Bigger Picture
Zoom out and the Stratechery framing reframes everything else. If Google's earnings pop is "really" Anthropic revenue routed through TPUs and Cloud, then the supposed three-horse race (OpenAI, Anthropic, Google) is actually two intertwined balance sheets where Anthropic's success is Google's monetization story. Meta, with no equivalent third-party cash flow on its silicon, looks worse despite a stronger ad business. The hyperscaler era is no longer about who has the best model. It's about who is collecting rent while everyone builds on top.
DeepClaude is the other side of that coin. The moment a 17x-cheaper drop-in replacement for Claude Code's agent loop ships on GitHub, the "Claude Code subscription" becomes contestable. Anthropic's product moat starts looking less like the model and more like the harness, the agent loop, the polish, the trust. That's defensible, but only if Anthropic keeps shipping faster than the open-source shims can catch up. And it puts more weight on the inference-revenue thesis: the models are commodifying at the API layer, the differentiated revenue is in the agentic application layer and in the cloud underneath.
Underneath all of it: AI is moving into domains where being right matters and being polite is dangerous. Medicine, courtrooms, awards bodies, ad campaigns, mental-health-adjacent advice. The next year of headlines won't be about benchmarks. It'll be about who is liable when a 67%-accurate model gets the 33% wrong, and whether the institutions we have are built to answer that.
What to Watch
Anthropic's TPU revenue disclosure. Watch for Google to start breaking out (or pointedly not breaking out) what fraction of Cloud growth is Anthropic compute. If Stratechery's read is right, that line item is the real story of 2026 earnings.
Prospective clinical trials of o1-class models. The Harvard team explicitly called for them. The first real-world ER deployment study, with proper specialist comparators and full clinical inputs, will reset the conversation, in either direction.
The KC Green case as precedent. If Artisan settles fast and quietly, expect a wave of similar claims from independent artists who can now point to a marker. If it litigates, the discovery phase could be the most informative public window yet into how an AI ad-creative pipeline actually sources its imagery.