← Brevix

AI / LLM Intelligence Briefing — Dec 2025 to 14 Jun 2026

Frontier AI & large language models · six-month lookback · technology-intelligence delta

First run. No ai_seen_items.md existed, so this is the initial AI/LLM briefing — all items reported for the first time and seeded into memory. Tiers: Demonstrated peer-reviewed/replicated · Reported vendor/single-source/preprint · Projected roadmap · Contested.
Source-quality caveat. This topic attracts heavy AI-generated aggregator content. Load-bearing items were verified against primary sources (lab blogs, Nature/Axios/Time/CNBC, HuggingFace). Almost all benchmark numbers are vendor- or partner-reported and un-replicated — treat "state of the art," olympiad "gold-level," and partner anecdotes as marketing-adjacent. Items resting only on aggregators are flagged unverified.

1 · Top takeaways

2 · By area

Models & releases

Training & architecture

Inference & systems

Evaluation & benchmarks

Agents & applications

Safety, alignment & interpretability

Policy, standards & governance

3 · New commercial activity

OrgWhat they doStage / fundingThis window's updateTier
AnthropicFrontier LLMs (Claude)$65B Series H, ~$965B post-money; ~$47B run-rate revenueOvertook OpenAI as most valuable AI startup (28 May 2026)Demonstrated
OpenAIFrontier LLMs (GPT)~$110B raised at ~$840B post (secondary-sourced)Mega-round ~Feb 2026 (SoftBank, Nvidia, Amazon)Reported
xAIFrontier LLMs (Grok)~$250B all-stock (reported)SpaceX reportedly acquired xAI (~Feb 2026) — needs primary confirmationReported
DeepSeekOpen-weights LLMsV4 (MIT license) released 24 Apr 2026; R2 still unreleasedReported

U.S. venture funding reportedly hit a record ~$267B (PitchBook) with OpenAI/Anthropic/xAI dominating; large Amazon Trainium and Nvidia GPU commitments reported (specific GW/GPU figures secondary-sourced).

4 · Watch list

5 · Quiet areas

6 · Sources

Confidence note: Mixed — the policy/governance headline and Anthropic's valuation are well-corroborated across reputable outlets; nearly all capability/benchmark claims are vendor- or partner-reported and un-replicated, and several confident aggregator specifics (Meta "Muse Spark," SpaceX–xAI, exact GPT-5.4/5.5 dates, third-party SWE-bench figures, the "0.97 PGR" and "4× fewer flaws" claims) remain unverified. The standout reliability caveat on the open-weights side is DeepSeek contamination scrutiny. report.css could not be inlined this run; a self-contained fallback style was used.