Skip to content

AI Newsroom (Digest)

AI Timeline

A curated roundup of AI's fast lane - crafted with a model's help - spotlighting the biggest launches, research milestones, and what's making waves in the industry.

Latest Headlines

Wednesday, January 28

Tuesday, January 27

Monday, January 26

Friday, January 23

Thursday, January 22


Complete News Archive

2026

January 2026

2026-01-28 (Wednesday)

  • Chrome adds Gemini 3 with Auto Browse features
    Google Chrome integrated the Gemini 3 model, introducing a new side panel, Nano Banana image generation, and Connected Apps support. The release features 'auto browse,' an agentic tool that handles complex, multi-step workflows like form filling and travel planning for Pro and Ultra subscribers. (blog.google)

2026-01-27 (Tuesday)

  • Ai2 launches SERA open coding agents
    The Allen Institute for AI (Ai2) released SERA, a family of open coding agents designed to help enterprise developer teams train smaller, open models on their specific codebases cost-effectively. The release emphasizes transparency and data sovereignty by providing training recipes and synthetic data generation methods alongside the 8B and 32B-parameter models. (aibusiness.com)

  • Moonshot AI releases Kimi K2.5 open weights model and Kimi Code
    Moonshot AI released the open-source Kimi K2.5, a native multimodal model trained on 15 trillion tokens that challenges closed models in coding and video benchmarks. The company also launched Kimi Code, a coding agent that allows developers to use images and videos as input within terminals or IDEs. (techcrunch.com)

2026-01-26 (Monday)

  • Anthropic launches interactive Claude apps for workplace
    Anthropic introduced a new feature allowing Claude users to launch interactive workplace apps like Slack, Canva, and Figma directly within the chatbot interface. Built on the Model Context Protocol, this integration aims to enhance enterprise productivity by combining Claude's intelligence with dedicated visual tools. (techcrunch.com)

2026-01-23 (Friday)

  • Report highlights AI displacement of translation jobs
    A report highlights the significant negative impact of AI and machine translation on the translation industry, noting that over one-third of translators have lost work and nearly half have suffered income reductions due to generative AI adoption. (edition.cnn.com)

  • FastRender: Browser built by thousands of parallel agents
    Cursor engineer Wilson Lin demonstrated FastRender, a from-scratch web browser built by swarms of autonomous agents, peaking at 2,000 concurrent agents and generating nearly 30,000 commits. The experiment showcases the potential of massive parallel agent coordination to manage complex engineering tasks with minimal human intervention. (simonwillison.net)

  • DeepMind seeks Chief AGI Economist
    Google DeepMind opened a position for a Chief AGI Economist to lead research on post-AGI economics, scarcity, and power distribution. This hiring initiative underscores the organization's strategic preparation for the profound economic transformations expected with the arrival of Artificial General Intelligence. (job-boards.greenhouse.io)

2026-01-22 (Thursday)

  • Google launches Gemini-powered SAT practice tests
    Google announced that its Gemini AI can now generate free, interactive SAT practice tests with scoring and answer explanations, developed in partnership with The Princeton Review. This feature aims to provide accessible test preparation and potentially disrupt the multi-billion dollar test prep and tutoring industry. (arstechnica.com)

  • Axios CEO predicts AI upheaval in family letter
    Axios CEO Jim VandeHei publishes a public letter to his family warning that AI will upend work and life more profoundly than electricity - and rather within months than years. He predicts imminent obsolescence for many knowledge work jobs and urges aggressive daily experimentation with large language models. (axios.com)

  • Yann LeCun launches AMI Labs to advance world models beyond LLMs
    Yann LeCun announces AMI Labs, a Paris-based AI company focused on developing world models as an alternative to large language models (LLMs), emphasizing open-source AI and a 'third path' between US and Chinese tech dominance. (technologyreview.com)

2026-01-21

  • Jensen Huang outlines AI infrastructure at Davos
    At the World Economic Forum, NVIDIA CEO Jensen Huang described AI as the 'largest infrastructure buildout in human history,' framing it as a 'five-layer cake' spanning energy to applications that will drive global job creation. He urged nations to treat AI as critical infrastructure and emphasized that the technology creates jobs by shifting workers from tasks to purpose. (blogs.nvidia.com)

  • OpenAI outlines monetization amid $1.4T commitments
    OpenAI CFO Sarah Friar detailed plans to monetize services through ads, licensing, and outcome-based pricing to support massive infrastructure spending. The company targets practical adoption to offset projected commitments of $1.4 trillion over the next eight years. (aibusiness.com)

2026-01-20

  • OpenAI launches ads and expands ChatGPT Go globally
    OpenAI announced the global rollout of its $8/month ChatGPT Go subscription and confirmed plans to introduce advertisements on free and Go tiers to generate revenue, reversing CEO Sam Altman's previous opposition to ads. (aibusiness.com)

2026-01-15

  • Black Forest Labs releases FLUX.2 [klein] models
    Black Forest Labs released the FLUX.2 [klein] model family, designed for sub-second image generation and editing within a unified architecture optimized for consumer hardware. The release aims to advance interactive visual intelligence, offering real-time performance for AI agents and creators. (bfl.ai)

  • Google explains Nano Banana model naming origin
    Google revealed the backstory behind the codename 'Nano Banana' for its Gemini 2.5 Flash Image model, attributing the moniker to a Product Manager's nicknames. The model subsequently became a top-rated image editing tool, leading to the adoption of the name for its 'Nano Banana Pro' successor. (blog.google)

2026-01-14

  • Cursor details multi-agent autonomous coding scaling
    Cursor published research findings on successfully scaling autonomous coding to hundreds of concurrent agents, generating over a million lines of code across complex projects like a web browser. The study emphasizes the efficacy of a 'planners and workers' architecture to solve coordination bottlenecks in long-running agentic workflows. (cursor.com)

2026-01-12

  • Apple to Use Google Gemini for Siri Overhaul
    Apple announced a strategic partnership with Google to use Gemini as the foundational platform for revamping Siri and Apple Intelligence. This agreement shifts Apple away from OpenAI and reportedly involves a $1 billion annual payment to Google. (aibusiness.com)

2026-01-09

  • Siemens, Nvidia Partner on Industrial AI Operating System
    Siemens and Nvidia expanded their partnership to co-develop an industrial AI operating system aimed at enabling fully AI-driven adaptive manufacturing and digital twins. This initiative includes the launch of the Digital Twin Composer and nine new AI-powered industrial copilots to streamline operations. (aibusiness.com)

2026-01-07

  • Boston Dynamics Unveils Production-Ready Humanoid Robot Atlas
    Boston Dynamics debuted the production-ready, fully electric version of its humanoid robot Atlas at CES, marking its official entry into the industrial robotics market. The launch was accompanied by a partnership with Google DeepMind to integrate Gemini Robotics AI models for enhanced perception and reasoning. (aibusiness.com)

  • xAI Raises $20B in Series E Funding
    Elon Musk's xAI secured $20 billion in a Series E funding round backed by investors including Nvidia and Cisco to rapidly scale its compute infrastructure and GPU clusters. This significant investment aims to support the company's growth and the development of Grok models, despite ongoing controversies regarding the model's image generation capabilities. (aibusiness.com)

2026-01-06

  • Mercedes CLA to debut Nvidia Alpamayo driving tech
    Nvidia announced that its new autonomous driving software, based on the open-source Alpamayo model family, will debut in the Mercedes CLA by late 2026. This collaboration marks the first production application of Nvidia's reasoning-based physical AI, designed to handle complex edge cases and improve vehicle safety. (aibusiness.com)

2026-01-05

  • Nvidia releases Rubin platform and open models
    Nvidia launched the Rubin platform, consisting of six new AI chips (Vera CPU, Rubin GPU, NVLink 6 Switch, ConnectX-9 SuperNIC, BlueField-4 DPU, and Spectrum 6 Ethernet Switch) alongside new open models in the Nemotron and Cosmos families. This release highlights Nvidia's strategy of offering full-stack 'AI supercomputers' to maintain market leadership against growing competition. (aibusiness.com)

  • Boston Dynamics and DeepMind Form AI Partnership
    Boston Dynamics and Google DeepMind announced a strategic partnership to integrate Gemini Robotics AI foundation models with the new Atlas humanoid robots. The collaboration aims to enable advanced industrial task capabilities, starting in the automotive sector. (bostondynamics.com)

2026-01-03

  • Financial Times analyzes AI boom via Soros theory
    The Financial Times publishes an analysis applying George Soros's reflexivity theory to interpret the current market dynamics and investment cycles surrounding the artificial intelligence boom. (ft.com)

  • Satya Nadella: AI outlook for 2026
    This article outlines strategic expectations for AI in 2026, emphasizing a shift from discovery to widespread diffusion and the importance of AI as scaffolding for human potential. It highlights the transition from isolated models to complex systems orchestration for real-world impact. (snscratchpad.com)

2025

December 2025

2025-12-31

  • 17 Predictions for AI in 2026
    Understanding AI published 17 forecasts for 2026, predicting continued rapid model improvement but modest immediate economic impact. Predictions cover big tech spending, model capabilities, regulation, and autonomous vehicles. (understandingai.org)

  • 2025 LLM trends year-in-review
    Simon Willison published a comprehensive retrospective summarizing key developments in the LLM space for 2025, highlighting trends such as the rise of reasoning models, agents, coding agents, and the dominance of Chinese open-weight models. (simonwillison.net)

2025-12-30

  • 2025 LLM Year-in-Review: Reasoning and RLVR
    Sebastian Raschka reviews the major developments in LLMs during 2025, identifying the year as dominated by reasoning models trained with Reinforcement Learning with Verifiable Rewards (RLVR) and the GRPO algorithm. The article analyzes trends in inference scaling, architecture efficiency, and the issue of 'benchmaxxing' while offering predictions for 2026. (sebastianraschka.com)

2025-12-29

  • Meta Acquires Manus AI for ~$4B
    Meta Superintelligence Labs has snapped up the high-flying startup Manus AI in a deal valued at around $4 billion. It is a remarkable outcome for a company that raced to $100 million in annual recurring revenue in just nine months. (news.smol.ai)

2025-12-27

2025-12-25

  • Nemotron 3 Nano Model Release
    NVIDIA releases Nemotron 3 Nano, an open-source hybrid Mamba-Transformer language model optimized for agentic reasoning with 1M-token context support, claiming improved accuracy and 3.3x higher throughput vs comparable models like GPT-OSS-20B. (arxiv.org)

  • TokSuite Toolkit Release for Tokenizer Research
    Researchers introduced TokSuite, a toolkit and benchmark enabling systematic study of how tokenizer selection impacts language model behavior, using 14 structurally identical models trained with varied tokenizers. The release includes a perturbation-focused benchmark to isolate tokenization effects in real-world scenarios. (arxiv.org)

2025-12-24

  • NVIDIA acquires AI chip startup Groq
    NVIDIA announces a $20 billion deal to acquire intellectual property and talent from AI accelerator developer Groq, marking its largest-ever acquisition to strengthen its AI chip dominance. Groq's founders and executives will join NVIDIA while maintaining partial independence under new leadership. (cnbc.com)

2025-12-23

  • MiniMaxAI Releases VIBE Dataset
    MiniMaxAI launches the VIBE benchmark dataset "Visual & Interactive Benchmark for Execution" for evaluating AI agents on full-stack web development tasks through a dataset of 200 structured coding challenges. (huggingface.co)

  • MiniMax releases M2.1 language model upgrade
    MiniMax launches M2.1 with enhanced multi-language programming support for Rust, Java, and others, optimized for complex real-world tasks and office workflows while improving response efficiency. (minimax.io)

2025-12-22

  • SAM 3 image segmentation model introduced
    Developers unveil SAM 3, enhancing image segmentation by incorporating conceptual understanding. This iteration builds on prior models for more intuitive visual parsing. (vizuara.substack.com)

  • Lovable AI coding startup hits $6.6B valuation in Series B
    The AI startup Lovable, focused on coding tools, secured a Series B round that propelled its valuation to $6.6 billion. This milestone highlights surging investor enthusiasm for generative AI in software development. (aibusiness.com)

  • Z.AI new GLM-4.7 AI coding model release
    Z.ai releases GLM-4.7, a coding-focused AI model showing benchmark improvements including 73.8% on SWE-bench (+5.8% over prior version) and better terminal/web UI generation capabilities.

2025-12-19

  • Anthropic releases Agent Skills as an open standard
    Anthropic has open-sourced its Agent Skills framework through a minimalist specification published on GitHub, with initial adoption by several development tools but no official support yet from OpenAI. (github.com)

  • OpenAI releases GPT-5.2-Codex model
    OpenAI launches GPT-5.2-Codex, an enhanced version of GPT-5.2 specialized for agentic coding tasks with improved context management, Windows support, and cybersecurity features—alongside an invite-only preview for security experts. (simonwillison.net)

2025-12-18

  • ChatGPT mobile app surpasses $3 billion in consumer spending
    OpenAI’s mobile application has generated over $3 billion in global consumer revenue since its 2023 launch, with the majority of that income arriving this year. The app reached this impressive financial benchmark faster than major players like TikTok and Disney+. (techcrunch.com)

2025-12-17

  • Google releases Gemini 3 Flash AI model
    Google unveils Gemini 3 Flash, a lightweight AI model optimized for high-speed performance, which beats previous full-scale Gemini 2.5 Pro model. (blog.google)

2025-12-16

  • OpenAI launches GPT Image 1.5
    OpenAI unveiled GPT Image 1.5, a snappier image generator with sharper instruction adherence and editing finesse, now live for all ChatGPT users and via API. This upgrade steps up the creative game amid fierce rivalry with Google's tools. (techcrunch.com)

2025-12-15

2025-12-14

2025-12-12

2025-12-11

2025-12-10

  • NVIDIA GPUs Dominate Supercomputing Benchmarks at SC25
    NVIDIA announced at SC25 that over 85% of the TOP100 supercomputers now use GPUs, marking a historic shift from CPUs to accelerated computing. Their GPUs also topped the Green500 and Graph500 benchmarks, showcasing superior energy efficiency and performance. (blogs.nvidia.com)

2025-12-09

  • Anthropic and Accenture form strategic AI partnership
    Anthropic and Accenture have agreed to a three-year partnership that will see Accenture's 30,000 employees trained on Claude and the creation of a joint business group to help enterprises adopt AI. (techcrunch.com)

  • Mistral releases Devstral 2 and Devstral Small 2 models
    Mistral has quietly released two new models: Devstral 2 (123B) and Devstral Small 2 (24B), both optimized for coding agents. The larger model hits 72.2% on SWE-bench Verified and is up to 7× more cost-efficient than Claude Sonnet, though it comes with a modified MIT license that bars use by companies earning over $20 million monthly. (simonwillison.net)

2025-12-04

2025-12-02

  • Mistral AI releases Mistral 3 model family
    Mistral AI unveiled its next-generation Mistral 3 lineup, including compact dense models in 3B to 14B sizes and the flagship Mistral Large 3 sparse mixture-of-experts powerhouse with 675B total parameters. All variants are open-sourced under Apache 2.0, boasting top-tier multimodal and multilingual prowess for developers and edge devices alike. (mistral.ai)

2025-12-01

  • Internals of DeepSeek Models V3 to V3.2 reviewed by Sebastian Raschka.
    DeepSeek launched its upgraded V3.2 open-weight model, matching top proprietary rivals like GPT-5 in benchmarks. This evolution introduces sparse attention and refined reinforcement learning for enhanced reasoning. (sebastianraschka.com)

  • DeepSeek releases V3.2 flagship model and reasoning variant
    DeepSeek launched its new flagship AI model, V3.2, with an experimental reasoning-enhanced version called V3.2-Speciale. These massive 685-billion-parameter models are openly available under MIT license, excelling in math like gold-medal IMO performance. (simonwillison.net)

  • Hugging Face launches Transformers v5 library
    Hugging Face rolled out Transformers v5 after five years, emphasizing cleaner code, modular designs, and support for over 400 architectures amid daily installs topping 3 million. This update aims to streamline contributions and bolster the AI ecosystem's backbone. (huggingface.co)

November 2025

2025-11-28

  • Google commercializes TPUv7 for external AI customers
    Google ramps up sales of its advanced TPUv7 chips to outside firms like Anthropic, which placed a massive 1GW+ order, positioning the hardware as a serious rival to Nvidia's dominance. This shift highlights a growing ecosystem challenging the established GPU leader with cost savings and strong performance. (newsletter.semianalysis.com)

  • Moonshot AI introduces Kimi Agentic Slides
    Agentic Slides are powered by Nano Banana Pro and enable Agentic Search (Kimi K2), file upload, fully editable slides canvas and export in Powerpoint format. (x.com)

2025-11-27

  • The Thinking Game - DeepMind Documentary
    A five-year documentary follows Demis Hassabis and the DeepMind team as they pursue the nature of intelligence, moving from game-playing AI to solving protein folding.
    In 2020 DeepMind already released the AlphaGo Documentary with the legendary Move 37 when competing in Go 2016 against Lee Sedol. (youtube.com)

  • DeepSeek releases open-source Math-V2 model
    DeepSeek unveiled a hefty 685-billion-parameter open-weights model tuned for top-tier math reasoning. It matches proprietary rivals by earning gold-medal equivalents on elite competitions like IMO and Putnam. (huggingface.co)

2025-11-26

  • Prime Intellect releases INTELLECT-3 106B MoE model
    Prime Intellect unveiled INTELLECT-3, a 106 billion parameter Mixture-of-Experts model excelling on math, code, and reasoning benchmarks via large-scale reinforcement learning. They open-sourced the full training stack, from weights to environments, empowering broader access to cutting-edge AI development. (primeintellect.ai)

  • AI-generated band Breaking Rust tops Billboard Country Digital Song Sales chart
    An AI-created country band named Breaking Rust has hit number one on Billboard's Country Digital Song Sales chart, showing audiences eagerly consuming machine-made music. This underscores the rising appeal of AI-generated entertainment amid ongoing hype. (technologyreview.com)

2025-11-25

  • Black Forest Labs releases FLUX.2 image models optimized for NVIDIA RTX GPUs
    Black Forest Labs unveiled the FLUX.2 series of cutting-edge image generators, boasting photorealistic details, pose control, and multi-reference consistency. NVIDIA teamed up for RTX GPU tweaks via FP8 quantization, trimming VRAM use by 40% to make high-fidelity AI art more accessible on consumer hardware. (blogs.nvidia.com)

  • Google launches Ironwood seventh-gen TPU
    Google unveiled Ironwood, its newest Tensor Processing Unit, as the seventh generation and most potent, efficient chip design so far. It's now ready for use in Google Cloud services. (blog.google)

  • Tom Gally releases SVG generation benchmark for frontier LLMs
    Tom Gally crafted a fun benchmark with 30 whimsical prompts, challenging nine top LLMs to produce SVG artwork like a sloth at the controls of an excavator. The results offer a charming glimpse into which models shine at creative vector graphics. (simonwillison.net)

2025-11-24

  • Anthropic releases Claude Opus 4.5
    New Claude Opus 4.5 by Anthropic - billing it as the top performer for coding, agents, and computer tasks amid rivalry from recent GPT and Gemini updates. Benchmarks edge ahead, yet real-world gains over prior models like Sonnet feel subtle. (simonwillison.net)

2025-11-23

  • Interconnects.ai lists top US labs releasing open AI models
    A helpful roundup spotlights key US players like Ai2 with Olmo, Nvidia's Nemotron, and others dropping quality open models, amid rising buzz in open AI scenes on both sides of the Pacific. It's a timely snapshot as truly open releases pick up steam. (interconnects.ai)

2025-11-22

  • Ai2 releases fully open Olmo 3 LLM series
    The Allen Institute for AI unveiled the Olmo 3 family, spotlighting the 32B Olmo 3-Think model as the top fully open option in its class, complete with training data, checkpoints, and tools to peek inside reasoning steps. They also shared smaller 7B variants trained efficiently on a refined 6-trillion-token dataset. (simonwillison.net)

2025-11-20

  • Google DeepMind launches Nano Banana Pro image model
    Google DeepMind unveiled Nano Banana Pro, a cutting-edge image generation and editing tool powered by Gemini 3 Pro. It brings enhanced control, sharper text handling, and richer world understanding to turn ideas into polished visuals. (blog.google)

  • AI Engineering Code Summit discusses shift to agent swarms in coding
    Industry leaders at the NYC summit outlined a pivot from single large-context AI agents to multi-agent 'ant swarms' for reliable coding, with Steve Yegge forecasting the demise of traditional IDEs by 2026. This emerging approach emphasizes context engineering and verification to sidestep model limitations like the 'Dumb Zone.' (turingpost.com)

  • Allen AI releases OLMo 3 open reasoning models
    Allen AI unveiled OLMo 3, a fresh lineup of fully open 7B and 32B language models topping charts in base performance and reasoning tasks. These releases include all training details, paving the way for community innovation without the usual black-box mysteries. (allenai.org)

  • Hugging Face launches AnyLanguageModel for Apple LLM integration
    Hugging Face unveiled AnyLanguageModel, a Swift package that unifies APIs for local open-source models and cloud providers on Apple devices. It acts as a seamless swap for Apple's Foundation Models, easing developer experimentation with LLMs. (huggingface.co)

2025-11-19

  • MIT Technology Review Insights report on AI adoption in manufacturing
    Manufacturers are ramping up AI use alongside digital twins to streamline factories and cut downtime, with half now running AI in production versus just over a third last year. Big firms lead the pack at 77% adoption. (technologyreview.com)

  • OpenAI releases GPT-5.1-Codex-Max for Codex CLI
    OpenAI debuted GPT-5.1-Codex-Max as the default model in its Codex CLI coding agent, boasting top benchmark scores and a novel compaction technique for managing extended coding tasks over vast token contexts. This specialist model shines in agentic workflows, edging out rivals like Gemini 3 Pro. (simonwillison.net)

  • Hollywood backlash against signing AI actress Tilly Norwood
    Talent agencies explored representing AI-generated actress Tilly Norwood, prompting sharp criticism from actors like Emily Blunt and Natasha Lyonne who decried it as frightening and misguided. This flare-up underscores broader Hollywood anxieties about AI displacing human performers. (understandingai.org)

2025-11-18

  • Microsoft, NVIDIA, and Anthropic Launch Major AI Infrastructure Partnerships
    Microsoft, NVIDIA, and Anthropic are teaming up to supercharge Anthropic's Claude models on Azure using NVIDIA hardware, with commitments for vast compute resources and billions in joint investments. This move expands enterprise access to advanced AI while optimizing performance across cutting-edge systems. (blogs.nvidia.com)

  • Google releases Gemini 3 AI model
    Google unveiled Gemini 3, its most capable AI model to date, poised to empower users in turning concepts into reality with enhanced intelligence. Leaders like Sundar Pichai and Demis Hassabis highlighted this step forward in AI capabilities. (blog.google)

  • Google launches Search powered by Gemini 3
    Google rolled out its Gemini 3 AI model to enhance Search, delivering sharper reasoning and fresh interactive visuals tailored to user queries. This update promises a more intuitive experience with dynamic tools and simulations. (blog.google)

  • Google releases Antigravity IDE
    Google unveiled Antigravity, a desktop app blending a VS Code-style editor, agent dashboard, and browser tools to streamline app-building with Gemini models. It generates handy Markdown artifacts to track tasks and progress along the way. (simonwillison.net)

2025-11-17

2025-11-13

2025-11-12

  • German court finds OpenAI liable for ChatGPT copyright breach
    A German court decided OpenAI broke national copyright rules by training ChatGPT on protected music without approval, directing the firm to compensate rights manager GEMA. This sets an early European benchmark for AI respecting creators' rights. (techcrunch.com)

  • Waymo enables freeway service for robotaxis
    Waymo now lets its autonomous vehicles use freeways for paying customers in Phoenix, Los Angeles, and the San Francisco Bay Area, a smart upgrade that speeds up trips across vast urban landscapes. This positions the service as a credible challenger to rideshare leaders. (understandingai.org)

  • Moonshot AI releases Kimi K2 Thinking model
    Moonshot AI introduced Kimi K2 Thinking, an enhanced open-source reasoning agent that handles step-by-step problem-solving and chains hundreds of tool calls, edging out top rivals on benchmarks. This builds on their push for agentic intelligence with strong long-context retention. (turingpost.com)

2025-11-11

  • Google launches Private AI Compute
    Google unveils Private AI Compute, a clever setup that lets users harness Gemini models' cloud prowess for smart AI help while ensuring their data stays strictly private. It's a thoughtful stride toward blending powerful AI with robust privacy safeguards. (blog.google)

2025-11-10

  • Kimi-Linear attention architecture unveiled
    Kimi-Linear emerges as a hybrid design blending linear recurrence with selective full attention for efficient long-context modeling. It promises to overcome quadratic scaling limits while preserving expressive power through innovations like Kimi Delta Attention. (vizuara.substack.com)

2025-11-06

  • The “Sutskever Memo” Saga
    According to new testimony, OpenAI cofounder Ilya Sutskever compiled a 52-page memo accusing CEO Sam Altman of repeatedly lying and pitting executives against one another, which eroded the board’s trust and led to his ouster. This saga offers a rare glimpse into the turmoil at OpenAI, one of the world’s most influential AI companies, and highlights how internal chaos and broken trust can spark a dramatic leadership shake-up. (theverge.com)

2025-11-04

  • Sebastian Raschka overviews non-transformer LLM architectures
    Sebastian Raschka released a detailed blog post surveying emerging alternatives to dominant autoregressive transformer LLMs. It highlights linear attention hybrids, text diffusion, code world models, and recursive transformers for better efficiency and performance. (sebastianraschka.com)

2025-11-01

  • Andrej Karpathy releases nanochat tiny language model
    Andrej Karpathy unveiled nanochat, a compact language model that fits on everyday hardware and illustrates full AI training cycles for educational purposes. This open project rapidly attracted over 35,000 GitHub stars, acting as a practical lab for hands-on experimentation. (turingpost.com)

October 2025

2025-10-27

  • Hugging Face launches huggingface_hub v1.0
    After five years of evolution, Hugging Face unveiled version 1.0 of its core Python library, huggingface_hub, boosting performance with modern HTTP tools and a revamped CLI for seamless access to millions of models, datasets, and Spaces. This mature release supports the next era of open machine learning for a vast community. (huggingface.co)

  • Big tech AI capex surges to historic boom levels
    Major tech firms like Amazon, Meta, Microsoft, Alphabet, and Oracle poured $241 billion into capex last year, equaling 0.82% of US GDP and on track to surpass peaks from iconic projects like Apollo. This spending, largely AI-driven, marks one of the biggest investment surges since World War II, with plans for even more ahead. (understandingai.org)

2025-10-22

  • AI wildfire metaphor: the ecosystem is burning, not bubbling
    The article reframes the AI market as a wildfire, not a bubble, arguing that the coming shakeout will clear the underbrush and leave the strongest companies standing, much like a forest fire clears space for new growth. (ceodinner.substack.com)