AI News - March 2026: Key Events & Releases

Last updated: March 10, 2026 at 20:10 CET

AI Timeline

The artificial intelligence landscape is evolving at breakneck speed, balancing historic financial injections with deepening regulatory and economic concerns. AMI Labs has raised $1.03 billion to build world models, DeepMind is celebrating AlphaGo's 10-year impact, and Microsoft has launched the Wave 3 Copilot alongside the Frontier Suite. Conversely, Anthropic is making waves by releasing a study warning of a white-collar job recession and suing the US government over its national security blacklist.

2026

March 2026

2026-03-10

AMI Labs raises $1.03 billion for world models
Founded by former Meta AI chief Yann LeCun, Paris-based AMI Labs raised $1.03 billion in seed funding to develop AI systems that understand the physical world. The round values the startup at $3.5 billion and includes investors such as Bezos Expeditions and Cathay Innovation. (aibusiness.com)
NVIDIA and Thinking Machines Lab strategic partnership
NVIDIA and Thinking Machines Lab announced a multiyear strategic partnership to deploy at least one gigawatt of next-generation NVIDIA Vera Rubin systems for frontier model training, which includes a significant investment from NVIDIA. (blogs.nvidia.com)
Gemini Sheets achieves 70.48% SpreadsheetBench score
Google announced that Gemini in Google Sheets has achieved state-of-the-art performance on the SpreadsheetBench dataset with a 70.48% success rate, nearing human expert ability. The update introduces new beta features enabling users to create, organize, and edit complex spreadsheets using natural language commands. (blog.google)

2026-03-09

DeepMind reflects on AlphaGo's 10-year impact
Google DeepMind published a retrospective by Demis Hassabis marking the 10th anniversary of AlphaGo's victory, examining its foundational role in accelerating scientific discovery and the path toward artificial general intelligence. (deepmind.google)
Nscale valued at $14.6B after raising another $2B
Nscale has raised a $2 billion Series C round, valuing the company at $14.6 billion, marking one of the largest funding rounds in Europe. (aibusiness.com)
Microsoft launches Wave 3 Copilot and Frontier Suite
Microsoft announced Wave 3 of Microsoft 365 Copilot featuring multi-model support with Anthropic's Claude and agentic "Copilot Cowork" capabilities, alongside the general availability of Agent 365 and the new Microsoft 365 E7: The Frontier Suite. As noted in the Microsoft 365 Blog, the company emphasizes "Intelligence + Trust" as core pillars, with The Register highlighting the integration of Anthropic's Claude to handle long-running tasks within Copilot Cowork. (blogs.microsoft.com)

2026-03-07

Northwestern develops AI-evolved legged metamachines
Northwestern University engineers introduced legged metamachines, modular robots designed by evolutionary algorithms that can reassemble and recover from injury. Published in the Proceedings of the National Academy of Sciences, the research demonstrates machines that adapt to unstructured environments and withstand catastrophic damage. (techxplore.com)

2026-03-06

Anthropic study warns of white-collar job recession
Anthropic researchers published a report titled 'Labor market impacts of AI: A new measure and early evidence,' introducing a metric to compare AI capabilities against actual usage data. The study finds that while AI can theoretically handle the vast majority of tasks in fields like computer science and administration, current adoption is low, though it warns of a potential 'Great Recession for white-collar workers' as this gap closes. (fortune.com)
Anthropic sues US over national security blacklist
Anthropic filed a lawsuit against the US government after being designated a national security supply chain risk, effectively barring it from military contracts due to its refusal to permit AI use in autonomous weapons and domestic surveillance. (go.theregister.com)

2026-03-05

Labor market impacts of AI: A new measure and early evidence
Anthropic introduces a new measure called "observed exposure" that combines theoretical LLM capability and real-world usage data to track AI displacement risk. The report finds actual AI coverage remains low compared to theoretical capabilities, with higher exposure occupations projected to grow less by 2034. Early evidence shows no systematic increase in unemployment for highly exposed workers, though there are hints of slowed hiring for younger workers. (anthropic.com)
OpenAI releases GPT-5.4 and GPT-5.4 Pro
OpenAI launched GPT-5.4 and GPT-5.4 Pro, frontier models designed for professional work that feature native computer-use capabilities, advanced agentic workflows, and support for up to 1 million tokens of context. Developer Simon Willison noted the release offers a slight price bump and outperforms previous specialist coding models in his review. (openai.com)

2026-03-04

OpenAI releases GPT-5.3 Instant with reduced refusals
OpenAI released GPT-5.3 Instant, an update designed to improve conversational flow by reducing unnecessary refusals and moralizing preambles while decreasing hallucination rates. The Register notes that while the model offers more factual responses and better web synthesis, it shows minor regressions in some safety benchmarks compared to its predecessor. (openai.com)
Microsoft releases Phi-4-reasoning-vision-15B
Microsoft announced Phi-4-reasoning-vision-15B, a 15 billion parameter open-weight multimodal model designed to balance high-level reasoning capabilities in math and science with computational efficiency. The release shares technical insights on using dynamic resolution vision encoders and a mixed training approach to optimize for both reasoning-heavy and perception-focused tasks. (microsoft.com)

2026-03-03

OpenCode iOS App Launches on App Store
The OpenCode desktop application has been ported to iOS and is now officially available on the App Store, featuring WhisperKit speech-to-text capabilities and a custom keyboard. (apps.apple.com)
Report: AI adoption outpaces CIOs' management capacity
The Logicalis Global CIO Report 2026 reveals that 51% of tech leaders believe AI adoption is moving faster than their ability to manage it, citing significant gaps in governance frameworks, skills, and sustainability planning. (go.theregister.com)
Google releases Gemini 3.1 Flash-Lite model
Google DeepMind announced the preview release of Gemini 3.1 Flash-Lite, positioning it as the fastest and most cost-efficient model in the Gemini 3 series for high-volume workloads. The new model offers enhanced speed and reasoning capabilities at a significantly lower price point, outperforming the previous 2.5 Flash generation. (deepmind.google)

2026-03-01

AI agents drive the SaaSpocalypse
TechCrunch examines how AI coding agents are causing a 'SaaSpocalypse' by shifting the build-versus-buy dynamic and undermining per-seat pricing, leading to a $1 trillion sell-off in software stocks and stalled IPOs. (techcrunch.com)
VCs favor AI-native infrastructure over thin wrappers
Investors are shifting capital away from generic AI SaaS wrappers toward AI-native infrastructure and vertical platforms with proprietary data moats. This trend reflects a preference for deep workflow integration and domain expertise as low barriers to entry threaten surface-level automation tools. (techcrunch.com)

February 2026

2026-02-28

"Cancel ChatGPT" movement follows OpenAI DoW deal
A mainstream "Cancel ChatGPT" movement emerged online following OpenAI's partnership with the U.S. Department of War, sparking widespread user backlash and subscription cancellations. The protest contrasts with Anthropic, which refused a similar deal due to concerns over autonomous weapons and mass surveillance. (windowscentral.com)

2026-02-27

Trump bans Anthropic; OpenAI secures Pentagon deal
President Trump ordered all federal agencies to immediately cease using Anthropic's technology after the company refused Pentagon demands to remove guardrails regarding autonomous weapons and mass surveillance. Following the ban, Defense Secretary Pete Hegseth labeled Anthropic a national security risk, while OpenAI struck a deal with the Department of Defense and Anthropic defended its refusal to enable what it deems unsafe use cases. (nbcnews.com)
OpenAI secures $110B Amazon, Nvidia, SoftBank deal
OpenAI announced a $110 billion funding round with Amazon, Nvidia, and SoftBank at a $730 billion valuation, though the investments from Amazon and Nvidia are largely structured as compute infrastructure commitments. This strategic agreement secures gigawatts of capacity for Amazon Trainium and Nvidia Vera Rubin systems while SoftBank provides general operational funding. (go.theregister.com)

2026-02-26

Anthropic refuses Pentagon surveillance and autonomous weapons requests
Anthropic CEO Dario Amodei stated the company will not remove safeguards against mass domestic surveillance or fully autonomous weapons, despite threats from the Department of War to designate the company as a supply chain risk or invoke the Defense Production Act. Understanding AI reported that Defense Secretary Pete Hegseth demanded Anthropic waive these restrictions by Friday or face contract termination. (anthropic.com)
Google DeepMind releases Nano Banana 2 image model
Google DeepMind launched Nano Banana 2 (Gemini 3.1 Flash Image), a state-of-the-art image generation model that combines Pro-level quality with Flash speed, rolling out across consumer products and enterprise platforms. The release introduces advanced capabilities such as search grounding, subject consistency, configurable thinking levels, and support for up to 4K resolution.
Wayve raises $1.2B for self-driving AI
U.K. autonomous vehicle company Wayve secured $1.2 billion in a Series D funding round led by Eclipse, Balderton, and SoftBank, valuing the company at $8.6 billion. The investment will support the launch of commercial robotaxis in London and the integration of its embodied AI technology into consumer vehicles. (aibusiness.com)
Hugging Face details MoE integration in Transformers
Hugging Face published a technical overview explaining how the Transformers library integrates Mixture of Experts architectures, detailing infrastructure changes such as weight loading refactors, pluggable expert backends, and parallelism strategies. (huggingface.co)
Latent Space publishes 'WTF Happened in 2025' timeline
Latent Space launched a curated microsite documenting 2025 as a historical inflection point, highlighting when AI models reached human expert parity and agents revolutionized software development. (wtfhappened2025.com)

2026-02-25

Inception Releases Mercury 2 Diffusion LLM
Inception released Mercury 2, a reasoning language model powered by diffusion architecture that generates tokens in parallel to achieve speeds exceeding 1,000 tokens per second. The model targets production use cases like agentic loops and real-time voice interactions by offering reasoning-grade quality within low latency budgets. (inceptionlabs.ai)
Economist Impact releases AI workplace readiness report
Economist Impact published a survey of 639 executives revealing a gap between AI ambition and implementation, noting that while 88% view AI as a competitive advantage, only 4% have achieved repeatable business value at scale and just 8% have comprehensive governance frameworks. (impact.economist.com)
Review of 10 Open-Weight LLM Architectures
Sebastian Raschka published a comprehensive technical analysis of ten major open-weight LLM architectures released between January and February 2026, highlighting trends in Mixture-of-Experts, hybrid attention, and multimodal capabilities. (sebastianraschka.com)

2026-02-24

Meta strikes $100B AMD chip deal
Meta announced a multiyear agreement to purchase up to $100 billion worth of AMD chips, including MI540 GPUs and CPUs, to diversify its AI infrastructure and support the development of personal superintelligence. The deal also includes a performance-based warrant for up to 160 million shares of AMD common stock. (techcrunch.com)
Anthropic accuses Chinese labs of distilling Claude
Anthropic accused Chinese AI labs DeepSeek, Moonshot, and MiniMax of conducting industrial-scale distillation campaigns using thousands of fraudulent accounts to extract knowledge from its Claude models, raising national security concerns. An in-depth analysis of the campaigns suggests the impact varies significantly, noting that while MiniMax and Moonshot generated millions of exchanges, DeepSeek's attributed activity was comparatively small. (go.theregister.com)

2026-02-23

IBM stock drops 13% on Anthropic COBOL news
IBM shares plummeted by 13 percent after Anthropic highlighted the ability of its Claude Code tools to rapidly refactor legacy COBOL applications, sparking investor concerns over the future of the mainframe business, as covered by Yahoo Finance. (go.theregister.com)

2026-02-20

Anthropic bans third-party harnesses for Claude subs
Anthropic revised its Consumer Terms of Service to explicitly forbid using Claude Free, Pro, or Max subscription keys in third-party harnesses to prevent token arbitrage. The company clarified that OAuth access is strictly limited to official interfaces like Claude Code and Claude.ai. (go.theregister.com)
GGML and llama.cpp join Hugging Face
Georgi Gerganov and the team behind GGML and llama.cpp have joined Hugging Face to secure resources and support for the local AI ecosystem. The partnership aims to improve integration between the Transformers library and local inference tools while maintaining the project's open-source status and technical autonomy. (huggingface.co)
Video: 'My friend accidentally started an AI revolution'
As two native Austrians talking English to each other wasn't enoug - this is Andreas Klinger interviewing his friend Peter Steinberger for 20 mins on the OpenClaw Revolution: how it happened and what to expect next. (youtube.com)
Snyk CEO Peter McKay resigns to prioritize AI experience
Snyk CEO Peter McKay announced he is stepping down, stating the company requires a leader with deep roots in product innovation and AI to guide its future development - a rare moment of a CEO recognizing he is lacking key AI skills to guide the strategy of the company. (go.theregister.com)

2026-02-19

Karpathy explores highly bespoke software era
Andrej Karpathy discussed the potential impact of highly bespoke software on future user experiences, using a personal health regimen to illustrate how deeply customized applications could shape user expectations. (nitter.net)
Gemini 3.1 Pro Release
Google released Gemini 3.1 Pro, an upgraded AI model with improved reasoning and problem-solving capabilities, available in preview for developers, enterprises, and consumers. As described by Simon Willison, the model boasts improved SVG animation performance and is priced similarly to Gemini 3 Pro - which is half the price of Opus 4.6. (blog.google)
MiniMax releases M2.5 frontier model
MiniMax launched the M2.5 model, a frontier AI trained via reinforcement learning that achieves state-of-the-art results in coding and agentic tasks, while documentation from Unsloth details how to run the 230B parameter model locally. (minimax.io)

2026-02-18

Google launches Lyria 3 music generation in Gemini
Google launched the Lyria 3 music generation model within the Gemini app, enabling users to create 30-second tracks from text, photo, or video prompts. The official announcement highlights features like automatic lyric generation, cover art creation, and SynthID watermarking for content verification.
Paul Ford declares AI coding disruption arrived
Paul Ford published an opinion piece in the New York Times asserting that the AI disruption in software development has arrived, noting how agents like Claude Code now enable users to perform hundreds of thousands of dollars worth of work through 'vibe coding'. (simonwillison.net)

2026-02-17

Cohere Labs Launches Tiny Aya Multilingual Models
Cohere Labs introduced Tiny Aya, a family of compact 3.35B-parameter multilingual models capable of running locally on-device across 70+ languages. The release includes specialized regional variants, with additional specifications available on the research page and a demo space accessible for testing. (cohere.com)
Review of open weights vs closed models
Nathan Lambert publishes a comprehensive review arguing that open-weights models remain in a state of perpetual catch-up behind closed frontier models, maintaining a steady six-month performance gap. The analysis notes a shift in leadership toward Chinese models like Qwen and discusses the limitations of current benchmarks in capturing true frontier capabilities. (interconnects.ai)
Anthropic releases Claude Sonnet 4.6
Anthropic released Claude Sonnet 4.6, claiming it offers performance comparable to the larger Opus 4.5 model while maintaining the lower Sonnet pricing of $3 per million input tokens. The new model features a knowledge cutoff of August 2025, supports up to 1 million input tokens in beta, and removes support for prefixes. (simonwillison.net)

2026-02-16

SemiAnalysis releases InferenceX v2 benchmark suite
SemiAnalysis released InferenceX v2, an open-source inference benchmark revealing that NVIDIA's GB300 NVL72 achieves up to 100x better performance than H100, while AMD's MI355X shows competitive results in FP8 but lags in FP4 composability. (newsletter.semianalysis.com)
Ars Technica publishes hallucinated quotes in an article about hallucinated quotes
Ars Technica published an article regarding AI hallucinations that ironically included fabricated quotes attributed to the subject, software developer Scott Shambaugh. (medium.com)
Qwen Team Announces Qwen 3.5 Open Weights Model
The Qwen team announced the release of Qwen 3.5, marking the latest update to their large language model series. Called Qwen3.5-397B-A17B it features 397 billion total parameters with 17 billion activated per forward pass. It is a native vision-language model, supporting 201 languages. (qwen.ai)

2026-02-15

Peter Steinberger joins OpenAI, OpenClaw becomes foundation
Peter Steinberger, creator of OpenClaw, announced he is joining OpenAI to work on bringing agents to everyone, while the OpenClaw project will transition to an independent foundation. CEO Sam Altman confirmed the hire, noting Steinberger will drive the next generation of personal agents, a move Steinberger also highlighted on social media. (steipete.me)

2026-02-14

Google and OpenAI report distillation attacks by DeepSeek
Google and OpenAI released reports warning that competitors, specifically China's DeepSeek, are using 'distillation attacks' to probe and replicate the reasoning capabilities of their frontier models like Gemini and ChatGPT. These companies argue that this large-scale intellectual property theft undermines massive R&D investments and have called for US government intervention to close API loopholes. (go.theregister.com)

2026-02-13

Lex Fridman Podcast discusses OpenClaw
Lex Fridman releases episode #491 featuring Peter Steinberger to discuss OpenClaw, a viral AI agent described as having broken the internet. (youtube.com)
Anthropic raises $30B at $380B valuation
Anthropic secured a $30 billion Series G funding round led by Coatue and GIC, boosting its valuation to $380 billion, marking the second-largest private financing in tech history as it intensifies competition with OpenAI. (aibusiness.com)
Petition urges OpenAI to retain GPT-4o
A Change.org petition with over 20,000 signatures requests that OpenAI maintain access to the GPT-4o model on ChatGPT and the API, citing its unique user experience and distinct capabilities compared to newer models. (change.org)
OpenAI Retires GPT-4o and Older Models
OpenAI is retiring GPT-4o, GPT-4.1, GPT-4.1 mini, and OpenAI o4-mini from ChatGPT, citing the majority of users have shifted to GPT-5.2. This change allows the company to focus on improving the most used models. (openai.com)

2026-02-12

Apple reportedly delays Siri AI upgrades
Apple is reportedly considering delaying long-awaited upgrades to its Siri virtual assistant due to reliability issues with in-app voice commands and privacy constraints. (fool.com)
Google releases Gemini 3 Deep Think model
Google DeepMind announced the release of Gemini 3 Deep Think, a new AI model iteration designed to advance capabilities in science, research, and engineering. (deepmind.google)

2026-02-11

Mistral commits $1.4B to Swedish data center
Mistral announced a $1.43 billion investment to build a large-scale AI data center in Sweden, marking its first major infrastructure investment outside France to support European AI sovereignty. (aibusiness.com)
Z.ai releases GLM-5 language model
Z.ai announced the release of GLM-5, the latest iteration in their General Language Model series. This release signifies a major upgrade to the GLM family - with massive numbers: 744B params (40B active) based on 28,5T token in pre-training. (z.ai)

2026-02-10

Qwen releases Image 2.0 model
The Qwen team announced the release of Qwen Image 2.0, updating their image generation capabilities: chat.qwen.ai (qwen.ai)

2026-02-09

2026 Datacenter CPU Landscape Overview
SemiAnalysis publishes a comprehensive review of the 2026 datacenter CPU landscape, highlighting a resurgence in demand driven by AI reinforcement learning and agentic workflows. The report details architectural evolutions and competitive dynamics between Intel, AMD, NVIDIA, and hyperscalers, emphasizing the renewed critical role of CPUs in supporting AI infrastructure. (newsletter.semianalysis.com)
Overview of LLM persona stability challenges
This comprehensive analysis examines the difficulties in maintaining consistent LLM personas, covering historical jailbreaks, 'LLM psychosis,' the @grok 'MechaHitler' incident, and research on emergent misalignment and the 'Assistant Axis' from 2022 to 2026. (understandingai.org)
Berkeley Haas study finds AI intensifies work
A study of 200 employees at a U.S. technology company found that while AI tools increase productivity, they also create unsustainable work intensity and cognitive load, leading to exhaustion rather than reduced work hours. (simonwillison.net)
Tech firms adopt 996 culture for AI race
Intense competition in the AI sector is driving tech startups to embrace '996' work cultur - "from 9 to 9 for 6 days a week" - demanding up to 72-hour work weeks to accelerate product development and monetization. (bbc.com)

2026-02-07

China's massive power expansion aids AI race
New data indicates China is adding power capacity nearly six times faster than the US, positioning it to meet rising AI energy demands while US developers face grid constraints and connection delays. (techxplore.com)
Crypto.com founder buys AI.com for $70mn
The founder of Crypto.com acquired the domain AI.com for $70 million, marking the largest-ever website name transaction. The sale highlights the escalating value of prime digital real estate amidst the artificial intelligence boom. (ft.com)

2026-02-06

Waymo creates world model using Genie 3
Waymo introduced the Waymo World Model, powered by Google DeepMind's Genie 3, to generate hyper-realistic simulations of rare driving scenarios. The model creates synchronized 2D video and 3D lidar data, allowing the AI to train on dangerous or impossible conditions without real-world risks. (arstechnica.com)
Goldman Sachs adopts Claude for accounting, compliance
Goldman Sachs implements Anthropic's Claude AI model to automate accounting and compliance roles, marking a significant integration of generative AI into high-stakes financial operations. (reddit.com)
OpenAI launches Frontier enterprise AI platform
OpenAI released 'Frontier,' a platform designed to help enterprises build, deploy, and manage AI agents with shared context and integration into existing systems. The platform aims to reduce workflow silos and support OpenAI's goal of increasing enterprise revenue to 50% of total revenue. (aibusiness.com)
OpenAI releases GPT-5.3-Codex
OpenAI released GPT-5.3-Codex, its most capable agentic coding model to date, which combines advanced reasoning with frontier coding performance to handle complex software development and professional knowledge work. The model sets new benchmarks in coding and computer-use tasks and is the first OpenAI model classified as 'High capability' for cybersecurity. (openai.com)

2026-02-05

Interconnects reviews the release of Codex 5.3 & Opus 4.6
OpenAI and Anthropic released GPT-5.3-Codex and Claude Opus 4.6, marking a significant shift in the industry towards advanced coding agents and a 'post-benchmark era' where practical utility supersedes standardized evaluation scores. (interconnects.ai)
Waymo raises $16B at $126B valuation
Waymo secured a $16 billion funding round led by Alphabet, raising its valuation to $126 billion to fuel its expansion into new US and international markets. (aibusiness.com)
Nvidia, OpenAI $100bn deal evaporates
The anticipated $100bn deal between Nvidia and OpenAI appears to have collapsed, raising questions about circular funding in the AI sector and causing Nvidia's stock to drop by 10%. Nvidia's CEO stated the investment was never binding, while reports suggest OpenAI is seeking alternative chip suppliers due to dissatisfaction. (theguardian.com)
Anthropic airs Super Bowl ads mocking OpenAI
Anthropic launched a multi-million dollar Super Bowl advertising campaign to mock OpenAI's ad plans while pledging to keep the Claude AI family free of advertisements. The move escalates the public rivalry between the two AI firms and drew a sharp rebuttal from OpenAI CEO Sam Altman. (theregister.com)
Anthropic releases Claude Opus 4.6 model
Anthropic launched Claude Opus 4.6, featuring improved agentic coding and reasoning capabilities, a 1 million token context window, and state-of-the-art performance on benchmarks such as Terminal-Bench 2.0 and Humanity's Last Exam. (anthropic.com)
Claude Code adoption signals agentic AI shift
SemiAnalysis reports that Anthropic's Claude Code is now authoring 4% of GitHub public commits, marking a critical inflection point where AI agents begin to dominate software engineering and reshape the broader information work economy. (newsletter.semianalysis.com)

2026-02-04

Investors fear AI will replace vertical SaaS
Major software stocks lost over $730 billion in market value as investors grew concerned that advancements in generative AI, specifically from OpenAI and Anthropic, threaten to render vertical SaaS solutions obsolete. (go.theregister.com)
Anthropic commits to ad-free AI assistant
Anthropic announced that Claude will permanently remain ad-free, rejecting advertising-based revenue models to ensure the AI acts solely in users' interests and serves as a dedicated space for deep thinking. (anthropic.com)

2026-02-03

China's Open Source AI Ecosystem Review
Hugging Face publishes a comprehensive review of the maturation of China's open-source AI ecosystem from the 2025 'DeepSeek moment' through the implementation of the 'AI+' action plan, highlighting a strategic shift toward large-scale industrial integration. The analysis details how major tech firms and startups adopted open collaboration as a default strategy, creating a self-sustaining infrastructure that rivals proprietary models. (huggingface.co)
xAI joins SpaceX
SpaceX’s official update announces the acquisition of xAI to create a vertically integrated "innovation engine" that aims to overcome terrestrial power constraints by deploying a constellation of one million solar-powered orbital data centers. (spacex.com)

2026-02-02

OpenAI Signs $200M Snowflake Partnership
OpenAI and Snowflake announced a $200 million agreement to integrate OpenAI models natively into the Snowflake Cortex AI platform, aiming to expand OpenAI's enterprise reach and differentiate Snowflake's data cloud offerings. (aibusiness.com)
Adobe discontinues Animate to focus on AI
Adobe announced it will discontinue its 2D animation software, Adobe Animate, on March 1, 2026, as the company shifts its strategic focus toward artificial intelligence products. (techcrunch.com)
New York Times about Moltbook
Technologist Matt Schlicht launched Moltbook, a social network exclusively designed for AI bots, which attracted over 10,000 automated users in two days. The platform has become a focal point for debate in Silicon Valley regarding the current capabilities and potential risks of autonomous AI agents. (nytimes.com)

2026-02-01

Lex Fridman Podcast: State of AI in 2026 with Sebastian Raschka and Nathan Lambert
Lex Fridman releases a podcast episode discussing the comprehensive state of AI in 2026. The episode reviews key developments including LLMs, coding, scaling laws, China, agents, GPUs, and AGI. (youtube.com)

January 2026

2026-01-31

Anthropic leases $111m office in downtown San Francisco
Anthropic has secured a $111 million office lease in a downtown location, marking a significant expansion of its physical infrastructure footprint. (finance.yahoo.com)

2026-01-30

Former Google Engineer Convicted of AI Economic Espionage
A federal jury convicted former Google engineer Linwei Ding on seven counts of economic espionage and seven counts of theft of trade secrets for stealing AI technology to benefit the People's Republic of China. This case marks the first-ever conviction on AI-related economic espionage charges. (justice.gov)
Moltbook launches social network for AI agents
Moltbook launches as a social network enabling OpenClaw digital assistants to interact and share skills by periodically fetching and executing instructions from the web. The platform showcases the emerging culture of agent-to-agent communication while raising critical security concerns about prompt injection and autonomous code execution. (moltbook.com)
Apple acquires Israeli startup Q.AI
Apple acquired Israeli startup Q.AI in a deal potentially worth nearly $2 billion, marking the company's second-largest acquisition in history. The move is expected to strengthen Apple's AI-powered wearables portfolio through Q.AI's communication enhancement and 'invisible interface' technologies. (aibusiness.com)

2026-01-29

OpenClaw (aka Moltbot, aka Clawd) - by Peter Steinberger
OpenClaw launched as an open-source agent platform designed to run locally on user machines, integrating with existing chat apps like WhatsApp and Telegram. The release, a rebrand from Moltbot, introduces support for new models like KIMI K2.5, channels such as Twitch, and includes significant security hardening. (openclaw.ai)

2026-01-28

Moltbot (formerly Clawdbot): The Viral Agent Redefining Local Computing
Peter Steinberger, the creator behind the viral open-source project Moltbot (previously known as Clawdbot), joined TBPN to discuss the unprecedented rise of his AI agent. What began as a hobby project to automate personal tasks has exploded into a historic moment for open-source AI, achieving a vertical growth trajectory on GitHub that defies typical software adoption curves. (youtube.com)
Chrome adds Gemini 3 with Auto Browse features
Google Chrome integrated the Gemini 3 model, introducing a new side panel, Nano Banana image generation, and Connected Apps support. The release features 'auto browse,' an agentic tool that handles complex, multi-step workflows like form filling and travel planning for Pro and Ultra subscribers. (blog.google)

2026-01-27

Ai2 launches SERA open coding agents
The Allen Institute for AI (Ai2) released SERA, a family of open coding agents designed to help enterprise developer teams train smaller, open models on their specific codebases cost-effectively. The release emphasizes transparency and data sovereignty by providing training recipes and synthetic data generation methods alongside the 8B and 32B-parameter models. (aibusiness.com)
Moonshot AI releases Kimi K2.5 open weights model and Kimi Code
Moonshot AI released the open-source Kimi K2.5, a native multimodal model trained on 15 trillion tokens that challenges closed models in coding and video benchmarks. The company also launched Kimi Code, a coding agent that allows developers to use images and videos as input within terminals or IDEs. (techcrunch.com)

2026-01-26

Anthropic launches interactive Claude apps for workplace
Anthropic introduced a new feature allowing Claude users to launch interactive workplace apps like Slack, Canva, and Figma directly within the chatbot interface. Built on the Model Context Protocol, this integration aims to enhance enterprise productivity by combining Claude's intelligence with dedicated visual tools. (techcrunch.com)

2026-01-23

Report highlights AI displacement of translation jobs
A report highlights the significant negative impact of AI and machine translation on the translation industry, noting that over one-third of translators have lost work and nearly half have suffered income reductions due to generative AI adoption. (edition.cnn.com)
FastRender: Browser built by thousands of parallel agents
Cursor engineer Wilson Lin demonstrated FastRender, a from-scratch web browser built by swarms of autonomous agents, peaking at 2,000 concurrent agents and generating nearly 30,000 commits. The experiment showcases the potential of massive parallel agent coordination to manage complex engineering tasks with minimal human intervention. (simonwillison.net)
DeepMind seeks Chief AGI Economist
Google DeepMind opened a position for a Chief AGI Economist to lead research on post-AGI economics, scarcity, and power distribution. This hiring initiative underscores the organization's strategic preparation for the profound economic transformations expected with the arrival of Artificial General Intelligence. (job-boards.greenhouse.io)

2026-01-22

Google launches Gemini-powered SAT practice tests
Google announced that its Gemini AI can now generate free, interactive SAT practice tests with scoring and answer explanations, developed in partnership with The Princeton Review. This feature aims to provide accessible test preparation and potentially disrupt the multi-billion dollar test prep and tutoring industry. (arstechnica.com)
Axios CEO predicts AI upheaval in family letter
Axios CEO Jim VandeHei publishes a public letter to his family warning that AI will upend work and life more profoundly than electricity - and rather within months than years. He predicts imminent obsolescence for many knowledge work jobs and urges aggressive daily experimentation with large language models. (axios.com)
Yann LeCun launches AMI Labs to advance world models beyond LLMs
Yann LeCun announces AMI Labs, a Paris-based AI company focused on developing world models as an alternative to large language models (LLMs), emphasizing open-source AI and a 'third path' between US and Chinese tech dominance. (technologyreview.com)

2026-01-21

Jensen Huang outlines AI infrastructure at Davos
At the World Economic Forum, NVIDIA CEO Jensen Huang described AI as the 'largest infrastructure buildout in human history,' framing it as a 'five-layer cake' spanning energy to applications that will drive global job creation. He urged nations to treat AI as critical infrastructure and emphasized that the technology creates jobs by shifting workers from tasks to purpose. (blogs.nvidia.com)
OpenAI outlines monetization amid $1.4T commitments
OpenAI CFO Sarah Friar detailed plans to monetize services through ads, licensing, and outcome-based pricing to support massive infrastructure spending. The company targets practical adoption to offset projected commitments of $1.4 trillion over the next eight years. (aibusiness.com)

2026-01-20

OpenAI launches ads and expands ChatGPT Go globally
OpenAI announced the global rollout of its $8/month ChatGPT Go subscription and confirmed plans to introduce advertisements on free and Go tiers to generate revenue, reversing CEO Sam Altman's previous opposition to ads. (aibusiness.com)

2026-01-15

Black Forest Labs releases FLUX.2 [klein] models
Black Forest Labs released the FLUX.2 [klein] model family, designed for sub-second image generation and editing within a unified architecture optimized for consumer hardware. The release aims to advance interactive visual intelligence, offering real-time performance for AI agents and creators. (bfl.ai)
Google explains Nano Banana model naming origin
Google revealed the backstory behind the codename 'Nano Banana' for its Gemini 2.5 Flash Image model, attributing the moniker to a Product Manager's nicknames. The model subsequently became a top-rated image editing tool, leading to the adoption of the name for its 'Nano Banana Pro' successor. (blog.google)

2026-01-14

Cursor details multi-agent autonomous coding scaling
Cursor published research findings on successfully scaling autonomous coding to hundreds of concurrent agents, generating over a million lines of code across complex projects like a web browser. The study emphasizes the efficacy of a 'planners and workers' architecture to solve coordination bottlenecks in long-running agentic workflows. (cursor.com)

2026-01-12

Apple to Use Google Gemini for Siri Overhaul
Apple announced a strategic partnership with Google to use Gemini as the foundational platform for revamping Siri and Apple Intelligence. This agreement shifts Apple away from OpenAI and reportedly involves a $1 billion annual payment to Google. (aibusiness.com)

2026-01-09

Siemens, Nvidia Partner on Industrial AI Operating System
Siemens and Nvidia expanded their partnership to co-develop an industrial AI operating system aimed at enabling fully AI-driven adaptive manufacturing and digital twins. This initiative includes the launch of the Digital Twin Composer and nine new AI-powered industrial copilots to streamline operations. (aibusiness.com)

2026-01-07

Boston Dynamics Unveils Production-Ready Humanoid Robot Atlas
Boston Dynamics debuted the production-ready, fully electric version of its humanoid robot Atlas at CES, marking its official entry into the industrial robotics market. The launch was accompanied by a partnership with Google DeepMind to integrate Gemini Robotics AI models for enhanced perception and reasoning. (aibusiness.com)
xAI Raises $20B in Series E Funding
Elon Musk's xAI secured $20 billion in a Series E funding round backed by investors including Nvidia and Cisco to rapidly scale its compute infrastructure and GPU clusters. This significant investment aims to support the company's growth and the development of Grok models, despite ongoing controversies regarding the model's image generation capabilities. (aibusiness.com)

2026-01-06

Mercedes CLA to debut Nvidia Alpamayo driving tech
Nvidia announced that its new autonomous driving software, based on the open-source Alpamayo model family, will debut in the Mercedes CLA by late 2026. This collaboration marks the first production application of Nvidia's reasoning-based physical AI, designed to handle complex edge cases and improve vehicle safety. (aibusiness.com)

2026-01-05

Nvidia releases Rubin platform and open models
Nvidia launched the Rubin platform, consisting of six new AI chips (Vera CPU, Rubin GPU, NVLink 6 Switch, ConnectX-9 SuperNIC, BlueField-4 DPU, and Spectrum 6 Ethernet Switch) alongside new open models in the Nemotron and Cosmos families. This release highlights Nvidia's strategy of offering full-stack 'AI supercomputers' to maintain market leadership against growing competition. (aibusiness.com)
Boston Dynamics and DeepMind Form AI Partnership
Boston Dynamics and Google DeepMind announced a strategic partnership to integrate Gemini Robotics AI foundation models with the new Atlas humanoid robots. The collaboration aims to enable advanced industrial task capabilities, starting in the automotive sector. (bostondynamics.com)

2026-01-03

Financial Times analyzes AI boom via Soros theory
The Financial Times publishes an analysis applying George Soros's reflexivity theory to interpret the current market dynamics and investment cycles surrounding the artificial intelligence boom. (ft.com)
Satya Nadella: AI outlook for 2026
This article outlines strategic expectations for AI in 2026, emphasizing a shift from discovery to widespread diffusion and the importance of AI as scaffolding for human potential. It highlights the transition from isolated models to complex systems orchestration for real-world impact. (snscratchpad.com)

2025

December 2025

2025-12-31

17 Predictions for AI in 2026
Understanding AI published 17 forecasts for 2026, predicting continued rapid model improvement but modest immediate economic impact. Predictions cover big tech spending, model capabilities, regulation, and autonomous vehicles. (understandingai.org)
2025 LLM trends year-in-review
Simon Willison published a comprehensive retrospective summarizing key developments in the LLM space for 2025, highlighting trends such as the rise of reasoning models, agents, coding agents, and the dominance of Chinese open-weight models. (simonwillison.net)

2025-12-30

2025 LLM Year-in-Review: Reasoning and RLVR
Sebastian Raschka reviews the major developments in LLMs during 2025, identifying the year as dominated by reasoning models trained with Reinforcement Learning with Verifiable Rewards (RLVR) and the GRPO algorithm. The article analyzes trends in inference scaling, architecture efficiency, and the issue of 'benchmaxxing' while offering predictions for 2026. (sebastianraschka.com)

2025-12-29

Meta Acquires Manus AI for ~$4B
Meta Superintelligence Labs has snapped up the high-flying startup Manus AI in a deal valued at around $4 billion. It is a remarkable outcome for a company that raced to $100 million in annual recurring revenue in just nine months. (news.smol.ai)

2025-12-27

Comprehensive guide published for Claude Code 2.0
A detailed blog post explores the features and workflows of Claude Code 2.0, offering tips on sub-agents and context engineering to improve developer productivity. (sankalp.bearblog.dev)
Boris Cherny credits Claude Code + Opus 4.5 for 259 PRs in 30 days
Boris Cherny, creator of Claude Code, reported using Claude Code with Opus 4.5 to author 259 PRs over 30 days, highlighting the tool's productivity gains for coding workflows. (simonwillison.net)

2025-12-25

Nemotron 3 Nano Model Release
NVIDIA releases Nemotron 3 Nano, an open-source hybrid Mamba-Transformer language model optimized for agentic reasoning with 1M-token context support, claiming improved accuracy and 3.3x higher throughput vs comparable models like GPT-OSS-20B. (arxiv.org)
TokSuite Toolkit Release for Tokenizer Research
Researchers introduced TokSuite, a toolkit and benchmark enabling systematic study of how tokenizer selection impacts language model behavior, using 14 structurally identical models trained with varied tokenizers. The release includes a perturbation-focused benchmark to isolate tokenization effects in real-world scenarios. (arxiv.org)

2025-12-24

NVIDIA acquires AI chip startup Groq
NVIDIA announces a $20 billion deal to acquire intellectual property and talent from AI accelerator developer Groq, marking its largest-ever acquisition to strengthen its AI chip dominance. Groq's founders and executives will join NVIDIA while maintaining partial independence under new leadership. (cnbc.com)

2025-12-23

MiniMaxAI Releases VIBE Dataset
MiniMaxAI launches the VIBE benchmark dataset "Visual & Interactive Benchmark for Execution" for evaluating AI agents on full-stack web development tasks through a dataset of 200 structured coding challenges. (huggingface.co)
MiniMax releases M2.1 language model upgrade
MiniMax launches M2.1 with enhanced multi-language programming support for Rust, Java, and others, optimized for complex real-world tasks and office workflows while improving response efficiency. (minimax.io)

2025-12-22

SAM 3 image segmentation model introduced
Developers unveil SAM 3, enhancing image segmentation by incorporating conceptual understanding. This iteration builds on prior models for more intuitive visual parsing. (vizuara.substack.com)
Lovable AI coding startup hits $6.6B valuation in Series B
The AI startup Lovable, focused on coding tools, secured a Series B round that propelled its valuation to $6.6 billion. This milestone highlights surging investor enthusiasm for generative AI in software development. (aibusiness.com)
Z.AI new GLM-4.7 AI coding model release
Z.ai releases GLM-4.7, a coding-focused AI model showing benchmark improvements including 73.8% on SWE-bench (+5.8% over prior version) and better terminal/web UI generation capabilities.

2025-12-19

Anthropic releases Agent Skills as an open standard
Anthropic has open-sourced its Agent Skills framework through a minimalist specification published on GitHub, with initial adoption by several development tools but no official support yet from OpenAI. (github.com)
OpenAI releases GPT-5.2-Codex model
OpenAI launches GPT-5.2-Codex, an enhanced version of GPT-5.2 specialized for agentic coding tasks with improved context management, Windows support, and cybersecurity features—alongside an invite-only preview for security experts. (simonwillison.net)

2025-12-18

ChatGPT mobile app surpasses $3 billion in consumer spending
OpenAI’s mobile application has generated over $3 billion in global consumer revenue since its 2023 launch, with the majority of that income arriving this year. The app reached this impressive financial benchmark faster than major players like TikTok and Disney+. (techcrunch.com)

2025-12-17

Google releases Gemini 3 Flash AI model
Google unveils Gemini 3 Flash, a lightweight AI model optimized for high-speed performance, which beats previous full-scale Gemini 2.5 Pro model. (blog.google)

2025-12-16

OpenAI launches GPT Image 1.5
OpenAI unveiled GPT Image 1.5, a snappier image generator with sharper instruction adherence and editing finesse, now live for all ChatGPT users and via API. This upgrade steps up the creative game amid fierce rivalry with Google's tools. (techcrunch.com)

2025-12-15

Ilya Sutskever founds Safe SuperIntelligence (SSI) to pursue safe superintelligence
The article contrasts the diverging strategies of AI experts, specifically highlighting Ilya Sutskever's venture to build a safe superintelligence entity rather than a standard AI agent. (thealgorithmicbridge.com)
Merriam-Webster selects 'slop' as 2025 Word of the Year
Merriam-Webster chose 'slop,' denoting low-quality AI-generated digital content, as its 2025 Word of the Year amid growing online frustration with such material. The term captures a mocking tone toward AI's flood of mediocre output across media and beyond. (techcrunch.com)

2025-12-14

Interconnects reviews 2025 AI Model Landscape
2025 has been a busy year for model releases - Nathan Lambert of Interconnect reviews and groups the models and their parent companies. (interconnects.ai)

2025-12-12

OpenAI Partners with Disney for $1B to Expand Sora Content Library
OpenAI has secured a $1 billion partnership with Disney, allowing its Sora video generation tool to access a library of more than 200 iconic characters. (aibusiness.com)
OpenAI introduces skills support in ChatGPT and Codex CLI
OpenAI has quietly added skills support to ChatGPT and the Codex CLI tool, allowing users to leverage modular skills for tasks like PDF creation and plugin development. (simonwillison.net)
OpenAI releases ChatGPT 5.2 with enhanced reasoning capabilities
OpenAI has launched ChatGPT 5.2, focusing on improved reasoning and thinking abilities. The update aims to enhance the model's cognitive performance across various tasks. (techradar.com)

2025-12-11

TIME Magazine Names 'Architects of AI' as Person of the Year
TIME Magazine recognized a group of AI leaders, including Sam Altman, Elon Musk, and Jensen Huang, as its Person of the Year for their significant influence on the global AI landscape. (techcrunch.com)

2025-12-10

NVIDIA GPUs Dominate Supercomputing Benchmarks at SC25
NVIDIA announced at SC25 that over 85% of the TOP100 supercomputers now use GPUs, marking a historic shift from CPUs to accelerated computing. Their GPUs also topped the Green500 and Graph500 benchmarks, showcasing superior energy efficiency and performance. (blogs.nvidia.com)

2025-12-09

Anthropic and Accenture form strategic AI partnership
Anthropic and Accenture have agreed to a three-year partnership that will see Accenture's 30,000 employees trained on Claude and the creation of a joint business group to help enterprises adopt AI. (techcrunch.com)
Mistral releases Devstral 2 and Devstral Small 2 models
Mistral has quietly released two new models: Devstral 2 (123B) and Devstral Small 2 (24B), both optimized for coding agents. The larger model hits 72.2% on SWE-bench Verified and is up to 7× more cost-efficient than Claude Sonnet, though it comes with a modified MIT license that bars use by companies earning over $20 million monthly. (simonwillison.net)

2025-12-04

Hugging Face Launches LLM Fine-Tuning Skill for Claude
Hugging Face introduced its hf-llm-trainer skill enabling Claude AI agents to autonomously fine-tune open-source language models, handling job submissions to cloud GPUs and Hub deployment. (huggingface.co)

2025-12-02

Mistral AI releases Mistral 3 model family
Mistral AI unveiled its next-generation Mistral 3 lineup, including compact dense models in 3B to 14B sizes and the flagship Mistral Large 3 sparse mixture-of-experts powerhouse with 675B total parameters. All variants are open-sourced under Apache 2.0, boasting top-tier multimodal and multilingual prowess for developers and edge devices alike. (mistral.ai)

2025-12-01

Internals of DeepSeek Models V3 to V3.2 reviewed by Sebastian Raschka.
DeepSeek launched its upgraded V3.2 open-weight model, matching top proprietary rivals like GPT-5 in benchmarks. This evolution introduces sparse attention and refined reinforcement learning for enhanced reasoning. (sebastianraschka.com)
DeepSeek releases V3.2 flagship model and reasoning variant
DeepSeek launched its new flagship AI model, V3.2, with an experimental reasoning-enhanced version called V3.2-Speciale. These massive 685-billion-parameter models are openly available under MIT license, excelling in math like gold-medal IMO performance. (simonwillison.net)
Hugging Face launches Transformers v5 library
Hugging Face rolled out Transformers v5 after five years, emphasizing cleaner code, modular designs, and support for over 400 architectures amid daily installs topping 3 million. This update aims to streamline contributions and bolster the AI ecosystem's backbone. (huggingface.co)

November 2025

2025-11-28

Google commercializes TPUv7 for external AI customers
Google ramps up sales of its advanced TPUv7 chips to outside firms like Anthropic, which placed a massive 1GW+ order, positioning the hardware as a serious rival to Nvidia's dominance. This shift highlights a growing ecosystem challenging the established GPU leader with cost savings and strong performance. (newsletter.semianalysis.com)
Moonshot AI introduces Kimi Agentic Slides
Agentic Slides are powered by Nano Banana Pro and enable Agentic Search (Kimi K2), file upload, fully editable slides canvas and export in Powerpoint format. (x.com)

2025-11-27

The Thinking Game - DeepMind Documentary
A five-year documentary follows Demis Hassabis and the DeepMind team as they pursue the nature of intelligence, moving from game-playing AI to solving protein folding.
In 2020 DeepMind already released the AlphaGo Documentary with the legendary Move 37 when competing in Go 2016 against Lee Sedol. (youtube.com)
DeepSeek releases open-source Math-V2 model
DeepSeek unveiled a hefty 685-billion-parameter open-weights model tuned for top-tier math reasoning. It matches proprietary rivals by earning gold-medal equivalents on elite competitions like IMO and Putnam. (huggingface.co)

2025-11-26

Prime Intellect releases INTELLECT-3 106B MoE model
Prime Intellect unveiled INTELLECT-3, a 106 billion parameter Mixture-of-Experts model excelling on math, code, and reasoning benchmarks via large-scale reinforcement learning. They open-sourced the full training stack, from weights to environments, empowering broader access to cutting-edge AI development. (primeintellect.ai)
AI-generated band Breaking Rust tops Billboard Country Digital Song Sales chart
An AI-created country band named Breaking Rust has hit number one on Billboard's Country Digital Song Sales chart, showing audiences eagerly consuming machine-made music. This underscores the rising appeal of AI-generated entertainment amid ongoing hype. (technologyreview.com)

2025-11-25

Black Forest Labs releases FLUX.2 image models optimized for NVIDIA RTX GPUs
Black Forest Labs unveiled the FLUX.2 series of cutting-edge image generators, boasting photorealistic details, pose control, and multi-reference consistency. NVIDIA teamed up for RTX GPU tweaks via FP8 quantization, trimming VRAM use by 40% to make high-fidelity AI art more accessible on consumer hardware. (blogs.nvidia.com)
Google launches Ironwood seventh-gen TPU
Google unveiled Ironwood, its newest Tensor Processing Unit, as the seventh generation and most potent, efficient chip design so far. It's now ready for use in Google Cloud services. (blog.google)
Tom Gally releases SVG generation benchmark for frontier LLMs
Tom Gally crafted a fun benchmark with 30 whimsical prompts, challenging nine top LLMs to produce SVG artwork like a sloth at the controls of an excavator. The results offer a charming glimpse into which models shine at creative vector graphics. (simonwillison.net)

2025-11-24

Anthropic releases Claude Opus 4.5
New Claude Opus 4.5 by Anthropic - billing it as the top performer for coding, agents, and computer tasks amid rivalry from recent GPT and Gemini updates. Benchmarks edge ahead, yet real-world gains over prior models like Sonnet feel subtle. (simonwillison.net)

2025-11-23

Interconnects.ai lists top US labs releasing open AI models
A helpful roundup spotlights key US players like Ai2 with Olmo, Nvidia's Nemotron, and others dropping quality open models, amid rising buzz in open AI scenes on both sides of the Pacific. It's a timely snapshot as truly open releases pick up steam. (interconnects.ai)

2025-11-22

Ai2 releases fully open Olmo 3 LLM series
The Allen Institute for AI unveiled the Olmo 3 family, spotlighting the 32B Olmo 3-Think model as the top fully open option in its class, complete with training data, checkpoints, and tools to peek inside reasoning steps. They also shared smaller 7B variants trained efficiently on a refined 6-trillion-token dataset. (simonwillison.net)

2025-11-20

Google DeepMind launches Nano Banana Pro image model
Google DeepMind unveiled Nano Banana Pro, a cutting-edge image generation and editing tool powered by Gemini 3 Pro. It brings enhanced control, sharper text handling, and richer world understanding to turn ideas into polished visuals. (blog.google)
AI Engineering Code Summit discusses shift to agent swarms in coding
Industry leaders at the NYC summit outlined a pivot from single large-context AI agents to multi-agent 'ant swarms' for reliable coding, with Steve Yegge forecasting the demise of traditional IDEs by 2026. This emerging approach emphasizes context engineering and verification to sidestep model limitations like the 'Dumb Zone.' (turingpost.com)
Allen AI releases OLMo 3 open reasoning models
Allen AI unveiled OLMo 3, a fresh lineup of fully open 7B and 32B language models topping charts in base performance and reasoning tasks. These releases include all training details, paving the way for community innovation without the usual black-box mysteries. (allenai.org)
Hugging Face launches AnyLanguageModel for Apple LLM integration
Hugging Face unveiled AnyLanguageModel, a Swift package that unifies APIs for local open-source models and cloud providers on Apple devices. It acts as a seamless swap for Apple's Foundation Models, easing developer experimentation with LLMs. (huggingface.co)

2025-11-19

MIT Technology Review Insights report on AI adoption in manufacturing
Manufacturers are ramping up AI use alongside digital twins to streamline factories and cut downtime, with half now running AI in production versus just over a third last year. Big firms lead the pack at 77% adoption. (technologyreview.com)
OpenAI releases GPT-5.1-Codex-Max for Codex CLI
OpenAI debuted GPT-5.1-Codex-Max as the default model in its Codex CLI coding agent, boasting top benchmark scores and a novel compaction technique for managing extended coding tasks over vast token contexts. This specialist model shines in agentic workflows, edging out rivals like Gemini 3 Pro. (simonwillison.net)
Hollywood backlash against signing AI actress Tilly Norwood
Talent agencies explored representing AI-generated actress Tilly Norwood, prompting sharp criticism from actors like Emily Blunt and Natasha Lyonne who decried it as frightening and misguided. This flare-up underscores broader Hollywood anxieties about AI displacing human performers. (understandingai.org)

2025-11-18

Microsoft, NVIDIA, and Anthropic Launch Major AI Infrastructure Partnerships
Microsoft, NVIDIA, and Anthropic are teaming up to supercharge Anthropic's Claude models on Azure using NVIDIA hardware, with commitments for vast compute resources and billions in joint investments. This move expands enterprise access to advanced AI while optimizing performance across cutting-edge systems. (blogs.nvidia.com)
Google releases Gemini 3 AI model
Google unveiled Gemini 3, its most capable AI model to date, poised to empower users in turning concepts into reality with enhanced intelligence. Leaders like Sundar Pichai and Demis Hassabis highlighted this step forward in AI capabilities. (blog.google)
Google launches Search powered by Gemini 3
Google rolled out its Gemini 3 AI model to enhance Search, delivering sharper reasoning and fresh interactive visuals tailored to user queries. This update promises a more intuitive experience with dynamic tools and simulations. (blog.google)
Google releases Antigravity IDE
Google unveiled Antigravity, a desktop app blending a VS Code-style editor, agent dashboard, and browser tools to streamline app-building with Gemini models. It generates handy Markdown artifacts to track tasks and progress along the way. (simonwillison.net)

2025-11-17

Google Search adds AI tools for travel planning and booking
Google rolled out handy AI features in Search to help folks craft custom travel itineraries, snag the best deals, and book trips effortlessly. It's a timely boost just as holiday season kicks off. (blog.google)

2025-11-13

Chinese tech firms push domestic AI chips to offset Nvidia restrictions
Major Chinese companies are accelerating homegrown AI chip development to rival Nvidia's offerings, spurred by U.S. export curbs blocking access to top-tier GPUs. Though progress is underway, full parity remains elusive. (spectrum.ieee.org)
Hugging Face deepens partnership with Google Cloud for open models
Hugging Face and Google Cloud announced a strategic collaboration to streamline access to open AI models for cloud users. Key features include a new CDN gateway for rapid model downloads and smoother deployments across Google services. (huggingface.co)

2025-11-12

German court finds OpenAI liable for ChatGPT copyright breach
A German court decided OpenAI broke national copyright rules by training ChatGPT on protected music without approval, directing the firm to compensate rights manager GEMA. This sets an early European benchmark for AI respecting creators' rights. (techcrunch.com)
Waymo enables freeway service for robotaxis
Waymo now lets its autonomous vehicles use freeways for paying customers in Phoenix, Los Angeles, and the San Francisco Bay Area, a smart upgrade that speeds up trips across vast urban landscapes. This positions the service as a credible challenger to rideshare leaders. (understandingai.org)
Moonshot AI releases Kimi K2 Thinking model
Moonshot AI introduced Kimi K2 Thinking, an enhanced open-source reasoning agent that handles step-by-step problem-solving and chains hundreds of tool calls, edging out top rivals on benchmarks. This builds on their push for agentic intelligence with strong long-context retention. (turingpost.com)

2025-11-11

Google launches Private AI Compute
Google unveils Private AI Compute, a clever setup that lets users harness Gemini models' cloud prowess for smart AI help while ensuring their data stays strictly private. It's a thoughtful stride toward blending powerful AI with robust privacy safeguards. (blog.google)

2025-11-10

Kimi-Linear attention architecture unveiled
Kimi-Linear emerges as a hybrid design blending linear recurrence with selective full attention for efficient long-context modeling. It promises to overcome quadratic scaling limits while preserving expressive power through innovations like Kimi Delta Attention. (vizuara.substack.com)

2025-11-06

The “Sutskever Memo” Saga
According to new testimony, OpenAI cofounder Ilya Sutskever compiled a 52-page memo accusing CEO Sam Altman of repeatedly lying and pitting executives against one another, which eroded the board’s trust and led to his ouster. This saga offers a rare glimpse into the turmoil at OpenAI, one of the world’s most influential AI companies, and highlights how internal chaos and broken trust can spark a dramatic leadership shake-up. (theverge.com)

2025-11-04

Sebastian Raschka overviews non-transformer LLM architectures
Sebastian Raschka released a detailed blog post surveying emerging alternatives to dominant autoregressive transformer LLMs. It highlights linear attention hybrids, text diffusion, code world models, and recursive transformers for better efficiency and performance. (sebastianraschka.com)

2025-11-01

Andrej Karpathy releases nanochat tiny language model
Andrej Karpathy unveiled nanochat, a compact language model that fits on everyday hardware and illustrates full AI training cycles for educational purposes. This open project rapidly attracted over 35,000 GitHub stars, acting as a practical lab for hands-on experimentation. (turingpost.com)

October 2025

2025-10-27

Hugging Face launches huggingface_hub v1.0
After five years of evolution, Hugging Face unveiled version 1.0 of its core Python library, huggingface_hub, boosting performance with modern HTTP tools and a revamped CLI for seamless access to millions of models, datasets, and Spaces. This mature release supports the next era of open machine learning for a vast community. (huggingface.co)
Big tech AI capex surges to historic boom levels
Major tech firms like Amazon, Meta, Microsoft, Alphabet, and Oracle poured $241 billion into capex last year, equaling 0.82% of US GDP and on track to surpass peaks from iconic projects like Apollo. This spending, largely AI-driven, marks one of the biggest investment surges since World War II, with plans for even more ahead. (understandingai.org)

2025-10-22

AI wildfire metaphor: the ecosystem is burning, not bubbling
The article reframes the AI market as a wildfire, not a bubble, arguing that the coming shakeout will clear the underbrush and leave the strongest companies standing, much like a forest fire clears space for new growth. (ceodinner.substack.com)