OpenAI and Broadcom unveil AI inference chip for large-scale data centres
OpenAI and Broadcom have announced Jalapeno, a custom AI inference accelerator designed for large language model (LLM) workloads, marking OpenAI's first... telecompaper.com
28 articles
OpenAI and Broadcom unveiled a new AI inference chip targeting large-scale data centres, while Google's Gemini 3.5 Flash advanced the AI agent frontier by enabling human-like computer interaction; separately, a Cambridge University report issued stark warnings that AI could be weaponised by criminals and rogue states, highlighting growing safety concerns. The week also saw major AI investment momentum, with healthcare-focused Trase securing $107M and AI continuing to drive a spree of venture megadeals.
The agent revolution is accelerating on multiple fronts, and we're now seeing the infrastructure race catch up with the capability race. That matters because it suggests the bottleneck is shifting from "can we build this?" to "can we deploy this at scale?"
Start with Google's move. Gemini 3.5 Flash now has native computer use—meaning it can see screens, click buttons, and navigate interfaces the way a human would. This isn't theoretical anymore. What strikes me is the practical implication: agents can now operate across legacy systems without requiring API integrations or custom bridges. That's a massive force multiplier for enterprise adoption, especially in industries drowning in poorly connected software. By the way, Google's decision to bake this directly into the model rather than bolting it on afterward suggests they learned from earlier missteps.
But capability means nothing without infrastructure. OpenAI and Broadcom's announcement of Jalapeno—a custom inference chip built for LLM workloads at scale—addresses the real constraint now: cost per inference at massive throughput. The economics of running agents at hundreds of thousands of simultaneous tasks only work if you can push the hardware efficiency far beyond what consumer chips allow. I find it telling that OpenAI is vertically integrating here. It suggests confidence that inference margins, not just model capability, will define competitive advantage.
Trase's $107 million funding round lands in this context too. A company building an operating system for agents in healthcare and high-stakes industries wouldn't attract that kind of capital unless investors believed the unit economics were real. The shift from "chatbot interaction" to "delegated long-horizon task" is profound—it changes what we're asking AI to do and how we measure whether it's working.
Then there's the safety question. Cambridge's warning about AI falling into criminal and state hands isn't new in substance, but the urgency feels different now. When agents could soon be operating autonomously across business systems, the surface area for misuse expands dramatically. A compromised agent running procurement workflows or medical diagnostics has orders of magnitude more impact than a compromised chatbot. I'm not convinced the safeguards are keeping pace, and that gap is real.
Microsoft's move to ship MAI-Code-1-Flash more broadly across Copilot tiers shows the commodification accelerating too. Better models are becoming a baseline feature, not a differentiator. The race is on to embed agents into workflow, and whoever connects them most seamlessly to actual work will win.
We're at an inflection point where the architecture for agentic AI is crystallizing—better chips, better models with native computer use, operating systems to orchestrate them. The question now is how quickly safety, governance, and business models adapt to follow.
OpenAI and Broadcom have announced Jalapeno, a custom AI inference accelerator designed for large language model (LLM) workloads, marking OpenAI's first... telecompaper.com
When a standard large language model (LLM) is confronted with a problem, it tries to solve it by matching it to similar information it has seen before,... techxplore.com
You know what's cheaper than large language models? Small language models, which are designed for specialized tasks and can reduce latency. adexchanger.com
Looped language model training cannot control hidden-state norm growth because RMSNorm normalizes scale away before the loss sees it. A paper posted today... techtimes.com
YTL AI Labs launched its proprietary large language model (LLM), ILMUchat, marking a milestone in Malaysia's sovereign artificial intelligence (AI)... theedgemalaysia.com
The company will use the funds to expand its workforce and enhance its AI agent operating system. mobihealthnews.com
Google has officially changed the AI agent race. In a major upgrade launched today, native “computer use” capabilities are now live within the flagship... nokiapoweruser.com
Agentic AI changes the unit of knowledge work from single interactions to delegated, long-horizon tasks. Chatbot interactions are often short and... openai.com
You have just deployed a local LLM. Nice. But after the first few chats, you might be wondering: what else can I do with it? towardsdatascience.com
Up to 90% of an AI agent's runtime is CPU-side tool calling, not GPU inference; Akamai is pitching its 4,000-plus edge sites and AI Grid Orchestrator as the... fierce-network.com
When AI agents are applied in public institutions, the 'authorization' gap becomes a democratic accountability gap, writes Kida Chung-Ta Huang. techpolicy.press
University of Cambridge report warns AI advancing faster than safety measures, opening doors to criminal and state misuse. turkiyetoday.com
Landmark University of Cambridge report says rapidly advancing frontier AI models are outpacing safeguards, raising risks of cyberattacks and disinformation... aa.com.tr
VladTV published an interview with AI safety researcher Dr. Roman Yampolskiy in which VladTV reports he stated that artificial intelligence has a "**99.9%**... letsdatascience.com
The Computer Weekly Security Think Tank considers if Anthropic's Claude Mythos frontier AI model is a benefit or barrier to achieving resilient enterprise... computerweekly.com
©Photo by Leon Neal/Getty Images. BLETCHLEY, ENGLAND - NOVEMBER 1: A general view during the first plenary session on Day 1 of the AI Safety Summit at... msn.com
MAI-Code-1-Flash, Microsoft AI's in-house coding model, is now generally available for GitHub Copilot Business and Copilot Enterprise, building on its... github.blog
Microsoft on Thursday announced new Copilot features aimed at boosting the artificial intelligence tool's ability to handle a range of financial tasks in... cfodive.com
AI assistants are repeating a common Git mistake: committing fixes that remove secrets only from the latest code, not from repository history. blog.gitguardian.com
Copilot in Excel now supports reusable Skills, six new financial data connectors, and full change tracking for professional finance teams. techmymoney.com
The service converts PDFs or CAD files into coordinated 3D models, clash-reviewed documents and construction-ready drawing sets. engineering.com
With Microsoft 365 Copilot, KARL STORZ scaled governed AI, reaching 97% active usage across licensed users. microsoft.com
Ferroelectric memory enables simultaneous probabilistic sampling and deterministic computation in generative AI hardware / Findings published in the... eurekalert.org
Renoise and Contra have launched a co-branded short-film challenge, open June 22-30, 2026, with a $10,500 total prize pool. Contra, with over one million... theglobeandmail.com
Neural4D adds a built-in Community feed and AI 3D Agent to Studio, letting users discover, remix, and generate 3D models, images, and videos in one place. tennessean.com
Photonic quantum processing unit for quantum and classical machine learning tasks. A collaborative research group consisting of quantum information... quantumcomputingreport.com
Want to keep track of the largest startup funding deals in 2026 with our curated list of $100 million-plus venture deals to U.S.-based companies? news.crunchbase.com
General Intuition, an AI lab that uses gaming content for training, raised $320 million in Series A funding at a $2.3 billion post-money valuation and... axios.com