Understanding LLM Distillation Techniques
Learn how understanding LLM distillation techniques improves model training through innovative teacher-student approaches.
30 articles
Chinese AI firms are accelerating their pivot away from Nvidia by adopting Huawei chips, signaling a major shift in AI hardware infrastructure amid ongoing export restrictions, while Microsoft has integrated the new GPT-5.5 Instant model into Microsoft 365 Copilot, expanding enterprise AI capabilities. Additionally, Anthropic announced that its Claude tool can now surface hidden reasoning during AI safety evaluations, marking a notable advance in AI transparency and interpretability.
Learn how understanding LLM distillation techniques improves model training through innovative teacher-student approaches.
A new method of applying quantum computing to large language models has been achieved by Borja Aizpurua of University of Navarra, and colleagues from...
In the past two years, if you've been following the research on the interpretability of large models, you'll notice a phenomenon: in this field,...
Agents are powerful because they do more than answer questions. They call tools, retrieve context, and act across multiple steps.
As organizations scale their use of AI agents, IT teams face a familiar challenge: how do you expand automation without losing control?
The coffee might be poured by a human hand, but behind the counter something far less traditional is calling the shots at an experimental cafe in Stockholm.
Nearly half of in-house legal professionals say they would not detect an unauthorized or incorrect action taken by an AI agent until after it had already...
The new AI-powered tool is designed to help "non-technical users" craft prediction market trading strategies.
According to a recent LinkedIn post from Dataiku, the company is promoting its Dataiku E2A (Expert-to-Agent) framework as a way to convert internal business...
CIOs face rising risk as agentic AI moves into production faster than most data platforms can govern, retrieve and act on reliably.
The cybersecurity firm released a new protocol for AI agent identity, a real-time zero-day tracker, and secured a key partnership with Anthropic.
AI safety and AI interpretability research advances as Anthropic says Claude Natural Language Autoencoders exposed hidden test awareness in model...
Charity Clark will be one of two state Attorneys General leading efforts to examine issues related to internet security and AI, while Sen.
Policies to ensure public benefits from the adoption of artificial intelligence bear resemblance to policies designed to protect communities from climate...
In recent years, protein language models (pLMs) have revolutionized the field of protein engineering, opening new horizons that were previously unattainable...
Looking back over the past period, even as technological competition between China and the U.S. has intensified, the two sides have also made some...
Meta announced a major update to its open-source Immersive Web SDK (IWSDK) framework, which lets developers build VR experiences on the web using WebXR—now...
New model integration aims to improve response speed and AI-assisted productivity across Microsoft 365 applications.
MoonPay's new Dawn CLI AI trading product lets users turn plain-English prompts into automated crypto trading strategies.
New feature rollout includes support for tapping -- or building -- reusable skills that Cowork can invoke to complete a task or workflow on a user's behalf.
See what's new in Copilot Studio, April 2026: updates to workflows, more control over agent operations, and an expanded agent usage estimator.
Today, Google's native video model, Gemini Omni, was unexpectedly revealed. Stunning demos went viral, like showing a professor deriving math formulas on...
Google's upcoming Gemini Omni video model briefly surfaced, revealing new editing features ahead of Google I/O 2026.
Elastic (NYSE: ESTC), the Search AI Company, today announced jina-embeddings-v5-omni, a new family of multimodal embedding models with the ability to...
An Appfigures report shows AI image and video tools are driving massive app download growth, while concerns over deepfakes, misinformation, and unreliable...
We developed a Geneformer model with an expanded pretraining dataset of more than 100 million single-cell human transcriptomes. The increased data diversity...
Apple has published four recordings and a research recap from its 2026 Workshop on Privacy-Preserving Machine Learning & AI.
Before this week's U.S.-Chinese summit, Beijing reached a milestone in its quest for technological self-sufficiency.
Beijing and Washington are locked in an era-defining contest – and China's rapid technological progress raises questions about limits of containment.
The Philadelphia Stock Exchange Semiconductor Index has leaped 60% in six weeks, and Micron—a memory chipmaker—surged 38% last week alone, its best week...