OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM & VLM Support
OpenCV 5.0 released today as a major update to this widely-used, open-source computer vision (CV) library. phoronix.com
23 articles
Alibaba's Qwen3.7-Plus pushes multimodal AI toward full autonomy, while Anthropic's call for an AI pause raises eyebrows as the company approaches a major IPO; meanwhile, World Labs secured a significant $1B Series B, signaling continued aggressive investment in the AI sector. Microsoft's newly released AI models face mixed reviews, with analysts questioning whether the tech giant is losing its competitive edge in the rapidly evolving AI landscape.
The gap between what AI companies claim and what actually works is widening fast, and I think we're starting to see the consequences in real time.
Take Microsoft's situation. They've released new MAI models at Build 2026 with considerable fanfare, but early testing suggests they're not living up to the hype. GitHub's troubles compound the problem. You can have the best AI research in the world, but if your products don't deliver tangible value to developers and enterprises, it doesn't matter much. Microsoft has the resources to iterate, of course, but there's a pattern here worth noting: the company that once dominated developer tooling is now scrambling to prove its AI strategy is more than marketing momentum.
By the way, contrast that with what Alibaba is attempting. Qwen3.7-Plus represents a genuinely interesting direction—taking multimodal AI and actually turning it into an agent capable of visual perception, GUI operation, and coding within a single loop. That's not flashy, but it's practical. The question isn't whether it's "revolutionary" but whether it actually solves problems for people building applications. That's where my skepticism kicks in: the real test isn't the benchmark, it's whether businesses adopt it at scale.
What's catching my attention, though, is the tension surfacing between capability claims and risk management. Anthropic is being unusually candid as it approaches a reported trillion-dollar IPO—warning that AI systems could soon achieve recursive self-improvement and that we risk losing control. I respect the honesty, but there's an uncomfortable contradiction embedded here. You're raising capital at unprecedented valuations while simultaneously raising alarms about existential risks. Markets and caution usually don't coexist well. Either the risk is real and priced in, or it's being downplayed for investor appetite. I'm not sure which concerns me more.
The robotics and video generation spaces tell a different story. Chinese humanoid robots are proliferating but remain largely performative—the demand and scale needed for real mass production just isn't there yet. Video generation, meanwhile, is maturing in the opposite direction. Kling, Gemini, and others have moved beyond unpredictable outputs; director-level control is becoming standard. That's meaningful progress.
What strikes me is that the winners in AI aren't always the ones with the biggest announcements. Google's work on research assistants—tools that help scientists generate hypotheses and analyze data—feels understated but genuinely useful. World Labs raising $1B on a spatial-intelligence platform suggests investors still believe in infrastructure plays, even if consumer-facing AI products are hitting friction.
The real question going forward: which of these narratives actually matter in two years? That's what I'll be watching.
OpenCV 5.0 released today as a major update to this widely-used, open-source computer vision (CV) library. phoronix.com
Princeton University's Language and Intelligence Lab (PLI) published a paper introducing **Goedel-Architect**, an agent framework for formal theorem proving... letsdatascience.com
PANO - Viettel AI has developed VT-Super-120B-A12B, a 120-billion-parameter Vietnamese large language model. The initiative aims to build artificial... en.qdnd.vn
Our system did one thing, and it did it well: It turned natural-language questions into API calls. The users were analysts, account managers, and operations... venturebeat.com
Alibaba's Qwen team has released Qwen3.7-Plus, a multimodal agent model that combines visual perception, GUI operation, and coding in a single agent loop. the-decoder.com
Alibaba's Qwen3.7-Plus targets screen, coding, and cloud-console automation as computer-use AI rivals push beyond browsers into app and terminal tasks. winbuzzer.com
You might already use AI to suggest the most comfortable sneakers under $150, or compare the specs on a new vacuum cleaner before you buy. cbc.ca
CrewAI outlines strategies to combat rising AI agent costs by optimizing token spend through orchestration and infrastructure controls. startuphub.ai
Microsoft's MAI-Thinking-1, MAI-Code-1-Flash, MAI-Image-2.5, Scout AI agent. Nemotron 3 Ultra & 3.5 ASR, RTX Spark, Minimax M3, Gemma 4 12B, Reve 2,... patmcguinness.substack.com
What's the threat?: Anthropic says AI could soon achieve recursive self-improvement, raising risks of humans losing control over powerful systems. msn.com
As the race to build ever more powerful artificial intelligence systems accelerates, one of the industry's leading players is urging the world to consider a... tradingview.com
Anthropic warns advanced AI may outpace human oversight even as the company accelerates growth and competition. americanbazaaronline.com
Microsoft's AI products aren't selling, and Github's been plagued with troubles. WIRED spoke with VP Scott Hanselman about whether the company is in... wired.com
Microsoft says its new MAI models revealed at Build 2026 are the future. After testing them, I'm not convinced they're ready for that spotlight. uk.pcmag.com
GitHub Copilot's support for custom endpoints gives enterprises more control over model choice, billing and security. It also opens a new distribution. startupfortune.com
Video generation is no longer a matter of luck at last. eu.36kr.com
Becoming a film director used to require significant resources. For many aspiring directors, the gap between imagination and execution was simply too wide. tynmagazine.com
Having trouble getting the right image out of ChatGPT, Gemini, or another AI tool? Follow this prompt for higher-impact results. zdnet.com
Nano Banana Pro (Gemini 3) delivered the strongest overall performance in our testing, combining excellent image quality and text rendering. memeburn.com
Without the demand and without that scale from the market, these companies are not able to really go into mass production.” fortune.com
Google's Co-Scientist and Futurehouse's Robin can help scientists generate hypotheses, design experiments and analyse data. chemistryworld.com
World Labs raises $1B Series B led by Autodesk at a $5B valuation to scale its spatial-intelligence world model platform Marble. thesaasnews.com
Apple is preparing a major Siri overhaul that will use Nvidia (NasdaqGS:NVDA) Blackwell B200 chips, with workloads expected to run on Google Cloud. finance.yahoo.com