About582 PostsMachine Learning ResearchAI Mammogram Diagnosis Under Real-World Conditions: Two studies test Google's breast cancer detection models in clinicsIntroduced in 2020, Google’s AI system for detecting breast cancer in mammograms still hasn't been used to diagnose current patients.Machine Learning ResearchOpenAI Challenges Speech-to-Speech Leaders: RealTime API updates audio models that reason, transcribe, and translateAn update of OpenAI’s speech-to-speech model lets developers tune the tradeoff between speed and reasoning.Machine Learning ResearchU.S. to Evaluate Upcoming Models: U.S. Government Will Test AI Models for National Security Risks, Other Hazards Prior to ReleaseThe U.S. government said it will evaluate cutting-edge models before they’re available to the public, a sharp reversal of the White House’s earlier hands-off policy.Machine Learning ResearchRobots That Adapt to New Tasks: Sony and university researchers train robots on new tasks without catastrophic forgettingNeural networks can forget how to perform earlier tasks as they learn new ones. Machine Learning ResearchHow Nvidia Uses AI to Design Chips: Chipmaker's models design circuits, verify designs, and test new layoutsNvidia’s chief scientist dreams of telling an AI model to design a new GPU, then skiing for a couple days while the system does the job.Machine Learning ResearchByteDance Bids for Video Leadership: ByteDance adds state-of-the-art Seedance 2.0 video to Capcut, while OpenAI retreatsAs OpenAI prepares to shut down Sora, ByteDance made its own video generation model available to hundreds of millions of users.Machine Learning ResearchStrategic Thinking in LLMs vs. Humans: Researchers at UT-Austin and Google model human decision-making in Rock-Paper-ScissorsWhile large language models can behave in human-like ways, the similarities are superficial. A simple strategy game revealed clear differences in their strategic approaches.Machine Learning ResearchKimi K2.6 Challenges Open-Weights Champs: Kimi K2.6 matches open Qwen3.6 Max andDeepSeek V4, falls just behind top closed models.Moonshot AI’s updated Kimi model handles longer autonomous coding sessions and scales up its multi-agent orchestration relative to its predecessor.Machine Learning ResearchGPT-5.5 Outperforms, Hallucinates: OpenAI’s latest model tops leaderboards for coding, visual puzzles, and overall intelligenceThe latest update of OpenAI’s flagship model sets new states of the art in important benchmarks but has difficulty distinguishing between what it does and doesn't know.Machine Learning ResearchAssistants That Assist Consistently: Large language models can drift drift from helpful personas to harmful ones, but new research aims to stabilize themTypically, large language models are trained to act as helpful, harmless, honest assistants. However, during long or emotionally charged conversations, traits can emerge that are less beneficial. Researchers devised a way to steady the assistant personas of LLMs.Machine Learning ResearchHumanoid Robots Work Factory Floors: Agiliy Digits humanoid robots fetch and carry bins at a Schaeffler auto-parts factory, displacing humans into higher-level jobsA small number of humanoid robots have made their way into industrial settings, where they’re roughly matching the cost of human labor and propelling some workers into higher-level roles.Machine Learning ResearchGLM 5.1 Aims for Long-Running Tasks: Z.ai’s GLM 5.1 evaluates interim results and may change its approach hundreds of times before it delivers final outputZ.ai updated its flagship open-weights large language model to work autonomously on single tasks for up to eight hours.Machine Learning ResearchSimulating Diverse Human Cohorts: Persona generation simulates human characters across a controllable range of points of viewIf you want to understand how the public will respond to your offerings, large language models can simulate users who answer questions about capabilities, features, promotions, or prices.Machine Learning ResearchUS States Move Forward With AI Laws: Most states are regulating AI despite President Trump’s opposition to state-level lawsU.S. states are continuing to enact laws that regulate AI, despite President Trump’s efforts to discourage state-by-state legislation in favor of national laws.Machine Learning ResearchBig Pharma Bets Big on AI: Pharmaceutical kingpin Eli Lilly gave Insilico $2.75 billion for AI-driven drug developmentGenerative AI has proven that it can produce text, images, audio, video, and code. The world’s most valuable pharmaceutical company is betting billions that it can produce drugs as well.Load MoreSubscribe to The BatchStay updated with weekly AI News and Insights delivered to your inbox
Machine Learning Research | The Batch
Weekly AI news for engineers, executives, and enthusiasts.
653 words~3 min read








