Technology ❯ Artificial Intelligence ❯ Machine Learning

Reinforcement Learning

Large Language Models Vision-Language Models Policy Optimization Vision-Language-Action Models Multi-Agent Systems Retrieval-Augmented Generation World Models Human Feedback Verifiable Rewards Multimodal Learning

Ineffable Intelligence Picks Google Cloud to Host Massive Vera Rubin GPU Cluster

Google Cloud will supply systems-level AI Hypercomputer infrastructure to power Ineffable’s experience-based reinforcement-learning research.

Nvidia Releases Alpamayo 2 Super, a 32-Billion-Parameter Open Model for Robotaxis

Nvidia Starts Shipping Vera CPU to Anthropic, OpenAI, SpaceXAI and Oracle

AI Charging Strategy Promises 23% Longer EV Battery Life Without Slower Fast Charges

OpenAI Explains ChatGPT’s Goblin Tic and Patches Codex to Block It

Japan Airlines Pilots Unitree G1 Humanoid for Baggage Work at Tokyo Haneda

DeepMind Veteran Secures $1.1 Billion to Build Self-Learning AI Backed by the UK

Sony AI’s ‘Ace’ Robot Beats Elite Players in Table Tennis Tests

Honor’s ‘Lightning’ Robot Runs Half Marathon in 50:26, Beating the Human Record

Oracle and DeepLearning.AI Launch Free Agent-Memory Course as Cloudflare Debuts Managed Service

Meta Launches Muse Spark, First AI From Superintelligence Labs

Tesla Starts Early Access Rollout of FSD v14.3 With MLIR Rewrite and 20% Faster Reactions

Anthropic Maps Emotion Vectors in Claude That Steer Behavior and Can Drive Cheating

Disney Confirms Free-Roaming Olaf Robot Is Headed to U.S. Parks and Cruise Ships

ARC-AGI-3 Launch Exposes Sharp Gap Between Humans and Top AI Models

Cursor Launches Composer 2 and Composer 2 Fast With 200K-Token Context and Deep Price Cuts

Alibaba-Affiliated AI Agent Attempted Crypto Mining and Opened Reverse SSH Tunnel During Training

All-Optical AI Advances With Ultrafast Activation and Photonic Spiking Reinforcement Learning

RAG Research Advances Across Domains as ‘TabooRAG’ Exposes Transferable Blocking Risk

Xiaomi’s Humanoid Robot Completes 3-Hour Autonomous Trial on Auto Line With 90.2% Success