12th September - AI News Daily - OpenAI Launches Real-Time Voice API as Mastercard Rolls Out Agentic Checkout
Manage episode 505810533 series 3670986
🌍 INAI • The Open AI Hub
The Intelligence Atlas → the world’s most comprehensive, open hub of AI knowledge. 2 Million+ tools, models, agents, tutorials & daily news—free for all, updated every day.
https://github.com/inai-sandy/inAI-wiki
AI News Daily — 12 Sept 2025: Comprehensive Summary
Industry Shifts: OpenAI's $300B Oracle cloud deal (4.5GW capacity) reshapes the AI infrastructure landscape. NVIDIA's new Rubin CPX GPU supports 1M+ token context with SMART infrastructure for enterprise workloads. The FTC intensifies scrutiny of AI platforms regarding child safety and exaggerated AI claims. Microsoft both deepens its OpenAI partnership and develops custom silicon, while OpenAI joins Broadcom's program as the industry seeks Nvidia alternatives. Mastercard launches agentic AI checkout in the US.
New Tools: OpenAI introduces gpt-realtime and Realtime API for voice agents with lower latency. Google Gemini adds audio transcription and Creation Library for 10-minute files. ChatGPT implements MCP tools while Anthropic launches an MCP server registry. Claude now offers document editing for Word, Excel, and PDF files. Replit debuts an autonomous coding agent. DSPy integrates with KùzuDB for improved retrieval.
LLM Advances: Alibaba's Qwen3-Next-80B-A3B uses MoE architecture for efficient training. Baidu open-sources ERNIE-4.5-21B-A3B-Thinking. The mmBERT encoder supports 1,800+ languages. OpenAI integrates GPT-OSS with Transformers. Unsloth delivers 1-3-bit LLMs that outperform larger models. Baichuan introduces DCPO RLHF for better alignment.
Research Breakthroughs: Mathematics Inc's Gauss agent tackles the Strong Prime Number Theorem. ByteDance creates AgentGym-RL for standardized agent training. DeepMind partners with Imperial on antibiotic resistance. AQCat25 provides 11M+ reactions for catalyst discovery. DCQCN wins a SIGCOMM award. A new survey explores 3D/4D world modeling.
Tutorials & Demos: Anthropic offers agent tool optimization guidance. Jurafsky & Martin release SLP3. AWS Builder Loft shares AI infrastructure scaling lessons. Context engineering studies show quality beats quantity. RAG remains vital even with long contexts. ByteDance's Seedream 4.0 competes with Gemini 2.5. New creative tools include Delphi AI, Kling Avatars, and Veo 3. Design tools Mood Font and Glif enhance creativity.
Industry Discussions: Debates continue on open vs. closed AI ecosystems. AI text detection faces feasibility challenges. The industry trends toward model plurality rather than dominance. AI task autonomy reportedly doubles every ~7 months. Local LLMs offer cost advantages. Agent security concerns grow as operations scale.
100 episodes