All tags
Person: "sundar_pichai"
not much happened today
gemini-3.1-flash voxtral-tts cohere-transcribe gpt-5.4-mini gpt-5.4-nano glm-5-turbo reka-edge reka-flash-3 google-deepmind mistral-ai cohere openai zai reka-ai voice vision function-calling context-windows multimodality text-to-speech low-latency human-preference automatic-speech-recognition model-benchmarking cost-efficiency hallucination-detection multi-agent-systems open-source git-worktrees logan_kilpatrick sundar_pichai guillaume_lample aidan_gomez jay_alammar giffmana andrew_curran
Google launched Gemini 3.1 Flash Live, a realtime voice and vision agent model with 2x longer conversation memory, supporting 70 languages and 128k context. Mistral AI released Voxtral TTS, a low-latency, open-weight text-to-speech model supporting 9 languages and competitive with ElevenLabs. Cohere introduced Cohere Transcribe, an audio model with 14-language support and top English ASR leaderboard performance at 5.42 WER. OpenAI released smaller multimodal variants GPT-5.4 mini and GPT-5.4 nano with 400k context, noted for cost-competitiveness but high verbosity and hallucination rates. Other releases include GLM-5-Turbo by Zai, Reka Edge and Flash 3 on OpenRouter, and new multi-agent UX tooling Cline Kanban for orchestrating CLI coding agents.
Gemini 3.0 Flash Preview: 1/4 cost of Pro, but ~as smart, retakes Pareto Frontier
gemini-3-flash gemini-3 gpt-5.2 gemini-3-pro google google-deepmind tool-calling multimodality benchmarking reasoning cost-efficiency model-performance context-window agentic-ai model-deployment sundar_pichai jeffdean demishassabis
Google launched Gemini 3 Flash, a pro-grade reasoning model with flash latency, supporting tool calling and multimodal IO, available via multiple platforms including Google AI Studio and Vertex AI. It offers competitive pricing at $0.50 per 1M input tokens and $3.00 per 1M output tokens, with context windows up to 1M tokens. Benchmarks show Gemini 3 Flash rivals or outperforms larger models like GPT-5.2 and Gemini 3 Pro in agentic, coding, and reasoning tasks, validated by ARC-AGI-2, SWE-bench, LMArena, and Arena benchmarks. Despite some tradeoffs like high token use and hallucination rates, it is cost-effective overall. Key figures include Sundar Pichai, Jeff Dean, and Demis Hassabis who publicly celebrated this achievement. The model's tool calling capabilities were demonstrated with 100 tools in a live demo.
OpenAI buys Jony Ive's io for $6.5b, LMArena lands $100m seed from a16z
gemini-2.5-pro gemini-diffusion openai lmarena a16z mistral-ai google google-deepmind multimodality reasoning code-generation math model-fine-tuning ai-assistants voice memory-optimization sundar_pichai
OpenAI confirmed a partnership with Jony Ive to develop consumer hardware. LMArena secured a $100 million seed round from a16z. Mistral launched a new code model fine-tune. Google DeepMind announced multiple updates at Google I/O 2024, including over a dozen new models and 20 AI products. Key highlights include the release of Gemini 2.5 Pro and Gemini Diffusion, featuring advanced multimodal reasoning, coding, and math capabilities, and integration of Gemini in Google Chrome as an AI browsing assistant. Deep Think enhanced reasoning mode and Project Astra improvements were also introduced, focusing on voice output, memory, and computer control for a universal AI assistant.