All tags

Topic: "long-context"

    not much happened today
    not much happened today
    Anthropic raises $65B in Series H at a $965B post-money valuation, releases Opus 4.8 and Dynamic Workflows
    not much happened today
    not much happened today
    not much happened today
    DeepSeek v4
    not much happened today
    not much happened today
    Anthropic's Claude Opus 4.7
    Gemma 4
    not much happened today
    GPT 5.4: SOTA Knowledge Work -and- Coding -and- CUA Model, OpenAI is so very back
    not much happened today
    Claude Sonnet 4.6: clean upgrade of 4.5, mostly better with some caveats
    Qwen3.5-397B-A17B: the smallest Open-Opus class, very efficient model
    Z.ai GLM-5: New SOTA Open Weights LLM
    not much happened today
    OpenAI and Anthropic go to war: Claude Opus 4.6 vs GPT 5.3 Codex
    Context Graphs: Hype or actually Trillion-dollar opportunity?
    OpenEvidence, the โ€˜ChatGPT for doctors,โ€™ raises $250m at $12B valuation, 12x from $1b last Feb
    not much happened today
    Apple picks Google's Gemini to power Siri's next generation
    not much happened today
    Claude Skills grows: Open Standard, Directory, Org Admin
    OpenAI GPT Image-1.5 claims to beat Nano Banana Pro, #1 across all Arenas, but completely fails Vibe Checks
    NVIDIA Nemotron 3: hybrid Mamba-Transformer completely open source models from 30B to 500B
    not much happened today
    GPT-5.2 (Instant/Thinking/Pro): 74% on GDPVal, 1.4x cost of GPT 5.1, on 10 Year OpenAI Anniversary
    not much happened today
    OpenRouter's State of AI - An Empirical 100 Trillion Token Study
    Mistral 3: Mistral Large 3 + Ministral 3B/8B/14B open weights models
    not much happened today
    not much happened today
    not much happened today
    not much happened today
    DeepSeek-OCR finds vision models can decode 10x more efficiently with ~97% accuracy of text-only, 33/200k pages/day/A100
    The Karpathy-Dwarkesh Interview delays AGI timelines
    Claude Agent Skills - glorified AGENTS.md? or MCP killer?
    not much happened today
    Anthropic Claude Sonnet 4.5, Claude Code 2.0, new VS Code Extensions
    GDPVal finding: Claude Opus 4.1 within 95% of AGI (human experts in top 44 white collar jobs)
    not much happened today
    GPT-5 Codex launch and OpenAI's quiet rise in Agentic Coding
    not much happened today
    Oracle jumps +36% in a day after winning $300B OpenAI contract
    Kimi K2โ€‘0905 and Qwen3โ€‘Max preview: two 1T open weights models launched
    Cohere Command A Reasoning beats GPT-OSS-120B and DeepSeek R1 0528
    DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost
    not much happened today
    OpenAI rolls out GPT-5 and GPT-5 Thinking to >1B users worldwide; -mini and -nano help claim Pareto Frontier
    not much happened today
    ChatGPT Agent: new o* model + unified Deep Research browser + Operator computer use + Code Interpreter terminal
    Voxtral - Mistral's SOTA ASR model in 3B (mini) and 24B ("small") sizes beats OpenAI Whisper large-v3
    Kimi K2 - SOTA Open MoE proves that Muon can scale to 15T tokens/1T params
    Grok 4: xAI succeeds in going from 0 to new SOTA LLM in 2 years
    not much happened today
    Zuck goes Superintelligence Founder Mode: $100M bonuses + $100M+ salaries + NFDG Buyout?
    Gemini 2.5 Pro/Flash GA, 2.5 Flash-Lite in Preview
    not much happened today
    not much happened today
    Qwen 3: 0.6B to 235B MoE full+base models that beat R1 and o1
    gpt-image-1 - ChatGPT's imagegen model, confusingly NOT 4o, now available in API
    not much happened today
    GPT 4.1: The New OpenAI Workhorse
    not much happened today
    LLaDA: Large Language Diffusion Models
    Project Stargate: $500b datacenter (1.7% of US GDP) and Gemini 2 Flash Thinking 2
    Titans: Learning to Memorize at Test Time
    ModernBert: small new Retriever/Classifier workhorse, 8k context, 2T tokens,
    not much happened today
    Not much (in AI) happened this weekend
    not much happened today
    a calm before the storm
    not much happened today
    Everybody shipped small things this holiday weekend
    not much happened today
    Summer of Code AI: $1.6b raised, 1 usable product
    CogVideoX: Zhipu's Open Source Sora
    not much happened this weekend
    Nvidia Minitron: LLM Pruning and Distillation updated for Llama 3.1
    super quiet day
    Gemini Live
    Llama 3.1: The Synthetic Data Model
    Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o-mini version)
    Not much happened today.
    Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata
    5 small news items
    1 TRILLION token context, real time, on device?
    Skyfall
    Not much happened today
    Evals: The Next Generation
    Mergestral, Meta MTIAv2, Cohere Rerank 3, Google Infini-Attention
    Claude 3 is officially America's Next Top Model
    Claude 3 just destroyed GPT 4 (see for yourself)
    Ring Attention for >1M Context
    Google AI: Win some (Gemma, 1.5 Pro), Lose some (Image gen)
    Sora pushes SOTA
    1/8/2024: The Four Wars of the AI Stack
    12/8/2023 - Mamba v Mistral v Hyena