All tags

Model: "qwen-2.5-max"

    Gemma 3 beats DeepSeek V3 in Elo, 2.0 Flash beats GPT4o with Native Image Gen
    Mistral Small 3 24B and Tulu 3 405B
    not much happened today