All tags

Topic: "architecture"

    MiniMax 2.7: GLM-5 at 1/3 cost SOTA Open Model
    not much happened today
    Shazeer et al (2024): you are overpaying for inference >13x