All tags

Topic: "model-security"

    Anthropic accuses DeepSeek, Moonshot, and MiniMax of "industrial-scale distillation attacks".
    1/12/2024: Anthropic coins Sleeper Agents