Skip to content
@METR

METR

Model Evaluation and Threat Research

Model Evaluation and Threat Research (METR)

METR is a research nonprofit that works on assessing whether cutting-edge AI systems could pose catastrophic risks to society.

We build the science of accurately assessing risks, so that humanity is informed before developing transformative AI systems.

Read more about our work here.

Our Software

Popular repositories Loading

  1. eval-analysis-public eval-analysis-public Public

    Public repository containing METR's DVC pipeline for eval data analysis

    Python 184 37

  2. task-standard task-standard Public

    METR Task Standard

    TypeScript 170 36

  3. RE-Bench RE-Bench Public

    Python 130 18

  4. vivaria vivaria Public

    Vivaria is METR's tool for running evaluations and conducting agent elicitation research.

    TypeScript 130 39

  5. public-tasks public-tasks Public

    HTML 116 17

  6. hcast-public hcast-public Public

    HTML 18 4

Repositories

Showing 10 of 54 repositories
  • hcast-public Public
    METR/hcast-public’s past year of commit activity
    HTML 18 4 2 1 Updated Jan 19, 2026
  • public-tasks Public
    METR/public-tasks’s past year of commit activity
    HTML 116 17 1 2 Updated Jan 19, 2026
  • inspect-agents Public

    A collection of METR wrappers around Inspect agents and of METR scanners for Inspect Scout. Intended to allow consistent usage and customization.

    METR/inspect-agents’s past year of commit activity
    Python 4 1 4 3 Updated Jan 20, 2026
  • inspect_ai Public Forked from UKGovernmentBEIS/inspect_ai

    Inspect: A framework for large language model evaluations

    METR/inspect_ai’s past year of commit activity
    Python 4 MIT 377 1 0 Updated Jan 20, 2026
  • inspect-action Public

    Running UK AISI's Inspect in the Cloud

    METR/inspect-action’s past year of commit activity
    Python 10 MIT 5 36 21 Updated Jan 19, 2026
  • eval-analysis-public Public

    Public repository containing METR's DVC pipeline for eval data analysis

    METR/eval-analysis-public’s past year of commit activity
    Python 184 37 6 3 Updated Jan 20, 2026
  • inspect-tinker-bridge Public

    Inspect tasks <> Tinker RL envs

    METR/inspect-tinker-bridge’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Jan 19, 2026
  • inspect-gepa-bridge Public

    Generic bridge for Inspect AI tasks with GEPA optimization

    METR/inspect-gepa-bridge’s past year of commit activity
    Python 0 0 0 0 Updated Jan 19, 2026
  • METR/inspect_scout’s past year of commit activity
    Python 1 MIT 6 0 0 Updated Jan 18, 2026
  • prime-rl Public Forked from PrimeIntellect-ai/prime-rl

    Decentralized RL Training at Scale

    METR/prime-rl’s past year of commit activity
    Python 0 Apache-2.0 175 0 0 Updated Jan 15, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.