Skip to content

Releases from OpenAI Preparedness

License

Notifications You must be signed in to change notification settings

openai/preparedness

Repository files navigation

Preparedness Evals

This repository contains the code for multiple Preparedness evals that use nanoeval and alcatraz.

System requirements

  1. Python 3.11 (3.12 is untested; 3.13 will break chz)

Install pre-requisites

forprojin nanoeval alcatraz nanoeval_alcatraz;do pip install -e project/"$proj"done

Evals

  • PaperBench
  • SWELancer (Forthcoming)
  • MLE-bench (Forthcoming)

About

Releases from OpenAI Preparedness

Resources

License

Stars

Watchers

Forks

Languages

close