Stars
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Public repository containing METR's DVC pipeline for eval data analysis
A lightweight, powerful framework for multi-agent workflows
trying to get my kid excited about code by writing small programs together
A deep learning model for style-specific music generation.
An open-source NLP research library, built on PyTorch.
Assignment 2: Step by step to understand the instructions