Stars
1
star
written in Python
Clear filter
Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words