Independent Science + Technology

Author: Kenneth Leung

How to Benchmark DeepSeek-R1 Distilled Models on GPQA Using Ollama and OpenAI’s simple-evals

Nothing left to load.