Why LLMs Struggle with Arithmetic Puzzles Post date August 23, 2025 Post author By Extrapolate Post categories In data-synthesis-pipeline, fine-tuning-on-synthetic-data, mathematical-reasoning-ai, out-of-domain-benchmarking, reasoning-verification, symbolic-reasoning-ai, synthetic-data-generation, zero-shot-learning
Testing Large Language Models on Math Puzzles Post date August 23, 2025 Post author By Extrapolate Post categories In data-synthesis-pipeline, fine-tuning-on-synthetic-data, mathematical-reasoning-ai, out-of-domain-benchmarking, reasoning-verification, symbolic-reasoning-ai, synthetic-data-generation, zero-shot-learning
Evaluating Fine-Tuned LLMs on Reasoning Puzzles Post date August 23, 2025 Post author By Extrapolate Post categories In data-synthesis-pipeline, fine-tuning-on-synthetic-data, mathematical-reasoning-ai, out-of-domain-benchmarking, reasoning-verification, symbolic-reasoning-ai, synthetic-data-generation, zero-shot-learning
A Framework for Synthesizing Arithmetical Puzzle Datasets for Large Language Models Post date August 23, 2025 Post author By Extrapolate Post categories In data-synthesis-pipeline, fine-tuning-on-synthetic-data, mathematical-reasoning-ai, out-of-domain-benchmarking, reasoning-verification, symbolic-reasoning-ai, synthetic-data-generation, zero-shot-learning