Multilingual Isn’t Cross-Lingual: Inside My Benchmark of 11 LLMs on Mid- & Low-Resource Languages

I built an evaluation pipeline for multilingual and cross-lingual LLM performance on 11 mid/low-resource languages (e.g., Basque, Kazakh, Amharic, Hausa, Sundanese). I combined native-language datasets (KazMMLU, BertaQA, BLEnD), zero-shot chain-of-thou…


This content originally appeared on HackerNoon and was authored by Niko D

I built an evaluation pipeline for multilingual and cross-lingual LLM performance on 11 mid/low-resource languages (e.g., Basque, Kazakh, Amharic, Hausa, Sundanese). I combined native-language datasets (KazMMLU, BertaQA, BLEnD), zero-shot chain-of-thought prompts, and a new metric - LASS (Language-Aware Semantic Score) - that rewards semantic correctness and outputting answers in the requested language. Findings: (1) scale helps but with diminishing returns; (2) reasoning-optimized models often beat larger non-reasoning models; (3) the best open-weight model is ~7% behind the best closed model; (4) "multilingual" models underperform on culturally specific cross-lingual tasks when evaluations move beyond translated English content. Code & data: see GitHub link in Reproducibility.


This content originally appeared on HackerNoon and was authored by Niko D


Print Share Comment Cite Upload Translate Updates
APA

Niko D | Sciencx (2025-11-20T15:41:53+00:00) Multilingual Isn’t Cross-Lingual: Inside My Benchmark of 11 LLMs on Mid- & Low-Resource Languages. Retrieved from https://www.scien.cx/2025/11/20/multilingual-isnt-cross-lingual-inside-my-benchmark-of-11-llms-on-mid-low-resource-languages/

MLA
" » Multilingual Isn’t Cross-Lingual: Inside My Benchmark of 11 LLMs on Mid- & Low-Resource Languages." Niko D | Sciencx - Thursday November 20, 2025, https://www.scien.cx/2025/11/20/multilingual-isnt-cross-lingual-inside-my-benchmark-of-11-llms-on-mid-low-resource-languages/
HARVARD
Niko D | Sciencx Thursday November 20, 2025 » Multilingual Isn’t Cross-Lingual: Inside My Benchmark of 11 LLMs on Mid- & Low-Resource Languages., viewed ,<https://www.scien.cx/2025/11/20/multilingual-isnt-cross-lingual-inside-my-benchmark-of-11-llms-on-mid-low-resource-languages/>
VANCOUVER
Niko D | Sciencx - » Multilingual Isn’t Cross-Lingual: Inside My Benchmark of 11 LLMs on Mid- & Low-Resource Languages. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2025/11/20/multilingual-isnt-cross-lingual-inside-my-benchmark-of-11-llms-on-mid-low-resource-languages/
CHICAGO
" » Multilingual Isn’t Cross-Lingual: Inside My Benchmark of 11 LLMs on Mid- & Low-Resource Languages." Niko D | Sciencx - Accessed . https://www.scien.cx/2025/11/20/multilingual-isnt-cross-lingual-inside-my-benchmark-of-11-llms-on-mid-low-resource-languages/
IEEE
" » Multilingual Isn’t Cross-Lingual: Inside My Benchmark of 11 LLMs on Mid- & Low-Resource Languages." Niko D | Sciencx [Online]. Available: https://www.scien.cx/2025/11/20/multilingual-isnt-cross-lingual-inside-my-benchmark-of-11-llms-on-mid-low-resource-languages/. [Accessed: ]
rf:citation
» Multilingual Isn’t Cross-Lingual: Inside My Benchmark of 11 LLMs on Mid- & Low-Resource Languages | Niko D | Sciencx | https://www.scien.cx/2025/11/20/multilingual-isnt-cross-lingual-inside-my-benchmark-of-11-llms-on-mid-low-resource-languages/ |

Please log in to upload a file.




There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.