Background image for aesthetic purposes

Research papers

Work by Cohere Labs and Technical Staff at Cohere

Filter papers

Remove All Filters

Apr 30, 2025

The Leaderboard Illusion

Evaluation

Language Models

multilingual

Evaluation

Language Models

Apr 10, 2025

Kaleidoscope: Exams for Multilingual Vision Evaluation

Evaluation

Open Source

multilingual

Generative Models

Multimodal

Feb 19, 2025

Code

Collaboration

Evaluation

Reasoning

Tooling

Dec 05, 2024

Global MMLU

Evaluation

Open Source

multilingual

Generative Models

Nov 29, 2024

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Data

Evaluation

Generative Models

multilingual

Open Source

Language Models

Nov 05, 2024

M-RewardBench: Evaluating Reward Models in Multilingual Settings

multilingual

Data

Evaluation

Open Release

Collaboration

Nov 29, 2023

Elo Uncovered: Robustness and Best Practices in Language Model Evaluation

Evaluation

Reproducibility

Language

Generative Models

Evaluation

Efficiency

Language

Generative Models

Responsible AI

Evaluation