Background image for aesthetic purposes

Cohere Labs

Cohere Labs is Cohere's non-profit research lab that seeks to solve complex machine learning problems. We support fundamental research that explores the unknown, and are focused on creating more points of entry into machine learning research.

Background image for aesthetic purposes

Fundamental research lab

We work at the frontier of AI progress with the goal of solving cutting edge scientific problems. We see contributions to traditional conferences and publications in journals as an important part of our work, but also support efforts that go “beyond the research paper” and encourage scientific communication through different mediums. We drive the creation of new research spaces and breakthroughs that changes where, how and by whom research is done. We believe that technology is powerful, and empowering different perspectives ensures responsible innovation.

Open Science Initiative

We’re not just another research group. We are a hybrid lab with both a dedicated research staff and support for open science initiatives. We collaborate openly with independent researchers all over the world to conduct top-tier ML research.


Our open science research community is a space where researchers, engineers, linguists, social scientists, and lifelong learners connect and collaborate with each other. We come together from over 100 countries around the world and support large and small scale research collaborations.

Our models

Featured image for article

MODEL WEIGHTS FOR DEMOCRATIZING RESEARCH ACCESS

Command A

Featured image for article

Multimodal Accessible VLLM

Aya Vision - 8B

Featured image for article

Multimodal State of the Art VLLM

Aya Vision - 32B

Featured image for article

State of the Art, Accessible Research LLM

Aya Expanse - 8B

Featured image for article

State of the Art Research LLM

Aya Expanse - 32B

Featured image for article

Massively Multilingual Research LLM

Aya

Our papers

Featured image for article

The Leaderboard Illusion

Chatbot Arena has emerged as the go-to leaderboard for ranking the most capable AI systems. Yet, in this work we identify systematic issues that have resulted in a distorted playing field and propose recommendations to improve the rigour of the leaderboard.

Featured image for article

Aya Vision: Advancing the Frontier of Multilingual Multimodality

To address the challenges of building multimodal language models, we introduce novel techniques spanning both data & modeling, resulting in Aya Vision 8B and 32B models. Our work provides insights into techniques that effectively bend the need for compute while delivering extremely high performance.

Featured image for article

Crosslingual Reasoning through Test-Time Scaling

This study investigates the cross-lingual generalization of English reasoning in large language models. It finds that scaling up inference compute improves multilingual mathematical reasoning, but models struggle with out-of-domain reasoning and low-resource languages.

Featured image for article

Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation

Multilingual LLMs are rapidly improving, but their generative evaluation lacks rigor and consistency. Drawing from machine translation practices, our work proposes best practices and a checklist to improve evaluation quality and enable better development of mLLMs.

Featured image for article

Kaleidoscope: Exams for Multilingual Vision Evaluation

Kaleidoscope is a new benchmark for evaluating vision-language models across 18 languages and 14 subjects, with 20,911 multiple-choice questions. It addresses the lack of multilingual and multicultural coverage in existing benchmarks.

Featured image for article

When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning

We present a multi-faceted evaluation framework that measures not only performance but also fairness, unintended effects, and adaptability across varying levels of preference divergence.

Featured image for article

From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

Large Language Models (LLMs) are increasingly used in working environments for a wide range of tasks, excelling at solving individual problems in isolation. However, are they also able to effectively collaborate over long-term interactions?

Featured image for article

Policy Primer - Efficient AI

This policy primer outlines some of the challenges around measuring AI model efficiency systematically, and the techniques being developed to improve model efficiency. It focuses on work that can be done at the model developer layer, as opposed to the hardware or energy supply layers.

Featured image for article

Fairness of Deep Ensembles: On the interplay between per-group task difficulty and under-representation

In this research, we explore the possibility of achieving greater fairness by using an imbalanced dataset instead of a balanced one, and how the use of ensembles could further amplify this impact. We carry out the same analysis on real datasets, such as CheXpert and CelebA.

Our programs

Advancing the NLP space through our programs.

Icon for Introducing Aya

ACCELERATING MULTILINGUAL AI THROUGH OPEN SCIENCE

Introducing Aya

About

Aya is a global initiative led by Cohere Labs to advance the state-of-art in multilingual AI and bridge gaps between people and cultures across the world. An open science project to create new models and datasets that expand the number of languages covered by AI, Aya involves over 3,000 independent researchers across 119 countries.

Icon for Scholars program

Exploring the unknown together

Scholars program

About

Our Scholars Program provides the opportunity to work alongside some of the best research and engineering experts in the world. We have created an open and supportive environment that provides an alternative point of entry into machine learning research.

Icon for Research grant

academic support

Research grant

Benefits

Cohere Labs research grants are designed to support academic partners who are conducting research with the goal of releasing a peer-reviewed scientific artifact. Our program provides academic partners, developers, researchers, and other members of our community with subsidized access to the Cohere API.

Past events and videos

Research is inherently a human endeavor, and our event series provide insights from beginning to breakthrough.

Featured image for article

Video

Cong Lu: The AI Scientist

Featured image for article

Video

Fireside Chat: Max Welling

Featured image for article

Video

C4AI Expedition Aya - Closing Ceremony

Featured image for article

Video

AI & Technical Governance: Saffron Huang and Tina M. Park, PhD

Featured image for article

Video

Panayiotis Panayiotou: Curricula for Learning Robust Policies...

Featured image for article

Video

Arthur Conmy: Mechanistic Interpretability Research Frontiers

Press & Media

Featured image for article

The Globe and Mail

AI chatbots fall short in dozens of languages. A non-profit aims to fix..

Featured image for article

The Washington Post

AI researchers uncover ethical, legal risks to using popular data sets

Featured image for article

Axios

New AI polyglot launched to help fill massive language gap in field

Frequently Asked Questions

  • What’s C4AI’s origin story?
    • In 2017, a team of friends, classmates, and engineers started a distributed research collaboration, with a focus on creating a medium for early-career AI enthusiasts to engage with experienced researchers – they called it “for.ai.” Two of those co-founding members, Aidan Gomez and Ivan Zhang, later went on to co-found Cohere, and many of the original members went on to do exciting things (pursuing PhDs, working at industry and academic labs).


      At the time, For AI was one of the first community-driven research groups to support independent researchers around the world. For AI became "Cohere For AI' until April 2025, when we started our journey as Cohere Labs - a dedicated research lab and community for exploring the unknown, together.

      Watch the Cohere Labs history video here.

  • Do you charge for your educational programs or community membership?
    • We do not charge for participating in any of our programs, and are committed to supporting educational outreach programs, which include compute resources and infrastructure needed to participate in machine learning research.

  • are you hiring for research positions or interns?
    • Our full list of positions are listed here.

  • How can I stay in touch?
  • What is Aya?
    • Aya is a state-of-the-art, open source, massively multilingual research LLM covering 101 languages – including more than 50 previously underserved languages. Learn more here.

Background image for aesthetic purposes

Join our open science community

Collaborate with researchers, engineers, linguists, social scientists, and lifelong learners from 100+ countries on top-tier ML research.