Cohere For AI
Cohere For AI is Cohere's research lab that seeks to solve complex machine learning problems. We support fundamental research that explores the unknown, and are focused on creating more points of entry into machine learning research.
Cohere For AI
Cohere For AI is Cohere's research lab that seeks to solve complex machine learning problems. We support fundamental research that explores the unknown, and are focused on creating more points of entry into machine learning research.
Fundamental research lab
We work at the frontier of AI progress with the goal of solving cutting edge scientific problems. We see contributions to traditional conferences and publications in journals as an important part of our work, but also support efforts that go “beyond the research paper” and encourage scientific communication through different mediums. We drive the creation of new research spaces and breakthroughs that changes where, how and by whom research is done. We believe that technology is powerful, and empowering different perspectives ensures responsible innovation.
Open Science Initiative
We’re not just another research group. We are a hybrid lab with both a dedicated research staff and support for open science initiatives. We collaborate openly with independent researchers all over the world to conduct top-tier ML research.
Our open science research community is a space where researchers, engineers, linguists, social scientists, and lifelong learners connect and collaborate with each other. We come together from over 100 countries around the world and support large and small scale research collaborations.
Our models
State of the Art, Accessible Research LLM
Aya Expanse - 8B
State of the Art Research LLM
Aya Expanse - 32B
Massively Multilingual Research LLM
Aya
MODEL WEIGHTS FOR DEMOCRATIZING RESEARCH ACCESS
C4AI Command R - 104B
MODEL WEIGHTS FOR DEMOCRATIZING RESEARCH ACCESS
C4AI Command R - 35B
MODEL WEIGHTS FOR DEMOCRATIZING RESEARCH ACCESS
Command R7B
Our papers
Bridging the Data Provenance Gap Across Text, Speech, and Video
Progress in AI is driven largely by the scale and quality of training data. Despite this, there is a deficit of empirical analysis examining the attributes of well-established datasets beyond text. In this work we conduct the largest and first-of-its-kind longitudinal audit across modalities.
Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
We introduce the Aya Expanse model family, a new generation of 8B and 32B parameter multilingual language models, aiming to address the critical challenge of developing highly performant multilingual models that match or surpass the capabilities of monolingual models.
Policy Primer - Translating Safety
This Policy Primer summarises several promising avenues to addressing the language gap in AI safety and identifies five recommendations for researchers and policymakers to consider in their efforts to improve AI safety for everyone.
If You Can't Use Them, Recycle Them
Optimizing Merging at Scale Mitigates Performance Tradeoffs
Global MMLU
Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
The Reality of AI and Biorisk
This paper provides an analysis of existing available research surrounding two AI and biorisk threat models: 1) access to information and planning via large language models (LLMs), and 2) the use of AI-enabled biological tools (BTs) in synthesizing novel biological artifacts.
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
In this work, we construct an evaluation suite of 197,243 QA pairs from local exam sources to measure the capabilities of multilingual LLMs in a variety of regional contexts.
M-RewardBench: Evaluating Reward Models in Multilingual Settings
In this work, we conduct a systematic evaluation of several reward models in multilingual settings.
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
In this work, we explore model merging in a diverse multi-task setting, combining safety and general-purpose tasks within a multilingual context. Overall, our comprehensive study of merging approaches provides a useful framework for building strong and safe multilingual models.
Our programs
Advancing the NLP space through our programs.
ACCELERATING MULTILINGUAL AI THROUGH OPEN SCIENCE
Introducing Aya
About
Aya is a global initiative led by Cohere For AI to advance the state-of-art in multilingual AI and bridge gaps between people and cultures across the world. An open science project to create new models and datasets that expand the number of languages covered by AI, Aya involves over 3,000 independent researchers across 119 countries.
Exploring the unknown together
Scholars program
About
Our Scholars Program provides the opportunity to work alongside some of the best research and engineering experts in the world. We have created an open and supportive environment that provides an alternative point of entry into machine learning research.
academic support
Research grant
Benefits
Cohere For AI research grants are designed to support academic partners who are conducting research with the goal of releasing a peer-reviewed scientific artifact. Our program provides academic partners, developers, researchers, and other members of our community with subsidized access to the Cohere API.
ACCELERATING MULTILINGUAL AI THROUGH OPEN SCIENCE
Introducing Aya
About
Aya is a global initiative led by Cohere For AI to advance the state-of-art in multilingual AI and bridge gaps between people and cultures across the world. An open science project to create new models and datasets that expand the number of languages covered by AI, Aya involves over 3,000 independent researchers across 119 countries.
Exploring the unknown together
Scholars program
About
Our Scholars Program provides the opportunity to work alongside some of the best research and engineering experts in the world. We have created an open and supportive environment that provides an alternative point of entry into machine learning research.
academic support
Research grant
Benefits
Cohere For AI research grants are designed to support academic partners who are conducting research with the goal of releasing a peer-reviewed scientific artifact. Our program provides academic partners, developers, researchers, and other members of our community with subsidized access to the Cohere API.
Past events and videos
Research is inherently a human endeavor, and our event series provide insights from beginning to breakthrough.
Video
Cong Lu: The AI Scientist
Video
Fireside Chat: Max Welling
Video
C4AI Expedition Aya - Closing Ceremony
Video
AI & Technical Governance: Saffron Huang and Tina M. Park, PhD
Video
Panayiotis Panayiotou: Curricula for Learning Robust Policies...
Video
Arthur Conmy: Mechanistic Interpretability Research Frontiers
Meet our research team
Our staff brings together machine learning experts to contribute to progress in machine learning through fundamental research. We are committed to open collaboration, and empowering more points of entry into machine learning research through our scholars program.
Sara hooker
head, Cohere for ai
Marzieh Fadaee
Senior Research Scientist
Julia Kreutzer
SENIOR RESEARCH SCIENTIST
Ahmet Üstün
Senior Research Scientist
Beyza Ermis
Senior Research Scientist
Madeline Smith
Operations and Community Lead
Aidan Peppin
Policy & Responsible AI Lead
Brittawnya Prince
Operations Associate
Arielle Salman Bailey
Operations Associate
Saurabh Dash
Research Engineer
Daniel D'souza
Research Engineer
Alejandro Salamanca
Open Science Research Engineer
Shivalika Singh
Open Science Research Engineer
Aakanksha
Research Scholar
Viraat Aryabumi
Research Scholar
John Dang
Research Scholar
Oliver Nan
Research Scholar
Luísa Shimabucoro
Research Scholar
Arash Ahmadian Dehkordi
Research Scholar
Frequently Asked Questions
What’s C4AI’s origin story?
In 2017, a team of friends, classmates, and engineers started a distributed research collaboration, with a focus on creating a medium for early-career AI enthusiasts to engage with experienced researchers – they called it “for.ai.” Two of those co-founding members, Aidan Gomez and Ivan Zhang, later went on to co-found Cohere, and many of the original members went on to do exciting things (pursuing PhDs, working at industry and academic labs).
At the time, For AI was one of the first community-driven research groups to support independent researchers around the world. Today, Cohere is proud to reintroduce For AI as Cohere For AI, a dedicated research lab and community for exploring the unknown, together. Watch the C4AI history video here.
Do you charge for your educational programs or community membership?
We do not charge for participating in any of our programs, and are committed to supporting educational outreach programs, which include compute resources and infrastructure needed to participate in machine learning research.
are you hiring for research positions or interns?
Our full list of positions are listed here.
How can I stay in touch?
To stay up to date on upcoming talks, sign up for our mailing list.
You can also apply to join our open science community or follow us on LinkedIn and Twitter.
What is Aya?
Aya is a state-of-the-art, open source, massively multilingual research LLM covering 101 languages – including more than 50 previously underserved languages. Learn more here.
Join our open science community