Introducing Command A: Max Performance, Minimal Compute

Research papers

Work by Cohere Labs and Technical Staff at Cohere

Learn more about our lab

Search papers

Filter papers

Remove All Filters

Feb 26, 2025

When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning

Dec 18, 2024

Bridging the Data Provenance Gap Across Text, Speech, and Video

Oct 15, 2024

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

multilingual

Safety

Supervised Learning

Language

Human Feedback

Efficiency

multilingual

Safety

Supervised Learning

Language

Human Feedback

Efficiency

Aug 21, 2024

Light bulbs have energy ratings — so why can’t AI chatbots?

Language Models

Efficiency

Language Models

Efficiency

Aug 15, 2024

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Mixture of Experts

Language Models

Efficiency

Mixture of Experts

Language Models

Efficiency

Jul 09, 2024

On the Limitations of Compute Thresholds as a Governance Strategy

Apr 29, 2024

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Apr 24, 2024

The PRISM Alignment Project: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models

Feb 29, 2024

Here's a Free Lunch: Sanitizing Backdoored Models with Model Merge

Oct 22, 2023

Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation

Evaluation

Efficiency

Language

Generative Models

Evaluation

Efficiency

Language

Generative Models

Sep 11, 2023

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning

Mixture of Experts

Efficiency

Transfer Learning

Language

Generative Models

Compute

Mixture of Experts

Efficiency

Transfer Learning

Language

Generative Models

Compute

Aug 31, 2023

Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models

Apr 11, 2023

PASHA: Efficient HPO and NAS with Progressive Resource Allocation

Efficiency

Optimization

Neural Architecture Search

Efficiency

Optimization

Neural Architecture Search

Mar 24, 2023

Efficient Methods for Natural Language Processing: A Survey

Efficiency

Generative Models

Efficiency

Generative Models

Mar 02, 2023

Associative Memory Augmented Asynchronous Spatiotemporal Representation Learning for Event-based Perception

Transformers

Representation Learning

Efficiency

Computer Vision

Transformers

Representation Learning

Efficiency

Computer Vision

Nov 26, 2022

Intriguing Properties of Compression on Multilingual Models

Efficiency

Compute

Language

Generative Models

Efficiency

Compute

Language

Generative Models

Sep 27, 2022

Exploring Low Rank Training of Deep Neural Networks

Model Compression

Transformers

Efficiency

Model Compression

Transformers

Efficiency

Sep 20, 2022

Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics

Efficiency

Interpretability

Safety

Responsible AI

Computer Vision

Efficiency

Interpretability

Safety

Responsible AI

Computer Vision

Jun 13, 2022

Robust Distillation for Worst-class Performance

Efficiency

Interpretability

Safety

Responsible AI

Efficiency

Interpretability

Safety

Responsible AI

Apr 13, 2022

Scalable Training of Language Models using PAX pjit and TPUv4

Efficiency

Frameworks

Tooling

Efficiency

Frameworks

Tooling