Research papers

Work by Cohere For AI and Technical Staff at Cohere

Filter papers

Nov 29, 2023

Elo Uncovered: Robustness and Best Practices in Language Model Evaluation

Evaluation

Reproducibility

Language

Generative Models

Authors: Meriem Boubdir, Edward Kim, Beyza Ermis, Sara Hooker, Marzieh Fadaee

Continue Reading

Oct 25, 2023

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

Responsible AI

Safety

AI Policy

Data

Authors: Shayne Longpre, Robert Mahari, Anthony Chen, Naana Obeng-Marnu, Damien Sileo, William Brannon, Niklas Muennighoff, Nathan Khazam, Jad Kabbara, Kartik Perisetla, Xinyi Wu, Enrico Shippole Kurt Bollacker, Tongshuang Wu, Luis Villa, Sandy Pentland, Deb Roy, Sara Hooker

Continue Reading

Oct 24, 2023

Locally Differentially Private Document Generation Using Zero Shot Prompting

Safety

Privacy

Authors: Saiteja Utpala, Sara Hooker, Pin Yu Chen

Continue Reading

Oct 22, 2023

Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation

Evaluation

Efficiency

Language

Generative Models

Authors: Meriem Boubdir, Edward Kim, Beyza Ermis, Marzieh Fadaee, Sara Hooker

Continue Reading

Oct 11, 2023

Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models

Safety

Generative Models

Language Models

Authors: Luiza Pozzobon, Beyza Ermis, Patrick Lewis, Sara Hooker

Continue Reading

Sep 12, 2023

The Grand Illusion: The Myth of Software Portability and Implications for ML Progress

AI Policy

Compute

Open Source

Tooling

Hardware

Authors: Fraser Mince, Dzung Dinh, Jonas Kgomo, Neil Thompson, Sara Hooker

Continue Reading

Sep 11, 2023

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning

Mixture of Experts

Efficiency

Transfer Learning

Language

Generative Models

Compute

Authors: Ted Zadouri, Ahmet Üstün, Arash Ahmadian, Beyza Ermiş, Acyr Locatelli, Sara Hooker

Continue Reading

Sep 08, 2023

When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale

Transformers

Generative Models

Data Pruning

Large-Scale Pretraining

Data Efficiency

Authors: Max Marion, Ahmet Üstün, Luiza Pozzobon, Alex Wang, Marzieh Fadaee, Sara Hooker

Continue Reading

Jun 14, 2023

The Presidio Recommendations on Responsible Generative AI - World Economic Forum

Interpretability

Safety

Responsible AI

AI Policy

Authors: Sara Hooker, and over 100 other thought leaders.

Continue Reading

Jun 12, 2023

Evaluating the Social Impact of Generative AI Systems in Systems and Society

Responsible AI

Authors: Irene Solaiman, Zeerak Talat, William Agnew, Lama Ahmad, Dylan Baker, Su Lin Blodgett, Hal Daumé III, Jesse Dodge, Ellie Evans, Sara Hooker, Yacine Jernite, Alexandra Sasha Luccioni, Alberto Lusoli, Margaret Mitchell, Jessica Newman, Marie-Therese Png, Andrew Strait, Apostol Vassilev

Continue Reading