
Filter papers
Remove All Filters
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
multilingual
Safety
Supervised Learning
Language
Human Feedback
Efficiency
multilingual
Safety
Supervised Learning
Language
Human Feedback
Efficiency
Light bulbs have energy ratings — so why can’t AI chatbots?
Language Models
Efficiency
Language Models
Efficiency
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Mixture of Experts
Language Models
Efficiency
Mixture of Experts
Language Models
Efficiency
Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation
Evaluation
Efficiency
Language
Generative Models
Evaluation
Efficiency
Language
Generative Models
Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning
Mixture of Experts
Efficiency
Transfer Learning
Language
Generative Models
Compute
Mixture of Experts
Efficiency
Transfer Learning
Language
Generative Models
Compute
PASHA: Efficient HPO and NAS with Progressive Resource Allocation
Efficiency
Optimization
Neural Architecture Search
Efficiency
Optimization
Neural Architecture Search
Efficient Methods for Natural Language Processing: A Survey
Efficiency
Generative Models
Efficiency
Generative Models
Associative Memory Augmented Asynchronous Spatiotemporal Representation Learning for Event-based Perception
Transformers
Representation Learning
Efficiency
Computer Vision
Transformers
Representation Learning
Efficiency
Computer Vision
Intriguing Properties of Compression on Multilingual Models
Efficiency
Compute
Language
Generative Models
Efficiency
Compute
Language
Generative Models
Exploring Low Rank Training of Deep Neural Networks
Model Compression
Transformers
Efficiency
Model Compression
Transformers
Efficiency
Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics
Efficiency
Interpretability
Safety
Responsible AI
Computer Vision
Efficiency
Interpretability
Safety
Responsible AI
Computer Vision
Robust Distillation for Worst-class Performance
Efficiency
Interpretability
Safety
Responsible AI
Efficiency
Interpretability
Safety
Responsible AI
Scalable Training of Language Models using PAX pjit and TPUv4
Efficiency
Frameworks
Tooling
Efficiency
Frameworks
Tooling