< Back to authors

Conway Zhu
Member of Technical Staff, Foundations
Conway Zhu is a Member of Technical Staff in the Foundations team at Cohere, where his work focuses on efficient inference of Large Language Models. He obtained his Bachelor's degree from Northwestern University.
Multiple Authors - Apr 22, 2026
Production-Ready W4A8: vLLM Integration and Quality Recovery Techniques Explained