Date: Jan 03, 2024
Time: 12:00 PM - 1:00 PM
Join the Geo Regional Asia group as they welcome Haihao Shen to present "Efficient LLM Inference on CPUs."
Bio: Haihao is a senior AI architect in DCAI/AISE at Intel, leading model quantization and efficient inference for LLMs on Intel platforms.
Description: the talk will give an overview of LLM model quantization and efficient inference based on Intel Extension for Transformers.
Add event to calendar