No. 38 Xueyuan Road
Haidian District
Beijing, 100871
China
Peking University
Large Language ModelsTernary QuantizationPost-Training QuantizationEfficient Inference