default author photo

xunyu zhu

Chinese Academy of Sciences

SCHOLARLY PAPERS

2

DOWNLOADS

150

TOTAL CITATIONS

2

Scholarly Papers (2)

1.

Distilling Mathematical Reasoning Capabilities into Small Language Models

Number of pages: 27 Posted: 03 Apr 2024
xunyu zhu, Jian Li, Yong Liu, Can Ma and Weiping Wang
Chinese Academy of Sciences, Chinese Academy of Sciences, Renmin University of China, affiliation not provided to SSRN and Chinese Academy of Sciences
Downloads 105 (664,543)
Citation 2

Abstract:

Loading...

Large Language Models, Knowledge Distillation, Mathematical Reasoning, Chain-of-Thought, Program-of-Thought

2.

Improving Differentiable Architecture Search Via Self-Distillation

Number of pages: 27 Posted: 28 Feb 2023
xunyu zhu, Jian Li, Yong Liu and Weiping Wang
Chinese Academy of Sciences, Chinese Academy of Sciences, Renmin University of China and Chinese Academy of Sciences
Downloads 45 (1,121,190)

Abstract:

Loading...

neural architecture search, Neural networks, smoothness, knowledge distillation, sharpness-aware minimization