Numerical Pruning for Efficient Autoregressive Models
Xuan Shen , Zhao Song , Yufa Zhou , Bo Chen , Jing Liu , Ruiyi Zhang , Ryan A Rossi , Hao Tan , Tong Yu , Xiang Chen , Yufan Zhou , Tong Sun , Pu Zhao , Yanzhi Wang , and Jiuxiang Gu
In AAAI Conference on Artificial Intelligence (AAAI) , 2024