Compressing Large Language Models by Joint Sparsification and QuantizationPublished in ICML, 2024Jinyang Guo, Jianyu Wu, Zining Wang, Jiaheng Liu, Ge Yang, Yifu Ding, Ruihao Gong, Haotong Qin, Xianglong LiuShare on Twitter Facebook LinkedIn Previous Next