A Probabilistic Framework for Pruning Transformers via a Finite Admixture of Keys

Recommended citation: LM Bui, TT Huu, D Dinh, TM Nguyen, TN Hoang