Cite

HARVARD Citation

    Sajjad, H. et al. (2023). On the effect of dropping layers of pre-trained transformer models. Computer speech & language. p. . [Online]. 
  
Back to record