Do Efficient Transformers Really Save Computation

Leo Migdal
-
do efficient transformers really save computation