https://arxiv.org/abs/2311.10770

“UltraFastBERT”, apparently a variant of BERT, that uses only 0.3% of it’s neurons during inference, is performing on par with similar BERT models.

I hope that’s going to be available for all kinds of models in the near future!

    • koehr@alien.topOPB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      I don’t think so (unfortunately). The model size doesn’t change, only the way it is traversed.

      • obwohl@alien.topB
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        Can this technique be combined with lora with a not so low rank? Lora increases the learning time (I heard) but this should be no problem then anymore :)