40x or more speedup by selecting important neurons

koehr@alien.top · 2 years ago

40x or more speedup by selecting important neurons

obwohl@alien.top · 2 years ago

Does this technique affect the required RAM-size for inference?

koehr@alien.top · 2 years ago

I don’t think so (unfortunately). The model size doesn’t change, only the way it is traversed.

obwohl@alien.top · 2 years ago

Can this technique be combined with lora with a not so low rank? Lora increases the learning time (I heard) but this should be no problem then anymore :)