I don't understand Mistral and context size, honestly.

anti-lucas-throwaway@alien.top · 2 years ago

I don't understand Mistral and context size, honestly.

anti-lucas-throwaway@alien.top · 2 years ago

Very interesting. Well, in hindsight I should’ve noticed but performance did decrease after 8k tokens, but became completely unusable after 10k. I actually am pretty disappointed to still know nothing. No one actually documents what does and doesn’t work and when or how. I can barely find anything about SWA (I know what it is in essence) but no one documents how it works, where and if you can set the window size and whether or not it’s in Ooba’s app.

And then there is the problem that I don’t know if it’s supported on AMD cards like you said. Try to look it up, sliding window attention on Google by itself just gives endless pages of “tutorials” and “guides” that don’t tell anything. And combining it with Rocm just gives random results that don’t lead anywhere useful.

4onen@alien.top · 2 years ago

I did my best to explain Sliding Window Attention briefly there, so do let me know where my explanation is deficient.
No, you cannot set the window size and no, it’s not in Oobabooga/text-generation-webui. It’s trained in.
Well, good luck. AMD doesn’t even support their own cards properly for AI (RoCm support skipped my last card’s generation and the generation before it was only ever in beta support) which is why I finally gave up and switched to team green last year.