- I did my best to explain Sliding Window Attention briefly there, so do let me know where my explanation is deficient.
- No, you cannot set the window size and no, it’s not in Oobabooga/text-generation-webui. It’s trained in.
- Well, good luck. AMD doesn’t even support their own cards properly for AI (RoCm support skipped my last card’s generation and the generation before it was only ever in beta support) which is why I finally gave up and switched to team green last year.
Hiya! Seen a few of your analyses but please pardon me because I haven’t seen an answer to this.
Why are you testing models on Q4_0? Isn’t Q4_K_S the same size but with a speedup and quality improvement?