Hello, this has probably been asked a bazillion times, but I can’t find an example. I have installed stable diffusion and LLaMA on my new PC. However, it does not appear to be utilising my new RTX 4080 for generation. Generation of text or images is very slow, and the GPU utilisation stays at 0% - 4% throughout. Any idea how this could be addressed? I am no expert, so I have not a clue what I could change for this.
It is on a laptop by the way, NVIDIA RTX 4080 (Laptop) and 12th Gen Intel CPU.
Thanks in advance!
I am not sure what installing llama means. There are different ways of running llama. But if the program you installed is supposed to utilize gpu, it could be a cuda issue.