davidmezzetti@alien.topB to

LocalLLaMAEnglish · 2 years ago

RAG in a couple lines of code with txtai-wikipedia embeddings database + Mistral

1

RAG in a couple lines of code with txtai-wikipedia embeddings database + Mistral

davidmezzetti@alien.topB to

LocalLLaMAEnglish · 2 years ago

Chat

DaniyarQQQ@alien.topB
link
fedilink
English
arrow-up
1·
2 years ago
Looks like it can work with AWQ models. Can it work with GPTQ (Exllama2) and GGUF models?
- davidmezzetti@alien.topOPB
  link
  fedilink
  English
  arrow-up
  1·
  2 years ago
  It works with GPTQ models as well, just need to install AutoGPTQ.
  
  You would need to replace the LLM pipeline with llama.cpp for it to work with GGUF models.
  
  See this page for more: https://huggingface.co/docs/transformers/main_classes/quantization