Yi is a series of LLMs trained from scratch at 01.AI. The models have the same architecture of Llama, making them compatible with all the llama-based ecosystems. Just in November, they released
- Base 6B and 34B models
- Models with extended context of up to 200k tokens
- Today, the Chat models
With the release, they are also releasing 4-bit quantized by AWQ and 8-bit quantized by GPTQ
- Chat model - https://huggingface.co/01-ai/Yi-34B-Chat
- Demo to try it out - https://huggingface.co/spaces/01-ai/Yi-34B-Chat
Things to consider:
- Llama compatible format, so you can use across a bunch of tools
- License is not commercial unfortunately, but you can request commercial use and they are quite responsive
- 34B is an amazing model size for consumer GPUs
- Yi-34B is at the top of the OS Leaderboard, making it a very strong base model for a chat one
Interesting. I often test these models by writing a detective story in the style of Agatha Christie because chatgpt sucks at that. This one seems to do a fair bit better. I don’t think any model rivals a human at that but this one is significantly better. A much more coherent tale. ChatGPT will eventually start going bonkers as the story continues whereas this one I feel matches at least a good high school level writer.