Rocket 🦝 - smol model that overcomes models much larger in size

starkiller1298@alien.top · 2 years ago

Rocket 🦝 - smol model that overcomes models much larger in size

phaylon@alien.top · 2 years ago

👩‍💻 Chat format: Rocket-3B follows the ChatML format.

From the README and the tokenizer.json it looks like it’s using a textual representation of ChatML on top of StableLM’s format. Just in case this trips anyone up.

wiesel26@alien.top · 2 years ago

I think “The Bloke” takes requests for GUFF conversions. Might want to check hugging face.

Competitive_Ad_5515@alien.top · 2 years ago

!RemindMe 7 days

Agroshar@alien.top · 2 years ago

TheBloke/rocket-3B-GGUF · Hugging Face

pablines@alien.top · 2 years ago

Woooooooow!

CardAnarchist@alien.top · 2 years ago

Looking forward to trying this when some GGUF’s are available.

CNWDI_Sigma_1@alien.top · 2 years ago

https://huggingface.co/TheBloke/rocket-3B-GGUF

uti24@alien.top · 2 years ago

Seems this model has a problem and not loading.

Art10001@alien.top · 2 years ago

It was recently fixed then.

bot-333@alien.top · 2 years ago

I think I need to remind people of the benchmarks used, MT-Bench and AlpacaEval are terrible benchmarks.

paryska99@alien.top · 2 years ago

Oh wow, this seems almost too good to be true

RangerRocket09@alien.top · 2 years ago

As fan of the character, I approve 👍

Mr_Finious@alien.top · 2 years ago

Any details on what max context sizes are usable?

LienniTa@alien.top · 2 years ago

📚 Training Data: We’ve amalgamated multiple public datasets to ensure a comprehensive and diverse training base. This approach equips Rocket-3B with a wide-ranging understanding and response capability.

We’ve amalgamated multiple public benchmark answers to ensure a contaminated and diverse training base.

Creative_Bottle_3225@alien.top · 2 years ago

pansophic/rocket-3B

Model Card 🤗 ↗

Might Not Work (LMStudio )

Sweet_Protection_163@alien.top · 2 years ago

This smells like leftovers…

We’ve been having “pretraining on the test set” for weeks and I’m craving something else.

uti24@alien.top · 2 years ago

Tried gguf format of this model from huggingface and they just wont load.

those2badguys@alien.top · 2 years ago

Same, even the model from the bloke that was released hours ago wouldn’t work :-(

3m84rk@alien.top · 2 years ago

I tried both GGUF models currently on HF. Same result.

Curious to try this out when it’s working!

brobruh211@alien.top · 2 years ago

The latest version of KoboldCpp v1.50.1 now loads this model properly.

holistic-engine@alien.top · 2 years ago

Finally, I can integrate AI to my arduino project and build my own version of BB-8