Just spreading awareness towards this very useful Model.

Haiart@alien.top · 2 years ago

Just spreading awareness towards this very useful Model.

FPham@alien.top · 2 years ago

There is very little info.

It seems to be instruction finetuned, but what template? ChatML? There is no mention of anything. Posting it this way is pretty bad.

Creative_Bottle_3225@alien.top · 2 years ago

53 GB?

a_beautiful_rhind@alien.top · 2 years ago

it’s in FP32 rather than FP16

frozen_tuna@alien.top · 2 years ago

This model is primarily recommended as a superior-to-Llama-2 baseline for additional finetuning,

According to the model, its not really supposed to compete with something like Vicuna. Sounds like they’re trying to be an upgraded foundational model.

ttkciar@alien.top · 2 years ago

What have you found it useful for? The model card is pretty vague.

kpodkanowicz@alien.top · 2 years ago

Really nice, I had a dreamz we need to find a way to iterate over base models so every finetune is closer to sota :D

eachcitizen100@alien.top · 2 years ago

its model average on the openllm leaderboard is 51.

Budget-Juggernaut-68@alien.top · 2 years ago

I really wonder who this TheBlok is. What a legend.

sophosympatheia@alien.top · 2 years ago

I can’t speak to the quality of sequelbox/DaringFortitude but I can wholeheartedly recommend sequelbox/StellarBright. I have been using StellarBright in some experimental 70b model merges and it’s phenomenal. I imagine 13b merges using DaringFortitude, or finetunes on top of it, would be quite good.