When training an LLM how do you decide to use a 7b, 30b, 120b, etc model (assuming you can run them all)?

paradigm11235@alien.top · 1 year ago

I’m glad I goofed in my question because your response was super helpful, but I now realize I was missing the terminology when I posted. I was talking about fine tuning an existing model with a specific goal in mind, (re: poetry)

paradigm11235@alien.top · 1 year ago

When training an LLM how do you decide to use a 7b, 30b, 120b, etc model (assuming you can run them all)?