Using simple tree-search techniques for LLM token sampling can give better results

its_just_andy@alien.top · 1 year ago

Using simple tree-search techniques for LLM token sampling can give better results

its_just_andy@alien.top · 1 year ago

if you’re interested in running your own models for any reason, you really should build your own evaluation dataset for the scenarios you care about.

at this point, all the public benchmarks are such a mess. Do you really care if the model you select has the highest MMLU? Or, do you care only that it’s the best-performing model for the scenarios you actually need?