• 1 Post
  • 12 Comments
Joined 1 year ago
cake
Cake day: October 28th, 2023

help-circle


  • so it sounds like for the 600b they just finetuned llama2 again with the same stuff Llama2 was trained with, just more of it…

    RefinedWeb

    Opensource code from GitHub

    Common Crawl we fine-tuned the model on a huge dataset (generated manually and with automation) for logical understanding and reasoning. We also trained the model for function calling capabilities.