← Back to context

Comment by dr_kiszonka

8 days ago

Could you explain what is your use case for training 1B models? Learning or perhaps fine tuning?

Learning, prototype and then scale it in to cloud. Also can be used as inference engine to train another model if you are using model as a judge for RL.

  • Very neat. Thanks. Since you know more about it than I, how do the lessons learned in training small models translate to scaling them up? If that is too much to explain, would you be able to recommend something to read about it?