Comment by dr_kiszonka

9 months ago

Could you explain what is your use case for training 1B models? Learning or perhaps fine tuning?

2 comments

dr_kiszonka

Learning, prototype and then scale it in to cloud. Also can be used as inference engine to train another model if you are using model as a judge for RL.

dr_kiszonka 9 months ago

Very neat. Thanks. Since you know more about it than I, how do the lessons learned in training small models translate to scaling them up? If that is too much to explain, would you be able to recommend something to read about it?