Comment by arthurcolle
1 year ago
yeah but they aren't letting you see the useful chain of thought reasoning that is crucial to train a good model. Everyone will replicate this over next 6 months
1 year ago
yeah but they aren't letting you see the useful chain of thought reasoning that is crucial to train a good model. Everyone will replicate this over next 6 months
>Everyone will replicate this over next 6 months
Not without a billion dollars worth of compute, they won't.
Are you sure its a billion? Helps with estimating the training run