← Back to context Comment by 7e 5 days ago 2 PB? They will not come close to training in on that amount. Maybe years from now. 4 comments 7e Reply sgt 5 days ago Think they will not train on the dull 2TB but use that as the data lake to start and then apply a more targeted approach. winddude 5 days ago if you read the article 2pb is available as flash storage in the data pipeline, used to dedupe, clean, normalize, etc, for training from 60pb of raw data. Den_VR 5 days ago Could probably LoRA with that huflungdung 5 days ago [dead]
sgt 5 days ago Think they will not train on the dull 2TB but use that as the data lake to start and then apply a more targeted approach. winddude 5 days ago if you read the article 2pb is available as flash storage in the data pipeline, used to dedupe, clean, normalize, etc, for training from 60pb of raw data.
winddude 5 days ago if you read the article 2pb is available as flash storage in the data pipeline, used to dedupe, clean, normalize, etc, for training from 60pb of raw data.
Think they will not train on the dull 2TB but use that as the data lake to start and then apply a more targeted approach.
if you read the article 2pb is available as flash storage in the data pipeline, used to dedupe, clean, normalize, etc, for training from 60pb of raw data.
Could probably LoRA with that
[dead]