Comment by kevinis
1 day ago
Just noticed Tinfoil runs Deepseek-R1 "70b". Technically this is not the original 671b Deepseek R1; it's just a Llama-70b trained by Deepseek R1 (called "distillation").
1 day ago
Just noticed Tinfoil runs Deepseek-R1 "70b". Technically this is not the original 671b Deepseek R1; it's just a Llama-70b trained by Deepseek R1 (called "distillation").
No comments yet
Contribute on Hacker News ↗