Comment by swyx
16 hours ago
chill out, ofir does not work for anthropic. he's just saying there's inherent variability in LLMs and you need to at least 30x the samples that OP is doing in order to make any form of statistically significant conclusions.
No comments yet
Contribute on Hacker News ↗