Comment by kristopolous

15 hours ago

That M versus B is way too subtle. 0.026B is my suggestion

13 comments

kristopolous

The "M" nomenclature has been around since at least BERT and T5/FLAN. It's valid to use it even if today's LLM devs are more familiar with billion-scale models.

DrammBA 10 hours ago

I was so confused by many comments in this post but thanks to you I realized that some people are apparently reading it as 26B and that's why their comments make no sense.

HenryNdubuaku 15 hours ago

Haha, we were trying to not be hand-wavy too much :)

kristopolous 9 hours ago

Oh hey it's Henry. I met you a couple weeks ago at an event in SF. Nice to see you on here.

dymk 13 hours ago

[flagged]

dang 10 hours ago
Can you please make your substantive points without sharp elbows? We're trying for something different here, and would appreciate it if you'd post in the intended spirit.
https://news.ycombinator.com/newsguidelines.html
- dymk 9 hours ago
  
  I’d edit it if I could, but it seems to be past the timeout.
  As the other poster noted, the post wasn’t meant to be read as a personal attack
  
  1 reply →
kristopolous 13 hours ago
Pardon me, do I know you?
Why are you attacking me?
- osrec 12 hours ago
  
  I don't think they're attacking you, but suggesting you read more carefully. The information provided is correct and clear, but you need to let go of your own biases when consuming it.
  I personally prefer the M to the B. I guess as an engineer, noticing the units comes pretty naturally.
  
  1 reply →
f33d5173 13 hours ago

I read it as 26B as well.