Comment by vlovich123
21 hours ago
Took me a while to find what you were referring to by gram. Arxiv paper from 9 days ago that's not properly indexed by search engines.
(G)enerative (R)ecursive re(A)soning (M)odels. They really wanted the acronym.
21 hours ago
Took me a while to find what you were referring to by gram. Arxiv paper from 9 days ago that's not properly indexed by search engines.
(G)enerative (R)ecursive re(A)soning (M)odels. They really wanted the acronym.
I prefer GRRM but then that would imply a habit of not actually getting a final result
And then every time I ask it to hurry along it kills a Stark.
Version 8 had serious flaws and wasn't recieved well by users.
1 reply →
Thank you for the gold kind stranger.
Ouch.
As a fellow reader-in-waiting, I applaud that. GMTA :)
Claude Opus 4.8 suggests "ReGRAM", which is less bad than GRAM.
writing… (17 years)
Let's not forget about Yann LeCun's current area of research that's completely different from LLMs: Joint Embedding Predictive Architecture (JEPA)
If he gets that style to be more efficient (they're already competitive) it'll completely kill off LLMs
https://openreview.net/pdf?id=BZ5a1r-kVsf
That acronym is unacceptable. It's going to impede discussion and cause confusion for a long time if it doesn't die off immediately.
You think that's bad? I introduce you to LION, (evoLved sIgn mOmeNtum) [1]
[1] https://arxiv.org/pdf/2302.06675
Now I just hear the Voltron intro riff in my head
1 reply →
not bad although archived. have any info why?
We're still talking about "zero-shot prompt" when the saying "X-shotted" ["One-shotted the difficult maze"] was already a well-established thing in daily vernacular. So now you constantly have to readjust your brain because whenever you read "zero-shot prompt" your mind goes "uh.. a zero-try attempt is a paradox and cannot exist".
Zero-shot, one-shot, few-shot etc. refers to how many examples you have to give.
It comes about from machine learning algorithms that could pick up on patterns from a small number of examples. Few shot means only a handful of examples to recognize something. One shot means only a single example. And zero shot means no examples. Of course, you have to indicate what you want somehow, but in the case of an LLM that's the prompt. Once LLMs were trained for instruction following, you didn't have to give any examples, you could just give a prompt describing what you want, and that was a zero-shot.
1 reply →
> a zero-try attempt is a paradox and cannot exist
Have you tried applying L'Hôpital's Rule?
Zero shotting: there wasn't even an attempt.
Minus one shotting: you have to make one attempt for there to have been no attempt, and two attempts for there to have been one attempt.
2 replies →
confusing indeed. I wondered "which RAM? nvram? dram? vram? dram? now what's g-ram?"
GPU RAM, clearly. At least that's where my mind went.
3 replies →
It's great if they also introduce KILOGRAM
Yeah, look what happened to GNU
Is this the right place to do everyone's favorite copypasta?:D
It's just an acronym. It's not gonna impede anything. Think of it as just a name - you either know what it refers to or you don't, you don't understand something from it's name, or it's acronym.
It's an acronym that matches an extremely common word, making it not easily searchable.
1 reply →
I propose GRIM: Generative Recursive Indeterministic Impression Machine.
And to think, we could have had George RR Martins instead.
Speaking of things that never finish.
[flagged]
2 replies →
Just spell it GRRM but pronounce it “gram” if you have to reference it in spoken conversation.
Which will be pretty rare.
Grrm with a rolling r sounds better.
Pronounced like “groom” makes for a nice analogy with slimming down the model size too.
Random plug for Kagi, which got it for 'GRAM model llm' on the first try ;)
It is the 3rd list on Kagi when searching "gram models"
G return G