Comment by saaaaaam

2 months ago

“Time-locked models don't roleplay; they embody their training data. Ranke-4B-1913 doesn't know about WWI because WWI hasn't happened in its textual universe. It can be surprised by your questions in ways modern LLMs cannot.”

“Modern LLMs suffer from hindsight contamination. GPT-5 knows how the story ends—WWI, the League's failure, the Spanish flu.”

This is really fascinating. As someone who reads a lot of history and historical fiction I think this is really intriguing. Imagine having a conversation with someone genuinely from the period, where they don’t know the “end of the story”.

158 comments

saaaaaam

jscyc 2 months ago

When you put it that way it reminds me of the Severn/Keats character in the Hyperion Cantos. Far-future AIs reconstruct historical figures from their writings in an attempt to gain philosophical insights.

srtw 2 months ago
The Hyperion Cantos is such an incredible work of fiction. Currently re-reading and am midway through the fourth book The Rise Of Endymion; this series captivates my imagination and would often find myself idly reflecting on it and the characters within more than a decade after reading. Like all works, it has its shortcomings, but I can give no higher recommendation than the first two books.
- EvanAnderson 2 months ago
  
  I really should re-read the series. I enjoyed it when I read it back in 2000 but it's a faded memory now.
  Without saying anything specific to spoil plot poonts, I will say that I ended-up having a kidney stone while I was reading the last two books of the series. It was fucking eerie.
bikeshaving 2 months ago
This isn’t science fiction anymore. CIA is using chatbot simulations of world leaders to inform analysts. https://archive.ph/9KxkJ
- ghurtado 2 months ago
  
  We're literally running out of science fiction topics faster than we can create new ones
  If I started a list with the things that were comically sci Fi when I was a kid, and are a reality today, I'd be here until next Tuesday.
  
  39 replies →
- idiotsecant 2 months ago
  
  Zero percent chance this is anything other than laughably bad. The fact that they're trotting it out in front of the press like a double spaced book report only reinforces this theory. It's a transparent attempt by someone at the CIA to be able to say they're using AI in a meeting with their bosses.
  
  3 replies →
- dnel 2 months ago
  
  Sounds like using Instagram posts to determine what someone really looks like
- catlifeonmars 2 months ago
  
  How is this different than chatbots cosplaying?
  
  1 reply →
- bookofjoe 2 months ago
  
  "The Man With The President's Mind" — fantastic 1977 novel by Ted Allbeury
  https://www.amazon.com/Man-Presidents-Mind-Ted-Allbeury/dp/0...
- UltraSane 2 months ago
  
  I predict very rich people will pay to have LLMs created based on their personalities.
  
  6 replies →
- otabdeveloper4 2 months ago
  
  Oh. That explains a lot about USA's foreign policy, actually. (Lmao)
- NuclearPM 2 months ago
  
  [flagged]
  
  4 replies →
abrookewood 2 months ago

This is such a ridiculously good series. If you haven't read it yet, I thoroughly recommend it.

culi 2 months ago

I used to follow this blog — I believe it was somehow associated with Slate Star Codex? — anyways, I remember the author used to do these experiments on themselves where they spent a week or two only reading newspapers/media from a specific point in time and then wrote a blog about their experiences/takeaways

On that same note, there was this great YouTube series called The Great War. It spanned from 2014-2018 (100 years after WW1) and followed WW1 developments week by week.

verve_rat 2 months ago

The people that did the Great War series (at least some of them, I believe there was a little bit of a falling out) went on to do a WWII version on the World War II channel: https://youtube.com/@worldwartwo
They are currently in the middle of a Korean War version: https://youtube.com/@thekoreanwarbyindyneidell
tyre 2 months ago

The Great War series is phenomenal. A truly impressive project.

pwillia7 2 months ago

This is why the impersonation stuff is so interesting with LLMs -- If you ask chatGPT a question without a 'right' answer, and then tell it to embody someone you really want to ask that question to, you'll get a better answer with the impersonation. Now, is this the same phenomenon that causes people to lose their minds with the LLMs? Possibly. Is it really cool asking followup philosophy questions to the LLM Dalai Lama after reading his book? Yes.

Sprotch 2 months ago
Nice idea, does not work
- pwillia7 2 months ago
  
  In which way?
staticman2 2 months ago

Why is that cool?
Imagine you are a billionaire so money is no object and really interested in the Dhali Llama?
Would you read the book then hire someone to pretend to be the author and ask questions that are not covered by the book? Then be enraptured by whatever the roleplayer invents?
Probably not? At least this isn't a phenomenon I've heard of?

ghurtado 2 months ago

This might just be the closest we get to a time machine for some time. Or maybe ever.

Every "King Arthur travels to the year 2000" kinda script is now something that writes itself.

> Imagine having a conversation with someone genuinely from the period,

Imagine not just someone, but Aristotle or Leonardo or Kant!

RobotToaster 2 months ago
I imagine King Arthur would say something like: Hwæt spricst þu be?
- yorwba 2 months ago
  
  Wrong language. The Arthur of legend is a Celtic-speaking Briton fighting against the Germanic-speaking invaders. Old English developed from the language of his enemies. https://en.wikipedia.org/wiki/Celtic_language_decline_in_Eng...
anthk 2 months ago

Easier with Cervantes for Spanish speakers than King Arhur or Shakespeare.
With Alphonse X, o The Cid, it would be greater issues, but understandable over weeks.

takeda 2 months ago

> This is really fascinating. As someone who reads a lot of history and historical fiction I think this is really intriguing. Imagine having a conversation with someone genuinely from the period, where they don’t know the “end of the story”.

Having the facts from the era is one thing, to make conclusions about things it doesn't know would require intelligence.

dr-detroit 2 months ago

[dead]

psychoslave 2 months ago

>Imagine having a conversation with someone genuinely from the period, where they don’t know the “end of the story”.

Isn't this part of the basics feature of human conditions? Not only we are all unaware of the coming historic outcome (though we can get some big points with more or less good guesses), but to a marginally variable extend, we are also very unaware of past and present history.

LLM are not aware, but they can be trained on larger historical accounts than any human and regurgitate syntactically correct summary on any point within it. Very different kind of utterer.

pwillia7 2 months ago
captain hindsight
- psychoslave 2 months ago
  
  Actually, this made me discover the character, thanks. I see your point and get the fun out of myself. On the other hand, at least in this case I don't pretend to cover some catastrophic results. :)

observationist 2 months ago

This is definitely fascinating - being able to do AI brain surgery, and selectively tuning its knowledge and priors, you'd be able to create awesome and terrifying simulations.

nottorp 2 months ago

You can't. To use your terms, you have to "grow" a new LLM. "Brain surgery" would be modifying an existing model and that's exactly what they're trying to avoid.
ilaksh 2 months ago

Activation steering can do that to some degree, although normally it's just one or two specific things or rather than a whole set of knowledge.
eek2121 2 months ago
Respectfully, LLMs are nothing like a brain, and I discourage comparisons between the two, because beyond a complete difference in the way they operate, a brain can innovate, and as of this moment, an LLM cannot because it relies on previously available information.
LLMs are just seemingly intelligent autocomplete engines, and until they figure a way to stop the hallucinations, they aren't great either.
Every piece of code a developer churns out using LLMs will be built from previous code that other developers have written (including both strengths and weaknesses, btw). Every paragraph you ask it to write in a summary? Same. Every single other problem? Same. Ask it to generate a summary of a document? Don't trust it here either. [Note, expect cyber-attacks later on regarding this scenario, it is beginning to happen -- documents made intentionally obtuse to fool an LLM into hallucinating about the document, which leads to someone signing a contract, conning the person out of millions].
If you ask an LLM to solve something no human has, you'll get a fabrication, which has fooled quite a few folks and caused them to jeopardize their career (lawyers, etc) which is why I am posting this.
- libraryofbabel 2 months ago
  
  This is the 2023 take on LLMs. It still gets repeated a lot. But it doesn’t really hold up anymore - it’s more complicated than that. Don’t let some factoid about how they are pretrained on autocomplete-like next token prediction fool you into thinking you understand what is going on in that trillion parameter neural network.
  Sure, LLMs do not think like humans and they may not have human-level creativity. Sometimes they hallucinate. But they can absolutely solve new problems that aren’t in their training set, e.g. some rather difficult problems on the last Mathematical Olympiad. They don’t just regurgitate remixes of their training data. If you don’t believe this, you really need to spend more time with the latest SotA models like Opus 4.5 or Gemini 3.
  Nontrivial emergent behavior is a thing. It will only get more impressive. That doesn’t make LLMs like humans (and we shouldn’t anthropomorphize them) but they are not “autocomplete on steroids” anymore either.
  
  46 replies →
- HarHarVeryFunny 2 months ago
  
  > LLMs are just seemingly intelligent autocomplete engines
  Well, no, they are training set statistical predictors, not individual training sample predictors (autocomplete).
  The best mental model of what they are doing might be that you are talking to a football stadium full of people, where everyone in the stadium gets to vote on the next word of the response being generated. You are not getting an "autocomplete" answer from any one coherent source, but instead a strange composite response where each word is the result of different people trying to steer the response in different directions.
  An LLM will naturally generate responses that were not in the training set, even if ultimately limited by what was in the training set. The best way to think of this is perhaps that they are limited to the "generative closure" (cf mathematical set closure) of the training data - they can generate "novel" (to the training set) combinations of words and partial samples in the training data, by combining statistical patterns from different sources that never occurred together in the training data.
- ada1981 2 months ago
  
  Are you sure about this?
  LLMs are like a topographic map of language.
  If you have 2 known mountains (domains of knowledge) you can likely predict there is a valley between them, even if you haven’t been there.
  I think LLMs can approximate language topography based on known surrounding features so to speak, and that can produce novel information that would be similar to insight or innovation.
  I’ve seen this in our lab, or at least, I think I have.
  Curious how you see it.
- unusualmonkey 2 months ago
  
  > a brain can innovate, and as of this moment, an LLM cannot because it relies on previously available information.
  Source needed RE brain.
  Define innovate, in a way that a LLM can't and we definitively can prove a human can.
- observationist 2 months ago
  
  Respectfully, you're not completely wrong, but you are making some mistaken assumptions about the operation of LLMs.
  Transformers allow for the mapping of a complex manifold representation of causal phenomena present in the data they're trained on. When they're trained on a vast corpus of human generated text, they model a lot of the underlying phenomena that resulted in that text.
  In some cases, shortcuts and hacks and entirely inhuman features and functions are learned. In other cases, the functions and features are learned to an astonishingly superhuman level. There's a depth of recursion and complexity to some things that escape the capability of modern architectures to model, and there are subtle things that don't get picked up on. LLMs do not have a coherent self, or subjective central perspective, even within constraints of context modifications for run-time constructs. They're fundamentally many-minded, or no-minded, depending on the way they're used, and without that subjective anchor, they lack the principle by which to effectively model a self over many of the long horizon and complex features that human brains basically live in.
  Confabulation isn't unique to LLMs. Everything you're saying about how LLMs operate can be said about human brains, too. Our intelligence and capabilities don't emerge from nothing, and human cognition isn't magical. And what humans do can also be considered "intelligent autocomplete" at a functional level.
  What cortical columns do is next-activation predictions at an optimally sparse, embarrassingly parallel scale - it's not tokens being predicted but "what does the brain think is the next neuron/column that will fire", and where it's successful, synapses are reinforced, and where it fails, signals are suppressed.
  Neocortical processing does the task of learning, modeling, and predicting across a wide multimodal, arbitrary depth, long horizon domain that allow us to learn words and writing and language and coding and rationalism and everything it is that we do. We're profoundly more data efficient learners, and massively parallel, amazingly sparse processing allows us to pick up on subtle nuance and amazing wide and deep contextual cues in ways that LLMs are structurally incapable of, for now.
  You use the word hallucinations as a pejorative, but everything you do, your every memory, experience, thought, plan, all of your existence is a hallucination. You are, at a deep and fundamental level, a construct built by your brain, from the processing of millions of electrochemical signals, bundled together, parsed, compressed, interpreted, and finally joined together in the wonderfully diverse and rich and deep fabric of your subjective experience.
  LLMs don't have that, or at best, only have disparate flashes of incoherent subjective experience, because nothing is persisted or temporally coherent at the levels that matter. That could very well be a very important mechanism and crucial to overcoming many of the flaws in current models.
  That said, you don't want to get rid of hallucinations. You want the hallucinations to be valid. You want them to correspond to reality as closely as possible, coupled tightly to correctly modeled features of things that are real.
  LLMs have created, at superhuman speeds, vast troves of things that humans have not. They've even done things that most humans could not. I don't think they've done things that any human could not, yet, but the jagged frontier of capabilities is pushing many domains very close to the degree of competence at which they'll be superhuman in quality, outperforming any possible human for certain tasks.
  There are architecture issues that don't look like they can be resolved with scaling alone. That doesn't mean shortcuts, hacks, and useful capabilities won't produce good results in the meantime, and if they can get us to the point of useful, replicable, and automated AI research and recursive self improvement, then we don't necessarily need to change course. LLMs will eventually be used to find the next big breakthrough architecture, and we can enjoy these wonderful, downright magical tools in the meantime.
  And of course, human experts in the loop are a must, and everything must be held to a high standard of evidence and review. The more important the problem being worked on, like a law case, the more scrutiny and human intervention will be required. Judges, lawyers, and politicians are all using AI for things that they probably shouldn't, but that's a human failure mode. It doesn't imply that the tools aren't useful, nor that they can't be used skillfully.
- DonHopkins 2 months ago
  
  > LLMs are just seemingly intelligent autocomplete engines
  BINGO!
  (I just won a stuffed animal prize with my AI Skeptic Thought-Terminating Cliché BINGO Card!)
  Sorry. Carry on.

Sprotch 2 months ago

This is the point - a modern LLM "role playing" pre-1913 would only reflect our view today of what someone from that era would say. It woud not be accurate.

diamond559 2 months ago

Yeah, whenever we figure out time travel that will be really cool. In the meantime we have autocorrect trained on internet facts and modern textbooks that can never truly understand anything let alone what is was like to live hundreds of years ago.

throawayonthe 2 months ago

i get what you're saying, but the post is specifically about models that were not trained on the internet/modern textbooks

xg15 2 months ago

"...what do you mean, 'World War One?'"

tejohnso 2 months ago
I remember reading a children's book when I was young and the fact that people used the phrase "World War One" rather than "The Great War" was a clue to the reader that events were taking place in a certain time period. Never forgot that for some reason.
I failed to catch the clue, btw.
- wat10000 2 months ago
  
  It wouldn’t be totally implausible to use that phrase between the wars. The name “the First World War” was used as early as 1920, although not very common.
- bradfitz 2 months ago
  
  I seem to recall reading that as a kid too, but I can't find it now. I keep finding references to "Encyclopedia Brown, Boy Detective" about a Civil War sword being fake (instead of a Great War one), but with the same plot I'd remembered.
  
  2 replies →
- alberto_ol 2 months ago
  
  I remember that the brother of my grandmother who fought in ww1 called it simply "the war" ("sa gherra" in his dialect/language).
- BeefySwain 2 months ago
  
  Pendragon?
gaius_baltar 2 months ago

> "...what do you mean, 'World War One?'"
Oh sorry, spoilers.
(Hell, I miss Capaldi)
inferiorhuman 2 months ago

… what do you mean, an internet where everything wasn't hidden behind anti-bot captchas?

LordDragonfang 2 months ago

Perhaps I'm overly sensitive to this and terminally online, but that first quote reads as a textbook LLM-generated sentence.

"<Thing> doesn't <action>, it <shallow description that's slightly off from how you would expect a human to choose>"

Later parts of the readme (whole section of bullets enumerating what it is and what it isn't, another LLM favorite) make me more confident that significant parts of the readme is generated.

I'm generally pro-AI, but if you spend hundreds of hours making a thing, I'd rather hear your explanation of it, not an LLM's.

ViktorRay 2 months ago

Reminds me of this scene from a Doctor Who episode

https://youtu.be/eg4mcdhIsvU

I’m not a Doctor Who fan and haven’t seen the rest of the episode and I don’t even what this episode was about but I thought this scene was excellent.

Sieyk 2 months ago

I was going to say the same thing. Its really hard to explain the concept of "convincing but undoubtedly pretending", yet they captured that concept so beautifully here.

anshumankmr 2 months ago

>where they don’t know the “end of the story”.

Applicable to us also, cause we do not know how the current story ends either, of the post pandemic world as we know it now.

DGoettlich 2 months ago

exactly

rcpt 2 months ago

Watching a modern LLM chat with this would be fun.

Davidbrcz 2 months ago

That's some Westworld level of discussion