Comment by xiomrze

5 hours ago

Honest question, how do you know if it's pulling from context vs from memory?

If I use Opus 4.6 with Extended Thinking (Web Search disabled, no books attached), it answers with 130 spells.

10 comments

xiomrze

petercooper 5 hours ago

One possible trick could be to search and replace them all with nonsense alternatives then see if it extracts those.

andai 5 hours ago

That might actually boost performance since attention pays attention to stuff that stands out. If I make a typo, the models often hyperfixate on it.

ozim 5 hours ago

Exactly there was this study where they were trying to make LLM reproduce HP book word for word like giving first sentences and letting it cook.

Basically they managed with some tricks make 99% word for word - tricks were needed to bypass security measures that are there in place for exactly reason to stop people to retrieve training material.

pron 4 hours ago

This reminds me of https://en.wikipedia.org/wiki/Pierre_Menard,_Author_of_the_Q... :
> Borges's "review" describes Menard's efforts to go beyond a mere "translation" of Don Quixote by immersing himself so thoroughly in the work as to be able to actually "re-create" it, line for line, in the original 17th-century Spanish. Thus, Pierre Menard is often used to raise questions and discussion about the nature of authorship, appropriation, and interpretation.
ck_one 5 hours ago
Do you remember how to get around those tricks?
- djhn 5 hours ago
  
  This is the paper: https://arxiv.org/abs/2601.02671
  Grok and Deepmind IIRC didn’t require tricks.
  
  2 replies →

ck_one 5 hours ago

When I tried it without web search so only internal knowledge it missed ~15 spells.

clanker_fluffer 5 hours ago

What was your prompt?