← Back to context

Comment by akomtu

6 days ago

Easy benchmark that's hard to fake: data compression. Intelligence is largely about creating compact predictive models and so is data compression. The output should be a program generating the sequence or the dataset, based on entry id or nearby data points. Typical LLM bullshit won't work here because the output isn't English prose that can fool a human.