Comment by protocolture

4 months ago

I have always had concerns about physical robots making my work less safe in the real world.

But had never considered that a programming language might be created thats less human readable/auditable to enable LLMs.

Scares me a bit.

LLMs in their current form rely heavily on the vast amount of human data that's available, to learn from it as a first step (the second step is RL).

We're not building a language for LLMs just yet.

  • > We're not building a language for LLMs just yet.

    Working on it, actually! I think it's a really interesting problem space - being efficient on tokens, readable by humans for review, strongly typed and static for reasoning purposes, and having extremely regular syntax. One of the biggest issues with symbols is that, to a human, matching parentheses is relatively easy, but the models struggle with it.

    I expect a language like the one I'm playing with will mature enough over the next couple years that models with a knowledge cutoff around 1/2027 will probably know how to program it well enough for it to start being more viable.

    One of the things I plan to do is build evals so that I can validate the performance of various models on my as yet only partially baked language. I'm also using only LLMs to build out the entire infrastructure, mostly to see if it's possible.

    • do you expect the model to train on synthetic data or do you expect to grow a userbase that will generate organic training data?

      > One of the biggest issues with symbols is that, to a human, matching parentheses is relatively easy, but the models struggle with it.

      Great point. I find it near trivial to close parens but llms seem to struggle with the lisps ive played with because of this counting issue. To the point where ive not been working with them as much. typescript and functional js as other commentors note is usually smooth sailing.

      1 reply →

    • There's no way there's enough data for you to get a model that is anywhere as strong as mainstream languages.

      If your model is struggling with parentheses, that means it's not even the level of GPT-3 for a mainstream language.

      It's not completely impossible with in-context learning I guess, but it will still be much weaker than .. eg all of GitHub and more on Python

    • Working on it too. It's actually more like a meta language that is very token efficient.

  • It's worth asking why we haven't had the AlphaZero moment for general learning yet, where no human data is needed.

    • That's easy, AlphaZero had a perfect simulator of the world it existed in (chess, super easy), so it was insanely easy to run simulations of that world ad infinitum, and learn from it.

      It's simply not the case for the real world, you can't simulate the world perfectly and see what happens when you do things.

    • I think the issue is that for games and other closed-ended systems the criteria are very easy, so self-referential training is effective.