Comment by 100721

15 hours ago

Does anyone know why they are using language models instead of a more purpose-built statistical model? My intuition is that a language model would either be overfit, or its training data would have a lot of noise unrelated to the application and significantly drive up costs.

It's not an LLM, it is a purpose built model. https://arxiv.org/html/2411.19506v1

5 years ago we would've called it a Machine Learning algorithm. 5 years before that, a Big Data algorithm.

  • We’ve been calling neural nets AI for decades.

    > 5 years before that, a Big Data algorithm.

    The DNN part? Absolutely not.

    I don’t know why people feel the need for such revisionism but AI has been a field encompassing things far more basic than this for longer than most commenters have been alive.

    • > AI has been a field encompassing things far more basic than this for longer than most commenters have been alive.

      When I was 13, having just started programming, I picked up a book from a "junk bin" at a book store on Artificial Intelligence. It must have been from the mid-80s if not older.

      It had an entire chapter on syllogism[1] and how to implement a program to spit them out based on user input. As I recall it basically amounted to some string exteaction assuming user followed a template and string concatenation to generate the result. I distinctly recall not being impressed about such a trivial thing being part of a book on AI.

      [1]: https://en.wikipedia.org/wiki/Syllogism

      1 reply →

  • i hate that we're in this linguistic soup when it comes to algorithmic intelligence now.