Comment by optimalsolver

2 years ago

The best thing Cycorp could do now is open source its accumulated database of logical relations so it can be ingested by some monster LLM.

What's the point of all that data collecting dust and accomplishing not much of anything?

It seems the direction of flow would be the opposite: LLMs are a great source of logical data for Cyc-like things. Distill your LLM into logical statements, then run your Cyc algorithms on it.

  • > It seems the direction of flow would be the opposite: LLMs are a great source of logical data for Cyc-like things. Distill your LLM into logical statements, then run your Cyc algorithms on it.

    This is hugely problematic. If you get the premises wrong, many fallacies will follow.

    LLMs can play many roles around this area, but their output cannot be trusted with significant verification and validation.

  • LLM statements (distilled into logical statements) would not be logically sound. That's (one of) the main issues of LLMs. And that would make logical inference on these logical statements impossible with current systems.

    That's one of the principal features of Cyc. It's carefully built by humans to be (essentially) logically sound. - so that inference can then be run through the fact base. Making that stuff logically sound made for a very detailed and fussy knowledge base. And that in turn made it difficult to expand or even understand for mere civilians. Cyc is NOT simple.

    • Cyc is built to be locally consistent but global KB consistency is an impossible task. Lenat stressed that in his videos over and over.

      1 reply →

> The best thing Cycorp could do now is open source its accumulated database of logical relations...

This is unpersuasive without laying out your assumptions and reasoning.

Counter points:

(a) It would be unethical for such a knowledge base to be put out in the open without considerable guardrails and appropriate licensing. The details matter.

(b) Cycorp gets some funding from the U.S. Government; this changes both the set of options available and the calculus of weighing them.

(c) Not all nations have equivalent values. Unless one is a moral relativist, these differences should not be deemed equivalent nor irrelevant. As such, despite the flaws of U.S. values and some horrific decision-making throughout history, there are known worse actors and states. Such parties would make worse use of an extensive human-curated knowledge base.

  • An older version of the database is already available for download, but that's not the approach you want for common sense anyway, no one needs to remember that a "dog is not a cat".

    • You are probably referring to OpenCyc. It provides much more value than your comment suggests.

      I'd recommend that more people take a look and compare its approach against others. https://en.wikipedia.org/wiki/CycL is compact and worth a read, especially the concept of "microtheories".

OpenCyc is already a thing and there's been very little interest in it. These days we also have general-purpose semantic KB's like Wikidata, that are available for free and go way beyond what Cyc or OpenCyc was trying to do.

I think military will take over his work.Snowden documents reveled the cyc was been used to come up with Terror attack scenarios.