← Back to context

Comment by jwpapi

10 hours ago

Has someone verified this was an actual bug?

One of AI’s strengths is definitely exploration, f.e. in finding bugs, but it still has a high false positive rate. Depending on context that matters or it wont.

Also one has to be aware that there are a lot of bugs that AI won’t find but humans would

I don’t have the expertise to verify this bug actually happened, but I’m curious.

It's not even clear if AI was used to find the bug: they mention modeling the software with an "ai native" language, whatever that means. What is not clear is how they found themselves modeling the gyros software of the apollo code to begin with.

But, I do think their explanation of the lock acquisition and the failure scenario is quite clear and compelling.

  • > It's not even clear if AI was used to find the bug: they mention modeling the software with an "ai native" language, whatever that means.

    Could the "AI native language" they used be Apache Drools? The "when" syntax reminded me of it...

    https://kie.apache.org/docs/10.0.x/drools/drools/language-re...

    (Apache Drools is an open source rule language and interpreter to declaratively formulate and execute rule-based specifications; it easily integrates with Java code.)

  • How did you pick out AI native and miss the rest of the SAME sentence?

    > We found this defect by distilling a behavioural specification of the IMU subsystem using Allium, an AI-native behavioural specification language.

    • That does not answer my confusion, especially when static analysis could reveal the same conclusion with that language. It's not clear what role ai played at all.

      1 reply →

  • >It's not even clear if AI was used to find the bug

    It's not even clear you read the article

    • Where do you think my confusion came from? All it says is that ai assists in resolving the gyroscope lock path, not why they decided to model the gyroscope lock path to begin with.

      Please, keep your offensive comments to yourself when a clarifying comment might have sufficed.

    • Even worse, the other child comments are speculating (and didn't RTFA either) when the answer is clear in the article.

      > We found this defect by distilling a behavioural specification of the IMU subsystem using Allium, an AI-native behavioural specification language.

      3 replies →

  • > It's not even clear if AI was used to find the bug

    The intro says “We used Claude and Allium”. Allium looks like a tool they’ve built for Claude.

    So the article is about how they used their AI tooling and workflow to find the bug.

    • The article does not explain anything about how they used AI—it just has some relation with the behavioral model a human seems to have written (and an AI does not seem necessary to use!)

      1 reply →