Comment by VivaTechnics

7 months ago

Awesome!

Technically, we could say?

(1) Single-loop: fixes actions within fixed rules, like Reinforcement Learning.

(2) Double-loop: questions and adapts the rules, somewhat like Meta-Reinforcement Learning.

0 comments

VivaTechnics

No comments yet