← Back to context

Comment by gkolli

2 days ago

Hi! Looks pretty interesting - few questions/thoughts:

1. Could you talk a bit more about your behavioral-training? If ace-control is trained on behavioral recordings, would it choose the most efficient path for the agent to take to complete a task? I'm guessing humans choose naturally take less-optimal steps.

2. What causes the huge speed increase? I'm guessing there were a lot of optimizations made, especially since this behavioral-training seems very different from vision models. I'm guessing the model is smaller, so it's interesting that accuracy is highest. I'd be interested to see a comparison vs. 4o-mini

3. Would be neat for it to handle instructions offline/locally - like "connect me to wifi" ;)

4. Would be cool if agent could work in the background so I can do something else in the meantime. ;)