Comment by ilaksh

3 years ago

I assume they will release this API publicly at some point?

It's amazing the extreme levels of advantage that groups have depending on funding and connections.

The multi-modal vision support? Yes. It's just temporarily available only to BeMyEyes.

For now I'm using models like Salesforce/blip2 and OVF and Meta's Segment Anything for visual questioning.

> It's amazing the extreme levels of advantage that groups have depending on funding and connections.

It's actually not.