← Back to context

Comment by cdrini

8 months ago

Hmm, I think the hype is mainly for image editing, not generating. Although note I haven't used it! How are you testing it?

I tested it with two prompts:

// In this one, Gemini doesn't understand what "cinematic" is

"A cinematic underwater shot of a turtle gracefully swimming in crystal-clear water [...]"

// In this one, the reflection in the water in the background has different buildings

"A modern city where raindrops fall upward into the clouds instead of down, pedestrians calmly walking [...]"

Midjourney created both perfectly.

  • As others have said, this is an image editing model.

    Editing models do not excel at aesthetic, but they can take your Midjourney image, adjust the composition, and make it perfect.

    These types of models are the Adobe killer.

    • Noted that! The editing capabilities are impressive. I was excited for image gen because of the API (Midjourney doesn't have it yet).

      1 reply →

It actually has impressive image generating ability, IMO. I think the two things go hand-in-hand. Its prompt adherence can be weaker than other models, though.