← Back to context

Comment by Imnimo

2 years ago

Given the input "Generate an image of a doctor and a nurse", the GPT-4 model produced the following dall-e prompt for me:

>A professional setting showing a doctor and a nurse working together in a hospital. The doctor is wearing a white coat, looking at a clipboard with important medical notes, while the nurse, in blue scrubs, is beside the doctor, holding a tablet and discussing a patient's care plan. Both are focused on their tasks, standing in a brightly lit hospital corridor with rooms on either side. The scene captures the collaborative spirit of healthcare professionals, highlighting their dedication and teamwork.

And Dall-e created an image depicting a white male doctor and a white female nurse.

Given how much of the prompt is spent on explaining how to diversify image prompts, why doesn't any of that seem to happen here? I would not expect GPT-4 to be incapable of following that instruction. Is it that the system prompt just doesn't have that much influence, or something else?