OpenAI Image Generation Tweaks #440

jeffpaul · 2023-04-25T02:35:24Z

Is your enhancement related to a problem? Please describe.

The following tweaks to our OpenAI image generation functionality would help surface the feature a bit better in the editor as well as make the user flow a bit more akin to the existing Media Library experience within the editor.

1.) Like the Media Library tab aligns the Filter media and All dates items with the Upload files horizontal border, it would be great to do the same in the Generate images tab with the Enter a prompt..., Once images..., and Enter prompt items.

2.) Change the header text of Select or Upload Media to Select, Upload, or Generate Media:

3.) Like the Select or Upload Media modal has options for Upload files, Media Library, and Generate images, let's update the core image block to have Generate image alongside the current Upload, Media Library, Insert from URL options that would automatically deep-link into the Generate images modal tab.

4.) Update the Enter prompt text field to be a larger text area so that lengthy prompt inputs can display more (most?) of the prompt before a user clicks the Generate images button.

5.) After an image(s) are generated, maintain the prompt input in the Enter prompt text area in case a user wants to generate different images by tweaking the prompt (e.g. they don't like the options generated and want more to select from). [additionally, if there's an ability in the API to create variants from a specific image result or upscale a certain image, then we should explore that via a separate feature enhancement issue]

6.) Regardless of what size image the user has set ClassifAI to generate in the plugin settings, let's only render smaller thumbnail sizes for the results so that however many are returned from OpenAI will easily display alongside eachother (and respective action buttons/links) versus now showing very large images and having to scroll significantly to view all those options.

Designs

Screenshots of existing areas for iteration are above, if any suggestions are unclear then let me know and I can hack together samples from those screenshots to try and visually express updates.

Describe alternatives you've considered

n/a

Code of Conduct

I agree to follow this project's Code of Conduct

The text was updated successfully, but these errors were encountered:

dkotter · 2023-04-26T18:52:49Z

1.) Like the Media Library tab aligns the Filter media and All dates items with the Upload files horizontal border, it would be great to do the same in the Generate images tab with the Enter a prompt..., Once images..., and Enter prompt items.

This was already done but found a bug where the CSS we rely on was only included if the IBM Watson feature was on. I've fixed that now in #441.

2.) Change the header text of Select or Upload Media to Select, Upload, or Generate Media:

I agree this would be nice but from what I can tell, there's not a clean way to modify this text. It appears this text comes from the Gutenberg MediaUpload component itself (see https://github.com/WordPress/gutenberg/blob/trunk/packages/media-utils/src/components/media-upload/index.js#L233) and I don't see any filter in place that we can use to change that text. I know we can target that with JS and modify it, though the downside there is that flash of text changing from one thing to the other. Maybe someone else will have more knowledge on if this component can easily be modified to change that title text

3.) Like the Select or Upload Media modal has options for Upload files, Media Library, and Generate images, let's update the core image block to have Generate image alongside the current Upload, Media Library, Insert from URL options that would automatically deep-link into the Generate images modal tab.

I also really like this idea, just not sure how hard it will be to achieve. I'd suggest we open this as a separate issue to investigate (perfect opportunity for someone wanting to dive more into Gutenberg).

4.) Update the Enter prompt text field to be a larger text area so that lengthy prompt inputs can display more (most?) of the prompt before a user clicks the Generate images button.

This has been changed to a textarea in #441

5.) After an image(s) are generated, maintain the prompt input in the Enter prompt text area in case a user wants to generate different images by tweaking the prompt (e.g. they don't like the options generated and want more to select from)

This is also taken care of in #441

[additionally, if there's an ability in the API to create variants from a specific image result or upscale a certain image, then we should explore that via a separate feature enhancement issue]

There is both an image edit API and image variation API that would be awesome to figure out the best way to integrate those here. I'd suggest those as a separate issue and would be ideal to get some design/UX feedback on how best to trigger that. As far as upscaling goes, no API for that and images have to be either 1024x1024, 512x512 or 256x256 and that size is currently chosen in the settings. We could look to add an inline option allowing you to upscale or downscale to one of those options if we think that would be useful

6.) Regardless of what size image the user has set ClassifAI to generate in the plugin settings, let's only render smaller thumbnail sizes for the results so that however many are returned from OpenAI will easily display alongside eachother (and respective action buttons/links) versus now showing very large images and having to scroll significantly to view all those options.

This is the same as the first point here. We had styling in place for this but the CSS wasn't loading if IBM Watson wasn't enabled. I have tweaked the styling a bit in #441 but I think this is fairly decent now. I wanted the images to be big enough that you can easily see what the image looks like but without them taking up the entire screen. Right now it's typically 4 images in a row, though does depend on your screen size.

jeffpaul · 2023-05-01T22:50:07Z

@fabiankagy could use your insight on some of the Gutenberg-related tweaks above ^ and what might be feasible already vs. what we might need to open as an issue upstream in Gutenberg first

dkotter · 2025-01-29T17:29:24Z

Reading through this issue, I think everything has either been taken care of, can't be changed due to limitations or new issues have been opened (like #723).

That said, I do think there are some additional changes we should look to make that I thought we had documented elsewhere but am not finding any issue with these details, so capturing those here.

Right now we have four options that can be set when configuring the Image Generation Feature:

Number of images to generate
Quality of generated images (standard or HD)
Size of generated images (1024x1024, 1792x1024, 1024x1792)
Style of generated images (vivid, natural)

Once set, these options are used anytime an image is generated. But there's almost certainly situations where someone would want to change these for an individual generation. For instance, maybe they normally want natural style images but in one scenario, they want a vivid image. Right now they'd have to change the global settings, generate the image, then change the settings back (and this is assuming they have access to change settings).

I propose we bring the last three settings into the generate image modal (don't have a mockup for what that would look like but feel like we can iterate on that as this is built). I don't think we need the number of images to be modified, as this has a higher impact on the cost of the request (and technically someone can just make multiple requests if needed).

So within the media modal, let's look to add options to set the quality, size and style of the image being generated, defaulting to what is set at the global level. May also make sense to add a new global setting to disable this, in case a site doesn't want individual editors having the ability to change these settings each time they generate an image (noting that image quality and size can impact the cost).

faisal-alvi · 2025-01-30T18:22:11Z

@dkotter Here is the new mockup based on the request. How does this look?

Also, let me know if I should proceed with adding the dropdowns and implementing a new global setting to enable/disable them.

dkotter · 2025-01-30T18:41:35Z

@faisal-alvi I think that's good enough to start. I could see a UI where these are hidden and we have a button or link to click to View Additional Settings or something along those lines, but showing these by default works for now. Just want to make sure the values there default to whatever is set at the global level, so someone can just leave those as-is if they don't want to change anything.

Also, let me know if I should proceed with adding the dropdowns and implementing a new global setting to enable/disable them.

Yes, I think this is good to start work on

jeffpaul added the type:enhancement label Apr 25, 2023

jeffpaul added this to the 2.2.0 milestone Apr 25, 2023

jeffpaul added this to Open Source Practice Apr 25, 2023

github-project-automation bot moved this to Incoming in Open Source Practice Apr 25, 2023

dkotter self-assigned this Apr 26, 2023

dkotter mentioned this issue Apr 26, 2023

OpenAI image generation tweaks #441

Merged

4 tasks

jeffpaul modified the milestones: 2.2.0, 2.1.0 Apr 26, 2023

dkotter closed this as completed in #441 Apr 27, 2023

github-project-automation bot moved this from Incoming to Merged in Open Source Practice Apr 27, 2023

dkotter reopened this Apr 27, 2023

github-project-automation bot moved this from Merged to In Progress in Open Source Practice Apr 27, 2023

dkotter modified the milestones: 2.1.0, 2.2.0 May 1, 2023

dkotter modified the milestones: 2.2.0, 2.3.0 May 18, 2023

dkotter removed their assignment Jun 12, 2023

dkotter modified the milestones: 2.3.0, Future Release Aug 17, 2023

jeffpaul removed the type:enhancement label Jan 7, 2025

dkotter assigned faisal-alvi Jan 29, 2025

dkotter mentioned this issue Jan 29, 2025

Add ability to modify DALL·E 3 generated images #723

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI Image Generation Tweaks #440

OpenAI Image Generation Tweaks #440

jeffpaul commented Apr 25, 2023

dkotter commented Apr 26, 2023

jeffpaul commented May 1, 2023

dkotter commented Jan 29, 2025

faisal-alvi commented Jan 30, 2025

dkotter commented Jan 30, 2025

OpenAI Image Generation Tweaks #440

OpenAI Image Generation Tweaks #440

Comments

jeffpaul commented Apr 25, 2023

Is your enhancement related to a problem? Please describe.

Designs

Describe alternatives you've considered

Code of Conduct

dkotter commented Apr 26, 2023

jeffpaul commented May 1, 2023

dkotter commented Jan 29, 2025

faisal-alvi commented Jan 30, 2025

dkotter commented Jan 30, 2025