text-to-image: replace nested dict by `height` and `width` properties in the input schema #1158

hanouticelina · 2025-01-31T08:58:28Z

Flattening height and width parameters for text-to-image, making the API simpler for users and making provider-specific transformations (dict/enum) easier to handle for us.

yes, It's a breaking change but I expect the usage of target_size to be really minimal so far.

Wauplin

Thanks!

Just to be sure, it means we need to do the height/width => target_size conversion for "hf-inference" provider, right? Maybe can be done in the same PR for the hf.js client (here).

hanouticelina · 2025-01-31T09:20:25Z

Just to be sure, it means we need to do the height/width => target_size conversion for "hf-inference" provider, right? Maybe can be done in the same PR for the hf.js client (here).

Not sure about that, if the pipeline we're using for hf-inference is this one : https://huggingface.co/docs/diffusers/main/en/api/pipelines/stable_diffusion/text2img, we don't need to do the conversion.

It seems that it's indeed this one that is used for text-to-image : huggingface/api-inference-community/docker_images/diffusers/app/pipelines/text_to_image.py

Wauplin · 2025-01-31T09:22:28Z

Even better! Then, there shouldn't have been a target_size in the first place?

hanouticelina · 2025-01-31T09:24:26Z

Even better! Then, there shouldn't have been a target_size in the first place?

I guess so 😄

packages/tasks/src/tasks/text-to-image/inference.ts

hanouticelina · 2025-02-03T10:03:48Z

failing CI is unrelated, it seems like there is a security check failure when trying to install node.js packages

use height and width instead of dict

69b85da

hanouticelina requested review from SBrandeis, gary149, Wauplin, julien-c, pcuenca and ngxson as code owners January 31, 2025 08:58

hanouticelina mentioned this pull request Jan 31, 2025

[InferenceClient] Better handling of task parameters huggingface/huggingface_hub#2812

Merged

Wauplin approved these changes Jan 31, 2025

View reviewed changes

coyotte508 reviewed Jan 31, 2025

View reviewed changes

packages/tasks/src/tasks/text-to-image/inference.ts Show resolved Hide resolved

keep and deprecate target_size

23888c7

coyotte508 approved these changes Jan 31, 2025

View reviewed changes

hanouticelina added 2 commits February 3, 2025 09:07

remove completely target_size

790d65e

Merge branch 'main' into update-text-to-image-input-specs

73663da

Merge branch 'main' into update-text-to-image-input-specs

efe8546

hanouticelina merged commit 48cd514 into main Feb 4, 2025
5 checks passed

hanouticelina deleted the update-text-to-image-input-specs branch February 4, 2025 10:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text-to-image: replace nested dict by `height` and `width` properties in the input schema #1158

text-to-image: replace nested dict by `height` and `width` properties in the input schema #1158

hanouticelina commented Jan 31, 2025

Wauplin left a comment

hanouticelina commented Jan 31, 2025

Wauplin commented Jan 31, 2025

hanouticelina commented Jan 31, 2025

hanouticelina commented Feb 3, 2025

text-to-image: replace nested dict by height and width properties in the input schema #1158

text-to-image: replace nested dict by height and width properties in the input schema #1158

Conversation

hanouticelina commented Jan 31, 2025

Wauplin left a comment

Choose a reason for hiding this comment

hanouticelina commented Jan 31, 2025

Wauplin commented Jan 31, 2025

hanouticelina commented Jan 31, 2025

hanouticelina commented Feb 3, 2025

text-to-image: replace nested dict by `height` and `width` properties in the input schema #1158

text-to-image: replace nested dict by `height` and `width` properties in the input schema #1158