[InferenceClient] Better handling of task parameters #2812

hanouticelina · 2025-01-30T17:11:24Z

This PR:

Fixes some discrepancies in text-to-image parameters (particularly for Together AI).
Adds a new extra_parameters argument to text_to_image, text_to_speech and text_to_video to be able to pass a provider's unique parameters for these tasks.
Improves the test suite of inference providers.

…rs argument

src/huggingface_hub/inference/_providers/fal_ai.py

HuggingFaceDocBuilderDev · 2025-01-30T17:15:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Wauplin

Haven't tested anything locally but it looks good to me. Thanks for handling these discrepancies!

Wauplin · 2025-01-30T17:14:43Z

src/huggingface_hub/inference/_client.py

@@ -2407,7 +2405,7 @@ def text_to_image(
        scheduler: Optional[str] = None,
        target_size: Optional[TextToImageTargetSize] = None,
        seed: Optional[int] = None,
-        **kwargs,


This is a breaking change but hopefully totally fine

i'll mention this in the next release notes! but it should be fine, if users were previously using text_to_image with the HF Inference API, this shouldn't be an issue since all API parameters were exposed as explicit method arguments

this shouldn't be an issue since all API parameters were exposed as explicit method arguments

yes exactly

Wauplin · 2025-01-30T17:16:18Z

src/huggingface_hub/inference/_client.py

@@ -2443,6 +2441,9 @@ def text_to_image(
                The size in pixel of the output image
            seed (`int`, *optional*):
                Seed for the random number generator.
+            extra_parameters (`Dict[str, Any]`, *optional*):


Do we have a good example of how to use extra_parameters for a specific model on a specific provider? Would be good to add a least one example (either in text-to-image or text-to-video depending on what's best)

yep, added in 305c720

src/huggingface_hub/inference/_providers/fal_ai.py

Wauplin · 2025-01-30T17:22:31Z

tests/test_inference_providers.py

+    @pytest.mark.parametrize(
+        "helper,inputs,parameters,expected_data,expected_json",
+        [


thanks for adding these!

Wauplin

Looks good to me!

Wauplin · 2025-01-31T09:18:05Z

src/huggingface_hub/inference/_providers/fal_ai.py

@@ -134,7 +134,7 @@ def __init__(self):

    def _prepare_payload(self, inputs: Any, parameters: Dict[str, Any]) -> Dict[str, Any]:
        parameters = {k: v for k, v in parameters.items() if v is not None}
-        if "image_size" not in parameters and "width" in parameters and "height" in parameters:
+        if "width" in parameters and "height" in parameters:


what if only one if passed btw?

we should be able to send only one if specified, the other one would be set to the default value. I'll fix that

no actually for fal-ai, you either send both, or neither.

there is no default values for each one of them, according to their documentation

ok thanks for checking 👍

…sk-parameters

Wauplin

approving, though I'm not sure to understand what's happening in utils/generate_inference_types.py. I trust it we result is correct :)

hanouticelina · 2025-01-31T11:05:14Z

should be good now, let's merge!

julien-c

(nit) i think OpenAI calls this param extra_body (https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#extra-parameters or from the Python openai sdk), but i think our name is probably fine too 🤷

Wauplin · 2025-01-31T20:23:55Z

Then I'm down to go with extra_body before making a release for consistency

julien-c · 2025-02-01T12:53:12Z

up to you (i'm not sure it's exactly the same use case, but my brain is fried)

hanouticelina · 2025-02-01T13:36:05Z

It's actually the same use case, you can send an extra parameter using openai client, but i'm not sure if it's extra_body or extra_query (they expose both). Let me double check and open a PR to use the same naming on our side.

julien-c · 2025-02-01T13:37:48Z

extra_query goes into the URL query params no? (just guessing)

hanouticelina · 2025-02-01T13:54:28Z

yep, just wanted to double check in their docstrings, since it's not really documented.
Indeed, extra_body -> adds parameters to the JSON body and extra_query -> adds parameters as query string to the URL.
Let's rename extra_parameters to extra_body then!

hanouticelina added 4 commits January 30, 2025 17:57

fix discrepancies for text-to-image parameters and add extra_paramete…

ea3245e

…rs argument

revamp inference providers tests

1aa5558

nit

f8c673b

fix test

169205c

hanouticelina requested a review from Wauplin January 30, 2025 17:11

hanouticelina commented Jan 30, 2025

View reviewed changes

src/huggingface_hub/inference/_providers/fal_ai.py Outdated Show resolved Hide resolved

Wauplin approved these changes Jan 30, 2025

View reviewed changes

hanouticelina mentioned this pull request Jan 30, 2025

Add YuE (music gen) from fal.ai #2801

Merged

hanouticelina added 2 commits January 31, 2025 09:36

add examples with extra parameters

305c720

remove nested dict image size

d927057

Wauplin approved these changes Jan 31, 2025

View reviewed changes

hanouticelina added 4 commits January 31, 2025 11:39

filter out @deprecated params

57d2ae2

fix

bf96fde

Merge branch 'main' of github.com:huggingface/huggingface_hub into ta…

9817c12

…sk-parameters

fix test

958a426

Wauplin approved these changes Jan 31, 2025

View reviewed changes

fixing bugs introduced by the LLM

c537251

hanouticelina merged commit 07e1adb into main Jan 31, 2025
17 checks passed

hanouticelina deleted the task-parameters branch January 31, 2025 11:05

julien-c reviewed Jan 31, 2025

View reviewed changes

hanouticelina mentioned this pull request Feb 1, 2025

[InferenceClient] Renaming extra_parameters to extra_body #2821

Merged

julien-c mentioned this pull request Feb 3, 2025

[InferenceClient] Provide a way to deal with content-type header when sending raw bytes #2706

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[InferenceClient] Better handling of task parameters #2812

[InferenceClient] Better handling of task parameters #2812

hanouticelina commented Jan 30, 2025

HuggingFaceDocBuilderDev commented Jan 30, 2025

Wauplin left a comment •

edited

Loading

Wauplin Jan 30, 2025

hanouticelina Jan 31, 2025

Wauplin Jan 31, 2025

Wauplin Jan 30, 2025

hanouticelina Jan 31, 2025

Wauplin Jan 30, 2025

Wauplin left a comment

Wauplin Jan 31, 2025

hanouticelina Jan 31, 2025

hanouticelina Jan 31, 2025

hanouticelina Jan 31, 2025

Wauplin Jan 31, 2025

Wauplin left a comment

hanouticelina commented Jan 31, 2025

julien-c left a comment

Wauplin commented Jan 31, 2025

julien-c commented Feb 1, 2025

hanouticelina commented Feb 1, 2025

julien-c commented Feb 1, 2025

hanouticelina commented Feb 1, 2025

[InferenceClient] Better handling of task parameters #2812

[InferenceClient] Better handling of task parameters #2812

Conversation

hanouticelina commented Jan 30, 2025

HuggingFaceDocBuilderDev commented Jan 30, 2025

Wauplin left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wauplin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wauplin left a comment

Choose a reason for hiding this comment

hanouticelina commented Jan 31, 2025

julien-c left a comment

Choose a reason for hiding this comment

Wauplin commented Jan 31, 2025

julien-c commented Feb 1, 2025

hanouticelina commented Feb 1, 2025

julien-c commented Feb 1, 2025

hanouticelina commented Feb 1, 2025

Wauplin left a comment •

edited

Loading