(WIP) Dynamic inference provider mapping #1173

Wauplin · 2025-02-04T16:14:11Z

"it should work" 😄

Still a draft while https://github.com/huggingface-internal/moon-landing/pull/12398 (internal) is been merged/deployed.

Goal is to use the dynamic mapping, and default back to hardcoded model ids if necessary (for backward compatibility).

I haven't tested anything for now and I left some todos to address:

what to do with status: "live" | "staging" ? => raise a warning
we need to cache the modelInfo call (only do it once at runtime)
how to handle if model supports both text-generation and conversational
how to deal with "hf-inference" if no taskHints? (for now, I kept as before)
tests are flaky (requires server-side update)

EDIT: made an update to preserve previous behavior with hardcoded mapping. Dynamic mapping from the hub
takes precedence.

julien-c · 2025-02-05T12:11:55Z

packages/inference/src/providers/fal-ai.ts

-
-type FalAiId = string;
-
-export const FAL_AI_SUPPORTED_MODEL_IDS: ProviderMapping<FalAiId> = {


same comment as in https://github.com/huggingface/huggingface_hub/pull/2836/files#r1942755169, i would for now keep the mappings (but make them empty)
and still support them in code

That way we have slightly less breaking changes too, potentially

@julien-c I reverted to keep both the hard-coded logic and the new dynamic mapping. Mapping from the Hub takes precedence over the hardcoded one.

SBrandeis · 2025-02-05T14:30:49Z

Depends on https://github.com/huggingface-internal/moon-landing/pull/12453 (internal)

packages/inference/package.json

packages/hub/src/types/api/api-model.ts

julien-c · 2025-02-05T18:42:00Z

packages/inference/src/lib/getProviderModelId.ts

+				`Model ${params.model} is in staging mode for provider ${params.provider}. Meant for test purposes only.`
+			);
+		}
+		// TODO: how is it handled server-side if model has multiple tasks (e.g. `text-generation` + `conversational`)?


Suggested change

// TODO: how is it handled server-side if model has multiple tasks (e.g. `text-generation` + `conversational`)?

i think this is ok @Wauplin

julien-c

IMO this should be ready to review + to merge

let's try to merge quickly because otherwise PRs will be opened to add stuff to the mappings

julien-c · 2025-02-06T11:51:32Z

BG @SBrandeis

Wauplin added 2 commits February 4, 2025 17:09

(draft) Dynamic inference provider mapping

1230f3d

Merge branch 'main' into dynamic-inference-provider-mapping

8537222

hanouticelina mentioned this pull request Feb 5, 2025

[InferenceClient] Add dynamic inference providers mapping huggingface/huggingface_hub#2836

Draft

4 tasks

julien-c reviewed Feb 5, 2025

View reviewed changes

Wauplin and others added 6 commits February 5, 2025 14:35

add back backward compatibility

add91cf

use hardcoded mapping

a4e83d9

better ?

8994771

merge conflict

7b00be0

throw before modelInfo call

4a7c7ad

minor changes

10436c7

warn if model on staging

fdd317f

SBrandeis approved these changes Feb 5, 2025

View reviewed changes

Wauplin and others added 2 commits February 5, 2025 15:50

update

aa0e049

Merge branch 'main' into dynamic-inference-provider-mapping

bdea401

julien-c reviewed Feb 5, 2025

View reviewed changes

packages/inference/package.json Outdated Show resolved Hide resolved

julien-c reviewed Feb 5, 2025

View reviewed changes

packages/hub/src/types/api/api-model.ts Outdated Show resolved Hide resolved

julien-c added 9 commits February 5, 2025 17:07

(new enum name, prod => live)

e943a35

Example of empty mapping to let devs override locally

72de8ad

naming tweak

bee85ae

more precise comment

08931c0

drop additional huggingface/hub dependency

03a4c4d

Drop most hardcoded mappings

2b799be

move this function around

12e4bed

implement a cache

2737081

ooops

660a54c

julien-c reviewed Feb 5, 2025

View reviewed changes

julien-c approved these changes Feb 5, 2025

View reviewed changes

julien-c mentioned this pull request Feb 5, 2025

Fireworks AI Conversational Models #1177

Merged

SBrandeis self-assigned this Feb 6, 2025

SBrandeis added 5 commits February 6, 2025 11:38

tweaks

5cb069f

Merge branch 'main' into dynamic-inference-provider-mapping

07069e9

lint

630462b

fix: vcr

d646c9d

snippet fixes

9d05764

SBrandeis marked this pull request as ready for review February 6, 2025 11:51

SBrandeis requested review from gary149, pcuenca, ngxson and hanouticelina as code owners February 6, 2025 11:51

SBrandeis merged commit 38d13dd into main Feb 6, 2025
5 checks passed

SBrandeis deleted the dynamic-inference-provider-mapping branch February 6, 2025 11:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(WIP) Dynamic inference provider mapping #1173

(WIP) Dynamic inference provider mapping #1173

Wauplin commented Feb 4, 2025 •

edited by julien-c

Loading

julien-c Feb 5, 2025

Wauplin Feb 5, 2025

SBrandeis commented Feb 5, 2025

julien-c Feb 5, 2025

julien-c left a comment

julien-c commented Feb 6, 2025


		type FalAiId = string;

		export const FAL_AI_SUPPORTED_MODEL_IDS: ProviderMapping<FalAiId> = {

(WIP) Dynamic inference provider mapping #1173

(WIP) Dynamic inference provider mapping #1173

Conversation

Wauplin commented Feb 4, 2025 • edited by julien-c Loading

julien-c Feb 5, 2025

Choose a reason for hiding this comment

Wauplin Feb 5, 2025

Choose a reason for hiding this comment

SBrandeis commented Feb 5, 2025

julien-c Feb 5, 2025

Choose a reason for hiding this comment

julien-c left a comment

Choose a reason for hiding this comment

julien-c commented Feb 6, 2025

Wauplin commented Feb 4, 2025 •

edited by julien-c

Loading