[Inference Providers] `provider="auto"` #1390

hanouticelina · 2025-04-25T16:12:20Z

Same as huggingface/huggingface_hub#3011.

This PR adds support for auto selection of the provider. Previously the default value was hf-inference (HF Inference API provider), now we default to "auto", meaning we will select the first of the providers available for the model, sorted by the user's order in https://hf.co/settings/inference-providers.

you can test with:

import { chatCompletion } from "../src";

const res = await chatCompletion({
	// provider="auto",
	model: "deepseek-ai/DeepSeek-V3-0324",
	messages: [
		{
			role: "user",
			content: "What is the capital of France?",
		},
	],
	accessToken: process.env.HF_TOKEN,
});
console.log(res.choices[0].message.content);

Defaulting to 'auto' which will select the first provider available for the model, sorted by the user's order in https://hf.co/settings/inference-providers.
Auto-selected provider: sambanova
The capital of France is **Paris**. It is known for its iconic landmarks such as the Eiffel Tower...blabla

the selected provider should be be the first in inferenceProviderMapping mapping here: https://huggingface.co/api/models/deepseek-ai/DeepSeek-V3-0324?expand=inferenceProviderMapping

hanouticelina · 2025-04-25T16:13:23Z

packages/inference/src/lib/getInferenceProviderMapping.ts

 	return null;
 }
+
+export async function resolveProvider(


this could be done in getProviderHelper as we did for the python client, but that would make the function async and we would have to update the snippets generation as well

hanouticelina · 2025-04-25T16:17:25Z

packages/inference/src/lib/getInferenceProviderMapping.ts

 			.catch(() => null);
+
+		if (inferenceProviderMapping) {
+			inferenceProviderMappingCache.set(modelId, inferenceProviderMapping);


if provider="auto", we call fetchInferenceProviderMappingForModel 2 times: once to resolve the provider and a second time in makeRequestOptions. this cache avoids the extra HTTP call.

…face.js into auto-select-provider

julien-c

Apart from a question about the type name, lgtm!

packages/inference/src/types.ts

SBrandeis

Clean!

Wauplin

Very nice!

packages/inference/src/types.ts

julien-c · 2025-04-29T14:53:23Z

Are we sure we're not keeping the name InferenceProvider? Feels better to me honestly (and we mention in the docstring that it can be a "policy" or a "virtual provider")

Will keep diff smaller as @SBrandeis mentions too

…face.js into auto-select-provider

hanouticelina · 2025-04-30T09:17:56Z

Discussed internally, let's keep InferenceProviderOrPolicy and merge the PR!

Same as huggingface/huggingface_hub#3011. This PR adds support for auto selection of the provider. Previously the default value was `hf-inference` (HF Inference API provider), now we default to "auto", meaning we will select the first of the providers available for the model, sorted by the user's order in https://hf.co/settings/inference-providers. you can test with: ```ts import { chatCompletion } from "../src"; const res = await chatCompletion({ // provider="auto", model: "deepseek-ai/DeepSeek-V3-0324", messages: [ { role: "user", content: "What is the capital of France?", }, ], accessToken: process.env.HF_TOKEN, }); console.log(res.choices[0].message.content); ``` ``` Defaulting to 'auto' which will select the first provider available for the model, sorted by the user's order in https://hf.co/settings/inference-providers. Auto-selected provider: sambanova The capital of France is **Paris**. It is known for its iconic landmarks such as the Eiffel Tower...blabla ``` the selected provider should be be the first in `inferenceProviderMapping` mapping here: https://huggingface.co/api/models/deepseek-ai/DeepSeek-V3-0324?expand=inferenceProviderMapping

hanouticelina added 3 commits April 25, 2025 17:57

auto select provider

6ddcd18

nit

bab28f9

logging

c7383d3

hanouticelina requested a review from Wauplin April 25, 2025 16:12

hanouticelina requested review from SBrandeis and julien-c as code owners April 25, 2025 16:12

hanouticelina commented Apr 25, 2025

View reviewed changes

hanouticelina added 3 commits April 25, 2025 18:17

Merge branch 'main' into auto-select-provider

2d8e16e

fix

4a10479

Merge branch 'auto-select-provider' of github.com:huggingface/hugging…

591cdb6

…face.js into auto-select-provider

julien-c reviewed Apr 28, 2025

View reviewed changes

packages/inference/src/types.ts Outdated Show resolved Hide resolved

SBrandeis approved these changes Apr 29, 2025

View reviewed changes

Wauplin approved these changes Apr 29, 2025

View reviewed changes

packages/inference/src/types.ts Outdated Show resolved Hide resolved

hanouticelina added 2 commits April 29, 2025 15:39

rename InferenceProviderPolicy -> InferenceProviderOrPolicy

38530d0

Merge branch 'main' into auto-select-provider

83c809b

hanouticelina added 3 commits April 30, 2025 11:12

remove unnecessary log

a4bce4f

Merge branch 'auto-select-provider' of github.com:huggingface/hugging…

38a4ab4

…face.js into auto-select-provider

Merge branch 'main' into auto-select-provider

29e8706

hanouticelina merged commit eaa1b9c into main Apr 30, 2025
4 of 5 checks passed

hanouticelina deleted the auto-select-provider branch April 30, 2025 09:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inference Providers] `provider="auto"` #1390

[Inference Providers] `provider="auto"` #1390

Uh oh!

hanouticelina commented Apr 25, 2025

Uh oh!

hanouticelina Apr 25, 2025 •

edited

Loading

Uh oh!

hanouticelina Apr 25, 2025

Uh oh!

julien-c left a comment

Uh oh!

Uh oh!

SBrandeis left a comment

Uh oh!

Wauplin left a comment

Uh oh!

Uh oh!

julien-c commented Apr 29, 2025

Uh oh!

hanouticelina commented Apr 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Inference Providers] provider="auto" #1390

[Inference Providers] provider="auto" #1390

Uh oh!

Conversation

hanouticelina commented Apr 25, 2025

Uh oh!

hanouticelina Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hanouticelina Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

julien-c left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SBrandeis left a comment

Choose a reason for hiding this comment

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

julien-c commented Apr 29, 2025

Uh oh!

hanouticelina commented Apr 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Inference Providers] `provider="auto"` #1390

[Inference Providers] `provider="auto"` #1390

hanouticelina Apr 25, 2025 •

edited

Loading