Added Azure AI Chat Completion Client #4723

rohanthacker · 2024-12-16T17:38:31Z

Related issue number

#4683 Adds initial support for Azure AI Chat Completion Client

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

ekzhu · 2024-12-16T18:40:26Z

@yanivvak can you review this?

yanivvak · 2024-12-17T17:28:33Z

@ekzhu @rohanthacker
Great work, I tried to deploy with 3 different options offered by Azure AI inference SDK

Azure open AI - works good, I tried it with magnetic one, the websurfer got stucked, can you take a look?
Serverless - it works, but I didn't got the full answer it was phi 3.5
Managed compute - it didn't run for me, I assume it's an issue with the endpoint and it is not related to your code

yanivvak · 2024-12-17T17:38:48Z

I used this code
https://github.com/Azure-Samples/dream-team/blob/main/Magenticone_example.py

python/packages/autogen-ext/pyproject.toml

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

ekzhu · 2024-12-18T00:00:08Z

@lspinheiro could you help reviewing this PR?

lspinheiro · 2024-12-18T03:57:46Z

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

+
+class AzureAIChatCompletionClient(ChatCompletionClient):
+    def __init__(self, **kwargs: Unpack[AzureAIChatCompletionClientConfig]):
+        if "endpoint" not in kwargs:


I think this part could benefit from some better separation of concerns between config validation and instantiation. e.g.

class AzureAIChatCompletionClient(ChatCompletionClient): def __init__(self, **kwargs: Unpack[AzureAIClientConfiguration]): config = self._validate_config(kwargs) self._client = self._create_client(config) self._create_args = self._prepare_create_args(config) # ... @staticmethod def _validate_config(config: Mapping[str, Any]) -> AzureAIClientConfiguration: # Validation logic here return config

lspinheiro · 2024-12-18T04:50:43Z

@lspinheiro could you help reviewing this PR?

Looks quite good and is consistent with the openai client. I have a minor comment about the config validation. @jackgerrits may have more options since a lot of the design decisions here are driven by his original implementation of the openai client. If anything doesn't make since in this context he would be the best person to evaluate.

* Added: object-level usage data * Added: doc string * Added: check existing response_format value * Added: _validate_config and _create_client

ekzhu · 2025-01-13T23:20:41Z

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

+            content = choice.message.content or ""
+
+        response = CreateResult(
+            finish_reason=finish_reason,  # type: ignore


Let's use the latest update to canonicalize the finish_reason. See OpenAIChatCompletionClient

ekzhu

@rohanthacker Is this PR ready to review? It looks good from code perspective. @srjoglekar246 could you help to use it in an assistant agent and running some teams on it to test it out?

ekzhu · 2025-01-13T23:31:24Z

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

+
+    def __init__(self, **kwargs: Unpack[AzureAIChatCompletionClientConfig]):
+        config = self._validate_config(kwargs)
+        self._model_capabilities = config["model_capabilities"]


Model Capabilities are deprecated, use ModelInfo. See OpenAIChatCompletionClient: https://github.com/microsoft/autogen/blob/main/python/packages/autogen-ext/src/autogen_ext/models/openai/_openai_client.py#L912-L913

ekzhu · 2025-01-13T23:32:38Z

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

+            client = AzureAIChatCompletionClient(
+                endpoint="endpoint",
+                credential=AzureKeyCredential("api_key"),
+                model_capabilities={


Use model_info as model_capabilities is deprecated.

ekzhu · 2025-01-13T23:34:09Z

@rohanthacker I made this PR ready for review.

lspinheiro · 2025-01-20T07:05:58Z

@ekzhu For context, this works for Azure AI Inference. I tested on a Phi-4 deployment I created.

from semantic_kernel import Kernel
from semantic_kernel.memory.null_memory import NullMemory
from semantic_kernel.connectors.ai.azure_ai_inference import AzureAIInferenceChatCompletion
from semantic_kernel.connectors.ai.azure_ai_inference import AzureAIInferenceChatPromptExecutionSettings
from autogen_core.models import SystemMessage, UserMessage, LLMMessage
from autogen_ext.models.semantic_kernel import SKChatCompletionAdapter


kernel = Kernel(memory=NullMemory())

execution_settings = AzureAIInferenceChatPromptExecutionSettings(
    max_tokens=100,
    temperature=0.5,
    top_p=0.9,
)

chat_completion_service = AzureAIInferenceChatCompletion(ai_model_id="Phi-4")
model_adapter = SKChatCompletionAdapter(sk_client=chat_completion_service)

messages: list[LLMMessage] = [
    SystemMessage(content="You are a helpful assistant."),
    UserMessage(content="What is 2 + 2?", source="user"),
]

azure_result = await model_adapter.create(
    messages=messages,
    extra_create_args={"kernel": kernel, "prompt_execution_settings": execution_settings},
)
print("Azure result:", azure_result.content)

ekzhu · 2025-01-22T18:55:59Z

python/packages/autogen-ext/pyproject.toml

@@ -56,6 +56,11 @@ redis = [
 grpc = [
    "grpcio~=1.62.0", # TODO: update this once we have a stable version.
 ]
+
+azure-ai-inference = [
+    "azure-ai-inference>=1.0.0b6",


This should be merged with the azure extra, because if you want to use this you need azure-identity anyway.

ekzhu linked an issue Dec 17, 2024 that may be closed by this pull request

Adding Azure AI inference #4683

Open

ekzhu reviewed Dec 17, 2024

View reviewed changes

python/packages/autogen-ext/pyproject.toml Show resolved Hide resolved

ekzhu reviewed Dec 17, 2024

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py Show resolved Hide resolved

ekzhu reviewed Dec 17, 2024

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py Show resolved Hide resolved

ekzhu reviewed Dec 17, 2024

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py Show resolved Hide resolved

ekzhu reviewed Dec 17, 2024

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py Show resolved Hide resolved

lspinheiro reviewed Dec 18, 2024

View reviewed changes

rohanthacker requested review from ekzhu and lspinheiro December 24, 2024 08:40

rohanthacker added 7 commits December 30, 2024 19:49

Rebase to latest main branch

ee2fe26

Moved _azure module to azure

b135d9a

Validate extra_create_args in and json response

09c071e

Added Support for Github Models

a24901c

Added normalize_name and assert_valid name

bacab86

Added Tests for AzureAIChatCompletionClient

06d3f95

WIP: Azure AI Client

daf43de

* Added: object-level usage data * Added: doc string * Added: check existing response_format value * Added: _validate_config and _create_client

rohanthacker force-pushed the feature/azure-ai-inference-client branch from d53421b to daf43de Compare December 30, 2024 14:19

Merge branch 'main' into feature/azure-ai-inference-client

5d645f1

ekzhu reviewed Jan 13, 2025

View reviewed changes

ekzhu marked this pull request as ready for review January 13, 2025 23:33

ekzhu added this to the 0.4.1 milestone Jan 13, 2025

ekzhu modified the milestones: 0.4.1, 0.4.2, 0.4.x Jan 13, 2025

ekzhu mentioned this pull request Jan 17, 2025

With current source code getting error openai.NotFoundError: Error code: 404 - {'error': {'code': '404', 'message': 'Resource not found'}} on running the example_userproxy.py #4936

Open

lspinheiro added 2 commits January 18, 2025 08:46

Merge branch 'main' into feature/azure-ai-inference-client

c35191b

Merge branch 'main' into feature/azure-ai-inference-client

a5fc2b7

lspinheiro self-assigned this Jan 20, 2025

lspinheiro added models Pertains to using alternate, non-GPT, models (e.g., local models, llama, etc.) proj-extensions labels Jan 20, 2025

ekzhu mentioned this pull request Jan 20, 2025

doc: A page in extension to show the recommended ways to integrate with non-openai models #5118

Open

Merge branch 'main' into feature/azure-ai-inference-client

a92c62e

ekzhu reviewed Jan 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Azure AI Chat Completion Client #4723

Added Azure AI Chat Completion Client #4723

rohanthacker commented Dec 16, 2024 •

edited

Loading

ekzhu commented Dec 16, 2024

yanivvak commented Dec 17, 2024 •

edited

Loading

yanivvak commented Dec 17, 2024

ekzhu commented Dec 18, 2024

lspinheiro Dec 18, 2024

lspinheiro commented Dec 18, 2024

ekzhu Jan 13, 2025

ekzhu left a comment

ekzhu Jan 13, 2025

ekzhu Jan 13, 2025

ekzhu commented Jan 13, 2025

lspinheiro commented Jan 20, 2025

ekzhu Jan 22, 2025

Added Azure AI Chat Completion Client #4723

Are you sure you want to change the base?

Added Azure AI Chat Completion Client #4723

Conversation

rohanthacker commented Dec 16, 2024 • edited Loading

Related issue number

Checks

ekzhu commented Dec 16, 2024

yanivvak commented Dec 17, 2024 • edited Loading

yanivvak commented Dec 17, 2024

ekzhu commented Dec 18, 2024

lspinheiro Dec 18, 2024

Choose a reason for hiding this comment

lspinheiro commented Dec 18, 2024

ekzhu Jan 13, 2025

Choose a reason for hiding this comment

ekzhu left a comment

Choose a reason for hiding this comment

ekzhu Jan 13, 2025

Choose a reason for hiding this comment

ekzhu Jan 13, 2025

Choose a reason for hiding this comment

ekzhu commented Jan 13, 2025

lspinheiro commented Jan 20, 2025

ekzhu Jan 22, 2025

Choose a reason for hiding this comment

rohanthacker commented Dec 16, 2024 •

edited

Loading

yanivvak commented Dec 17, 2024 •

edited

Loading