From c50d0cc6056febb1f972053a5631efed5ea46da1 Mon Sep 17 00:00:00 2001 From: kthui <18255193+kthui@users.noreply.github.com> Date: Thu, 4 Apr 2024 14:55:29 -0700 Subject: [PATCH] Improve docs wording --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index c7e6e6af..d057d793 100644 --- a/README.md +++ b/README.md @@ -624,8 +624,8 @@ for more details on how to host a decoupled model. Starting from 24.04, `async def execute(self, requests):` is supported for decoupled Python models. Its coroutine will be executed by an AsyncIO event loop -shared with requests executing in a model instance. The next request for the -model instance can start executing while the current request is waiting. +shared with requests executing in the same model instance. The next request for +the model instance can start executing while the current request is waiting. This is useful for minimizing the number of model instances for models that spend the majority of its time waiting, given requests can be executed