[bug fix][AIC-py] add missing global model settings on init #765

jonathanlastmileai · 2024-01-05T02:46:10Z

[bug fix][AIC-py] add missing global model settings on init

Adding prompt with a model name to a new AIConfig makes it invalid
because essentially the name is a dangling foreign key to the AIConfig's global
model settings mapping.

Immediate fix:

When we add prompt_1, also add a default global mapping entry for each
registered model.

Better fix: Reject updates to AIConfig that would make it invalid.
(inside add_prompt). However, this could break existing code
that adds a prompt and then immediately adds the missing model, like we're doing
in this PR.

In the future, we should make AIConfig more constrained by definition
so that this kind of invalid object can't exist to begin with.

Test: run server, type into prompt_1, it runs

Adding prompt with a model name to a new AIConfig makes it invalid because essentially the name is a dangling foreign key to the AIConfig's global model settings mapping. Immediate fix: When we add prompt_1, also add a default global mapping entry for each registered model. Better fix: Reject updates to AIConfig that would make it invalid. (inside add_prompt). However, this could break existing code that adds a prompt and then immediately adds the missing model, like we're doing in this PR. In the future, we should make AIConfig more constrained by definition so that this kind of invalid object can't exist to begin with. Test: run server, type into prompt_1, it runs

rossdanlm · 2024-01-05T08:26:39Z

Better fix: Reject updates to AIConfig that would make it invalid. (inside add_prompt). However, this could break existing code that adds a prompt and then immediately adds the missing model, like we're doing in this PR.

Forcing the user to explicitly define the model in aiconfig-level is kind of unintuitive and not easy to do if you aren't familiar with the AIConfig schema

In the future, we should make AIConfig more constrained by definition so that this kind of invalid object can't exist to begin with.

This isn't strictly possible. Ex: If in editor prompt we want to choose a new model that isn't defined (which this diff fixed) in the aiconfig-level metadata.models field, there's no way of having it pre-defined (other than forcing user to enter it manually which I mentioned earlier isn't good UI experience)

For me the best solution would be to modify this on update_model, and when model name changes we should:

Add the new model info to the aiconfig-level settings IF it's not already defined
Remove the old model from the aiconfig-level settings IF no other prompts use it

1 is necessary, 2 is nice to have. Either way, this diff unblocks us for editor (except for custom model parsers), but I will mark this as P1. Do you want me to own this followup (post-MVP) or do you want to do it?

Extra, beyond scope of this diff

There's also a lot of "initial state" considerations, but for now I think this diff is good to get us unblocked. Ex: I think instead of initializing every model, we should go through each initial prompt, make a list of all used models, and add those if not defined (and if those models are one of the valid core models that we support). We can also do if a prompt has not been created, for the first time we can default to gpt-3 and also add that as well (so yea, you're right and we'd probably need to insert some custom functionality to add_prompt to check for this edge case)

rossdanlm

Nice, big unblocker!

rholinshead · 2024-01-05T15:07:07Z

python/src/aiconfig/editor/server/server_utils.py

+            if aiconfig_runtime.metadata.models is None:
+                aiconfig_runtime.metadata.models = {}
+            if model_id not in aiconfig_runtime.metadata.models:
+                aiconfig_runtime.add_model(model_id, {"model": model_id})


Isn't this going to set {"model"} into the completion params for all models by default now? For example, for HuggingFaceTextGenerationParser the settings would have {"model": "HuggingFaceTextGenerationParser"} in this case? I don't think that's what we want, right?

Isn't this issue only a problem for those model parsers that actually support / need the 'model' in their completion params? If so, I think we should resolve this at the model parser level.

e.g. for openai deserialize:

model_settings = self.get_model_settings(prompt, aiconfig) or {"model": aiconfig.get_model_name(prompt)})

cc @jonathanlastmileai , @rossdanlm

Right, good point. I'm a bit confused again. Let's start from here: let's write a minimum example of a valid serialized AIConfig for GPT4, and another valid serialized AIConfig for a model that doesn't put a "model" key into the underlying API - maybe HF text generation. Let's get crystal clear on why each one is valid and why alternatives would be invalid.

GPT4

{ "metadata": { "models": { "gpt-4": { "model": "gpt-4" } } }, "prompts": [ { "name": "get_activities", "input": "Tell me 10 fun attractions to do in NYC." "metadata": { "model": "gpt-4", } } ] }

I think this is valid because the string-valued prompt.metadata.model points to an existing metadata.models key whose value is a dict.

This is an invalid variant of what we have above, what we were seeing before:

{ "metadata": { "models": { } }, "prompts": [ { "name": "get_activities", "input": "Tell me 10 fun attractions to do in NYC." "metadata": { "model": "gpt-4", } } ] }

Right so far?

Assuming that's correct, here's something I'm not clear on: what is a valid and invalid equivalent for another model e.g. HuggingFaceTextGenerationParser ?

By the design of AIConfig, shouldn't it just be the same as for GPT4 but with the name changed?

jonathanlastmileai · 2024-01-05T15:33:23Z

Better fix: Reject updates to AIConfig that would make it invalid. (inside add_prompt). However, this could break existing code that adds a prompt and then immediately adds the missing model, like we're doing in this PR.

Forcing the user to explicitly define the model in aiconfig-level is kind of unintuitive and not easy to do if you aren't familiar with the AIConfig schema

In the future, we should make AIConfig more constrained by definition so that this kind of invalid object can't exist to begin with.

This isn't strictly possible. Ex: If in editor prompt we want to choose a new model that isn't defined (which this diff fixed) in the aiconfig-level metadata.models field, there's no way of having it pre-defined (other than forcing user to enter it manually which I mentioned earlier isn't good UI experience)

For me the best solution would be to modify this on update_model, and when model name changes we should:

Add the new model info to the aiconfig-level settings IF it's not already defined

Remove the old model from the aiconfig-level settings IF no other prompts use it

1 is necessary, 2 is nice to have. Either way, this diff unblocks us for editor (except for custom model parsers), but I will mark this as P1. Do you want me to own this followup (post-MVP) or do you want to do it?

Extra, beyond scope of this diff

There's also a lot of "initial state" considerations, but for now I think this diff is good to get us unblocked. Ex: I think instead of initializing every model, we should go through each initial prompt, make a list of all used models, and add those if not defined (and if those models are one of the valid core models that we support). We can also do if a prompt has not been created, for the first time we can default to gpt-3 and also add that as well (so yea, you're right and we'd probably need to insert some custom functionality to add_prompt to check for this edge case)

Let's P1 this discussion here https://docs.google.com/document/d/1YwUlIQZX8CiuvtbVR4OvpFQ2nphFITRD_XZR3CDNtoE/edit#bookmark=id.ipu45nan09yk

Revert #765 to fix properly in the AIConfig SDK

Revert #765 Revert #765 to fix properly in the AIConfig SDK

jonathanlastmileai requested review from saqadri, rholinshead, suyoglastmileai, Ankush-lastmile and rossdanlm as code owners January 5, 2024 02:46

jonathanlastmileai mentioned this pull request Jan 5, 2024

debug new #764

Draft

jonathanlastmileai force-pushed the pr765 branch from 6dbb526 to c2f29cf Compare January 5, 2024 02:47

rossdanlm approved these changes Jan 5, 2024

View reviewed changes

jonathanlastmileai merged commit 1625d10 into main Jan 5, 2024
2 checks passed

rholinshead reviewed Jan 5, 2024

View reviewed changes

jonathanlastmileai deleted the pr765 branch January 5, 2024 15:30

saqadri added a commit that referenced this pull request Jan 5, 2024

Revert #765

b7a46a2

Revert #765 to fix properly in the AIConfig SDK

saqadri mentioned this pull request Jan 5, 2024

Revert #765 #778

Merged

saqadri added a commit that referenced this pull request Jan 5, 2024

Revert #765 (#778)

1f9b02a

Revert #765 Revert #765 to fix properly in the AIConfig SDK

rossdanlm mentioned this pull request Jan 5, 2024

Explicitly set "model" property for models that support it #783

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bug fix][AIC-py] add missing global model settings on init #765

[bug fix][AIC-py] add missing global model settings on init #765

jonathanlastmileai commented Jan 5, 2024 •

edited

Loading

rossdanlm commented Jan 5, 2024

rossdanlm left a comment

rholinshead Jan 5, 2024

jonathanlastmileai Jan 5, 2024

jonathanlastmileai Jan 5, 2024

jonathanlastmileai commented Jan 5, 2024

Extra, beyond scope of this diff

[bug fix][AIC-py] add missing global model settings on init #765

[bug fix][AIC-py] add missing global model settings on init #765

Conversation

jonathanlastmileai commented Jan 5, 2024 • edited Loading

rossdanlm commented Jan 5, 2024

Extra, beyond scope of this diff

rossdanlm left a comment

Choose a reason for hiding this comment

rholinshead Jan 5, 2024

Choose a reason for hiding this comment

jonathanlastmileai Jan 5, 2024

Choose a reason for hiding this comment

jonathanlastmileai Jan 5, 2024

Choose a reason for hiding this comment

jonathanlastmileai commented Jan 5, 2024

Extra, beyond scope of this diff

jonathanlastmileai commented Jan 5, 2024 •

edited

Loading