Improve input mismatch error for inference requests (DLIS-6165) #330

indrajit96 · 2024-02-28T00:06:23Z

Created 2 new strings to improve the verbosity of the error response.

Scenario: When a inference request has less input params than the model expects error response is insufficient.
Fix: Added 2 new strings to the error response

Old Behavior:
[request id: <id_unknown>] expected 5 inputs but got 1 inputs for model 'llama2-7b'

New Behavior:
[request id: <id_unknown>] expected 5 inputs but got 1 inputs for model 'llama2-7b'. Got inputs ['prompt'], but missing ['max_output_token', 'top_k', 'top_p', 'temperature']"

We have added text to existing response so that existing tests do not break.
Working on the test case for this response will add later.

…re into indrajit_test

src/infer_request.cc

…ndrajit_test

…re into indrajit_test

lkomali · 2024-03-02T00:46:49Z

The message can be improved for better clarity and readability to something similar to the one below.

[request ID: <id_unknown>] Error: Expected 5 inputs for model 'llama2-7b', but received only 1 input(s).
Received inputs: ['prompt']
Missing inputs: ['max_output_token', 'top_k', 'top_p', 'temperature']
Please provide all required inputs.

indrajit96 · 2024-03-05T18:28:59Z

The message can be improved for better clarity and readability to something similar to the one below.

[request ID: <id_unknown>] Error: Expected 5 inputs for model 'llama2-7b', but received only 1 input(s). Received inputs: ['prompt'] Missing inputs: ['max_output_token', 'top_k', 'top_p', 'temperature'] Please provide all required inputs.

Fixed this made the message more verbose.

src/infer_request.cc

kthui · 2024-03-08T20:55:41Z

Please update the copyright year on both files, because this change is more than fixing a typo, i.e. on infer_request.h, the copyright should start with

// Copyright 2020-2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.

same for infer_request.cc

…ndrajit_test

indrajit96 · 2024-03-08T21:08:36Z

Please update the copyright year on both files, because this change is more than fixing a typo, i.e. on infer_request.h, the copyright should start with
// Copyright 2020-2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
same for infer_request.cc

Fixed for all files

kthui

LGTM!

rmccorm4

Looks great!

indrajit96 added 8 commits February 20, 2024 21:07

First Draft

b060187

Merge branch 'indrajit_test' of github.com:triton-inference-server/co…

22df2f3

…re into indrajit_test

Typo fixed

3ec8bfe

Fixed Typo

e4b1468

Corner cases of 0 input added

e552cef

Clang fixes

965763c

Missed quotes

6fae0ec

Clang

1a8a5b6

indrajit96 requested a review from rmccorm4 February 28, 2024 00:06

Merge branch 'main' into indrajit_test

865cf29

rmccorm4 reviewed Feb 28, 2024

View reviewed changes

src/infer_request.cc Outdated Show resolved Hide resolved

rmccorm4 reviewed Feb 28, 2024

View reviewed changes

src/infer_request.cc Outdated Show resolved Hide resolved

rmccorm4 reviewed Feb 28, 2024

View reviewed changes

src/infer_request.cc Outdated Show resolved Hide resolved

indrajit96 added 4 commits February 29, 2024 15:52

More responses fixed

9c9f2af

Merge branch 'main' of github.com:triton-inference-server/core into i…

95e31e8

…ndrajit_test

Merge branch 'indrajit_test' of github.com:triton-inference-server/co…

2ea0f5d

…re into indrajit_test

Clang Fixes

02aeddd

rmccorm4 requested review from kthui and lkomali March 1, 2024 23:36

rmccorm4 changed the title ~~DLIS-6165 Improve input mismatch error for inference requests~~ Improve input mismatch error for inference requests (DLIS-6165) Mar 1, 2024

indrajit96 added 3 commits March 4, 2024 20:18

Message more clear

00033b4

Typo Fixed

7f0b899

Pre Commit changes

a5ca1a5

rmccorm4 reviewed Mar 5, 2024

View reviewed changes

src/infer_request.cc Outdated Show resolved Hide resolved

indrajit96 added 4 commits March 5, 2024 15:24

Test local build script

0dbbc0c

ValidateRequestInputs() func added

46e31b3

Compilation error fix

4b0ab24

Typo fix

8b13189

indrajit96 added 2 commits March 5, 2024 16:35

Typo fix

9d37b63

Return Success in default case

17c31cb

indrajit96 added 2 commits March 8, 2024 13:06

Copyright fix

e05023b

Merge branch 'main' of github.com:triton-inference-server/core into i…

301f230

…ndrajit_test

kthui approved these changes Mar 8, 2024

View reviewed changes

rmccorm4 approved these changes Mar 8, 2024

View reviewed changes

Merge branch 'main' into indrajit_test

3698f7f

indrajit96 merged commit 2bd46a4 into main Mar 9, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve input mismatch error for inference requests (DLIS-6165) #330

Improve input mismatch error for inference requests (DLIS-6165) #330

indrajit96 commented Feb 28, 2024

lkomali commented Mar 2, 2024 •

edited

Loading

indrajit96 commented Mar 5, 2024

kthui commented Mar 8, 2024

indrajit96 commented Mar 8, 2024

kthui left a comment

rmccorm4 left a comment

Improve input mismatch error for inference requests (DLIS-6165) #330

Improve input mismatch error for inference requests (DLIS-6165) #330

Conversation

indrajit96 commented Feb 28, 2024

lkomali commented Mar 2, 2024 • edited Loading

indrajit96 commented Mar 5, 2024

kthui commented Mar 8, 2024

indrajit96 commented Mar 8, 2024

kthui left a comment

Choose a reason for hiding this comment

rmccorm4 left a comment

Choose a reason for hiding this comment

lkomali commented Mar 2, 2024 •

edited

Loading