Update utils.py #1012

ZihanWang314 · 2023-11-20T11:41:03Z

update compute_accuracy to deal with the cases where str_chosen and str_rej got the same scores, which is probably what the developers don't want

HuggingFaceDocBuilderDev · 2023-11-20T12:56:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

lvwerra · 2023-11-20T15:50:13Z

Thanks for the PR! Can you elaborate a bit more why this is bad and what bad side effects this has?

ZihanWang314 · 2023-11-20T15:58:25Z

When a model gives exactly same scores for the sentence to accept and the sentence to reject, the current evaluation metric will set this pair to be "correct". However, the above scenario is usually caused by the model having some problems in encoding sentences (e.g. over-quantized), or the two sentences exactly being the same, which we both don't expect. I think putting a warning here and defining this pair as "wrong" or "random" would be helpful for the developers to observe these problems.

lvwerra · 2023-11-21T07:55:04Z

I see, thanks! In that case I'd prefer to just add a warning but leave the computation logic the same. Does that make sense?

ZihanWang314 · 2023-11-21T07:57:17Z

sure it makes sense. Thanks for your suggestions!

updated so only warning is reserved

trl/trainer/utils.py

Co-authored-by: Leandro von Werra <[email protected]>

* Update utils.py update compute_accuracy to deal with the cases where str_chosen and str_rej got the same scores, which is probably what the developers don't want * Update utils.py updated so only warning is reserved * Update trl/trainer/utils.py Co-authored-by: Leandro von Werra <[email protected]> --------- Co-authored-by: Leandro von Werra <[email protected]>

Update utils.py

c29e00c

update compute_accuracy to deal with the cases where str_chosen and str_rej got the same scores, which is probably what the developers don't want

Update utils.py

c2f0015

updated so only warning is reserved

lvwerra reviewed Nov 29, 2023

View reviewed changes

trl/trainer/utils.py Outdated Show resolved Hide resolved

Update trl/trainer/utils.py

7c5c29d

Co-authored-by: Leandro von Werra <[email protected]>

lvwerra approved these changes Nov 29, 2023

View reviewed changes

lvwerra merged commit 4b67af3 into huggingface:main Nov 29, 2023
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update utils.py #1012

Update utils.py #1012

ZihanWang314 commented Nov 20, 2023

HuggingFaceDocBuilderDev commented Nov 20, 2023

lvwerra commented Nov 20, 2023 •

edited

Loading

ZihanWang314 commented Nov 20, 2023

lvwerra commented Nov 21, 2023

ZihanWang314 commented Nov 21, 2023

Update utils.py #1012

Update utils.py #1012

Conversation

ZihanWang314 commented Nov 20, 2023

HuggingFaceDocBuilderDev commented Nov 20, 2023

lvwerra commented Nov 20, 2023 • edited Loading

ZihanWang314 commented Nov 20, 2023

lvwerra commented Nov 21, 2023

ZihanWang314 commented Nov 21, 2023

lvwerra commented Nov 20, 2023 •

edited

Loading