Decomposing improvements in accuracy #24

yaoshiang · 2020-01-08T18:51:59Z

For some unpublished work, I have decomposed improvements into this waterfall. Please let me know if this would make sense to include.

Suppose:

Model M trained on X_train classifies data X_test with 90% top-1 accuracy with k=[10] classes.
Attack A on M generates X_test_adv.
M classifies X_test_adv with 1% top-1 accuracy.
Model M_defended applies a defense (e.g. adversarial retraining) and achieve 30% top-1 accuracy.
Model M_seed is identical to M other than a different random seed during training
M_seed achieves nearly identical performance to M on x_test.
M_seed achieves 15% accuracy on X_test_adv due to imperfect transferability of attacks.

Decomposition:

Random Guess: A trivial model that randomly guesses on k=[10] classes would achieve 1/k = [10%] accuracy. The first 9% of accuracy is attributable to a TRIVIAL RANDOM-GUESSING MODEL. This is not a very impressive part of the improvement in accuracy.

Different training seed: M_seed achieves 15% on x_test_adv. The next 5% of improved accuracy is attributable to NON-TRANSFERABILITY. This is also unimpressive.

Defense: M_defended achieves 30% on x_test_adv. Only the final 15% of improvement can be attributed to the defense.

carlini · 2021-02-25T07:59:45Z

This is an excellent idea. I've seen a few papers do this in the past year or two and we should definitely include something like this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decomposing improvements in accuracy #24

Decomposing improvements in accuracy #24

yaoshiang commented Jan 8, 2020 •

edited

Loading

carlini commented Feb 25, 2021

Decomposing improvements in accuracy #24

Decomposing improvements in accuracy #24

Comments

yaoshiang commented Jan 8, 2020 • edited Loading

carlini commented Feb 25, 2021

yaoshiang commented Jan 8, 2020 •

edited

Loading