Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Decomposing improvements in accuracy #24

Open
yaoshiang opened this issue Jan 8, 2020 · 1 comment
Open

Decomposing improvements in accuracy #24

yaoshiang opened this issue Jan 8, 2020 · 1 comment

Comments

@yaoshiang
Copy link

yaoshiang commented Jan 8, 2020

For some unpublished work, I have decomposed improvements into this waterfall. Please let me know if this would make sense to include.

Suppose:

  • Model M trained on X_train classifies data X_test with 90% top-1 accuracy with k=[10] classes.

  • Attack A on M generates X_test_adv.

  • M classifies X_test_adv with 1% top-1 accuracy.

  • Model M_defended applies a defense (e.g. adversarial retraining) and achieve 30% top-1 accuracy.

  • Model M_seed is identical to M other than a different random seed during training

  • M_seed achieves nearly identical performance to M on x_test.

  • M_seed achieves 15% accuracy on X_test_adv due to imperfect transferability of attacks.

Decomposition:

Random Guess: A trivial model that randomly guesses on k=[10] classes would achieve 1/k = [10%] accuracy. The first 9% of accuracy is attributable to a TRIVIAL RANDOM-GUESSING MODEL. This is not a very impressive part of the improvement in accuracy.

Different training seed: M_seed achieves 15% on x_test_adv. The next 5% of improved accuracy is attributable to NON-TRANSFERABILITY. This is also unimpressive.

Defense: M_defended achieves 30% on x_test_adv. Only the final 15% of improvement can be attributed to the defense.

Screen Shot 2020-01-08 at 10 52 27 AM

@carlini
Copy link
Member

carlini commented Feb 25, 2021

This is an excellent idea. I've seen a few papers do this in the past year or two and we should definitely include something like this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants