This is a joint work by Binglin Zhou, Linhao Zhong, and Wentao Chen. More details are included in our paper and pages
This work is a course project of Course 2023 Fall, CS348: Computer Vision in SJTU. Our paper is also accepted by ICMSSP 2024.
Dataset distillation, a pragmatic approach in machine learning, aims to create a smaller synthetic dataset from a larger existing dataset. However, existing distillation methods primarily adopt a model-based paradigm, where the synthetic dataset inherits model-specific biases, limiting its generalizability to alternative models. In response to this constraint, we propose a novel methodology termed "model pool". This approach involves selecting models from a diverse model pool based on a specific probability distribution during the data distillation process. Additionally, we integrate our model pool with the established knowledge distillation approach and apply knowledge distillation to the test process of the distilled dataset. Our experimental results validate the effectiveness of the model pool approach across a range of existing models while testing, demonstrating superior performance compared to existing methodologies.
We test two methods for dataset distillation: model pool and knowledge distillation. The dataset is CIFAR-10. The main model is shown below:
hyper parameter | specific setting |
---|---|
net width | 128 |
net depth | 3 |
net activation layer | relu |
net normalization layer | instancenorm |
net pooling layer | avgpooling |
Other Random ConvNet in model pool:
hyper parameter | specific setting |
---|---|
net width | 128 |
net depth | 3 |
net activation layer | relu, sigmoid, leakyrelu, swish |
net normalization layer | instancenorm, batchnorm, layernorm, groupnorm |
net pooling layer | avgpooling, maxpooling |
This can be shown as the table.