[RL agent] study the impact of model scaling and model architecture in the overall performances #25

3rdCore · 2022-02-15T03:30:00Z

The current RL agent uses a very simple GNN +MLP architecture. From the supremacy of ResNET to the advent of autoregressive Transformers, the latest papers in either natural language processing or image processing have shown the benefits of using extremely complex architectures sometimes made of more than a billion parameters.

To what extent these conclusions can be applied to SeaPearl ? Why do we need very little CPU computing power to achieve good results ? I would be very interesting to study given a specific CP problem, the performance of different deep-NN architectures and to more widely study the impact of different scaling parameters regarding the input data size, the model size, the computing power available.

3rdCore added the help wanted Extra attention is needed label Feb 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RL agent] study the impact of model scaling and model architecture in the overall performances #25

[RL agent] study the impact of model scaling and model architecture in the overall performances #25

3rdCore commented Feb 15, 2022

[RL agent] study the impact of model scaling and model architecture in the overall performances #25

[RL agent] study the impact of model scaling and model architecture in the overall performances #25

Comments

3rdCore commented Feb 15, 2022