Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DQN agent good for stochastic game? What other technique would be better? #684

Open
Spiegel-Leser opened this issue Dec 6, 2021 · 0 comments

Comments

@Spiegel-Leser
Copy link

Spiegel-Leser commented Dec 6, 2021

Browsing through freely availabel sources I find both statements: DQN is good / is not good for stochastic environments.

As far as I understand it, the Q-Network predicts the expected return of an action in a state, which can then be used to decide e.g. greedily; and training makes that prediction better. If the environment is stochastic, repeated learning should nudge the prediction to the distribution center as the loss minimum.

So in general it should work, but might need a lot of time to get there (law of great numbers), especially since the game is being played by 2 agents suffering from the same problem, and being part of the "environment" stochastic behaviour for the opponent!

Maybe there is another technique in Deep Learning / Reinforcement Learning much better suited for such a strongly stochastic environment?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant