Skip to content

Defining Exo-MDPs with External Data #564

Discussion options

You must be logged in to vote

For reference, I got a minimal example working with closures, heap allocated data, and @eval (with MCTS). It is straightforward. Basically, you share data using closures for each part of the interface (eg initialstate, transition, ...) that requires exogenous state, using heap allocated data for communication between methods if necessary. I'm not using any multithreading right now for simulations or solvers so communicating this way should be fine.

The MDP I implemented was not actually a pure Exo-MDP because there was some other external data also needed during the reward calculation. This made it a little more complicated. If you just needed to sample the initial state from external dat…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by kpa28-git
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
1 participant