Skip to content

State-dependent actions and gen function #382

Answered by lassepe
Cbroomer asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @clrescobar,

Unfortunately, TabularTDLearning does not currently support state-dependent actions since that would make the table "non square". That said, it would be possible to make that extension by using another data structure for the Q-values in TabularTDLearning. That, however, would require minor changes to the solver/code over there.

The gen function implements the (PO)MDP transition (and observation) model in a generative representation. That is, rather than having to provide probability densities over states, observations, and rewards via T, Z, and R, you just have to implement gen as a function to sample from the joint distribution of the three. Therefore, when you call gen(s…

Replies: 1 comment 23 replies

Comment options

You must be logged in to vote
23 replies
@Cbroomer
Comment options

@zsunberg
Comment options

@Cbroomer
Comment options

@zsunberg
Comment options

@Cbroomer
Comment options

Answer selected by zsunberg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants