Using `tf-agents` for Bandits with sparse data #778

ujjwal95 · 2022-09-30T17:12:37Z

Hi,
I am looking to use tf-agents to develop a multi armed bandit for advertising.

For each observation, I don't have the reward for other arms, because I'll only show that single arm to the observation.

Is tf-agents able to handle such situations? I went through all the Environments and all of them seem to assume that rewards are available for each observation-arm combination. The MovieLens example is handling sparsity using SVD.

Will I need to use similar methods to estimate the reward for other arms? or is there something in tf-agents that I am missing out on?

The text was updated successfully, but these errors were encountered:

ujjwal95 · 2022-09-30T17:19:23Z

Is tf-agents able to train a bandit where we just provide each observation-feature, the arm picked and the reward?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using `tf-agents` for Bandits with sparse data #778

Using `tf-agents` for Bandits with sparse data #778

ujjwal95 commented Sep 30, 2022

ujjwal95 commented Sep 30, 2022

Using tf-agents for Bandits with sparse data #778

Using tf-agents for Bandits with sparse data #778

Comments

ujjwal95 commented Sep 30, 2022

ujjwal95 commented Sep 30, 2022

Using `tf-agents` for Bandits with sparse data #778

Using `tf-agents` for Bandits with sparse data #778