mulit-arm-bandit

Here are 3 public repositories matching this topic...

Multi-Armed Bandit (MAB) algorithm implementation in go

go ucb1 mulit-arm-bandit greedy-epsilon

A multi armed bandit Reinforcement learning problem using Policy Gradient.

Multi-Arm-Bandit-Problem solved using Tensorflow framework, tf.train.GradientDescentOptimizer

python jupyter algorithms tensorflow ipynb mulit-arm-bandit

Add a description, image, and links to the mulit-arm-bandit topic page so that developers can more easily learn about it.

To associate your repository with the mulit-arm-bandit topic, visit your repo's landing page and select "manage topics."