A | |
AlphaPhiUCBParam [Obandit] | Use to instanciate a |
AlphaUCBParam [Obandit] | Use to instanciate a |
B | |
Bandit [Obandit] | A bandit algorithm. |
D | |
DecayingEpsilonGreedyParam [Obandit] | Use to instanciate a |
E | |
EpsilonGreedyParam [Obandit] | Use to instanciate a |
F | |
FixedExp3Param [Obandit] | Use to instanciate a |
H | |
HorizonExp3Param [Obandit] | Use to instanciate a |
K | |
KBanditParam [Obandit] | Use to instanciate a |
R | |
RangeParam [Obandit] | A Reward range. |
RangedBandit [Obandit] | The type of a bandit with reward scaling. |
RateBanditParam [Obandit] | Use to instanciate algorithms that need a parametrizable rate. |