B | |
| bandit [Obandit.RangedBandit] | |
| bandit [Obandit.Bandit] | The internal data structure of the bandit algorithm. |
| banditEstimates [Obandit] | The inner state of a bandit that maintains estimates of arm means. |
| banditPolicy [Obandit] | The internal state of an Exp3 bandit |
R | |
| rangedAction [Obandit] | A ranged action: Action a in normal course of action, Reset a in case * the bandit was just restarted. |
| rangedBandit [Obandit] | The type of a bandit with a range. |