kumulant

EpsilonGreedySpec

@Serializable
@SerialName(value = "EpsilonGreedy")
data class EpsilonGreedySpec(val epsilon: Double = 0.1, val priorMean: Double = 0.0, val priorWeight: Double = 0.02, val priorSquaredDeviations: Double = 0.02) : BanditPolicySpec<WeightedVarianceResult> (source)

Spec for EpsilonGreedy.

Constructors

Link copied to clipboard
constructor(epsilon: Double = 0.1, priorMean: Double = 0.0, priorWeight: Double = 0.02, priorSquaredDeviations: Double = 0.02)

Properties

Link copied to clipboard

Probability of exploring uniformly.

Link copied to clipboard

Per-arm prior on the running reward mean.

Link copied to clipboard

Prior on Sum (x - mean)^2 * w.

Link copied to clipboard

Per-arm prior pseudo-count.

Functions

Link copied to clipboard

Build a live BanditPolicy from its spec.

EpsilonGreedySpec

constructor(epsilon: Double = 0.1, priorMean: Double = 0.0, priorWeight: Double = 0.02, priorSquaredDeviations: Double = 0.02)(source)

epsilon

Probability of exploring uniformly.

priorMean

Per-arm prior on the running reward mean.

priorSquaredDeviations

Prior on Sum (x - mean)^2 * w.

priorWeight

Per-arm prior pseudo-count.