kumulant

EpsilonDecreasingSpec

@Serializable
@SerialName(value = "EpsilonDecreasing")
data class EpsilonDecreasingSpec(val epsilon: Double = 2.0, val decay: Double = 0.5, val priorMean: Double = 0.0, val priorWeight: Double = 0.02, val priorSquaredDeviations: Double = 0.02) : BanditPolicySpec<WeightedVarianceResult> (source)

Spec for EpsilonDecreasing.

Constructors

Link copied to clipboard
constructor(epsilon: Double = 2.0, decay: Double = 0.5, priorMean: Double = 0.0, priorWeight: Double = 0.02, priorSquaredDeviations: Double = 0.02)

Properties

Link copied to clipboard

Decay exponent applied to the running sample count.

Link copied to clipboard

Initial exploration scale.

Link copied to clipboard

Per-arm prior on the running reward mean.

Link copied to clipboard

Prior on Sum (x - mean)^2 * w.

Link copied to clipboard

Per-arm prior pseudo-count.

Functions

Link copied to clipboard

Build a live BanditPolicy from its spec.

EpsilonDecreasingSpec

constructor(epsilon: Double = 2.0, decay: Double = 0.5, priorMean: Double = 0.0, priorWeight: Double = 0.02, priorSquaredDeviations: Double = 0.02)(source)

decay

Decay exponent applied to the running sample count.

epsilon

Initial exploration scale.

priorMean

Per-arm prior on the running reward mean.

priorSquaredDeviations

Prior on Sum (x - mean)^2 * w.

priorWeight

Per-arm prior pseudo-count.