kumulant

Ucb1NormalSpec

@Serializable
@SerialName(value = "UCB1Normal")
data class Ucb1NormalSpec(val alpha: Double = 1.0, val priorMean: Double = 0.0, val priorWeight: Double = 0.02) : BanditPolicySpec<MomentsResult> (source)

Spec for UCB1Normal.

Constructors

Link copied to clipboard
constructor(alpha: Double = 1.0, priorMean: Double = 0.0, priorWeight: Double = 0.02)

Properties

Link copied to clipboard

Exploration scale on the confidence-bound term.

Link copied to clipboard

Per-arm prior on the running reward mean.

Link copied to clipboard

Per-arm prior pseudo-count.

Functions

Link copied to clipboard

Build a live BanditPolicy from its spec.

Ucb1NormalSpec

constructor(alpha: Double = 1.0, priorMean: Double = 0.0, priorWeight: Double = 0.02)(source)

alpha

Exploration scale on the confidence-bound term.

priorMean

Per-arm prior on the running reward mean.

priorWeight

Per-arm prior pseudo-count.