kumulant

RmspropOptimizer

class RmspropOptimizer(val featureSize: Int, val learningRate: ScalarExpr, val rho: Double = 0.9, val epsilon: Double = 1.0E-8, concurrency: Concurrency = Concurrency.None) : Optimizer(source)

RMSProp: exponential moving average of squared gradients with decay rho; effective per-coord learning rate is lr / sqrt(emaG2[i] + epsilon).

Constructors

Link copied to clipboard
constructor(featureSize: Int, learningRate: ScalarExpr, rho: Double = 0.9, epsilon: Double = 1.0E-8, concurrency: Concurrency = Concurrency.None)

Properties

Link copied to clipboard

Numerical-stability epsilon.

Link copied to clipboard
open override val featureSize: Int

Number of weight coordinates this optimizer manages.

Link copied to clipboard

Base learning-rate schedule.

Link copied to clipboard
val rho: Double

EMA decay for the squared gradient.

Functions

Link copied to clipboard
open override fun advance()

Advance per-update counters (Adam step t, etc.). Called once per stat update.

Link copied to clipboard
open override fun computeDelta(coordIndex: Int, gradient: Double, observationWeight: Double): Double

Per-coordinate update. Reads/writes any auxiliary state owned by the optimizer, then returns the delta the caller should add to w[coordIndex].

Link copied to clipboard
open override fun reset()

Reset all internal state to its initial values.

RmspropOptimizer

constructor(featureSize: Int, learningRate: ScalarExpr, rho: Double = 0.9, epsilon: Double = 1.0E-8, concurrency: Concurrency = Concurrency.None)(source)

advance

open override fun advance()(source)

Advance per-update counters (Adam step t, etc.). Called once per stat update.

computeDelta

open override fun computeDelta(coordIndex: Int, gradient: Double, observationWeight: Double): Double(source)

Per-coordinate update. Reads/writes any auxiliary state owned by the optimizer, then returns the delta the caller should add to w[coordIndex].

epsilon

Numerical-stability epsilon.

featureSize

open override val featureSize: Int(source)

Number of weight coordinates this optimizer manages.

learningRate

Base learning-rate schedule.

reset

open override fun reset()(source)

Reset all internal state to its initial values.

rho

EMA decay for the squared gradient.