com.eignex.kumulant/stat/regression

stat.regression

Family root for the regression-modality stats and the cross-cutting infrastructure they share. The single-output linear-model family lives in glm, the decision-tree and random-forest family in tree. What sits directly in this package is the small set of stats that don't fit either subfamily, plus the strategy types both rely on.

What's directly in this package

Stat	Modality	Role
CovarianceStat	Paired	Weighted covariance and Pearson correlation between two streams. The building block for both univariate and multivariate regressions.
SoftmaxRegressionStat	Regression	Online multinomial logistic regression; K-way classification via softmax cross-entropy. Generalises the GLM with `Link.Logit` from binary to K-way.
GaussianNaiveBayesStat	Regression	Per-class running Welford mean / variance of each feature plus class priors. The cheap non-parametric multiclass classifier.

Softmax and Naive Bayes implement com.eignex.kumulant.core.RegressionStat because that interface gives them update(VectorView, Double); the scalar y is the class index. Strictly they are classifiers, not regressors, but they share the input shape and the result is consumed the same way (a posterior over classes plus calibration / accuracy metrics).

Cross-cutting types

Optimizer strategy

Optimizer is the per-coordinate update rule that single-output and multinomial linear models share. The runtime API has four implementations:

SgdOptimizer: stateless; delta = -lr * weight * gradient.
AdagradOptimizer: accumulates squared gradients per coordinate.
RmspropOptimizer: exponential moving average of squared gradients.
AdamOptimizer: bias-corrected first and second moments (Kingma & Ba 2015).

Wire-portable counterparts live in com.eignex.kumulant.schema.OptimizerSpec: com.eignex.kumulant.schema.Sgd, com.eignex.kumulant.schema.Adagrad, com.eignex.kumulant.schema.Rmsprop, com.eignex.kumulant.schema.Adam. Stats accept the spec and materialise their own optimizer instances per-feature-set; per-coordinate aux state honours the stat's com.eignex.kumulant.core.Concurrency level.

Posteriors

RegressionPosterior is the scoring interface shared by every regression family. A posterior projects a com.eignex.kumulant.core.Result and a query com.eignex.kumulant.math.VectorView to a scalar score, parametrised by an exploration knob and a Random. The contextual bandits (com.eignex.kumulant.bandit.contextual.RegressionContextualBandit) consume posteriors at choose time.

Concrete posteriors live with their model families:

com.eignex.kumulant.stat.regression.glm.LinearPosterior for linear models: PointPosterior, FactorisedGaussian, MultivariateGaussian, LinUcb.
TreePosterior and ForestPosterior for trees: see com.eignex.kumulant.stat.regression.tree.

The posterior interface is intentionally minimal so a downstream contextual bandit can mix-and-match: one bandit might score arms with a GLM under LinUcb and another with a forest under ThompsonForestPosterior: same code path, different model and scoring rule.

When to reach into which subfamily

Need a linear model? Use glm. Pick by required output:

Point estimates only, fastest path → StochasticRegressionStat with com.eignex.kumulant.schema.Sgd or com.eignex.kumulant.schema.Adam.
Per-coordinate uncertainty → DiagonalRegressionStat.
Full posterior covariance for Thompson sampling / LinUCB → BayesianRegressionStat.
Pooled estimation across many parallel regressors → HierarchicalBayesianRegression.

Need a non-linear regressor? Use tree. Single tree for cheap; forest for ensembled variance estimates (the natural backbone for a tree-based contextual bandit).
Need K-way classification? SoftmaxRegressionStat (parametric, online SGD, scales to high dimensions) or GaussianNaiveBayesStat (non-parametric, cheap, no convergence concerns).
Need calibrated probabilities? Wrap any of the above's output through com.eignex.kumulant.stat.calibration.

Merge

Of the stats living directly here, CovarianceStat and GaussianNaiveBayesStat merge exactly (Chan-style parallel Welford on the running covariance and on the per-class moments); SoftmaxRegressionStat merges approximately via a sample-weighted blend of the per-class weight vectors, the usual SGD limitation. The glm and tree subpackages document their own merge stories.

Concurrency

CovarianceStat inherits com.eignex.kumulant.stat.regression.glm.UnivariateRegressionStat's Welford-coupled model: locked under com.eignex.kumulant.core.Concurrency.Strict / com.eignex.kumulant.core.Concurrency.HighWrite, racing under com.eignex.kumulant.core.Concurrency.Relaxed. SoftmaxRegressionStat and GaussianNaiveBayesStat serialise the update body under com.eignex.kumulant.core.Concurrency.Strict / com.eignex.kumulant.core.Concurrency.HighWrite; under com.eignex.kumulant.core.Concurrency.Relaxed the per-class cells race with bounded drift. See the subpackages for the glm and tree concurrency designs.

Types

AdagradOptimizer

class AdagradOptimizer(val featureSize: Int, val learningRate: ScalarExpr, val epsilon: Double = 1.0E-10, concurrency: Concurrency = Concurrency.None) : Optimizer

Adagrad: accumulates squared gradients per coordinate; the effective per-coord learning rate is lr / sqrt(sumG2[i] + epsilon). Adapts faster on rare features.

AdamOptimizer

class AdamOptimizer(val featureSize: Int, val learningRate: ScalarExpr, val beta1: Double = 0.9, val beta2: Double = 0.999, val epsilon: Double = 1.0E-8, concurrency: Concurrency = Concurrency.None) : Optimizer

Adam with bias-corrected first and second moments. Default hyperparameters beta1=0.9, beta2=0.999, epsilon=1e-8 follow Kingma & Ba 2015.

CovarianceResult

@Serializable

@SerialName(value = "CovarianceResult")

data class CovarianceResult(val totalWeights: Double, val meanX: Double, val meanY: Double, val sxy: Double, val sxx: Double, val syy: Double) : Result

Weighted covariance snapshot with second-moment sums usable for merging.

CovarianceStat

class CovarianceStat(concurrency: Concurrency = Concurrency.None) : PairedStat<CovarianceResult>

Online covariance and Pearson correlation between two streams.

GaussianNaiveBayesResult

@Serializable

@SerialName(value = "GaussianNaiveBayesResult")

data class GaussianNaiveBayesResult(val featureSize: Int, val numClasses: Int, val means: DenseMatrix, val variances: DenseMatrix, val classWeights: DenseVector, val totalWeights: Double, val varianceFloor: Double) : Result

Snapshot from GaussianNaiveBayesStat: per-class feature statistics and class priors. Each row of means and variances holds the running mean / variance of every feature conditioned on a given class.

GaussianNaiveBayesStat

class GaussianNaiveBayesStat(val featureSize: Int, val numClasses: Int, val varianceFloor: Double = 1.0E-9, val concurrency: Concurrency = Concurrency.None) : RegressionStat<GaussianNaiveBayesResult>

Online Gaussian Naive Bayes classifier. Tracks per-class, per-feature running mean and variance via weighted Welford, plus per-class accumulated weight as the prior. Predict-time log-likelihoods assume features are conditionally independent within each class.

Optimizer

sealed interface Optimizer

Online optimizer strategy. Given a per-coordinate raw gradient, returns the delta to add to that weight cell. Owns any per-coordinate auxiliary state (Adam's first/second moments, Adagrad's running squared gradient, etc.).

RegressionPosterior

interface RegressionPosterior<R : Result>

Stateless scorer over a regression snapshot at a query point x. Generalises the "score this arm under the current model and context" loop across linear regressors, trees, and any future regressor type:

RmspropOptimizer

class RmspropOptimizer(val featureSize: Int, val learningRate: ScalarExpr, val rho: Double = 0.9, val epsilon: Double = 1.0E-8, concurrency: Concurrency = Concurrency.None) : Optimizer

RMSProp: exponential moving average of squared gradients with decay rho; effective per-coord learning rate is lr / sqrt(emaG2[i] + epsilon).

SgdOptimizer

class SgdOptimizer(val featureSize: Int, val learningRate: ScalarExpr, concurrency: Concurrency = Concurrency.None) : Optimizer

Plain SGD: delta = -learningRate(step) * weight * gradient. Stateless apart from the global step counter feeding the schedule.

SoftmaxRegressionResult

@Serializable

@SerialName(value = "SoftmaxRegressionResult")

data class SoftmaxRegressionResult(val featureSize: Int, val numClasses: Int, val weights: DenseMatrix, val biases: DenseVector, val totalWeights: Double, val step: Long, val crossEntropy: Double) : Result

Snapshot from SoftmaxRegressionStat: per-class linear-model parameters plus cumulative bookkeeping. The K-by-p weights matrix and length-K biases vector define the linear predictors eta[k] = biases[k] + weights[k] . x; the predicted class probability is the softmax over the K logits.

SoftmaxRegressionStat

class SoftmaxRegressionStat(val featureSize: Int, val numClasses: Int, val optimizer: OptimizerSpec = Sgd(), val biasOptimizer: OptimizerSpec = optimizer, val concurrency: Concurrency = Concurrency.None) : RegressionStat<SoftmaxRegressionResult>

Online multinomial logistic regression by stochastic gradient descent on the softmax cross-entropy loss. Generalises com.eignex.kumulant.stat.regression.glm.StochasticRegressionStat with Link.Logit from binary to K-way classification.