A Predictive View on Streaming Hidden Markov Models

venue: arXiv
date: April 2026
link: https://arxiv.org/abs/2604.09208
Authors: Gerardo Duran-Martin

Abstract

We develop a predictive-first optimisation framework for streaming hidden Markov models. Unlike classical approaches that prioritise full posterior recovery under a fully specified generative model, we assume access to regime-specific predictive models whose parameters are learned online while maintaining a fixed transition prior over regimes. Our objective is to sequentially identify latent regimes while maintaining accurate step-ahead predictive distributions. Because the number of possible regime paths grows exponentially, exact filtering is infeasible. We therefore formulate streaming inference as a constrained projection problem in predictive-distribution space: under a fixed hypothesis budget, we approximate the full posterior predictive by the forward-KL optimal mixture supported on S paths. The solution is the renormalised top-S posterior-weighted mixture, providing a principled derivation of beam search for HMMs. The resulting algorithm is fully recursive and deterministic, performing beam-style truncation with closed-form predictive updates and requiring neither EM nor sampling. Empirical comparisons against Online EM and Sequential Monte Carlo under matched computational budgets demonstrate competitive prequential performance.

Citation

1
2
3
4
5
6
7
8
9
@misc{duranmartin2026predictiveviewstreaminghidden,
  title={A Predictive View on Streaming Hidden Markov Models},
  author={Gerardo Duran-Martin},
  year={2026},
  eprint={2604.09208},
  archivePrefix={arXiv},
  primaryClass={stat.ML},
  url={https://arxiv.org/abs/2604.09208},
}