Comment by anshumankmr

2 months ago

How does next-token prediction work for time series data?

1 comment

anshumankmr

There is no single answer, because there are multiple architectures for foundation time-series models, such as T5, decoder-only models, and state-space models (SSMs).

For Chronos-2 (the current state of the art in time-series modeling), the setup is almost identical to that of LLMs because it is based on the T5 architecture. The main difference is that, in time-series models, tokens correspond to subintervals in the real-valued (ℝ) space. You can check the details here: https://arxiv.org/pdf/2510.15821