Back to Search Start Over

Chronos: Learning the Language of Time Series

Authors :
Ansari, Abdul Fatir
Stella, Lorenzo
Turkmen, Caner
Zhang, Xiyuan
Mercado, Pedro
Shen, Huibin
Shchur, Oleksandr
Rangapuram, Syama Sundar
Arango, Sebastian Pineda
Kapoor, Shubham
Zschiegner, Jasper
Maddix, Danielle C.
Wang, Hao
Mahoney, Michael W.
Torkkola, Kari
Wilson, Andrew Gordon
Bohlke-Schneider, Michael
Wang, Yuyang
Publication Year :
2024

Abstract

We introduce Chronos, a simple yet effective framework for pretrained probabilistic time series models. Chronos tokenizes time series values using scaling and quantization into a fixed vocabulary and trains existing transformer-based language model architectures on these tokenized time series via the cross-entropy loss. We pretrained Chronos models based on the T5 family (ranging from 20M to 710M parameters) on a large collection of publicly available datasets, complemented by a synthetic dataset that we generated via Gaussian processes to improve generalization. In a comprehensive benchmark consisting of 42 datasets, and comprising both classical local models and deep learning methods, we show that Chronos models: (a) significantly outperform other methods on datasets that were part of the training corpus; and (b) have comparable and occasionally superior zero-shot performance on new datasets, relative to methods that were trained specifically on them. Our results demonstrate that Chronos models can leverage time series data from diverse domains to improve zero-shot accuracy on unseen forecasting tasks, positioning pretrained models as a viable tool to greatly simplify forecasting pipelines.<br />Comment: Code and model checkpoints available at https://github.com/amazon-science/chronos-forecasting

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2403.07815
Document Type :
Working Paper