T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models

Yunfeng Ge; Jiawei Li; Yiji Zhao; Haomin Wen; Zhao Li; Meikang Qiu; Hongyan Li; Ming Jin; Shirui Pan

doi:10.24963/ijcai.2025/580

T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models

Yunfeng Ge, Jiawei Li, Yiji Zhao, Haomin Wen, Zhao Li, Meikang Qiu, Hongyan Li, Ming Jin, Shirui Pan

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

Main Track. Pages 5208-5216. https://doi.org/10.24963/ijcai.2025/580

PDF BibTeX

Text-to-Time Series generation holds significant potential to address challenges such as data sparsity, imbalance, and limited availability of multimodal time series data across domains. While diffusion models have achieved remarkable success in Text-to-X (e.g., vision and audio data) generation, their use in time series generation remains limit. Existing approaches face two critical limitations: (1) reliance on domain-specific captions that generalize poorly, and (2) inability to generate time series of arbitrary length, limiting real-world use. In this work, we first introduce a new multimodal dataset containing over 600,000 high-resolution text-time series pairs. Second, we propose Text-to-Series (T2S), a diffusion-based framework that bridges the gap between natural language and time series in a domain-agnostic manner. It employs a length-adaptive VAE to encode time series of varying lengths into consistent latent embeddings. On top of that, T2S effectively aligns textual representations with latent embeddings by utilizing Flow Matching and employing DiT as the denoiser. We train T2S in an interleaved paradigm across multiple lengths, allowing it to generate sequences of arbitrary lengths. Extensive evaluations demonstrate that T2S achieves state-of-the-art performance across 13 datasets spanning 12 domains.

Keywords:

Machine Learning: ML: Generative models

Machine Learning: ML: Multi-modal learning

Machine Learning: ML: Time series and data streams

Natural Language Processing: NLP: Language grounding