TimeOmni-VL: Unified Models for Time Series Understanding and Generation
About
Recent time series modeling faces a sharp divide between numerical generation and semantic understanding, with research showing that generation models often rely on superficial pattern matching, while understanding-oriented models struggle with high-fidelity numerical output. Although unified multimodal models (UMMs) have bridged this gap in vision, their potential for time series remains untapped. We propose TimeOmni-VL, the first vision-centric framework that unifies time series understanding and generation through two key innovations: (1) Fidelity-preserving bidirectional mapping between time series and images (Bi-TSI), which advances Time Series-to-Image (TS2I) and Image-to-Time Series (I2TS) conversions to ensure near-lossless transformations. (2) Understanding-guided generation. We introduce TSUMM-Suite, a novel dataset consists of six understanding tasks rooted in time series analytics that are coupled with two generation tasks. With a calibrated Chain-of-Thought, TimeOmni-VL is the first to leverage time series understanding as an explicit control signal for high-fidelity generation. Experiments confirm that this unified approach significantly improves both semantic understanding and numerical precision, establishing a new frontier for multimodal time series modeling.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Forecasting | GIFT-Eval Short-term | nMASE0.878 | 9 | |
| Decision Making | TSR-Suite Task 4 | Accuracy61.4 | 8 | |
| Time Series Imputation | GIFT-Eval (test) | nMASE Count [0.1, 0.2)0.713 | 8 | |
| Perception | TSR-Suite Task 1 | Accuracy84 | 8 | |
| Perception | TSR-Suite Task 2 | Accuracy61.3 | 8 | |
| Time Series Forecasting | GIFT-Eval Long-term | nMASE0.784 | 6 | |
| Time Series Forecasting | GIFT-Eval Med-term | nMASE0.816 | 6 | |
| Extrapolation | TSR-Suite Task 3 | MAE163.8 | 6 | |
| Anomaly Detection | TS-image Understanding | QA5 (Weighted Accuracy)66.7 | 4 | |
| Cycle Bounding Box Localization | TS-image | QA3 (IoU)93.1 | 4 |