TrackGPT -- A generative pre-trained transformer for cross-domain entity trajectory forecasting
About
The forecasting of entity trajectories at future points in time is a critical capability gap in applications across both Commercial and Defense sectors. Transformers, and specifically Generative Pre-trained Transformer (GPT) networks have recently revolutionized several fields of Artificial Intelligence, most notably Natural Language Processing (NLP) with the advent of Large Language Models (LLM) like OpenAI's ChatGPT. In this research paper, we introduce TrackGPT, a GPT-based model for entity trajectory forecasting that has shown utility across both maritime and air domains, and we expect to perform well in others. TrackGPT stands as a pioneering GPT model capable of producing accurate predictions across diverse entity time series datasets, demonstrating proficiency in generating both long-term forecasts with sustained accuracy and short-term forecasts with high precision. We present benchmarks against state-of-the-art deep learning techniques, showing that TrackGPT's forecasting capability excels in terms of accuracy, reliability, and modularity. Importantly, TrackGPT achieves these results while remaining domain-agnostic and requiring minimal data features (only location and time) compared to models achieving similar performance. In conclusion, our findings underscore the immense potential of applying GPT architectures to the task of entity trajectory forecasting, exemplified by the innovative TrackGPT model.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Referring Video Object Segmentation | Ref-DAVIS 17 | J&F Score66.5 | 131 | |
| Referring Video Segmentation | MeViS | J&F Score41.2 | 50 | |
| Referring Video Object Segmentation | Ref-Youtube-VOS v1.0 (test) | J&F Score49.5 | 33 | |
| Reasoning Video Object Segmentation | ReVOS 1.0 (test) | Jaccard (J)0.381 | 22 | |
| Video Referring Segmentation | ReVOS Referring | J Score48.3 | 19 | |
| Referring Video Segmentation | MeViS (test) | J&F Score40.1 | 18 | |
| Video Object Segmentation | ReVOS 1.0 (test) | Jaccard Index43.2 | 15 | |
| Reasoning Video Object Segmentation | ReVOS Overall (Entire Dataset) | J&F Score45 | 14 | |
| Reasoning Video Object Segmentation | ReVOS Reasoning | Jaccard (J)38.1 | 12 | |
| Referring Video Object Segmentation | Ref-YT-VOS | J Score58.1 | 11 |