Spatiotemporal Pre-Trained Large Language Model for Forecasting with Missing Values

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Fang, L
Xiang, W
Pan, S
Salim, FD
Chen, YPP
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2025
Size
File type(s)
Location
License
Abstract

Spatiotemporal data collected by sensors within an urban Internet of Things (IoT) system inevitably contains some missing values, which significantly affects the accuracy of spatiotemporal data forecasting. However, existing techniques, including those based on Large Language Models (LLMs), show limited effectiveness in forecasting with missing values, especially in scenarios involving high-dimensional sensor data. In this article, we propose a novel spatiotemporal pre-trained large language model dubbed SPLLM for forecasting with missing values. In this network, we seamlessly integrate a specialized spatiotemporal fusion Graph Convolutional Network (GCN) module that extracts intricate spatiotemporal and graph-based information, for generating suitable inputs to the SPLLM. Furthermore, we propose a Feed-Forward Network (FFN) fine-tuning strategy within the LLM and a final fusion layer to enable the model to leverage the pre-trained foundational knowledge of the LLM and adapt to new incomplete data simultaneously. The experimental results indicate that SPLLM outperforms state-of-the-art models on real-world public datasets. Notably, SPLLM exhibits a superior performance in tackling incomplete sensory data with a variety of missing rates. A comprehensive ablation study of key components is conducted to demonstrate their efficiency.

Journal Title

IEEE Internet of Things Journal

Conference Title
Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

This work is covered by copyright. You must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a specified licence, refer to the licence for details of permitted re-use. If you believe that this work infringes copyright please make a copyright takedown request using the form at https://www.griffith.edu.au/copyright-matters.

Item Access Status
Note

This publication has been entered in Griffith Research Online as an advance online version.

Access the data
Related item(s)
Subject

Engineering

Information and computing sciences

Persistent link to this record
Citation

Fang, L; Xiang, W; Pan, S; Salim, FD; Chen, YPP, Spatiotemporal Pre-Trained Large Language Model for Forecasting with Missing Values, IEEE Internet of Things Journal, 2025

Collections