Short-term inbound passenger flow forecasting for urban rail transit based on phase space reconstruction and deep learning

Jing Wu; Jie Xu; Dengyu Xu; Jing Wu; Jie Xu; Dengyu Xu

doi:10.48130/dts-0025-0015

Accurate and real-time passenger flow forecasting is of significant importance for the operation of urban rail transit (URT). However, the chaotic and nonlinear nature of short-term passenger flow limits the predictive performance of traditional time series and deep learning models. Integrating chaos theory and deep learning, this paper proposes an efficient short-term passenger flow forecasting model to address these challenges. First, the Lyapunov exponent of the passenger flow time series is computed to quantify its chaotic nature. Then, Phase Space Reconstruction (PSR) is used to map the one-dimensional passenger flow time series to a higher-dimensional space, uncovering its inherent chaotic properties. Using the reconstructed high-dimensional data, a Convolutional Neural Network (CNN) abstract features of passenger flow across different dimensions, while a Long Short-Term Memory (LSTM) network captures temporal features. The combined CNN and LSTM architecture, termed PSR-CNN-LSTM enhances predictive efficiency and stability, with the Grey Wolf Optimizer (GWO) optimizing the model's hyperparameters. Experiments on real-world AFC datasets from five representative metro stations in Shanghai (China) validate the model's generalization capability across diverse station types and passenger flow patterns. Compared with five benchmark models, the PSR-CNN-LSTM model achieves higher predictive accuracy, faster convergence speed, and improved computational efficiency. Ablation studies confirm that each component plays a critical role in enhancing forecasting performance. This research provides subway operators with real-time, reliable insights into short-term passenger flow, optimizing passenger flow management and scheduling.

HTML

Introduction

With the acceleration of urbanization, the urban rail transit systems in megacities are becoming increasingly crowded during peak periods, presenting significant challenges in passenger flow management^[1]. Short-term passenger flow forecasting (STPFF) can provide real-time traffic information to assist urban rail transit authorities in managing passenger inflows, optimizing train scheduling, handling emergencies, and helping passengers choose optimal travel times and routes. It plays a crucial role in ensuring the efficiency and safety of urban rail transit. However, due to the complex impacts of factors such as passenger behavior, weather, and unexpected events^[2], passenger flow entering stations exhibits significant nonlinearity and randomness^[3], posing substantial challenges for STPFF. STPFF has attracted widespread attention from researchers due to its practical significance.

In the early stages, traditional statistical methods provided basic forecasting approaches, such as historical data regression analysis^[4], exponential smoothing^[5], and ARIMA models^[6]. These methods rely on fixed model structures and parameters, making it difficult to capture dynamic changes in passenger flow, limiting forecasting accuracy^[7]. With the development of machine learning, models based on machine learning techniques have been introduced for short-term passenger flow forecasting, such as Bayesian Networks^[8], Random Forests (RF)^[9], and Support Vector Machines (SVM)^[10]. These models have some ability to model nonlinear relationships, but they face limitations when considering more complex spatiotemporal correlations^[11].

In recent years, deep learning methods, due to their powerful feature extraction capabilities have been widely applied in the field of STPFF. Long Short-Term Memory (LSTM) networks, with their recurrent structure and parameter-sharing advantages^[12], are suitable for forecasting passenger flow at a single station^[13]. However, the single LSTM model relies on one-dimensional passenger flow time series provided by Automatic Fare Collection (AFC) data, making it difficult to fully capture the nonlinear characteristics of passenger flow. Convolutional neural networks (CNN) and LSTM can be combined to address the limitations of single networks. CNN effectively handle spatial issues^[14], and hybrid models that integrate spatial, variability, and periodic features of passenger flow exhibit better prediction accuracy than baseline models^[15]. However, relying solely on the fusion of spatial and temporal features is inadequate for fully capturing the complex dynamic characteristics of passenger flow under varying external conditions. Emerging research has gradually shifted towards integrating multidimensional influencing factors into hybrid models^[16−20] to further enhance prediction accuracy and model applicability. For instance, hybrid models of Graph Convolutional Networks (GCN) and LSTM that consider station connectivity, weather conditions, and air quality demonstrate high prediction accuracy^[21], but challenges remain in obtaining and processing real-time data, including computation delays and high complexity, making it difficult to meet real-time prediction needs. Additionally, these models often rely on human experience to adjust parameters, which affects their generalization ability and adaptability. In practical applications, balancing prediction accuracy and real-time performance remain a problem that requires resolution.

However, due to the complex impacts of factors such as passenger behavior, weather, and unexpected events^[2], passenger flow entering stations exhibits significant volatility, which becomes increasingly pronounced as the data collection interval is reduced^[22]. It has been demonstrated that short-term passenger flow data typically exhibits nonlinear and chaotic characteristics^[23]. Chaos is the unity of determinism and randomness, simultaneously embodying both global stability and local instability^[24]. In-depth study of chaotic phenomena can help reveal their internally complex yet orderly structure, thereby uncovering the underlying laws behind these seemingly irregular phenomena^[25]. On one hand, chaotic systems are highly sensitive to initial conditions and small disturbances, resulting in long-term unpredictability. On the other hand, although the trajectories may seem to diverge, they are actually confined by strange attractors, allowing for the identification of their underlying patterns and short-term prediction^[26].

According to Taken’s theorem^[27], in chaotic systems, the future state of one dimension depends on interactions with other dimensions. This principle underlies Phase Space Reconstruction (PSR), which reconstructs chaotic attractors in a higher-dimensional space from one-dimensional time series data. PSR reveals the inherent irregularity and self-similarity of the data^[28]. Therefore, PSR serves as a prerequisite for the nonlinear time series analysis and forecasting of data from chaotic systems^[29].

At the same time, short-term passenger flow data often exhibits periodicity, with patterns recurring on a daily or weekly basis. This time-dependent behavior indicates that historical data with similar periodic characteristics can assist in predicting future passenger flows. LSTM networks are particularly well-suited for this task, as they can capture both short-term and long-term temporal dependencies by learning the relationships between past and future values over time. Therefore, a forecasting model that integrates both chaotic features extracted by CNN from the reconstructed phase space and temporal dependencies captured by LSTM is expected to yield more accurate results.

This study proposes a novel forecasting model that combines Phase Space Reconstruction (PSR) with the deep learning model CNN-LSTM. The PSR method is employed to transform the original one-dimensional time series into a multi-dimensional phase space, unveiling the chaotic features inherent in the dynamical system. The CNN-LSTM model is utilized to learn both spatial features (phase space features) and temporal features of the passenger flow data. The model's hyperparameters are optimized using the Grey Wolf Optimizer (GWO), further enhancing its performance. Experimental results demonstrate that, compared to single models and other advanced hybrid models, the PSR-CNN-LSTM model exhibits significant advantages in prediction performance, convergence speed, and stability. It effectively uncovers passenger flow patterns in short-term nonlinear and chaotic AFC transaction data, enabling accurate forecasting of short-term station passenger flow.

The contributions and highlights of the study are as follows:

(1) By calculating the Lyapunov exponent of the passenger flow time series, the chaotic characteristics were quantitatively identified.

(2) Using a Phase Space Reconstruction method to expand the one-dimensional passenger flow time series into a high-dimensional space, enabling accurate capture of passenger flow volatility while significantly reducing data complexity.

(3) The integration of CNN and LSTM enables comprehensive capture of spatiotemporal features in phase space. CNN effectively extracts spatial features from the reconstructed high-dimensional phase space data, while LSTM handles temporal dependencies within the time series based on phase space features.

(4) The GWO algorithm optimizes the proposed forecasting model's parameters automatically, adapting the model to varying data in different situations, and maintaining stable prediction performance under various uncertainties and dynamic changes.

Discussion

To further enhance the applicability, robustness, and computational efficiency of the PSR-CNN-LSTM model, this section discusses its potential for generalization to other transit systems, its adaptability to extreme scenarios, and strategies for improving computational efficiency in large-scale applications.

(1) Expanding Applicability to Other Cities and Transit Systems

The model’s strong generalization across different station types in the Shanghai Metro system suggests its potential applicability to other urban transit networks. While the CNN-LSTM architecture and PSR framework remain applicable, future studies should validate the model in different metro systems to assess whether regional variations in passenger behavior, network structure, and operational policies impact predictive performance.

Moreover, its reliance on AFC entry data makes it adaptable to bus-metro transfer networks, requiring only minor modifications to incorporate multi-modal passenger flows.

(2) Future Work on Extreme Scenarios

Although the model performs well under normal conditions, it requires further enhancements to handle extreme events such as accidents, public events, or weather disruptions, which can cause sudden fluctuations in passenger flow.

To improve adaptability, anomaly detection mechanisms can be integrated directly into the forecasting pipeline. A LSTM-Autoencoder (LSTM-AE) can learn normal passenger flow patterns and detect anomalies based on reconstruction errors, allowing the forecasting model to dynamically adjust predictions in response to unexpected disruptions.

Additionally, incorporating real-time external data (e.g., weather conditions, social media activity, and transit incident reports) could further enhance the model’s accuracy under extreme conditions.

(3) Future Work on Computational Efficiency

Despite its strong predictive accuracy, the model’s computational demands can be challenging for large-scale, real-time applications. A hybrid architecture could improve efficiency by using lighter-weight models (without GWO optimization) for low-complexity, low-traffic stations while reserving the full PSR-CNN-LSTM model for critical hubs with volatile passenger flow.

Additionally, applying dimensionality reduction techniques to the reconstructed phase space could reduce computational costs while preserving essential chaotic features. By integrating these strategies, the system can dynamically balance efficiency and accuracy, ensuring scalability, efficiency, and real-time responsiveness across multiple transit stations.

[1]	Ke J, Feng S, Zhu Z, Yang H, Ye J. 2021. Joint predictions of multi-modal ride-hailing demands: a deep multi-task multi-graph learning-based approach. Transportation Research Part C: Emerging Technologies 127:103063 doi: 10.1016/j.trc.2021.103063 CrossRef Google Scholar
[2]	Li W, Zhou M, Dong H. 2020. CPT model-based prediction of the temporal and spatial distributions of passenger flow for urban rail transit under emergency conditions. Journal of Advanced Transportation 2020:8850541 doi: 10.1155/2020/8850541 CrossRef Google Scholar
[3]	Wang Y, Zheng D, Luo SM, Zhan DN, Nie P. 2013. The research of railway passenger flow prediction model based on BP neural network. Advanced Materials Research 605–607:2366−69 doi: 10.4028/www.scientific.net/amr.605-607.2366 CrossRef Google Scholar
[4]	Chien SIJ, Kuchipudi CM. 2003. Dynamic travel time prediction with real-time and historic data. Journal of Transportation Engineering 129(6):608−16 doi: 10.1061/(ASCE)0733-947X(2003)129:6(608) CrossRef Google Scholar
[5]	Ge SY, Zheng CJ, Hou MM. 2013. Forecast of bus passenger traffic based on exponential smoothing and trend moving average method. Applied Mechanics and Materials 433:1374−78 Google Scholar
[6]	Gu Y, Han Y, Fang XL. 2011. Method of hub station passenger flow forecasting based on ARMA model. Journal of Transport Information and Safety 29(2):5−9 doi: 10.3963/j.ISSN1674-4861.2011.02.002 CrossRef Google Scholar
[7]	Zhang J, Chen Y, Panchamy K, Jin G, Wang C, et al. 2023. Attention-based multi-step short-term passenger flow spatial-temporal integrated prediction model in URT systems. Journal of Geo-information Science 25(4):698−713 doi: 10.12082/dqxxkx.2023.220817 CrossRef Google Scholar
[8]	Roos J, Bonnevay S, Gavin G. 2018. Dynamic Bayesian networks with Gaussian mixture models for short-term passenger flow forecasting. In: Proceedings of the 2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE), Nanjing, China, 24-26 November 2017. USA: IEEE. pp. 1–8. doi: 10.1109/ISKE.2017.8258756
[9]	Zarei N, Ghayour MA, Hashemi S. 2013. Road traffic prediction using context-aware random forest based on volatility nature of traffic flows. Intelligent Information and Database Systems. ACIIDS 2013. Lecture Notes in Computer Science, vol 7802. Berlin, Heidelberg: Springer. pp. 196–205 doi: 10.1007/978-3-642-36546-1_21
[10]	Xue F, Yao E. 2022. Adopting a random forest approach to model household residential relocation behavior. Cities 125:103625 doi: 10.1016/j.cities.2022.103625 CrossRef Google Scholar
[11]	Long XQ, Li J, Chen YR. 2019. Metro short-term traffic flow prediction with deep learning. Control and Decision 34:1589−600 doi: 10.13195/j.kzyjc.2018.1393 CrossRef Google Scholar
[12]	Yang F, Song X, Xu F, Tsui KL. 2019. State-of-charge estimation of lithium-ion batteries via long short-term memory network. IEEE Access 7:53792−99 doi: 10.1109/ACCESS.2019.2912803 CrossRef Google Scholar
[13]	Shao H, Soong BH. 2016. Traffic flow prediction with Long Short-Term Memory Networks (LSTMs). 2016 IEEE Region 10 Conference (TENCON), Singapore, 22–25 November 2016. pp. 2986–89 doi: 10.1109/TENCON.2016.7848593
[14]	Ma X, Dai Z, He Z, Ma J, Wang Y, et al. 2017. Learning traffic as images: a deep convolutional neural network for large-scale transportation network speed prediction. Sensors 17(4):818 doi: 10.3390/s17040818 CrossRef Google Scholar
[15]	Wu Y, Tan H. 2016. Short-term traffic flow forecasting with spatial-temporal correlation in a hybrid deep learning framework. arXiv doi: 10.48550/arXiv.1612.01022 CrossRef Google Scholar
[16]	Qi Q, Cheng R, Ge H. 2023. Short-term inbound rail transit passenger flow prediction based on BILSTM model and influence factor analysis. Digital Transportation and Safety 2(1):12−22 doi: 10.48130/dts-2023-0002 CrossRef Google Scholar
[17]	Yu B, Lee Y, Sohn K. 2020. Forecasting road traffic speeds by considering area-wide spatio-temporal dependencies based on a graph convolutional neural network (GCN). Transportation Research Part C: Emerging Technologies 114:189−204 doi: 10.1016/j.trc.2020.02.013 CrossRef Google Scholar
[18]	Bogaerts T, Masegosa AD, Angarita-Zapata JS, Onieva E, Hellinckx P. 2020. A graph CNN-LSTM neural network for short and long-term traffic forecasting based on trajectory data. Transportation Research Part C: Emerging Technologies 112:62−77 doi: 10.1016/j.trc.2020.01.010 CrossRef Google Scholar
[19]	Zhao L, Song Y, Zhang C, Liu Y, Wang P, et al. 2020. T-GCN: a temporal graph convolutional network for traffic prediction. IEEE Transactions on Intelligent Transportation Systems 21(9):3848−58 doi: 10.1109/TITS.2019.2935152 CrossRef Google Scholar
[20]	Cui Z, Henrickson KC, Ke R, Wang Y. 2020. Traffic graph convolutional recurrent neural network: a deep learning framework for network-scale traffic learning and forecasting. IEEE Transactions on Intelligent Transportation Systems 21:4883−94 doi: 10.1109/TITS.2019.2950416 CrossRef Google Scholar
[21]	Zhang J, Chen F, Cui Z, Guo Y, Zhu Y. 2021. Deep learning architecture for short-term passenger flow forecasting in urban rail transit. IEEE Transactions on Intelligent Transportation Systems 22(11):1502−10 doi: 10.1109/TITS.2020.3000761 CrossRef Google Scholar
[22]	Xie Y, Zhang Y, Ye Z. 2007. Short-term traffic volume forecasting using Kalman filter with discrete wavelet decomposition. Computer-Aided Civil and Infrastructure Engineering 22(5):326−34 doi: 10.1111/j.1467-8667.2007.00489.x CrossRef Google Scholar
[23]	Shang P, Li X, Kamae S. 2005. Chaotic analysis of traffic time series. Chaos, Solitons & Fractals 25(1):121−28 doi: 10.1016/j.chaos.2004.09.104 CrossRef Google Scholar
[24]	Famourzadeh V, Sefidkhosh M. 2019. Straddling between determinism and randomness: chaos theory vis-a-vis Leibniz. arXiv 1909.13635v1 doi: 10.48550/arXiv.1909.13635 CrossRef Google Scholar
[25]	Hsieh Da. 1991. Chaos and nonlinear dynamics: application to financial markets. The Journal of Finance 46(5):1839−77 doi: 10.1111/j.1540-6261.1991.tb04646.x CrossRef Google Scholar
[26]	Kang Y, Li X, Lu Y, Yang C. 2008. Application of chaotic phase space reconstruction into nonlinear time series prediction in deep rock mass. Proc. 5^th International Symposium on Knowledge Discovery and Data Mining (FSKD), Jinan, China, 18-20 October 2008. USA: IEEE. pp. 593–97 doi: 10.1109/FSKD.2008.423
[27]	Takens F. 1981. Detecting strange attractors in turbulence. Dynamical Systems and Turbulence, Warwick 1980. Lecture Notes in Mathematics. vol 898. Berlin, Heidelberg: Springer. pp. 366−81 doi: 10.1007/BFb0091924
[28]	Kugiumtzis D. 1996. State space reconstruction parameters in the analysis of chaotic time series-the role of the time window length. Physica D: Nonlinear Phenomena 95(1):13−28 doi: 10.1016/0167-2789(96)00054-1 CrossRef Google Scholar
[29]	Shi Z, Zhang N, Schonfeld PM, Zhang J. 2020. Short-term metro passenger flow forecasting using ensemble-chaos support vector regression. Transportmetrica A: Transport Science 16(2):194−212 doi: 10.1080/23249935.2019.1692956 CrossRef Google Scholar
[30]	Jin J, Xu Z, Li C, Miao W, Xiao J, et al. 2022. Rolling bearing fault diagnosis based on deep learning and chaotic feature fusion. Control Theory & Applications 39(1):109−16 doi: 10.7641/CTA.2021.10177 CrossRef Google Scholar
[31]	Zhang WC, Tan SC, Gao PZ. 2013. Chaotic forecasting of natural circulation flow instabilities under rolling motion based on Lyapunov exponents. Acta Physica Sinica 62(6):060502 doi: 10.7498/aps.62.060502 CrossRef Google Scholar
[32]	Smale S. 1967. Differentiable dynamical systems. Bulletin of the American Mathematical Society 73(6):747−817 doi: 10.1090/s0002-9904-1967-11798-1 CrossRef Google Scholar
[33]	Sterman JD. 1988. Deterministic chaos in models of human behavior: methodological issues and experimental results. System Dynamics Review 4(1):148−78 doi: 10.1002/sdr.4260040109 CrossRef Google Scholar
[34]	Packard N, Crutchfield JP, Shaw R. 1980. Deterministic chaos in dynamical systems. Physical Review Letters 45:712 doi: 10.1103/PhysRevLett.45.712 CrossRef Google Scholar
[35]	Martinerie JM, Albano AM, Mees AI, Rapp PE. 1992. Chaos and dynamics of a time-delayed system. Physical Review A 45:7058 doi: 10.1103/PhysRevA.45.7058 CrossRef Google Scholar
[36]	Liangyue C. 1997. Nonlinear dynamics of a time-delay system. Physica D: Nonlinear Phenomena 110:43 doi: 10.1016/S0167-2789(97)00118-8 CrossRef Google Scholar
[37]	Mirjalili S, Mirjalili SM, Lewis A. 2014. Grey wolf optimizer. Advances in Engineering Software 69:46−61 doi: 10.1016/j.advengsoft.2013.12.007 CrossRef Google Scholar
[38]	Gu R, Chen J, Hong R, Wang H, Wu W. 2020. Incipient fault diagnosis of rolling bearings based on adaptive variational mode decomposition and Teager energy operator. Measurement 149:106941 doi: 10.1016/j.measurement.2019.106941 CrossRef Google Scholar
[39]	Xiong J, Sun Y, Sun J, Wan Y, Yu G. 2024. Sparse temporal data-driven SSA-CNN-LSTM-based fault prediction of electromechanical equipment in rail transit stations. Applied Sciences 14(18):8156 doi: 10.3390/app14188156 CrossRef Google Scholar
[40]	Gottam S, Nanda SJ, Maddila RK. 2021. A CNN-LSTM model trained with grey wolf optimizer for prediction of household power consumption. 2021 IEEE International Symposium on Smart Electronic Systems (iSES), Jaipur, India, 18−22 December 2021, Jaipur, India. pp. 355 doi: 10.1109/iSES52644.2021.00089

Card no	Date	Time	Line and station	Mode	Cost (CNY)	Type
2201252167	04-01-15	19:20:33	Line 7 Changzhong Road	Subway	4.0	Full fare
2702155929	04-01-15	12:52:38	Songjiang Bus 43	Bus	1.0	Full fare
2201252167	04-01-15	08:55:44	Line 1 Baoshan Highway	Subway	3.0	Full fare
…	…	…	…	…	…	…
602141128	04-01-15	09:07:57	Songjiang Bus 43	Bus	0	Discount

Line	Subway station	Date	Period serial number	Station entry person
1	Baoan Road	04-01-15	37	164
1	Baoan Road	04-01-15	38	397
1	…	…	…	…
1	Baoan Road	04-01-15	55	1,316

Line	Station	Number	MLES
1	Bao'an Highway	1	0.025
1	Caobao Road	2	0.026
1	Changshu Road	3	0.035
1	Fujin Road	4	0.022
1	Gongfu Xincun	5	0.031
1	Gongkang Road	6	0.030
1	Shanghai Railway Station	7	0.040

Parameter	Value
Learning rate	0.00283
Convolutional kernels	35
Hidden neurons	25
Dropout rate	0.2
Kernel size	6 × 6
Pooling size	4 × 4

{{lists.name}}

Short-term inbound passenger flow forecasting for urban rail transit based on phase space reconstruction and deep learning

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors