An improved particle swarm optimization algorithm based urban rail passenger flow prediction model: a case study in Beijing, China

Song Hu; Meichen Ji; Zheng Chang; Haipeng Wang; Xiangfei Kong; Song Hu; Meichen Ji; Zheng Chang; Haipeng Wang; Xiangfei Kong

doi:10.48130/dts-0025-0005

To accurately predict the passenger flow of urban rail transit under different external conditions, the distribution characteristics of urban rail passenger flow were analyzed using the AFC data of rail transit, and three influencing factors of urban rail passenger flow were extracted. The typical support vector regression (SVR) algorithm was utilized to construct the passenger flow prediction model of the urban rail, then we proposed an improved particle swarm optimization (IPSO) algorithm for optimizing the prediction model. Finally, the prediction accuracy of the proposed model was verified by comparative analysis. The results show that the IPSO-SVR passenger flow prediction model has better prediction accuracy compared with the SVR and long short-term memory networks (LSTM) models, also the traditional grid optimization method, the mean square error (MSE), and relative accuracy (RA) are 1.54% and 92.37%. The validation of model performance is carried out compared with three other models in a previous study. There is a positive correlation between the model prediction errors and the scale of urban rail transit passenger flow. The three influencing variables of the time segments, working day type, and weather can also effectively characterize the coupling characteristics of urban rail passenger flow, and improve the model prediction accuracy. The results have important application value for evaluating passenger flow status, improving the operation quality, and operation organization of urban rail transit.

HTML

Introduction

Recently, unprecedented data availability and rapid development of machine learning techniques have led to tremendous progress in the intelligent transportation systems field^[1]. Accurate rail transit passenger flow prediction, as a prerequisite for real-time traffic signal control, traffic allocation, path guidance, automatic navigation, and determination of residential travel connection schemes in intelligent transportation systems, is currently a research hotspot in the transportation field^[2]. With the continuous development of urban rail transit systems in China, the diversification of passenger flow data sources and the massive scale of data, such as Automatic Fare Collection (AFC) system data, smart card data, and mobile phone signaling data, provide a data foundation for precise passenger flow prediction in rail transit. However, the passenger flow in urban rail transit is characterized by non-linearity, non-stationarity, and randomness, and is influenced by various internal and external factors, making accurate prediction challenging. Therefore, with the wide application of big data, artificial intelligence, cloud computing, and other emerging technologies in the field of rail transit, it is of great significance to improve the operation organization efficiency and service level of urban rail transit by using intelligent algorithms to carry out the short-term forecast of rail transit travel demand.

The current research on urban rail transit passenger flow prediction has achieved significant results in aspects such as influencing factors and prediction methods. In terms of influencing factors, Hui et al.^[3] analyzed the coupled spatio-temporal characteristics of subway passenger flow using Xi'an Metro Line 1 (Xi'an, China) as a case study. They identified five influencing factors: holidays, non-holidays, time periods, stations, and weather, and then analyzed their correlation coefficients with passenger flow. Pereira et al.^[4] examined the different impacts of various types of activity days on public transport passenger flow, classifying 59 special events and analyzing the passenger flow of surrounding subway and bus services in 30-min intervals. Liu et al.^[5] established three LSTM models to extract the hourly, daily, and weekly characteristics of subway passenger flow. By incorporating factors like weather, workdays, precipitation, subway operating times, and inter-station travel duration, they predicted passenger flow at transfer stations and regular stations. Wu et al.^[6] used the Pearson correlation coefficient to analyze the short-term influences on rail transit passenger flow, such as weather conditions, historical passenger volumes, peak periods, and workdays, thus forecasting short-term urban rail transit passenger flow. Hao et al.^[7] comprehensively analyzed the impact of external factors like weather, events, workdays, and time periods on urban rail transit passenger flow. Incorporating both weekday and daily external factors into the LSTM model, they demonstrated that these factors significantly enhance the model's predictive performance. Zhang et al.^[8] identified weather conditions, historical passenger flow, peak periods or not, and weekdays or not, as the factors influencing the short-time passenger flow of rail transit, then proposed a short-time passenger flow prediction method for urban rail transit based on long and short-time memory neural networks. It is evident that urban rail transit passenger flow varies with time and space and is influenced by various external factors such as weather conditions, holiday schedules, and major events. Xue et al.^[9] adopted the Pearson correlation coefficient to determine the influencing factors of short-term passenger flow of rail transit, such as the weather conditions, historical passenger flow, whether it is a peak time period, whether it is a working day, etc. However, too many influencing factors as input variables can lead to model overfitting, while too few may not fully reflect the relationship between passenger flow changes and influences.

Regarding prediction methods, some studies have used traditional methods like weighted regression models^[10], exponential smoothing, and autoregressive integrated moving average (ARIMA) models^[11] for passenger flow analysis. However, due to the non-linear, non-stationary, and random nature of urban rail transit passenger flow, traditional methods and models often struggle to capture the changing patterns and intrinsic relationships of passenger flow, leading to low prediction accuracy. Wang et al.^[12] proposed a short-term rail transit passenger flow prediction model that integrates attention mechanisms and spatiotemporal graph convolutional gated recurrent units, based on travel time and OD volume to construct adjacency matrices. The model's prediction accuracy surpassed that of ARIMA and Support Vector Regression models. Chen et al.^[13] combined GCN with LSTM/GRU to establish a road traffic speed/volume prediction model, finding that this hybrid model was more effective and performed better than standalone LSTM or GRU models. Du et al.^[14] embedded a 'time-feature' attention mechanism into an LSTM time series prediction model and compared its superiority against existing representative methods such as SVM, BPNN, ARIMA, and standard LSTM. Qi et al.^[15] uses analytic hierarchy processes (AHP) analysis to scientifically select the factor of time characteristics, then BILSTM based model considering the hourly travel characteristics factors is proposed to predict the inbound rail transit passenger flow. Lu et al.^[16] analyzed the passenger flow-land use mapping relationship between new stations and existing stations by the clustering algorithm, and established a real-time station passenger flow prediction model for the early stage of new station opening by an improved nonparametric regression algorithm. Zhang et al.^[17] developed a short-time prediction model of rail passenger flow based on the Light Gradient Boosting Machine Model, and the accuracy of the proposed algorithm is compared with algorithms such as support vector machine and BP neural network models. Weng et al.^[18] used the density peak clustering algorithm to identify the associated road chain set with strong spatial and temporal correlation of traffic flow, and developed a short-term traffic flow prediction method based on the long short-term memory neural network of the road chain groups division (RCGD-LSTMNN). Qi et al.^[19] proposed a deep learning approach based on a spatiotemporal graph convolutional network for long-term traffic flow prediction with multiple factors. However, the analysis and adjustment of the structure and parameters of the prediction model in many studies are not deep enough in the above-related studies, and the accuracy of the algorithm needs to be improved.

In general, this paper addresses the issue of urban rail passenger flow prediction by collecting and analyzing the characteristics of urban rail AFC card data, as well as information on time periods, types of workdays, and weather as influencing factors. Utilizing an improved Particle Swarm Optimization algorithm (IPSO) to optimize the Support Vector Regression (SVR) algorithm, this study proposes an IPSO-SVR-based urban rail passenger flow prediction model, achieving precise prediction of passenger flow. This research provides technical support for improving the passenger flow organization and operational dispatch of urban rail transit systems.

Construction of a rail transit passenger flow prediction model

[1]	Guo M, Sun Z, Pan J, Xu M. 2008. Research on short-time traffic flow forecasting method. Application Research of Computers 25(9):2676−78 doi: 10.3969/j.issn.1001-3695.2008.09.031 CrossRef Google Scholar
[2]	Xing Z, Huang M, Peng D. 2023. Overview of machine learning-based traffic flow prediction. Digital Transportation and Safety 2(3):164−75 doi: 10.48130/dts-2023-0013 CrossRef Google Scholar
[3]	Hui Y, Wang Y, Peng H, Hou S. 2021. Subway passenger flow prediction based on optimized PSO-BP algorithm with coupled spatial-temporal characteristics. Journal of Traffic and Transportation Engineering 21:210−22 doi: 10.19818/j.cnki.1671-1637.2021.04.016 CrossRef Google Scholar
[4]	Pereira FC, Rodrigues F, Ben-Akiva M. 2015. Using data from the web to predict public transport arrivals under special events scenarios. Journal of Intelligent Transportation Systems 19:273−88 doi: 10.1080/15472450.2013.868284 CrossRef Google Scholar
[5]	Liu Y, Liu Z, Jia R. 2019. DeepPF: a deep learning based architecture for metro passenger flow prediction. Transportation Research Part C: Emerging Technologies 101:18−34 doi: 10.1016/j.trc.2019.01.027 CrossRef Google Scholar
[6]	Meng P, Li X, Jia H, Li Y. 2018. Short-time rail transit passenger flow real-time prediction based on moving average. Journal of Jilin University (Engineering and Technology Edition) 48(2):448−53 doi: 10.13229/j.cnki.jdxbgxb20161256 CrossRef Google Scholar
[7]	Hao S, Lee DH, Zhao D. 2019. Sequence to sequence learning with attention mechanism for short-term passenger flow prediction in large-scale metro system. Transportation Research Part C: Emerging Technologies 107:287−300 doi: 10.1016/j.trc.2019.08.005 CrossRef Google Scholar
[8]	Zhang H, Gao Z, Li J, Wang C, Pan Y, et al. 2023. Short-term passenger flow forecast of urban rail transit based on recurrent neural network. Journal of Jilin University (Engineering and Technology Edition) 53(2):430−38 doi: 10.13229/j.cnki.jdxbgxb20210720 CrossRef Google Scholar
[9]	Xue Q, Zhang W, Ding M, Yang X, Wu J, et al. 2023. Passenger flow forecasting approaches for urban rail transit: a survey. International Journal of General Systems 52:919−47 doi: 10.1080/03081079.2023.2231133 CrossRef Google Scholar
[10]	Qi C, Hu H. 2021. Research on ridership forecast of urban rail transit station based on mixed geographic weighted regression. Journal of Railway Science and Engineering 18(7):1903−9 doi: 10.19713/j.cnki.43-1423/u.T20200800 CrossRef Google Scholar
[11]	Zhang G, Jin H. 2022. Research on the prediction of short-term passenger flow of urban rail transit based on improved ARIMA mode. Computer applications and software 39(1):339−44 doi: 10.3969/j.issn.1000-386x.2022.01.052 CrossRef Google Scholar
[12]	Wang X, Xu X, Wu Y, Liu J. 2022. Short term passenger flow forecasting of urban rail transit based on hybrid deep learning model. Journal of Railway Science and Engineering 19(12):3557−68 doi: 10.19713/j.cnki.43-1423/u.T20220158 CrossRef Google Scholar
[13]	Chen H, Shao Y, Ao G, Zhang H. 2021. Speed prediction by online map-based GCN-LSTM neural network. Journal of Traffic and Transportation Engineering 21(4):183−96 doi: 10.19818/j.cnki.1671-1637.2021.04.014 CrossRef Google Scholar
[14]	Du W, Shi W, Liao S, Zhu X. 2022. Passenger flow forecasting of airport express based on time and feature cooperative attention. Journal of Beijing University of Aeronautics and Astronautics 48(9):1605−12 doi: 10.13700/j.bh.1001-5965.2022.0321 CrossRef Google Scholar
[15]	Qi Q, Cheng R, Ge H. 2023. Short-term inbound rail transit passenger flow prediction based on BILSTM model and influence factor analysis. Digital Transportation and Safety 2(1):12−22 doi: 10.48130/dts-2023-0002 CrossRef Google Scholar
[16]	Lu T, Yao E, Liu S, Zhou W. 2020. Short-time forecast of entrance and exit passenger flow for new line of urban rail transit during growth period. Journal of the China Railway Society 2020:19−28 doi: 10.3969/j.issn.1001-8360.2020.05.003 CrossRef Google Scholar
[17]	Zhang Z, Wang C, Gao Y, Chen J, Zang Y. 2020. Short-term passenger flow forecast of rail transit station based on mic feature selection and ST-lightGBM considering transfer passenger flow. Scientific Programming 2020:3180628.1−3180628.15 doi: 10.1155/2020/3180628 CrossRef Google Scholar
[18]	Weng J, Wei R, He H, Xu H, Wang J. 2023. Urban road network short-term traffic flow prediction model based on associated road chain group. Journal of Jilin University (Engineering and Technology Edition) 53(11):3104−12 doi: 10.13229/j.cnki.jdxbgxb.20211391 CrossRef Google Scholar
[19]	Qi X, Mei G, Tu J, Xi N, Piccialli F. 2023. A deep learning approach for long-term traffic flow prediction with multifactor fusion using spatiotemporal graph convolutional network. IEEE transactions on intelligent transportation systems 24(8):8687−700 doi: 10.1109/TITS.2022.3201879 CrossRef Google Scholar
[20]	Zheng X L, Chen H L, Lai W H. 2022. Prediction of mobile network traffic by SVR with optimized parameter. Computer Applications and Software 39(9):278−84 doi: 10.3969/j.issn.1000-386x.2022.09.042 CrossRef Google Scholar
[21]	Weng J, Feng K, Fu Y, Wang J, Mao L. 2023. Extreme gradient boosting algorithm based urban daily traffic index prediction model: a case study of Beijing, China. Digital Transportation and Safety 2(3):220−28 doi: 10.48130/DTS-2023-0018 CrossRef Google Scholar

Field name	Field meaning	Data samples
CARD_ID	Card number	***180025
CARD_TYPE	Card type	2
ENTYR_TIME	Entry time	2019***568253
ENTYR_LINE_NUM	Code of entry time	4
ENTYR_STATION_NUM	Code of entry station	5
EXIT_TIME	Exit time	2019***315549
EXIT_LINE_NUM	Code of exit time	4
EXIT_STATION_NUM	Code of exit station	12

Field	Field name	Data example
DATE	Date	2019/9/11
PRCP	1-h precipitation (mm)	0
HUM	Relative humidity (%)	56.5
TEMP	Temperature	1.6
SPD	10-min wind speed (m/s)	4.8
PRE	Atmospheric pressure (HPa)	1014

{{lists.name}}

An improved particle swarm optimization algorithm based urban rail passenger flow prediction model: a case study in Beijing, China

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors