Pedestrian Flow Prediction in Open Public Places Using Graph Convolutional Network

Liu, Menghang; Li, Luning; Li, Qiang; Bai, Yu; Hu, Cheng

doi:10.3390/ijgi10070455

Open AccessArticle

Pedestrian Flow Prediction in Open Public Places Using Graph Convolutional Network

¹

Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China

²

State Key Laboratory of Earth Surface Processes & Resource Ecology, Beijing Normal University, Beijing 100875, China

³

Safety & Emergency Management Lab, Beijing Municipal Institute of Labor Protection, Beijing 100054, China

^*

Author to whom correspondence should be addressed.

ISPRS Int. J. Geo-Inf. 2021, 10(7), 455; https://doi.org/10.3390/ijgi10070455

Submission received: 12 May 2021 / Revised: 25 June 2021 / Accepted: 1 July 2021 / Published: 2 July 2021

(This article belongs to the Special Issue Spatial Data Science and Artificial Intelligence for Human Mobility Research)

Download

Browse Figures

Versions Notes

Abstract

:

Open public places, such as pedestrian streets, parks, and squares, are vulnerable when the pedestrians thronged into the sidewalks. The crowd count changes dynamically over time with various external factors, such as surroundings, weekends, and peak hours, so it is essential to predict the accurate and timely crowd count. To address this issue, this study introduces graph convolutional network (GCN), a network-based model, to predict the crowd flow in a walking street. Compared with other grid-based methods, the model is capable of directly processing road network graphs. Experiments show the GCN model and its extension STGCN consistently and significantly outperform other five baseline models, namely HA, ARIMA, SVM, CNN and LSTM, in terms of

R M S E

,

M A E

and R². Considering the computation efficiency, the standard GCN model was selected to predict the crowd. The results showed that the model obtains superior performances with higher prediction precision on weekends and peak hours, of which R² are above 0.9, indicating the GCN model can capture the pedestrian features in the road network effectively, especially during the periods with massive crowds. The results will provide practical references for city managers to alleviate road congestion and help pedestrians make smarter planning and save travel time.

Keywords:

pedestrian flow prediction; graph convolutional network (GCN); open public places; model performance

1. Introduction

With the ever-enriching city lives, open public places, such as pedestrian streets, commercial streets, parks, and squares, have gradually become an important part of people’s lives [1]. These open places, without definite space boundary, are likely to cause overcrowding with the inrush of massive pedestrians in a short period, which could arise evacuation problems, leading to the occurrence of stampedes [2,3]. In stampedes, a huge crowd obstructs each other and people are crushed by the shock waves building up in the crowd, which may incur clogging effects at bottlenecks. The people who lose their balance and fall down become obstacles for others, which can produce piles of fallen people [4]. Therefore, it is a great concern to understand the dynamics of pedestrian flow in open public places. It can not only help city managers implement prevention strategies to alleviate road congestion, but also provide useful information for travelers to choose appropriate travel routes and improve travel efficiency.

The definition of pedestrian flow prediction can be considered as follows. Given a sequence of observed flow data in the road network, the task is to predict the pedestrian flow in the next moments [5,6,7,8]. The pedestrian can be affected by miscellaneous factors which pose great challenges to pedestrian prediction [9,10]. From the temporal dependencies, the crowd flow always repeats over time, such as weekends and weekdays and the rush hours in the day. Concerning the spatial dependencies, the flow in a region is similar and relevant with its neighborhood rather than the far regions. Besides, the weather condition has an influence on the pedestrian features [11].

There are three kinds of forecasting methods in existing research, namely traditional statistical model, machine learning-based model and deep learning-base model. In the early stages, classic forecasting approaches were mainly based on the assumption of linearity and stationarity to infer future pedestrian trends [12,13,14,15,16]. The models consider the dynamic change of historical data and extract the crowd features for the prediction task, and require large efforts on parameter inference, which may result in lower prediction accuracy and efficiency. Serval non-linear prediction models in the artificial intelligence field, such as Gaussian maximum likelihood model [17], Bayesian networks [18], decision trees [19,20], support vector machines (SVM) [21,22] and neural networks [23,24,25,26], have shown the great application prospect and received wide popularity in the field of crowd flow prediction. The machine learning-based models discover complex non-linearities in data and outperformed the traditional methods with a lower error rate and higher accuracy rate. Despite the above methods have produced compelling results, it remains unsatisfactory for the rapid development of intelligent transportation systems in practice. Recently, deep learning has made groundbreaking progress on classification tasks, pattern recognition, and natural language processing [27]. Due to its high computational efficiency and interpretability, deep learning models, such as convolutional neural network (CNN) [28,29,30,31,32] and recurrent neural network (RNN) [33,34,35,36], have been applied widely to capture the crowd flow features with a squared tessellation of tiles.

However, the above methods by assuming a convolution filter in the form of a grid structure and ignore modeling the physical roadway network topological structures. In the real world, the road network cannot be regarded as a regular gridded structure [37,38,39]. With the powerful ability to capture the spatio-temporal features of graph-structured data, graph convolutional network (GCN) and its extensions have been widely applied to prediction tasks [40,41,42,43,44,45,46]. By constructing the road network as a graph and aggerating the features of neighborhood, the GCN model can capture characteristics of the crowd data in the deep layer among the irregular regions. Many researchers used the trajectories of bicycle sharing [47,48], mobile phone positioning data [49,50], taxi GPS records data [51,52], or the number of passengers on public transportation systems (e.g., metro, railway or airport) [53,54,55,56,57] to predict the crowd flow in a city. It is relatively scarce on the pedestrian in open public spaces.

In this study, we introduce the GCN model to predict the pedestrian flow in a walking street in Shenzhen metropolitan area. The GCN model can handle the issues that CNN cannot be applied to graph structures of the road network. The graph is built based on the existed road structures to capture the spatial dependency of the crowd by calculating the coefficient between nodes. We further compare the GCN model with baseline methods to validate the performance of pedestrian flow prediction. The experiments show that the GCN model improves the prediction precision and decreased the prediction error. The accurate and timely pedestrian flow prediction results can help managers to take precautions in advance and ensure the public safety, which is beneficial for building a smart city. Since the pedestrian flow could be impacted by temporal dependencies and other factors, we conduct the sensitive analysis of the GCN model with the consideration of these related factors such as weekends and weekdays, the rush hours in the day and the weather conditions. It is found that the model has the ability to capture pedestrian congestion peaks with high accuracy. Understanding the dynamic of pedestrian flow can be beneficial for crowd control systems to manage and direct the optimal detours in advance.

The main contributions of this paper are as follows:

(1): We employ the GCN model to predict the pedestrian flow. The graph-structure-based deep learning model, in which the detectors are regarded as nodes, and edges represent the relationship of the road network, can capture the complex topological dependency. Moreover, we assign different weights to road segments to identify the influence of road network structure to capture the spatial dependencies.
(2): We compare the GCN model with baseline methods selected from the existing methods to validate the performance of pedestrian flow prediction in terms of three evaluation metrics. The experimental effectiveness of the GCN model show that proper integration of the road topology could considerably improve the pedestrian flow prediction precision in real-world applications.
(3): We further conduct comparative experiments of pedestrian flow prediction between weekdays and weekends, and different hours during the day to capture the temporary dependencies. Sensitive analysis on the effect of weather conditions is also conducted. The robustness of the GCN model to predict the pedestrian flow would help practitioners and managers to improve road efficiency.

The remainder of this paper is organized as follows. Section 2 details the study area and the data preprocessing procedures. The detail of the GCN model is also formulated in this section. Section 3 shows the experimental results. Section 4 discusses the advantage and sensitivity of the model. The conclusions are presented in Section 5.

2. Materials and Methods

2.1. Study Area

This study predicts the pedestrian flow in an open public place in the city of Shenzhen. Shenzhen, being a vice-provincial level city of the Pearl River Delta, was designated as the first special economic zone in China on 26 August 1980 and has gradually become a pioneer international metropolitan area following the Reform and Open-up policy over the past 40 years. As one of the cities with the most economic viability and the fastest population growth in China, the gross domestic product of Shenzhen was 2692.7 billion RMB in 2019 and the population was 13.44 million and ranked third among all cities in China. It includes 10 administrative districts with an area of 1997.47 km². In the downtown areas of Shenzhen, known as the Nanshan, Futian and Luohu districts, the population density is above 20 thousand people per kilometer.

The Dongmen walking street, located in the center of Luohu district and covering an area of 176 thousand square meters, is a time-honored and large-scale commercial area in the city of Shenzhen. As an open place with various public service functions of shopping, leisure and tourism, the Dongmen walking street is bustling with a large crowd of travelers during the peak hours. On a typical holiday, the crowd count can easily reach 300 thousand per day. For example, on 25 May 2014, a suspected man with mental disease ran away in a shopping mall after overthrowing the display shelves. The man ran and yelled, causing the pedestrian around to flee disorderly, which led dozens of people to suffer minor injuries before the security officers caught him. As a result, many retail shops on the prosperous commercial district were closed on weekends [58]. Thus, there has been a considerable safety concern and need for pedestrian flow prediction and management in this area.

As shown in Figure 1, the road network in this area is rather complex and contains many junctions. There are several office buildings and hotels in the southern of the walking street, where the flow is steady and risk-free of overcrowding. Comparing to the south areas, the northern regions are a mass of shopping malls, recreation facilities, snack bars and tourist sites, which attract a large number of visitors to sightsee. The high crowd counts in these areas are likely to induce the occurrence of stampedes. Taking this phenomenon into consideration, the landmarks along the main road and the crossroads in the north were considered to be the optimal locations to deploy the camera detectors. The metro stations and bus stops in the study area were also regarded as the installation locations, such as detector 1 and 25. Consequently, 25 camera detectors, denoted as 1, 2, …, 25, were equipped along the roads to catch the dynamic distribution of pedestrian.

2.2. Pedestrian Flow Data

The pedestrian flow data describes the pedestrian volume changes over time on each detector, denoted as the number of pedestrians within the definite monitoring area. During the study period from July 1 to September 30, 2020, we collect the pedestrian flow data with an interval of 1 min. Examples of the pedestrian flow records at the junctions (i.e., detectors 6, 8, 16, 21 and 24) are shown in Table 1.

Since the detectors covered different monitoring areas along the roads, the crowd count of each detector was normalized by dividing the monitoring area, as the number of pedestrians per 100 square meters. Figure 2 presents the variation of crowd count at the junctions and the average count of all the detectors, from 1 to 14 July 2020. It can be observed that the pedestrian flow has a marked periodicity during a week. Besides, there are more pedestrians thronged into the walking street on weekends (4 July 2020, 5 July 2020, 11 July 2020, and 12 July 2020) compared with the crowd count on weekdays.

To determine the optimal periodicity of the pedestrian flow rationally, we further calculate the relationship between phase difference and autocorrelation coefficient. The results are shown in Figure 2f. The correlation coefficient is strongest based on the cycle of a week, whose value is 0.918. Besides, the pedestrian flow has a slightly periodicity on a daily basis. It can be found that the crowd shows the similar pattern from day to day when ignoring the difference of weekends. The correlation during an hour is second to the week periodicity, because of the pedestrian flow with the strong temporary dependencies. Therefore, the input sequence length is set to 60 in our experiments, which means we use the historical flow data in the past hour to predict the pedestrian flow in the next moments.

2.3. Methodology

The goal of pedestrian flow prediction is to forecast the pedestrian count on next

T

moments

({\hat{X}}_{t + 1}, \dots, {\hat{X}}_{t + T})

given previously collected

T^{'}

moments data

(X_{t - T^{'} + 1}, \dots, X_{t})

from correlated detectors on the road network, as shown in Equation (1):

[(X_{t - T^{'} + 1}, \dots, X_{t}); A] \overset{f (\cdot)}{\to} [{\hat{X}}_{t + 1}, \dots, {\hat{X}}_{t + T}]

(1)

where

A

is a weighted adjacency matrix representing the road network structure, and

f (\cdot)

is a mapping function from input to output to be learned.

2.3.1. The GCN Model for Pedestrian Flow Prediction

Before being put into the GCN model, the pedestrian flow data were grouped into two sets, the feature sets and the label sets. The feature set is used to predict the pedestrian flow in the next moments, while the label set is preserved for the validation of the model results. Besides, we built the road network graph based on the topological structure derived from Figure 1, that the road distance between each pair of nodes, represented as the detectors in this study. The adjacency matrix is constructed using a threshold Gaussian kernel weighting function [59,60,61]:

A_{i, j} = \exp (- \frac{{[d i s t (v_{i}, v_{j})]}^{2}}{θ^{2}})

(2)

A_{i, j} = {\begin{matrix} A_{i, j} & A_{i, j} > κ \\ 0 & A_{i, j} \leq κ \end{matrix}

(3)

where

A_{i, j}

is the weight between detectors

v_{i}

and

v_{j}

.

d i s t (v_{i}, v_{j})

denotes the distance between detectors

v_{i}

and

v_{j}

, and

θ

represents the standard deviation of distances.

κ

is the threshold to determine the connection.

We use the adjacency matrix

A

to represent the graph. A normalized graph Laplacian matrix

L^{s y s}

is defined as:

L^{s y s} = D^{- \frac{1}{2}} L D^{- \frac{1}{2}} = I_{N} - D^{- \frac{1}{2}} A D^{- \frac{1}{2}}

(4)

where

L

is the laplacian matrix, and

I_{N}

is the identify matrix.

D = \sum_{j} A_{i j}

is the degree matrix, and a spectral normalization is applied to the adjacency matrix by the inverse of the degree matrix (

D^{- \frac{1}{2}} A D^{- \frac{1}{2}}

).

Graph neural network was presented by Gori et al. [62] as the extension of recursive neural network by capturing neighboring spatial correlation, which needs to compute the eigenvectors of the Laplacian by repeated iteration until the nodes’ feature converge to a stable value. Hammond et al. [63] designed a framework with fast localized convolutions. This network captures the nodes’ feature on the

k

th-order neighborhood by stacking multiple graph convolutional layers. The polynomial parametrization for localized filters can be written as:

g_{θ} = g_{θ} (Λ) \approx \sum_{k = 0}^{K - 1} θ_{k} T_{k} (\tilde{Λ})

(5)

\tilde{Λ} = \frac{2 Λ}{λ_{m a x}} - I_{N}

(6)

where

g_{θ}

is a filter in the Fourier domain, and

θ_{k}

is the polynomial coefficients.

λ_{m a x}

represents the largest eigenvalue of the laplacian matrix.

Defferrard et al. [64] further used Chebyshev polynomial to reduce the computational complexity, named Chebyshev Graph Convolution. The Chebyshev polynomial

T_{k} (x)

of order

k

is defined as follows:

T_{k} (x) = 2 x T_{k - 1} (x) - T_{k - 2} (x)

(7)

where

T_{0} (x) = 1

,

T_{1} (x) = x

.

Kipf and Welling [65] simplified the Chebyshev Graph Convolution by limiting the layers-wise convolution operation to

k

= 1, which means the central node is only determined by the nodes on its 1st-order neighborhood. This is the prototype of the graph convolutional networks (GCN) model. The simple form of layer-wise propagation of the GCN model is as shown in Equation (8):

H^{(l + 1)} = σ ({\tilde{D}}^{- \frac{1}{2}} \tilde{A} {\tilde{D}}^{- \frac{1}{2}} H^{(l)} W^{(l)})

(8)

where

\tilde{A} = A + I_{N}

represents the graph structure, calculated by adding the adjacency matrix

A

to the identity matrix

I_{N}

, so it can transfer both the node’s features and the features of its neighbor nodes.

{\tilde{D}}_{i i} = \sum_{j} {\tilde{A}}_{i j}

is the degree matrix. And a spectral normalization is applied to the adjacency matrix by the inverse of the degree matrix (

{\tilde{D}}^{- \frac{1}{2}} \tilde{A} {\tilde{D}}^{- \frac{1}{2}}

).

H^{(l)}

represents the output matrix and the initial layer is set to

H^{(0)} = X

.

W^{(l)}

denotes trainable weight matrix in the

l

th layer.

σ (\cdot)

is the activation function such as a rectified linear unit (ReLU). A feature matrix

X

and an adjacency matrix

A

are fed to GCN as input, and the model computes the features of nodes by multiple hidden layers.

The segmentation of the pedestrian flow data and the architecture of the GCN model is depicted in Figure 3. Note that the pedestrian flow data of all 25 detectors at time

t

is denoted by

X_{t} = (x_{t}^{1}, x_{t}^{2}, \dots, x_{t}^{25})

,

t = 1, 2, \dots, N - 1, N

, where

N

is the length of the observed time interval. The

T^{'}

moments data were grouped as the feature, and the following

T

moments data as the label, continuously. Then the feature sets and the adjacency matrix were input in the GCN architecture. The model performance can be evaluated by comparing the label sets and the model output, denoted as prediction sets.

In our experiment, we stack two convolutional layers for pedestrian flow prediction given a weighted adjacency matrix A, as shown in Equation (9):

f (X, A) = R e L U (\hat{A} R e L U (\hat{A} X W^{(0)}) W^{(1)})

(9)

where

\hat{A} = {\tilde{D}}^{- \frac{1}{2}} \tilde{A} {\tilde{D}}^{- \frac{1}{2}}

, and

R e L U (\cdot) = \max (0, \cdot)

denotes an activation function.

W^{(0)} \in R^{T^{'} \times H}

is an input-to-hidden weight matrix, and

W^{(1)} \in R^{H \times T}

is a hidden-to-output weight matrix.

The model is operated by applying the Adam optimizer. Mean square error function is employed as the loss function of the model for predict pedestrian flow in open public places, which is formulated as:

L o s s = \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}

(10)

2.3.2. Evaluation Metrics

There are three metrics selected as basic measures to evaluate the model performances.

(1) Root mean square error:

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}

(11)

(2) Mean absolute error:

M A E = \frac{1}{N} \sum_{i = 1}^{N} | y_{i} - {\hat{y}}_{i} |

(12)

(3) Coefficient of determination:

R^{2} = 1 - \frac{\sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{N} {(y_{i} - \bar{y})}^{2}}

(13)

where

Y = y_{1}, \dots, y_{N}

and

\hat{Y} = {\hat{y}}_{1}, \dots, {\hat{y}}_{N}

are the ground truth pedestrian flow and predicted one, and

\bar{y}

is the average of

Y

.

N

represents the number of the samples for prediction.

R M S E

and

M A E

measure the difference between ground-truth flow and predicted pedestrian flow. The smaller the value of

R M S E

and

M A E

are, and the closer the predicted values to the truth one.

R^{2}

is a measure of how well the collected data explained by the forecasting model. The larger the value of

R^{2}

, the better the prediction effect.

In our experiment, 80% of the data were used for the training process, and 20% of the data were reserved for testing and validation. The descriptive analyses of the two sub-groups, namely training and testing data, were summarized in Table 2.

The mean values of training and testing data are 6.179 and 6.138, respectively, while the standard deviation values are 9.725 and 9.648, respectively. There are almost the same. The minimum, lower quartile and median values have also little difference, and the upper quartile and maximum values of testing data are greater than training data. From the mean and median values, we can easily find the collected data are concentrated in the low-value region. The difference in the high-value indices, namely upper quartile and maximum, are not dominant. Therefore, we can consider there is no obvious difference between the training and testing data, indicating the available data are sampled randomly. The model would avoid lose information by training only a subset of the pedestrian flow data. It could prove the robustness of the model.

We take prediction loss (Equation (10)) to measure the precision of the model. To illustrate the efficiency of the model, we also apply the computational time, the run time required to repeat 1000 epochs, to evaluate the model performances. All of our experiments were implemented under a computing environment with Intel(R) Core (TM) i7 CPU 870 @ 2.93 GHz with 8 GB RAM and NVIDIA GeForce GT 420 (1024 MB) GPU. The operating system and software platform are Windows 10 and Python 3.6. All the models are conducted with the open source frameworks, including scikit-learn (0.23.2 version), Keras (2.1.5 version), torch (1.5.0 version).

3. Results

3.1. Hyperparameter Settings of the GCN Model

The input and output sequence lengths were set to 60 min and 1 min respectively, and the number of hidden units was 32. To compute the adjacency matrix and identify the effect of

κ

(Equation (3)) on the GCN model, we design the validation experiments for estimation accuracy with

κ \in [0, 0.05, 0.1, 0.15, 0.2, 0.25]

[66], and the results are summarized in Figure 4. The horizontal axis denotes the change of

κ

and the vertical is the value of different metrics.

Figure 5a shows the changes in prediction loss, run time on the train set. It is found that

κ

does not significantly influence the computational time of the model. The values of training time rise and fall within a narrow range between 48 min and 50 min under different thresholds. The change of prediction loss is significant, which is different from run time. When

κ

is small, the training loss gradually decreases with the increase of the threshold. When the value of

κ

increases to 0.1, the sharp increase in prediction loss overwhelms the slight decrease in run time. Figure 5b presents the changes of evaluation metrics under different

κ

on the train set. From the figure, it is obvious that the values of

R M S E

augment when the threshold is larger than 0.15. There are slight oscillations in the change of

M A E

, and the values are all about 1.555. It is distinct that the threshold does not significantly influence the value of

R^{2}

of estimated pedestrian flow. Figure 5c is the changes in prediction loss, computational time on the test set. The values of run time show a tiny fluctuation under different thresholds. The change of prediction loss is totally different with computational time. When the threshold is larger than 0.15, the test loss increases sharply with the augment of

κ

, meaning the decrease of model performance. Figure 5d depicts the changes of metrics on the test set, which presents the approximate trend with the train set.

According to the findings in Figure 5, we notice that the accuracy of the model can be improved by a moderate adjustment of the parameter of

κ

. More specifically, when

κ

is set to 0, the adjacency matrix is computed by all neighboring detectors, prediction accuracy is constraint by the repetitive computation. As the threshold increases, some nodes located far away from the center are excluded, leading to improvement in prediction efficiency and performance. Nonetheless, when the value of

κ

is larger than 0.1, those detectors with the strong relationships are also ignored, so the model performance starts to decrease. Therefore, the results reach the optimal condition when the value is 0.1.

Table 3 shows the adjacency matrix of 25 detectors, along with the road distance used for calculating the connection between each pair of nodes the strength of the relationship. It is noteworthy that the strong correlations are obtained when the distances between two detectors are close, such as the detectors 2 and 3. When the connection is weak than the threshold of 0.1, the adjacency matrix is set to 0 because they are too far away, such as the detectors 4 and 8.

3.2. Comparative Experiments of Different Models

3.2.1. Baseline Models

Six baseline models selected from three kinds of existing models are used for pedestrian flow prediction to compare with the performance of the standard GCN model.

(1): Historical average (HA) [12]: It simply employs the average of previous periods as the prediction.
(2): Autoregressive integrated moving average (ARIMA) [14]: It predicts the future trend of time series data.
(3): Support vector machine (SVM) [21]: It uses a kernel function for the prediction task.
(4): Convolutional neural network (CNN) [29]: It handles the traffic data by constraining the grid-structure.
(5): Long short term memory (LSTM) [34]: It is a recurrent neural network (RNN) based model to capture temporal dependencies for traffic prediction.
(6): Spatio-temporal graph convolutional networks (STGCN) [40]: It is a deep learning framework for traffic forecasting, solving the problem on graphs and build the model with complete convolutional structures.

3.2.2. Experimental Results

We conduct a quantitative evaluation of the GCN and six baseline models over the collected pedestrian flow and compare the changes of metrics results to validate the performance of the model. We choose the value of output sequence length from [1, 5, 10, 15, 20, 25, 30] [57,66,67,68,69], and other simulation configurations are identical to the settings in the previous section to conduct the comparative experiments. The results are shown in Table 4.

The values of metrics achieving the best performances are defined as the lowest of prediction error (

R M S E

and

M A E

) and the highest of prediction precision (

R^{2}

). It can be observed that the STGCN and GCN models consistently and significantly outperform the other baseline models in terms of three evaluation metrics for all output sequence length, and the prediction error increases and effectiveness decreases when adding the value of output sequence length. For 1 min pedestrian flow prediction, the HA model has the poorest results, of which

R M S E

and

R^{2}

are 3.47 and 0.872 respectively, proving it is not good at capturing the pedestrian volume in a short time. When the output sequence length increases, the HA model outperforms the ARIMA and SVM model. It is worth pointing out that the SVM model has a satisfactory performance in forecasting the short sequence length, for example, the

R^{2}

is 0.887 in 1 min and 0.874 in 5 min. However, the model efficiency drops rapidly when the augment of prediction length, mainly because it has difficulty in capturing the irregular and long-term data. Especially,

R^{2}

is lower than 0.8 when the value of output sequence length is larger than 20 min. The ARIMA, CNN and LSTM model perform the crowd count prediction task with relatively good results, while the deep learning-based model such as CNN and LSTM outperforms slightly than the ARIMA model due to its high interpretability efficiency. The GCN model and its extension achieve the best performance with satisfactory results and improves the value of

R^{2}

above 0.9 under 30 min output length, which demonstrates the superiority of capturing the pedestrian characteristics of road network structures.

The GCN and STGCN model can effectively utilize the graph structure to obtain the accurate results. Concerning the operational efficiency, we also compare the computation time between these two models for 1 min prediction task. The standard GCN model only consumes 48.995 min, while STGCN spends 92.047 min. The 2 times acceleration of running time benefits from the graph features rather than the double spatio-temporal convolutional blocks. Considering both prediction accuracy and computation efficiency, we determined the GCN model to be relatively suitable for forecasting the pedestrian flow in this study.

4. Discussion

4.1. The Advantage of the GCN Model for Pedestrian Flow Prediction

Pedestrian flow spontaneously resides in the network topological structure of the road, and the network structure might fundamentally influence the distribution patterns of the crowd. The pedestrian flow of the road segments has a geographical association with the road which is connected. Road network-based prediction task is determined by not only the features of that road segment, but also the features of its neighbors. Normally, the stronger the relationship of the road junction is, the more people move from one junction to another. The distances between pairwise detectors were employed to describe this relationship. The crowd is most likely to move towards its neighborhood instead of the faraway places. The existing methods in pedestrian flow prediction lose sight of the topological structure of the road. Compared with other methods assuming the convolution filter as the grid structure, the GCN model is competent to capture the graph structure. The model, in which the detectors are regarded as nodes, and edges represent the relationship of the road network, can capture the pedestrian flow characteristics hidden in the topological structure. In order to identify the influence of road network structure, we employed the adjacent matrix by assigning different weights to road segments. The experimental effectiveness of the GCN model to capture the spatial dependencies was validated in terms of three prediction evaluation metrics. The lower error and higher accuracy rate of the model show that proper integration of the road topology could considerably improve the pedestrian flow prediction precision in real-world applications. To further illustrate the better and robust performance of the GCN model, we conducted comparative experiments of pedestrian flow prediction between weekdays and weekends, and different hours during the day to capture the temporary dependencies. We also analyzed the effect of weather conditions on the pedestrian features.

4.1.1. Comparison between Weekdays and Weekends

As presented in Figure 2, the pedestrian flow has a marked weekly periodicity pattern that the crowd count on weekends is larger than on weekdays. In this section, we measure the performances of the GCN model on weekends and weekdays respectively. Further, to have a better understanding of the prediction performance, we sample the head 200 rows of the dataset and visualize the ground-truth and predicted crowd count, as shown in Figure 5.

The values of

R^{2}

on weekends and weekdays are all above 0.93, proving the effectiveness of the GCN model for pedestrian flow forecasting. Moreover, the weekend dataset, compared to the weekday dataset, achieves higher prediction precision in terms of

R^{2}

. Since the pedestrian flow on weekends is much heavier than on weekdays, we can conclude that the GCN model has better prediction performances under larger flow volumes. The changes of

R M S E

and

M A E

are opposite to that of

R^{2}

, in which the weekend dataset occurs the higher error. This is mainly because

R M S E

and

M A E

are absolute error metrics, and the greater crowd count on weekends results in a higher value of

R M S E

and

M A E

. From the visualization results, we can see that the GCN model can accurately forecast the pedestrian flow in multiple local peaks and nadirs. And the model achieves satisfying results in predicting the variation trend of pedestrian flow in comparison with the ground-truth data.

4.1.2. Comparison between Different Hours of the Day

It must be noticed that the throng in open public place changes over time, with the characteristics of the high crowd count in the afternoon and evening and the low value at midnight. To further evaluate the influence of the high or low pedestrian flow on the model performance, we compare the evaluation metrics under different hours during the day, and the results are listed in Table 5.

For the table, we can find that the values of

R M S E

and

M A E

are quite small in the early morning (from 00:00–05:59), while the values are relatively big in the afternoon. It is noteworthy that the prediction accuracy (

R^{2}

) under various time intervals has no significant difference between weekdays and weekends. Specifically, no matter on weekends or weekdays, the model does not achieve satisfactory precision from 06:00 to 08:59. The value of

R^{2}

during the evening peak (from 16:00 to 21:59) are all above 0.9, indicating the GCN model can predict the pedestrian flow accurately when there are massive travelers, which is beneficial for road managers to prevent or alleviate pedestrian congestion.

As previously mentioned, the value of crowd count has a significant impact on the value of

R M S E

and

M A E

. We depict the relationship between them in Figure 6, and further employ the scatter plot and linear regression analysis on these indices. There are a larger crowd of people thronged into the walking street in the afternoon and evening on weekends. The peak of pedestrians per 100 square meters on weekdays is about 15, while the number on weekends is above 20. On weekdays, the maximum value of

R M S E

and

M A E

occurs from 17:00 to 17:59, of which the values are 4.452 and 2.727, respectively. The peak error occurs from 15:00 to 17:59 on weekends, and the value is larger than the one on weekdays. More importantly, the slopes in the linear regression equations are positive, indicating the prediction error has a positive correlation with crowd count. The intercept on weekends is larger than the value on weekdays, reflecting the heavier pedestrian flow on weekends could lead to a higher prediction error compared to weekdays. And the fitness of regression equations are 0.9628, 0.9551, 0.9517 and 0.9506, respectively. It provides an insight that

R M S E

and

M A E

have a strong correlation with crowd count.

Therefore, the relative metric (

R^{2}

) is adopted to measure the model effectiveness under different hours. We further investigate the reason why the model performances differ under various hours in the aspect of the interval distribution of datasets. The datasets of which

R^{2}

is less than 0.8 (highlighted in bold) are selected as poor performance sets. And the remainder datasets,

R^{2}

more than 0.8, are grouped as good performance sets. By considering the distribution characteristics of the collected ground-truth data, we partition the crowd count with an interval of 5, then calculate the proportion of the flow on the given interval. The statistical diagrams are pictured in Figure 7.

From the figure, it can be noticed that the good performance sets (

R^{2} \geq 0.8

) have a smaller proportion in the interval of from 0 to 5, where the pedestrian flow is quite low. On weekdays, about 60% of the crowd are lower than 5 in terms of good performance sets while the percent of poor performance sets is above 90%. This kind of phenomenon is more perceivable on weekends. The percentage of the two sets are 55% and 93%, respectively. More importantly, the poor performance sets reach the 99th percentile in the previous two intervals both on weekdays and weekends—that is, only 1 percent of crowd count is larger than 10. On the good performance sets, the percent of the previous two intervals account for 79% and 72% respectively on weekdays and weekends. It reaches the 99th percentile when the number of pedestrians per 100 square meters is above 50. It is concluded that the GCN model achieves a better prediction when the crowd count is high, which proves the effectiveness of the model to find the pedestrian flow peaks.

4.1.3. Comparison between Different Weather Conditions

It is known that the pedestrian flow fluctuates under various weather situations. Therefore, in this section, dealing with weather conditions data over the study period, we apply sensitive analysis to validate the model effectiveness. We used the weather and temperature data from Meteorological Bureau of Shenzhen Municipality (http://weather.sz.gov.cn/ (assessed on 1 March 2021)) and the data were obtained every day. The summary of the temperature and weather conditions is shown in Figure 8.

The temperature in the study area is stabilized during the third quarter (from 1 July 2020 to 30 September 2020). Overall, the highest and the lowest temperature are above 30 and 25 degrees Celsius respectively in most cases. From the statistical table, the cloudy weather accounted for more than half (50/92) of the study period and the temperature variation under diverse weather conditions has little difference. Comparing to the sunny days, it attracts more visitors in cloudy days. It is mainly because Shenzhen is the subtropics climate, and the cloudy days are suitable for people to travel along the road while the sunny days are too scorching heat to sightsee. Since the weather records are time-series data in every day, we embed weather features into pedestrian flow data. The time interval is 1 min in our experiment, which means the 1440 intervals share the recorded weather data in a day. We further conduct the comparative experiments under different weather conditions and the results are shown in Table 6.

Taking the mean crowd count in Figure 8 into consideration, we found that all three metrics have a positive correlation with the mean value. Specifically speaking, the heavy pedestrian flow may result in high prediction error (

R M S E

and

M A E

), and the model has a greater prediction precision (

R^{2}

) when the flow is heavier. The model achieves the best accuracy in cloudy days, and its prediction precision is 0.947, higher than the value of

R^{2}

in other weather conditions.

R^{2}

is the lowest in the overcast days, whose value is still above 0.92. It validates the model’s effectiveness to predict the pedestrian flow in open public places, especially when there is a large crowd of pedestrians thronged into the walking street.

4.2. The Limitation and Prospects of the Study

There are some limitations of this study. The primary concern is that we employ the number of pedestrians within the monitoring area. As the detectors are installed on the pole by the road, the monitoring area may be blocked by the buildings or leaves, resulting in the underestimation of crowd count. Besides, we equipped 25 detectors to monitor the pedestrian in the walking street, covering the main road and crossroad but ignoring the throng in the sideway. High-density spatial sampling would capture the pedestrian features across the board. Finally, there are three dominated indices in the field of pedestrian traffic, i.e., density, speed and direction. This study concentrates on the crowd count within a definite area. It will be beneficial to improve the prediction precision considering the relationship between the three indices.

With respect to the prospects of the study, it can go in three directions. First, compared to the characteristics of vehicle flow in complying with specified roadway, the pedestrian could move freely and even change direction at any time. It would be meaningful to consider the features of pedestrian movement and construct the adjacency matrix besides the distances of pairwise detectors. In addition, the road topology is represented as a static graph in this study. The present model could be improved with an in-depth exploration of the dynamic graph based on variable matrices, further enhancing its accuracy and robustness for crowd count prediction. Finally, we applied a standard GCN model to predict the pedestrians in this study. Many GCN extensions addressing the prediction tasks are proposed to improve the computation accuracy. It is meaningful to utilize these state-of-the-art methods for practical application after decreasing the model complexity.

5. Conclusions

In this paper, we have introduced the GCN model to predict pedestrian flow in open public places. In contrast to traditional grid matrices, the model forecasts the crowd count depending on the road spatial topology relationship, and the graph is constructed to describe the relationships among detectors. Experimental results show that the GCN model consistently and significantly outperforms s baseline models, namely HA, ARIMA, SVM, CNN, LSTM and STGCN. For 1, 5, 10, 15, 20, 25, 30 min pedestrian flow prediction, the values of

R^{2}

are 0.937, 0.928, 0.921, 0.916, 0.911, 0.906 and 0.902, respectively. We further analyze the sensitivity of the GCN model in pedestrian flow prediction. The model obtains superior performances with higher prediction precision on weekends and the precision during the evening peak is above 0.9, demonstrating the superiority of the model, especially when there is a large crowd of pedestrians thronged into the walking street.

The proliferation of various data mining technology creates unprecedented opportunities to better understand crowd distribution patterns using the collected data. The accurate prediction results help road managers take flexible and effective measures to meet the requirements for security management of open public places. More specifically, the massive crowds of pedestrians are similar to shock waves, and people may be crushed by the high pressure building up in the crowd, especially for the older, juvenile and women, who have the tendency to scream and cause psychological uneasiness, even lose their balance and fall down. The managers can shield or divert these vulnerable individuals into the vast square in advance to prevent the occurrence of stampedes. Besides, people often rely on their preferences to choose the route while neglecting the whole story in the walking street during the huge crowd. The regulators can release the road capacity information through public display screens and broadcast facilities, which is effective to avoid the crowd gathering due to the scarcity and distortion of information. It also has the potential to provide accurate and timely flow information for pedestrians to choose appropriate travel routes and decrease the travel time.

Author Contributions

Menghang Liu and Luning Li designed and performed the experiments, analyzed the data and draft the manuscript; Qiang Li and Cheng Hu design the experiments and supervised the data analysis; Yu Bai assisted with adjusting the model. All authors contributed to the writing and revision of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 41977408.

Data Availability Statement

Data support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Brown, M.; Benson, G.O.G.; Keel, R.; Mahoney, E.; Porter, J.; Thompson, J. Seeking Northlake: Place, technology, and public as enabling constraints for urban transdisciplinary research. Cities 2017, 60, 314–322. [Google Scholar] [CrossRef] [Green Version]
Pappalardo, L.; Simini, F.; Rinzivillo, S.; Pedreschi, D.; Giannotti, F.; Barabási, A.-L. Returners and explorers dichotomy in human mobility. Nat. Commun. 2015, 6, 8166. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, D.-W.; Li, L.-N.; Hu, C.; Li, Q.; Chen, X.; Huang, P.-W. A Modified Inverse Distance Weighting Method for Interpolation in Open Public Places Based on Wi-Fi Probe Data. J. Adv. Transp. 2019, 2019, 1–11. [Google Scholar] [CrossRef]
Helbing, D.; Molnár, P.; Farkas, I.J.; Bolay, K. Self-organizing pedestrian movement. Environ. Plan. B Plan. Des. 2001, 28, 361–383. [Google Scholar] [CrossRef] [Green Version]
Ke, J.; Zheng, H.; Yang, H.; Chen, X. Short-term forecasting of passenger demand under on-demand ride services: A spatio-temporal deep learning approach. Transp. Res. Part C Emerg. Technol. 2017, 85, 591–608. [Google Scholar] [CrossRef] [Green Version]
Tang, J.; Liang, J.; Zhang, S.; Huang, H.; Liu, F. Inferring driving trajectories based on probabilistic model from large scale taxi GPS data. Phys. A: Stat. Mech. Appl. 2018, 506, 566–577. [Google Scholar] [CrossRef]
Luca, M.; Barlacchi, G.; Lepri, B.; Pappalardo, L. Deep Learning for Human Mobility: A Survey on Data and Models. arXiv 2020, arXiv:2012.02825. [Google Scholar]
Yang, J.; Zhao, K.; Li, M.; Xu, Z.; Li, Z. Identifying Complex Junctions in a Road Network. ISPRS Int. J. Geo-Inf. 2021, 10, 4. [Google Scholar] [CrossRef]
Barbosa, H.; Barthelemy, M.; Ghoshal, G.; James, C.R.; Lenormand, M.; Louail, T.; Menezes, R.; Ramasco, J.J.; Simini, F.; Tomasini, M. Human mobility: Models and applications. Phys. Rep. 2018, 734, 1–74. [Google Scholar] [CrossRef] [Green Version]
Pappalardo, L.; Simini, F. Data-driven generation of spatio-temporal routines in human mobility. Data Min. Knowl. Discov. 2018, 32, 787–829. [Google Scholar] [CrossRef] [Green Version]
Zhang, D.; Kabuka, M.R. Combining weather condition data to predict traffic flow: A GRU-based deep learning approach. IET Intell. Transp. Syst. 2018, 12, 578–585. [Google Scholar] [CrossRef]
Smith, B.L.; Demetsky, M.J. Traffic flow forecasting: Comparison of modeling approaches. J. Transp. Eng. Asce 1997, 123, 261–266. [Google Scholar] [CrossRef]
Frejinger, E.; Bierlaire, M. Capturing correlation with subnetworks in route choice models. Transp. Res. Part B Methodol. 2007, 41, 363–378. [Google Scholar] [CrossRef]
Williams, B.M.; Durvasula, P.K.; Brown, D.E. Urban Freeway Traffic Flow Prediction: Application of Seasonal Autoregressive Integrated Moving Average and Exponential Smoothing Models. Transp. Res. Rec. 1998, 1644, 132–141. [Google Scholar] [CrossRef]
Li, J.; Liu, Q.; Sang, Y. Several Issues about Urbanization and Urban Safety. Procedia Eng. 2012, 43, 615–621. [Google Scholar] [CrossRef] [Green Version]
Apronti, D.; Ksaibati, K.; Gerow, K.; Hepner, J.J. Estimating traffic volume on Wyoming low volume roads using linear and logistic regression methods. J. Traffic Transp. Eng. Engl. Ed. 2016, 3, 493–506. [Google Scholar] [CrossRef] [Green Version]
Castro-Neto, M.; Jeong, Y.-S.; Jeong, M.-K.; Han, L.D. Online-SVR for short-term traffic flow prediction under typical and atypical traffic conditions. Expert Syst. Appl. 2009, 36, 6164–6173. [Google Scholar] [CrossRef]
Castillo, E.; Menéndez, J.M.; Sánchez-Cambronero, S. Predicting traffic flow using Bayesian networks. Transp. Res. Part B Methodol. 2008, 42, 482–509. [Google Scholar] [CrossRef]
Olutayo, V.; Eludire, A. Traffic accident analysis using decision trees and neural networks. Int. J. Inf. Technol. Comput. Sci. 2014, 2, 22–28. [Google Scholar] [CrossRef]
Crosby, H.; Jarvis, S.A.; Davis, P. Spatially-Intensive Decision Tree Prediction of Traffic Flow across the entire UK Road Network. In Proceedings of the 2016 IEEE/Acm 20th International Symposium on Distributed Simulation and Real Time Applications (Ds-Rt), London, UK, 21–23 September 2016; pp. 116–119. [Google Scholar]
Sun, Y.; Leng, B.; Guan, W. A novel wavelet-SVM short-time passenger flow prediction in Beijing subway system. Neurocomputing 2015, 166, 109–121. [Google Scholar] [CrossRef]
Feng, X.; Ling, X.; Zheng, H.; Chen, Z.; Xu, Y. Adaptive multi-kernel SVM with spatial–temporal correlation for short-term traffic flow prediction. IEEE Trans. Intell. Transp. Syst. 2018, 20, 2001–2013. [Google Scholar] [CrossRef]
Tsai, T.-H.; Lee, C.-K.; Wei, C.-H. Neural network based temporal feature models for short-term railway passenger demand forecasting. Expert Syst. Appl. 2009, 36, 3728–3736. [Google Scholar] [CrossRef]
El Esawey, M.; Mosa, A.I.; Nasr, K. Estimation of daily bicycle traffic volumes using sparse data. Comput. Environ. Urban. Syst. 2015, 54, 195–203. [Google Scholar] [CrossRef]
Gu, Y.; Wang, Y.; Dong, S. Public Traffic Congestion Estimation Using an Artificial Neural Network. ISPRS Int. J. Geo-Inf. 2020, 9, 152. [Google Scholar] [CrossRef] [Green Version]
Lin, Z.; Feng, J.; Lu, Z.; Li, Y.; Jin, D. Deepstn+: Context-aware spatial-temporal neural network for crowd flow prediction in metropolis. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulum, HI, USA, 27 January–1 February 2019; pp. 1020–1027. [Google Scholar]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Cui, H.; Radosavljevic, V.; Chou, F.-C.; Lin, T.-H.; Nguyen, T.; Huang, T.-K.; Schneider, J.; Djuric, N. Multimodal trajectory predictions for autonomous driving using deep convolutional networks. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20–24 May 2019; pp. 2090–2096. [Google Scholar]
Rao, G.M.; Ramesh, D. Parallel CNN based big data visualization for traffic monitoring. J. Intell. Fuzzy Syst. 2020, 39, 2679–2691. [Google Scholar] [CrossRef]
Song, C.; Lee, H.; Kang, C.; Lee, W.; Kim, Y.B.; Cha, S.W. Traffic Speed Prediction under Weekday Using Convolutional Neural Networks Concepts. In Proceedings of the 2017 28th Ieee Intelligent Vehicles Symposium (Iv 2017), Los Angeles, CA, USA, 11–14 June 2017; pp. 1293–1298. [Google Scholar]
Zhang, J.; Zheng, Y.; Qi, D.; Li, R.; Yi, X.; Li, T. Predicting citywide crowd flows using deep spatio-temporal residual networks. Artif. Intell. 2018, 259, 147–166. [Google Scholar] [CrossRef] [Green Version]
Jin, W.; Lin, Y.; Wu, Z.; Wan, H. Spatio-temporal recurrent convolutional networks for citywide short-term crowd flows prediction. In Proceedings of the 2nd International Conference on Compute and Data Analysis, DeKalb, IL, USA, 23–25 March 2018; pp. 28–35. [Google Scholar]
Park, D.-C. Structure optimization of BiLinear Recurrent Neural Networks and its application to Ethernet network traffic prediction. Inf. Sci. 2013, 237, 18–28. [Google Scholar] [CrossRef]
Gao, W.; Wang, Z.Y. Short-Term Airport Traffic Flow Prediction Based on Lstm Recurrent Neural Network. J. Aeronaut. Astronaut. Aviat. 2017, 49, 299–307. [Google Scholar] [CrossRef]
Ramakrishnan, N.; Soni, T. Network Traffic Prediction Using Recurrent Neural Networks. In Proceedings of the 2018 17th Ieee International Conference on Machine Learning and Applications (Icmla), Orlando, FL, USA, 17–20 December 2018; pp. 187–193. [Google Scholar]
Li, W.; Tao, W.; Qiu, J.; Liu, X.; Zhou, X.; Pan, Z. Densely Connected Convolutional Networks with Attention LSTM for Crowd Flows Prediction. IEEE Access 2019, 7, 140488–140498. [Google Scholar] [CrossRef]
Zhang, Z.; Li, M.; Lin, X.; Wang, Y.; He, F. Multistep speed prediction on traffic networks: A graph convolutional sequence-to-sequence learning approach with attention mechanism. arXiv 2018, arXiv:1810.10237. [Google Scholar]
Jiang, W.; Luo, J. Graph Neural Network for Traffic Forecasting: A Survey. arXiv 2021, arXiv:2101.11174. [Google Scholar]
Bruna, J.; Zaremba, W.; Szlam, A.; LeCun, Y. Spectral networks and locally connected networks on graphs. arXiv 2013, arXiv:1312.6203. [Google Scholar]
Yu, B.; Yin, H.; Zhu, Z. Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. arXiv 2017, arXiv:1709.04875. [Google Scholar]
Zhang, Q.; Chang, J.; Meng, G.; Xu, S.; Xiang, S.; Pan, C. Learning graph structure via graph convolutional networks. Pattern Recognit. 2019, 95, 308–318. [Google Scholar] [CrossRef]
Li, Z.S.; Xiong, G.; Chen, Y.Y.; Lv, Y.S.; Hu, B.; Zhu, F.H.; Wang, F.Y. A Hybrid Deep Learning Approach with GCN and LSTM for Traffic Flow Prediction. In Proceedings of the 2019 Ieee Intelligent Transportation Systems Conference (Itsc), Auckland, New Zealand, 27–30 October 2019; pp. 1929–1933. [Google Scholar]
Zheng, J.W.; Li, D.G. GCN-TC: Combining Trace Graph with Statistical Features for Network Traffic Classification. In Proceedings of the 2019 IEEE International Conference on Communications (Icc 2019), Shanghai, China, 20–24 May 2019. [Google Scholar]
Fukuda, S.; Uchida, H.; Fujii, H.; Yamada, T. Short-term prediction of traffic flow under incident conditions using graph convolutional recurrent neural network and traffic simulation. IET Intell. Transp. Syst. 2020, 14, 936–946. [Google Scholar] [CrossRef]
Vinchoff, C.; Chung, N.; Gordon, T.; Lyford, L.; Aibin, M. Traffic Prediction in Optical Networks Using Graph Convolutional Generative Adversarial Networks. In Proceedings of the 2020 22nd International Conference on Transparent Optical Networks (ICTON), Bari, Italy, 19–23 July 2020; pp. 1–4. [Google Scholar]
Yu, L.; Du, B.; Hu, X.; Sun, L.; Han, L.; Lv, W. Deep spatio-temporal graph convolutional network for traffic accident prediction. Neurocomputing 2021, 423, 135–147. [Google Scholar] [CrossRef]
Zhang, J.; Zheng, Y.; Qi, D. Deep spatio-temporal residual networks for citywide crowd flows prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, 4–9 February 2017. [Google Scholar]
Zonoozi, A.; Kim, J.-J.; Li, X.-L.; Cong, G. Periodic-CRN: A Convolutional Recurrent Model for Crowd Density Prediction with Recurring Periodic Patterns. In Proceedings of the 2018 International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden, 13–19 July 2018; pp. 3732–3738. [Google Scholar]
Martí, P.; Serrano-Estrada, L.; Nolasco-Cirugeda, A. Social Media data: Challenges, opportunities and limitations in urban studies. Comput. Environ. Urban. Syst. 2019, 74, 161–174. [Google Scholar] [CrossRef]
Pei, T.; Sobolevsky, S.; Ratti, C.; Shaw, S.-L.; Li, T.; Zhou, C. A new insight into land use classification based on aggregated mobile phone data. Int. J. Geogr. Inf. Sci. 2014, 28, 1988–2007. [Google Scholar] [CrossRef] [Green Version]
Zhu, D.; Wang, N.; Wu, L.; Liu, Y. Street as a big geo-data assembly and analysis unit in urban studies: A case study using Beijing taxi data. Appl. Geogr. 2017, 86, 152–164. [Google Scholar] [CrossRef]
Hu, S.; Gao, S.; Wu, L.; Xu, Y.; Zhang, Z.; Cui, H.; Gong, X. Urban function classification at road segment level using taxi trajectory data: A graph convolutional neural network approach. Comput. Environ. Urban. Syst. 2021, 87, 101619. [Google Scholar] [CrossRef]
Kroon, L.; Maróti, G.; Nielsen, L. Rescheduling of Railway Rolling Stock with Dynamic Passenger Flows. Transp. Sci. 2015, 49, 165–184. [Google Scholar] [CrossRef] [Green Version]
Cats, O.; Wang, Q.; Zhao, Y. Identification and classification of public transport activity centres in Stockholm using passenger flows data. J. Transp. Geogr. 2015, 48, 10–22. [Google Scholar] [CrossRef]
Liu, Y.; Liu, Z.; Jia, R. DeepPF: A deep learning based architecture for metro passenger flow prediction. Transp. Res. Part C Emerg. Technol. 2019, 101, 18–34. [Google Scholar] [CrossRef]
Peng, H.; Wang, H.; Du, B.; Bhuiyan, M.Z.A.; Ma, H.; Liu, J.; Wang, L.; Yang, Z.; Du, L.; Wang, S.; et al. Spatial temporal incidence dynamic graph neural networks for traffic flow forecasting. Inf. Sci. 2020, 521, 277–290. [Google Scholar] [CrossRef]
Han, Y.; Peng, T.; Wang, C.; Zhang, Z.; Chen, G. A Hybrid GLM Model for Predicting Citywide Spatio-Temporal Metro Passenger Flow. ISPRS Int. J. Geo-Inf. 2021, 10, 222. [Google Scholar] [CrossRef]
12 People Were Injured in a “Run-after” Attack at Shenzhen Dongmen Laojie Subway Station. Available online: http://news.sina.com.cn/o/2014-05-26/062630230940.shtml (accessed on 22 June 2021).
Li, Y.; Yu, R.; Shahabi, C.; Liu, Y. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. arXiv 2017, arXiv:1707.01926. [Google Scholar]
Shuman, D.I.; Narang, S.K.; Frossard, P.; Ortega, A.; Vandergheynst, P. The Emerging Field of Signal Processing on Graphs. IEEE Signal. Process. Mag. 2013, 30, 83–98. [Google Scholar] [CrossRef] [Green Version]
Wu, Z.; Pan, S.; Long, G.; Jiang, J.; Zhang, C. Graph wavenet for deep spatial-temporal graph modeling. arXiv 2019, arXiv:1906.00121. [Google Scholar]
Gori, M.; Monfardini, G.; Scarselli, F. A new model for learning in graph domains. In Proceedings of the International Joint Conference on Neural Networks, Killarney, Ireland, 12–17 July 2015; pp. 729–734. [Google Scholar]
Hammond, D.K.; Vandergheynst, P.; Gribonval, R. Wavelets on graphs via spectral graph theory. Appl. Comput. Harmon. Anal. 2011, 30, 129–150. [Google Scholar] [CrossRef] [Green Version]
Defferrard, M.; Bresson, X.; Vandergheynst, P. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. In Proceedings of the Advances in Neural Information Processing Systems 29 (Nips 2016), Barcelona, Spain, 5–10 December 2016; pp. 3189–3197. [Google Scholar]
Kipf, T.N.; Welling, M. Semi-supervised classification with graph convolutional networks. arXiv 2016, arXiv:1609.02907. [Google Scholar]
Guo, K.; Hu, Y.; Qian, Z.S.; Sun, Y.; Gao, J.; Yin, B. An Optimized Temporal-Spatial Gated Graph Convolution Network for Traffic Forecasting. IEEE Intell. Transp. Syst. Mag. 2020, 9055022. [Google Scholar] [CrossRef]
Zhao, L.; Song, Y.; Zhang, C.; Liu, Y.; Wang, P.; Lin, T.; Deng, M.; Li, H. T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction. IEEE Trans. Intell. Transp. Syst. 2020, 21, 3848–3858. [Google Scholar] [CrossRef] [Green Version]
Yu, B.; Lee, Y.; Sohn, K. Forecasting road traffic speeds by considering area-wide spatio-temporal dependencies based on a graph convolutional neural network (GCN). Transp. Res. Part C Emerg. Technol. 2020, 114, 189–204. [Google Scholar] [CrossRef]
Pan, Z.; Zhang, W.; Liang, Y.; Zhang, W.; Yu, Y.; Zhang, J.; Zheng, Y. Spatio-Temporal Meta Learning for Urban Traffic Prediction. IEEE Trans. Knowl. Data Eng. 2020, 9096591. [Google Scholar] [CrossRef]

Figure 1. The geographical distribution of the equipped detectors in Dongmen walking street, Shenzhen, China. (The basemap is collected from Apple maps).

Figure 2. (a–d) Pedestrian flow pattern of some typical detectors at the junctions (i.e., detector 6, 8, 13 and 16); (e) the mean pedestrian flow pattern of the study area; (f) the autocorrelation coefficient under various phase difference.

Figure 3. The framework of the GCN model for pedestrian flow prediction.

Figure 4. Comparison of model performances under different threshold on the train and test set. (a) Prediction loss and computational time on the train set; (b) Three evaluation metrics on the train set; (c) Prediction loss and computational time on the test set; (d) Three evaluation metrics on the test set.

Figure 5. The GCN model performance and the visualization results of the ground-truth and predicted data on (a) weekdays; (b) weekends.

Figure 6. The relationship between mean crowd count and (a) RMSE on weekdays; (b) MAE on weekdays; (c) RMSE on weekends; (d) MAE on weekends.

Figure 7. The interval distribution of datasets on (a) weekdays; (b) weekends.

Figure 8. The summary of the temperature and weather conditions.

Table 1. Examples of pedestrian flow data.

Capture Time	Detectors
Capture Time	6	8	16	21	24
1 July 2020 00:00	6	3	12	5	5
1 July 2020 00:01	5	2	5	5	1
1 July 2020 00:02	6	1	4	9	4
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
1 July 2020 23:59	8	1	4	4	3
2 July 2020 00:00	9	2	5	4	5
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
30 September 2020 23:58	6	11	4	8	5
30 September 2020 23:59	7	9	3	7	4

Table 2. Descriptive analysis of the sample data.

Index	Training Data	Testing Data
mean	6.179	6.138
standard deviation	9.725	9.648
minimum	0	0
lower quartile	0.775	0.758
median	2.415	2.532
upper quartile	6.595	7.089
maximum	105.82	111.13
count	105,984 × 25	26,496 × 25

Table 3. The adjacency matrix and the distance of detectors.

	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	16	17	18	19	20	21	22	23	24	25
1	1	0	0	0	0	0	0.210	0	0	0	0.137	0	0	0	0	0	0	0	0	0	0	0	0	0	0
2	326	1	0.951	0.541	0.508	0.735	0.503	0	0	0	0	0	0.128	0.362	0	0	0	0	0	0	0	0	0	0	0
3	291	35	1	0.730	0.698	0.896	0.693	0	0.130	0	0	0.116	0.230	0.541	0	0	0.172	0	0	0	0	0	0	0	0
4	275	123	88	1	0.781	0.949	0.776	0	0.172	0	0	0.155	0.292	0.951	0.313	0.210	0.557	0	0	0.106	0	0	0	0	0
5	281	129	94	78	1	0.931	0.745	0.276	0.503	0	0	0.471	0.687	0.928	0.280	0.184	0.514	0	0	0	0	0	0	0	0
6	239	87	52	36	42	1	0.928	0.140	0.300	0	0	0.276	0.461	0.815	0.181	0.112	0.372	0	0	0	0	0	0	0	0
7	196	130	95	79	85	43	1	0.245	0.153	0	0	0.137	0.264	0.589	0	0	0.200	0	0	0	0	0	0	0	0
8	282	307	272	256	178	220	186	1	0.911	0.358	0.551	0.546	0.761	0.137	0	0.216	0	0	0	0.110	0	0	0	0	0
9	330	259	224	208	130	172	215	48	1	0.175	0.313	0.800	0.954	0.296	0	0.420	0.135	0.104	0	0.249	0	0	0	0	0
10	259	418	383	367	289	331	345	159	207	1	0.943	0.386	0.220	0	0	0	0	0	0	0	0	0	0	0	0
11	221	428	393	377	299	341	307	121	169	38	1	0.227	0.187	0	0	0	0	0	0	0	0	0	0	0	0
12	404	265	230	214	136	178	221	122	74	153	191	1	0.937	0.272	0	0.391	0.121	0	0	0.557	0.145	0	0	0	0
13	364	225	190	174	96	138	181	82	34	193	203	40	1	0.456	0.102	0.600	0.238	0.190	0.121	0.396	0	0	0	0	0
14	310	158	123	35	43	71	114	221	173	332	342	179	139	1	0.482	0.348	0.745	0	0	0.197	0	0	0	0	0
15	444	292	257	169	177	205	248	319	271	430	440	277	237	134	1	0.530	0.907	0.276	0.184	0.335	0	0	0	0	0
16	471	319	284	196	204	232	275	194	146	305	315	152	112	161	125	1	0.791	0.719	0.579	0.940	0	0.116	0	0	0
17	395	243	208	120	128	156	199	270	222	381	391	228	188	85	49	76	1	0.326	0.223	0.584	0	0	0	0	0
18	566	409	374	286	294	322	365	284	236	395	405	242	202	251	178	90	166	1	0.451	0.508	0	0	0	0	0
19	592	435	400	312	320	348	391	310	262	421	431	268	228	277	204	116	192	140	1	0.376	0	0	0	0	0
20	515	358	323	235	243	271	314	233	185	273	311	120	151	200	164	39	115	129	155	1	0	0.227	0.164	0	0
21	622	483	448	480	354	396	439	340	292	251	289	218	258	445	409	284	360	374	400	245	1	0.888	0.954	0.606	0.203
22	676	537	502	426	408	450	493	394	346	305	343	272	312	391	355	230	306	320	346	191	54	1	0.984	0.876	0.430
23	656	517	482	446	388	430	473	374	326	285	323	252	292	411	375	250	326	340	366	211	34	20	1	0.786	0.335
24	733	594	559	483	465	507	550	451	403	362	400	329	369	448	412	287	363	377	403	248	111	57	77	1	0.735
25	820	681	646	570	578	594	637	538	490	449	487	416	456	535	499	374	450	464	490	335	198	144	164	87	1

Note: the upper triangular matrix is the adjacency matrix calculated by weighting function, and the lower triangular matrix is the distance matrix, of which the unit is the meter.

Table 4. The performances of different models.

Output Sequence Length	Metrics	HA	ARIMA	SVM	CNN	LSTM	STGCN	GCN
1 min	RMSE	3.470	3.176	3.224	3.136	3.249	2.833	2.840
	MAE	1.851	1.679	1.979	1.655	1.613	1.495	1.488
	R²	0.872	0.884	0.887	0.895	0.881	0.933	0.937
5 min	RMSE	3.536	3.464	3.402	3.426	3.460	2.948	3.042
	MAE	1.886	1.831	2.113	1.833	1.720	1.523	1.585
	R²	0.867	0.863	0.874	0.875	0.865	0.931	0.928
10 min	RMSE	3.610	3.618	3.694	3.508	3.532	3.097	3.178
	MAE	1.925	1.909	2.303	1.883	1.765	1.630	1.655
	R²	0.861	0.850	0.851	0.869	0.860	0.924	0.921
15 min	RMSE	3.680	3.713	3.962	3.542	3.611	3.225	3.289
	MAE	1.962	1.959	2.464	1.889	1.818	1.758	1.715
	R²	0.856	0.842	0.828	0.866	0.853	0.918	0.916
20 min	RMSE	3.748	3.789	4.231	3.602	3.639	3.357	3.374
	MAE	1.999	1.999	2.632	1.932	1.825	1.758	1.763
	R²	0.850	0.836	0.803	0.862	0.851	0.911	0.911
25 min	RMSE	3.815	3.855	4.496	3.695	3.651	3.429	3.462
	MAE	2.035	2.034	2.793	1.975	1.841	1.804	1.812
	R²	0.845	0.830	0.777	0.855	0.850	0.907	0.906
30 min	RMSE	3.883	3.919	4.731	3.863	3.683	3.494	3.537
	MAE	2.07	2.066	2.939	2.068	1.857	1.851	1.854
	R²	0.84	0.824	0.753	0.841	0.847	0.903	0.902

Table 5. The GCN model prediction precision under different hours.

Time	Weekdays			Weekends
Time	RMSE	MAE	R²	RMSE	MAE	R²
00:00–00:59	0.942	0.591	0.891	0.883	0.583	0.902
01:00–01:59	0.81	0.471	0.914	0.684	0.436	0.903
02:00–02:59	0.626	0.383	0.916	0.648	0.411	0.914
03:00–03:59	0.538	0.329	0.913	0.821	0.426	0.925
04:00–04:59	0.562	0.329	0.888	0.707	0.384	0.926
05:00–05:59	0.588	0.358	0.859	0.673	0.406	0.873
06:00–06:59	0.72	0.437	0.748	0.703	0.421	0.751
07:00–07:59	1.025	0.625	0.706	0.916	0.577	0.699
08:00–08:59	1.32	0.82	0.67	1.228	0.789	0.726
09:00–09:59	1.922	1.089	0.814	2.05	1.234	0.822
10:00–10:59	1.987	1.246	0.808	2.442	1.57	0.817
11:00–11:59	2.413	1.508	0.821	2.993	1.866	0.816
12:00–12:59	2.71	1.703	0.861	3.265	2.079	0.841
13:00–13:59	3.069	1.944	0.878	4.118	2.618	0.877
14:00–14:59	3.48	2.193	0.914	4.842	3.043	0.897
15:00–15:59	4.068	2.497	0.924	5.636	3.468	0.882
16:00–16:59	4.01	2.436	0.944	5.579	3.362	0.93
17:00–17:59	4.452	2.727	0.93	5.582	3.511	0.939
18:00–18:59	3.851	2.324	0.931	4.974	3.122	0.941
19:00–19:59	3.645	2.173	0.93	4.743	2.989	0.931
20:00–20:59	3.474	2.128	0.927	4.357	2.75	0.926
21:00–21:59	3.175	1.993	0.906	3.852	2.358	0.903
22:00–22:59	2.56	1.643	0.86	2.695	1.712	0.873
23:00–23:59	1.436	0.917	0.808	1.502	0.971	0.827

Table 6. The GCN model performance under different weather conditions.

Metrics	Sunny	Cloudy	Overcast	Rainy
$R M S E$	2.855	3.159	2.715	2.798
$M A E$	1.508	1.662	1.416	1.47
$R^{2}$	0.941	0.947	0.923	0.939

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, M.; Li, L.; Li, Q.; Bai, Y.; Hu, C. Pedestrian Flow Prediction in Open Public Places Using Graph Convolutional Network. ISPRS Int. J. Geo-Inf. 2021, 10, 455. https://doi.org/10.3390/ijgi10070455

AMA Style

Liu M, Li L, Li Q, Bai Y, Hu C. Pedestrian Flow Prediction in Open Public Places Using Graph Convolutional Network. ISPRS International Journal of Geo-Information. 2021; 10(7):455. https://doi.org/10.3390/ijgi10070455

Chicago/Turabian Style

Liu, Menghang, Luning Li, Qiang Li, Yu Bai, and Cheng Hu. 2021. "Pedestrian Flow Prediction in Open Public Places Using Graph Convolutional Network" ISPRS International Journal of Geo-Information 10, no. 7: 455. https://doi.org/10.3390/ijgi10070455

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Pedestrian Flow Prediction in Open Public Places Using Graph Convolutional Network

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Area

2.2. Pedestrian Flow Data

2.3. Methodology

2.3.1. The GCN Model for Pedestrian Flow Prediction

2.3.2. Evaluation Metrics

3. Results

3.1. Hyperparameter Settings of the GCN Model

3.2. Comparative Experiments of Different Models

3.2.1. Baseline Models

3.2.2. Experimental Results

4. Discussion

4.1. The Advantage of the GCN Model for Pedestrian Flow Prediction

4.1.1. Comparison between Weekdays and Weekends

4.1.2. Comparison between Different Hours of the Day

4.1.3. Comparison between Different Weather Conditions

4.2. The Limitation and Prospects of the Study

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI