Map Matching Based on Seq2Seq with Topology Information

Bai, Yulong; Li, Guolian; Lu, Tianxiu; Wu, Yadong; Zhang, Weihan; Feng, Yidan

doi:10.3390/app132312920

Open AccessArticle

Map Matching Based on Seq2Seq with Topology Information

¹

School of Automation and lnformation Engineering, Sichuan University of Science and Engineering, Yibin 644002, China

²

Sichuan Provincial Engineering Laboratory of Big Data Visual Analysis, Yibin 644002, China

³

Sichuan Key Provincial Research Base of Intelligent Tourism, Yibin 644002, China

⁴

School of Computer Science and Engineering, Sichuan University of Science and Engineering, Yibin 644002, China

⁵

School of Mathematics and Statistics, Sichuan University of Science and Engineering, Yibin 644002, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(23), 12920; https://doi.org/10.3390/app132312920

Submission received: 3 November 2023 / Revised: 22 November 2023 / Accepted: 29 November 2023 / Published: 2 December 2023

Download

Browse Figures

Versions Notes

Abstract

:

Most existing road network matching algorithms are designed based on previous rules and do not fully utilize the potential of big data and historical tracks. To solve this problem, we introduce a new road network matching algorithm based on deep learning and using the topology information of the road network. Taking inspiration from the sequence-to-sequence (seq2seq) model popular in natural language processing, our algorithm builds multiple grid-dependent dictionaries based on the topology of road networks. Then the Byte Pair Encoding (BPE) algorithm is used to compress the grid dictionary, which effectively restricts the output range. A Bidirectional gated loop unit (Bi-GRU) with attention mechanisms is used as a recurrent neural network to capture information from a sequence of trajectory points. The model output feedback obtained by training the road network on Yibin City and the empirical evidence of the comparison in this experiment prove the effectiveness of the algorithm. When juxtaposed with similar algorithms, it shows superior accuracy and faster training speeds in road networks matching different scenarios.

Keywords:

deep learning; map matching; GIS

1. Introduction

Road network matching refers to the precise positioning of discrete GPS (Global Positioning System) trajectory points onto an actual traffic road network [1], enabling the accurate depiction of vehicle locations. This process finds extensive applications in areas such as trajectory visualization, navigation implementation, intelligent transportation systems, and path planning.

Numerous scholars both domestically and internationally have put forth a plethora of algorithms for road network matching. For instance, in geometric-based matching algorithms, Bernstein [2] initially proposed the computation of distances between GPS points and each node within the road network, subsequently associating the GPS points with their nearest corresponding nodes. This algorithm is straightforward to implement and computationally efficient. However, its practicality is significantly influenced by the node density in the road network. In light of this limitation, Quddus [3] proposed a weighted topological matching algorithm that incorporates the topological relationship of the road network. This approach assigns weights to trajectory direction, geometric distance, and road relevance to determine the overall weight of each road segment. Subsequently, it selects the road segment with the highest weight as the matching counterpart. Wang Xiaohan [4] and colleagues introduced a method that combines a linear regression model for more accurate identification of changes in road shape at intersections. By referring to the matching situation of multiple subsequent points, the map matching at complex road sections reduced matching errors. Abbour [5] and Nadine [6] proposed the confidence interval matching method, which selected multiple road segments within the confidence region and added them to the candidate road segment. Then, a score operation was performed on each candidate road segment, and the road segment with the highest score was selected as the matching road segment. In the algorithm based on Kalman filtering [7,8,9,10], the road network matching was carried out by considering the characteristics of GPS system errors and analyzing whether the error characteristics of the Kalman filtering model meet the Gaussian white noise distribution. The algorithm based on the HMM (Hidden Markov Model) [11,12,13] model has become a widely used benchmark method due to its applicability in sequence modeling and road network connectivity. Among them, the IVVM [12] algorithm fully considers the topological structure of the road network and the mutual influence between GPS trajectory points. By weighted processing the relationship between these points, the matching accuracy is improved. Yan Shenglong [14] and others improved IVVM and proposed the interactive voting algorithm IIVVM, which fully considers the distance characteristics, topological structure of roads, and a series of road attributes. By filtering noisy road segments through constraint conditions, the matching efficiency is improved.

In recent years, the advancement of artificial intelligence and computer processing capabilities has prompted various fields to explore the potential of big data. In the domain of road network matching, Zhao et al. [15] introduced the DeepMM algorithm, which reevaluates road network matching from a data-centric perspective. This algorithm leverages all trajectory data for joint training and knowledge sharing. It employs a sequence learning model that incorporates embedding techniques and attention mechanisms to conduct road network matching in latent spaces while ensuring stability in noisy environments. Lu Jia-Pin [16] proposed a map matching approach based on ranking learning, wherein the scoring function of candidate roads is learned from known matching outcomes. This method selects the optimal matched road segment by assigning scores to candidate road segments.

Inspired by seq2seq models in the field of natural language processing, this paper proposes a novel approach to represent road network data. It leverages multiple dictionaries that capture topological relationships based on road network topology and subdivision into fine-grained grids. To compress these dictionaries, the BPE algorithm is employed. The scope of our results is focused on the road network using fine-grained grids exclusively. In order to provide robust data support for deep learning, we generate a substantial number of simulation standard paths through Gaussian noise path generation within the Frechet distance framework, combined with efficient path search algorithms. Bi-GRU (Bi-directional GRU) is employed as a bidirectional recurrent network to reduce parameters and assimilate information from both trajectory history and future sequences during model training and prediction. Additionally, an attention mechanism is incorporated to augment the model’s proficiency in extracting hidden value information from sequences. Experimental findings demonstrate the commendable performance of our algorithm in road network matching tasks, showcasing improved accuracy compared to similar algorithms while significantly reducing training time.

2. Related Work

Our work comprises three main components: the creation of a dataset featuring a topological network structure, the development of a deep learning model based on the seq2seq approach, and the training and experimental validation of this model through comparisons with other existing models.

2.1. Basic Concepts and Problem Description

A road network is composed of a series of GPS trajectory points. These points are connected based on the topology of the road network, forming a directed graph G(V,E), where V represents the set of GPS points on the roads in the graph, and each point is represented as P = [lng, lat]. E represents the set of road edges formed by these GPS points, and each road edge is represented as l = [P1, P2, P3, … PN]. Based on this concept, this article defines a series of related dictionaries to express the concepts in map matching problems.

2.1.1. Road Network Dictionary

A road network comprises a sequence of GPS trajectory points, which are interconnected based on the topological structure of the road network, thereby forming a directed graph denoted as G(V,E). Here, V represents the set of GPS points located on the roads within the graph, with each point being represented as P = [lng, lat]. E denotes the set of road edges formed by these GPS points and is represented as l = [P1, P2, P3, … PN]. Building upon this concept, this article establishes a series of associated dictionaries to effectively express concepts pertaining to map matching problems.

2.1.2. Trajectory Dictionary

The trajectory dictionary generates a series of standard trajectories that satisfy the road network topology rules based on the network topology of the road, using the road network and road segment dictionaries as the basis. The generated standard trajectories are added to the dictionary, where each standard trajectory is represented as a set of points ST = [P1, P2, P3, …]. The dictionary is in the form of [ST1, ST2, ST3, … STN].

2.1.3. Grid Dictionary

The grid dictionary partitions the map into an N by N grid, serving as the word vector dictionary for the model. Increasing N enhances division accuracy, albeit at the expense of augmenting model parameters. The grid is arranged in ascending order from the bottom left to the top right. The dictionary takes on a sequential form of [1, 2, …, N × N].

2.1.4. Network Grid Dictionary

The road network grid dictionary replaces all GPS trajectory points in the road network dictionary with their corresponding position ID in the grid dictionary, resulting in a dictionary comprising grid IDs. This dictionary is represented as [CELL1, CELL2, … CELLN], where its length matches the number of GPS points present in the road network.

2.1.5. Road Segment Grid Dictionary

The road segment grid dictionary converts all GPS points within the road segments in the road segment dictionary into their corresponding grid points within the grid dictionary. The index of the dictionary represents the unique identifier for each road segment, while its corresponding value denotes the set of grid-based edges associated with that particular road segment. The structure of this dictionary is represented as [L1, L2, L3, … LN], where L = [CELL1, CELL2, CELL3, … CELLN].

2.1.6. Trajectory Grid Dictionary

The trajectory grid dictionary corresponds to the grid representation of the trajectory dictionary by replacing GPS trajectory points with grid IDs. The resulting sequence also adheres to road network topology rules. Each trajectory is represented as CELL_ST = [CELL1, CELL2, CELL3, … CELLN]. The dictionary is structured as [CELL_ST1, CELL_ST2, CELL_ST3, … CELL_STN].

The final grid ID sequence, along with its topology relationship, is utilized during the model training process, while the remaining dictionaries serve as encoding and decoding references. The formats of the corresponding dependent dictionaries are illustrated in Table 1, whereas Figure 1 depicts the specific mapping logic from points to grids.

The task of road network matching involves inferring the accurate grid sequence from the GPS points’ sequence of a given mobile object, utilizing the algorithm model presented in this paper. Subsequently, the true road network trajectory sequence is decoded based on the inferred grid sequence, thereby completing the road network matching process. For instance, as depicted in Figure 2a, the line represents the GPS point trajectory, while in Figure 2b, it represents the corresponding authentic trajectory. The objective of road network matching is to employ the algorithm for inferring the genuine trajectory shown in Figure 2b from the GPS point trajectory illustrated in Figure 2a.

Described in the form of a grid diagram as shown in the following figure, the lines connected by green points in Figure 3a are the true tracks, the red points are the original measured latitude and longitude points, and the line segment connected by the red points is the wrong track. The purpose of map matching is to infer the wrong trajectory formed by the red dots to the true trajectory formed by the green dots. First, the two trajectory points need to be mapped on the grid, as shown in Figure 3b. The green grid represents the grid sequence of the true trajectory mapping, the red grid represents the grid sequence of the wrong trajectory mapping, and the yellow grid represents the overlap of red and green. The red grid sequence in Figure 3b is input into the trained model, and the model can output the red grid sequence in Figure 3c through calculation, and finally decode the grid sequence into the real geographical longitude and latitude point sequence connected by the green points in Figure 3c.

2.2. Dataset Preparation

The original data for this article was obtained from the OpenStreetMap download of the road network in Yibin City, which was used as the data source. Using QGIS software, special road sections were removed from the network, leaving only the main roads in Yibin City, as shown in the diagram below. The dataset includes 6429 GPS road points and 619 roads. The coordinates of the bottom left corner node are left_bottom = [104.5783146, 28.7090704], and the coordinates of the top right corner node are right_top = [104.6833462, 28.7962751]. The total area is 99,507,385 m².

Based on this information, a directed graph G(V,E) was generated using the NetworkX data processing package for complex graphic network analysis in Python, and all nodes and edges were calculated, totaling 5810.

For ease of description, this article divides the map into a grid of 1000 × 1000, with a precision of 9.95 m × 9.95 m, and assigns a grid code from 1 to 100,000. In actual environments, the grid precision can be adjusted according to demand.

2.2.1. Simulated Real Trajectory Generation

Obtaining large datasets to support deep learning work is a challenging task. Therefore, simulating real and noisy path generation is a crucial part of the work. In this study, we used the NetWorkX library’s simple path function to simulate real paths. All simulated standard trajectories will be used as the target sequence for the model. The format of some short path sequences is shown in Table 2. All paths follow the topological rules of the road network.

2.2.2. Noise Trajectory Generation

In real life, GPS systems often have some degree of deviation from the actual location, which is typically within a certain range known as the Fresnel distance as shown in Equation (1), where A and B are two continuous curves, α and β are two reparameterization functions of the unit interval, and d is a metric function. This formula represents the minimum of the maximum distances between A(α(t)) and B(β(t)) under all possible α and β values. This is the optimal case of the worst-case distance under all possible path pairings. This definition captures the similarity between two paths, taking into account both the shape and length of the paths, or there may be an overall error in a series of GPS points. Therefore, following this rule, Gaussian spatial noise is added to all standard simulated paths to generate a series of noisy trajectories as a corpus for the model’s input sequence, as described in Equation (2).

F (A, B) = \underset{α, β}{i n f} \max_{tϵ [0,1]} \{d (A (α (t)), B (β (t)))\} .

(1)

f (x | μ, σ^{2}) = \frac{1}{\sqrt{2 π σ^{2}}} e^{\frac{- {(x - μ)}^{2}}{2 σ^{2}}} .

(2)

The expected value of the μ distribution and σ is the standard deviation, both of which can be set from the actual accuracy requirements. Examples of noise tracks generated based on nodes 0 to 103 in Table 2 are shown in Table 3.

The generated noise trajectory points of the entire road network are shown in Figure 4. The yellow dots are noise points generated by the algorithm, representing inaccurate latitude and longitude points, which are used as materials for model training. The red lines are the road network of Yibin City.

2.2.3. Data Compression

During the dataset preparation stage, a grid of 1000 × 1000 was partitioned, resulting in a grid dictionary with a length of 1 million. However, such an extensive dictionary would significantly impact both the training time and accuracy of the model. Hence, it becomes imperative to compress the grid dictionary.

The BPE algorithm [17] segments all words in the training corpus into character sequences, calculates the frequency of consecutive byte occurrences, selects the most frequent byte pairs, and combines them to form new subwords. This process is iteratively performed to update the vocabulary. As this paper’s data is stored in dictionary mode, employing the BPE algorithm for customized compression becomes effortless. The initial dictionary length was 1,000,000; however, through utilizing the BPE algorithm, it was compressed to one-tenth of its original size. The relationship between the new and original dictionaries has been preserved as a decoding basis for model output.

3. Model Design

In Feng J’s work [15], a unidirectional LSTM (Long short-term memory) was used as the recurrent neural network for state transfer in seq2seq. Although LSTM has slightly higher accuracy in processing large-scale data, it is extremely time consuming. Related experiments have shown that in situations where the data volume is moderately large, the effect of GRU is comparable to that of LSTM, and it has the advantage of faster processing time. The structure of GRU is shown in Figure 5.

The GRU uses a reset gate to control the amount of data in the current and memory information. It uses an update gate to calculate the output of the current hidden state, generating new memory information to continue forward propagation. The relevant mathematical formulas are shown in Equations (3)–(6).

z_{t} = σ (W_{z} x_{t} + U_{z} h_{t - 1}) .

(3)

r_{t} = σ (W_{z} x_{t} + U_{z} h_{t - 1} + b_{z}) .

(4)

\tilde{h} = t a n h (W X_{t} + r_{t} U h_{t - 1} + b_{h}) .

(5)

h_{t} = (1 - z_{t}) h_{t} + z_{T} h_{t - 1},

(6)

where

h_{t - 1}

is the hidden layer of the previous time,

x_{t}

is the current input, and the output is the hidden layer information

h_{t}

of the current time.

r_{t}

is the reset gate, used to calculate the candidate hidden layer

\tilde{h}

, in order to retain how much

h_{t - 1}

information.

z_{t}

is the update gate, which is used to control how much

\tilde{h}

information is added to get the output

h_{t}

.

In this paper, Bi-GRU is used to transfer the state of model coding phase. Its main feature is that it can absorb historical and future information, so as to capture information of a complete trajectory sequence. The structure of Bi-GRU is shown in Figure 6. In the Figure 6, FD represents the last GRU unit of forward propagation, and BD represents the last GRU unit of backward propagation.

In addition, in order to enhance the ability of the model to capture the dependency between noisy trajectories and real road network trajectories, this article applies the Bahdanau [18] attention mechanism, as shown in Figure 7.

First, a weight coefficient α_tj is generated for each hidden state h_j. The weight coefficients are then subjected to a Softmax operation to obtain the attention weights α_tj for each hidden state. The attention vector context is obtained by calculating the weighted sum of the attention weights and each hidden state. Finally, the attention vector is integrated with a fully connected layer to form the output value. The calculation process is shown in Equations (7)–(9).

e_{t, j} = a (s_{t - 1}, h_{j}) .

(7)

α_{t, j} = s o f t m a x (e_{t, j}) = \frac{\exp (e_{t, j})}{\sum_{k = 1}^{n} \exp (e_{t k})} .

(8)

c o n t e x t = \sum_{t = 1}^{T} (α_{t, j} s_{t}) .

(9)

The final output value will be calculated using a Softmax function to obtain a probability matrix for the grid. The highest probability value for each grid will be outputted to form a grid sequence. By converting it to the real coordinates using the dictionary of the road network, the actual trajectory output can be obtained. The overall structure of the algorithmic model is shown in Figure 8.

4. Experiment

4.1. Experimental Environment

The models involved in the experiment were built using the TensorFlow-gpu (2.3.0) framework and trained on a NVIDIA Quadro RTX 6000 graphics card with a 24 GB memory. The model parameters are shown in Table 4. The dictionary compression refers to the proportion of the original mesh dictionary compressed using the BPE algorithm, and the data augmentation multiplier is the number of noise paths generated based on a single path.

4.2. Experimental Data

The training data used for the model consisted of the road network of Yibin City downloaded from OpenStreetMap, where standard trajectory data were generated through simple path search in NetWork X, and simulated noisy trajectory data. A total of 260,389 standard paths and 13,019,450 simulated noisy paths were used.

4.3. Evaluation Index

Since the problem of road network matching is the matching of noise track sequence to road segment sequence, the absolute accuracy of the model cannot be simply used as the evaluation criterion. In this paper, the accuracy evaluation method of road section mentioned in article [15] is followed, as shown in Equation (10).

a c c = \frac{l e n (t r u e_{t r a} \cap p r e_{t r a})}{\max (l e n (t r u e_{t r a}), l e n (p r e_{t r e}))} .

(10)

l e n (t r u e_{t r a})

is the total length of the standard trajectory of the section,

l e n (p r e_{t r e})

is the total length of the matched trajectory, and

l e n (t r u e_{t r a} \cap p r e_{t r a})

is the total length of the intersection between the correct trajectory and the matched trajectory. In this article, the Haversine formula is used to calculate the distance between latitude and longitude points, as shown in Equation (11).

d = 2 r \sin^{- 1} (\sqrt{\sin^{2} (\frac{θ_{2} - θ_{1}}{2}) + \cos (α_{1}) \cos (α_{2}) \sin^{2} (\frac{α_{2} - α_{1}}{2})}),

(11)

where

r

is the radius of the Earth (6371 km),

θ_{1}

and

θ_{2}

are the latitude between two points, and

α_{1}

and

α_{2}

are the longitude between two points.

4.4. Experimental Result

4.4.1. Visualization of Matching Effect

Two simulated noise tracks were selected to match the road network through the model, and the results were visualized as shown in Figure 9.

It can be seen that the results obtained by the model match the topology characteristics of the road network and are consistent with the real trajectory. Vehicle track data is strictly restricted to the road network, so the noise track is confined to the road network with a certain distribution law of data. The simulation trajectory generated by the data expansion method in this paper can almost look like the actual driving trajectory. However, in order to ensure the rigor of the experiment, it is still necessary to use the real driving track for actual verification. Therefore, this paper collected a section of Yibin City driving routes for test verification. The software that collects track information is the Android App (GPS racking Route Map). The track is approximately 31.31 km long and contains 1467 GPS track points.

The overall trajectory is shown in Figure 10 below. The green area represents the recorded track points, and the red is the route formed by the points.

The overall trajectory was decomposed and piecewise put into the model for matching. The total length of matching was 31.31 km. The correct trajectory length calculated by Equation (10) was 28.89 km, and the matching accuracy was 92.27%. The matching results of some conventional sections are shown in Figure 11. In the figure, the red path is the matching path, and the green path is the path formed by the direct connection of the collected original track points.

The matching results of conventional sections are shown in Figure 11.

According to Figure 12 depicting matching results, it can be seen that the matching effect is very good whether it is in the conventional road section or the complex intersection road section. There is no wrong lane matching, large deviation matching, or other types of wrong matching. Some mismatched sections are shown in Figure 13 below.

The left side of Figure 13 shows the turntable area, and the matching results produced an error loop. Our estimation is that there was a traffic jam when the trajectory was collected, which resulted in excessive positioning in a short time, resulting in a deviation of the model prediction. The right side of Figure 13 shows the intersecting multi-road section, and the correct route should be the route in the yellow area.

4.4.2. Model Performance Comparison

The overall model training accuracy rate and loss curve in this paper are shown in Figure 14.

The absolute accuracy of the model on the training set is 89.93%, and the absolute accuracy of the model on the verification set is 89.88%. Through Equation (10), the accuracy of the verification set is re-converted, and the accuracy of the converted model is 95.45%. It shows that the road network matching method proposed in this paper is effective.

Through the comparison of LSTM, Bi-LSTM, Bi-LSTM-ATTN, and other model structures under the experimental conditions of the same parameters in Table 4, the parameters are shown in Table 5, and all accuracy rates are absolute accuracy rates in the verification set. It can be seen that after 20 iterations, the accuracy of the model trained by the bidirectional LSTM is 1.81 points higher than that of the unidirectional LSTM, and the accuracy of the model with the addition of attention mechanisms on this basis is 86.71 points, and the accuracy of the model in this paper is 89.93 points. It shows that the accuracy of the model can be improved to some extent by using bidirectional recurrent neural network and attention mechanisms.

4.4.3. Optimized Controlled Experiment

In order to verify the validity of the data expansion method and the effect of the compressed dictionary on the training time and training accuracy, based on different data expansion conditions, the compression degree of dictionary compression is shown in Figure 15. Several groups of control experiments were set, and other experimental parameters remained unchanged as shown in Table 4. Accuracy is the absolute accuracy of the verification set.

From the data expansion control test in Figure 15, it can be seen that when the expansion multiple is between 10 and 50 times, the absolute accuracy of the model will be greatly improved, and when the expansion multiple is above 50 times, the accuracy tends to be stable and oscillates between 89.57% and 89.93%.

The experimental results show that the data expansion method is effective to improve the absolute accuracy of the model.

It can be seen from the controlled experiments in Figure 16 that when the compressed data range is between 1/5 and 1/10, the accuracy rate of the model is maintained between 88.11 points and 89.04 points, and the training speed is increased from 276 ms/Step to 27 ms/Step. When the compression degree was increased to 1/15, the absolute accuracy of the model began to decline significantly to 62.37 points, and when the compression was continued to 1/20, the absolute accuracy of the model dropped to 43.37 points.

The experimental results show that the data compression method is effective for improving the training speed, and has limited influence on the accuracy of the model within a certain compression range. Beyond a certain compression range, the absolute accuracy of the model will be seriously affected.

5. Conclusions

To solve the matching problem between GPS tracking and real road networks, this paper proposes a method based on seq2seq combined with topological information, modeling road network matching into seq2seq sequence model, creating multiple dependency dictionaries according to the topological relationship of a road network, fine grid division, and limiting the matching results to road network. Using the BPE algorithm to compress the grid dictionary, the length of the word vector is reduced and the training time is optimized. Standard trajectory data sets are established through simple path search through NetWork X, and a large number of noise trajectories are generated based on Gaussian noise within a certain Frechet distance to support large data volume in deep learning. Bi-GRU is used as a bidirectional recurrent neural network to extract the information of both historical trajectory sequence and future trajectory sequence. We added attention mechanisms to enhance the ability of extracting sequence hidden value information. The experimental results show that this road network matching method is effective and has higher matching accuracy and less model training time than other methods.

Author Contributions

Methodology, Y.B.; Software, Y.B. and G.L.; Formal analysis, W.Z.; Validation, Y.B. and T.L.; Resources, W.Z.; Data curation, Y.B.; Writing—original draft, Y.B.; Writing—review & editing, T.L.; Supervision, Y.W.; Project administration, Y.W.; Funding acquisition, Y.W. and Y.F. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Talent Introduction Project of Sichuan University of Science & Engineering (2020RC20), Innovation Fund of Postgraduate, Sichuan University of Science & Engineering (Y2022106) and the Sichuan Key Provincial Research Base of Intelligent Tourism (No. ZHYJ23-03).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gao, W.; Li, G.; Ta, N. Overview of road network matching algorithms. J. Softw. 2018, 29, 225–250. [Google Scholar]
Bernstein, D.; Kornhauser, A. An introduction to map matching for personal navigation assistants. Geom. Distrib. 1996, 122, 1082–1083. [Google Scholar]
Quddus, M.A.; Ochieng, W.Y.; Zhao, L.; Noland, R.B. A general map matching algorithm for transport telematics applications. GPS Solut. 2003, 7, 157–167. [Google Scholar] [CrossRef]
Xiaohan, W.; Zengyu, H.; Wangwu, H.; Pei, W.; Long, Y. Method of Angle difference and follow-up point map matching for complex Road sections. Appl. Res. Comput. 2022, 39, 379–384. [Google Scholar]
Abbour, M.; Bonnifait, P.; Cherfaoui, V. Map matching integrity using multi-sensor fusion and multi-hypothesis road tracking. J. Intell. Transp. Syst. Technol. Planllingand Oper. 2008, 6, 189–201. [Google Scholar] [CrossRef]
Nadine, S.; Kay, W.A. Map matching of GPS traces on high-resolution navigation networks using the multiple hypothesis techniique (MHT). Work. Pap. Transp. Spat. Plan. 2009, 568, 568–588. [Google Scholar]
Obradovic, D.; Lenz, H.; Schupfner, M. Fusion of map and sensor data in a modern car navigation system. J. Signal Process. 2006, 45, 111–122. [Google Scholar] [CrossRef]
Yu, H.; Liu, H.C.; Tan, C.W.; Bao, Y. Development and application of an enhanced kalman filter and global positioning system errorcorrection approach for improved map matching. J. Intell. Transp. Syst. Technol. Plan. Oper. 2010, 14, 27–36. [Google Scholar]
Kim, W.; Jee, G.I.; Lee, J.G. Efficient use of digital road map in various positioning for ITS. In Proceedings of the IEEE 2000. Position Location and Navigation Symposium (Cat. No.00CH37062), San Diego, CA, USA, 13–16 March 2000; pp. 170–176. [Google Scholar]
Li, Z.; Chen, W. A new approach to map-matching and parameter correcting for vehicle navigation system in the area of shadow of GPS signal. In Proceedings of the 2005 IEEE Intelligent Transportation Systems, Vienna, Austria, 16 September 2005; pp. 425–430. [Google Scholar]
Lou, Y.; Zhang, C.; Zheng, Y.; Wang, W.; Huang, Y. Map-Matching for low-sampling-rate GPS trajectories. In Proceedings of the GIS’09: Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA, 4–6 November 2009; pp. 352–361. [Google Scholar]
Yuan, J.; Zheng, Y.; Zhang, C.; Xie, X.; Sun, G.Z. An interactive-voting based map matching algorithm. In Proceedings of the 2010 Eleventh International Conference on Mobile Data Management, Kansas City, MO, USA, 23–26 May 2010; pp. 43–52. [Google Scholar]
Miwa, T.; Kiuchi, D.; Yamamoto, T.; Morikawa, T. Development of map matching algorithm for low frequency probe data. Transp. Res. Part C Emerg. Technol. 2012, 22, 132–145. [Google Scholar] [CrossRef]
Shenglong, Y.; Juan, Y.; Houpan, Z. IIVMM: An improved interactive voting matching algorithm for Low frequency GPS trajectory. Comput. Sci. 2019, 46, 325–332. [Google Scholar]
Feng, J.; Li, Y.; Zhao, K.; Xu, Z.; Xia, T.; Zhang, J.; Jin, D. DeepMM: Deep Learning Based Map Matching with Data Augmentation. IEEE Trans. Mob. Comput. 2020, 21, 2372–2384. [Google Scholar] [CrossRef]
Lu, J.; Luo, Y.; Huang, Z.; Zhang, Y.-K.; Chen, W. Map matching method based on ranked learning and multi-source information. J. Zhejiang Univ. (Sci. Ed.) 2019, 47, 27–35. [Google Scholar]
Sennrich, R.; Haddow, B.; Birch, A. Neural Machine Translation of Rare Words with Subword Units. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany, 7–12 August 2016; pp. 1715–1725. [Google Scholar]
Bahdanau, D.; Cho, K.; Bengio, Y. Neural Machine Translation by Jointly Learning to Align and Translate. arXiv 2014, arXiv:1409.0473. [Google Scholar]

Figure 1. Grid mapping logic.

Figure 2. (a) The wrong trajectory; (b) The corrected trajectory.

Figure 3. (a) The red ponits connect to the error trajectory, and the green points are connected to the real trajectory. (b) The red grid is connected to the wrong grid sequence, and the green grid is connected to the real grid sequence (c) The red grid is connected to form the correct grid sequence, and the green points are connected to the real trajectory.

Figure 4. Real trajectory and noise trajectory points.

Figure 5. GRU structure diagram.

Figure 6. Bi-GRU Structure Diagram.

Figure 7. Attention structure diagram.

Figure 8. Global network structure.

Figure 9. Matching result visualization.

Figure 10. The red line represents the actual trajectory and the green part represents the generated noise points.

Figure 11. Conventional road section matching results. The green line represents the wrong trajectory and the red line represents the corrected trajectory.

Figure 12. Matching results of complex intersection sections. The green line represents the wrong trajectory and the red line represents the corrected trajectory.

Figure 13. Wrong match. The green line represents the wrong trajectory and the red line represents the corrected trajectory.

Figure 14. Accuracy and Loss.

Figure 15. Data expansion controlled experiment.

Figure 16. Data compression controlled experiment.

Table 1. Related Dependency Dictionary Form Examples.

Dictionary	Data Structure Example
Road network dictionary	{P0:[104.61686, 28.74756]…}
Trajectory dictionary	[{L0:[P1, P21, P33, P41, P49]}…]
Grid dictionary	[{L0:[P11, P221, P333, P411, P491]}…]
Network grid dictionary	[{Ln:[P131, P2133, P3143, P4415, P4391]}…]
Road segment grid dictionary	[{P0:442367},{P1:442345}…]
Trajectory grid dictionary	[{L0:[432367, 425370,…439404]…}

Table 2. Some Shorter Path Sequences.

Starting Point and Endpoint	Path
0==>103	[0, 2970, 377, 592, 2510, 2220, 103]
1==>3923	[1, 5352, 1538, 4355, 2715, 3822, 3923]

Table 3. Noise Pathway Example.

Starting Point and Endpoint	Cell Path
0==>103	[757497, 759494, 764486, 768484, 776483, 811426]
1==>3923	[758500, 759499, 763494, 768488, 773487, 813422]

Table 4. Model Parameters.

Params	Value
Iteration count.	20
BatchSize	128
Number of nodes on the longest/shortest path.	22/5
Optimizer	Adam
Loss function.	crossentropy
Dictionary compression ratio.	0.2
Shuffer	True
Data set partition ratio.	1/10
Data expansion factor.	50

Table 5. Model Comparison Experiment.

	LSTM	BI-LSTM	BI-LSTM-Attn	Textual Model
Training set accuracy	83.51%	85.32%	86.71%	89.93%
Test set accuracy	80.23%	82.34%	84.74%	89.88%
Training set loss	0.4131	0.3691	0.3271	0.2311
Test set loss	0.5221	0.4561	0.3711	0.2298

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bai, Y.; Li, G.; Lu, T.; Wu, Y.; Zhang, W.; Feng, Y. Map Matching Based on Seq2Seq with Topology Information. Appl. Sci. 2023, 13, 12920. https://doi.org/10.3390/app132312920

AMA Style

Bai Y, Li G, Lu T, Wu Y, Zhang W, Feng Y. Map Matching Based on Seq2Seq with Topology Information. Applied Sciences. 2023; 13(23):12920. https://doi.org/10.3390/app132312920

Chicago/Turabian Style

Bai, Yulong, Guolian Li, Tianxiu Lu, Yadong Wu, Weihan Zhang, and Yidan Feng. 2023. "Map Matching Based on Seq2Seq with Topology Information" Applied Sciences 13, no. 23: 12920. https://doi.org/10.3390/app132312920

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Map Matching Based on Seq2Seq with Topology Information

Abstract

1. Introduction

2. Related Work

2.1. Basic Concepts and Problem Description

2.1.1. Road Network Dictionary

2.1.2. Trajectory Dictionary

2.1.3. Grid Dictionary

2.1.4. Network Grid Dictionary

2.1.5. Road Segment Grid Dictionary

2.1.6. Trajectory Grid Dictionary

2.2. Dataset Preparation

2.2.1. Simulated Real Trajectory Generation

2.2.2. Noise Trajectory Generation

2.2.3. Data Compression

3. Model Design

4. Experiment

4.1. Experimental Environment

4.2. Experimental Data

4.3. Evaluation Index

4.4. Experimental Result

4.4.1. Visualization of Matching Effect

4.4.2. Model Performance Comparison

4.4.3. Optimized Controlled Experiment

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI