Traffic Flow Prediction Model Based on Multivariate Time Series and Pattern Mining in Terminal Area

2023-11-22 09:17,,3*,,,

,,3*,,,

1.College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics,Nanjing 211106,P.R.China;

2.College of Civil Aviation, Nanjing University of Aeronautics and Astronautics, Nanjing 211106,P.R.China;

3.Collaborative Innovation Center of Novel Software Technology and Industrialization, Nanjing 210023,P.R.China

Abstract: To improve the accuracy of traffic flow prediction under different weather scenarios in the terminal area, a terminal area traffic flow prediction model fusing multivariate time series and pattern mining(MTSPM) is proposed.Firstly, a multivariate time series-based traffic flow prediction model for terminal areas is presented where the traffic demand, weather, and strategy of terminal areas are fused to optimize the traffic flow prediction by a deep learning model CNN-GRUA, here CNN is the convolutional neural network and GRUA denotes the gated recurrent unit(GRU) model with attention mechanism.Secondly, a time series bag-of-pattern (BOP) representation based on trend segmentation symbolization, TSSBOP, is designed for univariate time series prediction model to mine the intrinsic patterns in the traffic flow series through trend-based segmentation, symbolization, and pattern representation.Finally, the final traffic flow prediction values are obtained by weighted fusion based on the prediction accuracy on the validation set of the two models.The comparison experiments on the historical data set of the Guangzhou terminal area show that the proposed time series representation TSSBOP can effectively mine the patterns in the original time series, and the proposed traffic flow prediction model MTSPM can significantly enhance the performance of traffic flow prediction under different weather scenarios in the terminal area.

Key words:traffic flow prediction; multivariate time series; time series representation; pattern mining; deep learning

0 Introduction

Terminal areas are the most crucial and complex parts of the air transportation system, which makes the study of traffic conditions in terminal areas particularly important.Traffic flow prediction in a terminal area aims to accurately predict the air traffic flow within the terminal area over a period of time in the future.Accurate traffic flow prediction is vital for the management of airlines and airports as it enables decision-makers to develop more reasonable flight plans, optimize resource allocation, and efficiently dispatch personnel, which can improve operational efficiency and cost control.However, since traffic flow in a terminal area is influenced by various factors, including flight schedules, seasonal changes, passenger behavior, and unexpected events such as weather changes and flight delays,the prediction faces several challenges.The interaction and variability of these factors make it even more difficult to achieve satisfactory predictions.Therefore, it is crucial to select appropriate models and algorithms for traffic flow prediction, considering the characteristics of flight operations and the effects of various factors.

In previous studies on traffic flow prediction,researchers primarily relied on non-machine learning methods, such as moving average and exponential smoothing, which are based on historical data for statistical prediction[1-2].However, these methods often incorporate subjective ideas and lack objectivity.With the advancement of machine learning,some researchers tried to use machine learning technologies for traffic flow prediction, including support vector machine (SVM), random forest (RF),contrast learning (CL), active learning(AL), and neural networks (NN)[3-4].These algorithms can effectively handle nonlinear problems but require substantial amounts of data and appropriate feature engineering.Researchers also predicted the traffic flow by treating traffic flow as an individual time series and disregarding the impacts of other factors on traffic flow.In recent years, a few researchers have started to study the traffic flow forecasting problem from a multivariate time series aspect and achieved good results[5].However, these methods lack the mining of the intrinsic patterns of the time series and thus fail to capture the trends information and deeper features of the time series.

In this paper, we attempt to construct a traffic flow prediction model based on multivariate time series and pattering mining (MTSPM) to tackle the complexity and high dimensionality of the terminal area traffic flow data.First, a multivariate time series fusion model is designed to use the related features for prediction.Meanwhile, a time series bagof-pattern (BOP) representation based on trend segmentation symbolization, TSSBOP, is designed to extract the intrinsic patterns from the original series and use them for prediction.Finally, the two prediction results are fused and output.By considering both the effects of multivariate time series features and the intrinsic patterns of the original series, our method can give more accurate traffic flow prediction in the terminal area.

The main contributions of this paper are summarized as follows:

(1) A multivariate time series fusion model is designed to predict the traffic flow in the terminal area based on multiple influencing factors.

(2) TSSBOP representation method is proposed to mine intrinsic patterns from the original series.

(3) A traffic flow prediction model fusing multivariate time series and pattern mining is proposed,and its effectiveness and superiority are validated on the historical traffic dataset of the Guangzhou terminal area.

1 Related Work

In the research field of traffic flow, many early researchers employed non-machine learning methods to make long-term or mid-long-term predictions under different weather scenarios.For example,Pan et al.[1]used a gray prediction algorithm to predict air traffic flow, taking into account the complexity, nonlinearity, and uncertainty of air traffic flow.Dmochowski et al.[2]analyzed the features of air traffic flow in terminal areas based on observational data.Liu et al.[6]combined a gray GM (2, 1) model,regression model, and system dynamics for strategic predicting of air traffic flow.However, most of these non-machine learning methods need to incorporate subjective ideas.Consequently, some researchers began to apply machine learning methods for traffic flow prediction.For example, Chen et al.[3]predicted traffic flow under similar scenarios by using active metric learning.Mao et al.[7]quantified the impact of convective weather on traffic capacity in the terminal area from the aspects of airspace, traffic,weather, and proposed a capacity prediction model based on random forest algorithm.Chen et al.[4]addressed different traffic flow prediction problems under different scenarios by comparative learning methods.Additionally, Hossain et al.[8]quantified the impact of convective weather on the terminal area and proposed a novel traffic flow prediction model based on the weather impact traffic index.Wang et al.[9]developed a stochastic dynamic model to predict traffic flow considering adverse weather conditions.

Machine learning methods always require large amounts of data and proper feature engineering.Dealing with high-dimensional large data sets can affect the training speed of the model, while unsuitable feature engineering can affect the accuracy of the prediction.Therefore, some researchers tried to use time series mining methods for traffic flow prediction.For example, Wang et al.[10]used an improved weighted first-order local method for air traffic flow prediction.Yang et al.[11]proposed a string segmentation algorithm based on regular expressions, combined with an aviation vertical profile model, and validated the traffic flow prediction results using real aeronautical fixed telecommunication network (AFTN) message data.After that,they applied a time series prediction model based on the echo state network to predict traffic flow.Song et al.[12]used the dynamic time warping (DTW) algorithm to measure traffic flow correlation between sectors, constructed a dataset based on correlation,and developed an long short-term memory(LSTM) network prediction model for traffic flow prediction under different input conditions.Although time series methods improved the prediction accuracy and efficiency, they primarily focused on individual traffic flow series without considering the influence of other factors on traffic flow.In recent years, researchers have attempted to address traffic flow prediction from the aspect of multivariate time series.They have achieved promising results by using deep learning models to incorporate multivariate features and predict traffic flow in the terminal area.For example, Lin et al.[13]proposed a multi-input deep learning model to predict traffic flow in the terminal area under convective weather.Yan et al.[14]developed a deep learning-based traffic flow prediction framework that can capture the spatial and temporal dependencies of historical traffic flows and predict inbound traffic flows.Peng et al.[15]proposed a multi-input deep learning model of traffic flow prediction for the terminal area under convective weather.They all aimed to improve the accuracy and stability of terminal area traffic flow prediction by extending the weather features that affect the traffic state.However, these multivariate time series methods ignore the learning of the intrinsic patterns of the series itself and therefore fail to capture the trends and deeper features of the time series.

In order to mine the intrinsic patterns of the time series, an effective representation of the time series is the most critical step.Zhang et al.[16]introduced a straightforward time series representation method called trend based symbolic aggregation approximation (TSAX), which can capture the trend features of each segment to enhance representation accuracy.Lin et al.[17]proposed a BOP representation that extracts substructures from the original series as highlevel features of the time series.These substructures are transformed using SAX enabling dimensionality reduction of the entire series while preserving local features.Ruan et al.[18]proposed an enhanced symbolic representation known as TrSAX.Building upon the BOP representation, they combined SAX with the least squares method to describe the mean and trend information of the time series, which can further refine the substructures.

Inspired by all these researches, we attempt to study traffic flow prediction methods in terms of both multivariate time series and univariate time series, respectively.By exploring the intrinsic patterns and dependencies existing in the traffic flow series, a traffic flow prediction model fusing multivariate time series and pattern mining is constructed for the terminal area.

2 MTSPM Model

In this section, we will propose the traffic flow prediction model MTSPM.The workflow of MTSPM is presented in Fig.1.As the figure shows, the MTSPM model has four parts: Data processing,multivariate time series fusing, univariate time series pattern mining and weighted fusion outputting.The details of each part are described in the following sections.

2.1 Data processing

Fig.1 Workflow of the traffic flow prediction model MTSPM

For accurate prediction of traffic flow in the terminal area under different weather scenarios, we first construct a traffic state dataset with weather,traffic demand, and control strategy as features and actual traffic flow as target values.Among them,the values of weather features can be calculated by ATMAP-CW from 3D weather avoidance area(WAF) data(containing both image and numerical forms) and regular weather report (METAR) data, such as convective weather impact index, visibility impact index, wind speed, gust, rainfall, thunderstorm, wind direction, accumulated rain, etc.The values of traffic demand feature can be calculated from the flight plan and flight log data for actual flow, planned flow, tolerance, inbound delay, outbound delay, cancelled flight volume, normal flight volume, cabin volume and normal flight volume,etc.The control policy feature can be obtained from mile-in-trail (MIT) data by calculating the average crossing interval limit, policy release intensity,crossing interval limit effectiveness, whether the policy is active or not, and the traffic flow affected by the policy.The actual traffic flow can be counted directly from the executed flight plans.In this dataset, both features and targets are time series and they are synchronized in time in any sample.

2.2 Multivariate time series fusing

In the part of multivariate time series fusion,we first use convolutional neural network (CNN)model to fuse the input multivariate traffic flow feature time series, including weather series, traffic demand series and control strategy series, to extract the deep data features of each time series in the highdimensional embedding space.Then, the computational results of CNN model are input to gated recurrent unit (GRU) model for traffic flow prediction,which can simplify the computational complexity and cost while maintaining accuracy.

However, GRU can only treat a time series as a unit during the computation, and cannot effectively handle long series or pay attention to the complex relationships existing in the time series.Therefore,GRU may not achieve the best results when dealing with sparse time series of traffic flow features.The attention mechanism is a technique widely used in deep learning to improve the performance of neural networks for processing time-series data.This mechanism helps the neural network to dynamically select and focus on the relevant parts of the input time series to extract useful information while reducing the reliance on irrelevant series, here we use GRUA to denote the GRU model with attention mechanism.Therefore, in this part, to improve the accuracy of traffic flow prediction, the attention mechanism is further utilized to focus on the traffic flow features and to form CNN-GRUA composite model for traffic flow prediction.

2.3 Univariate time series pattern mining

In the previous section, we used three traffic flow features weather, traffic demand and strategy for traffic flow prediction.However, the actual traffic flow itself is a time series with a strong cyclical trend.Therefore, in the univariate time series pattern mining part, we will study the traffic flow time series itself and mine the variation patterns and timedependent structures existing in it to further predict the dynamic changes of the traffic flow series.In order to analyze and predict a time series, we should first segment and represent it effectively to retain the maximum amount of useful information in the time series.This is the general step of time series mining.To this purpose, a time series BOP representation based on trend segmentation symbolization, TSSBOP, is proposed in this paper to segment, represent and extract patterns from traffic flow time series.The process of TSSBOP is presented in the following sections.

2.3.1 Segmentation based on trend and sliding window

In order to preserve the trend information of the time series as much as possible in the segmentation process, we propose a dichotomous iterative breakpoint search method, as shown in Fig.2, whereSx(x=1,i,j,m,q,n,k) are points of the time series,Di(i=j,m) are the distances betweenSxand the line joining the start and end points, the black dashed line represents the original series and the red solid line represents the obtained segment.

Fig.2 Dichotomous iterative search of breakpoints

First, we find the point with the largest trend change within the time series as the breakpoint, and divide the series into two subseries.By continuously dichotomizing and iterating in subseries, a number of breakpoints are obtained.The whole time series is traversed using the sliding window mechanism, and in each window, the segmentation strategy of breakpoints as the main and average segmentation as the secondary is adopted.ω-1 breakpoints with the most trend information are selected to divide the series intoωsegments.The adopted segmentation strategy makes the number of breakpoints in each subseriesω-1.Let the number of breakpoints in each subseries beγ.Ifγ≥ω-1,ω-1 breakpoints with the most trend information are selected from them to makeωsegments.Ifγ<ω-1, the segments with the series length in the first (ω-1-γ) are bisected to formωsegments.Ifthenωsegments are divided directly by average length, wheremeans rounding down to (ω-1)/2.

Therefore, when there are few breakpoints,the overall segmentation process approximates the average segmentation and when there are enough breakpoints, the segmentation is performed by the iterative dichotomous trend segmentation.It can be seen that the segmentation strategy balances effectiveness and efficiency.

2.3.2 Symbolic representation based on trend and mean

After segmenting, we need to represent the segments in another way.In this paper, we design a symbolic representation based on trend and mean of a segment, so that each segment can be represented by two symbols, the trend and the mean, respectively.

For the part of trend, we directly use the slope value of each segment, which takes a range of(-∞,+∞).This range can be divided into five non-overlapping regions and each region corresponds to a trend level and a capital letter, as shown in Table 1.

Table 1 Symbolic representation of trends

For the part of mean value, it simply describes the distribution of the values in a segment.For a given time seriesTand a subseriesx=[ti,…,t(i+l-1)]inT, then the mean value ofxcan be calculated as follows.

Since the normalized time series approximates a Gaussian distribution, it can be divided into multiple equal probability intervals, as shown in Table 2,whereαis the number of breakpoints andβthe interval boundary value of a Gaussian distribution.The values in the same interval are represented by the same symbol, so that the means of all the segments are mapped to different symbols according to the Gaussian distribution, as shown in Table 3.

Table 2 Breakpoints for equal probability intervals of a Gaussian distribution

Table 3 Symbolic representation of means

For each segment, it will finally be represented as multiple groups of words containing an uppercase letter and a lowercase letter in it.In this way, an original time series is symbolized as a string, retaining its key trend information and value distribution information, while the length has been greatly reduced.

2.3.3 Bagging of pattern

After symbolization, we can construct pattern“word lists” for the entire time series, where each word is a pattern.For example, for a symbolized time seriesS={AaBaAc,AaBaAc,BcDaEb,DcDaEb,DcDaEb,DcDaEb,BcEaEb,AcDaEd},by using digitally reduced counting,Swill be reduced toS' ={AaBaAc,BcDaEb,DcDaEb,BcEaEb,AcDaEd}.Then, we construct the word list for each series as a histogram representation in the form:H:AaBbAc=2,BaDaEb=1,DcDaEb=3,BcEaEb=1,AcDaEd=1.For a given segmentSi, it is likely to be very similar to its neighboring subseriesSi-1andSi+1, especially ifSiis in the smoothed region of the original time series.In this case, we may see many consecutive subseries mapping to the same word.To avoid storing these same words repeatedly, a counting method called number reduction is usually used.Specifically, for consecutive occurrences of the same word, only the first occurrence of the word is chosen to be recorded until a different word is encountered.The purpose of this method is to find out the patterns that occur in the series and their number of occurrences to make a histogram representation.After obtaining the histogram of each time series in the time series dataset,we constructed the word matrixwhereφ= 5,αdenotes the number of letters,ωthe number of segments, andsthe number of time series.Although the final number of matrices formed is huge, this matrix is sparse and most of the words are not mapped, so we can remove all the rows with value being of 0 to form the matrixM'[·] [s], thus ensuring that each word inside will appear once.

When we complete the above three steps, the various patterns present in an original time series are mined, which greatly reduces the computational complexity of subsequent time series prediction in terms of time and space while preserving the key information of the series.

2.3.4 Univariate time series predicting

To facilitate the input of the subsequent model, we first convert the obtained word matrix into an embedding vector encoding using the Word2Vec model.Word2Vec is a commonly used word embedding technique that maps words to a high-dimensional vector space and brings semantically similar words closer together in the vector space.Then, the vectors are fed into GRU model for prediction and the prediction results are converted into word matrix by the inverse operation of the Word2Vec model.Further, the word matrix of prediction results is converted into predicted values of the same form as the original series by the inverse operation of the TSSBOP method to understand and analyze the prediction results intuitively.

2.4 Weighted fusion outputting

In order to effectively fuse the two predictions from the multivariate time series prediction model and the univariate time series prediction model, in this section, we propose a weighted fusion method based on the performance of the two models on the validation set.The data we used in this paper is the actual traffic data from Guangzhou terminal area.From the data, we can find the convective weather has significantly impact on the traffic flow but it does not frequently occur.The overall regularity of the data will make the prediction results relatively stable.Therefore, to capture the diversity in predictive accuracy as effectively as possible, we choose mean square error (MSE), which has a wider range of values, as the evaluation indicator.Meanwhile,in order to reduce the impact of traffic flow outliers caused by severe convective weather on prediction performance, we choose mean absolute error(MAE), which has a lower penalty for outliers as a complementary indicator.In summary, based on prior knowledge of actual traffic data from the terminal area, we use the combination of MSE and MAE for calculating the weights in the fusion process for the final predictions.

Denoting MSE and MAE of the multivariate model as MSE1and MAE1, and those of the univariate model as MSE2and MAE2, the specific weight assignment formula is shown as

Finally, we can get the final predicted value of traffic flow as

whereY1andY2are the prediction results of the multivariate time series prediction model and the univariate time series prediction model, respectively.

2.5 MTSPM

The main flow of the proposed MTSPM model is shown in Algorithm 1.

Algorithm 1MTSPM

Input:Dataset TS, traffic flow TSi,epoch;

Output:PredictionY; // Multivariate time series fusing

(1) Initialize the hyper parameters of CNN, time step, hyper parameters of GRU, network weight,queryQ, keyKand value’s weight matricesV.

(2) For each epoch ∈{1,2,…,N} do

(3) Input dataset TS to multivariate TS prediction model.

(4) Call CNN to extract the multivariate features and save toOl.

(5) Call GRU to capture contextual information about a series and output hidden state seriesZl.

(6) Call attention mechanism to learn weight assignments and output predicted valueY2// Univariate time series pattern mining.

(7) Input traffic flow TSito univariate TS prediction model.

(8) For each sub in TSi:

(9) Call segmentation to calculate trend points and save to TP.

(10) Call symbolization of TP for symbolic representation.

(11) Feature pattern=TSSBOP(TP)

(12) Call Word2Vec to encode Pattern and save toX(13) Call GRU to predict and save toX'

(14)Y'1=Conversion(X')

(15)Y1=Reflexion(Y'1)// Weighted fusion outputting

(16)Y=W1*Y1+W2*Y2

3 Experiments and Analysis

3.1 Experimental setup

3.1.1 Design of experiments

In this section, the performance of the proposed MTSPM model is verified through several comparative experiments.The experiments are divided into four parts.Firstly, the ablation experiment is carried out to verify the effectiveness of the weighted fusion of the two models.Secondly, comparative experiments of five time series feature representations are carried out to verify the performance of the TSSBOP method.Then, the comparative experiment of seven deep learning models is carried out to verify the performance of CNNGRUA model.All of the above experiments are conducted to predict traffic flow in the terminal area under convective weather scenarios.The fourth comparative experiment focuses on the performance of the proposed MTSPM model under no convective weather scenario.Here, a convective weather scenario is one in which weather phenomena such as thunderstorms, rainfall or strong winds occur.A no convective weather scenario is a scenario without these weather phenomena.

3.1.2 Implementation

All experiments are run on the Guangzhou terminal area traffic flow dataset, where the experiments in Sections 3.2, 3.3, and 3.4 are carried out on the traffic flow dataset with convective weather scenarios, while the experiments in Section 3.5 are carried out on the traffic flow dataset without convective weather scenarios.The dataset has 23 different features of weather, traffic demand, and strategy.The time series of each feature has 7 344 timestamps, each timestamp is 1 h.Since in the pre-tactical stage, the lead time of flow control release is in hours, 1 h flow prediction is more relevant to the actual meaning.We use 80% of the data for training,10% for validation, and 10% for testing.We select five indicators, including MSE, MAE, root mean square error (RMSE), mean absolute percentage error (MAPE), andR2(Coefficient of determination), to evaluate the performance of the prediction models.

3.2 Ablation experiments

To simplify the representation, here we refer to the multivariate time series prediction model as Model 1 and the univariate time series prediction model as Model 2.To verify the validity of the weighted fusion of the prediction results of the two models, we compared the prediction results of Model 1, Model 2 and our model MTSPM (Model 1+Model 2), as shown in Table 4.

Table 4 Ablation experiment results

From Table 4, it can be seen that MTSPM model has the lowest MAE, MAPE, MSE,RMSE, and the highestR2compared to Model 1 and Model 2, which means MTSPM has the best performance.Fig.3 shows the traffic flow predictions of three consecutive days.It can be seen from Fig.3 that the prediction of our MTSPM model is closer to the actual traffic flow, which proves that the weighted fusion model MTSPM can greatly improve the prediction accuracy by taking into account the influence of multiple features on traffic flow and the trend pattern of the traffic flow time series itself.

Fig.3 Ablation experiment results of consecutive days

3.3 Performance verification of TSSBOP

To verify the effectiveness of the proposed TSSBOP method, we compared it with other four time series representation methods, including SAX, ESAX, TSAX, and TrSAX, in Model 1,while using the same Model 2 and weighted fusing model.The experimental results are shown in Table 5.

Table 5 Comparison of five representation methods

From Table 5, we can see that although the improvement of TSSBOP on MAPE andR2is relatively small, there is a significant improvement on the remaining three indicators.Compared with other methods, TSSBOP can significantly downscale the features at the data level, so TSSBOP has the best representation ability.We present the predictions of three consecutive days, using five different representation methods in Fig.4.As shown in Fig.4, the predictions using TSSBOP method is closer to the actual flow than the other four representation methods, which proves that the proposed TSSBOP method can extract the features of the traffic flow series in terms of both trend and mean, and use the word matrix to mine the patterns existing in the time series, thus effectively improving the prediction performance.

Fig.4 Prediction results of traffic flow by five representation methods

3.4 Performance verification of CNN-GRUA

To verify the effectiveness of Model 1 proposed in this paper, we compared the traffic flow prediction results of CNN-GRUA model with other six deep learning models, including CNN, LSTM,GRU, CNN-LSTM, CNN-GRU, and CNN-LSTMA, while using the same Model 2 and weighted fusing model.The experimental results are shown in Table 6.

Table 6 Comparison of seven deep learning models

From Table 6, it can be seen that CNNGRUA model has the lowest MAE, MAPE,MSE and RMSE and the highestR2compared with the other six prediction models, which indicates that CNN-GRUA model has achieved the best performance in traffic flow prediction.We also present the predictions of three consecutive days, using seven different models in Fig.5.As shown in Fig.5,the prediction of CNN-GRUA model is closer to the actual traffic flow than the other six models,which proves that the proposed model for traffic flow prediction improve the performance by considering the long-term dependence and paying attention to traffic flow features during the deep learning process.

Fig.5 Prediction results of traffic flow by seven deep learning models

3.5 Prediction under no convective weather scenarios

The above experiments are all carried out on the terminal area traffic data under convective weather scenarios.In order to verify the performance of the proposed MTSPM model under no convective weather scenarios, we constructed the traffic dataset of Guangzhou terminal without convective weather, and carried out two comparative experiments.The experimental results are shown in Table 7 and Table 8.From Table 7 and Table 8, it can be seenthat the prediction performances of all the representation methods and the deep learning models on the dataset without convective weather are better than those under convective weather scenarios, both in MAE, MAPE, MSE, RMSE andR2.This suggests that convective weather has an important impact on traffic flow in the terminal area.In a smooth weather scenario, the terminal area traffic flow has a stronger periodic pattern and thus is more easily predicted accurately.

Table 7 Comparison of five representation methods without convective weather

Table 8 Comparison of seven deep learning models without convective weather

We choose three consecutive days to demonstrate the prediction results of five different representation methods and seven different deep learning models, as shown in Fig.6 and Fig.7.It can still be concluded that the prediction results of the proposed TSSBOP method for pattern mining are closer to the real traffic flow for different time series representation methods.For different deep learning models, the prediction results of CNN-GRUA model are closer to the real traffic flow than the other models.

Fig.6 Prediction results of traffic flow by five representation methods without convective weather

Fig.7 Prediction results of traffic flow by seven deep learning models without convective weather

4 Conclusions

To improve the accuracy of traffic flow prediction in terminal areas under different weather scenarios, this paper proposes a traffic flow prediction model based on MTSPM model.The model analyzes traffic flow in the terminal area from two perspectives: Multivariate time series and univariate time series.To improve accuracy and reduce computational complexity, GRU model is used to incorporate fused multivariate time series features.Additionally, the attention mechanism is introduced to focus on key traffic flow features and reduce reliance on irrelevant information.From the univariate time series perspective, this paper explores the mining of feature patterns within traffic flow series to improve prediction accuracy.A BOP representation method of time series based on trend segmentation symbolization is proposed to identify trend points in each subseries and discretizes the subseries into binary representations.Then, the discretized subseries is counted into a word matrix, and each word represents a feature pattern in the series.The experimental results on the historical dataset of the Guangzhou terminal area demonstrate the effectiveness of the proposed representation in mining feature patterns.Moreover, the multivariate time-series-based traffic flow prediction model for the terminal area significantly improves the accuracy of traffic flow prediction under different weather scenarios.