COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

10
COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map Inference Arian Prabowo RMIT University, DATA61/CSIRO Melbourne, Australia [email protected] Piotr Koniusz DATA61/CSIRO, ANU Canberra, Australia [email protected] Wei Shao RMIT University Melbourne, Australia [email protected] Flora D. Salim RMIT University Melbourne, Australia [email protected] ABSTRACT The process of automatic generation of a road map from GPS trajec- tories, called map inference, remains a challenging task to perform on a geospatial data from a variety of domains as the majority of existing studies focus on road maps in cities. Inherently, existing algorithms are not guaranteed to work on unusual geospatial sites, such as an airport tarmac, pedestrianized paths and shortcuts, or animal migration routes, etc. Moreover, deep learning has not been explored well enough for such tasks. This paper introduces COLTRANE, ConvolutiOnaL TRAjectory NEtwork, a novel deep map inference framework which operates on GPS trajectories collected in various environments. This framework includes an Iterated Trajectory Mean Shift (ITMS) module to local- ize road centerlines, which copes with noisy GPS data points. Convo- lutional Neural Network trained on our novel trajectory descriptor is then introduced into our framework to detect and accurately classify junctions for refinement of the road maps. COLTRANE yields up to 37% improvement in F1 scores over existing methods on two distinct real-world datasets: city roads and airport tarmac. CCS CONCEPTS Information systems Spatial-temporal systems; Ap- plied computing Cartography. KEYWORDS Map Inference, GPS, Road Network, Spatial, Trajectory, Airport ACM Reference Format: Arian Prabowo, Piotr Koniusz, Wei Shao, and Flora D. Salim. 2019. COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map Inference. In The 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (BuildSys ’19), November 13–14, 2019, New York, NY, USA. ACM, New York, NY, USA, 10 pages. https://doi.org/10.1145/3360322. 3360853 Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]. BuildSys ’19, November 13–14, 2019, New York, NY, USA © 2019 Association for Computing Machinery. ACM ISBN 978-1-4503-7005-9/19/11. https://doi.org/10.1145/3360322.3360853 1 INTRODUCTION Traditionally, building a digital map includes digitizing current paper maps, having surveyors to visit the grounds and manually edit the map, and using the aerial photography [54]. However, the combination of traditional methods can be costly and the physical access to sites may be restricted. Roads often suffer from conges- tion and downtime due to maintenance or ongoing constructions, making frequent updates by classical approaches costly. Recently, the number of datasets which consist of GPS datapoints has been growing [14, 41, 46]. Availability of such a data presents researchers with an opportunity to design new algorithms for map inference from the GPS data. Approach [21] calls this process as ‘data recycling’ while [7] calls it as map inference. With algorithms for high quality map inference, obtaining accurate digital maps and their updates becomes viable and cost-effective. The process of map inference poses a number of challenges. The GPS signal is noisy, especially in urban areas [8], making extraction of road segments, and the detection of junctions, a non-trivial pur- suit. The datasets are often unbalanced as some roads are travelled frequently while other roads (e.g., rural) are not. Thus, in some cases, it may be hard to determine if a collection of datapoints represent spurious noises or sparsely travelled routes. Further, the map inference is often addressed with a technique specific to a given site or only tested with geospatial data from a particular domain e.g., GPS trajectories from road vehicles or pre- defined route networks. Therefore, standard map-matching tech- niques could be used when the movement data comes from popular well mapped urban areas. However, the map inference becomes dif- ficult when the geospatial trajectory data comes from commercial non-public areas or precincts, such as from airport tarmac areas [45] and parking spaces [47]. Often, in these cases, a reliable up-to- date map is non available and existing map inference methods fail. Below, we familiarize the reader with existing approaches. A seminal paper on map inference [17] uses a modified k-means algorithm to estimate road centerlines. Others followed and im- proved upon their baseline [1, 43, 54]. Recently, approach [13] employed a modified mean shift instead of k-means. Subsequently, many computer vision based approaches, first pioneered by [15] and then followed by [11, 26, 48], convert GPS trajectories to an image, compute a 2D histogram, and use a variety of different image processing tools for post-processing. Following arXiv:1909.11048v1 [cs.CV] 24 Sep 2019

Transcript of COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

Page 1: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

COLTRANE: ConvolutiOnaL TRAjectory NEtwork for DeepMap Inference

Arian PrabowoRMIT University, DATA61/CSIRO

Melbourne, [email protected]

Piotr KoniuszDATA61/CSIRO, ANUCanberra, Australia

[email protected]

Wei ShaoRMIT University

Melbourne, [email protected]

Flora D. SalimRMIT University

Melbourne, [email protected]

ABSTRACTThe process of automatic generation of a road map from GPS trajec-tories, called map inference, remains a challenging task to performon a geospatial data from a variety of domains as the majority ofexisting studies focus on road maps in cities. Inherently, existingalgorithms are not guaranteed to work on unusual geospatial sites,such as an airport tarmac, pedestrianized paths and shortcuts, oranimal migration routes, etc. Moreover, deep learning has not beenexplored well enough for such tasks.

This paper introduces COLTRANE, ConvolutiOnaL TRAjectoryNEtwork, a novel deepmap inference framework which operates onGPS trajectories collected in various environments. This frameworkincludes an Iterated Trajectory Mean Shift (ITMS) module to local-ize road centerlines, which copeswith noisy GPS data points. Convo-lutional Neural Network trained on our novel trajectory descriptoris then introduced into our framework to detect and accuratelyclassify junctions for refinement of the road maps. COLTRANEyields up to 37% improvement in F1 scores over existing methodson two distinct real-world datasets: city roads and airport tarmac.

CCS CONCEPTS• Information systems → Spatial-temporal systems; • Ap-plied computing→ Cartography.

KEYWORDSMap Inference, GPS, Road Network, Spatial, Trajectory, AirportACM Reference Format:Arian Prabowo, Piotr Koniusz,Wei Shao, and Flora D. Salim. 2019. COLTRANE:ConvolutiOnaL TRAjectory NEtwork for Deep Map Inference. In The 6thACM International Conference on Systems for Energy-Efficient Buildings,Cities, and Transportation (BuildSys ’19), November 13–14, 2019, New York, NY,USA. ACM, New York, NY, USA, 10 pages. https://doi.org/10.1145/3360322.3360853

Permission to make digital or hard copies of all or part of this work for personal orclassroom use is granted without fee provided that copies are not made or distributedfor profit or commercial advantage and that copies bear this notice and the full citationon the first page. Copyrights for components of this work owned by others than ACMmust be honored. Abstracting with credit is permitted. To copy otherwise, or republish,to post on servers or to redistribute to lists, requires prior specific permission and/or afee. Request permissions from [email protected] ’19, November 13–14, 2019, New York, NY, USA© 2019 Association for Computing Machinery.ACM ISBN 978-1-4503-7005-9/19/11.https://doi.org/10.1145/3360322.3360853

1 INTRODUCTIONTraditionally, building a digital map includes digitizing currentpaper maps, having surveyors to visit the grounds and manuallyedit the map, and using the aerial photography [54]. However, thecombination of traditional methods can be costly and the physicalaccess to sites may be restricted. Roads often suffer from conges-tion and downtime due to maintenance or ongoing constructions,making frequent updates by classical approaches costly.

Recently, the number of datasets which consist of GPS datapointshas been growing [14, 41, 46]. Availability of such a data presentsresearchers with an opportunity to design new algorithms for mapinference from the GPS data. Approach [21] calls this process as‘data recycling’ while [7] calls it as map inference. With algorithmsfor high quality map inference, obtaining accurate digital maps andtheir updates becomes viable and cost-effective.

The process of map inference poses a number of challenges. TheGPS signal is noisy, especially in urban areas [8], making extractionof road segments, and the detection of junctions, a non-trivial pur-suit. The datasets are often unbalanced as some roads are travelledfrequently while other roads (e.g., rural) are not. Thus, in somecases, it may be hard to determine if a collection of datapointsrepresent spurious noises or sparsely travelled routes.

Further, the map inference is often addressed with a techniquespecific to a given site or only tested with geospatial data from aparticular domain e.g., GPS trajectories from road vehicles or pre-defined route networks. Therefore, standard map-matching tech-niques could be used when the movement data comes from popularwell mapped urban areas. However, the map inference becomes dif-ficult when the geospatial trajectory data comes from commercialnon-public areas or precincts, such as from airport tarmac areas[45] and parking spaces [47]. Often, in these cases, a reliable up-to-date map is non available and existing map inference methods fail.Below, we familiarize the reader with existing approaches.

A seminal paper on map inference [17] uses a modified k-meansalgorithm to estimate road centerlines. Others followed and im-proved upon their baseline [1, 43, 54]. Recently, approach [13]employed a modified mean shift instead of k-means.

Subsequently, many computer vision based approaches, firstpioneered by [15] and then followed by [11, 26, 48], convert GPStrajectories to an image, compute a 2D histogram, and use a varietyof different image processing tools for post-processing. Following

arX

iv:1

909.

1104

8v1

[cs

.CV

] 2

4 Se

p 20

19

Page 2: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

BuildSys ’19, November 13–14, 2019, New York, NY, USA

the above direction, approach [8] combined aspects of existingalgorithms and introduced so-called grey-scale skeletonization thatmodels uncertainty of the road centerlines as gray-scale imagerepresentation (the state of the art until recently). Subsequently,approach by Chen et al. [13] integrated the prior knowledge ofroads into the inference step. Moreover, Chen et al. [13] used ofa modified version of a popular local image descriptors SIFT [36],called Traj-SIFT, which works directly on a directed graph data.However, their algorithm has not been applied to datasets fromcommercial non-public precincts such as airports. We are not awareof any map inference approaches applied to areas which lack a well-established road-network.

Moreover, in the decade of AI celebrating Convolutional NeuralNetworks (CNN) [33], it appears CNNs have not yet been used forthe road map inference despite of their learning ability. In this paper,we introduce COLTRANE, ConvolutiOnaL TRAjectory NEtwork,a novel deep learning framework for the map inference from GPStrajectory, which produces an annotated directed graph.

In COLTRANE, we improve upon an existing variant of meanshift by appropriating it for complex trajectory data. We call itIterated Trajectory Mean Shift Sampling (ITMS), and we use it toapproximate the road centerlines by generating centerline pointswhich are then connected to form a road map, represented by adirected graph. As we treat the centerline points as nodes of thegraph, each node may constitute on a different kind of road junction(or segment). Thus, the number of road lanes coming in/out of itcorrelates with the node degree in the graph. We apply a CNNto predict the degree of each node to infer the road connectivityand we classify each node into a junction type (several kinds) ora straight road segment. We develop novel trajectory descriptorsas the input to the CNN. We evaluate our method on two distinctreal-world city and airport datasets.

In what follows, we detail our contributions, discuss relatedworks and our notations. Next, we discuss challenges of map in-ference as well as the uniqueness of the airport dataset. Then wedescribe our framework COLTRANE. Lastly, we present our resultson two datasets, using visual and quantitative evaluations.

1.1 ContributionsWe propose COLTRANE, a deep learning framework for map infer-ence from trajectories which is generic and adaptable to both cityand airport tarmac environments. COLTRANE contains proposedby us three components:

i. Iterated Traj-Mean Shift (ITMS) algorithm which incorpo-rates the orientation of the motion of GPS points duringtheir clustering process.

ii. Trajectory descriptors a.k.a. features or feature maps, whichcontain counts of GPS coordinates, average of x- and y-directional velocities.

iii. CNN applied by us for the first time to trajectory maps forthe purpose of the junction type classification and inferenceof the degree of centerline points (roads going in/out of ajunction) to aid the process of merging road segments.

We apply COLTRANE to regular datasets collected by road vehi-cles, and a much more complex GPS data collected at the airportfrom aircraft and ground vehicles traversing the airport tarmac. We

outperform a recent algorithm [13] on both datasets. In contrastto the regular road data, airport routes are weakly defined, morenoisy and poorly separated due to proximity of various lanes. Yet,we demonstrate state-of-the-art results on such a challenging data.

2 RELATEDWORKSurveys on the map inference problem a.k.a. map construction, mapgeneration, and map creation, can be found in [2, 7]. There is also amore recent, albeit non-technical, read [20]. In addition, paper [7]also introduced a directed spatial graph evaluation protocol for thepurpose of evaluation of the quality of maps.

2.1 Clustering-based approachesThe seminal paper on map inference [17], further improved by[43], uses a variant of k-means to detect the road centerlines byclustering GPS datapoints that are close to each other according tothe Euclidean distance and heading. A similar approach [54] usedan alternative way to infer road segments that connect the samplepoints. GPS was used in [17] with only 2D coordinates while in [54]a new metric was introduced to find the best adjacent sample point.In [1], the altitude data was used for inference of pit mining maps.More complicated clustering approach [18] applies clustering onvector fields rather then directly on trajectories.

Notably, Chen et al. [13] developed Traj-Meanshift clusteringas a better alternative to k-means, and leveraged prior knowledgeregarding city road network such as the smoothness of local roadsegments. Similarly, [24] exploited the prior knowledge regardingturning restrictions at junctions to detect them. Recently, algorithm[42] employed yet another clustering technique called DBSCAN.

2.2 2D histogram based approachesA common family of approaches to finding the road centerlinescan be summarized by approach [15] which uses a 2D histogramof interpolated GPS datapoints followed by binarization. The roadcenterlines were extracted by using so-called Voronoi graph. Fur-thermore, approach [8] used adaptive thresholding for binarization.

Approaches [11, 48] used morphological operations such as thedilation and closure in place of interpolations followed by so-calledskeletonization in place of the Voronoi algorithm. Furthermore,method [26] clustered neighboring pixels in place of morphologicaloperations, while approach [21] extracted the road centerlines byfitting spline curves. Traj-SIFT approach [13] proposed modifiedSIFT descriptors on trajectories and employed an SVM classifierfor junction detection. We note such descriptors are related to 3Dhuman skeleton descriptors for action recognition [32, 52, 53].

In contrast, we use CNN on feature maps representing trajecto-ries to infer node degrees in the road graph representing junctions.

2.3 Other approachesLess common approaches include [9] which simulates the physicalattraction/repulsion between the GPS points to extract the center-line. More recently, [50, 51] use similar idea and formulate the mapinference as a partial graph matching. Approach [22] puts the focuson identifying missing road segments from the map. Approach[38] uses the strength of mobile phone signal (beside of standardtechniques) to detect bridges and tunnels. Also using the cellular

Page 3: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map Inference BuildSys ’19, November 13–14, 2019, New York, NY, USA

network data, approach [58] is modelling the urban mobility in acity. Approach [3] combined both map and partial curve matchingwhile papers [28, 29] detect junctions and corners prior to formconnections between them. Paper [55] uses Delaunay triangulationand the Voronoi diagram. while approach [57] relies on NaturalLanguage Processing, clustering and 2D histograms. Paper [45]filters out GPS points with low confidence. Finally, for brevity, werefer readers to tutorials on deep learning by Jeff Hinton [23].

Following [51, 51], we use the approach of Chen et al. [13] asthe baseline to compare ourselves to due to conceptual similarities.

To the best of our knowledge, there is no map inference paper us-ing GPS data and deep learning. However, approach [6] performedoutlier detection in bus routes by CNN using GPS data.

3 MOTIVATIONMap inference is applicable to a variety of scenarios, verging fromthe traffic analysis in smart cities and rural areas to wild habitatsand restricted environments e.g., constructing and updating theroute networks of airport aircraft can help air traffic controllersmanage aircraft landings and take-offs. It can also help airporttraffic managers recognise patterns of aircraft movement to reducethe traffic congestion [5, 31] and detect anomalies in the routesof aircraft [40]. Moreover, it can prevent hazardous situations byensuring ground vehicles adhere to secure routes and procedures.

Currently, companies such as Google, Apple and OpenStreetMapprovide digital maps of the road networks of cities and urban areasbut they are excluded from restricted areas. Those companies alsospend tremendous amounts of money and human resources inmanually mapping road networks into digital maps [4] which posesnumerous practical issues. Airport runways and tarmac remainexcluded from manual mapping, however, the Federal AviationAdministrations (FAA) have recorded the GPS trajectory data foreach aircraft at United States airports, which offers an opportunityto generate maps of aircraft ground routes [12].

Many algorithms and frameworks have been proposed to con-struct the road network of cities or urban areas. None of them areapplicable to the aircraft and ground vehicle trajectory data. Theseworks can be roughly grouped into two classes: (i) constructingthe route networks of vehicles or pedestrians, where the route net-works exist on the real-world map, and are represented as roads,highways or trails [19, 34] and (ii) discovering the main trajectoriesfrom the GPS data where no fixed roads, road plans and maps exist[16, 27, 35] e.g., animal routes or pedestrianised shortcuts.

We note there exist practical difficulties for the map inference.The patterns of trajectory data and evaluation metrics differ acrossthe variety of methods for the route network construction. For ex-isting road network reconstruction methods, route networks oftenmatch the existing road networks on the map while for non-existentroad network such a ground truth does not exist. Researchers oftengroup such trails into a couple of main trajectories to establish anetwork of popular routes. The criteria for evaluating such routenetworks are thus somewhat subjective. Compared with traditionalroad network map construction, inferring an airport map using air-craft GPS trajectories is more challenging. Firstly, airport runwaysare different from other sources of GPS data such as taxi or cars.The trajectories of aircraft are more uncertain because the common

roads are much narrower than airport runways, and airport groundvehicles often follow unscripted routes. Secondly, the speeds andheadings of aircraft are more uncertain than those of road vehi-cles due to the traffic control. Thirdly, aircrafts encounter differentuncontrolled situations than city vehicles. Fourthly, airport trafficchanges according to criteria such as weather, scheduling, safetyetc. In summary, requirements for inferring maps of airport/cityroad networks differ which necessitates our investigations.

3.1 DatasetsBelow, we compare the GPS noise and mobility patterns of twodifferent real-world datasets summarized in Table 1. We discoverthat the map inference on airport tarmac poses a unique set ofchallenges absent from the typical city map inference.

Table 1: UIC and FAA datasets.

Description (unit) UIC FAAVolume (MB) 7 138

Number of points 118,364 1,057,688Number of trajectories 889 8,902

Total trajectory length (h) 118 526Total trajectory length (km) 2,867 6,258

Area coverage (km2) 9.4104 10.4791Time span 28 days 24 hours

Start time 2011-04-0115:15:20

2016-07-3114:00:01

End time 2011-04-2919:54:53

2016-08-0113:59:59

3.1.1 UIC. This dataset was collected by the University of Illi-nois at Chicago (UIC) and is available at https://www.cs.uic.edu/bin/view/Bits/Software [8]. It was generated by GPS sensors em-bedded in a fleet of 13 campus shuttle buses. The mobility patternof these buses could be grouped into 2 categories. The first andlarger group consists of 11 buses that traveled regular predefinedroutes, thus displaying routine mobility patterns. The frequency ofservice varied between routes, creating a high disparity of spatialdensity. The other group served chartered trips with non-routinemobility. Thus, this dataset displays a highly regular patterns withonly a few trajectories that deviate from predefined paths [7]. Wenote that localization and mobility datasets are typically noisy. ForGPS data, the sources of error include hardware, tectonic and seis-mic activities, seasonal cycles, and local geography [39]. Since thebuses travel between low, mid, and high rise buildings, this datasetcontains a varying level of GPS noise.

3.1.2 FAA. This is a private dataset collected by the FederalAviation Administration (FAA) of United State Department ofTransportation, made available to us through our industry part-ner [45, 56]. Unlike UIC, this dataset has not been cleansed. Thus,we add an additional data cleaning step. This dataset consists oftrajectories of both airplanes and ground vehicles on the tarmacof Los Angeles Airport (LAX), the 4th busiest airport in the world.It handled 87,534,384 passengers and 2,209,850 tonnes of cargo in

Page 4: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

BuildSys ’19, November 13–14, 2019, New York, NY, USA

year 2018 alone [25]. Due to the busyness of the airport, this datasetis much larger in every aspect (Table 1), despite the fact that thedata collection spans only 24 hours. The trajectories are generatedwith various sensors embedded in airplanes and ground vehicles,as well as by ground sensors in the airport, such as radars. A singleobject can be assigned to multiple trajectories ie., an airplane couldbe assigned to one trajectory during arrival, and a different oneduring departure due to of the change of flight number.

In LAX,most of the start and end points of trajectories are locatedin the runway and apron areas, the latter of which is where gatesand terminals can be found. This airport has 4 runways, 2 at thenorth and 2 at the south [44] which allow east and west approach.In addition, there are 132 gates spread over 9 terminals. Taxiwaysconnect the runways, aprons, and other facilities such as hangars.

3.2 GPS signal and noise

(a) An intersection from the UICdataset (area of low-built buildings).

(b) An intersection from the UICdataset (area of high-built build-ings).

Figure 1: Illustration of the GPS data and the underlyingnoise for low and high-built areas of the UIC dataset. Bothareas have the same dimension of 100 × 100 square meters.The hue is based on the heading (best viewed in color).

Due to the noise from GPS sensors, locating road centerlines isdifficult. Even in areas of low-built buildings, the noise from thesensors is at the same magnitude as the distance between adjacentroads. Thus, GPS locations recorded by the sensors might landoutside of the road or even on the lane with the opposing traffic(see Figure 1(a)). This effect is exacerbated in areas of high-builtbuildings, with errors reaching 50m as shown in Figure 1(b). Worthnoting is the imbalanced nature of the data with the west boundvolume traffic being magnitude larger than the other directions.Thus, locating road centerlines is a hard task for which we havedeveloped Iterated Trajectory Mean Shift (ITMS) in Section 5.1.1.

3.3 Airport spatial complexityAs the FAA dataset only spans 24 hours of data recordings, onlya few of trajectories have a complete spatial overlap of routes,that is, similar start and end points, as well as taking similar pathalong the taxiway while in transit. However, the spatial order ofpossible trajectory paths is still limited as airplanes have to followthe taxiway. Thus, although the mobility pattern within a day ishighly irregular, the routine path travelled is not. This is only truefor the airplane trajectories.

A significant portion of the trajectories are generated by theground vehicles, which deviate from taxiways and take shortcuts,resulting in very long trajectories as shown in the distribution(Figure 2). This makes the dataset irregular, and is one of the mainsources of noise which is unique to airports.

Furthermore, this dataset has a high spatial complexity for anumber of reasons. Firstly, it has a more complex geometry. TheUIC dataset has junctions with degree 3 (T-junctions) and 4 (cross-junctions). In contrast, the junction degrees in the FAA datasetrange between 3 and 6. For a quantitative overall comparison, themean degree for junctions in the FAA dataset is 3.49, which higherthan the value for UIC dataset, which is 3.37. Moreover, this datasethas nearly 7 times more junctions while occupying the similar areaas UIC. Thus, junctions are closer to each other. For a quantitativecomparison, the mean pairwise nearest junction distance for theairport is 33.7 m, which is much lower than 153 m for regular roads.The distance for the above analysis was computed as follows: foreach junction, we find its nearest neighbor, we compute the distance,and we average over all possible such pairs.

All of these factors combined highlight the complexity of thedataset e.g., reflected by the higher standard deviations of nearlyall of the attributes 2, as well as a greater variation in the trajectorylength (Fig. 2). In particular, the take-off and landing trajectorysegments correspond to very high speeds, giving a positive skewto the distribution of speed and spatial distance between pointswithin a trajectory.

4 DEFINITION AND PROBLEM STATEMENTIn what follows, we define a GPS point p ∈ R5 as a 5 dimensionalvector consisting of a latitude, longitude, speed, heading and a times-tamp. The timestamp is in the UNIX time format, which is thenumber of seconds elapsed since the midnight of 1 January 1970.

P(H ) is a set of H points p so that P(H ) ≡ {ph}Hh=1. A trajectoryT(I ) is an ordered sequence of I points p so that T(I ) ≡ {pi}Ii=1. Aset of trajectories T is a set of J trajectory sequences T such thatT ≡ {T(Ij)

j } Jj=1. As our notation suggests, a set of trajectories Tmay consist of trajectories of different lengths indicated by Ij .

In this paper, we are reducing a road into a sequence of linesegments (null thickness), defined by centerline points c ∈ C whichrepresent the road centerline. A road map M is represented as anannotated and directed graph with annotated centerline points Cand edges e ∈ E (from,to,weight) so thatM ≡ (C, E). In Table 3, we

Table 2: Details of UIC and FAA datasets.

Attribute (unit) UIC FAAJunction degrees 3,4 3,4,5,6,7

Number of Junctions 41 285

Speed (m/s) 8.9462± 3.5758

10.9225± 13.3174

Distance between pointsin a trajectory (m)

24.4113± 3.3066

19.3622± 16.8242

Distance between pointsin a trajectory (s)

3.6171± 3.6758

5.8594± 259.2738

Page 5: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map Inference BuildSys ’19, November 13–14, 2019, New York, NY, USA

(a) Histogram of trajectory lengths for the UIC dataset.

(b) Histogram of trajectory lengths for the FAA dataset.

Figure 2: Histograms of trajectory lengths for both datasets.

Table 3: Notations for various attributes we access.

Notation Descriptionp(x); c(x) coordinates (2D)p(h); c(h) heading (1D)p(s); c(s) speed (1D)

p(t ) timestamp (1D)p(a) altitude (1D)p(κ) is covered (boolean)c(w ) weight (1D)c(d ) degree (1D)c(d ) degree upper bound (1D)c(l ) label (1D)

|P |; |T |; |C|; |M|; |E | set cardinality| |a − b| |2 where a, b are vectors euclidean distance

α(p, q) heading difference(modulo operator)

defined notations to access specific data attributes from the objectswe have previously defined. The table also includes ‘intermediate’attributes required by our algorithm, such as covered and weight.All headings and angles are in radians.

Our problem can be formalized as (i) inferring the road map M(e.g., Fig. 3(b)) from the set of trajectories T (e.g., Fig. 3(a)), and (ii)detecting junctions to assign one of the 5 possible labels for everycenterline point c(l )∀c ∈ C , where c(l ) can take on labels such asnot-a-junction, Y or T junction, and cross or star intersection.

5 METHODOLOGYIn what follows, we explain our COLTRANE approach. We proceedby constructing an intermediate road map M ′ from the set oftrajectories T generated by the sensors.

As M ′ contains many false positive edges, it is not trivial toremove them. The difficulty of this step is shown in Figure 1(a).All of the east-to-west traffic joining the intersection is directedsouth bound, while all the east-to-west traffic past the intersectionis coming from the the north-to-south traffic. A less sophisticatedalgorithm would mistakenly form an east-to-west edge.

In order to trim false positives, we develop a novel trajectorydescriptor Φ, which we use as an input to a Convolutional NeuralNetwork (CNN), a highly discriminative model which aids complexedge pruning. The output of the edge pruning module forms thefinal route mapM. Figure 4 gives an overview of our pipeline.

5.1 Intermediate Map ConstructionThe process of generating the intermediate mapM ′ can be dividedinto two steps: (i) extracting centerline points c ∈ C and (ii) in-ferring the edges e ∈ E that connect those points. The centerlinepoints c ∈ C are extracted using ITMS. Then, our algorithm uses tra-jectories T to infer directional links between the centerline pointsc ∈ C to form a set of edges E. As the false positive edges will bepruned in the next step, our algorithm forms all the plausible edgesfirst. Centerline points and directed edges form a directed graphrepresenting the intermediate mapM ′ = (C, E).

5.1.1 Iterated Trajectory Mean Shift Sampling (ITMS). Traj-Mean Shift, proposed in [13], locates road centerlines. It buildson the mean shift clustering algorithm by shifting cluster centerstowards the mean location of their neighbouring points. The twomain properties of Traj-Mean Shift are: (i) cluster centers shift alongan axis perpendicular to the heading, and (ii) the histogram takesheading and speed into account. Since Traj-Mean Shift locates roadcenterlines, we will refer to the cluster centers as centerline points.

The algorithm proceeds as follows. Firstly, all points are set asuncovered ie., p(κ) = 0,∀p ∈ P. Then, a random point is chosen asan uncovered point p ∈ P : p(κ) = 0 that is used to initialize thecenterline point by setting c(x,s,h)=p(x,s,h). Then, neighbors N ofthe centerline point c ∈ C, defined asN ≡ {n ∈ P : | |c − n| |2 ≤ γ }

(a) Input: trajectories from sensors (b) Output: Road Map

Figure 3: Our COLTRANE takes trajectories generated fromsensors 3(a) and produces the map 3(b).

Page 6: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

BuildSys ’19, November 13–14, 2019, New York, NY, USA

GPS points

CenterlinePoints

IntermediateMap

ITMS Edges InferenceIntermediate Map Construction

Deep Edge Pruning

Final Map

Trajectory Features TrajectoryDescriptor

Figure 4: Our pipeline. Firstly, we generate an intermediate roadmap by clusteringGPS trajectories using ITMS to approximatethe centerline points and we connect them. In parallel, we construct a trajectory descriptor by merging trajectory featuresinferred from GPS datapoints. We then use the trajectory descriptor as an input to CNN to determine the degree and junctiontype for each centerline point. After pruning, we obtain the road map.

are formed (γ is a constant). Neighbours n ∈ N are projected ontoa weighted histogram that is centered at c(x ) and perpendicular toc(h). The weights for the GPS point p ∈ P in the histogram for thecenterline point c ∈ C are set as follows:

w(c,n;N) =exp

(−α2 (c,n)/2σ 2

h)

exp(−(c(s)−n(s)

)2/2σ 2s)∑

n′∈Nexp

(−α2 (c,n′)/2σ 2

h)

exp(−(c(s)−n′(s)

)2/2σ 2s) ,

(1)where σh and σs are RBF radii. The centerline point c ∈ C is thenshifted to the coordinate of the histogram bin with the largestvalue, denoted as c′ from here on, and assigned a weight equal tothe number of its new neighbours c′(w ) = |N ′ | : N ′ ≡ {n′ ∈ P :| |c′ − n′ | |2 ≤ γ }. Finally, all new neighbours are set as coveredn′

(κ)=1,∀n′ ∈ N ′.

(a) Traj-Mean Shift [13] (b) ITMS

Figure 5: An illustrated comparison of Traj-meanshift andITMS (best viewed in color). Green arrows are the initialpoints and red arrows are the final centerline points. Thethick black line is the axis of the histogram (see Section5.1.1). Black arrows are the neighbours, while gray arrowsare not. Blue arrows are the intermediate centerline points.Magenta arrows are the neighbours that fall into the meanbin. See Algorithm 1 for more details.

Figure 5(a) illustrates that Traj-Mean Shift [13] fails to correctlydetermine the location and heading of the road centerline if theinitial point p ∈ P is too far from the actual road centerline. Notethat the final centerline point has the wrong heading as well. This isproblematic for more complex/noisy GPS data e.g., from the airporttarmac as shown in Section 3.2.

In Algorithm 1, to address this shortcoming, we propose IteratedTrajectory Mean Shift (ITMS), an extension of Traj-Meanshift withimprovements described below.

Firstly, the sample point c ∈ C is shifted to the mean_bin (ifnon-empty) of the weighted histogram rather than the bin withthe maximum value. Otherwise, the bin with maximum value ischosen (note ‘*’ in Algorithm 1). Secondly, to approximate wellthe speed and heading of c, we adjust them by taking into accountthe speed and heading of other points in the selected bin: c(s) =

1|U |

∑u∈U u(s) ·υ(c, u), c(h) =

1|U |

∑u∈U u(h) ·υ(c, u),whereU ≡

{u ∈ N : u ∈ mean_bin}. Our weighting is defined as follows:

υ(c, u) = 1 −α(c(h), u(h))

π. (2)

Thirdly, only neighbors of p, n, and all intermediate n whose head-ing is close to their respective points, are considered as processed(black in Figure 5(b)) while neighbors with different headings arenot considered processed (gray in Figure 5(b)). Finally, Algorithm 1iterates the shifts untilm≤ϵ and the set cardinality |C′ | does notchange. We set the hyperparameters as follows: σh = 1.7, σs = 13,ϵ = 1.5m, γ = 15m, η = 0.3π , and β = 50 .

5.1.2 Edge Inference. This module infers the intermediate setof directed edges E ′ between the centerline points from set Cbased on the trajectories T . Because we use a powerful learner toprune the false positive edges in the next step, we form all plausibleedges in this step. Since the the centerline points c ∈ C wereobtained through clustering, we assume that every GPS point in Pcorresponds to some cluster in C. Thus, we map every single GPSpoint in P to a centerline point in C by finding the centerline pointc that minimizes expression s(c, p)= ∥c − p∥2 · α(c, p). Specifically,

Page 7: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map Inference BuildSys ’19, November 13–14, 2019, New York, NY, USA

input :P - a set of GPS pointsoutput :C - a set of centerline pointsset converдed = False;while ¬ converged do

set p(κ) = 0 : ∀p ∈ P; set C′ = ∅ ;// initialization

while ¬p(κ) = 1,∀p ∈ P do // while p ∈ P are

not covered// pick a random point and

initialize with it the roadcenterline point

pick randomly p ∈ P : p(κ) = 0, set c(x,s,h) = p(x,s,h),setm = ∞;

whilem > ϵ do // while movement of c > ϵ// set similar neighboring points

as coveredset n(κ) = 1,∀n ∈ N ≡ n ∈ P : | |c − n| |2 ≤ γ

∧ α(c,n) < η;// find the meanconstruct histogram at c and perpendicular to c(h)with β bins;project n to histogram weighted according tow(c,n;N),∀n ∈ N ;set mean_bin = the bin containing the mean valueof histogram*;set U ≡ {u ∈ N : u ∈ mean_bin};// shift the road centerline point

n to the mean

setm = | |mean_bin − c| |2; // how far c has

movedset c(x) = mean_bin(x);set c(s) =

1|U |

∑u∈U u(s) · υ(c, u);

set c(h) =1

|U |∑

u∈U u(h) · υ(c, u);// set similar neighboring points

as covered after the shiftsset n(κ) = 1,∀n ∈ N ≡ n ∈ P : | |c − n| |2 ≤ γ

∧ α(c,n) < η;endset C′ = C′ ∪ {c}; // append c to C′

endif |P | = |C′ | then set converдed = True else setP = C′ ;

endset C = C′;

Algorithm 1: Iterated Trajectory Mean Shift Sampling (ITMS).

for every pair of adjacent GPS points within a trajectory (pi, pi+1) :pi, pi+1 ∈ T ∈ T , we form a directed edge with the weight of 1,e= (cj, ck, 1), where cj=argmin

c∈Cs(c, pi ) and ck=argmin

c∈Cs(c, pi+1).

If cj=ck, no edge is made. If an edge e= (cj, ck,w) with weightwalready exist, we increment the weight of that edge by 1, that ise = (cj, ck,w + 1). The set of all intermediate edges E ′ combinedwith the centerline points C from the previous section form theintermediate road mapM ′.

Figure 6: The architecture of our CNN consist of four con-volutional units and a fully connected unit followed by theoutput layer. The convolutional unit consist of a convolu-tional layer, a Rectified Linear Unit (ReLU) activation func-tion [37], maxpooling and dropout.

5.2 Trajectory Features and Traj. DescriptorBelow, we introduce our novel trajectory features extracted fromP that capture spatial and velocity information. When combined,these trajectory features form the trajectory descriptor Φ, whichis an input to the CNN (described in the next section) which isable to learn spatial relationships between datapoints. Althoughtrajectories are sequences of GPS points, spatial information is moreimportant for the map inference than the the sequential ordering ofthe GPS points. As CNN is a natural choice for feature map inputs,we design our trajectory descriptor to be a feature map.

The trajectory features are binned into weighted 2D histogramsto form a feature map (array). The dimension of each bin corre-sponds to 1m×1m area. We then combine the histograms into adescriptor Φ (an image/feature map with three channels).

The first trajectory feature is simply a binarized 2D histogram[15]. For each bin in the histogram, it set the value equal 1 if thereis at least one GPS point that falls in it. Otherwise, the bin is setto 0. Then, we extrapolate the speed and heading of each GPSpoint p ∈ P and compute the x− and y−directional velocitiesfor each p ∈ P respectively. For the second and third trajectorymeans, each histogram bin is computed by aggregating over x−and y−directional velocities of p that fall into that bin, respectively.These last two channels are normalized within the [0; 1] range.

5.3 Deep Edge PruningThe intermediate map from Section 5.1 contains false positive edges.Thus, we propose a powerful CNN framework to prune edges.

Firstly, the trajectory descriptor Φ described in Section 5.2 isthe input to our CNN which simultaneously performs the junctiondetection and classification to determines the degree for each cen-terline point cd̄ : c ∈ C. Finally, for each centerline point c ∈ C, weprune the edges e ∈ E ′ in the intermediate mapM ′based on theweight of edges. The pruned map is the final inferred road map M.

5.3.1 Junction Detection and Classification using CNN. Foreach c ∈ C, we extract a patch from the trajectory descriptor Φ,centered at c(x ), and we feed it to CNN. The patch size is 100×100bins (×3). The CNN resembles the VGG pipeline [49] due to itssimplicity and success in computer vision tasks. However, we used

Page 8: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

BuildSys ’19, November 13–14, 2019, New York, NY, USA

fewer layers as the most salient features should already be capturedby our trajectory descriptor Φ. Thus, our CNNs consists of fourconvolutional units and a fully connected layer followed by theoutput layer. Convolutional units consist of a convolutional layerwith 32 filters of 3×3 size, the stride is 1×1. We use a Rectified LinearUnit (ReLU) as the activation function [37]. The convolutional layeris followed by a 2× 2 maxpooling and dropout. The fully connectedlayer has 512 filters. Figure 6 shows the architecture of our network.We use the Adam optimizer with parameters taken from the originalpaper [30]. We train the model for 100 epoch, with a batch sizeof 156. The output describes each centerline point by its degree(upper bound) c(d̄ ) and a label c(l ) to detect and classify junctions,as described in Section 4.

5.3.2 Edge Pruning. Based on the degree inferred by the CNN,we prune the false positives from the intermediate edges E ′, leavingthe true positives. For every centerline point c ∈ C, we prune theedges, starting from the edge with a lowest edge weight, until allcenterline points have degrees that are smaller or equal to the upperbound degree, that is c(d ) ≤ c(d̄ ). The resulting set of edges E andthe road centerlines C form the final road mapM = (C, E).

6 RESULTS AND ANALYSISBelow, we compare our COLTRANE to Chen et al. [13]. We describethe datasets we use, our experimental setup and the results.

6.1 Experimental SetupFor the junction detection and classification, we applied oversam-pling to account for the unbalanced classes (e.g., most instancesbelong to road segment classes), 8× data augmentation (left-rightflip and 4 rotations). The models were trained using 70% of the datawhile the remaining 30% was kept for testing [10, 13]. The accuracy,Macro–F1 score, and confusion matrices are evaluated on the testset. We do not fine-tune any hyperparameters of our algorithm. Forthe purpose of visual evaluation in Figure 8, the model was trainedon the training set and predictions were made on the entire dataset.

Since we perform the multiclass classification with C classes, weuse Macro–F1 score defined as:

Macro–F1 =C∑c=1

tpc2tpc + f pc + f nc

,

where tpc , f pc , f nc denote the true positive, false positive, andfalse negative, for class c , respectively.

6.2 Empirical Evaluation MetricFor evaluating the quality of our map inference algorithm, weused a graph matching method called ‘marble and holes’ [7]. Froma starting location, the evaluation algorithm will traverse the in-ferred/ground truth map, dropping a marble/hole at regular intervald , until it is r meters away from the starting point. If the distancebetween a marble and a hole is less than d , they are considered as amatch. Unmatched marbles and holes are considered false positivesand negatives, respectively. Thus, precision, recall and F1 scorescan be calculated. For the UIC dataset, we fixed the sampling rateto r =100m, and varied the matching distance d between 1m and30m [13].

Table 4: Evaluations of junction detection and classification.

Dataset Metric COLTRANE Chen et al. [13]

UIC Accuracy 99.21% 98.24%Macro–F1 0.9921 0.9822

FAA Accuracy 93.73% 92.5%Macro–F1 0.9345 0.9231

6.3 Map InferenceFigure 7 shows that COLTRANE attains the best Macro–F1 resultsacross all matching distances for both datasets. The improvementranges from 27% on the UIC dataset with the matching distance of5 m, to 37%, also on the UIC dataset given the matching distance of25 m. Moreover, these results also confirm our findings in Section3.3 that the FAA dataset (airport data) is spatially more complex,thus posing a challenge for the map inference.

6.3.1 Visual Evaluations. Figure 8 shows that COLTRANE pro-duces a smoother road map compared to the approach of Chenet al. [13]. Moreover, COLTRANE produces fewer spurious edges,which is a visible issue in Chen et al. [13], particularly for areaswith many junctions and urban areas that exacerbated the GPSnoise. Our improvements are attributed to the implementation ofedge pruning via CNN.

6.3.2 ITMS. Below, we analyze the improvements brought byITMS when compared to Traj-Meanshift. To do so, we use the algo-rithm by [13] and only replace the Trajectory-Meanshift by ITMS.The result in Figure 9 shows that although the performance is com-parable in the city environment, ITMS is performing consistentlybetter in the more spatially complex airport environment.

6.4 Junction Detection and ClassificationTable 4 shows similar trend as the previous results. As the airportdata is more spatially complex, it yields lower Macro–F1 scoresfor the case of junction detection and classification. Nevertheless,our proposed COLTRANE framework slightly outperforms theapproach of Chen et al. [13] which is already a strong performer.

10 20 30Matching Distance (m)

0.00.10.20.30.40.50.6

Mac

ro-F

1 sc

ore

COLTRANEChen et.al.

(a) UIC

10 20 30Matching Distance (m)

0.00.10.20.30.40.50.6

Mac

ro-F

1 sc

ore

COLTRANEChen et.al.

(b) FAA

Figure 7: Performance comparison (Macro–F1 score) of themap inference between COLTRANE and the approach ofChen et al. [13] for a varying matching distance [7].

Page 9: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map Inference BuildSys ’19, November 13–14, 2019, New York, NY, USA

(a) COLTRANE on UIC (b) Chen et al. [13] on UIC

(c) COLTRANE on FAA (d) Chen et al. [13] on FAA

Figure 8: Visual evaluations of the inferred map by ourCOLTRANE and the approach by Chen et al. [13]. Thezoomed area onUIC is a high-built area that exacerbated theGPS noise. The zoomed area on FAA contains a very highjunction density. Google map is used as the groundtruth.

10 20 30Matching Distance (m)

0.00

0.05

0.10

0.15

Mac

ro-F

1 sc

ore

ITMSChen et.al.

(a) UIC

10 20 30Matching Distance (m)

0.00

0.05

0.10

0.15

Mac

ro-F

1 sc

ore

ITMSChen et.al.

(b) FAA

Figure 9: Performance comparison (Macro–F1) of the mapinference between ITMS and the approach of Chen et al. [13]for a varying matching distance [7].

7 DISCUSSION AND LIMITATIONSThere exist many ways to improve on COLTRANE. An on-lineversion of our approach would provide a benefit of the real-time

updates. Highlighting dynamic changes could serve a way of anom-aly detection to improve the situational awareness; a useful featurefor both road users, traffic and aviation authorities.

Moreover, our approach paves an avenue for integration with amore sophisticated spatial data. For instance, we could feed aerial orsatellite images as an input to further improve the accuracy of theinferred map. Thus, COLTRANE introduces a novel deep learningapproach to processing increasingly ubiquitous trajectory data andother varieties of the spatio-temporal data.

8 CONCLUSIONSWe have proposed COLTRANE, a novel deep learning frameworkfor the map inference, junction detection and classification; the firstdeep learning approach that has been tested in multiple scenariossuch as city road network and airport tarmac. We have evaluatedour approach on two real-world datasets. COLTRANE has outper-formed the approach of [13] by up to 37% according to the accuracyand Macro–F1 scores for both the map inference and junction de-tection/classification tasks. We have also improved upon the roadcenterline localization algorithm Traj-Mean Shift by proposingITMS which is more robust to noisy GPS data. Moreover, we haveintroduced a novel trajectory descriptor for the GPS datapointswhich captures important statistics of GPS datapoints such as oc-currences and directional velocities of GPS datapoints. As resultsshow, utilizing CNN to predict the node degree in the graph/mapconstruction yields significant improvements. The degree predic-tion helps disambiguate the correct and erroneous predictions ofroad segments and junctions.

ACKNOWLEDGMENTSThis research is partially supported by Northrop Grumman Corpo-rations USA, RMIT University. We would like to also acknowledgethe support of the Investigative Analytics team (Data61/CSIRO)and the NVIDIA GPU grant.

REFERENCES[1] Gabriel Agamennoni, Juan I Nieto, and Eduardo M Nebot. 2011. Robust inference

of principal road paths for intelligent transportation systems. IEEE Transactionson Intelligent Transportation Systems 12, 1 (2011), 298–308.

[2] Mahmuda Ahmed, Sophia Karagiorgou, Dieter Pfoser, and Carola Wenk. 2015. Acomparison and evaluation of map construction algorithms using vehicle trackingdata. GeoInformatica 19, 3 (2015), 601–632.

[3] Mahmuda Ahmed and Carola Wenk. 2012. Constructing street networks fromGPS trajectories. In European Symposium on Algorithms.

[4] Heba Aly, Anas Basalamah, andMoustafa Youssef. 2014. Map++: A crowd-sensingsystem for automatic map semantics identification. In IEEE SECON.

[5] Dimitris Bertsimas and Sarah Stock Patterson. 1998. The air traffic flow manage-ment problem with enroute capacities. Operations research 46, 3 (1998), 406–422.

[6] Aline Bessa, Fernando de Mesentier Silva, Rodrigo Frassetto Nogueira, EnricoBertini, and Juliana Freire. 2016. Riobusdata: Outlier detection in bus routes ofRio de Janeiro. arXiv preprint arXiv:1601.06128 (2016).

[7] James Biagioni and Jakob Eriksson. 2012. Inferring road maps from globalpositioning system traces: Survey and comparative evaluation. Transportationresearch record 2291, 1 (2012), 61–71.

[8] James Biagioni and Jakob Eriksson. 2012. Map inference in the face of noise anddisparity. In ACM SIGSPATIAL.

[9] Lili Cao and John Krumm. 2009. From GPS traces to a routable road map. In ACMSIGSPATIAL.

[10] Chih-Chung Chang. 2011. " LIBSVM: a library for support vector machines,"ACM Transactions on Intelligent Systems and Technology, 2: 27: 1–27: 27, 2011.http://www. csie. ntu. edu. tw/˜ cjlin/libsvm 2 (2011).

[11] Chen Chen and Yinhang Cheng. 2008. Roads digital map generation with multi-track GPS data. In IEEE ETT and GRS.

Page 10: COLTRANE: ConvolutiOnaL TRAjectory NEtwork for Deep Map ...

BuildSys ’19, November 13–14, 2019, New York, NY, USA

[12] Chen Chen, Xiaomin Liu, Tie Qiu, and Arun Kumar Sangaiah. 2017. A short-term traffic prediction model in the vehicular cyber–physical systems. FutureGeneration Computer Systems (2017).

[13] Chen Chen, Cewu Lu, Qixing Huang, Qiang Yang, Dimitrios Gunopulos, andLeonidas Guibas. 2016. City-scale map creation and updating using GPS collec-tions. In ACM SIGKDD International Conference on Knowledge Discovery and DataMining. 1465–1474.

[14] Michael O Cruz, Hendrik Macedo, and Adolfo Guimaraes. 2015. Grouping similartrajectories for carpooling purposes. In IEEE BRACIS.

[15] Jonathan J Davies, Alastair R Beresford, and Andy Hopper. 2006. Scalable, dis-tributed, real-time map generation. IEEE Pervasive Computing 5, 4 (2006), 47–54.

[16] Somayeh Dodge, Gil Bohrer, Rolf Weinzierl, Sarah C Davidson, Roland Kays,David Douglas, Sebastian Cruz, Jiawei Han, David Brandes, and Martin Wikelski.2013. The environmental-data automated track annotation (Env-DATA) system:linking animal tracks with environmental data. Movement Ecology 1, 1 (2013), 3.

[17] Stefan Edelkamp and Stefan Schrödl. 2003. Route planning and map inferencewith global positioning traces. In Computer science in perspective. Springer,128–151.

[18] Nivan Ferreira, James T Klosowski, Carlos E Scheidegger, and Cláudio T Silva.2013. Vector Field k-Means: Clustering Trajectories by Fitting Multiple VectorFields. In Computer Graphics Forum. Wiley Online Library.

[19] Jiali Fu, Erik Jenelius, and Haris N Koutsopoulos. 2016. Driving time and pathgeneration for heavy construction sites from GPS traces. In IEEE ITSC.

[20] Xiaoming Gao, Christopher Klaiber, Drishtie Patel, and Jeff Underwood. 2019. AIis supercharging the creation of maps around the world. https://tech.fb.com/ai-is-supercharging-the-creation-of-maps-around-the-world/

[21] Tao Guo, Kazuaki Iwamura, and Masashi Koga. 2007. Towards high accuracyroad maps generation from massive GPS traces data. In IEEE IGARSS.

[22] Songtao He, Favyen Bastani, Sofiane Abbar, Mohammad Alizadeh, Hari Balakr-ishnan, Sanjay Chawla, and Sam Madden. 2018. RoadRunner: improving theprecision of road network inference from GPS trajectories. In ACM SIGSPATIALInternational Conference on Advances in Geographic Information Systems.

[23] Geoffrey Hinton. 2019. Neural Network Tutorials. List of Tutorials, http://www.cs.toronto.edu/~hinton/nntut.html.

[24] Yourong Huang, Zhu Xiao, Xiaoyou Yu, Dong Wang, Vincent Havyarimana, andJing Bai. 2019. Road Network Construction with Complex Intersections Basedon Sparsely Sampled Private Car Trajectory Data. ACM TKDD (2019).

[25] Airport Council International. 2019. Preliminary world airport traffic rankingsreleased. https://aci.aero/news/2019/03/13/preliminary-world-airport-traffic-rankings-released/. Accessed: 2019-06-16.

[26] Sera Jang, Taehwan Kim, and Eunseok Lee. 2010. Map generation system withlightweight GPS trace data. In IEEE ICACT.

[27] Ian D Jonsen, Ransom A Myers, and Joanna Mills Flemming. 2003. Meta-analysisof animal movement using state-space models. Ecology 84, 11 (2003), 3055–3063.

[28] Sophia Karagiorgou and Dieter Pfoser. 2012. On vehicle tracking data-based roadnetwork generation. In ACM SIGSPATIAL.

[29] Sophia Karagiorgou, Dieter Pfoser, and Dimitrios Skoutas. 2013. Segmentation-based road network construction. In ACM SIGSPATIAL.

[30] Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic opti-mization. arXiv preprint arXiv:1412.6980 (2014).

[31] Xiangjie Kong, Zhenzhen Xu, Guojiang Shen, Jinzhong Wang, Qiuyuan Yang,and Benshi Zhang. 2016. Urban traffic congestion estimation and predictionbased on floating car trajectory data. Future Generation Computer Systems 61(2016), 97–107.

[32] Piotr Koniusz, Anoop Cherian, and Fatih Porikli. 2016. Tensor Representationsvia Kernel Linearization for Action Recognition from 3D Skeletons. ECCV (2016).

[33] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classifi-cation with deep convolutional neural networks. In NIPS.

[34] Colin Kuntzsch, Monika Sester, and Claus Brenner. 2016. Generative models forroad network reconstruction. International Journal of Geographical InformationScience 30, 5 (2016), 1012–1039.

[35] Jae-Gil Lee, Jiawei Han, and Kyu-Young Whang. 2007. Trajectory clustering: apartition-and-group framework. In Proceedings of the 2007 ACM SIGMOD inter-national conference on Management of data. ACM, 593–604.

[36] David G Lowe et al. 1999. Object recognition from local scale-invariant features..In ICCV.

[37] Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve restrictedboltzmann machines. In ICML.

[38] Brian Niehoefer, Ralf Burda, Christian Wietfeld, Franziskus Bauer, and OliverLueert. 2009. GPS community map generation for enhanced routing methodsbased on trace-collection by mobile phones. In IEEE First International Conferenceon Advances in Satellite and Space Communications.

[39] Sorin Nistor and Aurelian Stelian Buda. 2016. GPS network noise analysis: a casestudy of data collected over an 18-month period. Journal of Spatial Science 61, 2(2016), 427–440.

[40] MY Pusadan, JL Buliali, and RVH Ginardi. 2017. Anomaly detection of flightroutes through optimal waypoint. In IOP Journal of Physics: Conference Series,Vol. 801. 012041.

[41] Kyle K Qin, Wei Shao, Yongli Ren, Jeffrey Chan, and Flora D Salim. 2019. Solvingmultiple travelling officers problem with population-based optimization algo-rithms. Neural Computing and Applications (2019), 1–27.

[42] Yuya Sasaki, Jiahao Yu, and Yoshiharu Ishikawa. 2019. Road Segment Interpola-tion for Incomplete Road Data. In IEEE BigComp.

[43] Stefan Schroedl, Kiri Wagstaff, Seth Rogers, Pat Langley, and Christopher Wilson.2004. Mining GPS traces for map refinement. Data mining and knowledgeDiscovery 9, 1 (2004), 59–87.

[44] NoiseManagement Section. 2014. Los Angeles International Airport - PreferentialRunway Use Policy.

[45] Wei Shao, Flora D. Salim, Jeffrey Chan, Kai Qin, Jiaman Ma, and Bradley Feest.2019. OnlineAirTrajClus: An Online Aircraft Trajectory Clustering for TarmacSituation Awareness. In IEEE PerCom 2019.

[46] Wei Shao, Flora D. Salim, Tao Gu, Ngoc Thanh Dinh, and Jeffrey Chan. 2018.Traveling Officer Problem: Managing Car Parking Violations Efficiently UsingSensor Data. IEEE Internet of Things Journal 5, 2 (2018), 802–810.

[47] Wei Shao, Flora D Salim, Andy Song, and Athman Bouguettaya. 2016. Clusteringbig spatiotemporal-interval data. IEEE Transactions on Big Data 2, 3 (2016),190–203.

[48] Wenhuan Shi, Shuhan Shen, and Yuncai Liu. 2009. Automatic generation of roadnetwork map from massive GPS, vehicle trajectories. In IEEE ITSC.

[49] Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networksfor large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).

[50] Rade Stanojevic, Sofiane Abbar, Saravanan Thirumuruganathan, Sanjay Chawla,Fethi Filali, and Ahid Aleimat. 2018. Robust road map inference through networkalignment of trajectories. In SIAM International Conference on Data Mining.

[51] Rade Stanojevic, Sofiane Abbar, Saravanan Thirumuruganathan, GianmarcoDe Francisci Morales, Sanjay Chawla, Fethi Filali, and Ahid Aleimat. 2018. Roadnetwork fusion for incremental map updates. In LBS. 91–109.

[52] Yusuf Tas and Piotr Koniusz. 2018. CNN-basedAction Recognition and SupervisedDomain Adaptation on 3DBody Skeletons via Kernel FeatureMaps. BMVC (2018).

[53] Lei Wang, Du Q. Huynh, and Piotr Koniusz. 2019. A Comparative Review ofRecent Kinect-based Action Recognition Algorithms. IEEE Transactions on ImageProcessing 29, 1 (2019), 15–28.

[54] Stewart Worrall and Eduardo Nebot. 2007. Automated process for generatingdigitised maps through GPS data compression. In Australasian Conference onRobotics and Automation.

[55] Wei Yang, Tinghua Ai, and Wei Lu. 2018. A method for extracting road boundaryinformation from crowdsourcing vehicle GPS trajectories. Sensors 18, 4 (2018),1261.

[56] Sichen Zhao, Wei Shao, and Haitao Zhu. 2019. The Intelligent Decision of FlightsAdjusting Rule-based flight scheduling optimisation. In ACM WSDM.

[57] Renjie Zheng, Qin Liu, Weixiong Rao, Mingxuan Yuan, Jia Zeng, and ZhongxiaoJin. 2017. Topic model-based road network inference from massive trajectories.In IEEE MDM.

[58] Zimu Zheng, Feng Wang, Dan Wang, and Liang Zhang. 2018. Buildings affectmobile patterns: developing a new urban mobility model. In ACM BuildSys.