Make It Simple: Effective Road Selection for Small-Scale Map Design Using Decision-Tree-Based Models
Abstract
:1. Introduction
2. Materials and Methods
2.1. Data
2.2. Selection Based on Regulation and Machine Learning
- Gain ratio: In the case of the DT model the criterion on which attributes were selected for splitting was the gain ratio. It is a variant of information gain that adjusts the information gain for each attribute to allow the breadth and uniformity of the attribute values to be captured. The gain ratio parameter calculates the weight of attributes with respect to the label attribute by using the information gain ratio. The higher the weight of an attribute, the more relevant it is considered.
- Minimal split size: Set equal to 4. The size of a node is the number of examples in its subset. The size of the root node is equal to the total number of examples considered. Only those nodes are split whose size is greater than or equal to the minimal split size.
- Minimal leaf size was equal to 4. The size of a leaf node is the number of examples in its subset. The tree is generated in such a way that every leaf node subset has at least the minimal leaf size number of instances.
- Minimal gain: Set equal to 0.1. The gain of a node is calculated before splitting it. The node is split if its gain is greater than the minimal gain. Higher values of the minimal gain result in fewer splits and thus a smaller tree. A too high value will completely prevent splitting and a tree with a single node is generated.
- Minimal depth: Set equal to 20. The depth of a tree varies depending upon size and nature of the examples. This parameter is used to restrict the size of the decision tree.
- Confidence: Set equal to 0.25. This parameter specifies the confidence level used for the pessimistic error calculation of pruning.
- Number of prepruning alternatives: Set equal to 3. This parameter adjusts the number of alternative nodes tried for splitting when a split is prevented by prepruning at a certain node.
- In the case of the DT-GA model, the operator named ‘optimize selection’ was additionally used, invoking a genetic algorithm (GA) to select the most relevant attributes of the given sample set. A GA is a search heuristic that mimics the process of natural evolution, such as inheritance, mutation, selection, and crossover [27].
- For the RF model, further parameters included the number of trees to be generated (set to 100) and the maximal tree depth (set equal to 10). This parameter is used to restrict the depth for each random tree set [27].
3. Results
3.1. Decision Trees
3.2. ML Model Accuracy
3.3. Maps
4. Evaluation
4.1. Qualitative Assessment
4.2. Quantitative Assessment
4.3. Variable Weights
4.4. Correlation Analysis
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- De Serres, B.; Roy, A.G. Flow direction and branching geometry at junctions in Dendritic River Networks. Prof. Geogr. 1990, 42, 149–201. [Google Scholar] [CrossRef]
- Yu, X. Road network simplification with knowledge-based spatial analysis. J. Geogr. Sci. 2001, 11, 54–62. [Google Scholar] [CrossRef]
- Zhang, H.; Li, Z. Weighted ego network for forming hierarchical structure of road networks. Int. J. Geogr. Inf. Sci. 2011, 25, 255–272. [Google Scholar] [CrossRef]
- Weiss, R.; Weibel, R. Road network selection for small-scale maps using an improved centrality-based algorithm. J. Spat. Inf. Sci. 2014, 9, 71–99. [Google Scholar] [CrossRef]
- Benz, S.A.; Weibel, R. Road network selection for medium scales using an extended stroke-mesh combination algorithm. Cartogr. Geogr. Inf. Sci. 2014, 41, 323–339. [Google Scholar] [CrossRef] [Green Version]
- Samsonov, T.E.; Krivosheina, A.M. Joint Generalization of City Points and Road Network for Smallscale Mapping. In GIScience 2012: Seventh International Conference on Geographic Information Science; Columbus, OH, USA, 2012; Available online: https://www.researchgate.net/publication/264829198_Joint_generalization_of_city_points_and_road_network_for_small-scale_mapping (accessed on 5 January 2022).
- Richardson, D.E.; Thomson, R.C. Integrating Thematic, Geometric, and Topological Information in the Generalization of Road Networks. Cartogr. Int. J. Geogr. Inf. Geovisualization 1996, 33, 75–83. [Google Scholar] [CrossRef]
- Jiang, B.; Claramunt, C. A Structural Approach to the Model Generalization of an Urban Street Network. GeoInformatica 2004, 8, 157–171. [Google Scholar] [CrossRef]
- Liu, X.; Zhan, B.; Ai, T. Road selection based on Voronoi diagrams and “strokes” in map generalization. Int. J. Appl. Earth Obs. Geoinf. 2010, 12 (Suppl. 2), 194–202. [Google Scholar] [CrossRef]
- Touya, G. A road network selection process based on data enrichment and structure detection. Trans. GIS 2010, 14, 595–614. [Google Scholar] [CrossRef] [Green Version]
- Mackaness, W.; Beard, K. Use of Graph Theory to Support Map Generalization. Cartogr. Geogr. Inf. Syst. 1993, 20, 210–221. [Google Scholar] [CrossRef]
- Yan, H. Description Approaches and Automated Generalization Algorithms for Groups of Map Objects; Springer: Singapore, 2019. [Google Scholar]
- Karsznia, I.; Weibel, R. Improving Settlement Selection for Small-scale Maps Using Data Enrichment and Machine Learning. Cartogr. Geogr. Inf. Sci. 2018, 45, 111–127. [Google Scholar] [CrossRef] [Green Version]
- Karsznia, I.; Sielicka, K. Exploring essential variables in the settlement selection for small-scale maps using machine learning. In Abstracts of the International Cartographic Association; Fujita, H., Ed.; International Cartographic Association: Tokio, Japan, 2019; Volume 1, p. 162. [Google Scholar] [CrossRef]
- Karsznia, I.; Sielicka, K. When Traditional Selection Fails: How to Improve Settlement Selection for Small-Scale Maps Using Machine Learning. ISPRS Int. J. Geo-Inf. 2020, 9, 230. [Google Scholar] [CrossRef] [Green Version]
- Sester, M.; Feng, Y.; Thiemann, F. Building generalization using deep learning. In The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XLII-4, Proceedings of the 2018 ISPRS TC IV Mid-Term Symposium “3D Spatial Information Science—The Engine of Change”, Delft, The Netherlands, 1–5 October 2018; International Society for Photogrametry and Remote Sensing: Delft, The Netherlands, 2018. [Google Scholar]
- Feng, Y.; Thiemann, F.; Sester, M. Learning Cartographic Building Generalization with Deep Convolutional Neural Network. Int. J. Geo-Inf. 2019, 8, 258. [Google Scholar] [CrossRef] [Green Version]
- Lagrange, F.; Landras, B.; Mustiere, S. Machine Learning Techniques for Determining Parameters of Cartographic Generalisation Algorithms; XIXth ISPRS Congress: Amsterdam, The Netherlands, 2000; Volume XXXIII, Pt B4, pp. 718–725. [Google Scholar]
- Balboa, J.L.G.; López, F.J.A. Generalization-oriented road line classification by means of an artificial neural network. Geoinformatica 2008, 12, 289–312. [Google Scholar] [CrossRef]
- Zhou, Q.; Li, Z. Use of Artificial Neural Networks for Selective Omission in Updating Road Networks. Cartogr. J. 2014, 51, 38–51. [Google Scholar] [CrossRef]
- Zheng, J.; Gao, Z.; Ma, J.; Shen, J.; Zhang, K. Deep Graph Convolutional Networks for Accurate Automatic Road Network Selection. ISPRS Int. J. Geo-Inf. 2021, 10, 768. [Google Scholar] [CrossRef]
- Jepsen, S.T.; Jensen, C.S.; Dyhre Nielsen, T. Relational Fusion Networks: Graph Convolutional Networks for Road Networks. IEEE Trans. Intell. Transp. Syst. 2022, 23, 418–429. [Google Scholar] [CrossRef]
- Gülgen, F. Road hierarchy with integration of attributes using fuzzy-AHP. Geocarto Int. 2014, 29, 688–708. [Google Scholar] [CrossRef]
- Han, Y.; Wang, Z.; Lu, X.; Hu, B. Application of AHP to Road Selection. ISPRS Int. J. Geo-Inf. 2020, 9, 86. [Google Scholar] [CrossRef] [Green Version]
- Regulation of the Ministry of Interior on 17 November 2011 on the Topographic Objects Database and General Geographic Objects Database, As Well As Standard Cartographic Products, Journal of Laws of 2011, No 279 item 1642. Available online: https://isap.sejm.gov.pl/isap.nsf/DocDetails.xsp?id=WDU20112791642 (accessed on 5 January 2022).
- Karsznia, I.; Sielicka, K.; Weibel, R. Optimising road selection for small-scale maps using decision tree- based models. In Abstracts of AutoCarto 23rd International Research Symposium on Cartography and GIScience Cartography and Geographic Information Society; Redlands, CA, USA, 2020; Available online: https://tinyurl.com/58yrs79a (accessed on 5 January 2022).
- RapidMiner 9. Operator Reference Manual 2019. Retrieved 12 May 2022. Available online: https://docs.rapidminer.com/latest/studio/operators/rapidminer-studio-operator-reference.pdf (accessed on 5 January 2022).
- Courtial, A.; Touya, G.; Zhang, X. Constraint-Based Evaluation of Map Images Generalized by Deep Learning. J. Geovis. Spat. Anal. 2022, 6, 13. [Google Scholar] [CrossRef]
Variable | Description | |
---|---|---|
attribute variables | road class | hierarchy of roads in terms of technical condition, featuring: motorway, expressway, main road of accelerated traffic, main road, collector road, local road, local access road |
road category | hierarchy of roads in terms of function, featuring: national roads, voivodeship roads, district roads, municipality roads | |
number of carriageways | number of road carriageways | |
type of surface | type of road surface—paved or unpaved |
Variable | Description | |
---|---|---|
attribute variables | road class | hierarchy of roads in terms of technical condition, featuring: motorway, expressway, main road of accelerated traffic, main road, collector road, local road, local access road |
road category | hierarchy of roads in terms of function, featuring: national roads, voivodeship roads, district roads, municipality roads | |
number of carriageways | number of road carriageways | |
type of surface | type of road surface—paved or unpaved | |
spatial variables | segment length | segment length in metric units |
no. of connected roads (segment) | number of roads linked to a road segment that the road section is a part of | |
no. of connected roads (section) | number of roads linked to a road section | |
connects the settlements | number of settlements the road segment connects | |
minimum number of segments leading from settlements at a scale of 1:500,000 | minimum number of road segments exiting from settlements selected as a result of ML (scale 1:500,000) with which the segment connects | |
minimum number of segments leading from settlements at a scale of 1:1,000,000 | minimum number of road segments exiting from settlements selected as a result of ML (scale 1:1,000,000) with which the segment connects | |
density of paved roads in the district | density of paved road network in the district (in km/100 km2) | |
density of paved roads in hexagon | density of paved road network in the hexagon (in km/100 km2) | |
density of roads in the district | density of road network in the district (in km/100 km2) | |
density of roads in a hexagon | density of road network in the hexagon (in km/100 km2) | |
betweenness centrality | an indicator of an edge’s centrality in a network |
Area | Basic Approach | DT | DT-GA | RF | Difference |
---|---|---|---|---|---|
All districts | 45.10% | 82.46% | 83.33% | 84.96% | +39.86% |
Białostocki | 43.70% | 74.39% | 75.98% | 80.94% | +37.24% |
Rzeszowski | 55.25% | 84.38% | 86.76% | 82.58% | +31.51% |
Kępiński | 42.10% | 86.55% | 89.62% | 91.23% | +49.13% |
Area | Basic Approach | DT | DT-GA | RF | Difference |
---|---|---|---|---|---|
All districts | 96.36% | 99.18% | 99.44% | 99.34% | +3.08% |
Białostocki | 96.46% | 99.05% | 99.79% | 99.58% | +3.33% |
Rzeszowski | 91.89% | 96.11% | 98.21% | 97.00% | +7.32% |
Kępiński | 98.39% | 99.71% | 99.86% | 99.71% | +1.47% |
Area | Basic Approach | DT-GA | RF |
---|---|---|---|
All districts | 127 | 50 | 56 |
Białostocki | 51 | 30 | 31 |
Rzeszowski | 15 | 7 | 8 |
Kępiński | 61 | 13 | 17 |
Area | Basic Approach | DT-GA | RF |
---|---|---|---|
All districts | 1 | 1 | 1 |
Białostocki | 0 | 0 | 0 |
Rzeszowski | 0 | 1 | 1 |
Kępiński | 1 | 0 | 0 |
Variable | Weight |
---|---|
number of carriageways | 0.926 |
betweenness centrality | 0.911 |
number of connected roads (segment) | 0.910 |
minimum number of sections leading from settlements at a scale of 1:500,000 | 0.891 |
road class | 0.890 |
number of connected roads (section) | 0.883 |
road category | 0.878 |
minimum number of sections leading from settlements at a scale of 1:1,000,000 | 0.873 |
connects the settlements | 0.868 |
density of roads in the district | 0.680 |
density of paved roads in the district | 0.680 |
segment length | 0.670 |
density of paved roads in a hexagon | 0.668 |
density of roads in a hexagon | 0.668 |
Road Category | Road Class | Type of Surface | Segment Length | Density of Roads in the District | Density of Paved Roads in the District | Density of Paved Roads in Hexagon | Density of Roads in Hexagon | Betweenness Centrality | Number of Carriageways | No. of Connected Roads (Stroke) | No. of Connected Roads (Segment) | Connects the Settlements | Minimum Number of Segments Leading from Settlements at a Scale of 1:500,000 | Minimum Number of Segments Leading from Settlements at a Scale of 1:1,000,000 | Variables |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1.00 | 0.54 | 0.44 | −0.11 | −0.22 | −0.22 | −0.31 | −0.31 | −0.10 | −0.23 | −0.27 | −0.08 | −0.09 | −0.09 | −0.07 | road category |
0.54 | 1.00 | 0.33 | 0.22 | 0.07 | 0.07 | 0.02 | 0.02 | 0.09 | −0.25 | −0.34 | −0.12 | 0.06 | −0.01 | 0.18 | road class |
0.44 | 0.33 | 1.00 | −0.08 | −0.12 | −0.12 | −0.18 | −0.18 | −0.02 | −0.12 | −0.39 | −0.14 | −0.02 | −0.03 | 0.05 | type of surface |
−0.11 | 0.22 | −0.08 | 1.00 | 0.86 | 0.86 | 0.87 | 0.87 | 0.25 | 0.00 | −0.19 | −0.10 | 0.22 | 0.18 | 0.28 | segment length |
−0.22 | 0.07 | −0.12 | 0.86 | 1.00 | 1.00 | 0.81 | 0.81 | 0.22 | 0.01 | −0.12 | −0.09 | 0.25 | 0.18 | 0.29 | density of roads in the district |
−0.22 | 0.07 | −0.12 | 0.86 | 1.00 | 1.00 | 0.81 | 0.81 | 0.22 | 0.01 | −0.12 | −0.09 | 0.25 | 0.18 | 0.29 | density of paved roads in the district |
−0.31 | 0.02 | −0.18 | 0.87 | 0.81 | 0.81 | 1.00 | 1.00 | 0.25 | 0.08 | −0.17 | −0.11 | 0.26 | 0.22 | 0.35 | density of paved roads in hexagon |
−0.31 | 0.02 | −0.18 | 0.87 | 0.81 | 0.81 | 1.00 | 1.00 | 0.25 | 0.08 | −0.17 | −0.11 | 0.26 | 0.22 | 0.35 | density of roads in hexagon |
−0.10 | 0.09 | −0.02 | 0.25 | 0.22 | 0.22 | 0.25 | 0.25 | 1.00 | 0.05 | 0.00 | 0.12 | 0.12 | 0.11 | 0.14 | betweenness centrality |
−0.23 | −0.25 | −0.12 | 0.00 | 0.01 | 0.01 | 0.08 | 0.08 | 0.05 | 1.00 | 0.09 | 0.12 | −0.05 | −0.02 | −0.08 | number of carriageways |
−0.27 | −0.34 | −0.39 | −0.19 | −0.12 | −0.12 | −0.17 | −0.17 | 0.00 | 0.09 | 1.00 | 0.48 | −0.18 | −0.09 | −0.27 | no. of connected roads (stroke) |
−0.08 | −0.12 | −0.14 | −0.10 | −0.09 | −0.09 | −0.11 | −0.11 | 0.12 | 0.12 | 0.48 | 1.00 | −0.19 | 0.00 | −0.21 | no. of connected roads (segment) |
−0.09 | 0.06 | −0.02 | 0.22 | 0.25 | 0.25 | 0.26 | 0.26 | 0.12 | −0.05 | −0.18 | −0.19 | 1.00 | 0.65 | 0.46 | connects the settlements |
−0.09 | −0.01 | −0.03 | 0.18 | 0.18 | 0.18 | 0.22 | 0.22 | 0.11 | −0.02 | −0.09 | 0.00 | 0.65 | 1.00 | 0.25 | minimum number of sections leading from settlements at a scale of 1:500,000 |
−0.07 | 0.18 | 0.05 | 0.28 | 0.29 | 0.29 | 0.35 | 0.35 | 0.14 | −0.08 | −0.27 | −0.21 | 0.46 | 0.25 | 1.00 | minimum number of sections leading from settlements at a scale of 1:1,000,000 |
Correlation | |||||||||||||||
–0.39 1.00 | |||||||||||||||
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Karsznia, I.; Wereszczyńska, K.; Weibel, R. Make It Simple: Effective Road Selection for Small-Scale Map Design Using Decision-Tree-Based Models. ISPRS Int. J. Geo-Inf. 2022, 11, 457. https://doi.org/10.3390/ijgi11080457
Karsznia I, Wereszczyńska K, Weibel R. Make It Simple: Effective Road Selection for Small-Scale Map Design Using Decision-Tree-Based Models. ISPRS International Journal of Geo-Information. 2022; 11(8):457. https://doi.org/10.3390/ijgi11080457
Chicago/Turabian StyleKarsznia, Izabela, Karolina Wereszczyńska, and Robert Weibel. 2022. "Make It Simple: Effective Road Selection for Small-Scale Map Design Using Decision-Tree-Based Models" ISPRS International Journal of Geo-Information 11, no. 8: 457. https://doi.org/10.3390/ijgi11080457
APA StyleKarsznia, I., Wereszczyńska, K., & Weibel, R. (2022). Make It Simple: Effective Road Selection for Small-Scale Map Design Using Decision-Tree-Based Models. ISPRS International Journal of Geo-Information, 11(8), 457. https://doi.org/10.3390/ijgi11080457