网刊加载中。。。

使用Chrome浏览器效果最佳,继续浏览,你可能不会看到最佳的展示效果,

确定继续浏览么?

复制成功,请在其他浏览器进行阅读

Urban tree species classification based on multispectral airborne LiDAR  PDF

  • HU Pei-Lun 1,2
  • CHEN Yu-Wei 1
  • Mohammad Imangholiloo 2
  • Markus Holopainen 2
  • WANG Yi-Cheng 3
  • Juha Hyyppä 1
1. Department of Remote Sensing and Photogrammetry, Finnish Geospatial Research Institute, Espoo 02150, Finland; 2. Department of Forest Sciences, University of Helsinki, Helsinki 00014, Finland; 3. Advanced Laser Technology Laboratory of Anhui Province, Hefei 230037, China

CLC: S771.8

Updated:2025-04-27

DOI:10.11972/j.issn.1001-9014.2025.02.009

  • Full Text
  • Figs & Tabs
  • References
  • Authors
  • About
CN CITE
OUTLINE

Abstract

Urban tree species provide various essential ecosystem services in cities, such as regulating urban temperatures, reducing noise, capturing carbon, and mitigating the urban heat island effect. The quality of these services is influenced by species diversity, tree health, and the distribution and the composition of trees. Traditionally, data on urban trees has been collected through field surveys and manual interpretation of remote sensing images. In this study, we evaluated the effectiveness of multispectral airborne laser scanning (ALS) data in classifying 24 common urban roadside tree species in Espoo, Finland. Tree crown structure information, intensity features, and spectral data were used for classification. Eight different machine learning algorithms were tested, with the extra trees (ET) algorithm performing the best, achieving an overall accuracy of 71.7% using multispectral LiDAR data. This result highlights that integrating structural and spectral information within a single framework can improve the classification accuracy. Future research will focus on identifying the most important features for species classification and developing algorithms with greater efficiency and accuracy.

Introduction

Today, approximately 56% of the world's population—4.4 billion people—live in cities. Urban trees play a significant role in mitigating global climate change

1 and are uniquely susceptible to climate change impacts. Urban Forest Effects model23 is widely used in urban areas globally to estimate urban forest structure, species diversity, and ecosystem functions. However, conducting urban forest inventories is labor-intensive, especially on private properties, and the results are often not spatially detailed. While remotely sensed data is commonly used in forest applications, traditional optical remote sensing methods struggle to capture three-dimensional forest structures, especially in unevenly aged, mixed-species forests with multiple canopy layers4.

Airborne laser scanning (ALS) is effective for extracting biophysical variables and revising forest inventory maps. The successful use of ALS data has been demonstrated for various applications. For example, ALS has been used to estimate tree height

56, identify tree species7-9, and estimate tree volume biomass1011, and growth1213. Tree species information at an individual tree level is particularly useful in growth and yield estimates and has been primarily studied for forest applications, such as updating forest inventories. Tree species classification using ALS has not been intensively studied compared with studies on the successful use of ALS for other forest attribute mapping because of the lack of spectral information.

Previous studies have also revealed that combining multispectral information with 3D ALS data can improve the accuracy of tree extraction and tree species classification, as we can take advantage of both datasets. However, challenging factors limit the effective operational use of the fused datasets

1415. For example, geometric and radiometric registration between two datasets is demanding because data are normally acquired at different times using different sensors. The recently developed multispectral laser scanning technique is becoming an attractive option for forest mapping because it can provide not only a dense point cloud but also spectral information, which can simplify data processing and facilitate the interpretation of data.

Given the limitations of traditional optical remote sensing in capturing three-dimensional forest structures, it is essential to explore the potential of multispectral laser scanning for urban tree inventories, particularly for species classification. This study aims to assess the feasibility of using multispectral ALS data for urban tree species classification and to analyze the information content of features derived from point clouds and intensity data.

1 Materials and methods

1.1 Study area and establishment of sample plots

The MLS datasets used in this study were acquired in a suburban area in Espoolahti, southern Finland (60°9′18″N, 24°38´24″E) in the southern Boreal Forest Zone. We choose around 822 trees in this area as our field dataset. The land area is approximately 5 km2. In our research, we concentrated solely on the vegetated areas, excluding the sea using a water mask created from topographic map data. The area included a diverse range of boreal tree species.

The points were updated through visual interpretation of Titan data and open datasets from the City of Espoo, the National Land Survey of Finland, Google Maps, and Google Street View. Field checks validated the analysis and resolved uncertainties. The reference points' attributes included species, geographic location, living conditions, tree height, and planting date for each tree.

Fig. 1  Map of the study area and tree samples in the research area.

图 1  研究区和研究区的树木样本

1.2 Multispectral ALS data

Multispectral Optech Titan data (Teledyne Optech, Toronto, ON, Canada) for the study area were collected in May and June 2016 in collaboration with TerraTec Oy (Helsinki, Finland) from a 650 m flight height. The data acquisition was carried out using a fixed-wing aircraft flying at a constant altitude. The sensor comprises three Titan channels: green (532 nm), near-infrared (1 064 nm), and shortwave infrared (1 550 nm). Each channel provided separate point clouds. In our preprocessed dataset, the point densities over land areas were approximately 9 points/m² for Channel 1, 9 points/m² for Channel 2, and 8 points/m² for Channel 3.

TerraScan (TerraSolid Oy, Helsinki, Finland) was used to preprocess the ALS data and differentiate between ground and nonground points using a standardized procedure. This procedure involved removing noise, such as points detected below the ground level or above the canopy. Subsequently, the point clouds were height-normalized. Ground elevation was subtracted from the point cloud height measurements using a digital terrain model created from the classified ground points of the three channels to eliminate potential discrepancies.

Radiometric calibration of ALS intensity is crucial to ensure successful classification. Therefore, in this study, we implemented relative radiometric calibration. We observed that the intensity values were higher in the middle of the flight path compared to other areas and decreased with scanning height. A range correction was applied to mitigate such effects.

Ic=I×Di2Dref2 , (1)

where Ic is the modified intensity, I is the original intensity, Di is the distance from the LiDAR to the point cloud and Dref  is the flying altitude (650 m).

1.3 Creating canopy height model and single tree detection

Individual trees were detected using a minimum curvature-based algorithm, which started with creating a canopy height model (CHM). According to our field dataset of each tree coordination, we set the potential crown area within 5 m2. A local maximum filtering algorithm was used to find the treetops in this area. Subsequently, the watershed segmentation method was used to delineate tree crown boundaries without setting a flow threshold in the CHM. Eventually, the point cloud of each tree from the multispectral ALS dataset was created. In the segmentation process, the shape and position of individual tree crowns were identified using the segment boundaries and the location of the highest point within each segment. In this study, first return points from all three channels were utilized to generate CHM.

1.4 Multispectral ALS data feature extraction

In this experiment, the features were primarily divided into two types: intensity features and geometric features. The maximum height (Hmax) of each tree was calculated from the highest point of all point cloud in each tree segment.

Simultaneously, we got 137 features in each channel from the multispectral ALS data.

Table 1  List of all features from Multispectral ALS data (i refers to channel numbers, and subscript F represents the single-channel intensity feature used)
表 1  从激光回波中得出的所有特征列表 (i 代表通道编号,下标 F 指使用的单通道强度特征)
FeatureDefinition
Single-channel Intensity (SCI) features
Imax Maximum intensity
Imin Minimum intensity
Imean Mean intensity
Istd The standard deviation of intensity
Icov Coefficient of variation (i.e., relative standard deviation) of intensity
Isk Skewness of intensity
Irange Range of intensity
Ikut Kurtosis of intensity
I5 to I95 Percentiles of intensity values of points above the ground threshold from 5% to 95% in 5% increments
Multi-channel Intensity (MCI) features
RiF= IiF/(I1F+I2F+I3F) Ratios of intensity features in each channel
gNDVIF= (I1F-I3F)/(I2F+I3F) Green normalized differential vegetation index (gNDVI)
gSRF= (I2F/I3F) Green simple ratio vegetation index (gSR)
Geometric features
Hmax Maximum of the heights of all points
Hmean Arithmetic mean of the height of all points above 1 m threshold
Hstd Standard deviation of height of all points above 1 m threshold
Hrange Range of normalized height of all points above 1 m threshold
P Penetration as a ratio between the number of returns below 1 m and total returns
CA Crown area as the area of the convex hull in 2D
CV Crown volume as the convex hull in 3D
CD Crown diameter calculated from crown area considering crown as a circle
HP10 to HP90 Percentiles of the points above 1 m height from 10% to 90% at 10% incremental.
D1 to D10 Di = Ni/Ntotal, where i = 1 to 10, Ni is the number of points within the ith layer when tree height was divided into 10 intervals starting from 1 m, Ntotal is the number of all points.

1.5 Tree species classification and accuracy evaluation

In this study, we use 8 machine learning algorithms to compare the classification of tree species.: extra trees (ET), random forest (RF), K-nearest neighbour (KNN), logistic regression (LR), linear discriminant analysis (LDA), classification and regression tree (CART), naive bayes (NB), support vector machine (SVM). Tree species were estimated based on prediction models by 8 machine learning algorithms using tree features as predictors and tree species as a response for correctly detected trees.

2 Results

2.1 Accuracy of classification

As presented in Fig. 2, using all the intensity and geometric features, the overall tree species classification performed best in the extra tree algorithm and reached 71.7%. When we only use channel 1 features for classification, overall values can only reach 65.7%. Only using features from channel 2 yielded overall values that can only reach 68.3%. Only using features from channel 3 yielded overall values that can only reach 64.8%. The accuracy of all the classifications for each species is shown in Fig. 3.

Fig. 2  Titan intensity image of Study area in Espoolahti (Red: Channel 1; Green: Channel 2; Blue: Channel 3).

图 2  埃斯波拉赫蒂研究区,背景为泰坦强度影像(红色:Channel 1;绿色:Channel 2;蓝色:Channel 3)。

Fig. 3  The comparison of classification accuracy of 24 tree species: ET, RF, KNN, LR, LDA, CART, NB, SVM

图 3  树种的分类准确率比较:额外树、随机森林、K-近邻、逻辑回归、线性判别分析、分类回归树、奈夫贝叶斯、支持向量机

The confusion matrix analysis reveals a model that performs well for most classes but struggles with a few, particularly Quercus and Sorbus according to Table 2 and Fig. 4. Certain classes, such as AcerLarix, and Thuja, exhibit high accuracy (≥93%), indicating the model’s ability to correctly classify instances for these classes. By addressing these shortcomings through feature refinement, data augmentation, and model optimization, the overall classification accuracy can be significantly improved. Future work should focus on integrating domain-specific knowledge to enhance feature representation and reduce class overlap.

Table 2  List of tree sample
表 2  树木样本清单
Tree speciesThe index numberNumber of Trees
Pinta-ala 1 2
Abies 2 13
Acer 3 249
Alnus 4 5
Betula 5 26
Fallopia 6 1
Fraxinus 7 2
Juglans 8 5
Larix 9 11
Malus 10 8
Picea 11 15
Pinus 12 84
Populus 13 16
Prunus 14 10
Quercus 15 23
Ribes 16 5
Salix 17 4
Sambucus 18 1
Sorbus 19 84
Syringa 20 1
Taxus 21 4
Thuja 22 2
Tilia 23 88
Ulmus 24 163

Fig. 4  The confusion matrix of classification with geometric and intensity features for each species.

图 4  利用几何特征和强度特征对每个物种进行分类的混淆矩阵。

2.2 Feature importance analysis

We also investigated which input features and channels are most relevant for tree species classification based on the measure provided by the RF algorithm for assessing feature importance. If a feature influences the prediction, permuting its values should affect the model error. If a feature is not influential, then permuting its values should have little or no effect on the model error. Table 3 lists the top three features in the classifications based on different combinations of the features. The most important features in the classification based on point cloud features were penetration and higher-level percentiles. Two density-related features at higher and middle layers were also scored as important as higher percentiles. In the case of classification using single-channel features, the 1 064nm wavelength (Channel 2) appears to provide the most valuable information for distinguishing between pine, spruce, and birch species. This is followed by the 1 550nm wavelength (Channel 1) and then the 532nm wavelength (Channel 3).

Table 3  The features have the most predictive power in different classification scenarios
表 3  在不同的分类场景中预测能力最强的特征值
CasesTop 3 features
All features I2minP1.5I3min

3 Conclusions

Multispectral LiDAR data improved the classification accuracy by approximately 5% to 10% for all channels compared to each channel. This proves our hypothesis about the ability of mALS features in classification. For example, the overall accuracy of 71.7% was obtained in multispectral LiDAR all-channel data, while accuracies of 65.7%, 68.3%, and 64.8% were achieved when using only Channel 1, Channel 2, and Channel 3, respectively. Our findings demonstrated the advantage of combining multichannel features over single-channel data in classifying urban trees. However, the sample size of each tree species in this experiment was uneven, which may have affected the model's accuracy. Consequently, a larger and more representative sample will be used in future research. The imbalance in measurement samples reduced classification accuracy to some extent. Addressing this limitation will be a key focus in subsequent studies.

In this study, eight machine learning algorithms were evaluated for their classification performance, each demonstrating distinct strengths and limitations. The selection of an appropriate classification algorithm depends on the specific characteristics of the dataset, including size, dimensionality, and the underlying relationship between features and class labels. Extra trees (ET) and random forests(RF) proved effective in our study due to their ability to handle large, high-dimensional datasets and their robustness against overfitting, which suited the conditions of our dataset. Naive Bayes (NB) was efficient and scalable, especially for high-dimensional data, but its assumption of feature independence limited its applicability in cases with high feature correlation.

It is also important to note that overall accuracy (OA) is influenced by factors such as species composition, stand structure, age, and the methods used to select the best features, which vary among studies. In this research, however, the intensity of laser returns was not calibrated. This limitation can be addressed in future studies. First, we can investigate whether calibrated intensity affects classification results. Second, the use of MCI features in this study mitigated potential variations in intensity.

In conclusion, the ability of mALS compared to single-channel ALS (SCI-Ch) data to characterize tree species in urban areas was assessed in this study. Our classification results indicate that mALS data provided more accurate results than single-channel ALS data for urban tree species classification.

References

1

Schneider AFriedl M APotere D. Mapping global urban areas using MODIS 500-m data: New methods and datasets based on ‘urban ecoregions’ [J]. Remote Sensing of Environment20101148): 1733-1746. 10.1016/j.rse.2010.03.003 [Baidu Scholar] 

2

Lee J HBang K W. Characterization of urban stormwater runoff [J]. Water Research2000346): 1773-1780. 10.1016/s0043-1354(99)00325-5 [Baidu Scholar] 

3

Escobedo F JNowak D J. Spatial heterogeneity and air pollution removal by an urban forest [J]. Landscape and Urban Planning2009903-4): 102-110. 10.1016/j.landurbplan.2008.10.021 [Baidu Scholar] 

4

Nowak DCrane DStevens Jet al. A ground-based method of assessing urban forest structure and ecosystem services [J]. Arboriculture & Urban Forestry2008346): 347-358. 10.48044/jauf.2008.048 [Baidu Scholar] 

5

Lovell J LJupp D L BCulvenor D Set al. Using airborne and ground-based ranging lidar to measure canopy structure in Australian forests [J]. Canadian Journal of Remote Sensing2003295): 607-622. 10.5589/m03-026 [Baidu Scholar] 

6

Næsset EØkland T. Estimating tree height and tree crown properties using airborne scanning laser in a boreal nature reserve [J]. Remote Sensing of Environment2002791): 105-115. 10.1016/s0034-4257(01)00243-7 [Baidu Scholar] 

7

Clark M LClark D BRoberts D A. Small-footprint lidar estimation of sub-canopy elevation and tree height in a tropical rain forest landscape [J]. Remote Sensing of Environment2004911): 68-89. 10.1016/j.rse.2004.02.008 [Baidu Scholar] 

8

Holmgren JPersson Å. Identifying species of individual trees using airborne laser scanner [J]. Remote Sensing of Environment2004904): 415-423. 10.1016/s0034-4257(03)00140-8 [Baidu Scholar] 

9

Brandtberg T. Classifying individual tree species under leaf-off and leaf-on conditions using airborne lidar [J]. ISPRS Journal of Photogrammetry and Remote Sensing2007615): 325-340. 10.1016/j.isprsjprs.2006.10.006 [Baidu Scholar] 

10

Lindberg EEysn LHollaus Met al. Delineation of tree crowns and tree species classification from full-waveform airborne laser scanning data using 3-D ellipsoidal clustering [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing201477): 3174-3181. 10.1109/jstars.2014.2331276 [Baidu Scholar] 

11

Hyyppa JKelle OLehikoinen Met al. A segmentation-based method to retrieve stem volume estimates from 3-D tree height models produced by laser scanners [J]. IEEE Transactions on Geoscience and Remote Sensing2001395): 969-975. 10.1109/36.921414 [Baidu Scholar] 

12

Ahmed RSiqueira PHensley S. A study of forest biomass estimates from lidar in the northern temperate forests of New England [J]. Remote Sensing of Environment2013130121-135. 10.1016/j.rse.2012.11.015 [Baidu Scholar] 

13

Hollaus MWagner WMaier Bet al. Airborne laser scanning of forest stem volume in a mountainous environment [J]. Sensors200778): 1559-1577. 10.3390/s7081559 [Baidu Scholar] 

14

Yu XHyyppä JKaartinen Het al. Obtaining plotwise mean height and volume growth in boreal forests using multi‐temporal laser surveys and various change detection techniques [J]. International Journal of Remote Sensing2008295): 1367-1386. 10.1080/01431160701736356 [Baidu Scholar] 

15

Yu XHyyppä JKukko Aet al. Change detection techniques for canopy height growth measurements using airborne laser scanner data [J]. Photogrammetric Engineering & Remote Sensing20067212): 1339-1348. 10.14358/pers.72.12.1339 [Baidu Scholar]