Quantitative descriptors for identifying plant species of urban landscape vegetation

Jianhua ZHOU , Yifan ZHOU

Front. Earth Sci. ›› 2010, Vol. 4 ›› Issue (4) : 457 -462.

PDF (375KB)
Front. Earth Sci. ›› 2010, Vol. 4 ›› Issue (4) : 457 -462. DOI: 10.1007/s11707-010-0128-z
RESEARCH ARTICLE
RESEARCH ARTICLE

Quantitative descriptors for identifying plant species of urban landscape vegetation

Author information +
History +
PDF (375KB)

Abstract

This paper discusses the ideas and methods of designing effective descriptors for identifying plant species of urban landscape vegetation. Fourteen of such descriptors induced from image spectrum, texture, and shape properties were designed. These descriptors were intended to meet such requirements as possessing a true physical or geometric implication relating to ecological significance, having a relatively steady segmentation threshold and being less sensitive to image types or environmental conditions during image acquisition. This study used decision trees to combine four selected descriptors for plant species identification, and the experiment was able to reach an error rate of 5.8% compared 25.9% by merely using the conventional pixel brightness values in plant species identification.

Keywords

urban landscape vegetation / plant species / machine discerning / descriptor

Cite this article

Download citation ▾
Jianhua ZHOU, Yifan ZHOU. Quantitative descriptors for identifying plant species of urban landscape vegetation. Front. Earth Sci., 2010, 4(4): 457-462 DOI:10.1007/s11707-010-0128-z

登录浏览全文

4963

注册一个新账户 忘记密码

Introduction

Plants of different species, even with the same canopy closure, often vary widely in their biomass, net productivity, and environmental effects. It is therefore necessary to distinguish between plant species when vegetation studies require precise quantification (Zhou and Sun, 1995; Zhou, 2001; Liu et al., 2008). Due to limited urban land resources and required artificial aesthetics, the urban landscape vegetation is often characterized with multiple plant species, uneven distribution, diverse spatial structure, and high-density vegetated parcels. Moreover, other types of landscape mixed with vegetation also exhibit high degrees of fragmentation and heterogeneity. All of the above issues make it a severe challenge for identifying plant species in remote sensing images, since the level of uncertainty in the detailed quantification of urban landscape vegetation seems to be beyond management (Baret et al., 2007; Buyantuyev and Wu, 2009).

To date, it seems that the reports on intelligent recognition of plant species are only seen for the secondary forest with a few plant species. The recognition is likely feasible for the species of secondary forest communities. Identifiable species are only up to 4 when using such data as radar, lidar, and multispectral scanner, etc. (e.g., Leckie et al., 2003; Santos et al., 2003). Using hyperspectral data, identifiable species can increase to 6 because the weenie differences in light absorption can differentiate different species (e.g,. Gong et al., 1998; Martin et al., 1998; Tan et al., 2005).

Therefore, the high heterogeneity in urban vegetation and other landscapes and the noises in remote sensing images would make the recognition of plant species often ill with no unique solution, if they would only rely on traditional spectral properties in the machine discerning. This paper introduces some quantitative descriptors as a feasible solution to this puzzle. The word “descriptors” in this paper means some mathematical expressions (Table 1). A descriptor expresses a certain image spectral, texture, or shape property in the level of a single pixel and/or its neighbors, a cell (seeing 3.2), or a plot (seeing scholium c below Table 1). The design of the descriptors of the present study allows the high-level knowledge of human experts fusing into descriptors and yielding some constraints over the basic operations. These constraints will likely help resolve the uncertainties in identifying plant species and speed up convergence processes of some learning machines.

Methods

The types of remote sensing images and the environmental conditions during image acquisition often vary significantly. As a result, the images in the same region, obtained at different time, will show greater differences in spectral properties. It is necessary to retrain a learning machine with corresponding training sets when using conventional spectral properties to identify plant species. Therefore, it is needed to design some so-called “generic” descriptors.

Basic principles of design

To be in broader sense, it is necessary that each descriptor proposed has to be linked to a specific nature of vegetation. These descriptors must also meet the following requirements:

1) possessing a true physical or geometric implication relating to ecological significance;

2) being of a relatively steady segmentation threshold and being less sensitive to the differences among image types or the environmental conditions under which the images are obtained.

Design of descriptor

Based on such conventional properties as NDVI, statistical indicators from a GLCM (gray-level co-occurrence matrix), shape coefficient, etc., we proposed 14 descriptors listed in Table 1. These descriptors, in accordance with the conventional classification of interpretation signs, can be divided into three types, namely, spectral, texture, and shape descriptors. Due to space limitations, Table 1 shows only the expressions of these descriptors. As the examples of definition and getparms, four descriptors of DSI, Dd, Ra in Section 2.3, and Cd in Section 3.3 were discussed in detail.

Test the validity of descriptors

With some sampling data, the descriptors were tested one by one to ensure their validity on plant species identification (Table 1). The so-called validity means that the value of a descriptor can highlight some differences among plant species in order to generate a clear threshold for the segmentation.

Here, using DSI (relative difference between saturation and brightness) as an example to validate its credit on identifying several species while comparing to other three single spectral components, i.e., H, S,and I. (Fig. 1). Plant species were obtained from 12 remote sensing images by two seasons, including cedar, metasequoia, and camphor, which are the most common species in Shanghai. It can be seen from Fig. 1 that metasequoia can be identified from others in a much better separation with the use of DSI than that with one of the components of H, S, and I. In most of cases, there is an assured threshold of DSI, which helps identify metasequoia from others. This result can demonstrate that a properly selected descriptor is able to minimize indetermination for threshold.

Other example is given to the validity by using Dd (density of dark details), one of the texture descriptors. Small fluctuations on a crown surface will yield some small dark shadows or dark details whose density and mean length can be used to indicate the roughness of a crown surface. The gray-scale morphological filtering can serve as a feasible way to get Dd. Each continuous pixel set of dark details whose set size is smaller than a specified structural element can be extracted though a series of gray-scale morphological operations, namely, morphologically closeting an image and, then, opening the result. It is verified in several preliminary tests that Dd can indicate roughness of crown surface quite well. Figure 2 shows a clustering instance in which Dd serves as one of input components. The density of dark details is shown in Fig. 2(a), in which magnolia (with broad green leaves) and camphor (usually with several heads) have higher Dd values, while metasequoia (with a smooth appearance) has lower Dd value.

The last one is an experiment to test the validity of a shape descriptor. In the clustering pattern shown in Fig. 2, the Ra (weighted mean crown diameter) also serves as one of input components whose value is the mean minor axis of plot when each axis has its plot area as a weight. Figure 2(b) shows Ra in regular gray grades, and a lighter plot shows a higher Ra, namely, a larger crown diameter, and vice versa. Thus, the relative size of crowns can also serve as one of the key descriptors to identity plant species.

Discussion

Special roles of texture and shape descriptors

Both texture and shape descriptors play important roles in identifying plant species because they have a relatively steady segmentation threshold and are less sensitive to the differences between image types or to the environmental conditions under which the images are obtained.

Based on such traditional texture properties as GLCM statistics, wavelet analysis, etc., four new texture descriptors are designed to indicate the roughness on a crown surface (Table 1). One of the primary advantages of the texture descriptor comes from the matter that a descriptor value for a pixel is calculated with a certain neighboring of the pixel rather than with a plot the pixel belongs to. Usually, a plot is a result of previously defined segmentation. A remarkable shortcoming is that it is difficult to obtain continuous homogeneous segmentation results because the value distribution of a texture descriptor is often discrete. Our solution to this problem is in the use of a so-called “cell” (see Section 3.2).

Shapes not only with bright plot but also with dark one on a crown image can be indicated by some shape descriptors on the basis of a region or a boundary of the plot. Using such conventional shape properties as a plot area, the shape coefficient as well, three new shape descriptors are designed to indicate the complexity of a crown outline and its relative size (Table 1). The value of a shape descriptor is an overall quantity of a plot; so, these descriptors can perform well against the image noises. However, there is also an obvious shortcoming in the use of shape descriptors because these descriptors have excessive reliance on the previous segmentation. If the boundaries of the plots obtained from the segmentation are not accurate enough to show those crowns on an image, it is hard to take the shape descriptors into use. Our solution to this problem is to take a multidescriptor combination, including the use of those nonshape and without-presegmentation descriptors, thus reducing overreliance on the plots.

Significance of cell use

Vegetation in urban landscape often appears scattering or line-ranged layout. Large planted areas with only a single species can be seen occasionally. Usually, it is difficult to obtain a desirable result while segmenting or clustering plant species from a whole image. In addition, other cases also often make the results invalid, such as discrete texture properties and “round tendency” in a fuzzy clustering as well. A solution is to take the use of grid-based method with which image space is separated into a finite number of cells and then to have cell segmentation or cell clustering.

With cell operations, the gray undulation and random noise of texture properties can be smoothened when taking the weight-mean of a texture descriptor in a cell as the cell value. Furthermore, it will make those formerly difficult operations (i.e., segmentation, clustering, etc.) more feasible. Most tests in this paper have used cells. For example, using combined multidescriptors can work out the result of cell clustering (Fig. 2) and using the decision tree to obtain cell multisegmentation (seeing 3.4).

The precision of a cell operation depends largely on cell size. Here, the cell size is specified as 25m × 25m, considering that those trees have larger crown sizes. To facilitate the extraction of shape properties, it is necessary to detect a full crown. According to interpretation and statistics on remote sensing images, the greatest crown diameter in Shanghai is generally between 20 m and 25 m.

Use of additional constraints

Several new descriptors were derived from conventional properties but with some additional constraints. When extracting vegetation with NDVI for the purpose of obtaining a supplemental set of vegetation pixels, let us take SAVI as an example, as shown below.

NDVI is one of the early generic properties for extracting vegetation. Using the formula NDVI=(IR-R)/(IR+R) (IR means infrared and R red brightness), the condition NDVI>0.18 is valid for extracting most plant species of urban landscape vegetation. However, those species with a smaller difference between IR and R appear to have no saturated red in a color infrared image (seeing some trees in Fig. 3(a) pointed by blue arrows). To extract these species, reducing the NDVI threshold is needed. For example, NDVI threshold should be reduced by up to 39% for the purpose of extracting metasequoia. As a result of reduced NDVI threshold, it is possible to have other nonvegetation surface types to be extracted, such as the surface of granite in shadow color.

It is verified that this problem can be solved by additional constrains. For instance, using the following formula can effectively extract supplemental vegetation pixels in the low red saturation level, like that of metasequoia on a color infrared image, and can restrain the disturbers from other surface types.
S={S|SIM,S=(NDVI>0.11)(SAVI>0.18)},
where S is the set of pixels belonging to metasequoia; IM is the set of entire image pixels; SAVI means soil-adjusted vegetation index, and SAVI=((IR-R)/(IR+R+0.5))×1.5t (Huete, 1988; Gao et al., 2000).

Therefore, we can get DS as the supplemental pixel set in the low red saturation level as follows:
Ds={Ds|(DsS)(DsNDVI>0.18))},

Figure 3(c)) shows the situation by adding set S to the original set of vegetation pixels in Fig. 3(b). Figure 3(d) shows the DS set calculated with Eq. (2). It can be seen that those poorly extracted metasequoia pixels can be supplemented quite well with DS. It is more than that with the supplement that the full range of vegetation cover can be well extracted. More importantly, the relative density of DS, namely, Cd (Table 1) also can serve as an important descriptor to extract these species in the low red saturation level like metasequoia.

Accuracy examination

When using a single descriptor, only a few individual species can be identified under specific conditions. The classification in multiproperty space with the combination of multidescriptors will help increase the number of identifiable species and thus will improve the accuracy of classification. On the basis of, the evaluation of descriptors and understanding the properties of plant species, the input vectors, each corresponding to a descriptor can be decided for a learning machine.

To make an accurate examination of identifying plant species with descriptors, we use the scene of colored infrared aerial photograph (seeing Fig. 3(a)) and the four-descriptor combination as the input property vector. These descriptors are NDVI, Dd, Ra, and Cd, which represent spectrum, texture, shape properties, and pixel supplement, respectively. A conventional combination of spectrographic brightness is also taken as a reference. The learning machine used is a decision tree.

Figures 4(b) and (c) are the two identified results with the combination of NDVI, Dd, Ra, Cd, and the combination of spectrographic brightness. The charts were formed by incising the top left corner of the two outputs because there are so many cells in the outputs that the marks of plant species cannot be shown clearly for reading. In this case, data are spaced into certain sizes of cells (cell No= 342), of which the cells with≥10% vegetation cover are 139. Table 2 shows a comparison of identification accuracy with these two combinations. Number of incorrectly divided cells is counted by visual interpretation. The error rate here means that the ratio of the number of incorrectly divided cells to the number of cells with≥10% vegetation cover. A complex species cell means such a single cell that it contains two or more species, but it is irrelevant to prioritization. The number of complex species cells is 11 (possessing 7.9%).

The error rate of identification with the combination of NDVI, Dd, Ra, and Cd is obviously lower than that with the combination of spectrographic brightness, and the difference between them is 77% (Table 2). Thus, in the multiproperty space in accordance with the combination of NDVI, Dd, Ra, and Cd, the uncertainty of identification is greatly reduced. Therefore, incorporating human expert's knowledge into these descriptors will form some constraints over the basic operations. This can also enrich the organized data levels, namely, to have a new level of spatial objects above the conventional level of a single pixel.

Conclusions

This paper proposes 14 new descriptors, which can help indicate spectrum, texture, and shape properties, respectively. Through statistical analyses, together with the tests of image classification and/or clustering, it is demonstrated that these descriptors can perform better than that with conventional properties in identifying plant species of urban landscape vegetation. In addition, the algorithm of extracting supplemental vegetation pixels can help obtain the full range of vegetation cover. Also, the relative density of supplemental vegetation pixels can serve as an important descriptor to extract those species in the low red saturation. With the cells, the random noise of the texture descriptors can be smoothened, and some algorithms are likely to be simplified. It is verified by an accurate examination that the error rate of identification with the combination of NDVI, Dd, Ra, and Cd is 77% lower than that of the combination of spectrographic brightness.

So far, using verified descriptors of the present study can identify 12 plant species of urban landscape vegetable, further improvements can be expected along with in-depth studies on suitable descriptors, and more robust learning machines need to be verified. Our ultimate goal is to make up to 30 backbone plant species identifiable, which can likely account for>90% of the total individual plant in a large-scale city.

References

[1]

Baret F, Houlès V, Guérif M (2007). Quantification of plant stress using remote sensing observations and crop models: the case of nitrogen management. J Exp Bot, 58(4): 869–880

[2]

Buyantuyev A, Wu J (2009). Urbanization alters spatiotemporal patterns of ecosystem primary production: A case study of the Phoenix metropolitan region, USA. J Arid Environ, 73(4-5): 512–520

[3]

Gao X, Huete A R, Ni W, Miura T (2000). Optical-biophysical relationships of vegetation spectra without background contamination. Remote Sens Environ, 74(3): 609–620

[4]

Gong P, Pu R L, Yu B (1998). Recognition and analysis of conifer species in various seasons and time with hyperspectral data. Journal of Remote Sensing, 2(3): 211–217 (in Chinese with English abstract)

[5]

Huete A R (1988). A soil-adjusted vegetation index (SAVI). Remote Sens Environ, 25(3): 295–309

[6]

Leckie D G, Gougeon F A, Walsworth N, Paradine D (2003). Stand delineation and composition estimation using semi-automated individual tree crown analysis. Remote Sens Environ, 85(3): 355–369

[7]

Liu C F, Zhao S, Li L, Li X M, He X Y, Chen W (2008). Difference analysis of carbon fixation and pollution removal of urban forest in Shenyang. Journal of Northwest Forestry University, 23(4): 56–61 (in Chinese with English abstract)

[8]

Martin M E, Newman S D, Aber J D, Congalton R G (1998). Determining forest species composition using high spectral resolution remote sensing data. Remote Sens Environ, 65(3): 249–254

[9]

Santos J R, Freitas C C, Araujo L S, Dutra L V, Mura J C, Gama F F, Soler L S, Sant'Anna S J S (2003). Airborne P-band SAR applied to the aboveground biomass studies in the Brazilian tropical rainforest. Remote Sens Environ, 87(4): 482–493

[10]

Tan B X, Li Z Y, Chen E X, Pang Y (2005). Forest type recognition with hyperspectral and multi-spectral remote sensing data. Journal of Northeast Forestry University, 33(S): 61–64 (in Chinese with English abstract)

[11]

Zhou J H (2001). Theory and practice on database of three-dimensional vegetable quantity. Acta Geogr Sin, 56(1): 14–23 (in Chinese with English abstract)

[12]

Zhou J H, Sun T Z (1995). Measuring model of three-dimensional green biomass through remote sensing and estimating method of environmental benefits of urban vegetation. Journal of Remote Sensing, 10(3): 162–174

RIGHTS & PERMISSIONS

Higher Education Press and Springer-Verlag Berlin Heidelberg

AI Summary AI Mindmap
PDF (375KB)

1155

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/