1. College of Civil Engineering, Tongji University, Shanghai 200092, China
2. China Construction Eighth Engineering Division Co., Ltd., Shanghai 200135, China
guo-xiao-nong@tongji.edu.cn
Show less
History+
Received
Accepted
Published
2023-03-04
2023-05-20
Issue Date
Revised Date
2024-03-22
PDF
(9981KB)
Abstract
Single-layer reticulated shells (SLRSs) find widespread application in the roofs of crucial public structures, such as gymnasiums and exhibition center. In this paper, a new neural-network-based method for structural damage identification in SLRSs is proposed. First, a damage vector index, NDL, that is related only to the damage localization, is proposed for SLRSs, and a damage data set is constructed from NDL data. On the basis of visualization of the NDL damage data set, the structural damaged region locations are identified using convolutional neural networks (CNNs). By cross-dividing the damaged region locations and using parallel CNNs for each regional location, the damaged region locations can be quickly and efficiently identified and the undamaged region locations can be eliminated. Second, a damage vector index, DS, that is related to the damage location and damage degree, is proposed for SLRSs. Based on the damaged region identified previously, a fully connected neural network (FCNN) is constructed to identify the location and damage degree of members. The effectiveness and reliability of the proposed method are verified by considering a numerical case of a spherical SLRS. The calculation results showed that the proposed method can quickly eliminate candidate locations of potential damaged region locations and precisely determine the location and damage degree of members.
In the field of civil engineering, structural damage can be caused by many factors, such as material aging, corrosion, load effect, and natural disasters [1], and their identification can be divided into two steps: determining the location of damage, and quantifying the damage [2–5].
When damage occurs in a civil engineering structure, changes in the structure’s physical characteristics, such as structural stiffness and damping, result in changes in the vibration response and modal parameters. The dynamic response of the structure can be monitored by installing various types of sensors on the structure to detect structural damage [6,7]. Owing to the development of damage identification theory, monitoring equipment, and modal analysis methods [8], the capability to identify damage has improved significantly [9]. Furthermore, in recent years, a structural health monitoring system has been developed to obtain the structural dynamic response [10,11]. From the structural dynamic response, structural damage can be detected through damage detection or modal analysis techniques [12].
As a universal function approximator, artificial neural networks (ANNs) have been successfully used to solve partial differential equation problems [13,14]. Similarly, structural damage identification can be accomplished by approximating the nonlinear mapping relationship between structural damage characteristics, damage locations, and damage degrees by using ANN [15]. However, shallow ANNs have difficulty in completing more complex nonlinear problems. To further research the problems in more complex engineering systems, application of deep neural network (DNN) has been introduced into the mathematical modeling of complex engineering problems [16,17]. DNNs achieve better approximation for complex engineering system behavior due to their stronger feature extraction and nonlinear mapping capabilities [18,19]. To solve the complex nonlinear damage identification problem, researchers have used DNNs, such as convolutional neural networks (CNNs) [20] and long short-term memory networks [21], and hybrid neural networks [22].
DNNs process the original data through multiple layers of nonlinear transformations and output higher dimensional and more abstract features. Recently, a deep CNN has been widely used for feature extraction; it can detect structural defects and identify structural damage [23] by directly extracting features from structural time history data [24] or frequency domain data [25]. Owing to the advantages of CNNs in feature extraction, it has been used for crack detection in steel structures [26], concrete members [27,28], and for damage identification in frame structures [29] and bridges [30]. However, there have been relatively few studies on the use of CNNs for damage identification in reticulated shells.
When neural networks are used for damage identification, the training data set is constructed by combining members with different damage patterns [31]. Researchers construct damage training data sets based on transmissibility data and they have accomplished structural damage identification using CNNs [32–34]. However, the transmissibility data are related to the damage location and degree simultaneously. For large-scale civil engineering structures containing many members, the direct construction of the training data set with transmissibility data or other indexes related to the damage location and degree simultaneously is prone to training data explosion, as the number of combinations of different member damage locations and degrees is extremely large. For such structures, a phased damage identification method has been developed [35]. The damage location is first determined, and the damage degree is then obtained. Such a stepwise damage identification method facilitates precise damage identification for framed structures [36] and spatial truss structures [37], and it can improve the identification efficiency.
The problems are yet to be overcome for damage identification in single-layer reticulated shells (SLRSs). First, a large number of training data sets should be constructed in advance because of the huge number of structural members. Secondly, some damage patterns are very similar because of the close spacing of modes [38]. Thirdly the presence of too many members in the damaged area reduces the accuracy of member damage identification when the traditional stepwise damage identification method is used. To solve these problems, this paper proposes: 1) a damage location index to determine the damaged region locations and damage degree index to determine the damage degree of members; 2) a CNN-based damaged region location method that can identify complex damage patterns and a cross-partitioning damaged region location method that can significantly narrow down the possible damaged region; 3) a damaged member identification method involving a fully connected neural network (FCNN) that should be applied to the narrowed-down damaged region.
This paper is organized as follows. First, the damage-sensitive feature of SLRSs is derived. Then, CNNs for damaged region location and a cross-partitioning damaged region localization method based on CNNs are developed. The CNN-based cross-partitioning damaged region localization method can help identify complex damage patterns and significantly narrow down the affected location, facilitating the achievement of a higher identification precision. Subsequently, FCNNs are constructed to identify the damaged member of the structure. Finally, the above methods are applied to a spherical SLRS to verify their effectiveness.
2 Damage sensitive features of single-layer reticulated shell
2.1 Damage localization index of single-layer reticulated shell
The vibration equation of an n-degree of freedom (n-DOF) undamped structure system can be written as
where K is the stiffness matrix of the structure, M is the mass matrix of the structure, is the ith order circular frequency of the structure, and is the ith order vibration mode of the structure.
Assuming that only K of the structure changes and that M remains unchanged when the structure is damaged, we define the structural vibration equation for n-DOFs as
where is the stiffness variation of the structure, is the variation of the ith order circular frequency of the structure, and is the variation of the ith order vibration mode of the structure.
Combing Eqs. (1) and (2) and ignoring the higher-order infinitesimal terms, we can write as
In a linear structure, a change in the structural stiffness is the linear superposition of changes in the stiffness of all elements. Hence, can be expressed as
where is the damage coefficient of the kth element in the structure, is the element-stiffness-contributed matrix of the kth element in the global stiffness matrix, and L is the total number of structural elements.
On the basis of the orthogonality of modal shapes, when r ≠ i, and . By left-multiplying Eq. (2) with and combining it with the equation , we can obtain the following equation, in which the higher-order infinitesimal has been ignored:
Using the assumption of the linear model that the modal vibrations of the structure are independent of each other, we can express the variation of the ith order vibration mode as a linear combination of the modes:
where is the weight factor corresponding to .
Substituting Eq. (6) into Eq. (5) yields
When r ≠ i, and . Therefore, Eq. (7) can be simplified as
Herein we adopt the modal mass normalization method, i.e., . Then, Eq. (1) can be rewritten as . The weight factor corresponding to can be written as
The parameter can be rewritten as
Substituting Eq. (3) into Eq. (10) yields
Combining Eqs. (3) and (11), we obtain
If there is only one damaged element in the structure or all elements have similar damage degrees, the damage coefficient would be eliminated in Eq. (12). Subsequently, Eq. (13) below is derived from Eq. (12), and it is a damage-sensitive feature of the structure and is independent of the damage degree of the elements. Therefore, Eq. (13) is referred to as damage location feature.
Generally, the damage location feature is expressed using the damage location vector DLi, which can be written as
where and are the ith-order modes of the undamaged structure and damaged structure, respectively, and and are the ith-order circular frequencies of the undamaged structure and damaged structure, respectively.
When the damage location vector DLi is to be input into the neural network for machine learning, it should be normalized. The normalized damage location vector NDLi can be written as
where is the module of vector DLi.
From Eqs. (14) and (15), a matrix related only to damage location can be constructed:
Using the damage location matrix NDL, we can perform the damage pattern identification of a damaged location by training a neural network with damage location vectors. In the damage identification of SLRS, the NDL based neural network is used to determine the potential damage areas of SLRS, thereby greatly reducing the range of candidates of damage patterns. Then, the location and degree of damaged members can be determined in the potential damage regions of SLRS by constructing corresponding neural network.
2.2 Damage degree index of single-layer reticulated shell
Equations (3) and (11) show that the vibration mode variation and the natural frequency variation of the structure are related to the damage location and degree. Hence, a vector related to both damage location and degree can be constructed:
where is the variation of the normalized vibration mode vector of the nth order mode
On the basis of the vector DS, the neural network can learn the relevant features of damage location and degree to complete the damage location and degree identification of members in potential damage areas of SLRS.
3 Convolutional neural network-based cross-partitioning damaged region location method for single-layer reticulated shell
The number of members is large in an SLRS, and hence, there are numerous potential damage patterns. Therefore, it is challenging to directly determine the damaged member and its corresponding damage degree by using a neural network. However, the SLRS can be divided into substructures with similar or even identical geometric shapes, which implies that both the member distribution and damage patterns of the substructures have a certain regularity. A specific division method can used to divide the SLRS into several substructures. In the damage identification process, the damaged region locations can be determined first, which significantly reduces the number of unreasonable damage patterns. This section proposes a CNN-based cross-partitioning damaged location method for SLRS. The method can significantly narrow down the possible damaged location of the structure.
3.1 Construction of damage pattern
The nonlinear mapping relationship between the input and the damaged substructure can be determined by training the CNN with the common features of damage patterns in the substructure. The damage location vector can be processed and input into the CNN, which extracts the high-dimensional features in the damage location vector and performs damaged region location identification.
This section describes the transformation of the damage location matrix of the structure into images, and the training of the CNN with the features of the matrix to determine the mapping relationship between the damage pattern and the damaged region location. The trained CNN can be used to identify the damaged region location. The process of constructing the damage pattern that can be input into the CNN can be summarized as follows.
S1: Some members are randomly selected from the SLRS, and their stiffnesses are reduced to simulate structural damage. The damage location matrix NDL is calculated using Eqs. (11)–(16).
S2: In the NDL matrix, NDLi is selected to construct a set of scatter points in the x, y, and z directions:
where m is the number of joints in the SLRS and Sx, Sy, and Sz are the damage pattern scatter point sets for the x, y, and z directions, respectively. Sxm, Sym, and Szm are the three-dimensional coordinates of the mth point in Sx, Sy, and Sz, respectively, and they can be calculated as follows:
where Dxm, Dym, and Dzm are the x, y, and z direction coordinates of the mth joint of the SLRS, respectively, and NDLixm, NDLiym, and NDLizm are the normalized damage location vectors at the mth node in the x, y, and z directions, respectively.
S3: The scattered points of Eq. (18) are fitted to a three-dimensional surface:
where hx, hy, and hz are the fitting functions of Sx, Sy, and Sz, respectively. The fitting function form that has a better fitting result on scatter points is chosen among various fitting function forms.
S4: The three-dimensional surfaces formed by hx, hy, and hz are projected onto the yz, xz, and xy planes to form projection images, respectively. The projection images on the yz, xz, and xy planes are denoted by Hx, Hy, and Hz, respectively. Here, Hx, Hy, and Hz are three-dimensional tensors for damage pattern identification.
3.2 Damaged region location method
The damage pattern construction method has been described in Subsection 3.1. The CNN-based damaged region location method is developed in this section. Herein, v damage patterns are constructed according to Subsection 3.1 to simulate the damage of the structure. For the vth damage pattern, the member damage is simulated by reducing the elastic moduli of kv randomly selected members to a fixed value. The position vector of damaged members and the set of damage patterns DPC can be written as
where nf is the number of selected vibration modes, dir represents the x, y, or z direction, is the vth input three-dimensional tensor, and is the nfth three-dimensional tensor damage pattern in the x, y, or z direction for the position vector of damaged members.
Assuming that the SLRS can be divided into na substructure regions under a certain partitioning scheme, and there exist combinations of damaged substructure regions when the SLRS has na substructure regions. Based on the s types of damage substructure regions combinations, s types of subsets can be constructed in the damage pattern set DPC:
where is the damage pattern subset under the sth damage substructure combination, which includes a series of damage patterns.
Based on the DPC in Eq. (22), the corresponding damaged region location set OPC can be written as
where is the damaged substructure region location vector, is the damage state of nath substructure region. It should be noted that = 1 when the nath substructure region occurs damage, and = 0 indicates that the nath substructure region is undamaged.
Similarly, s types of subsets can be constructed in the corresponding damaged region location set OPC:
where is the corresponding output subset under the sth damage substructure combination, which includes a series of corresponding damaged substructure region location vectors.
According to Eqs. (24) and (27), the damage pattern data set and corresponding damaged region location data set can be divided into different categories, and the CNN can be used to complete the classification task. In the training process, the CNN is trained through DPC and OPC. Using the trained CNN, the damaged substructure regions can be identified.
3.3 Cross-partitioning damaged region location
The partitioning scheme of substructures is flexible, and SLRS can be divided into many different substructures by using different partition methods. A cross-partitioning method for determining the damaged region location of an SLRS is proposed in this section, which can significantly narrow down the range of possible damaged locations in the structure.
If it is assumed that the SLRS can be divided into substructures by using t types of partitioning schemes, and the SLRS can be divided into na,t substructure regions for the tth type of partitioning scheme, then the combination of damaged substructure regions st can be calculated as . For the tth partitioning scheme, the damage pattern data set DPC,t and corresponding damaged region location data set OPC,t including st subsets can be constructed according to Eqs. (24) and (27). A series of damage pattern sets with different subsets can be constructed using similar methods; the damage pattern set can be expressed as DPC,1,DPC,2,…,DPC,t, and the corresponding damaged location set can be expressed as OPC,1,OPC,2,…,OPC,t.
For DPC,1,DPC,2,…,DPC,t and corresponding OPC,1,OPC,2,…,OPC,t, a total of t CNNs f1,f2,...,ft are trained for classification, in which CNN ft can classify the stth type of damage substructure regions for the tth type of partitioning scheme.
In the damaged region location determination stage, an observed damage pattern DP is input into the trained CNN classifier. The classification results of the CNN can be output as a probability vector by the SoftMax layer:
where is the probability of damage occurring in the stth substructure region combination under the tth type of partitioning scheme.
The damaged region locations determined by different CNNs usually have common regions. The final damaged region location is the intersection region of all identified damaged region locations. During the calculation of first-time cross-partitioning, the damaged region location probability matrix M1 in each region can be written as , then the M1 is changed into row vector . When the second-time cross-partitioning is calculated, the probability matrix of the second-time cross-partitioning damaged region location can be written as . The above cross-partitioning calculation is performed in order.
After cross-partitioning calculations are performed t times, the probability of damage in each region can be written as . In , the regions corresponding to values close to 0 are considered undamaged, while other regions are considered damaged areas.
The cross-partitioning damaged region location method for an SLRS is described in Fig.1. This method involves the use of multiple CNNs connected in parallel, and the classification result vectors are multiplied t–1 times. A large region of potential damage as determined by a single CNN can be reduced to a smaller region when multiple CNNs are used in parallel. Since the number of potentially damaged members and the number of corresponding damage patterns can be reduced, it is easier to determine the more precise damage location and degree of members in the smaller region.
3.4 Convolutional neural network
The CNN is used to identify damage patterns to locate the damaged region locations, and it consists of convolution, pooling, fully connected (FC), and SoftMax layers. This section presents technical details of the CNN.
3.4.1 Convolutional layer
The convolution layer is used to compute the convolution of the input data and convolutional kernel. The convolution kernel slides over the input data with a fixed stride and convolves with the input data. The feature map matrix can be calculated after the convolution operation by using Eq. (30). Subsequently, the bias matrix is added to the feature map, as shown in Eq. (31).
The operations performed for the convolutional layer are as follows:
where is the feature map matrix; is the convolution kernel; is the input data matrix; represents the convolution operation; Mc and Nc are the horizontal and vertical dimensions of the two-dimensional convolution kernel, respectively; ic and jc represent the row number and column number of the elements in the matrix Ic, respectively; mc and nc represent the row number and column number of the elements in the matrix Kc, respectively; Sc is the biased characteristic mapping matrix; bc is the bias matrix.
3.4.2 Pooling layer
The pooling layer can be divided into two layers: the maximum pooling layer and average pooling layer. If operations such as classification are performed directly after the convolutional layer, overfitting is likely to occur. The pooling layer is introduced to avoid this problem. The pooling layer resamples the feature map matrix extracted by the convolutional layer, and it compresses the feature map matrix to reduce the number of parameters in the CNN. The pooling layer can also enhance the anti-interference ability of CNN.
3.4.3 Local response normalization layer
The local response normalization (LRN) layer is added to some CNN models, and it is designed to increase the convergence speed and generalization ability. The calculation of the LRN layer can be expressed as
where is the activity of a neuron computed by applying convolutional kernel i at position (x,y) and then applying the ReLU nonlinearity, and is the response-normalized activity. The summation is over nk “adjacent” kernel maps at the same spatial position, and Nk is the total number of kernels in the layer. The constants k, nk, α, and β are hyperparameters whose values are determined using a validation data set.
3.4.4 Architecture of convolutional neural network
On the basis of the aforementioned layers of CNNs, some CNN architectures have been proposed, such as AlexNet [39], VGG [40], GoogLeNet [41], and ResNet [42]. This section describes the use of the GoogLeNet architecture to identify the damaged region location.
The network structure of GoogLeNet is shown in Fig.2(a). Conv represents the convolutional layer, Maxpool represents the max pooling layer, LRN represents the LRN layer, Avepool represents the average pooling layer, FC represents the FC layer, and SoftMax represents the SoftMax layer. In addition, the inception layer is added in GoogLeNet (Fig.2(b)). After the convolution and pooling layers, the ReLU function is selected as the activation function.
Two auxiliary classifiers, shown in Fig.2(c), are added in the middle layer of the GoogLeNet model. The model can well differentiate the features in the lower layers when calculating the loss, and this capability accelerates the convergence of the entire network. At inference time, these auxiliary classifiers are discarded. The losses of the two auxiliary classifiers are multiplied by weight factors during training and added to the overall loss of the network before backward propagation. The identification of damaged region location is a classification task, so cross entropy loss function is selected in CNN. In the training process, the weights and biases are updated iteratively through the back propagation algorithm to reduce the loss function. The loss function of the network is given by
where LL is the loss of the network during training; L1 and L2 are the losses of auxiliary classifiers 1 and 2, respectively; L3 is the loss of the main network.
The hyperparameters of the CNN are introduced in Subsection 6.2.
4 Damaged member identification for single-layer reticulated shell by using a fully connected layer neural network
According to Eq. (17), the vector DS is related to the damage location and degree. Hence, part of the normalized vibration mode variation and circular frequency variation in DS can be selected to construct an input vector. The input in the input layer of the neural network is denoted as I = DS (nf,nω), and it can be calculated as
where I is a vector of length r, with r = m × nf + nω, and nω is the number of natural frequencies selected in the calculation.
The output layer in the neural network is denoted as p, and it can be expressed as
where p is a vector with length nm, nm is the total number of members in the SLRS, ne is the number of members in the damaged region location, and is the damage degree of the neth member. The zero values in p represent the corresponding members are in undamaged states.
After the cross-partitioning damaged region location process, the size of the region that may be damaged is reduced. The FCNN is adopted for the damaged member identification. For the determined damaged region location, a total of w damage patterns are constructed to simulate the damages of the structure according to Eq. (34). The member damage is simulated by reducing the elastic moduli of kw randomly selected members from the ne members by for the wth damage pattern. The position vector of damaged members and the set of damage patterns DPNN can be written as
where is the kwth damaged member for the wth damage pattern, is the wth input vector of the FCNN, and are the normalized vibration mode variation and circular frequency variation for the position vector of damaged members and the elastic modulus reduction , respectively.
According to Eq. (35), the set of corresponding outputs OPNN can be written as
where is the wth output vector of the FCNN and is the damage degree of the neth damaged member.
In the training process, the FCNN is trained through DPNN and OPNN. In the damaged member determination stage, an observed damage pattern Iob is input into the trained FCNN, and the identification results pid can be output by the FCNN, then the damaged member and damage degree can be determined by pid.
The corresponding FCNN is then constructed to identify the damaged members and their damage degree in the smaller identified damaged region location. The architecture of the FCNN is shown in Fig.3, and mean squared error loss is used for the loss function in FCNN. Since the size of the damaged region location determined by the cross-partitioning damaged region location method is different in different damage patterns, the number of layers and neurons of the FCNN shown in Fig.3 may need to be different in order to achieve more precise identification results. We discuss in detail the hyperparameter of the FCNN used to identify damage elements in Subsection 6.3.
5 Damage identification process of single-layer reticulated shell
Combining the damaged region location method in Section 3 and the damaged member identification method in Section 4, the damage identification process of SLRS is shown in Fig.4.
In the damage identification process, the cross-partitioning damaged region location method is used to narrow down the range of possible damaged locations. After determining the damaged location, the damaged members can be identified and their damage degree can be determined.
A Kewitt-6 spherical SLRS model is used to verify the proposed damaged region location method and damaged member identification method. Model development and modal analysis of the structure are performed in ANSYS 18.0 [43], and the methods proposed in Sections 3 and 4 are used to identify the damage of the SLRS.
The Kewitt-6 SLRS has a span of 30 m, and there are 5 rings in the radial direction. The height-to-span ratio of the structure is 0.25, and the dead load is 1.5 kN/m2. The structure is made of 6061-T6 aluminum alloy with a yield stress of f0.2 = 240 MPa, an elastic modulus of E = 70 GPa, and a density of ρ = 2700 kg/m3. All the sectional dimensions of the member in SLRS are H250 mm × 150 mm × 8 mm × 10 mm. In the finite element model, pinned supports are positioned at the periphery, and element BEAM188 is used to simulate the structural members. Note that we ignore the contribution of the roof panel to the overall stiffness of the structure, yet the roof panel is simplified as concentrated mass at joints; the MASS21 element is used to simulate the mass. The finite element model of the SLRS is shown in Fig.5. The joint numbers and the member numbers are shown in Fig.6.
6.2 Cross-partitioning damaged region location
If it is assumed that damage only occurs at the members and the support always remains intact, 30 members at the outermost ring cannot affect the natural frequency and vibration mode of the structure. To simulate the random damage pattern of the structure, we randomly selected 1−10 members from the remaining 210 members to apply the damage. The elastic modulus of the damaged member is uniformly reduced by 20%. Vertical vibration modes dominate the structure because of the small height-to-span ratio. The first and second vibration modes in the z direction are selected to construct the damage pattern set. The calculation generates a total of 84000 damage patterns according to the process described by S1−S4 in Subsection 3.1. Combining Eqs. (21)–(23), we can write the input of the CNN as
The spherical SLRS in Fig.5 is divided using two types of substructure partition schemes, which are named Sub1 and Sub2 in Fig.7.
In Fig.7, the SLRS with Sub1 is divided into six fan-shaped substructures of the same size, and the SLRS with Sub2 is divided into five ring-shaped substructures of similar size. The data set of damage patterns can be written with form of subsets according to Eq. (24):
Two types of substructure division schemes are adopted for the spherical SLRS, and hence the corresponding data set of output OPC is of two types:
where and are the damaged substructure region location vectors for the partitioning schemes Sub1 and Sub2, respectively.
Similarly, the corresponding damaged region location data sets can be written with form of subsets according to Eq. (27):
For DPC,1, DPC,2, OPC,1, and OPC,2, CNN1 and CNN2 are trained for classification. The CNN1 can perform 63 types of damaged substructure region classifications for the fan-shaped substructure, and CNN2 can perform 31 types of classification of damaged substructure region for the ring-shaped substructure. The CNNs are trained in a Python environment using Keras [44], a high-level open-source deep-learning library built on top of Tensorflow [45]. The simulations described in this paper are performed on a PC with one Intel Core i5-9400F CPU and one NVIDIA RTX A4000 GPU card. In the training process, CNN1 is trained through DPC,1 and OPC,1, and CNN2 is trained through DPC,2 and OPC,2.
Based on Ref. [41] and the constructed damage data set, trial training is conducted for CNN1 and CNN2 to adjust hypermeters. The following hyperparameters of CNN with higher damage region classification accuracy are obtained, and the hyperparameters of Fig.2 are listed in Tab.1.
The hyperparameters of the CNN models used in this study are as follows.
1) The size of the input matrix is 224 × 224 × 3.
2) The dropout rates [46] in auxiliary classifiers 1 and 2 are set as 0.5. The dropout rate in the main network is set as 0.4.
3) On the basis of Eq. (32), The hyperparameters k, nk, α, and β in the LRN layer are set to be 2, 5, 0.0001, and 0.75, respectively.
4) The adaptive momentum estimation (Adam) [47] is selected as the optimizer, and its learning rate and decay rate are 3 × 10−4 and 5 × 10−6, respectively.
5) Batch normalization is used during the training process, and the batch size is 32. The training epoch is set to be 300.
To avoid high variance and bias in the generated results, the K-fold cross validation is used to evaluate the CNN model more objectively. A 10-fold cross validation is used to randomly shuffle the data set and divide it into 10 mutually exclusive subsets of the same size. Each time, the current subset is used as the validation data set and all the remaining subsets are used as the training data set for training and evaluation of the model. The ratio of the training data set size to the validation data set size for each cross validation is 9:1, and the validation data set is not used for training the CNNs during each training process.
The accuracy curves and loss curves in validation data set of CNN1 and CNN2 in the first fold cross validation are shown in Fig.8. After 300 epochs of training, the CNN can successfully extract the damage characteristics of the spherical SLRS when different damage patterns occurred. When the accuracy and loss of each cross validation tend to stabilize, the average validation accuracy during stability of each cross validation is taken as the accuracy of each cross validation. The validation accuracy of each cross validation, NK, in 10-fold cross validation and the average accuracy of the 10-fold cross validation are shown in Fig.9. According to Fig.9, the average accuracies of CNN1 (Sub1) and CNN2 (Sub2) reached 99.3% and 98.4%, respectively, for the validation data set. From the classification results for the validation data set, we can assume that the accuracy of damaged region location after one-time cross-partitioning for the validation data set is 99.3% × 98.4% = 97.7%. Then, the weights and hyperparameters of the trained CNN1 and CNN2 are used to test the accuracy of damaged region location on the testing data set.
For the trained CNN1 and CNN2, an additional 8400 testing damage patterns are constructed to input into CNN1 and CNN2, using the same process of constructing the training data set. Additionally, it should be noted that the 8400 damage patterns in the testing data set are not included in the damage pattern data set for training CNN1 and CNN2. The testing accuracies of CNN1 and CNN2 are 98.7% and 97.7%, respectively, indicating that CNN1 and CNN2 still maintain high accuracy in identifying unknown damage patterns. The one-time cross-partitioning for the testing data set is 98.7% × 97.7% = 96.4%. The testing results indicate that CNN1 and CNN2 have good generalization ability and can be used for location of damaged regions.
The six damage patterns shown in Tab.2 are selected in the testing data set as observed damage patterns, and the damage degree of selected members in Tab.2 are all set to 20%. The plane projection images corresponding to Tab.2 are shown in Fig.10. The cross-partitioning damaged region location method is used to determine the corresponding damaged region locations of the observed damage modes using trained CNN1 and CNN2. The plane projection image on the right side of Fig.10 is input into the trained CNN1 and CNN2. The CNN1 and CNN2 cross-partitioned the damaged region locations by using the cross-partitioning damaged region location method shown in Fig.1, and the damaged region location results are shown in Tab.2 and Fig.11. In Tab.2, result 1, result 2, and result 3 are the output identified damaged region locations of CNN1 and CNN2; F1, R3, and F1R3 represent the damaged region location is the first fan-shaped substructure, the third ring-shaped substructure, and the intersection region of the first fan-shaped substructure and the third ring-shaped substructure respectively. The cross-partitioning damaged region location results are shown in blue, yellow, and red colors in Fig.11, respectively.
The total number of members , , and in the damaged region location, obtained by CNN1, CNN2, and the cross-partitioning location method, are listed in Tab.3. With the cross-partitioning location method, the number of members in the located damaged region decreases to 18.3%−53.3% of the number of members obtained by the two CNNs. In particular, the accuracy loss of the cross-partitioning is acceptable. The identified damaged region location can be significantly reduced to a smaller region when the cross-partitioning damaged region location method is used. Hence, the cross-partitioning damaged region location method proposed in this paper can locate the damaged region of an SLRS more precisely.
6.3 Damaged member identification
Before identifying damaged members, it is necessary to determine the hyperparameters of FCNN with architecture shown in Fig.3. The following steps are adopted to determine the hyperparameters of the FCNN.
P1: The cross-partitioning region is constructed by randomly selecting fan-shaped and ring-shaped substructure in Subsection 6.2. The number of members in the cross-partitioning region is . The determination process of can refer to Fig.11 and Tab.3.
P2: For a determined , a total of 5000 damage patterns are constructed to simulate the damage of the spherical SLRS. Furthermore, 1−10 members are randomly selected from the members. Then, randomly select a reduction percentage for the elastic modulus of the selected members from the options of 5%, 10%, 15%, 20%, 25%, or 30%. The first and second vibration modes in the z direction and the first and second circular frequencies are selected to construct the damage pattern set. Equations (36)–(40), the set of damage patterns DPNN and the corresponding set of outputs OPNN can be written as
P3: The number of FC layers is set to 2, 3, and 4, and 100, 200, 300 neurons per layer are selected to construct different FCNNs, to generate 9 different types of FCNN.
P4: The training and testing sets are randomly divided in an overall 8:2 ratio, to train the FCNNs. When the loss of FCNN tends to stabilize, the average value of the testing loss during the stable period is taken as the loss of FCNN.
P5: The steps of P1−P4 are repeated 20 times to obtain the loss value of FCNN under different damage regions and different network hyperparameters.
The other hyperparameters of the FCNN are set as follows.
1) The dropout rate in the network is set as 0.4.
2) The Adam is selected as the optimizer with a learning rate of 3 × 10−3 and a decay rate of 5 × 10−5.
3) The training epoch is set as 300.
For each selected cross-partitioning region, a surface is used to fit the relationship between testing loss, number of neurons, and number of layers. The surfaces determined by loss, number of neurons, and number of layers under 20 different cross-partitioning regions are drawn simultaneously in Fig.12, showing similar regular variation. In Fig.12, when using more than 3 FC layers and more than 200 neurons per layer, FCNN achieved lower loss for identifying damaged members in different cross-partitioning regions. Based on the above calculation results, the FCNN with 4 FC layers and 300 neurons per layer is finally determined for damaged member identification. The determined FCNN has lower loss values in different damage regions, which indicates that the selected FCNN has better performance in identifying damaged members. All the parameters of each layer in the FCNN are shown in Tab.4.
The damage patterns shown in Tab.2 are used to further identify the damage degree of members in determined damaged region locations. For the cross-partitioning damaged region location determined in Fig.11, the FCNN with same hyperparameters are used to further identify the damaged members and their damage degree. The total number of members in patterns 1−6 is (Tab.3). The input data set DPNN and output data set OPNN are constructed using the operations described in P2. The input and output data set in patterns 1−6 are denoted by, ,…, and ,…, for distinguishing between the inputs and outputs. Correspondingly, the trained FCNNs for identifying damaged members in patterns 1−6 are labeled FC1, FC2, …, FC6, respectively.
The 10-fold cross validation is also used to randomly shuffle the above data set and divide it into 10 mutually exclusive subsets of the same size. The accuracy curves and loss curves of FC1−FC6 in the 1st fold cross validation are shown in Fig.13. The average validation accuracy during the stability stage of each cross validation is also taken as the accuracy of each cross validation, and the validation accuracy of each cross validation in 10-fold cross validations are shown in Fig.14. According to Fig.14, the average validation accuracy of FC1−FC6 is in the range of 95.7%−96.8%. The weights and hyperparameters of the trained FC1−FC6 are then used to test the accuracy of damaged member identification on the testing data set.
For the trained FC1−FC6, a further 500 testing damage patterns are constructed using the same process of constructing a training data set for each FCNN. Each testing data set is not included in the training data set for FC1−FC6. The testing accuracies of FC1−FC6 are 94.4%, 96.0%, 95.8%, 95.0%, 95.4%, and 96.0%, respectively. The testing results indicate that FC1−FC6 have good generalization ability and can be used for damaged member identification in determined damaged region locations. For the observed damage patterns 1−6 in Tab.2, the vectors input to FCNNs can be constructed according to Eq. (52). Then the trained FC1−FC6 can output the damage degree of each member in the damaged region locations. The six damage patterns are also not included in the training data sets of FC1−FC6. The identification results are shown in Tab.5 and Fig.15. In Tab.5, results 1 represents the damaged member location results, and results 2 represents the damage degree identification results. The damaged members are marked with blue lines in Fig.15. For damage patterns 1−6, the location and damage degree of the damaged member are precisely identified. Herein, we have completed precise identification of damaged members of SLRS from the prior identification of damaged region locations.
7 Conclusions
A cross-partitioning damaged region locating method based on a CNN is proposed. The method can identify complex damage patterns and significantly narrow down the specific location within a region of possible damage. A FCNN is constructed to identify the damaged members and their degree of damage in the narrowed-down damaged region location. The effectiveness of the damaged region location method and damaged member identification method is verified using a spherical SLRS model. The conclusions of this study are as follows.
1) Damage localization and damage degree indices are proposed for an SLRS, and they can be used to determine the damage location and damage degree, respectively. These two indices can be used to construct damage patterns for training neural networks.
2) A CNN-based damaged region location method for SLRSs is proposed. Furthermore, a cross-partitioning damaged region locating method involving the use of parallel CNNs is proposed; the method can significantly narrow down the possible damaged region to a more precise location within the structure.
3) Owing to the significant reduction in the range of possibilities of damage patterns after narrowing down the location of the damaged region, the damage localization and damage degree identification of members can be performed by the FCNN.
4) The damaged region localization method and damaged member identification method are verified using a spherical SLRS. The cross-partitioning damaged region location method can successfully locate the damaged region under complex conditions. With the cross-partitioning location method, the number of members in the located damaged region decreased to 18.3%−53.3% of the value for the single CNN location results. After localizing the damaged region, the corresponding FCNN can further precisely identify the damage location and damage degree of members.
When using neural networks for damage identification, it is necessary to construct a large number of damage patterns; this relies on an accurate numerical model. Further research is still needed on how to modify the numerical model to be as close as possible to the behavior of the actual structure. In addition, the accuracy of SLRS modal identification directly affects the accuracy of damage identification during the damage identification process. The study of modal parameter identification methods applicable to SLRS is also a topic that needs to be studied in the future.
Carden E P, Fanning P. Vibration based condition monitoring: A review. Structural Health Monitoring, 2004, 3(4): 355–377
[2]
Li Y Y, Cheng L, Yam L H, Wong W O. Identification of damage locations for plate-like structures using damage sensitive indices: Strain modal approach. Computers & Structures, 2002, 80(25): 1881–1894
[3]
Lu Q, Ren G, Zhao Y. Multiple damage location with flexibility curvature and relative frequency change for beam structures. Journal of Sound and Vibration, 2002, 253(5): 1101–1114
[4]
Yang Q W, Sun B X. Structural damage localization and quantification using static test data. Structural Health Monitoring, 2011, 10(4): 381–389
[5]
Truong T T, Dinh-Cong D, Lee J, Nguyen-Thoi T. An effective deep feedforward neural networks (DFNN) method for damage identification of truss structures using noisy incomplete modal data. Journal of Building Engineering, 2020, 30: 101244
[6]
Mishra M, Lourenco P B, Ramana G V. Structural health monitoring of civil engineering structures by using the internet of things: A review. Journal of Building Engineering, 2022, 48: 103954
[7]
Fan W, Qiao P Z. Vibration-based damage identification methods: A review and comparative study. Structural Health Monitoring, 2011, 10(1): 83–111
[8]
Reynders E, Houbrechts J, de Roeck G. Fully automated (operational) modal analysis. Mechanical Systems and Signal Processing, 2012, 29: 228–250
[9]
Hou R R, Xia Y. Review on the new development of vibration-based damage identification for civil engineering structures: 2010–2019. Journal of Sound and Vibration, 2021, 491: 115741
[10]
Magalhães F, Cunha A, Caetano E. Vibration based structural health monitoring of an arch bridge: From automated OMA to damage detection. Mechanical Systems and Signal Processing, 2012, 28: 212–228
[11]
BollerCChangF KFujinoYeds. Encyclopedia of Structural Health Monitoring. Hoboken, NJ: John Wiley and Sons, 2009
[12]
Deraemaeker A, Reynders E, de Roeck G, Kullaa J. Vibration-based structural health monitoring using output-only measurements under changing environment. Mechanical Systems and Signal Processing, 2008, 22(1): 34–56
[13]
Guo H, Zhuang X, Rabczuk T. A deep collocation method for the bending analysis of Kirchhoff plate. Computers, Materials & Continua, 2019, 59(2): 433–456
[14]
Anitescu C, Atroshchenko E, Alajlan N, Rabczuk T. Artificial neural network methods for the solution of second order boundary value problems. Computers, Materials & Continua, 2019, 59(1): 345–359
[15]
GuanDLiJChenJ. Optimization method of wavelet neural network for suspension bridge damage identification. In: Proceedings of the 2016 2nd International Conference on Artificial Intelligence and Industrial Engineering. Paris: Atlantis Press, 2016, 194–197
[16]
Guo H, Zhuang X, Chen P, Alajlan N, Rabczuk T. Stochastic deep collocation method based on neural architecture search and transfer learning for heterogeneous porous media. Engineering with Computers, 2022, 38(6): 5173–5198
[17]
Guo H, Zhuang X, Chen P, Alajlan N, Rabczuk T. Analysis of three-dimensional potential problems in non-homogeneous media with physics-informed deep collocation method using material transfer learning and sensitivity analysis. Engineering with Computers, 2022, 38(6): 5423–5444
[18]
Samaniego E, Anitescu C, Goswami S, Nguyen-Thanh V M, Guo H, Hamdia K, Zhuang X, Rabczuk T. An energy approach to the solution of partial differential equations in computational mechanics via machine learning: Concepts, implementation and applications. Computer Methods in Applied Mechanics and Engineering, 2020, 362: 112790
[19]
Zhuang X, Guo H, Alajlan N, Zhu H, Rabczuk T. Deep autoencoder based energy method for the bending, vibration, and buckling analysis of Kirchhoff plates with transfer learning. European Journal of Mechanics A-Solids, 2021, 87: 104225
[20]
Sony S, Dunphy K, Sadhu A, Capretz M. A systematic review of convolutional neural network-based structural condition assessment techniques. Engineering Structures, 2021, 226: 111347
[21]
Ahmed B, Mangalathu S, Jeon J S. Seismic damage state predictions of reinforced concrete structures using stacked long short-term memory neural networks. Journal of Building Engineering, 2022, 46: 103737
[22]
Zhao B X, Cheng C M, Peng Z K, Dong X J, Meng G. Detecting the early damages in structures with nonlinear output frequency response functions and the CNN-LSTM model. IEEE Transactions on Instrumentation and Measurement, 2020, 69(12): 9557–9567
[23]
Yang X C, Li H, Yu Y T, Luo X C, Huang T, Yang X. Automatic pixel-level crack detection and measurement using fully convolutional network. Computer-Aided Civil and Infrastructure Engineering, 2018, 33(12): 1090–1109
[24]
Lin Y, Nie Z, Ma H. Structural damage detection with automatic feature-extraction through deep learning. Computer-Aided Civil and Infrastructure Engineering, 2017, 32(12): 1025–1046
[25]
Duan Y F, Chen Q Y, Zhang H M, Yun C B, Wu S K, Zhu Q. CNN-based damage identification method of tied-arch bridge using spatial-spectral information. Smart Structures and Systems, 2019, 23(5): 507–520
[26]
Han Q H, Liu X, Xu J. Detection and location of steel structure surface cracks based on unmanned aerial vehicle images. Journal of Building Engineering, 2022, 50: 104098
[27]
Cha Y J, Choi W, Büyüköztürk O. Deep learning-based crack damage detection using convolutional neural networks. Computer-Aided Civil and Infrastructure Engineering, 2017, 32(5): 361–378
[28]
Kim J, Ho D, Nguyen K, Hong D, Shin S W, Yun C B, Shinozuka M. System identification of a cable-stayed bridge using vibration responses measured by a wireless sensor network. Smart Structures and Systems, 2013, 11(5): 533–553
[29]
Sajedi S O, Liang X. Vibration-based semantic damage segmentation for large-scale structural health monitoring. Computer-Aided Civil and Infrastructure Engineering, 2020, 35(6): 579–596
[30]
Liang X. Image-based post-disaster inspection of reinforced concrete bridge systems using deep learning with Bayesian optimization. Computer-Aided Civil and Infrastructure Engineering, 2019, 34(5): 415–430
[31]
Rhim J, Lee S W. A neural-network approach for damage detection and identification of structures. Computational Mechanics, 1995, 16(6): 437–443
[32]
Lei Y, Zhang Y, Mi J, Liu W, Liu L. Detecting structural damage under unknown seismic excitation by deep convolutional neural network with wavelet-based transmissibility data. Structural Health Monitoring, 2021, 20(4): 1583–1596
[33]
Cofre-Martel S, Kobrich P, Lopez Droguett E, Meruane V. Deep convolutional neural network-based structural damage localization and quantification using transmissibility data. Shock and Vibration, 2019, 2019: 9859281
[34]
Barraza J F, Droguett E L, Naranjo V M, Martins M R. Capsule Neural Networks for structural damage localization and quantification using transmissibility data. Applied Soft Computing, 2020, 97: 106732
[35]
Ni Y Q, Wang B S, Ko J M. Constructing input vectors to neural networks for structural damage identification. Smart Materials and Structures, 2002, 11(6): 825–833
[36]
QuW LChenWLiQ S. Two-step approach for joints damage diagnosis of framed structures by artificial neural networks. China Civil Engineering Journal, 2003, 36(5): 37-45 (in Chinese)
[37]
Fallah N, Vaez S R H, Mohammadzadeh A. Multi-damage identification of large-scale truss structures using a two-step approach. Journal of Building Engineering, 2018, 19: 494–505
[38]
Guo X N, Zhang J D, Zhu S J, Luo X Q, Xu H J. Damping characteristics of single-layer aluminum alloy reticulated spatial structures based on improved modal parameter identification method. Thin-walled Structures, 2021, 164: 107822
[39]
Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks. Communications of the ACM, 2017, 60(6): 84–90
[40]
SimonyanKZissermanA. Very deep convolutional networks for large-scale image recognition. In: Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015. San Diego, CA: MIT Press, 2015, 1–14
[41]
SzegedyCLiuWJiaY QSermanetPReedSAnguelovDErhanDVanhouckeVRabinovichA. Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Boston, MA: IEEE, 2015, 1–9
[42]
HeK MZhangX YRenS QSunJ. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV: IEEE, 2016, 770–778
[43]
ANSYS. Version 18.0. Canonsburg, PA: Ansys Inc. 2017
[44]
GéronA. Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow. 3rd ed. Sebastopol, CA: O'Reilly Media, 2022
[45]
Abadi M. TensorFlow: Learning functions at scale. ACM SIGPLAN Notices, 2016, 51(9): 1–1
[46]
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 2014, 15: 1929–1958
[47]
KingmaD PBaJ. Adam: A method for stochastic optimization. 2014, arXiv: 1412.6980
RIGHTS & PERMISSIONS
Higher Education Press
AI Summary 中Eng×
Note: Please be aware that the following content is generated by artificial intelligence. This website is not responsible for any consequences arising from the use of this content.