A Data-Driven Approach for Linear and Nonlinear Damage Detection Using Variational Mode Decomposition and GARCH Model

In this article, an original data-driven approach is proposed to detect both linear and nonlinear damage in structures using output-only responses. The method deploys variational mode decomposition (VMD) and a generalised autoregressive conditional heteroscedasticity (GARCH) model for signal processing and feature extraction. To this end, VMD decomposes the response signals into intrinsic mode functions (IMFs). Afterwards, the GARCH model is utilised to represent the statistics of IMFs. The model coefficients of IMFs construct the primary feature vector. Kernel-based principal component analysis (PCA) and linear discriminant analysis (LDA) are utilised to reduce the redundancy of the primary features by mapping them to the new feature space. The informative features are then fed separately into three supervised classifiers, namely support vector machine (SVM), k-nearest neighbour (kNN), and fine tree. The performance of the proposed method is evaluated on two experimentally scaled models in terms of linear and nonlinear damage assessment. Kurtosis and ARCH tests proved the compatibility of the GARCH model.


INTRODUCTION
Today's current structural engineering industry requires consideration to be directed towards structural health monitoring (SHM) and optimizing safety.With forecasts of increasing worlds' population, structural infrastructure shall be subject to increased loading and deformation.To decrease the effects and consequences of structural deterioration, SHM processes are required more frequently, with high levels of accuracy necessary to achieve asset preservation.Hence, there has been a surge in interest surrounding SHM and the development of automated defect evaluation systems in an attempt to maintain existing structural networks and allow for asset expansion.
Concerning structural behavior, damage leads to deviations in the structure's dynamic characteristics and is considered a reliable indication of anomaly diagnosis.Also, it might cause a system with a typically linear behavior to demonstrate nonlinear responses, including cracking, impacts and rattling, delamination, stick or slip, rub, or deformation in connections [1,2].
Nonlinear behavior is supposed to be unpredictable and more sophisticated compared to the linear one.As a case in point, it has been proven through experimental investigation that natural frequencies could rise instead of decrease on breathing phenomena [3].This reaction originates from the fact that the crack conversely opens and closes in the experimental test.Subsequently, the detection of nonlinear anomalies is considered more challenging compared to linear damages [4].
Over decades, researchers proposed several techniques in terms of anomaly identification.
Generally speaking, such methods are divided into physics-based (or model-based) and datadriven approaches [5].In the physics-based, anomalies are tracked utilizing monitoring variations within the simulated responses from the structural numerical model [6].This model is a detailed mathematical abstraction linking a studied system's input and output variables employing known or presumed properties [7].Post analysis is demanded for determining damage location and qualification.Finite element methods (FEMs), boundary element methods (BEMs), and spectral finite element methods (SFEMs) are some of the techniques used in this regard.However, FEMs are considered the systematic method compared to the others due to their compliance in modeling complicated structures [8].In the occurrence of damage, particular parameters of the simulated models are updated according to response measurements.Optimization algorithms are typically Submitted Journal: Engineering with Computers, Springer 3 used to minimize variations between experimental and numerical responses by comparing mechanical characteristics of stiffness, damping, or mass [6].
Despite the broad potential of physics-based approaches in damage assessment, especially for the evolution of complex systems such as multi-stories buildings and multi-span bridges, they have some limitations.For example, exact modeling of a structure entails sufficient information regarding different components of a monitored system, such as loading states, boundary conditions, material properties, and precise coordinates of members.Moreover, optimization solutions commonly experience numerical instability as well as ill-conditions dilemma [9].The performance of such optimization techniques substantially degrades proportionally to the number of variables in the problem.
On the other side, data-driven SHM provides bottom-up solutions founded on tracking changes within the output signals appropriate for complex systems where the knowledge about geometries, properties, and initial conditions is limited [5].Any sudden changes in the output signals are Data-driven methods are helpful compared to physics-based techniques when [11], first, the structure's physical characteristics are unavailable or challenging to be modeled.Secondly, there are an adequate amount of sensors installed for capturing the structure's responses.Thirdly, the computational operations are costly in the SHM project; in addition, multi-physics models consist of more physical processes in a system (e.g., thermal interactions, water precipitation, magnetostatic and chemical reactions) may not seem efficient for utilizing a large amount of sensor data.The accuracy of physics-based depends on the response measurements; the best performance is achieved in an environment with the slightest noise.In real-world structures and especially for in-servicing conditions, however, the amount of noise is considerable.As such, data-driven damage identifications deploying actual responses have revealed preferable adaptability and thereby turned into an inspiring solution in the realm of SHM [10].

Need For Research
Although nonlinear damage has been studied before and practical solutions are proposed in this realm, most focus on damage identification as the first level based on Rytter's classification levels in SHM [12].Hence, limited research has been conducted to reach higher levels (e.g., damage localization and classification).This study attempts to address nonlinear damage detection in building structures through a robust data-driven approach.Adverse conditions such as environmental and operational effects in recording responses and analyzing signals are the other crucial points that should be considered.These issues become more though in the case of buildings where the story correlations can affect the structural responses.Therefore, proposing a robust model with appropriate precision in identifying different kinds of linear and nonlinear anomalies considering these issues leads to a practical approach in assessing real-world structures under adverse conditions.Accordingly, the rest of the paper is organized as follows.In Section 2, related works are discussed, and gaps are highlighted once again.Case studies are presented in detail in Section 3. Section 4 provides the details of the proposed data-driven approach.Experimental results and discussion are given in Section 5. Finally, Section 6 concludes the work and suggests future directions.

III. BACKGROUND
Signal processing techniques play a fundamental role in data-driven SHM for analysis responses in time, frequency, or time-frequency domains.Fourier spectra, spectrum analysis, difference frequency analysis, and the high-frequency resonance technique are appropriate for damage identification, especially for gear faults and roller bearings [13].Wavelets proved the efficiency for damage and deterioration detection in building structures based on a stochastic approach [14].
Fourier transform (FT) and fast Fourier transform (FFT) are considered the main concepts for anomaly detection.A time series model is a promising tool for simulating and predicting structural signals in the time domain.Since this method is based on a partial structural dynamics model, tit can identify even a small number of vibrations [15].In this area, autoregressive (AR) models are investigated for damage and deterioration detection in buildings and bridges [16][17][18].Autoregressive and moving average model (ARMA), as well as generalized autoregressive conditional heteroscedasticity model (GARCH), have proved to be beneficial for nonlinear damage identification in building specimens [19].Transient behaviors caused by damage or adverse environmental conditions can be recognized through a signal's time-frequency form [20].
In a broad perspective, the real-world signals are linear and stationary and are coupled with noise.
Consequently, linear signal processing techniques, such as spectral analysis, are not appropriate in this realm of scope [21].Hilbert-Huang transform (HHT), introduced by Huang et al. [22], consists of two sequential steps.The first step, called empirical mode decomposition (EMD), separates the complicated initial signal into a determined and commonly limited number of intrinsic mode functions (IMFs) or modes.Each mode is an oscillatory function with time-varying frequencies that reveal the input signals' local features and correspond to different frequencies and a residue [23,24].The algorithm detects the maxima/minima recursively, assesses the envelopes using the extrema, and removes the average envelopes, which leads to isolating high-frequency bands.[25].
In the next step, the Hilbert transform (HT) includes each IMF's orthogonal pair with 90 degrees difference in the phase [26].As a result, each IMF set and the corresponding pair can evaluate instant variations of signal magnitude and frequency concerning time.Compared to wavelet analysis and Fourier transform, EMD benefits from tracing out the IMFs by interpolating between the extremums instead of using any given wavelet basis.Despite the wide usage of EMD in a variety of time-frequency applications such as medical [27], economics [28], climate predictions [29], SHM [30,31], and many other fields, it may dace with some issues like sensitivity to noise and sampling frequency which cause the performance relies on the frequency ratio [25,31,32].Some modified algorithms have been developed, including ensemble EMD (EEMD), complete ensemble EMD with adaptive noise (CEEMDAN), and Variational mode decomposition (VMD) [32] to address these limitations.VMD is a relatively new algorithm that decomposes a signal into distinctive amplitude and frequency adjusted sub-signals where together they reproduce the primary input signal [32].This approach is entirely non-recursive, and the sub-signals are extracted simultaneously; it is proven that VMD outperforms the EMD algorithm in various areas such as signals analysis and damage detection.
Variational mode decomposition has been deployed in the real SHM by some researchers.For instance, Bagheri et al. [31] calculated damping ratios for each extracted modal response obtained from VMD.The mode shape vector was obtained for each decomposed structure mode, which was then practiced for damage identification in three specimens, including numerical, experiment, and field case studies.Xin et al. [33] established two damage indices relying on modal parameters obtained from VMD.An experimental and numerical assessment demonstrated the efficiency of the method for nonlinear to find the location and severity of nonlinear damage scenarios in the models.Das and Saha [34] investigated the impact of a heavy noise environment on a new hybrid algorithm using VMD along with frequency domain decomposition (FDD).It was deducted that the hybrid method could detect damage location accurately for noises above 20%.A novel methodology is illustrated and assessed in the following sections on two experimental specimens with linear and nonlinear damage scenarios. IV.

Case STUDIES
In this section, two case studies used in this work are thoroughly explained and discussed.

A. Case study 1: Linear Damage
The first case study is a three-story metal frame with aluminum columns and floors, investigated in linear damage simulation [35].A roller at the base supports the specimen and can move horizontally using a hydraulic jack.Piezoelectric single-axis accelerometers instrument each floor.
Nine linear damage scenarios are imitated employing stiffness reduction of columns and replacement of a 1.

B. Case Study 2: Nonlinear Damage
This case is the adjusted model of the first case study and is used for studying the impact of nonlinear damage.The sampling rate is the same as the linear model and is set to 322.58 Hz with 8192 data points for each record.Ten measurements are recorded for each state.Likewise, in the initial specimen, this frame also glides on rails that enable a transmission in one direction with the aid of an actuator.Four accelerometers with a sensitivity of 1000 mV/g are attached on the opposite side of the shaker at the center of the floors; thus, they do not help determine the specimen's torsion models.
In order to simulate nonlinear damages, a mechanical bumper and a center column are installed onto the frame.This mechanism imitates the breathing crack and will cause nonlinear behaviors in the condition that the installed column hits the bumper, which is placed on the second floor.The adjustable gap between the bumper and the installed column is used for defining different degrees of nonlinearity.Hence, the larger the gap is, the smaller the nonlinear behavior becomes.The specimen's outline and the damage scenarios are provided in Fig. 3 and Table 2, respectively.Some recorded nonlinear signals are given Fig. 4, where 1 () yt, 2 () yt , 3 () yt, and 4 ()   As noted, two three-story models were presented for linear and nonlinear damage scenarios.Linear damages were simulated by reducing the cross-section area of columns, while nonlinear behavior was considered as hitting a bumper with a mid-column in the second case study.The environmental and operational conditions were also considered by adding a mass to different damage scenarios.
Story accelerations were recorded for damage identification and classification, with a novel methodology discussed in the following section.

V. PROPOSED METHOD
In this work, anomaly detection is performed in three steps.Firstly, VMD decomposes the signal into several sub-signals with separated bandwidths.Secondly, primary features are extracted using the time-series modeling, and then the number of features is reduced by KPCA and KDA.Finally, three supervised classifiers are separately deployed to discriminate different damage states within three specimens.A schematic workflow of the proposed method is depicted in Fig. 5.In the following, these stages are illustrated thoroughly.

A. Signal Processing
Herein, the input acceleration signals are decomposed using VMD so that an input signal () St is broken down into d limited-bandwidth IMFs depicted as [36]: where () At and () k t present the instantaneous amplitude and frequency of () k ut, respectively.The constructed variational problem is obtained using Hilbert transform as follows: shows the IMFs of signal t S and their center frequencies of each signal sub-band, respectively.Eq. ( 2) is presented in a Lagrange function using and as a multiplier operator and penalty factor, respectively, to solve the optimization problem Afterward, Eq. ( 4) is transformed into the time-frequency space, and the equivalent extremum solution is solved to obtain the frequency domain form of the modal element () k ut as well as the center frequency k :  The Eq. ( 7) is continued till the following criteria are satisfied: Proved that the above condition is met, the iteration procedure stops.Other Herein, the iteration is stopped; otherwise, it returns to step 2, and d IMFs can be extracted [31,36].In Figs.6-9

B.1GARCH modeling of IMFs
Generally speaking, a signal can be modeled via ARMA time series to evaluate the conditional mean.As an illustration, the ARMA(p, q) prediction for the conditional mean is formulated as [37]: where p denotes the autoregressive model order, i presents the autoregressive variable, q stands for the moving average model order, j shows the moving average variable, t denotes the residual, and c is a constant.However, the residual is usually considered to have a mean of zero with constant variance.In some time series, it is not homoscedastic and has no constant variance [37].In this case, the time-varying variance is called conditional variance that is described as: 22 11 var ( ) The GARCH model, established by Bollersl [38], is a dynamic model that addresses the conditional heteroscedasticity or volatility clustering for an innovation process using a weighted combination of past heteroscedasticity functions coupled with the squared residuals of the past.It causes a reduction in the parameters and complexity of the model.A GARCH( , ) rm model for the conditional variance of residual t is formed as: 11 rm t i t j j t j ij ba (11) In which , i b , and j a are the parameters of the GARCH model.Herein, the following constraints are defined to ensure that the conditional variance is positive: Moreover, the following formula is defined to make the covariance stationary: This paper utilizes the GRACH model to create the conditional variance model for IMFs obtained from VMD.The GARCH model showed reliable performance in nonlinear problems, as discussed in [19].The coefficients of GARCH( , ) rm , i.e., {} , is constructed as: , , , , , , , , , , , Finally, since each signal is recorded from several sensors, each record is described with 1 () where n shows the number of sensors and i d stands for the number of IMFs is used to decompose the signal of the ith sensor.Hence, the feature vector of a signal with n sensors is given as . All obtained features are not suitable for classification, and feature vectors may suffer from redundant features.Hence, we should utilize feature reduction techniques to remove such features from the feature vector.

B.2 Feature reduction
The general concept of kernel-based feature reduction is based on deploying a particular sort of nonlinear mapping function to protrude the initial vector f into a high dimensional feature space as F. Regarding the new feature space, the principal components are obtained through the regular Principal component analysis (PCA).In other words, the principal nonlinear components in the initial space correspond to the principal components in feature space F. Afterward, the kernel functions, including polynomial, radial basis function, and sigmoid, are used to perform the nonlinear mapping in KPCA [39].
Assume nonlinear mapping ; the initial data space f n is mapped into a new feature space like as [40]: For a training sample set 12 , ,..., , where M denotes training sample numbers.
Subsequently, the covariance matrix is formulated as [40]: such that Since S it is a bounded, compact, positive, and symmetric matrix, its nonzero values are also positive.For the sake of finding theses nonzero values, Schölkopf et al. [41] suggested linearly express every eigenvector of S by [40]: In order to compute expansion coefficients, the Gram matrix is formed as T R QQ , where Consequently, each component Q is computed by using kernel tricks as [40]: Accordingly, R is centralized by [40]: where Afterward, the orthonormal eigenvectors  (22) After that, the KPCA transformed feature S respectively stand for the between-class, within- class, and total scatter matrices in feature space, which are obtained by the following formulation: Assume that shows the projective function in feature space, the associated objective function in feature space is defined as: This function can be solved by eigenproblem as: And we have: Then, we can define an equivalent problem as:

C. Classification
In the next section, three classifiers are applied to the selected features previously taken and are called predictors.These classifiers are prevailing in the realm of Machine Learning, including support vector machine (SVM), fine tree, and k-nearest neighbor (kNN).SVM is a supervised training algorithm founded on the fact that measurements can be considered two-dimensional space.Each sample denotes a data point in the space and can be separated by a line in the case of a two-dimensional problem and a plane in the case of the dimensional system [43].Regarding kNN, despite its simplicity, it is common in terms of suing in large training datasets.It allocates an estimated value to a new sample on the ground of plurality or weighted of the k nearest neighbors in the training set [44].Classification using a decision tree (fine tree) algorithm is very fast and suitable for high-dimensional classification problems.A fine tree is a predictive algorithm mapping from samples about an item to conclusions about its target value.In this model, leaves represent the labels, nodes are the features, and branches denote the junction of features, resulting in label classification [45].Subsequently, the prediction using these classifiers is compared with each in the following sections.

VI. RESULTS AND DISCUSSION
This section provides the experimental results and relevant discussions.We considered the fivefold cross-validation to assess the performance of the proposed method.To this end, data were randomly partitioned into five equal-sized groups, and then, the training and test procedure was repeated for five trials.One group was considered for testing data in each trial, and other groups were used to train the classifier.Finally, results were averaged.A. The Effect of the Number of

IMFs on Residual
The number of IMFs has a considerable effect on the number of extracted features and the complexity of the proposed method.Here, we determine the efficient number of IMFs based on the mean absolute of residuals, shown in Fig. 10 for different numbers of IMFs of nonlinear signals.It is observed that residual generally reduces as the number of IMFs increases.However, the slope of reduction varies for different sensors.The residuals of sensors 2, 3, and 4 reduce faster than that of sensor 1.As observed, the residual of sensor one does not have a significant variation when the number of IMFs is more than ten.On the other side, the reduction in residuals of sensors 2, 3, and 4 is not notable for the number of IMFs greater than seven.Hence, we consider the ten IMFs for sensor one and seven IMFs for the remaining sensors.Considering 31 IMFs and two features extracted from each IMF, each recording is described with 62 features.
Following the linear case, as observed in Fig. 11, the residuals of all sensors dwindle gradually at nearly the same pace.For any figures over eight IMFs, the residual does not show significant deviations.Thus, for the linear signals, the eight values of IMFs are assigned for all sensors of stories.Considering two features for each IMF, each record is denoted through 48 features.

B. Classification Accuracy
In order to assess the stability of the proposed method and evaluate the effect of features on results, the authors considered four cases as follows:  SA: no feature reduction method is employed  SB: only KPCA is used for feature reduction  SC: only KDA is employed for feature reduction  SD: at first, KPCA and then KDA is considered for feature reduction.
The number of features in conditions SB, SC, and SD is obtained based on the normalized cumulative summation of eigenvalues (NCSE).When the NCSE reaches higher than 0.95 for the first time, the efficient number of features is obtained.Considering 1 ,, f n as sorted eigenvalues in descending order, the NSCE is calculated as follows: Classification accuracy of the proposed method for nonlinear and linear data considering kNN, SVM, and fine tree classifiers and different lengths of signals obtained from sensors are given in Table 3 and Table 4, respectively.
Concerning the nonlinear case, the minimum and maximum performance observe in scenario SA and SD with 76.92% and 98.82%, respectively.In all scenarios, fine tree classifiers seem to be more efficient compared to the other classifiers.Moreover, kNN is the second accurate classifier, and SVM indicates the lowest performance in this case.It is noteworthy that the signal length has the higher impact on the SB and the lowest on SD with the relative variation ( max ) of 9.09% and 3.69%, respectively.
Regarding the nonlinear case study, the highest and lowest performance, likewise the nonlinear case, observes in SA and SD with the accuracy of 100.0% and 89.56%, respectively.Similar to the previous case, the fine tree is the suitable classifier in all proposed scenarios.Except for the SB, kNN indicates higher performance in comparison with SVM.Scenario SB reveals less sensitivity to the signal length, whereas scenario SA shows the highest sensitivity to the signal variations based on max .

C. Confusion Matrix
In this part, the classification performance for both case studies is provided through confusion matrices.Considering the confusion matrix, we provide the recall or sensitivity (Sens.),precision (Prec.),total accuracy (Acc.), and F-score, which are defined as Sens. TP TP FN (38) Prec.TP TP FP (39) Acc. 100 TP TN TP TN FP FN (40) Prec.
The results are given in Table 5 for the linear damage and the performance metrics are computed for the nine scenarios described earlier.As indicated, the proposed method determines all damage scenarios with no errors.Consequently, this approach expresses the highest performance for discriminating linear damages based on reference to this study.
Regarding the nonlinear case study, seventeen separate states of the specimen are predicted through the presented technique, and the results are presented by the confusion matrix as depicted in Table 6.As noted, in the majority of the damage states, the prediction accuracy is 100%.
Regarding the remaining cases, which are two out of seventeen scenarios, the classification performance is 90.0%.Subsequently, the established strategy revealed considerable performance for recognizing nonlinear and linear damages with significant precision.

D. The effect of noise
The various intensity of noises is applied to the responses on the grounds of signal-noise ratio (SNR) to assess the stability of the proposed method against noise, as depicted in Fig. 12.As observed, the proposed method is efficient even in environments contaminated with severe noise (SNR = 1).Furthermore, the established approach can maintain its performance against noise where it shows insignificant variations in the case of SNR of 20 and 15.

VII. GARCH EFFECT ASSESSMENT
In this section, two tests are applied to demonstrate the compatibility of the GARCH model [46].Thus, Kurtosis and ARCH tests are provided in the following sections.

A. Kurtosis test
GARCH model is appropriate for those signals that have the shape of heavy tails.Therefore, the Kurtosis test is utilized to find out that signals have heavy tails or not.The Kurtosis for a distribution (s) is formulated as follows [46]: Where  and  denote the mean and standard deviation of distribution s, respectively, and () Es stands for the expected value of s.For Gaussian distribution, the Kurtosis value of three and higher values shows that the distribution of coefficients has a heavier tail than the Gaussian distribution.This paper applies this test to the IMFs for each sensor, and the average results for minimum and maximum values of sub-bands are presented in Table 7. Regarding the results, it can be seen that the maximum values are higher than 3, which proves that the IMFs do not have Gaussian distribution.

B. ARCH test
Based on the hypothesis provided in [47], the ARCH test is deployed to see the existence of ARCH/GARCH impact in the IMFs of each sensor.In this reference, the Lagrange multiplier test is presented based on regression.Subsequently, the test statistic is asymptotically Chi-square distributed has q degrees of freedom [46].
Thus, in this part, the ARCH test is applied to the IMFs for different sub-bands, and the average results for signals are shown in Table 8.In this table, h stands for the Boolean decision variable, where 1 shows the rejection of the null hypothesis, which depicts that no GARCH effect exists.pvalue is the significance level at which the test rejects the null hypothesis.GARCHstat and CriticalValue are the ARCH test static and critical values of the Chi-square distribution, respectively.Based on this test, if GARCHstat is less than the critical value, no GARCH effect exists.In this study, the significance level is set to 0.05, frequently deployed in [48].Notably, these results are the average of all signals; for example, the average value of h for the fourth IMF of the first sensor is 0.74, which demonstrates that 74% of the signals have the GARCH effect.Thus, in general, the results of the table prove the existence of the GARCH effect in most cases.
observed and analyzed through signal processing tools and pattern recognition procedures to determine probable damage.Independence for having an initial model and prior knowledge causes data-driven SHM to be a faster technique and an economical and practical solution for online SHM.Signal processing techniques synthesize, modify, analyze the recorded responses, and highlight different features in time, frequency, and frequency domains.Machine Learning algorithms are typically employed to identify and interpret features extracted from signals and recognize generated patterns in conjunction with such methods.Machine learning includes clustering, regression, neural networks, ensemble learning, deep learning, Bayesian methods, instance-based, decision trees, and dimensionality reduction [10].
respectively represent the recorded data by sensor 1, sensor 2, sensor 3, and sensor 4. Similar to the previous case, the time-domain presentation of responses can not indicate variations due to damages properly.

Fig. 5 .
Fig. 5. Workflow of the proposed method alternative direction of multipliers (ADMM) is deployed to optimize the constrained variational model.Subsequently, the initial signal()  St is broken down by d IMFs as described in the following: IMFs, the feature vector of signal with()  d r m features, (

m
by the projection of the mapped sample () f onto the eigenvector 12 , ,..., p n as formulated below[40]: and w S reveal the between-class and within-class scatter matrices, which are obtained as: stands for the number of samples in the kth class, and () k denotes the mean of the kth class.Afterward, the total scatters matrix is defined as values of a correspond to the nonzero eigenvalue of eigenproblem: mapping (15) is considered to extend the LDA to the nonlinear case.Hence,

Fig. 10 .
The residual of VMD of nonlinear data for different numbers of IMFs and data length.(a) length of 512, (b) length of 2048, and (c) length of 8192.

Fig. 11 .
Fig. 11.The residual of VMD of linear data for different numbers of IMFs and data length.(a) length of 512, (b) length of 2048, and (c) length of 8192.

Fig. 12 .
Fig. 12.Effect of noise on classification performance

Submitted Journal: Engineering with Computers, Springer 7 output
yt represent the recorded data by sensor 1, sensor 2, and sensor 3, respectively.It is evident that the recorded responses for all damage scenarios follow a random pattern, and usage of timedomain data can not discriminate damage status from healthy cases.Thus, there is a need to model responses through signal processing techniques find suitable features indicating variations in the signals.
2 kg mass.Hence, 50 signals are recorded for each status with a sample rate of 320 Hz.Therefore, 450 signals are acquired for all scenarios, as illustrated in Table1.As depicted, there are nine statuses, including healthy condition (S1) showing the intact structures without any changes in components, two scenarios simulate the operational and environmental effects by changing mass of floors (S2 and S3), and six damage scenarios by changing the stiffness of columns (S4-S9).

Table 2 .
[17]ge scenarios of case study 2[17] , the IMFs of linear and nonlinear signals are shown.Due to space limitations, we only present the two IMFs.

Table 3 .
Classification accuracy of the proposed method for different classifiers and lengths of nonlinear data

Table 4 .
Classification accuracy of the proposed method for different classifiers and lengths of linear data

Table 5 .
Confusion matrix for linear case study

Table 6 .
Confusion matrix for nonlinear case study

Table 7
Kurtosis test for IMFs

Table 8
ARCH test for IMFs