You do not have permission to edit this page, for the following reason:

You are not allowed to execute the action you have requested.


You can view and copy the source of this page.

x
 
1
<!-- metadata commented in wiki content
2
3
4
==A deep learning-based Prognostic Framework for Aeroengine Exhaust Gas Temperature Margin==
5
6
'''Weigang Fu<sup>1,*</sup>, Xiang Tan <sup>2</sup>, Liangzhong Ao<sup>1</sup>, Yaoming Fu<sup>1</sup> and Peng Guo<sup>2,*</sup>'''
7
8
<sup>1</sup> Aviation Engineering Institute, Civil Aviation Flight University of China, Guanghan, 618307, China
9
10
<sup>2</sup> School of Mechanical Engineering, Southwest Jiaotong University, Chengdu, 610031, China
11
12
'''*''' Correspondence: Weigang Fu ([mailto:jiaodafwg@126.com jiaodafwg@126.com] (W. F); Peng Guo([mailto:pengguo318@swjtu.edu.cn pengguo318@swjtu.edu.cn])
13
-->
14
==Abstract==
15
16
The value of the gas-path parameter, exhaust gas temperature margin (EGTM), is the critical index for predicting aeroengine performance degradation. Accurate predictions help to improve engine maintenance, replacement schedules, and flight safety. The outside air temperature (OAT), altitude of the airport, the number of flight cycles, and water washing information were chosen as the sample input variables for the data-driven prognostic model for predicting the take-off EGTM of the on-wing engine. An attention-based deep learning framework was proposed for the aeroengine performance prediction model. Specifically, the multiscale convolutional neural network (CNN) structure is designed to initially learn sequential features from raw input data. Subsequently, the long short-term memory (LSTM) structure is employed to further extract the features processed by the multiscale CNN structure. Furthermore, the proposed attention mechanism is adopted to learn the influence of features and time steps, assigning different weights according to their importance. The actual operation data of the aeroengine are used to conduct experiments, where the experimental results verify the effectiveness of our proposed method in EGTM prediction.
17
18
'''Keywords''': Convolution neural network, long short-term memory, attention mechanism, aeroengine gas-path performance, exhaust gas temperature margin
19
20
==1. Introduction==
21
22
The aeroengine operates at the highest temperature, pressure, speed, and frequency of transitional working states during the take-off phase when compared to the other flight phases like cruise and landing [1,2]. The slow decline of engine performance is inevitable during the active process. Prediction and evaluation of the decline degree of engine performance are necessary for performing preventive maintenance on the aeroengine. Most engine failures are caused by the gas-path system fault, and accurate gas-path performance prediction provides the possibility for aeroengine performance evaluation and maintenance plan optimization. This is significant for ensuring the flight safety of aircraft.
23
24
The gas path parameters include exhaust gas temperature (EGT), rotor speed, and fuel flow. The EGT margin (EGTM) is usually adopted to perform gas path analysis and monitor the engine performance degradation, which can show whether the aeroengines are in the normal state or not [3]. During actual monitoring and maintenance, the take-off EGTM [4] is usually chosen as the critical gas-path parameter to evaluate the performance state of the aeroengine. A deteriorated engine will consume more fuel, thus increasing the EGT and decreasing the EGTM [5]. The net thrust, fuel flow, low rotor speed, core rotor speed, pressure ratio, air temperature at engine fan inlet, take-off EGTM, and specific fuel consumption are regarded as input parameters for estimating the EGT [6]. These input parameters were collected by the sensors during the flight [7], and they are unknown for the prognostic analysis. In that work, the relationship between EGT decline rate and the flight cycles was established to predict the remaining life of a PW4000-94 engine. The EGTM was influenced by the unknown real-time data, obtained by sensor data during the flight, as well as the known data obtained before the flight. This work aims to utilize the above-known data as the input parameters to predict the EGTM prior to flight.
25
26
Take-off EGTM, as shown in [[#img-1|Figure 1]], reflects the state of engine performance. When the take-off EGTM equals 0 °C, the EGT has reached the red line value. The EGTM in the take-off phase directly relates to the airport outside air temperature (OAT) and altitude. However, empirical data do not exist to allow a correlation between EGTM deterioration and the OAT or altitude of the airport.
27
28
<div id='img-1'></div>
29
{| class="wikitable" style="margin: 1em auto 0.1em auto;border-collapse: collapse;width:auto;" 
30
|-style="background:white;"
31
|style="text-align: center;padding:10px;"| [[Image:Review_393375948024-image1.png|306px]]
32
|-
33
| style="background:#efefef;text-align:left;padding:10px;font-size: 85%;"| '''Figure 1'''. Effect of OAT in the airport on EGTM deterioration
34
|}
35
36
37
The value of EGTM significantly affects engine life. Reducing EGTM will extend engine life on the wing, thereby reducing operating costs. If the engine is arranged to take off at the airport at a lower OAT and altitude, EGT will likely not cross the red line. With regard to meeting the aircraft performance requirements, the engine is designed to provide a given thrust level at a temperature below the corner point OAT. As the OAT in the airport increases, more fuel is required, EGT increases, and EGTM decreases. However, at a temperature above the corner point OAT, the EGTM is less than zero, and the thrust output must be reduced. If it does not reduce, the engine will be damaged [8]. Similarly, as the altitude of the airport increases, more fuel will be required to provide a given thrust output, so the EGT will increase, resulting in a decrease in the EGTM [9].
38
39
The number of flight cycles significantly affects the take-off EGTM of aeroengines. For an available gas turbine engine, the levels of degradation drop by increasing the flight cycles, and all engine health parameters deviate slowly from their nominal values [10]. The degradation data of the booster were used to illustrate the clean compressor map and degraded compressor maps at 3000 and 6000 flight cycles for the JT9D turbofan engine [11]. The degraded maps were utilized to predict the overall degradation effects on the engine performance. In the take-off phase, the engine accelerates from idle to maximum power resulting in maximum rotor speed and EGT, which causes the turbine blade to elongate and creep [12]. In addition, abrasion between the elongated turbine blade and the stationary parts occurs. The turbine clearance increases during the thermal cycles, such as the start and stop cycles, namely flight cycles. The engine wear and higher clearance lead to a deterioration of the engine efficiency, which decreases the turbine efficiency [13]. In this case, more fuel is consumed to maintain a given thrust level, so the EGTM will decrease, and the engine performance will degrade.
40
41
Meanwhile, low cycle fatigue (LCF) [14] is associated with engines that have been in service for long periods. LCF occurs due to machine cyclic loading, like start/stop cycles, which is closely related to the flight cycles. The LCF life of a component is determined by the number and the intensity of cycles the component material must endure [15]. In contrast, the creep life of a component depends on the time it spends operating within the material's creep temperature range. The number of flight cycles can influence the communicative time effect on the wear, creep, and fatigue life of hot section components.
42
43
Periodical on-wing water washing is an efficient and economical method to improve engine performance and restore the take-off EGTM [16]. Engines in the take-off phase and the approaching landing phase of each flight cycle are more affected by the airborne pollutant due to the lower altitude they operate, making them more susceptible to compressor fouling degradation. The changing value of the mass flow rate and the compressor's efficiency due to compressor fouling can be expressed as a linear relationship concerning the flight cycle [17]. Besides the effect of online washing with different water-to-air ratios and engine loads on performance recovery [18], the effect of inlet pressure and droplet diameter of washing liquid on compressor fouling removal [19], and the recovery efficiency of power loss with washing [20] have been studied.
44
45
To predict the take-off EGTM of the on-wing engine in advance, we choose the OAT and altitude of the airport, the number of flight cycles, and water washing information as the sample input variables for a prediction algorithm. All these parameters can be obtained before the flight, making the prognostic of the engine's performance degradation known in advance. The aeroengine performance degradation prediction is a time series forecasting task. Deep architectures such as convolutional neural networks (CNN) and long short-term memory (LSTM) can extract and effectively capture the feature information of raw input data. However, the ability of normal CNN is affected by the size of convolution kernels, which should be accurately determined. Another key issue is that once the sequence is too long, the traditional LSTM cannot effectively use the location information of time series data and capture the long-term interdependence. As such, an attention mechanism was added for assigning different weights to features of different importance.
46
47
The main contributions of this paper are summarized as follows:
48
49
:1) A multiscale CNN-LSTM structure is developed to handle raw aeroengine data to learn temporal sequence features and extract useful degradation information.
50
51
:2) We propose a deep learning framework based on an attention mechanism. The attention mechanism can learn the importance of sequential features and time steps, assigning different weights according to their preference.
52
53
:3) We conduct experiments on real datasets to evaluate the effectiveness of our proposed method. The experimental results show that the proposed method shows a considerable improvement in aeroengine performance estimation.
54
:
55
:The rest of this paper is organized as follows. Section 2 reviews the related works. Section 3 describes the suggested method based on deep learning, and section 4 includes the computational experiments and an analysis of the results. Finally, section 5 concludes this work and gives future studies.
56
57
==2. Related works==
58
59
Multiple methods are utilized in engineering applications to analyze the gas-path system for aeroengine performance prediction. In general, these prediction methods can be divided into two categories: model-based methods [21]–[23] and data-driven methods [24]. The nonlinear simulation model of a twin-spool turbofan engine was constructed as a component level model by Adibhatla et al. [21]. A bank of parallel Kalman filters and a hierarchical structure were used for the multiple model adaptive estimation methods of in-flight failures test by Maybeck [22]. The nonlinear dynamics of the jet engine are linearized and a set of linear models corresponding to various operating modes of the engine at each operating point is obtained by Lu et al [23]. However, the component structures and accessory systems of aeroengines are becoming increasingly complex and integrated. Accurate modeling remains difficult with model-based methods due to the challenge of mastering various nonlinear mathematical relationships between components and systems [25].
60
61
When compared with these model-based methods, data-driven methods do not require an understanding of the complex operation mechanisms of the mechanical system. Therefore, data-driven methods have been widely used in aeroengine performance prediction. The previously collected data of gas path systems were time-series data of multiple state parameters. A CNN is designed to extract the features from this input data. The CNN prediction technology was combined with a delta fuel flow degradation baseline to estimate the performance recovery by the water washing [26]. A CNN-based multitask learning framework was proposed to accurately estimate the remaining useful life (RUL) by simultaneous learning. The estimations occurred during a health state identification, where inter-dependencies of both tasks were considered using general features extracted from the shared network [27]. A CNN and extreme gradient boosting (CNN-XGB) were combined through model averaging. A CNN-XGB with an extended time window was utilized for a RUL estimation [28].
62
63
A hybrid method of convolutional and recurrent neural network (CNN-RNN) was proposed for the RUL estimation, where it can extract the local features and capture the degradation process [29]. A RNN has the advantage of a data-driven model with short time dependencies. Nevertheless, a RNN has poor performance in dealing with long-time dependencies data. LSTM neural networks have been proposed to address these dependencies for predicting the RUL of any system. The LSTM model has the advantage of retaining time domain information for a long duration of time. The accuracy of an online LSTM method was improved by comparing it to the proposed methods in Kakati et al. [30] for RUL estimation of a turbofan engine. The LSTM, as well as the statistical process analysis, were performed to predict the fault of aeroengine components with multi-stage performance degradation [31]. The linear regression model and LSTM were utilized to construct the data-driven model of degradation trend prediction and RUL estimation [32]. RUL estimation for predictive maintenance was achieved by using the support vector regression (SVR) model and an LSTM network [33].
64
65
A novel performance degradation prediction method based on the attention model and SVR is proposed for RUL prediction. The attention mechanism can focus on the important features in the time-sequential data, while the SVR model identifies the mapping relationship between multiple state parameters and performance degradation [34]. Many hidden layers were constructed for the machine learning model and a large number of training data to learn more useful features and improve the accuracy of classification and prediction. The designed system [35] is based on reinforcement learning and a deep learning framework, which consists of an input, modeling, and a decision layer. Li et al. [36] proposed a new data-driven approach for prognostics by using deep CNN. In that work, a time window approach employed for sample preparation achieves better feature extraction by deep CNN leading to high prognostic accuracy with regards to the RUL estimation.
66
67
An intelligent deep learning method was proposed for forecasting the health evolution trend of aeroengine by Jiang et al. [37]. This method systematically blends the dispersion entropy-based multi-scale series aggregation scheme with a long LSTM neural network. Remadna et al. [38] introduced a new hybrid RUL prediction approach by combining two deep learning methods sequentially. The hybrid model uses a CNN with bidirectional LSTM networks where the CNN extracts spatial features while bidirectional LSTM extracts temporal features. Chu et al. [39] proposed an integrated deep learning approach with CNN and LSTM networks to learn the latent features and estimate RUL value with a deep survival model based on the discrete Weibull distribution. In their work, the turbofan engine degradation simulation datasets provided by NASA were utilized to validate the proposed approach.
68
69
==3. Methodology==
70
71
This section describes the aeroengine EGTM prediction problem. Next, the proposed attention-based multiscale CNN-LSTM method is introduced in detail. This includes the theoretical background of the components and the method’s overall framework.
72
73
===3.1 Problem description===
74
75
From the perspective of health management, aeroengine EGTM prediction can be regarded as a time series problem. The EGTM prediction problem can be defined as follows. The input is <math display="inline">{X}_{t}^{k}</math>, where <math display="inline">k\in R^n</math>, <math display="inline">t = (1,2, \ldots ,T)</math>. In addition, <math display="inline">{X}_{t}^{k}</math>represents the collected input data during aeroengine operation, <math>n</math> represents the number of features, and <math>T</math> represents the length of the time step. The corresponding output is the EGTM prediction result <math display="inline">{Y}_{t}</math> for each time step. EGTM is predicted in real-time by establishing the mapping relationship between the output and input data. The mapping relationship is expressed as follows:
76
77
{| class="formulaSCP" style="width: 100%; text-align: center;" 
78
|-
79
| 
80
{| style="text-align: center; margin:auto;" 
81
|-
82
| <math>{Y}_{t}=f({X}_{t}^{k})</math>
83
|}
84
| style="width: 5px;text-align: right;white-space: nowrap;" | (1)                                                                                                                                           
85
|}
86
87
88
When building the performance prediction model, the above input data are directly imported from the raw data file to predict EGTM. As such, a large amount of mixed noise exists in the data. To fully extract the time-series features of the data, we design a multiscale CNN-LSTM deep learning framework based on an attention mechanism to construct mapping relationships, as introduced in detail in the following subsections.
89
90
===3.2 Multiscale CNN===
91
92
A traditional CNN can directly process the input raw data and extract the hidden features. However, the amount of raw data is relatively large. Thus, using a single convolution kernel may cause the model to omit locally important features in the process of adaptively extracting features. By adjusting the scale of the convolution kernel and using several different convolution kernels, designing a network capable of extracting the raw data features may be possible. This results in the performance of the model prediction improving.
93
94
This work proposes an improved CNN with a multiscale convolution operation to compensate for the limitation of a traditional CNN. Specifically, each convolution layer consists of 64 convolution kernels, and we set the convolution kernel size to 1, 3, and 5. The multiscale convolution operation is embodied as a structure to extract the hidden features by performing a multiscale convolution operation on the raw data. Initially, this establishes a shallow mapping relationship between the raw data and EGTM. The specific network structure is shown in [[#img-2|Figure 2]].
95
96
<div id='img-2'></div>
97
{| class="wikitable" style="margin: 1em auto 0.1em auto;border-collapse: collapse;width:auto;" 
98
|-style="background:white;"
99
|style="text-align: center;padding:10px;"| [[Image:Review_393375948024-image2-c.png|420px]]
100
|-
101
| style="background:#efefef;text-align:left;padding:10px;font-size: 85%;"| '''Figure 2'''. Structure of multiscale CNN
102
|}
103
104
105
===3.3 Long short-term memory network===
106
107
The data in the proposed prediction model, discussed previously, are time series, the nodes of the RNN are connected along the sequence, and the RNN is designed to learn the correlation of the time series. However, the standard RNN often encounters the problem of gradient disappearance and gradient explosion during the training process. As a result, both the model’s ability to capture the previous information and its performance in modeling long-term dependencies decreases.
108
109
To solve this problem, Hochreiter proposed a new architecture named long and short-term memory network (LSTM) [42]. LSTM is a special RNN, which has been widely used in various time sequence modeling tasks such as stock market price prediction and energy consumption prediction. The advantage of LSTM involves its ability to overcome shortcomings of traditional RNN, such as the influence of gradient disappearance and gradient explosion. The basic architecture of a typical LSTM is shown in [[#img-3|Figure 3]].
110
111
<div id='img-1'></div>
112
{| class="wikitable" style="margin: 1em auto 0.1em auto;border-collapse: collapse;width:auto;" 
113
|-style="background:white;"
114
|style="text-align: center;padding:10px;"| [[Image:Review_393375948024-image3.png|336px]]
115
|-
116
| style="background:#efefef;text-align:left;padding:10px;font-size: 85%;"| '''Figure 3'''. Structure of LSTM
117
|}
118
119
120
One notable feature includes how it delicately designs the structure of the recurrent unit. The sigmoid activation function, tanh activation function, and element-wise product work together to form three gate structures: forget gate, input gate, and output gate. Two gates are used to control the state of the memory cell <math display="inline"> c</math>. The first gate is the forget gate, while the other is the input gate. When the forget gate is turned on, some information from the previous memory cell state <math display="inline"> c_{t-1}</math> could be ignored, and others will be kept. When the input gate is activating, the information from the current input <math display="inline">x_t</math> can be added to the memory cell <math display="inline">c</math>. LSTM uses the output gate to control how much information of the memory cell state <math display="inline">c_t</math> will be added to the current output <math display="inline">h_t</math>. For the given inputs <math display="inline">x_t</math>, <math display="inline">h_{t-1}</math>, and <math display="inline"> c_{t-1}</math>, the update process of LSTM for time step <math display="inline">t</math> is shown as Eq.(2)
121
122
{| class="formulaSCP" style="width: 100%; text-align: center;" 
123
|-
124
| 
125
{| style="text-align: center; margin:auto;" 
126
|-
127
| <math>\left\{ \begin{matrix}\&{i}_{t}=\sigma ({W}_{i}[{h}_{t-1},{x}_{t}]+{b}_{i})\\\&{f}_{t}=\sigma ({W}_{f}[{h}_{t-1},{x}_{t}]+{b}_{f})\\\&{o}_{t}=\sigma ({W}_{o}[{h}_{t-1},{x}_{t}]+{b}_{o})\\\&{\tilde{c}}_{t}=\tanh({W}_{c}[{h}_{t-1},{x}_{t}]+{b}_{c})\\\&{c}_{t}={f}_{t}\odot {c}_{t-1}+{i}_{t}\odot {\tilde{c}}_{t}\\\&{h}_{t}={o}_{t}\odot \tanh({c}_{t})\end{matrix}\right.</math>
128
|}
129
| style="width: 5px;text-align: right;white-space: nowrap;" | (2)
130
|}
131
132
133
In the above equation, <math>{W}_{i}</math>,'  ''W<sub>f</sub>''','''W<sub>o</sub>''', '''''and ''W<sub>c</sub>'' are the weight matrices for connections''''',''''' '' '''<nowiki/>''b<sub>i</sub>, b<sub>f</sub> ,'<nowiki/>''<nowiki/>'<nowiki/>''b<sub>o</sub> and b<sub>C</sub> are the bias vectors,and  <math>\sigma (\cdot )</math> and tanh are the sigmoid and'' ''tanh functions, respectively. As mentioned above, LSTM has played an essential role in various tasks required to model time series data, which demonstrates the effectiveness of LSTM in addressing time series prediction problems. However, the regression of the standard LSTM is often based on the features learned in the last time step.''' '''It cannot accurately control the sequence impact of each time on the output, leading to a decrease in the final prediction accuracy. Hence, we added an attention mechanism to the proposed framework for learning the importance of each time step. So the neural network can learn and extract helpful feature information more thoroughly.
134
135
===3.4  Attention mechanism===
136
137
In recent years, an attention mechanism has been widely used in various tasks of deep learning, such as image caption generation [40], speech recognition [41], and visual question answering [42]. An attention mechanism, inspired by the ability of humans to focus on specific information while ignoring others selectively, can make deep learning more targeted when extracting the features for improving the accuracy of related prediction tasks. In addition, this operation does not increase the cost of model calculation or storage.
138
139
<div id='img-4'></div>
140
{| class="wikitable" style="margin: 1em auto 0.1em auto;border-collapse: collapse;width:auto;" 
141
|-style="background:white;"
142
|style="text-align: center;padding:10px;"| [[Image:Review_393375948024-image5-c.png|270px]]
143
|-
144
| style="background:#efefef;text-align:left;padding:10px;font-size: 85%;"| '''Figure 4'''. Structure of the attention mechanism 
145
|}
146
147
148
Applying the attention mechanism shown in [[#img-4|Figure 4]] to the EGTM prediction is achieved by assigning different weights to different features for focusing on the regions of different importance. Adding the weights into the neural network is useful for distinguishing various features. The first step of the specific process defines the input of the attention layer as the learned feature state <math display="inline">H = (h_1, h_2,.....,h_T)</math>. This is activated by the LSTM layer, where the calculation formula of the score <math display="inline">S_t</math> of the <math>t</math>-th feature is expressed by the following equation
149
150
{| class="formulaSCP" style="width: 100%;border-collapse: collapse;width: 100%;text-align: center;" 
151
|-
152
| 
153
{| style="vertical-align: top;margin:auto;width: 100%;"
154
|-
155
| <math>{S}_{t}=\tanh\left( {W}_{h}{h}_{t}+b\right)</math> 
156
|}
157
|  style="text-align: right;vertical-align: top;width: 5px;text-align: right;white-space: nowrap;"|(3)
158
|}
159
160
In Eq.(3), <math>{W}_{h}</math> and <math>b</math> represent the weight matrix and the bias vector, respectively, while tanh serves as an activation function. After the score <math display="inline">S_t</math> is obtained, it is normalized by the softmax function
161
162
{| class="formulaSCP" style="width: 100%;border-collapse: collapse;width: 100%;text-align: center;" 
163
|-
164
| 
165
{| style="vertical-align: top;margin:auto;width: 100%;"
166
|-
167
| <math>{a}_{t}=\frac{\exp\,(S_{t})}{\displaystyle\sum_{t=1}^{T}\exp\,({S}_{t})}</math>
168
|}
169
|  style="text-align: right;width: 5px;text-align: right;white-space: nowrap;"|(4)
170
|}
171
172
The final output feature <math>c</math> of the attention mechanism can be expressed as:
173
174
{| class="formulaSCP" style="width: 100%;border-collapse: collapse;width: 100%;text-align: center;" 
175
|-
176
| 
177
{| style="vertical-align: top;margin:auto;width: 100%;"
178
|-
179
| <math>c=\sum _{t=1}^{T}{a}_{t}{h}_{t}</math>
180
|}
181
|  style="text-align: right;width: 5px;text-align: right;white-space: nowrap;"|(5)
182
|}
183
184
===3.5 Overall framework ===
185
186
[[#img-5|Figure 5]] shows the overall framework of our proposed method for EGTM prediction. It is a multiscale CNN-LSTM model based on a multiscale convolution kernel and an attention mechanism. The general framework comprises three substructures: a CNN layer (including multiscale convolution layer, pooling layer, and feature fusion layer), a LSTM layer, and an attention layer.
187
188
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
189
 [[Image:Review_393375948024-image6-c.png|312px]] </div>
190
191
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
192
'''Figure 5:''' Structure of the overall framework</div>
193
194
The operation process of the proposed model starts with feature extraction performed on the raw collected data of aeroengines. The process utilizes a multiscale CNN structure to perform convolution operations for extracting representative features. The CNN consists of three multiple convolution layers of different scales, the maxpool layer, and the fusion layer. Next, the method employs the LSTM structure for further feature learning to find trends within the data. Then, the features processed by LSTM are transferred to the attention layer. Using the attention mechanism to learn the entire sequence simultaneously, the position information of the features can be considered with weights generated to inform the neural network with regards to extracting useful detailed information for improving the performance of the aeroengine EGTM prediction. Subsequently, the features learned by LSTM are merged with the importance weights generated by the attention mechanism. Finally, the regression layer is used to output the results of EGTM prediction. The inputs of the key modules (the multiscale CNN layer, the LSTM layer, the attention layer, and the merged layer) are summarized in Table 1.
195
196
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
197
'''Table 1:''' Inputs of each key layer</div>
198
199
{| style="width: 88%;margin: 1em auto 0.1em auto;border-collapse: collapse;" 
200
|-
201
|  style="border-top: 1pt solid black;border-bottom: 1pt solid black;text-align: center;"|Layer
202
|  style="border-top: 1pt solid black;border-bottom: 1pt solid black;text-align: center;"|Input
203
|-
204
|  style="border-top: 1pt solid black;text-align: center;"|Multiscale CNN (MCNN)
205
|  style="border-top: 1pt solid black;text-align: center;"|Raw aeroengine data
206
|-
207
|  style="text-align: center;"|LSTM
208
|  style="text-align: center;"|Features learned by MCNN
209
|-
210
|  style="text-align: center;"|Attention mechanism
211
|  style="text-align: center;"|Features learned by LSTM
212
|-
213
|  style="border-bottom: 1pt solid black;text-align: center;"|Merge layer
214
|  style="border-bottom: 1pt solid black;text-align: center;"|Features learned by LSTM
215
216
Weights generated by the attention mechanism
217
|}
218
219
==4 Experiments==
220
221
===4.1 Experimental Datasets===
222
223
To verify the performance of the proposed method, the EGTM data are collected from a civil aviation turbofan engine. The altitude data from the airport and flight cycles are obtained from historical flight records and a future flight plan. The OAT data in the airport are inferred from historical flight records and a future flight plan [43]. The historical atmosphere information of the airports and the water washing information can be collected from engine maintenance records.
224
225
===4.2 Data Preprocessing===
226
227
====4.2.1 Normalization====
228
229
Data from different sources have various units and scales, which could affect the accuracy of EGTM prediction. Therefore, the raw input is normalized to speed up training convergence and improve the generalization ability. This paper adopts the min-max normalization method to preprocess data. In general, the raw input data is mapped to the interval 0~1. Specifically, for the input data <math display="inline">{X}_{t}^{k}</math>, we normalize it as follows:
230
231
{| class="formulaSCP" style="width: 100%;border-collapse: collapse;width: 100%;text-align: center;" 
232
|-
233
| 
234
{| style="vertical-align: top;margin:auto;width: 100%;"
235
|-
236
| <math>\tilde{{X}_{t}^{k}}=\frac{{X}_{t}^{k}-\mathrm{min}\,\left\{ {X}^{k}\right\} }{\mathrm{max}\,\left\{ {X}^{k}\right\} -\mathrm{min}\,\left\{ {X}^{k}\right\} }</math>
237
|}
238
|  style="text-align: right;width: 5px;text-align: right;white-space: nowrap;"|(6)
239
|}
240
241
242
where <math display="inline">\tilde{{X}_{t}^{k}}</math> represents the normalized data,  <math display="inline">\mathrm{max}\,\left\{ {X}^{k}\right\}</math>  and <math display="inline">\mathrm{min}\,\left\{ {X}^{k}\right\}</math>  represent the maximum and
243
244
minimum values in the sequence, respectively.
245
246
====4.2.2  Sliding time-window processing====
247
248
In problems based on multivariate time series, time series data sampled at a longer temporal sequence usually have more information than a data point with a single time step. A sliding window is used for data segmentation to use the multivariate temporal information and generate the network inputs. An example of data segmentation through sliding time window processing is shown in Figure 6. The window size is denoted as ''s'', and the sliding step is set to be expressed as ''l''. The short sliding step can increase the number of experimental samples to reduce the risk of overfitting and ensure the stability of the training process. As such, the sliding step is set to a value of 1.0. We will discuss the impact of time window size on the model prediction performance in Section 4.5.1.
249
250
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
251
 [[Image:Review_393375948024-image7-c.png|336px]] </div>
252
253
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
254
'''Figure 6:''' Sliding time window processing</div>
255
256
===4.3  Evaluation Criteria===
257
258
To evaluate the performance of the RUL prediction, we use two commonly adopted evaluation criteria, root mean square error (RMSE) and mean absolute error (MAE). RMSE and MAE are defined as follows by equations (7) and (8), respectively:
259
260
{| class="formulaSCP" style="width: 100%;border-collapse: collapse;width: 100%;text-align: center;" 
261
|-
262
| 
263
{| style="vertical-align: top;margin:auto;width: 100%;"
264
|-
265
| 
266
|}
267
| 
268
{| style="text-align: center;margin:auto;width: 100%;"
269
|-
270
| <math>RMSE=\sqrt{\frac{1}{N}\sum _{i=1}^{N}({R}_{i}-{P}_{i}{)}^{2}}</math>
271
|}
272
|  style="text-align: right;text-align: right;white-space: nowrap;"|(7)  
273
|-
274
| 
275
{| style="vertical-align: top;margin:auto;width: 100%;"
276
|-
277
| 
278
|}
279
| 
280
{| style="text-align: center;margin:auto;width: 100%;"
281
|-
282
| <math>MAE=\frac{1}{N}\sum _{i=1}^{N}\left| {R}_{i}-{P}_{i}\right|</math> 
283
|}
284
|  style="text-align: right;text-align: right;white-space: nowrap;"|(8)    
285
|}
286
287
288
In equations (7) and (8), ''N'' represents the number of testing samples, and ''R''<sub>i</sub> and ''P''<sub>i</sub> represent the actual EGTM and predicted EGTM of the ''i''-th sample, respectively.
289
290
===4.4 Structural Parameters===
291
292
The number of the hidden units and the size of the convolutional kernel in each layer are set to the same value to simplify the parameter selection. The mini-batch gradient descent method is used to train the network, and the batch size is set to 8. The Adam algorithm has the advantages of the back-propagation algorithm and possesses an excellent ability with handling non-stationary data, so the Adam algorithm is adopted to train the neural networks.
293
294
The structural parameters of attention-based multiscale CNN-LSTM are determined by contrast experiments. The parameters include the output dimension of CNN, the number of stacked layers of LSTM, and the dimension of hidden layers. They are determined this way obtain a better prediction performance. When deciding the output dimensions of CNN by contrast experiments, the output dimensions are used as variables, while other structural parameters are used as definite values, simultaneously. Similarly, the number of stacked layers and hidden layer dimensions of LSTM are also determined through comparative experiments. Table 2 lists the final parameters of each part of the structure.
295
296
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
297
'''Table 2''': Structural Parameters of our method</div>
298
299
{| style="width: 48%;margin: 1em auto 0.1em auto;border-collapse: collapse;" 
300
|-
301
|  style="border-top: 1pt solid black;border-bottom: 1pt solid black;text-align: center;"|Layers
302
|  style="border-top: 1pt solid black;border-bottom: 1pt solid black;text-align: center;"|Parameters
303
|-
304
|  rowspan='3' style="border-bottom: 1pt solid black;text-align: center;"|CNN
305
|  style="text-align: center;"|Hidden units: 64
306
|-
307
|  style="text-align: center;"|Pool: Maxpooling1D
308
|-
309
|  style="border-bottom: 1pt solid black;text-align: center;"|Activation: Relu
310
|-
311
|  rowspan='3' style="border-bottom: 1pt solid black;text-align: center;"|LSTM
312
|  style="text-align: center;"|Hidden units: 128
313
|-
314
|  style="text-align: center;"|Num_layer: 4
315
|-
316
|  style="border-bottom: 1pt solid black;text-align: center;"|Dropout: 0.2
317
|-
318
|  rowspan='3' style="border-bottom: 1pt solid black;text-align: center;"|Attention
319
|  style="text-align: center;"|Dense:128
320
|-
321
|  style="text-align: center;"|Activation: Relu
322
|-
323
|  style="border-bottom: 1pt solid black;text-align: center;"|Dense: 1
324
|}
325
326
327
===4.5  Experimental Results and Analysis===
328
329
To verify the effectiveness of the proposed method, we first analyze the influence of window size on the prediction performance of EGTM. Then, ablation experiments are conducted to establish the role of the proposed multiscale CNN architecture and attention mechanism in improving the model's accuracy.
330
331
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
332
''' 
333
{|
334
|-
335
| [[Image:Review_393375948024-image8.png|246px]]
336
| [[Image:Review_393375948024-image9.png|center|246px]]
337
|}
338
'''</div>
339
340
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
341
'''Figure 7:''' Effect of the time window size on the prognostic performance</div>
342
343
During the data processing, mentioned above, the selection of the time window size determines the input data of the network. Different time series data contain various information, so a reasonable time window size must be chosen. To evaluate the impact of the time window size, we conducted experiments to analyze the influence of window size on the prediction performance of EGTM. The results with different window sizes (i.e., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12) are plotted in Figure 7. The figure shows fluctuations in the model's performance, but the overall trend indicates the performance improves when the window size is first increased. This may be due to the EGTM prediction containing more sequence information. When the window size is set to 4, the RMSE value and MAE reach a minimum. Increasing the window size leads to decreased model performance. Therefore, as the optimal size, the length of time window size is set to 4.
344
345
Furthermore, ablation experiments are conducted for the proposed method to evaluate the effectiveness of the proposed multiscale CNN structure and attention mechanism in improving the prediction accuracy of EGTM. Specifically, we conducted experiments on a traditional CNN-LSTM, a traditional CNN-LSTM with an attention mechanism, a multiscale CNN-LSTM, and our proposed method.
346
347
For training, we set the epoch to 300 and the mean square error (MSE) as the loss function. The parameters used in each model are identical. As shown in Figure 8, the red line indicates that the proposed method produces the smallest degree of error throughout the epochs.
348
349
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
350
 [[Image:Review_393375948024-image10-c.png|408px]] </div>
351
352
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
353
'''Figure 8:''' Training loss of the proposed method and other models in ablation experiments</div>
354
355
As shown in Figure 9, the performance of a multiscale CNN-LSTM and a traditional CNN-LSTM integrated with an attention mechanism performs better than a traditional CNN-LSTM. In addition, the proposed method in this paper has the highest accuracy in EGTM prediction when compared with other methods. This verifies the effectiveness of our proposed method in enhancing the performance of extracting data features from the input data.
356
357
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
358
 [[Image:Review_393375948024-image11.png|354px]] </div>
359
360
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
361
'''Figure 9:''' Performance comparison of the proposed method and other models in ablation experiments</div>
362
363
To analyze the predicted accuracy of our model with regard to EGTM, we compare the predicted EGTM with the actual EGTM of the aeroengine and plot the results in Figure 10. The trajectory of the predicted EGTM is similar to the real EGTM of the aeroengine when compared with other methods. This supports effectiveness of our proposed model in learning aeroengine degradation information. When compared with other stages, the error tends to be larger as EGTM is between 70 and 75, which may be a result of how the aeroengine begins to degenerate rapidly in this state.
364
365
{| style="width: 100%;margin: 1em auto 0.1em auto;border-collapse: collapse;" 
366
|-
367
|  style="text-align: center;vertical-align: top;width: 49%;"|[[Image:Review_393375948024-image12.png|264px]] 
368
|  style="text-align: center;vertical-align: top;"|
369
|  colspan='2'  style="text-align: center;vertical-align: top;width: 47%;"|[[Image:Review_393375948024-image13.png|264px]] 
370
|-
371
|  style="text-align: center;vertical-align: top;"|(a) proposed method
372
|  colspan='2'  style="text-align: center;vertical-align: top;"|
373
|  style="text-align: center;vertical-align: top;"|(b) CNN-LSTM
374
|-
375
|  style="text-align: center;vertical-align: top;width: 49%;"|[[Image:Review_393375948024-image14.png|270px]] 
376
|  colspan='2'  style="text-align: center;vertical-align: top;"|
377
|  style="text-align: center;vertical-align: top;width: 46%;"|[[Image:Review_393375948024-image15.png|264px]] 
378
|-
379
|  style="text-align: center;vertical-align: top;"|(c) multiscale CNN-LSTM
380
|  colspan='2'  style="text-align: center;vertical-align: top;"|
381
|  style="text-align: center;vertical-align: top;"|(d) traditional CNN-LSTM with attention
382
|}
383
384
385
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
386
'''Figure 10: '''Analysis of EGTM prediction results of different models</div>
387
388
==5 Conclusion and Future Work==
389
390
This paper proposes an attention-based multiscale CNN-LSTM framework for aeroengine performance prediction. First, a multiscale CNN is developed to extend the feature information of the raw input data to a longer time scale. Then, LSTM is used to learn the time-series features. An attention mechanism is adopted to obtain the importance of the time series data and assign different importance weights to the features for improving the prediction accuracy. Experiments on real data sets verify the effectiveness of the designed method in aeroengine performance prediction.
391
392
In prognostics and health management (PHM), accurate prediction performance is significant for ensuring the reliability of aeroengines and making maintenance plans and flight schedules. The price of data collection for the whole life of an aeroengine is relatively high, as it may take a long period of several years to track the degradation process. However, there are individual differences among aeroengines due to different manufacturing, assembly, and even model types. As such, they likely work under various scenarios. Hence, data-driven methods perform worse than expected. To address this problem and meet the actual needs of airline management and maintenance, the knowledge of transfer learning can be used to enhance the generalization ability of a data-driven prognostic model [44], so the trained model can be applied to the performance prediction of other aeroengines.
393
394
<span id='_Hlk103065805'></span><span style="text-align: center; font-size: 75%;">'''Funding:''' This research was funded by the Science and Technology Foundation of Sichuan, grant number 2022YFG0356, the Key Research and Development Plan Foundation of Tibet, grant number XZ202101ZY0017G, and the Aeroengine Operational Safety and Control Technology Research Center Project of CAFUC, grant number JG2019-02-03 and D202105.</span>
395
396
'''Conflicts of Interest:''' The authors declare no conflict of interest.
397
398
==References==
399
400
[1] A. H. Epstein, “Aeropropulsion for Commercial Aviation in the Twenty-First Century and Research Directions Needed,” ''AIAA Journal'', vol. 52, no. 5, pp. 901–911, May 2014, doi: 10.2514/1.J052713.
401
402
[2] X. Zhou, X. Fu, M. Zhao, and S. Zhong, “Regression model for civil aero-engine gas path parameter deviation based on deep domain-adaptation with Res-BP neural network,” ''Chinese Journal of Aeronautics'', vol. 34, no. 1, pp. 79–90, Jan. 2021, doi: 10.1016/j.cja.2020.08.051.
403
404
[3] Z. Chen, X. Yuan, M. Sun, J. Gao, and P. Li, “A hybrid deep computation model for feature learning on aero-engine data: applications to fault detection,” ''Applied Mathematical Modelling'', vol. 83, pp. 487–496, Jul. 2020, doi: 10.1016/j.apm.2020.02.002.
405
406
[4] W. Mao, J. He, and M. J. Zuo, “Predicting Remaining Useful Life of Rolling Bearings Based on Deep Feature Representation and Transfer Learning,” ''IEEE Transactions on Instrumentation and Measurement'', vol. 69, no. 4, pp. 1594–1608, Apr. 2020, doi: 10.1109/TIM.2019.2917735.
407
408
[5] R. Jakubowski, “Evaluation of performance properties of two combustor turbofan engine,” ''EiN'', vol. 17, no. 4, pp. 575–581, Sep. 2015, doi: 10.17531/ein.2015.4.13.
409
410
[6] M. Ilbas and M. Turkmen, “Estimation of exhaust gas temperature using artificial neural network in turbofan engines,” ''Journal of Thermal Sciences and Technology'', vol. 32, no. 2, pp. 11–18, 2012.
411
412
[7] J. Liu, F. Lei, C. Pan, D. Hu, and H. Zuo, “Prediction of remaining useful life of multi-stage aero-engine based on clustering and LSTM fusion,” ''Reliability Engineering & System Safety'', vol. 214, p. 107807, Oct. 2021, doi: 10.1016/j.ress.2021.107807.
413
414
[8] T. Wensky, L. Winkler, and J. Friedrichs, “Environmental Influences on Engine Performance Degradation,” Dec. 2010, pp. 249–254. doi: 10.1115/GT2010-22748.
415
416
[9] V. Bermúdez, J. R. Serrano, P. Piqueras, and B. Diesel, “Fuel consumption and aftertreatment thermal management synergy in compression ignition engines at variable altitude and ambient temperature,” ''International Journal of Engine Research'', p. 14680874211035016, Jul. 2021, doi: 10.1177/14680874211035015.
417
418
[10] R. Kurz and K. Brun, “Gas Turbine Tutorial - Maintenance And Operating Practices Effects On Degradation And Life.,” 2007, doi: 10.21423/R15W7F.
419
420
[11] Z. Wei, S. Zhang, S. Jafari, and T. Nikolaidis, “Gas turbine aero-engines real time on-board modelling: A review, research challenges, and exploring the future,” ''Progress in Aerospace Sciences'', vol. 121, p. 100693, Feb. 2020, doi: 10.1016/j.paerosci.2020.100693.
421
422
[12] N. Ejaz, I. N. Qureshi, and S. A. Rizvi, “Creep failure of low pressure turbine blade of an aircraft engine,” ''Engineering Failure Analysis'', vol. 18, no. 6, pp. 1407–1414, Sep. 2011, doi: 10.1016/j.engfailanal.2011.03.010.
423
424
[13] W. Xue, S. Gao, D. Duan, H. Zheng, and S. Li, “Investigation and simulation of the shear lip phenomenon observed in a high-speed abradable seal for use in aero-engines,” ''Wear'', vol. 386–387, pp. 195–203, Sep. 2017, doi: 10.1016/j.wear.2017.06.019.
425
426
[14] S. Bai, H.-Z. Huang, Y.-F. Li, A. Yu, and Z. Deng, “A modified damage accumulation model for life prediction of aero-engine materials under combined high and low cycle fatigue loading,” ''Fatigue & Fracture of Engineering Materials & Structures'', vol. 44, no. 11, pp. 3121–3134, 2021, doi: 10.1111/ffe.13566.
427
428
[15] J. Lin, J. Zhang, G. Zhang, G. Ni, and F. Bi, “Aero-engine blade fatigue analysis based on nonlinear continuum damage model using neural networks,” ''Chin. J. Mech. Eng.'', vol. 25, no. 2, pp. 338–345, Mar. 2012, doi: 10.3901/CJME.2012.02.338.
429
430
[16] D. Chen and J. Sun, “Fuel and emission reduction assessment for civil aircraft engine fleet on-wing washing,” ''Transportation Research Part D: Transport and Environment'', vol. 65, pp. 324–331, Dec. 2018, doi: 10.1016/j.trd.2018.05.013.
431
432
[17] D. Giesecke, U. Igie, P. Pilidis, K. Ramsden, and P. Lambart, “Performance and Techno-Economic Investigation of On-Wing Compressor Wash for a Short-Range Aero Engine,” Jul. 2013, pp. 235–244. doi: 10.1115/GT2012-68995.
433
434
[18] S. Madsen and L. E. Bakken, “Gas Turbine Operation Offshore: On-Line Compressor Wash Operational Experience,” presented at the ASME Turbo Expo 2014: Turbine Technical Conference and Exposition, Sep. 2014. doi: 10.1115/GT2014-25272.
435
436
[19] L. Wang, Z. Yan, F. Long, X. Shi, and J. Tang, “Parametric study of online aero-engine washing systems,” in ''2016 IEEE International Conference on Aircraft Utility Systems (AUS)'', Oct. 2016, pp. 273–277. doi: 10.1109/AUS.2016.7748058.
437
438
[20] U. Igie, P. Pilidis, D. Fouflias, K. Ramsden, and P. Laskaridis, “Industrial Gas Turbine Performance: Compressor Fouling and On-Line Washing,” ''Journal of Turbomachinery'', vol. 136, no. 10, Jun. 2014, doi: 10.1115/1.4027747.
439
440
[21] S. Adibhatla, T. Lewis, S. Adibhatla, and T. Lewis, “Model-based intelligent digital engine control (MoBIDEC),” in ''33rd Joint Propulsion Conference and Exhibit'', American Institute of Aeronautics and Astronautics, 1997. doi: 10.2514/6.1997-3192.
441
442
[22] P. S. Maybeck, “Multiple model adaptive algorithms for detecting and compensating sensor and actuator/surface failures in aircraft flight control systems,” ''International Journal of Robust and Nonlinear Control'', vol. 9, no. 14, pp. 1051–1070, 1999, doi: 10.1002/(SICI)1099-1239(19991215)9:14<1051::AID-RNC452>3.0.CO;2-0.
443
444
[23] F. Lu, Y. Chen, J. Huang, D. Zhang, and N. Liu, “An integrated nonlinear model-based approach to gas turbine engine sensor fault diagnostics,” ''Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering'', vol. 228, no. 11, pp. 2007–2021, Sep. 2014, doi: 10.1177/0954410013511596.
445
446
[24] Y. S. Chati and H. Balakrishnan, “Data-Driven Modeling of Aircraft Engine Fuel Burn in Climb Out and Approach,” ''Transportation Research Record'', vol. 2672, no. 29, pp. 1–11, Dec. 2018, doi: 10.1177/0361198118780876.
447
448
[25] A. Bobrinskoy, M. Gatti, O. Guerineau, F. Cazaurang, B. Bluteau, and E. Recherche, “Model-based fault detection and isolation design for flight-critical actuators in a harsh environment,” in ''2012 IEEE/AIAA 31st Digital Avionics Systems Conference (DASC)'', Oct. 2012, pp. 7D5-1-7D5-8. doi: 10.1109/DASC.2012.6382423.
449
450
[26] Z. Cui, S. Zhong, and Z. Yan, “Fuel savings model after aero-engine washing based on convolutional neural network prediction,” ''Measurement'', vol. 151, p. 107180, Feb. 2020, doi: 10.1016/j.measurement.2019.107180.
451
452
[27] T. S. Kim and S. Y. Sohn, “Multitask learning for health condition identification and remaining useful life prediction: deep convolutional neural network approach,” ''J Intell Manuf'', vol. 32, no. 8, pp. 2169–2179, Dec. 2021, doi: 10.1007/s10845-020-01630-w.
453
454
[28] X. Zhang ''et al.'', “Remaining Useful Life Estimation Using CNN-XGB With Extended Time Window,” ''IEEE Access'', vol. 7, pp. 154386–154397, 2019, doi: 10.1109/ACCESS.2019.2942991.
455
456
[29] X. Zhang, Y. Dong, L. Wen, F. Lu, and W. Li, “Remaining Useful Life Estimation Based on a New Convolutional and Recurrent Neural Network,” in ''2019 IEEE 15th International Conference on Automation Science and Engineering (CASE)'', Aug. 2019, pp. 317–322. doi: 10.1109/COASE.2019.8843078.
457
458
[30] P. Kakati, D. Dandotiya, and B. Pal, “Remaining Useful Life Predictions for Turbofan Engine Degradation Using Online Long Short-Term Memory Network,” presented at the ASME 2019 Gas Turbine India Conference, Jan. 2020. doi: 10.1115/GTINDIA2019-2368.
459
460
[31] J. Liu, C. Pan, F. Lei, D. Hu, and H. Zuo, “Fault prediction of bearings based on LSTM and statistical process analysis,” ''Reliability Engineering & System Safety'', vol. 214, p. 107646, Oct. 2021, doi: 10.1016/j.ress.2021.107646.
461
462
[32] C. Wang, Z. Zhu, N. Lu, Y. Cheng, and B. Jiang, “A data-driven degradation prognostic strategy for aero-engine under various operational conditions,” ''Neurocomputing'', vol. 462, pp. 195–207, Oct. 2021, doi: 10.1016/j.neucom.2021.07.080.
463
464
[33] C. Chen, N. Lu, B. Jiang, and C. Wang, “A Risk-Averse Remaining Useful Life Estimation for Predictive Maintenance,” ''IEEE/CAA Journal of Automatica Sinica'', vol. 8, no. 2, pp. 412–422, Feb. 2021, doi: 10.1109/JAS.2021.1003835.
465
466
[34] C. Che, H. Wang, X. Ni, and Q. Fu, “Performance degradation prediction of aeroengine based on attention model and support vector regression,” ''Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering'', vol. 236, no. 2, pp. 410–416, Feb. 2022, doi: 10.1177/09544100211014743.
467
468
[35] L. Li, J. Liu, S. Wei, G. Chen, E. Blasch, and K. Pham, “Smart robot-enabled remaining useful life prediction and maintenance optimization for complex structures using artificial intelligence and machine learning,” in ''Sensors and Systems for Space Applications XIV'', Apr. 2021, vol. 11755, pp. 100–108. doi: 10.1117/12.2589045.
469
470
[36] X. Li, Q. Ding, and J.-Q. Sun, “Remaining useful life estimation in prognostics using deep convolution neural networks,” ''Reliability Engineering & System Safety'', vol. 172, pp. 1–11, Apr. 2018, doi: 10.1016/j.ress.2017.11.021.
471
472
[37] W. Jiang, N. Zhang, X. Xue, Y. Xu, J. Zhou, and X. Wang, “Intelligent Deep Learning Method for Forecasting the Health Evolution Trend of Aero-Engine With Dispersion Entropy-Based Multi-Scale Series Aggregation and LSTM Neural Network,” ''IEEE Access'', vol. 8, pp. 34350–34361, 2020, doi: 10.1109/ACCESS.2020.2974190.
473
474
[38] I. Remadna, S. L. Terrissa, R. Zemouri, S. Ayad, and N. Zerhouni, “Leveraging the Power of the Combination of CNN and Bi-Directional LSTM Networks for Aircraft Engine RUL Estimation,” in ''2020 Prognostics and Health Management Conference (PHM-Besançon)'', May 2020, pp. 116–121. doi: 10.1109/PHM-Besancon49106.2020.00025.
475
476
[39] C.-H. Chu, C.-J. Lee, and H.-Y. Yeh, “Developing Deep Survival Model for Remaining Useful Life Estimation Based on Convolutional and Long Short-Term Memory Neural Networks,” ''Wireless Communications and Mobile Computing'', vol. 2020, p. e8814658, Dec. 2020, doi: 10.1155/2020/8814658.
477
478
[40] S. Hochreiter and J. Schmidhuber, “Long Short-Term Memory,” ''Neural Comput.'', vol. 9, no. 8, pp. 1735–1780, Nov. 1997, doi: 10.1162/neco.1997.9.8.1735.
479
480
[41] M. Liu, L. Li, H. Hu, W. Guan, and J. Tian, “Image caption generation with dual attention mechanism,” ''Information Processing & Management'', vol. 57, no. 2, p. 102178, Mar. 2020, doi: 10.1016/j.ipm.2019.102178.
481
482
[42] S. Chen, M. Zhang, X. Yang, Z. Zhao, T. Zou, and X. Sun, “The Impact of Attention Mechanisms on Speech Emotion Recognition,” ''Sensors'', vol. 21, no. 22, Art. no. 22, Jan. 2021, doi: 10.3390/s21227530.
483
484
[43] A. Fanfarillo, B. Roozitalab, W. M. Hu, G. Cervone. “Probabilistic forecasting using deep generative models,” [javascript:void(0) ''Geoinformatica''],'' ''vol. 25, no. 1, pp. 127–141, Jan. 2021, doi:10.1007/s10707-020-00425-8.
485
486
[44] W. Zhang, X. Li, H. Ma, Z. Luo, and X. Li, “Transfer learning using deep representation regularization in remaining useful life prediction across operating conditions,” ''Reliability Engineering & System Safety'', vol. 211, p. 107556, Jul. 2021, doi: 10.1016/j.ress.2021.107556.
487

Return to Fu et al 2022a.

Back to Top

Document information

Published on 16/05/23
Accepted on 08/05/23
Submitted on 08/05/22

Volume 39, Issue 2, 2023
DOI: 10.23967/j.rimni.2023.05.002
Licence: CC BY-NC-SA license

Document Score

1

Views 46
Recommendations 1

Share this document

claim authorship

Are you one of the authors of this document?