Ensemble forecast of tropical cyclone tracks based on deep neural networks

Chong WANG , Qing XU , Yongcun CHENG , Yi PAN , Hong LI

Front. Earth Sci. ›› 2022, Vol. 16 ›› Issue (3) : 671 -677.

PDF (2139KB)
Front. Earth Sci. ›› 2022, Vol. 16 ›› Issue (3) : 671 -677. DOI: 10.1007/s11707-021-0931-8
RESEARCH ARTICLE
RESEARCH ARTICLE

Ensemble forecast of tropical cyclone tracks based on deep neural networks

Author information +
History +
PDF (2139KB)

Abstract

A nonlinear artificial intelligence ensemble forecast model has been developed in this paper for predicting tropical cyclone (TC) tracks based on the deep neural network (DNN) by using the 24-h forecast data from the China Meteorological Administration (CMA), Japan Meteorological Agency (JMA) and Joint Typhoon Warning Center (JTWC). Data from a total of 287 TC cases over the Northwest Pacific Ocean from 2004 to 2015 were used to train and validate the DNN based ensemble forecast (DNNEF) model. The comparison of model results with Best Track data of TCs shows that the DNNEF model has a higher accuracy than any individual forecast center or the traditional ensemble forecast model. The average 24-h forecast error of 82 TCs from 2016 to 2018 is 63 km, which has been reduced by 17.1%, 16.0%, 20.3%, and 4.6%, respectively, compared with that of CMA, JMA, JTWC, and the error-estimation based ensemble method. The results indicate that the nonlinear DNNEF model has the capability of adjusting the model parameter dynamically and automatically, thus improving the accuracy and stability of TC prediction.

Graphical abstract

Keywords

tropical cyclone track / deep neural network / ensemble forecast

Cite this article

Download citation ▾
Chong WANG, Qing XU, Yongcun CHENG, Yi PAN, Hong LI. Ensemble forecast of tropical cyclone tracks based on deep neural networks. Front. Earth Sci., 2022, 16(3): 671-677 DOI:10.1007/s11707-021-0931-8

登录浏览全文

4963

注册一个新账户 忘记密码

1 Introduction

Tropical cyclones (TCs) can cause enormous damage to human lives and properties owing to severe flooding, destructive wind and coastal inundation from storm surges. The accurate forecast of TC track is essential for emergency responders to resist the influence of TCs effectively.

Traditional objective TC forecast techniques can be divided into three categories: dynamic numerical models, statistical prediction method and ensemble forecast technology. Dynamic numerical models use dynamic equations to achieve the forecast of TC tracks (Jeffries et al., 1993; Elsberry, 1995). With better understanding of TC dynamics and the development of computing powers, a variety of numerical models have been developed and widely applied for operational TC forecast such as the GRAPES (Global/Regional Assimilation and PrEdiction System) model developed at China Meteorological Administration (CMA) (e.g., Zhang and Shen, 2008), the GFS (Global Forecast System) model developed at National Centers for Environmental Prediction (NCEP) of the United States (e.g., Rappaport et al., 2009; Saha et al., 2014), the IFS (Integrated Forecasting System) model developed at the European Centre for Medium-Range Weather Forecasts (ECMWF) (e.g., Raddaway, 2012), and so on. With the application of advanced data assimilation techniques in numerical models, the accuracy of TC track prediction has been significantly improved (Elsberry, 1995; Hamill et al., 2011) and the 24-h forecast error is less than 90 km (Peng et al., 2017). However, these numerical models have different architectures, such as the representation of topography, data assimilation schemes, treatment of physics and dynamics, spatial resolutions, and initialization etc., which makes the forecast results have different deviations from the observations (Plu, 2011; Chen et al., 2013). In recent years, the improvement of TC track forecast accuracy has slowed down significantly, and some scientists believe that numerical models have reached a bottleneck (Landsea and Cangialosi, 2018).

The statistical prediction models are usually based on the regression method, among which the Climatology and Persistance (CLIPER) model proposed by Neumann and Lawrence (1975) is considered to be the most classic one (Veigas, 1996; Knaff et al., 2007). The CLIPER model establishes the regression between the TC track and climatic persistence factors, such as the longitude and latitude of the TC center, the maximum wind speed and minimum air pressure in the past few hours. Although the demand for computing resources has reduced greatly, the forecast accuracy is usually worse than that of numerical models.

In the late 1960s and early 1970s, an ensemble forecast technology based on the initial value disturbance and mode disturbance was proposed by Epstein (1969) and Leith (1974). Since then, the technology has been widely used for predicting the tracks of TCs, wind field and sea surface temperature all over the world (Toth and Kalnay, 1997; Buizza et al., 1999; Leutbecher and Palmer, 2008; Buckingham et al., 2010; Li et al., 2019). However, this method requires a huge amount of computing resources. Later, Krishnamurti et al. (1999) and Zhi et al. (2011) proposed some kinds of multi-model ensemble method for predicting the wind field as the weighted average of the results from several independent numerical models. This method uses a weighted average formula to fit the complex relationships between different numerical model results. Yuan et al. (2017) improved the weighting method based on the error-estimation principle and developed an ensemble model for TC track forecast by using the historical forecast results from several forecast centers. The overall accuracy of the ensemble model has been increased, but the empirical parameter in this model needs to be determined first at each forecast time, which leads to some uncertainty and instability of the forecast results to some extent. In addition, the model cannot completely simulate the complex relationship between different forecast models due to its linearity.

Recently, neural networks, especially the deep neural networks (DNNs) have been applied to TC track prediction. Jin et al. (2015) and Zhu et al. (2016) used pure linear neural network (PLNN) and Bayesian neural network (BNN) to predict the 24-h TC track in the South China Sea, and the average forecast errors are 122.8 km and 125.4 km respectively. By inputting historical TC track data into a long-short-term memory (LSTM) model, Gao et al. (2018) achieved an accuracy of 105.7 km for TCs in the Northwest Pacific Ocean. Although the accuracy is not very high, these studies show that the neural networks have great potential in the fast prediction of TC tracks due to its high-speed computing performance.

The DNN is a feedforward neural network, which is inspired by the notion that a neuron’s computation involves a weighted sum of the input values. The network can be viewed as consisting of numerous piecewise functions and is able to simulate complex nonlinear functions (Sze et al., 2017). It has achieved great success in applications of speech recognition, image recognition, time series prediction and other research fields (Krizhevsky et al., 2012; Deng et al., 2013; Silver et al., 2016; Esteva et al., 2017). In this study, we aim to develop a 24-h TC track forecast model by combining the ensemble technique with DNN, i.e., a DNN based ensemble model which integrates forecast results from several individual forecast centers. In this way, the nonlinear relationship between different forecast models can be learned effectively. We organize the paper as follow. Section 2 introduces the data set and methods. The results of our model will be presented and discussed in Section 3. Section 4 is the conclusions.

2 Materials and methods

2.1 Materials

The input data of the DNN based ensemble forecast (DNNEF) model are 24-h forecast results of TC tracks, i.e., the longitude or/and latitude of the TC center within the next 24 h, published every 6 h by CMA (available at CMA Tropical Cyclone Data Center website), Japan Meteorological Agency (JMA, available at Japan Meteorological Agency website) and Joint Typhoon Warning Center (JTWC, available at Naval Oceanographic Portal-Public Facing website) in the United States. The Best Track data provided by CMA (Ying et al., 2014) with a time interval of 6 h is used as the “ground truth data” for verification of the model.

Totally, we obtained 369 TC cases over the Northwest Pacific (NWP) Ocean from 2004 to 2018. Data for 287 TCs in the first 12 years are randomly divided into training and validation data sets at a ratio of 8:2, which are used for training the DNNEF model and preventing model overfitting, respectively. The remaining data between 2016 and 2018 (Fig.1) are set as test data and used to evaluate the model performance. One can see from Fig.1(a) that the TCs generated over the NWP Ocean travel on different paths. Some move to the west, some to the northwest, while others suddenly change their directions. As shown in Fig.1(b), there are totally 28 abrupt turning TCs which turned more than 60° along the movement direction within 24 h according to the definition proposed by Wu et al. (2011).

2.2 Methods

A typical DNN model contains a combination of input layer, hidden layer and output layer. The hidden layer is used to simulate the complex nonlinear relationship between the input and output. According to the different input and output parameters, we designed three schemes for TC track forecast, as shown in Tab.1. Scheme A consists of two models which predict the longitude and latitude of the TC center, respectively, with the longitude or latitude information forecasted by CMA, JMA and JTWC as the model input. Scheme B also includes two models, but with both longitude and latitude forecast results from three centers as input in either the longitude or the latitude forecast model. Scheme C only contains one model, and the input of this model is the same as that in Scheme B, but the longitude and latitude are predicted at the same time.

Schemes A, B and C have the same architecture except for the different parameters in the input and output layers, which is shown in Fig.2. Each DNNEF model consists of one input layer, two hidden layers, and one output layer. The activate function is “linear” in the output layer, and “sigmoid” in the other layers. The loss function is set as “mse.” Take the longitude forecast model in Scheme A as an example, with the 24-h forecast results of TC longitude from three forecast centers as the model input, the first hidden layer will establish 18 links between the three inputs and the six neurons. Then 24 links will be generated between the four neurons in the second hidden layer and the six neurons in the first hidden layer. Finally, four links are established between the second hidden layer and the output layer. In this way, the DNN model will learn a total of 62 parameters. As shown in Fig.3, the model stops training when the loss of the training and validation data tends to be stable. The model with the lowest loss of the validation data is then retained as the model of the test data.

Since the TC track forecast results from JTWC are usually released later than CMA or JMA, we will also investigate how many models of input data are necessary for an accurate and efficient forecast in an DNNEF model by comparing the performance of our models which combines the forecast results from all the three forecast centers (DNNEF-CJJ model) and that from CMA and JMA only (DNNEF-CJ model).

3 Results and discussion

To evaluate the performance of the DNNEF model, the forecast error is presented as the distance D between the forecasted and observed TC centers, which is calculated as

D=R × arcos[cos (Y1)× cos( Y2)× cos( X1X2)+sin ( Y1)× sin ( Y2)],

where R is the radius of the earth; X1 and Y1 are the longitude and latitude forecasted by the model, respectively; X2 and Y2 are the observed longitude and latitude, respectively.

We also compared our model results with that from the error-estimation based ensemble forecast (EEF) model proposed by Yuan et al. (2017):

Pi= P0i+k× ei,

ai= 1/|ei| i=1N(1/| ei|),

Pe= i=1NaiP i,

where P0i is the TC center position vector of the original forecast from the ith forecast center, including the longitude and latitude; P i is the position vector of the corrected forecast value; k is the empirical parameter, ei is the estimated deviation between the TC center position predicted by the ith forecast center and the real TC track; ai is the weight coefficient of the forecast result from the ith center; N is the total number of the forecast centers; Pe is the final TC center position vector predicted by the EEF model.

Compared with Scheme A, when both latitude and longitude are input into the DNNEF model, the forecast result of Scheme B is slightly better (Tab.1), indicating that the additional introduction of latitude (or longitude) into the model helps to improve the performance of the longitude (or latitude) forecast model. However, the result of Scheme C gets worse if the latitude and longitude forecast models are integrated into one model. One possible reason is that for a single latitude or longitude forecast model, we only need to adjust each model to the optimal configuration. The integration of the two models in Scheme C makes the problem more complicated and difficult to get better results based on the existing data.

As shown in Tab.2 and Fig.4, our DNN based ensemble forecast models in Scheme B have a higher accuracy than all the individual models of the forecast center or the EEF model for 24-h TC track prediction. Compared with CMA, JMA, JTWC and the EEF model, the average forecast error of the two DNNEF models both reduced by 17%, 16%, 20%, 5%, respectively. Although our DNNEF model only performs slightly better than the error-estimation based ensemble forecast model, it is more stable because there is no need to adjust the empirical parameters manually at each forecast time. In addition, the model can better simulate the complex nonlinear relationship between different numerical models.

If we compare the DNNEF-CJJ and DNNEF-CJ models, we can find that the 3-year average forecast errors of the two models are the same, although the former has additional input data from JTWC. In 2017, the accuracy of the DNNEF-CJJ model is even a little lower than the DNNEF-CJ model partly due to the relatively unstable forecast results of JTWC in this year. This indicates that the DNNEF model can produce a fairly accurate prediction of TC tracks with the combination of CMA and JMA outputs. At the same time, considering that the JTWC forecast results are generally released later than CMA and JMA, the DNNEF-CJ model may be more efficient and suitable for the operational forecast of TC tracks in the Northwest Pacific Ocean.

The forecast error of the latitude and longitude of TC center in 2016−2018 predicted by the DNN based ensemble forecast models and three forecast centers are shown inTab.3 and Fig.5. One can see the JTWC, JMA, and CMA TC forecast data are not homogeneous. The forecast error bias changes from year to year. Take the JTWC forecast result as an example, it tends to have track error bias to the left or west of a TC track in 2016 and 2017 TC seasons. In 2018 season, this bias changes to a right or eastward bias. Although the forecast error bias of different forecast centers changes every year, both the DNNEF-CJJ and DNNEF-CJ models can well adapt to the inter-annual variation of the bias and achieve better forecast results. This indicates that the DNN based ensemble forecast method is able to restrain the propagation of uncertainties caused by these biases to a certain extent.

We further analyzed the model performance for individual TCs. Fig.6 shows the comparison of 24-h forecast results from different models with the observations of TC Lionrock in August 2016 and Ampil in July 2018. Lionrock was generated on August 17, 2016. It became a tropical storm on August 18, intensified into TC category 5 days later, and made landfall on August 30 near Ofunato in northeastern Honshu. Ampil was formed on the evening of July 18, 2018. It made landfall in Shanghai’s Chongming Island at noon on July 22, with the intensity continuing weakened and disappeared on July 24. These two TCs caused huge economic damage to Japan and China. As shown in Fig.6, both are steering TCs which are more difficult to forecast.

The 24-h forecast results of CMA, JMA, and JTWC are all improved greatly by the DNNEF models. For Lionrock, compared with CMA and JMA, the TC track forecast error of our DNNEF-CJ model is significantly reduced by 14% and 29%, respectively. For Lionrock, which suddenly turned its direction at (130.5°E, 23.0°N), the track forecast error of our DNNEF-CJ model is significantly reduced by 29% compared with that of JMA. As shown in Fig.7, the accuracy of the DNN based model is higher than that of the three forecast centers in the whole life cycle of the TC and the maximum improvement in the model performance is more than 80% compared with CMA or JTWC forecasts. In general, the DNN based ensemble model is more helpful to modify the track forecast at the early and weakening stage of the TC. For Ampil, the forecast errors reduced by 22% compared with that of CMA. Also, the DNNEF models capture the movement of Ampil more accurately at its early stage, while JMA’s forecasts during this period are quite different from the observations. This indicates that the trend of the TC track predicted by the DNNEF model is more stable, thus avoiding the abnormally large error that might exist in the forecast results of different agencies at some time.

To investigate the performance of the DNN based ensemble model in forecasting the movement of TCs with sudden track changes between 2016 and 2018 (Fig.1(b)), we carried out a statistical analysis. Take the DNN-CJ model as an example, as shown in Tab.2, the 24-h track forecast accuracy is significantly improved by 14.5%, 13.4%, 18.4%, and 2.7%, respectively, compared with that of CMA, JMA, JTWC and EEF, indicating that the DNN ensemble forecast model also has the ability to accurately predict the tracks of abrupt turning TCs.

4 Conclusions

In this study, a completely objective model for predicting the 24-h TC tracks was developed based on the deep neural network. With an ensemble of the forecast results from CMA, JMA and/or JTWC, the DNN based ensemble forecast model has a higher accuracy than that applied by the individual forecast centers. Both the overall forecast results of TCs between 2016 and 2018 and case studies show that the DNNEF model is more accurate and stable.

Compared with the traditional ensemble forecast model, our DNNEF model need no manual adjustment of empirical parameters. The model is able to simulate the nonlinear relationship between the forecast results from different models or forecast centers and correct the results automatically. Also, it performs slightly better in terms of model accuracy.

By inputting and outputting different parameters in the DNNEF model in three schemes, we find that the input of both longitude and latitude information helps to improve the performance of our forecast model. The average 24-h forecast error of TCs from 2016 to 2018 has been significantly reduced by 17.1%, 16.0%, and 20.3%, respectively, compared with that from the CMA, JMA and JTWC. The comparison between the DNNEF-CJ and DNNEF-CJJ models shows that the former, which combines the outputs from only two forecast centers (CMA and JMA), can predict the TC track more quickly with comparable accuracy. Therefore, the DNNEF-CJ model is more suitable for operational forecast due to simple data processing and high practicability.

References

[1]

BuckinghamC, MarchokT, GinisI, RothsteinL, RoweD. ( 2010). Short- and medium-range prediction of tropical and transitioning cyclone tracks within the NCEP global ensemble forecasting system. Weather Forecast, 25( 6): 1736– 1754

[2]

BuizzaR, MilleerM, PalmerT N. ( 1999). Stochastic representation of model uncertainties in the ECMWF ensemble prediction system. Q J R Meteorol Soc, 125( 560): 2887– 2908

[3]

ChenG M, YuH, CaoQ, ZengZ. ( 2013). The performance of global models in TC track forecasting over the Western North Pacific from 2010 to 2012. Trop Cyclone Res Rev, 2( 3): 149– 158

[4]

DengL LiJ Huang J T YaoK YuD Seide F SeltzerM ZweigG HeX Williams J GongY AceroA (2013). Recent advances in deep learning for speech research at Microsoft. In: Proceedings of International Conference on Acoustics, Speech, and Signal. Vancouver: 8604– 8608

[5]

ElsberryR L. ( 1995). Recent advancements in dynamical tropical cyclone track predictions. Meteorol Atmos Phys, 56( 1−2): 81– 99

[6]

EstevaA, KuprelB, NovoaR A, KoJ, SwetterS M, BlauH M, ThrunS. ( 2017). Dermatologist-level classification of skin cancer with deep neural networks. Nature, 542( 7639): 115– 118

[7]

EpsteinE S. ( 1969). Stochastic dynamic prediction. Tellus, 21( 6): 739– 759

[8]

GaoS, ZhaoP, PanB, LiY, ZhouM, XuJ, ZhongS, ShiZ. ( 2018). A nowcasting model for the prediction of typhoon tracks based on a long short term memory neural network. Acta Oceanol Sin, 37( 5): 8– 12

[9]

HamillT M, WhitakerJ S, FiorinoM, BenjaminS G. ( 2011). Global ensemble predictions of 2009’s tropical cyclones initialized with an ensemble kalman filter. Mon Weather Rev, 139( 2): 668– 688

[10]

JeffriesR A SampsonC R CarrL E ChuJ (1993). Tropical cyclone forecasters reference guide numerical track forecast guidance. Tech. Rep. No. NRL/PU/7515–93–0011

[11]

JinJ, LiM, JinL. ( 2015). Data normalization to accelerate training for linear neural net to predict tropical cyclone tracks. Math Probl Eng, 2015: 931629

[12]

KnaffJ A, SampsonC R, DeMariaM, MarchokT P, GrossJ M, McAdieC J. ( 2007). Statistical tropical cyclone wind radii prediction using climatology and persistence. Weather Forecast, 22( 4): 781– 791

[13]

KrishnamurtiT N, KishtawalC M, LaRowT E, BachiochiD R, ZhangZ, WillifordC E, GadgilS, SurendranS. ( 1999). Improved weather and seasonal climate forecasts from multimodel superensemble. Science, 285( 5433): 1548– 1550

[14]

KrizhevskyA, SutskeverI, HintonG E. ( 2012). ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems, 1097– 1105

[15]

LandseaC W CangialosiJ P ( 2018). Have we reached the limits of predictability for tropical cyclone track forecasting? Bull Am Meteorol Soc, 99(11): 2237–2243

[16]

LeithC E. ( 1974). Theoretical skill of Monte Carlo forecasts. Mon Weather Rev, 102( 6): 409– 418

[17]

LeutbecherM, PalmerT N. ( 2008). Ensemble forecasting. J Comput Phys, 227( 7): 3515– 3539

[18]

LiH, LuoJ, XuM. ( 2019). Ensemble data assimilation and prediction of typhoon and associated hazards using TEDAPS: Evaluation for 2015–18 Seasons. Front Earth Sci, 13( 4): 733– 743

[19]

NeumannC J, LawrenceM B. ( 1975). An operational experiment in the satistical-dynamical prediction of tropical cyclone motion. Mon Weather Rev, 103( 8): 665– 673

[20]

PengX, FeiJ, HuangX, ChengX. ( 2017). Evaluation and error analysis of official forecasts of tropical cyclones during 2005–14 over the Western North Pacific. Part I: storm Tracks. Weather Forecast, 32( 2): 689– 712

[21]

PluM. ( 2011). A new assessment of the predictability of tropical cyclone tracks. Mon Weather Rev, 139( 11): 3600– 3608

[22]

RaddawayB (2012). Newsletter No.130-Winter 2011/12. Available at ECMWF website

[23]

RappaportE N, FranklinJ L, AvilaL A, BaigS R, BevenJ L II, BlakeE S, BurrC A, JiingJ G, JuckinsC A, KnabbR D, LandseaC W, MainelliM, MayfieldM, McAdieC J, PaschR J, SiskoC, StewartS R, TribbleA N. ( 2009). Advances and challenges at the National Hurricane Center. Weather Forecast, 24( 2): 395– 419

[24]

SahaS, MoorthiS, WuX, WangJ, NadigaS, TrippP, BehringerD, HouY T, ChuangH, IredellM, EkM, MengJ, YangR, MendezM P, van den DoolH, ZhangQ, WangW, ChenM, BeckerE. ( 2014). The NCEP climate forecast system version2. J Clim, 27( 6): 2185– 2208

[25]

SilverD, HuangA, MaddisonC J, GuezA, SifreL, van den DriesscheG, SchrittwieserJ, AntonoglouI, PanneershelvamV, LanctotM, DielemanS, GreweD, NhamJ, KalchbrennerN, SutskeverI, LillicrapT, LeachM, KavukcuogluK, GraepelT, HassabisD. ( 2016). Mastering the game of Go with deep neural networks and tree search. Nature, 529( 7587): 484– 489

[26]

SzeV, ChenY, YangT, EmerJ S. ( 2017). Efficient processing of deep neural networks: a tutorial and survey. Proc IEEE, 105( 12): 2295– 2329

[27]

TothZ, KalnayE. ( 1997). Ensemble forecasting at NCEP and the breeding method. Mon Weather Rev, 125( 12): 3297– 3319

[28]

VeigasK W (1996). The development of statistical-physical hurricane prediction model. Final Report, U.S.W.B. Contract Cwb 10966, Travellers Weather Research Center, Hartford, CT, 1996. 19

[29]

WuL, ZongH, LiangJ. ( 2011). Observational analysis of sudden tropical cyclone track changes in the vicinity of the East China Sea. J Atmos Sci, 68( 12): 3012– 3031

[30]

YingM, ZhangW, YuH, LuX, FengJ, FanY, ZhuY, ChenD. ( 2014). An overview of the China Meteorological Administration tropical cyclone database. J Atmos Ocean Technol, 31( 2): 287– 301

[31]

YuanJ ChenY PanY Dong J LuoY (2017). Improvement of ensemble forecast of typhoon track in the Northwestern Pacific. Marine Forecasts, 34(2): 37– 42 (in Chinese)

[32]

ZhangR H, ShenX S. ( 2008). On the development of GRAPES-A new generation of the national operational NWP system in China. Chin Sci Bull, 53: 3429– 3432

[33]

ZhiX, ZhangL, BaiY. ( 2011). Application of the multi-model ensemble forecast in the QPF. In: Proceedings of International Conference on Information Science and Technology, 657– 660

[34]

ZhuL, JinJ, CannonA J, HsiehW W. ( 2016). Bayesian neural networks based bootstrap aggregating for tropical cyclone tracks prediction in South China Sea. In: Proceeding of International Conference on Neural Information Processing, 475– 482

RIGHTS & PERMISSIONS

Higher Education Press

AI Summary AI Mindmap
PDF (2139KB)

1796

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/