Next Article in Journal
Reservoir Sediment Management and Downstream River Impacts for Sustainable Water Resources—Case Study of Shihmen Reservoir
Previous Article in Journal
Reconstruction of Planforms and Architecture of the Meandering Paleo-Channels—A Case Study of the 1st Member of Shanxi Formation, Central of Sulige Gas Field, Ordos Basin
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improvement of Extreme Value Modeling for Extreme Rainfall Using Large-Scale Climate Modes and Considering Model Uncertainty

1
School of Civil and Environmental Engineering, Yonsei University, Seoul 03722, Korea
2
School of Civil Engineering and Environmental Science, University of Oklahoma, Norman, OK 73072, USA
3
High Impact Weather Research Department, National Institute of Meteorological Sciences, Seogwipo 63568, Korea
*
Author to whom correspondence should be addressed.
Water 2022, 14(3), 478; https://doi.org/10.3390/w14030478
Submission received: 10 December 2021 / Revised: 29 January 2022 / Accepted: 2 February 2022 / Published: 6 February 2022
(This article belongs to the Section Hydrology)

Abstract

:
Extreme value modeling for extreme rainfall is one of the most important processes in the field of hydrology. For the improvement of extreme value modeling and its physical meaning, large-scale climate modes have been widely used as covariates of distribution parameters, as they can physically account for climate variability. This study proposes a novel procedure for extreme value modeling of rainfall based on the significant relationship between the long-term trend of the annual maximum (AM) daily rainfall and large-scale climate indices. This procedure is characterized by two main steps: (a) identifying significant seasonal climate indices (SCIs), which impact the long-term trend of AM daily rainfall using statistical approaches, such as ensemble empirical mode decomposition, and (b) selecting an appropriate generalized extreme value (GEV) distribution among the stationary GEV and nonstationary GEV (NS-GEV) using time and SCIs as covariates by comparing their model fit and uncertainty. Our findings showed significant relationships between the long-term trend of AM daily rainfall over South Korea and SCIs (i.e., the Atlantic Meridional Mode, Atlantic Multidecadal Oscillation in the fall season, and North Atlantic Oscillation in the summer season). In addition, we proposed a model selection procedure considering both the Akaike information criterion and residual bootstrap method to select an appropriate GEV distribution among a total of 59 GEV candidates. As a result, the NS-GEV with SCI covariates generally showed the best performance over South Korea. We expect that this study can contribute to estimating more reliable extreme rainfall quantiles using climate covariates.

1. Introduction

Extreme value modeling for hydrological extreme events (e.g., heavy rainfall and floods) is an essential task for the design of hydraulic structures. For extreme value modeling, the block maxima approach has been widely used, and it uses a sequence of maximum observations extracted from equal periods, such as annual maximum (AM) daily rainfall. The set of block maxima is assumed to be independent and identically distributed (iid) and is fitted with a probability distribution model, such as a generalized extreme value (GEV) distribution, to estimate the design quantiles of extreme hydrological events.
As the impact of climate change has become a significant issue, many efforts have been made to consider nonstationarity in hydrologic applications. One popular approach applies a variety of candidate nonstationary models to the nonstationary data and selects an appropriate model based on model diagnostics. It generally employs the maximum likelihood estimation (MLE) method to estimate nonstationary model parameters due to its adaptability to changes in model structures [1]. So far, this approach has been studied extensively and can be referred to as a “user-friendly” method. However, there is an issue regarding ergodicity in nonstationary extreme value modeling from a statistical point of view [2,3,4]. More specifically, statistical properties from temporal statistics would involve the assumption of ergodicity since AM time series can be theoretically considered as a stochastic process. However, if the observed data has a trend or is affected by external variables (i.e., the process is nonstationary), the ergodicity cannot hold and inductive inference from the data would not be possible [3,4,5]. In short, it is necessary to determine the relationships between model parameters and covariates prior to model parameter estimation to deal with the ergodicity issue. This is an important issue that needs to be addressed in terms of statistical hydrology, but hydrological applications mostly accept nonstationary models, as we mentioned earlier. Hence, we apply the standardized approach in this study, and the limitations regarding ergodicity will be discussed in the discussion section later.
For nonstationary extreme value modeling, the nonstationary GEV (NS-GEV) is a representative model, and it employs time or exogenous covariates (e.g., large-scale climate modes and hydrological variables) [6,7,8,9,10,11,12,13,14,15]. Here, time t is commonly used as a covariate for modeling the linear or polynomial trend of GEV parameters. It is a simple and straightforward way to apply nonstationarity to the probability distribution model. However, considering only time t as a covariate could lead to problems such as an increase in uncertainty of quantile estimates and distortion of the probability distribution model according to extrapolation [16,17]. In addition, the trends of hydrological data can vary in the short or long term because of climate variability and other external forces [18,19,20]. Thus, large-scale climate modes have been employed as potential climate covariates since the mid-2000s as they can consider natural climate variability.
In general, extreme value modeling for rainfall with climate covariates has been performed as follows: (1) identifying significant climate indices related to extreme rainfall; and (2) selecting an appropriate probability distribution model among the various GEV models based on the statistical evaluation of the model fit. As such, various forms of NS-GEV can be modeled to estimate the magnitude and occurrence frequency of extreme rainfall events using climate indices as covariates [8,9,11,12,14,21]. Then, information criteria such as Akaike information criterion (AIC) and Bayesian information criterion (BIC) are generally used to select the most appropriate model. For example, Vasiliades et al. [8] considered the four climate indices (i.e., the Southern Oscillation Index (SOI), North Atlantic Oscillation (NAO), Pacific Decadal Oscillation (PDO), and Mediterranean Oscillation Index (MOI)) reported to illustrate the dependence of the Mediterranean climate on large-scale low-frequency climate patterns. These indices were employed as covariates for the GEV model with a conditional density network, and an appropriate GEV model was finally selected based on the corrected AIC (AICc) and BIC. Agilan and Umamahesh [9] conducted extreme value modeling for developing a nonstationary rainfall intensity-duration-frequency curve. They used five covariates, including El-Nino-Southern Oscillation (ENSO) and Indian Ocean Dipole (IOD), which are known to impact extreme rainfall over India; then, they selected an appropriate model using the AICc. They concluded that the global processes (i.e., global warming, ENSO cycle, and IOD) are the best covariates for the long-duration extreme rainfall of Hyderabad city, India.
As shown in the abovementioned studies, many previous works considered several climate indices reported in the literature for modeling NS-GEV of rainfall extremes. The climate indices were directly used as covariates of location and scale parameters, which are closely linked to the mean and variance of the data, respectively. It could be possible to model a more reasonable NS-GEV by considering the connection between climate indices and the trend inherent in extreme rainfall. For this, a preliminary analysis is necessary to determine the trends in the mean and variance of AM rainfall and to identify the impact of climate indices on these trends. Recently, many researchers have employed modern signal processing known as ensemble empirical mode decomposition (EEMD) to extract long-term trends from a given data series. The EEMD has the advantage of detecting long-term trends after extracting oscillatory patterns from an original data series [22,23]. It is also successfully applied to hydro-climatology variables [24,25,26,27]. For example, Kim et al. [27] identified that the behavior of the Atlantic Multidecadal Oscillation (AMO) index indicates a long-term trend in the monthly precipitation series of South Korea through a statistical analysis procedure along with EEMD. Chen et al. [26] employed the EEMD to explore the trend of the AM daily precipitation data. They extracted long-term trends in the mean and variance of the AM daily precipitation data and successfully identified that the extracted trends can be applied to extreme value modeling.
The increase of uncertainty in design levels is one of the most important issues in nonstationary extreme value modeling. The use of a more complex model (i.e., nonstationary distribution model) generally allows a good fit for the given samples, but this could provide quantile estimates with a large amount of uncertainty [3]. In general, however, the best model is selected using only the information criteria that assess the model performance based on the maximized log-likelihood (i.e., model fit) [26,28,29,30,31,32,33], so that the selected nonstationary model usually yields unreliable design quantiles [16]. Cooley [34] demonstrated that relying solely on the information criteria is not enough to select an appropriate model because they cannot take into account uncertainty. Nevertheless, most of the previous studies assessed uncertainties in model parameters and design quantiles after selecting the best model based on the information criteria [13,16,17,21,35,36,37,38,39,40,41]. Therefore, to provide more reliable design quantiles, the uncertainty of the candidate models should be assessed prior to selecting an appropriate model. The bootstrap method is generally recommended to measure the uncertainty of quantile estimates [36,37,38,42]. It is computationally efficient and provides realistic asymmetric confidence intervals (CIs) without asymptotic assumptions [3,10].
Many studies have attempted extreme value modeling by applying the NS-GEV with climate covariates to estimate rainfall quantiles considering climate variability. However, most of these are limited to modeling for several rainfall gauging stations [8,9,11,12,14,36], and few studies propose an overall procedure from the identification of appropriate climate indices impacting regional rainfall extremes to the selection of an appropriate nonstationary distribution model with climate covariates (e.g., India [21], southern France [38], and southern U.S. [43]). To the best of our knowledge, there has been less attention to considering the uncertainty in the model selection procedure. The main objective of this study is to propose a procedure for extreme value modeling with large-scale climate modes and to consider the uncertainty of model selection. It can be divided into two main subsections: (a) identifying significant seasonal climate indices (SCIs) that impact the long-term trends of AM daily rainfall using the EEMD, and (b) selecting an appropriate GEV distribution considering both model fit and uncertainty among the stationary GEV (ST-GEV) and various NS-GEVs using time and SCIs as covariates. The EEMD was applied to AM daily rainfall observed at 61 stations over South Korea, and the residue was extracted that represent the long-term trend of AM daily rainfall. By conducting correlation analysis between the extracted residues and the various climate indices, significant SCIs that impact the trend of AM daily rainfall were selected. Extreme value modeling was then performed using the significant climate indices as covariates of GEV parameters. Considering both model fit and uncertainty, an appropriate GEV distribution was finally selected at each station. We also discussed the physical meaning of the significant climate indices selected over South Korea and the feasibility of the procedure for nonstationary extreme value modeling.

2. Data

2.1. Annual Maximum Daily Rainfall Data

The spatial domain of the study is South Korea, East Asia. Rainfall over South Korea is mainly affected by the subtropical East Asian summer monsoon and typhoons. Therefore, approximately 70% of the annual rainfall is concentrated in the rainy season from June to September [44,45,46]. We extracted annual maximum (AM) daily rainfall data observed at 61 stations over South Korea. The data recorded until 2018 cover a minimum of 40 years with no missing year. Figure 1 illustrates the study area and the location of the rainfall stations, and associated information is presented in Table 1. The monthly rate of the AM daily rainfall occurrences over South Korea was identified, as depicted in Figure 2. Most of the AM daily rainfall occurred between June and September, particularly in July and August.

2.2. Climate Indices

Climate indices are quantitative indicators representing large-scale climate modes in the ocean-atmospheric system. They can be characterized by variations in the sea surface temperature, sea level pressure, atmospheric circulation, and so on. Each climate index is derived through various measurements and is widely used to represent regional climatic characteristics [27]. In the current study, we used a total of 12 climate indices that can cover the observed rainfall data periods: Atlantic Meridional Mode (AMM), AMO, Arctic Oscillation (AO), NAO, Niño 1 + 2 (NINO12), Niño 3.4 (NINO34), Niño 4 (NINO4), North Pacific pattern (NP), PDO, Pacific Meridional Mode (PMM), Pacific-North American pattern (PNA), and SOI. These climate indices were obtained from the Physical Sciences Division (PSD) of the National Oceanic and Atmospheric Administration Earth System Research Laboratory (NOAA/ESRL) (www.esrl.noaa.gov/psd/data/climateindices/list (accessed on 7 June 2019)).

3. Methodology

This study is grouped largely into two main steps to conduct extreme value modeling with large-scale climate modes: Step 1. identifying significant climate indices that impact the trend of AM daily rainfall based on statistical approaches; Step 2. selecting an appropriate GEV distribution by comparing the performance of various ST- and NS-GEV with climate covariates. The procedure can be subdivided into five steps. Figure 3 presents a brief overview of this study, and the details of the methodology are described in the following subsections.

3.1. Seasonal Climate Indices

All climate indices are provided on a monthly scale. To employ the SCIs as covariates in nonstationary extreme value modeling, we converted the monthly climate indices to seasonal ones by seasonal averaging (i.e., Spring: March-April-May (MAM); Summer: June-July-August (JJA); Fall: September-October-November (SON); and Winter: December-January-February (DJF)). To consider the time lag for using SCIs as a predictor, the SCIs observed prior to the occurrence of AM daily rainfall should be used. As most AM daily rainfall events over South Korea occur in the JJA season (see Figure 2), the four seasons of lagged SCIs before the JJA season are considered as described in Table 2.

3.2. Ensemble Empirical Mode Decomposition

EEMD is a decomposition method that has been used widely in hydrological time series. By decomposing an original time series into a set of intrinsic mode functions (IMFs), the EEMD can subsequently extract a long-term trend inherent in the time series. The IMFs indicate physically meaningful information, and they should satisfy two conditions: (1) the number of extrema and zero crossings must either be equal to each other or differ at most by 1 in the original time series, and (2) the mean of the upper and lower envelopes, which are defined by connecting all local minima and maxima, should be zero at any point [47]. A monotonic function that remains after the decomposition is a residue, which indicates a long-term trend. Here, we briefly describe the EEMD procedure with an example of a time series, as shown in Figure 4. The red-dotted and blue-dotted lines are the upper and lower envelopes of the original time series, respectively, and the gray-dotted line is the mean value of the local maxima and local minima. Assuming that there is an original time series, we can obtain N number of IMFs as follows: (1) identify the upper and lower envelopes by connecting the local maxima and minima ( y u ( t ) ,   y l ( t ) ) using a cubic spline; (2) calculate the mean value of the local maxima and local minima as y m ( t ) = ( y u ( t ) + y l ( t ) ) / 2 ; (3) extract the mean value from the original time series as h ( t ) = y ( t ) y m ( t ) ; (4) repeat steps (1) to (3) until h ( t ) satisfies the condition of the IMF; (5) define a new time series by extracting h ( t ) from y ( t ) and repeat steps (1) to (5) until no more IMFs can be extracted. Finally, y ( t ) is composed of the sum of the IMFs and a residue ( r e s ( t ) ) , as presented in Equation (1):
y ( t ) = i = 1 N I M F i ( t ) + r e s ( t )
where N is the number of IMFs.
In this study, the EEMD is applied to AM daily rainfall to extract the long-term trend of AM daily rainfall as follows (for additional details, see Chen et al. [26]): (1) define a residue r e s m ( t ) , extracted from the original time series y ( t ) using the EEMD procedure, as a long-term trend in the mean of AM daily rainfall (hereafter referred to as the mean trend); (2) calculate the time series of the variance as v a r ( t ) = ( y ( t ) r e s m ( t ) ) 2 ; (3) define a residue r e s v ( t ) , extracted from the time series of the variance v a r ( t ) using the EEMD procedure, as a long-term trend in the variance of AM daily rainfall (hereafter referred to as the variance trend).

3.3. Spearman’s Rank Correlation Analysis

Spearman’s rank correlation is one of the most widely used statistical estimators to measure the statistical dependence between two different hydrometeorological series [14,38,48,49]. It is based on the nonparametric rank correlation and describes the statistical relationship using a monotonic function. Spearman’s rank correlation coefficient ( r s ) between two time series X 1 ( t ) and X 2 ( t ) is defined by Equation (2):
r s = 1 6 t = 1 n d t 2 n ( n 2 1 )
where d t = r a n k ( X 1 ( t ) ) r a n k ( X 2 ( t ) ) is the difference between the two ranks of each observation at time t , and n is the number of observations. If the two time series have a perfect monotonic function, the value of r s is close to −1 or 1. As a rule of thumb, a value of r s between 0.7 and 1.0 (−0.7 and −1.0) represents a strong positive (negative) correlation, a value between 0.3 and 0.7 (−0.3 and −0.7) represents a moderate positive (negative) correlation, and a value between 0 and 0.3 (0 and −0.3) represents a weak positive (negative) correlation [50,51,52].
To identify the significant SCIs impacting AM daily rainfall over South Korea, r s between the SCI and the mean and variance trends of AM daily rainfall are calculated for all employed stations. Then, the percentage of stations with a significant r s value ( P s i g ) at the 1% significance level is calculated for each SCI.

3.4. Generalized Extreme Value Distribution Modeling

The GEV distribution is widely used for estimating the magnitude and occurrence probability of hydrological extreme events. Let X = x 1 ,   x 2 ,   ,   x n be the AM time series data with sample size n , which are assumed to be independent and identically distributed (iid). The cumulative distribution function ( F ( x ) ) of the ST-GEV for X is expressed by Equation (3):
F ( x ) = exp [ { 1 + ξ x μ σ } 1 / ξ ]
where μ is the location parameter related to the mean of the data, σ (>0) is the scale parameter related to the variability of the data, and ξ is the shape parameter related to the heaviness of the distribution tail.
When a trend or the impact of external variables in the observed data is considered in extreme value modeling, the distribution parameters can be modeled as functions of covariates, such as time or climate indices. In this study, the location and scale parameters of the GEV distribution were defined as functions of time-dependent covariates as follows:
μ ( t ) = μ 0 + μ 1 c o v 1 ( t ) + μ 2 c o v 2 ( t ) + + μ p c o v p ( t )
σ ( t ) = exp ( σ 0 + σ 1 c o v 1 ( t ) + σ 2 c o v 2 ( t ) + + σ p c o v p ( t ) )
where p is the number of covariates and c o v p ( t ) is the p -th covariate at time t . As the scale parameter should be positive for all t , an exponential function was employed as a link function for the scale parameter. The shape parameter was assumed to be stationary, as it is difficult to estimate the shape parameter with precision [1,10].
The combinations of significant SCIs are used as covariates of location and scale parameters. Depending on the nonstationarity in the GEV parameters, two types of NS-GEV are defined: NS-GEV(1,0,0), in which only a location parameter is assumed to be a function of covariates, and NS-GEV(1,1,0), in which both location and scale parameters are assumed to be functions of covariates. The parameters of GEV distributions are estimated using the MLE method.

3.5. Appropriate Model Selection Considering Uncertainty

The Akaike information criterion (AIC) is an information-theoretic model selection method based on Kullback–Leibler information loss [53]. Since the AIC evaluates model performance based on the maximized log-likelihood and the number of parameters for the fitted distribution, a distribution model with a good fit and parameter parsimony was selected as an appropriate model. The equation of the AIC is expressed as
A I C = 2 log ( M L ) + 2 k
where log ( M L ) is the maximized log-likelihood of the fitted distribution and k is the number of distribution parameters. By comparing the AIC value of candidate distribution models, the model with the smallest AIC value is selected as an appropriate model, considering both the parameter parsimony and the goodness-of-fit. The AIC of each model can be rescaled as follows:
r A I C = A I C min ( A I C )
where r A I C is the rescaled A I C and min ( A I C ) is the smallest A I C among all candidate models. The models with r A I C 2 can also be reasonable choices [53].
The bootstrap method is generally recommended to measure the uncertainty of the quantile estimate in a nonstationary distribution model because it is computationally efficient and provides realistic asymmetric confidence intervals (CIs) without asymptotic assumptions [3,10]. For the NS-GEV, the residual bootstrap method has been employed to calculate the confidence intervals for the parameter and quantile estimates [9,14,54,55,56]. The residual bootstrap method can be conducted as follows:
  • For the fitted GEV distribution, transform the AM time series data ( X = x 1 ,   x 2 ,   ,   x n ) into the standardized residuals with no trend ( X ˜ = x ˜ 1 ,   x ˜ 2 ,   ,   x ˜ n ) as follows [1]:
    x t ˜ = 1 ξ ^ ln [ 1 + ξ ^ ( x t μ ^ ( t ) σ ^ ( t ) ) ] ,    t = 1 ,   2 , , n
    where the μ ^ ( t ) , σ ^ ( t ) , and ξ ^ are the location, scale, and shape parameters of the fitted GEV distribution, respectively.
  • Obtain a new sample of X ˜ by resampling residuals with replacement and back-transforming the resampled residuals using Equation (8).
  • For back-transformed samples, estimate the T-year quantile at each time t ( Q ^ T ( t ) , t = 1 ,   2 , , n ) using the same GEV distribution.
  • Repeat steps (2)–(3) N times and calculate the time-averaged 95% CIs for the T-year quantiles ( C I Q T 95 % ) as follows:
    C I Q T 95 % = 1 n t = 1 n ( Q ^ T 97.5 % ( t ) Q ^ T 2.5 % ( t ) )
    where Q ^ T 97.5 % ( t ) and Q ^ T 2.5 % ( t ) are the 97.5th and 2.5th percentiles of N ordered samples of Q ^ T ( t ) .
In this study, the model selection procedure combining the AIC and residual bootstrap method is performed to select an appropriate GEV model considering both model fit and uncertainty. First, the AIC values of all GEV candidates are calculated, and the GEV distributions with r A I C 2 are selected. Second, for each selected GEV distribution, the residual bootstrap method is repeated 1000 times to calculate the time-averaged 95% CIs for the 100-year quantiles ( C I Q 100 95 % ). Finally, the GEV distribution with the smallest C I Q 100 95 % is selected as an appropriate distribution model.

4. Results

4.1. Mean and Variance Trends of AM Daily Rainfall Extracted Using EEMD

At first, the residue of AM daily rainfall is extracted using EEMD. Figure 5 presents the time series data and extracted residues of AM daily rainfall and its variance at two representative stations (Ulleungdo and Daegu) for illustration purposes. On the left side, the black line indicates the observed AM daily rainfall at the station, and the red line represents a residue ( r e s m ( t ) ) indicating the long-term mean trend of AM daily rainfall. On the right side, the black line is the time series of the variance, and the red line is a residue ( r e s v ( t ) ) indicating the long-term variance trend of AM daily rainfall. Chen et al. [26] modeled these residues as linear or quadratic functions of time and directly used them as time-dependent parameters of nonstationary probability distribution models. In this study, the extracted residues were used to find the significant climate indices that impact the long-term trend of AM daily rainfall.

4.2. Significant Seasonal Climate Indices for AM Daily Rainfall over South Korea

Table 3 presents the percentage of stations with a significant r s value ( P s i g ) between the lagged SCIs and r e s m ( t ) . A P s i g greater than 50% is highlighted in bold. Among the 12 climate indices, the AMM, AMO, and NAO indices include SCIs with more than 50% P s i g . Table 4 presents the P s i g between the lagged SCIs and r e s v ( t ) . Similar to the results presented in Table 3, the AMM, AMO, and NAO indices include SCIs with more than 50% P s i g .
The r s values between the mean trend of AM daily rainfall and (a) AMM, (b) AMO, (c) NAO indices are depicted in Figure 6. The seasonal AMM indices indicate positive r s values in the overall inland area, whereas they indicate negative r s values in some eastern and southern coastal areas of the Korean Peninsula. For each season, the average absolute r s value for all stations ( | r s | ¯ ) was calculated. Among the four seasonal AMM indices, the AMM index for the SON(−1) season, AMM_SON(−1), indicates the strongest correlation ( | r s | ¯ = 0.55) with the mean trend of AM daily rainfall over South Korea in Figure 6a. The results of the AMO index are very similar to those of the AMM_SON(−1). For all seasons, the AMO index indicates a strong correlation with the mean trend of AM daily rainfall in Figure 6b. Among them, AMO_SON(−1) indicates the strongest correlation ( | r s | ¯ = 0.59). In contrast to the results of the AMO indices, the NAO index demonstrates different patterns according to the seasons. While the NAO_JJA(−1) and NAO_SON(−1) present overall negative correlations with the mean trend of AM daily rainfall, the NAO_DJF(−1) and NAO_MAM indicate very weak positive correlations in Figure 6c. In particular, NAO_JJA(−1) presents the strongest correlation ( | r s | ¯ = 0.34), and its spatial pattern is opposite to those of AMM_SON(−1) and AMO_SON(−1).
The r s values between the variance trend of AM daily rainfall and the (a) AMM, (b) AMO, and (c) NAO indices are depicted in Figure 7. The spatial pattern of r s values of the variance trend is similar to that of the mean trend, except that the r s values in some inland areas (lying between latitude 35.5° N–37.5° N and longitude 126.5° E–129.5° E) have an opposite sign compared with those of the mean trend. As with the results for the mean trend, AMM_SON(−1), AMO_SON(−1), and NAO_JJA(−1) present the strongest correlations ( | r s | ¯ = 0.56, 0.60, and 0.36, respectively) with the variance trend of AM daily rainfall over South Korea. Therefore, these three SCIs were finally selected as the significant climate indices linked closely to the long-term trends of AM daily rainfall over South Korea.

4.3. Appropriate Model Selection Considering Model Fit and Uncertainty

The combinations of three significant SCIs selected in Section 4.2 (AMM_SON(−1), AMO_SON(−1), NAO_JJA(−1)) were used as covariates of location and scale parameters in the GEV distribution. As presented in Table 5, 59 GEV distributions including ST-GEV and NS-GEV with time covariates were considered as candidate models. Following the model selection procedure described in Section 3.5, an appropriate GEV distribution was selected for each station. Table 6 presents the model selection result at the Daegu station (No. 143) as an example. Among the 59 GEV candidate models, 9 NS-GEVs and ST-GEV with r A I C 2 were selected to assess the uncertainty. Thereafter, the bootstrap time-averaged 95% CIs for the 100-year quantile ( C I Q 100 95 % ) were calculated. As a result, the NS-GEV(1,0,0) that uses AMO_SON(−1) as covariates of location parameter and constant scale parameter was finally selected as an appropriate GEV model (as shown in bold in Table 5). To evaluate the quality of the model fit, the QQ-plots were also examined. Figure 8 presents the QQ-plots of the ST-GEV, NS-GEV with time covariate, and selected NS-GEV with SCI covariates at the Daegu station (No. 143). The selected NS-GEV with SCI covariate provides a better fit than the ST-GEV and NS-GEV with time covariate.
Figure 9 illustrates the model selection results at 61 stations over South Korea according to the distribution type classified in Table 5. The ST-GEV, NS-GEV with time covariate, and NS-GEV with SCI covariates were selected at 18 stations (29.5%), 3 stations (4.9%), and 40 stations (65.6%), respectively. It is worth noting that the NS-GEV with SCI covariates was selected at a majority of stations, whereas the NS-GEV with time covariate was selected at only 3 stations. In the case of the NS-GEV with SCI covariates, the number of stations where the NS-GEV(1,0,0) was selected was larger than that where the NS-GEV(1,1,0) was selected.

5. Discussions

5.1. Significant Climate Indices for Annual Maximum Rainfall over South Korea

Based on the correlation analysis with the long-term trend of AM daily rainfall over South Korea, the three significant SCIs (AMM_SON(−1), AMO_SON(−1), and NAO_JJA(−1)) were selected. In nonstationary extreme value modeling, it is necessary to verify that external variables such as climate indices are physically meaningful and justifiable to use as covariates [10]. Therefore, the physical mechanisms of teleconnections between the selected climate indices and the East Asian summer monsoons (EASMs), which mainly affect rainfall over South Korea, were investigated by previous literature as follows.
The AMO, which exhibits an oscillation of the sea surface temperatures (SSTs) in the Atlantic, is one of the most significant indices leading to global-scale climate variability, particularly in the Northern Hemisphere [57,58]. Although the influence of the AMO on the variability in EASMs is not fully understood, there have been several studies to explain the relationship between AMO and EASM variability [58,59,60,61]. Lu et al. [58] suggested that a positive AMO causes positive SST anomalies in the eastern Indian Ocean and maritime continent, and this atmospheric heating leads to an anticyclonic anomaly at the low troposphere over the western North Pacific, resulting in strong EASMs. Si and Ding [60] described that through the AMO-Northern Hemisphere teleconnection, the positive and negative AMO affect the SST anomalies in the midlatitude central Pacific and the geopotential height anomalies over the Huanghe–Huaihe River and Yangtze River, resulting in variability in summer precipitation over East Asia. Li et al. [61] explained that the tropical easterly jet (TEJ) is a significant component of the Afro-Asian monsoon system and directly affects rainfall near East Asia. They described that warm AMO phases cause intensification of the TEJ, which could increase rainfall over East Asia. On the other hand, there is a lack of studies on how the AMM physically affects the EASMs. However, the AMM, characterized by meridional variations in SST and sea level pressure (SLP) in the tropical Atlantic, is highly correlated with AMO and is affected by AMO on decadal time scales [62]. In addition, the AMM_SON(−1) and AMO_SON(−1) also present very similar patterns, as displayed in Figure 5 and Figure 6. These results imply that the AMM may affect the EASMs indirectly.
The NAO, which describes the variation of the sea level pressure between the Icelandic Low and the Azores High [63], is known to affect the weather and climate over East Asia in addition to the North Atlantic [64,65,66,67]. In particular, some studies have illustrated that the summer NAO affects the EASMs [68,69,70]. Linderholm et al. [68] suggested that the North Atlantic storm tracks and transient eddy activity related to the summer NAO can affect the EASMs. Sun and Wang [69] described that the summer NAO can cause a meridional dipole pattern over East Asia by changing the zonal wave activity over the Eurasian Continent, and it would finally contribute to summer rainfall over central and northern East Asia. Wang et al. [70] explained that the summer NAO affects the summer rainfall in East China through an intermediate bridge effect, resulting from the thermal forcing of the Tibetan Plateau.
Although there are some differences in the lag and time scale of teleconnections between the climate indices and the EASMs, the abovementioned studies support the acceptability of the selected climate indices by explaining the physical mechanism for teleconnections. In addition, this implies that the residue of AM daily rainfall extracted by the EEMD was properly used to detect the significant SCIs that impact extreme rainfall over South Korea. The selected SCIs could be employed as predictors in extreme value modeling for AM daily rainfall.

5.2. Consideration of Uncertainty in Model Selection

In nonstationary extreme value modeling, the uncertainty caused by the increase in the complexity of the model is a crucial issue [3,4]. However, information criteria such as AIC or BIC (Bayesian information criterion) are solely used to determine the best model [8,9,14,26,32,33]. Although these methods attempt to select a less complex model considering model parsimony, they are not enough to consider model uncertainty. In this study, a model selection procedure combining the information criterion and residual bootstrap method was proposed to consider both the model fit and uncertainty. That is, instead of selecting the best model with the smallest AIC value, distribution models whose r A I C is equal to or less than 2 were first selected as acceptable candidates. Among the selected candidate models, the model with the smallest uncertainty measured by the residual bootstrap method was finally selected as the appropriate model. This stepwise procedure allows us to select the most appropriate model with acceptable goodness-of-fit and low uncertainty and to estimate more reliable design quantiles in practice. In addition, it reduces the analytical and computational burden of uncertainty assessment by narrowing down the number of model candidates. Even if several exogenous variables are used as covariates for nonstationary extreme value modeling, a number of candidate models can be considered due to probability distribution types [19], combinations of covariates [8,9], and their link function types [26,31,33]. Therefore, our suggested model selection procedure can be an efficient and reasonable way to select the most appropriate model.

5.3. Suggestions for Extreme Value Modeling with SCI Covariates

Extreme value modeling was conducted using the three significant SCIs (AMM_SON(−1), AMO_SON(−1), and NAO_JJA(−1)) as covariates of GEV parameters. Consequently, the ratio of stations where the NS-GEV with SCI covariates was selected was 65.6%, whereas the ratio of stations where the NS-GEV with time covariate was selected was 4.9%. This result indicates a notable improvement in extreme value modeling in terms of both model fit and uncertainty by applying the significant SCIs as covariates. Moreover, the NS-GEV with SCI covariates could rationally explain the long-term trend and variability of AM daily rainfall compared to ST-GEV and NS-GEV with the time covariate. In nonstationary extreme value modeling, much attention has focused on modeling the relationship between extreme data and their attributions [71,72], instead of modeling the linear or polynomial trend in extreme data using time t. Our results also support the use of climate indices as covariates for modeling extreme rainfall. Therefore, here, we propose to identify the significant climate indices properly based on EEMD and perform extreme value modeling using the selected climate indices to estimate more reliable rainfall quantiles in practice.

5.4. Limitations of this Study

One limitation of this study is that we only consider the GEV distribution to conduct nonstationary extreme value modeling, as the GEV distribution is recommended for the estimation of the design quantile of extreme rainfall in South Korea [73]. Other distribution models, such as generalized logistic, generalized normal, and Pearson type III distributions, can also be the candidate distributions for fitting AM rainfall series [74,75,76]. To the best of our knowledge, there are a limited number of studies that have considered nonstationarity in other distributions so far [35,77]. Therefore, we believe that this study can be improved by applying nonstationarity to various probability distributions and comparing their performance with that of NS-GEVs for future studies.
Another limitation is the sensitivity of data length to the result. In this study, we used observed rainfall and climate indices available until 2018. Due to the chaotic nature of the climate system, there is a possibility of providing varying results by adding the latest observations. For example, the SCI may be differently selected because there is no significant difference in P s i g between seasons (see Table 3 and Table 4). This can lead to another source of uncertainty in nonstationary extreme value modeling. Thus, we remain this limitation for future studies that should be assessed for the selection of significant SCIs.
Last but not least, we employ the nonstationary extreme value modeling based on the “user-friendly” method, not considering the ergodicity issue, as we mentioned earlier in the Introduction section. Thus, this study estimated parameters of model candidates with all possible combinations of SCIs by directly applying the MLE method to the nonstationary data. Then, we selected the best model by comparing all candidate models. To deal with the ergodicity issue, it is necessary to define the deterministic relationships between the selected SCI and extreme rainfall based on the physical mechanism prior to parameter estimation. Since the significant SCIs that affect the long-term trend of extreme rainfall were identified throughout this study, we believe further study can be extended to adopt the ergodicity issue together with the current study.

6. Conclusions

In nonstationary extreme value modeling, it is essential to reflect the trend in statistical characteristics, such as the mean and variance of the observations, because nonstationarity is generally considered by the time-dependent location and/or scale parameters of the probability distribution model. Using physically meaningful information, such as climate indices as covariates, a probability distribution model can be improved in terms of not only considering nonstationarity but also reducing uncertainty. In this study, the procedure of extreme value modeling with large-scale climate modes was proposed, from the identification of appropriate SCIs impacting regional AM rainfall based on EEMD to the selection of appropriate NS-GEV, considering both model fit and uncertainty. Using the EEMD, residues that indicate the long-term trends in the mean and variance were extracted from AM daily rainfall. Subsequently, the correlation coefficient was calculated between the extracted residues and various lagged SCIs. After the identification of appropriate SCIs, the AMM_SON(−1), AMO_SON(−1), and NAO_JJA(−1) were finally selected as significant SCIs with an impact on the long-term trends in both the mean and variance of AM daily rainfall over South Korea. The combinations of these significant SCIs were employed as covariates of location and scale parameters for constructing various NS-GEVs.
As the uncertainty increases with the complexity of the probability distribution model, it is necessary to consider the uncertainty to select an appropriate probability distribution model. However, there have been few studies that consider uncertainty in the model selection procedure. In this study, the model selection procedure combining the AIC and residual bootstrap method was proposed to select an appropriate GEV model among ST-GEV, NS-GEV with time covariate, and NS-GEV with SCI covariates. Thus, the NS-GEV with SCI covariates was selected as an appropriate model for more than 65% of the applied stations in South Korea.
Nowadays, uncertainty is the main issue in nonstationary extreme value modeling. Rather than simply using the time covariate, employing appropriate climate indices as covariates can reduce uncertainty. The EEMD could be employed to detect the significant climate indices for using the covariates of the nonstationary extreme value model because it can extract the long-term trend inherent in the time series. Further, by selecting an appropriate probability distribution model considering both model fit and uncertainty, more accurate and reliable quantiles could be estimated. Although this study focused on the determination of significant climate indices and their application to extreme value modeling of extreme rainfall over South Korea, the method and discussion presented are expected to be extended to various hydrological variables, as well as to other regions.

Author Contributions

Conceptualization, H.K., T.K. and J.-H.H.; Data curation, H.K.; Formal analysis, H.K.; Funding acquisition, J.-H.H.; Investigation, H.K. and T.K.; Methodology, H.K., T.K. and J.-Y.S.; Supervision, J.-H.H.; Visualization, H.K.; Writing—original draft, H.K. and T.K.; Writing—review & editing, J.-Y.S. and J.-H.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) (No. 2019R1A2C2010854).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The climate indices used in this study are available on the Physical Sciences Division (PSD) of the National Oceanic and Atmospheric Administration Earth System Research Laboratory (NOAA/ESRL) (www.esrl.noaa.gov/psd/data/climateindices/list (accessed on 10 December 2021)).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Coles, S. An Introduction to Statistical Modeling of Extreme Values; Springer: London, UK, 2001. [Google Scholar]
  2. Koutsoyiannis, D.; Montanari, A. Negligent killing of scientific concepts: The stationarity case. Hydrol. Sci. J. 2015, 60, 1174–1183. [Google Scholar] [CrossRef]
  3. Serinaldi, F.; Kilsby, C.G. Stationarity is undead: Uncertainty dominates the distribution of extremes. Adv. Water Resour. 2015, 77, 17–36. [Google Scholar] [CrossRef] [Green Version]
  4. Mondal, A.; Mujumdar, P.P. Hydrologic Extremes Under Climate Change: Non-stationarity and Uncertainty. In Sustainable Water Resources Planning and Management under Climate Change; Kolokytha, E., Oishi, S., Teegavarapu, R.S.V., Eds.; Springer: Singapore, 2017; pp. 39–60. ISBN 978-981-10-2051-3. [Google Scholar]
  5. Montanari, A.; Koutsoyiannis, D. Modeling and mitigating natural hazards: Stationarity is immortal! Water Resour. Res. 2014, 50, 9748–9756. [Google Scholar] [CrossRef]
  6. Cheng, L.; Aghakouchak, A. Nonstationary precipitation intensity-duration-frequency curves for infrastructure design in a changing climate. Sci. Rep. 2014, 4, 7093. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Prosdocimi, I.; Kjeldsen, T.R.; Miller, J.D. Detection and attribution of urbanization effect on flood extremes using nonstationary flood-frequency models. Water Resour. Res. 2015, 51, 4244–4262. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Vasiliades, L.; Galiatsatou, P.; Loukas, A. Nonstationary Frequency Analysis of Annual Maximum Rainfall Using Climate Covariates. Water Resour. Manag. 2015, 29, 339–358. [Google Scholar] [CrossRef]
  9. Agilan, V.; Umamahesh, N.V. What are the best covariates for developing non-stationary rainfall Intensity-Duration-Frequency relationship? Adv. Water Resour. 2017, 101, 11–22. [Google Scholar] [CrossRef]
  10. Salas, J.D.; Obeysekera, J.; Vogel, R.M. Techniques for assessing water infrastructure for nonstationary extreme events: A review. Hydrol. Sci. J. 2018, 63, 325–352. [Google Scholar] [CrossRef]
  11. Ouarda, T.B.M.J.; El-Adlouni, S. Bayesian nonstationary frequency analysis of hydrological variables. J. Am. Water Resour. Assoc. 2011, 47, 496–505. [Google Scholar] [CrossRef]
  12. El Adlouni, S.; Ouarda, T.B.M.J.; Zhang, X.; Roy, R.; Bobée, B. Generalized maximum likelihood estimators for the nonstationary generalized extreme value model. Water Resour. Res. 2007, 43, W03410. [Google Scholar] [CrossRef]
  13. Ganguli, P.; Coulibaly, P. Does nonstationarity in rainfall require nonstationary intensity-duration-frequency curves? Hydrol. Earth Syst. Sci. 2017, 21, 6461–6483. [Google Scholar] [CrossRef] [Green Version]
  14. Cannon, A.J. A flexible nonlinear modelling framework for nonstationary generalized extreme value analysis in hydroclimatology. Hydrol. Process. 2010, 24, 673–685. [Google Scholar] [CrossRef]
  15. Zhang, X.; Wang, J.; Zwiers, F.W.; Groisman, P.Y. The Influence of Large-Scale Climate Variability on Winter Maximum Daily Precipitation over North America. J. Clim. 2010, 23, 2902–2915. [Google Scholar] [CrossRef]
  16. Mondal, A.; Daniel, D. Return levels under nonstationarity: The need to update infrastructure design strategies. J. Hydrol. Eng. 2019, 24, 1–11. [Google Scholar] [CrossRef]
  17. Agilan, V.; Umamahesh, N.V. Covariate and parameter uncertainty in non-stationary rainfall IDF curve. Int. J. Climatol. 2018, 38, 365–383. [Google Scholar] [CrossRef]
  18. Kenyon, J.; Hegerl, G.C. Influence of modes of climate variability on global precipitation extremes. J. Clim. 2010, 23, 6248–6262. [Google Scholar] [CrossRef] [Green Version]
  19. López, J.; Francés, F. Non-stationary flood frequency analysis in continental Spanish rivers, using climate and reservoir indices as external covariates. Hydrol. Earth Syst. Sci. 2013, 17, 3189–3203. [Google Scholar] [CrossRef] [Green Version]
  20. Casanueva, A.; Rodríguez-Puebla, C.; Frías, M.D.; González-Reviriego, N. Variability of extreme precipitation over Europe and its relationships with teleconnection patterns. Hydrol. Earth Syst. Sci. 2014, 18, 709–725. [Google Scholar] [CrossRef] [Green Version]
  21. Jha, S.; Das, J.; Goyal, M.K. Low frequency global-scale modes and its influence on rainfall extremes over India: Nonstationary and uncertainty analysis. Int. J. Climatol. 2021, 41, 1873–1888. [Google Scholar] [CrossRef]
  22. Wu, Z.; Huang, N.E. Ensemble Empirical Mode Decomposition: A Noise-Assisted Data Analysis Method. Adv. Adapt. Data Anal. 2009, 1, 1–41. [Google Scholar] [CrossRef]
  23. Castino, F.; Bookhagen, B.; Strecker, M.R. Rainfall variability and trends of the past six decades (1950–2014) in the subtropical NW Argentine Andes. Clim. Dyn. 2017, 48, 1049–1067. [Google Scholar] [CrossRef]
  24. Kim, Y.; Cho, K. Sea level rise around Korea: Analysis of tide gauge station data with the ensemble empirical mode decomposition method. J. Hydro-Environ. Res. 2016, 11, 138–145. [Google Scholar] [CrossRef]
  25. Zhang, X.; Li, J.; Liu, X.; Chen, Z. Improved EEMD-based standardization method for developing long tree-ring chronologies. J. For. Res. 2019, 31, 2217–2224. [Google Scholar] [CrossRef] [Green Version]
  26. Chen, P.-C.; Wang, Y.-H.; You, G.J.-Y.; Wei, C.-C. Comparison of methods for non-stationary hydrologic frequency analysis: Case study using annual maximum daily precipitation in Taiwan. J. Hydrol. 2017, 545, 197–211. [Google Scholar] [CrossRef]
  27. Kim, T.; Shin, J.-Y.; Kim, S.; Heo, J.-H. Identification of relationships between climate indices and long-term precipitation in South Korea using ensemble empirical mode decomposition. J. Hydrol. 2018, 557, 726–739. [Google Scholar] [CrossRef]
  28. Katz, R.W. Statistical Methods for Nonstationary Extremes. In Extremes in a Changing Climate: Detection, Analysis and Uncertainty; AghaKouchak, A., Easterling, D., Hsu, K., Schubert, S., Sorooshian, S., Eds.; Springer: Dordrecht, The Netherlands, 2013; pp. 15–37. ISBN 978-94-007-4479-0. [Google Scholar]
  29. Kim, H.; Kim, S.; Shin, H.; Heo, J.-H. Appropriate model selection methods for nonstationary generalized extreme value models. J. Hydrol. 2017, 547, 557–574. [Google Scholar] [CrossRef]
  30. Strupczewski, W.G.; Singh, V.P.; Feluch, W. Non-stationarity approach to at-site flood frequency modeling I. Maximum likelihood estimation. J. Hydrol. 2001, 248, 123–142. [Google Scholar] [CrossRef]
  31. Thiombiano, A.N.; St-Hilaire, A.; El Adlouni, S.E.; Ouarda, T.B.M.J. Nonlinear response of precipitation to climate indices using a non-stationary Poisson-generalized Pareto model: Case study of southeastern Canada. Int. J. Climatol. 2018, 38, e875–e888. [Google Scholar] [CrossRef]
  32. Villarini, G.; Smith, J.A.; Ntelekos, A.A.; Schwarz, U. Annual maximum and peaks-over-threshold analyses of daily rainfall accumulations for Austria. J. Geophys. Res. Atmos. 2011, 116, D05103. [Google Scholar] [CrossRef]
  33. Villarini, G.; Smith, J.A.; Serinaldi, F.; Ntelekos, A.A.; Schwarz, U. Analyses of extreme flooding in Austria over the period 1951–2006. Int. J. Climatol. 2012, 32, 1178–1192. [Google Scholar] [CrossRef]
  34. Cooley, D. Return Periods and Return Levels Under Climate Change. In Extremes in a Changing Climate: Detection, Analysis and Uncertainty; AghaKouchak, A., Easterling, D., Hsu, K., Schubert, S., Sorooshian, S., Eds.; Springer: Dordrecht, The Netherlands, 2013; pp. 97–114. ISBN 978-94-007-4479-0. [Google Scholar]
  35. Hesarkazzazi, S.; Arabzadeh, R.; Hajibabaei, M.; Rauch, W.; Kjeldsen, T.R.; Prosdocimi, I.; Castellarin, A.; Sitzenfrei, R. Stationary vs. non-stationary modelling of flood frequency distribution across northwest England. Hydrol. Sci. J. 2021, 66, 729–744. [Google Scholar] [CrossRef]
  36. Ouarda, T.B.M.J.; Charron, C.; St-Hilaire, A. Uncertainty of stationary and nonstationary models for rainfall frequency analysis. Int. J. Climatol. 2020, 40, 2373–2392. [Google Scholar] [CrossRef]
  37. O’Brien, N.L.; Burn, D.H. A nonstationary index-flood technique for estimating extreme quantiles for annual maximum streamflow. J. Hydrol. 2014, 519, 2040–2048. [Google Scholar] [CrossRef]
  38. Tramblay, Y.; Neppel, L.; Carreau, J.; Najib, K. Non-stationary frequency analysis of heavy rainfall events in southern France. Hydrol. Sci. J. 2013, 58, 280–294. [Google Scholar] [CrossRef]
  39. Silva, A.T.; Naghettini, M.; Portela, M.M. On some aspects of peaks-over-threshold modeling of floods under nonstationarity using climate covariates. Stoch. Environ. Res. Risk Assess. 2016, 30, 207–224. [Google Scholar] [CrossRef]
  40. Obeysekera, J.; Salas, J.D. Quantifying the Uncertainty of Design Floods under Nonstationary Conditions. J. Hydrol. Eng. 2014, 19, 1438–1446. [Google Scholar] [CrossRef]
  41. Hu, Y.; Liang, Z.; Chen, X.; Liu, Y.; Wang, H.; Yang, J.; Wang, J.; Li, B. Estimation of design flood using EWT and ENE metrics and uncertainty analysis under non-stationary conditions. Stoch. Environ. Res. Risk Assess. 2017, 31, 2617–2626. [Google Scholar] [CrossRef]
  42. Ng, J.L.; Abd Aziz, S.; Huang, Y.F.; Mirzaei, M.; Wayayok, A.; Rowshon, M.K. Uncertainty analysis of rainfall depth duration frequency curves using the bootstrap resampling technique. J. Earth Syst. Sci. 2019, 128, 113. [Google Scholar] [CrossRef] [Green Version]
  43. Ossandón, Á.; Rajagopalan, B.; Kleiber, W. Spatial-temporal multivariate semi-Bayesian hierarchical framework for extreme precipitation frequency analysis. J. Hydrol. 2021, 600, 126499. [Google Scholar] [CrossRef]
  44. Seo, Y.A.; Lee, Y.; Park, J.S.; Kim, M.K.; Cho, C.; Baek, H.J. Assessing changes in observed and future projected precipitation extremes in South Korea. Int. J. Climatol. 2015, 35, 1069–1078. [Google Scholar] [CrossRef]
  45. Wi, S.; Valdés, J.B.; Steinschneider, S.; Kim, T.-W. Non-stationary frequency analysis of extreme precipitation in South Korea using peaks-over-threshold and annual maxima. Stoch. Environ. Res. Risk Assess. 2016, 30, 583–606. [Google Scholar] [CrossRef]
  46. Sung, J.H.; Kim, Y.O.; Jeon, J.J. Application of distribution-free nonstationary regional frequency analysis based on L-moments. Theor. Appl. Climatol. 2018, 133, 1219–1233. [Google Scholar] [CrossRef]
  47. Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Snin, H.H.; Zheng, Q.; Yen, N.C.; Tung, C.C.; Liu, H.H. The empirical mode decomposition and the Hubert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. A Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
  48. King, A.D.; Klingaman, N.P.; Alexander, L.V.; Donat, M.G.; Jourdain, N.C.; Maher, P. Extreme Rainfall Variability in Australia: Patterns, Drivers, and Predictability. J. Clim. 2014, 27, 6035–6050. [Google Scholar] [CrossRef] [Green Version]
  49. Dehghan, Z.; Eslamian, S.S.; Modarres, R. Spatial clustering of maximum 24-h rainfall over Urmia Lake Basin by new weighting approaches. Int. J. Climatol. 2018, 38, 2298–2313. [Google Scholar] [CrossRef]
  50. Ratner, B. The correlation coefficient: Its values range between +1/−1, or do they? J. Target. Meas. Anal. Mark. 2009, 17, 139–142. [Google Scholar] [CrossRef] [Green Version]
  51. Corder, G.W.; Foreman, D.I. Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach; Wiley: Hoboken, NJ, USA, 2009; ISBN 9780470454619. [Google Scholar]
  52. Hinkle, D.E.; Wiersma, W.; Jurs, S.G. Applied Statistics for the Behavioral Sciences, 5th ed.; Houghton Mifflin: Boston, MA, USA, 2003; ISBN 9780618124053. [Google Scholar]
  53. Burnham, K.P.; Anderson, D.R. Multimodel inference: Understanding AIC and BIC in model selection. Sociol. Methods Res. 2004, 33, 261–304. [Google Scholar] [CrossRef]
  54. Kharin, V.V.; Zwiers, F.W. Estimating extremes in transient climate change simulations. J. Clim. 2005, 18, 1156–1173. [Google Scholar] [CrossRef]
  55. Khaliq, M.N.; Ouarda, T.B.M.J.; Ondo, J.-C.; Gachon, P.; Bobée, B. Frequency analysis of a sequence of dependent and/or non-stationary hydro-meteorological observations: A review. J. Hydrol. 2006, 329, 534–552. [Google Scholar] [CrossRef]
  56. Hanel, M.; Buishand, T.A.; Ferro, C.A.T. A nonstationary index flood model for precipitation extremes in transient regional climate model simulations. J. Geophys. Res. Atmos. 2009, 114, D15107. [Google Scholar] [CrossRef]
  57. Knight, J.R.; Allan, R.J.; Folland, C.K.; Vellinga, M.; Mann, M.E. A signature of persistent natural thermohaline circulation cycles in observed climate. Geophys. Res. Lett. 2005, 32, L20708. [Google Scholar] [CrossRef] [Green Version]
  58. Lu, R.; Dong, B.; Ding, H. Impact of the Atlantic Multidecadal Oscillation on the Asian summer monsoon. Geophys. Res. Lett. 2006, 33, 1–5. [Google Scholar] [CrossRef] [Green Version]
  59. Li, S.; Bates, G.T. Influence of the Atlantic Multidecadal Oscillation on the winter climate of East China. Adv. Atmos. Sci. 2007, 24, 126–135. [Google Scholar] [CrossRef]
  60. Si, D.; Ding, Y. Oceanic forcings of the interdecadal variability in east Asian summer rainfall. J. Clim. 2016, 29, 7633–7649. [Google Scholar] [CrossRef]
  61. Li, Y.; Ding, Y.; Li, W. Interdecadal variability of the Afro-Asian summer monsoon system. Adv. Atmos. Sci. 2017, 34, 833–846. [Google Scholar] [CrossRef]
  62. Vimont, D.J.; Kossin, J.P. The Atlantic Meridional Mode and hurricane activity. Geophys. Res. Lett. 2007, 34, 1–5. [Google Scholar] [CrossRef] [Green Version]
  63. van Loon, H.; Rogers, J.C. The Seesaw in Winter Temperatures between Greenland and Northern Europe. Part I: General Description. Mon. Weather Rev. 1978, 106, 296–310. [Google Scholar] [CrossRef] [Green Version]
  64. Liu, X.; Yin, Z.-Y. Spatial and Temporal Variation of Summer Precipitation over the Eastern Tibetan Plateau and the North Atlantic Oscillation. J. Clim. 2001, 14, 2896–2909. [Google Scholar] [CrossRef] [Green Version]
  65. Gu, W.; Li, C.; Li, W.; Zhou, W.; Chan, J.C.L. Interdecadal unstationary relationship between NAO and east China’s summer precipitation patterns. Geophys. Res. Lett. 2009, 36, L13702. [Google Scholar] [CrossRef]
  66. Wu, Z.; Wang, B.; Li, J.; Jin, F.-F. An empirical seasonal prediction model of the east Asian summer monsoon using ENSO and NAO. J. Geophys. Res. Atmos. 2009, 114, D18120. [Google Scholar] [CrossRef]
  67. Jin, D.; Guan, Z. Summer Rainfall Seesaw between Hetao and the Middle and Lower Reaches of the Yangtze River and Its Relationship with the North Atlantic Oscillation. J. Clim. 2017, 30, 6629–6643. [Google Scholar] [CrossRef]
  68. Linderholm, H.W.; Ou, T.; Jeong, J.H.; Folland, C.K.; Gong, D.; Liu, H.; Liu, Y.; Chen, D. Interannual teleconnections between the summer North Atlantic Oscillation and the East Asian summer monsoon. J. Geophys. Res. Atmos. 2011, 116, 1–13. [Google Scholar] [CrossRef]
  69. Sun, J.; Wang, H. Changes of the connection between the summer North Atlantic Oscillation and the East Asian summer rainfall. J. Geophys. Res. Atmos. 2012, 117, 1–12. [Google Scholar] [CrossRef]
  70. Wang, Z.; Yang, S.; Lau, N.C.; Duan, A. Teleconnection between summer NAO and East China rainfall variations: A bridge effect of the Tibetan Plateau. J. Clim. 2018, 31, 6433–6444. [Google Scholar] [CrossRef]
  71. Merz, B.; Vorogushyn, S.; Uhlemann, S.; Delgado, J.; Hundecha, Y. HESS Opinions More efforts and scientific rigour are needed to attribute trends in flood time series. Hydrol. Earth Syst. Sci. 2012, 16, 1379–1387. [Google Scholar] [CrossRef] [Green Version]
  72. Eden, J.M.; Wolter, K.; Otto, F.E.L.; Jan van Oldenborgh, G. Multi-method attribution analysis of extreme precipitation in Boulder, Colorado. Environ. Res. Lett. 2016, 11, 124009. [Google Scholar] [CrossRef] [Green Version]
  73. Ministry of Environment. Standard Guidelines for Design Flood Estimation; Ministry of Environment: Sejong City, Korea, 2019.
  74. Zaifoglu, H.; Akintug, B.; Yanmaz, A.M. Regional frequency analysis of precipitation using time series clustering approaches. J. Hydrol. Eng. 2018, 23, 1–13. [Google Scholar] [CrossRef]
  75. Kyselý, J.; Picek, J. Regional growth curves and improved design value estimates of extreme precipitation events in the Czech Republic. Clim. Res. 2007, 33, 243–255. [Google Scholar] [CrossRef]
  76. Wang, Z.; Zeng, Z.; Lai, C.; Lin, W.; Wu, X.; Chen, X. A regional frequency analysis of precipitation extremes in Mainland China with fuzzy c-means and L-moments approaches. Int. J. Climatol. 2017, 37, 429–444. [Google Scholar] [CrossRef]
  77. Kim, S.; Nam, W.; Ahn, H.; Kim, T.; Heo, J.-H. Comparison of nonstationary generalized logistic models based on Monte Carlo simulation. Proc. IAHS 2015, 371, 65–68. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The location of 61 rainfall gauging stations in South Korea.
Figure 1. The location of 61 rainfall gauging stations in South Korea.
Water 14 00478 g001
Figure 2. Rate of AM daily rainfall occurrences over South Korea.
Figure 2. Rate of AM daily rainfall occurrences over South Korea.
Water 14 00478 g002
Figure 3. Overview of a procedure of extreme value modeling with climate indices for extreme rainfall in this study.
Figure 3. Overview of a procedure of extreme value modeling with climate indices for extreme rainfall in this study.
Water 14 00478 g003
Figure 4. An example of extracting IMF from an original time series.
Figure 4. An example of extracting IMF from an original time series.
Water 14 00478 g004
Figure 5. The time series data and extracted residues of AM daily rainfall and its variance at the (a) Ulleungdo (No. 115) and (b) Daegu stations (No. 143).
Figure 5. The time series data and extracted residues of AM daily rainfall and its variance at the (a) Ulleungdo (No. 115) and (b) Daegu stations (No. 143).
Water 14 00478 g005aWater 14 00478 g005b
Figure 6. Maps with the correlation coefficient between SCI and mean residue of AM daily rainfall over South Korea. (a) AMM, (b) AMO, (c) NAO.
Figure 6. Maps with the correlation coefficient between SCI and mean residue of AM daily rainfall over South Korea. (a) AMM, (b) AMO, (c) NAO.
Water 14 00478 g006
Figure 7. Maps with the correlation coefficient between SCI and variance residue of AM daily rainfall over South Korea. (a) AMM, (b) AMO, (c) NAO.
Figure 7. Maps with the correlation coefficient between SCI and variance residue of AM daily rainfall over South Korea. (a) AMM, (b) AMO, (c) NAO.
Water 14 00478 g007
Figure 8. QQ-plots of (a) ST-GEV, (b) NS-GEV with time covariate, and (c) selected NS-GEV with SCI covariates at the Daegu station (No. 143).
Figure 8. QQ-plots of (a) ST-GEV, (b) NS-GEV with time covariate, and (c) selected NS-GEV with SCI covariates at the Daegu station (No. 143).
Water 14 00478 g008
Figure 9. Model selection results at 61 stations over South Korea.
Figure 9. Model selection results at 61 stations over South Korea.
Water 14 00478 g009
Table 1. Locations and record lengths of the rainfall gauging stations in this study.
Table 1. Locations and record lengths of the rainfall gauging stations in this study.
CodeNameLongitudeLatitudeRecord Length (year)CodeNameLongitudeLatitudeRecord Length (year)
90Sokcho128.638.350201Ganghwa126.437.745
100Daegwallyeong128.737.746202Yangpyeong127.537.545
101Chuncheon127.737.952203Icheon127.537.345
105Gangneung128.937.857211Inje128.238.145
108Seoul127.037.657212Hongcheon127.937.745
112Incheon126.637.557221Jecheon128.237.245
114Wonju127.937.345226Boeun127.736.545
115Ulleungdo130.937.557232Cheonan127.136.845
119Suwon127.037.354235Boryeong126.636.345
127Chungju128.037.045236Buyeo126.936.345
129Seosan126.536.850238Geumsan127.536.145
130Uljin129.437.046243Buan126.735.745
131Cheongju127.436.651244Imsil127.335.645
133Daejeon127.436.449245Jeongeup126.935.645
135Chupungnyeong128.036.257247Namwon127.335.445
138Pohang129.436.057260Jangheung126.934.745
140Gunsan126.836.050261Haenam126.634.645
143Daegu128.635.957262Goheung127.334.645
146Jeonju127.235.857272Yeongju128.536.945
152Ulsan129.335.657273Mungyeong128.136.645
156Gwangju126.935.257277Yeongdeok129.436.545
159Busan129.035.157278Uiseong128.736.445
162Tongyeong128.434.850279Gumi128.336.145
165Mokpo126.434.857281Yeongcheon129.036.045
168Yoesu127.734.757284Geochang127.935.745
170Wando126.734.445285Hapcheon128.235.645
174Suncheon127.435.045288Miryang128.735.545
184Jeju126.533.557289Sancheong127.935.445
188Seongsan126.933.445294Geoje128.634.945
189Seogwipo126.633.257295Namhae127.934.845
192Jinju128.035.249
Table 2. Four seasons of lagged SCIs were considered in this study.
Table 2. Four seasons of lagged SCIs were considered in this study.
AbbreviationJJA(−1)SON(−1)DJF(−1)MAM
DescriptionAveraged from Jun. to Aug. (summer season) in the previous year of AM rainfall occurrenceAveraged from Sep. to Nov. (fall season) in the previous year of AM rainfall occurrenceAveraged from Dec. to Feb. (winter season) in the previous year of AM rainfall occurrenceAveraged from Mar. to May (spring season) in the year of AM rainfall occurrence
Table 3. Percentage of stations with a significant r s value between the lagged SCI and r e s m ( t ) at the 1% significance level.
Table 3. Percentage of stations with a significant r s value between the lagged SCI and r e s m ( t ) at the 1% significance level.
Climate IndexSeason
JJA(−1)SON(−1)DJF(−1)MAM
AMM-77%67.2%-
AMO75.4%77%77%77%
AO----
NAO57.4%-13.1%-
NINO124.9%---
NINO4----
NINO3419.7%---
NP---6.6%
PDO9.8%--9.8%
PMM----
PNA---8.2%
SOI----
Table 4. Percentage of stations with a significant r s value between the lagged SCI and r e s v ( t ) at the 1% significance level.
Table 4. Percentage of stations with a significant r s value between the lagged SCI and r e s v ( t ) at the 1% significance level.
Climate IndexSeason
JJA(−1)SON(−1)DJF(−1)MAM
AMM-75.4%68.9%-
AMO72.1%82%80.3%73.8%
AO----
NAO68.9%-13.1%-
NINO123.3%---
NINO4----
NINO3414.8%---
NP---3.3%
PDO13.1%--11.5%
PMM----
PNA---8.2%
SOI----
Table 5. Description of 59 GEV candidate models in this study.
Table 5. Description of 59 GEV candidate models in this study.
Distribution TypeGEV ParametersThe Number of Models
Location, μ ( t ) Scale, σ ( t ) Shape, ξ
ST-GEV μ σ ξ 1
NS-GEV(1,0,0) with time covariate μ 0 + μ 1 t σ ξ 1
NS-GEV(1,1,0) with time covariate μ 0 + μ 1 t exp ( σ 0 + σ 1 t ) ξ 1
NS-GEV(1,0,0) with SCI covariates
(1)
μ 0 + μ 1 S C I 1 ( t )
(2)
μ 0 + μ 1 S C I 2 ( t )
(3)
μ 0 + μ 1 S C I 3 ( t )
(4)
μ 0 + μ 1 S C I 1 ( t ) + μ 2 S C I 2 ( t )
(5)
μ 0 + μ 1 S C I 1 ( t ) + μ 2 S C I 3 ( t )
(6)
μ 0 + μ 1 S C I 2 ( t ) + μ 2 S C I 3 ( t )
(7)
μ 0 + μ 1 S C I 1 ( t ) + μ 2 S C I 2 ( t ) +
(8)
μ 3 S C I 3 ( t )
σ ξ 7
NS-GEV(1,1,0) with SCI covariates
(1)
μ 0 + μ 1 S C I 1 ( t )
(2)
μ 0 + μ 1 S C I 2 ( t )
(3)
μ 0 + μ 1 S C I 3 ( t )
(4)
μ 0 + μ 1 S C I 1 ( t ) + μ 2 S C I 2 ( t )
(5)
μ 0 + μ 1 S C I 1 ( t ) + μ 2 S C I 3 ( t )
(6)
μ 0 + μ 1 S C I 2 ( t ) + μ 2 S C I 3 ( t )
(7)
μ 0 + μ 1 S C I 1 ( t ) + μ 2 S C I 2 ( t ) +
(8)
μ 3 S C I 3 ( t )
(1)
exp ( σ 0 + σ 1 S C I 1 ( t ) )
(2)
exp ( σ 0 + σ 1 S C I 2 ( t ) )
(3)
exp ( σ 0 + σ 1 S C I 3 ( t ) )
(4)
exp ( σ 0 + σ 1 S C I 1 ( t ) + σ 2 S C I 2 ( t ) )
(5)
exp ( σ 0 + σ 1 S C I 1 ( t ) + σ 2 S C I 3 ( t ) )
(6)
exp ( σ 0 + σ 1 S C I 2 ( t ) + σ 2 S C I 3 ( t ) )
(7)
exp ( σ 0 + σ 1 S C I 1 ( t ) + σ 2 S C I 2 ( t ) +
(8)
σ 3 S C I 3 ( t ) )
ξ 49
Note. S C I 1 : AMM_SON(−1), S C I 2 : AMO_SON(−1), S C I 3 : NAO_JJA(−1).
Table 6. Model selection result at the Daegu station (No. 143).
Table 6. Model selection result at the Daegu station (No. 143).
Distribution TyperAIC ValueGEV ParametersUncertainty,
Location, μ ( t ) Scale, σ ( t ) Shape, ξ C I Q 100 95 %
NS-GEV(1,0,0) with SCI covariates0 μ 0 + μ 1 S C I 1 ( t ) σ ξ 102.2
NS-GEV(1,0,0) with SCI covariates0.24 μ 0 + μ 1 S C I 2 ( t ) σ ξ 88.8
NS-GEV(1,1,0) with SCI covariates1.69 μ 0 + μ 1 S C I 2 ( t ) exp ( σ 0 + σ 1 S C I 3 ( t ) ) ξ 116.6
ST-GEV1.74 μ σ ξ 109.5
NS-GEV(1,1,0) with SCI covariates1.80 μ 0 + μ 1 S C I 1 ( t ) exp ( σ 0 + σ 1 S C I 2 ( t ) ) ξ 121.3
NS-GEV(1,0,0) with SCI covariates1.81 μ 0 + μ 1 S C I 1 ( t ) + μ 2 S C I 2 ( t ) σ ξ 103.1
NS-GEV(1,1,0) with SCI covariates1.81 μ 0 + μ 1 S C I 2 ( t ) exp ( σ 0 + σ 1 S C I 2 ( t ) ) ξ 109.7
NS-GEV(1,0,0) with SCI covariates1.82 μ 0 + μ 1 S C I 2 ( t ) + μ 2 S C I 3 ( t ) σ ξ 108.2
NS-GEV(1,0,0) with SCI covariates1.95 μ 0 + μ 1 S C I 1 ( t ) + μ 2 S C I 3 ( t ) σ 115.6
NS-GEV(1,1,0) with SCI covariates1.95 μ 0 + μ 1 S C I 1 ( t ) exp ( σ 0 + σ 1 S C I 3 ( t ) ) 124.7
Note. S C I 1 : AMM_SON(−1),   S C I 2 : AMO_SON(−1), S C I 3 : NAO_JJA(−1).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kim, H.; Kim, T.; Shin, J.-Y.; Heo, J.-H. Improvement of Extreme Value Modeling for Extreme Rainfall Using Large-Scale Climate Modes and Considering Model Uncertainty. Water 2022, 14, 478. https://doi.org/10.3390/w14030478

AMA Style

Kim H, Kim T, Shin J-Y, Heo J-H. Improvement of Extreme Value Modeling for Extreme Rainfall Using Large-Scale Climate Modes and Considering Model Uncertainty. Water. 2022; 14(3):478. https://doi.org/10.3390/w14030478

Chicago/Turabian Style

Kim, Hanbeen, Taereem Kim, Ju-Young Shin, and Jun-Haeng Heo. 2022. "Improvement of Extreme Value Modeling for Extreme Rainfall Using Large-Scale Climate Modes and Considering Model Uncertainty" Water 14, no. 3: 478. https://doi.org/10.3390/w14030478

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop