
popular in the literature and is based on the
interexceedances times of overpassing a high
threshold. The intervals estimator motivated the
work of Süveges, who proposed a maximum
likelihood estimator, the so called Kgaps estimator,
[9], which also depends of a K parameter. A detailed
synthesis of the EI estimation options may be found
at the book by [10], or in the recent paper of [11].
Variations of the initial blocks and runs estimators
include the disjoint blocks estimator and the sliding
blocks estimator, [12]. Based on the maxima
method, [13], proposed a semiparametric approach
producing more efficient estimates. Relevant
asymptotic results and a moments EI estimator is
presented in the paper from [14]. To reduce bias,
[15], applies the Jackknife methodology. A key
problem with EI estimation is the choice of the
threshold to be considered, originating the process of
exceedances. Some of the existent estimators are too
sensitive to the choice of that parameter, [16].
Recent research addresses to this problem, proposing
a new way to select the threshold in a context of non
parametric estimation of the extremal index of
stochastic processes, [17]. When academics are
called to apply their results to real data, it is frequent
not to encounter the conditions assumed to validate
the results. In that sense, there is the recognized need
for robust procedures which are not so vulnerable in
the absence of those conditions. The estimator
proposed in [18], is a robust proposal to estimate the
EI.
More recently, an interesting proposal uses artificial
censoring of interexceedances times with some
evidence of stability improvement and less
sensitivity to the choice of the parameter evolved, D,
in this case, and to the choice of the high threshold,
[19]. In the present paper, two different proposals
are considered to estimate the EI: a robust estimator,
based on the Negative Binomial (NB) model, and a
proportion estimator, also based in the
interexceedances times.
Following the introduction, Section 2 refers to
the basic principles of EVT in the case of IID
samples and for specific dependent sequences. In
Section 3 and 4, two recently proposed EI estimators
are presented. Section 5 presents other known
estimators to be considered in the data analysis.
Section 6 contains the results of applying both
robust and non robust estimators to a real data set.
The paper concludes in Section 7 with comments
and conclusions.
2 EVT classic theory
2.1 IID case
The theory of extreme values is highly developed
when dealing with IID samples. In this case, there
exists the well known theorem in EVT that allows us
to obtain the distribution of the limit of the
maximum when appropriately normalized, and
whenever such a limit exists. This theorem provides
a convenient way to understand the behaviour of
extreme values in large samples drawn
independently and identically distributed,
(Y1, Y2, ..., Yn), with some unknown common
cumulative distribution function (CDF), F(y). The
Fisher, Tippett, Gnedenko extremal types theorem
determines the limiting CDF of the maximum as a
member of the general extreme value (EV)
distribution family. If Yn:n=max{Y1, Y2, ..., Yn}
and if there are constants an>0and bn∈Rsuch
that the standardized maximum Z= (Yn:n−bn)/an
converges for some non degenerate distribution,
then, such a limit CDF is of the type of
EVξ(y) =
exp(−(1 + ξy)−1/ξ),1 + ξy > 0,
if ξ= 0
exp(−exp(−y)), y ∈R,
if ξ= 0.
(1)
Several advancements have expanded the
applicability of EVT to dependent sequences, just as
sketched in Section 2.2. This is especially relevant
in the analysis of time series, where dependencies
among observations are typical. In such cases, recent
developments in EVT have adapted to account for
these dependencies, offering insights into the
distribution of extremes and enhancing our
understanding of their behaviour over.
2.2 EVT for dependent structures
When we have an IID sequence, extreme values tend
to be rare events, usually far apart from each other in
time. But when some dependence structure is
present, this affects their behaviour, originating
clusters of high values close to each other.
Dependent observations tend to exhibit similar
patterns. When an extreme value is observed, the
next observation is likely to be close to the previous
one. The presence of autocorrelation induces
proximity between observations, leading to clusters
of high values rather than isolated extremes.
In some sequences, those clusters appear
separated in time. So in limit, the clusters
ocuurrence times tend to be independent. This paper
addresses to the problem of estimating a parameter
that measures the interdependence among sequences
WSEAS TRANSACTIONS on SYSTEMS
DOI: 10.37394/23202.2024.23.25
M. Cristina Miranda,
Manuela Souto De Miranda, M. Ivette Gomes