next up previous
Next: Methods Up: Introduction Previous: Fuzzy Logic

Measures of Skill

Performance evaluation of meteorological algorithms is commonly done using the contingency table, which is a 2$\times$2 table defined as:
\begin{displaymath}
\left\{
\begin{array}{rr}
a & b\\
c & d
\end{array}\right\}
\end{displaymath} (2)

where $d$ is the number of correctly detected (or forecast) events (often referred to as ``hits''), $b$ the number of false alarms and $c$ the number of events not detected (``misses''). The number of correctly classified non-events, $a$, is not clearly defined in the analysis presented here. We use, as our performance measure, the critical success index (CSI) which is defined on the above table as  [4,19]:
\begin{displaymath}
CSI = \frac{d}{b+c+d}
\end{displaymath} (3)

The CSI is zero for an algorithm that possesses no skill and is equal to one for an algorithm that performs flawlessly. At opposite ends of the performance spectrum of meteorological algorithms are the original WSR-88D tornadic vortex signature detection algorithm, with a CSI of 0.02 [14], and the enhanced WSR-88D hail detection algorithm, with a CSI of 0.49 [20].

It is apparent that Equation 3 hinges on a binary classification - either the event occurs or it does not. Yet, marginal events are often observed. There are three ways of dealing with this when designing algorithms:

We decided to use a three-level classification, i.e., regions are BWERs, marginal BWERs or non-BWERs. Scoring the skill of such a classifier can be done based on the $3\times3$ table:

\begin{displaymath}
\left\{
\begin{array}{rrr}
a & b & c\\
d & e & f\\
g & h & i\\
\end{array}\right\}
\end{displaymath} (4)

in which rows represent the truth values 0, 0.5 and 1. The columns represent the output of the scheme. Thus, $h$ would represent BWERs (true value = 1) that were misidentified as marginal BWERs (output = 0.5) by the scheme.

We can then describe a modified CSI based on the $3\times3$ table as:

\begin{displaymath}
CSI_{mod} = \frac{i+e}{c+g+\frac{b+d+h+f}{2}+i+e}
\end{displaymath} (5)

where the ``misses'' and ``false alarms'' are weighted by the degree to which they are truly ``misses'' and ``false alarms''. The CSI of Equation 3 (where marginal BWERs are counted neither when detected nor when missed) is given in terms of the $3\times3$ table by:
\begin{displaymath}
CSI = \frac{i}{c+g+i}
\end{displaymath} (6)

The final results in this paper are presented using both the traditional measure in Equation 6 and the truer measure given by Equation 5.


next up previous
Next: Methods Up: Introduction Previous: Fuzzy Logic
Lakshman : lakshman@nssl.noaa.gov