Skew normal distribution

Skew Normal
	Probability density function
	Cumulative distribution function
Parameters	location (real); scale (positive, real); shape (real)
Support
PDF
CDF	; is Owen's T function
Mean	where
Mode
Variance
Skewness
Ex. kurtosis
MGF
CF

In probability theory and statistics, the skew normal distribution is a continuous probability distribution that generalises the normal distribution to allow for non-zero skewness.

Definition

Let $\phi (x)$ denote the standard normal probability density function

\phi (x)={\frac {1}{\sqrt {2\pi }}}e^{-{\frac {x^{2}}{2}}}

with the cumulative distribution function given by

\Phi (x)=\int _{-\infty }^{x}\phi (t)\ dt={\frac {1}{2}}\left[1+\operatorname {erf} \left({\frac {x}{\sqrt {2}}}\right)\right]

,

where "erf" is the error function. Then the probability density function (pdf) of the skew-normal distribution with parameter $\alpha$ is given by

f(x)=2\phi (x)\Phi (\alpha x).\,

This distribution was first introduced by O'Hagan and Leonard (1976). Approximations to this distribution that are easier to manipulate mathematically have been given by Ashour and Abdel-Hamid (2010) and by Mudholkar and Hutson (2000).

A stochastic process that underpins the distribution was described by Andel, Netuka and Zvara (1984).[1] Both the distribution and its stochastic process underpinnings were consequences of the symmetry argument developed in Chan and Tong (1986), which applies to multivariate cases beyond normality, e.g. skew multivariate t distribution and others. The distribution is a particular case of a general class of distributions with probability density functions of the form f(x)=2 φ(x) Φ(x) where φ() is any PDF symmetric about zero and Φ() is any CDF whose PDF is symmetric about zero.[2]

To add location and scale parameters to this, one makes the usual transform $x\rightarrow {\frac {x-\xi }{\omega }}$ . One can verify that the normal distribution is recovered when $\alpha =0$ , and that the absolute value of the skewness increases as the absolute value of $\alpha$ increases. The distribution is right skewed if $\alpha >0$ and is left skewed if $\alpha <0$ . The probability density function with location $\xi$ , scale $\omega$ , and parameter $\alpha$ becomes

f(x)={\frac {2}{\omega }}\phi \left({\frac {x-\xi }{\omega }}\right)\Phi \left(\alpha \left({\frac {x-\xi }{\omega }}\right)\right).\,

Note, however, that the skewness ( $\gamma _{1}$ ) of the distribution is limited to the interval $(-1,1)$ .

As has been shown [3], the mode (maximum) of the distribution is unique. For general $\alpha$ there's no analytic expression for $m_{o}$ , but a quite accurate (numerical) approximation is:

m_{o}(\alpha )\approx \mu _{z}-{\frac {\gamma _{1}\sigma _{z}}{2}}-{\frac {\mathrm {sgn} (\alpha )}{2}}\exp \left(-{\frac {2\pi }{|\alpha |}}\right)

where $\mu _{z}={\sqrt {\frac {2}{\pi }}}\delta$ and $\sigma _{z}={\sqrt {1-\mu _{z}^{2}}}$

Estimation

Maximum likelihood estimates for $\xi$ , $\omega$ , and $\alpha$ can be computed numerically, but no closed-form expression for the estimates is available unless $\alpha =0$ . If a closed-form expression is needed, the method of moments can be applied to estimate $\alpha$ from the sample skew, by inverting the skewness equation. This yields the estimate

|\delta |={\sqrt {{\frac {\pi }{2}}{\frac {|{\hat {\gamma }}_{1}|^{\frac {2}{3}}}{|{\hat {\gamma }}_{1}|^{\frac {2}{3}}+((4-\pi )/2)^{\frac {2}{3}}}}}}

where $\delta ={\frac {\alpha }{\sqrt {1+\alpha ^{2}}}}$ , and ${\hat {\gamma }}_{1}$ is the sample skew. The sign of $\delta$ is the same as the sign of ${\hat {\gamma }}_{1}$ . Consequently, ${\hat {\alpha }}=\delta /{\sqrt {1-\delta ^{2}}}$ .

The maximum (theoretical) skewness is obtained by setting ${\delta =1}$ in the skewness equation, giving $\gamma _{1}\approx 0.9952717$ . However it is possible that the sample skewness is larger, and then $\alpha$ cannot be determined from these equations. When using the method of moments in an automatic fashion, for example to give starting values for maximum likelihood iteration, one should therefore let (for example) $|{\hat {\gamma }}_{1}|=\min(0.99,|(1/n)\sum {((x_{i}-{\bar {x}})/s)^{3}}|)$ .

Concern has been expressed about the impact of skew normal methods on the reliability of inferences based upon them.[4]

Related distributions

The exponentially modified normal distribution is another 3-parameter distribution that is a generalization of the normal distribution to skewed cases. The skew normal still has a normal-like tail in the direction of the skew, with a shorter tail in the other direction; that is, its density is asymptotically proportional to $e^{-kx^{2}}$ for some positive $k$ . Thus, in terms of the seven states of randomness, it shows "proper mild randomness". In contrast, the exponentially modified normal has an exponential tail in the direction of the skew; its density is asymptotically proportional to $e^{-k|x|}$ . In the same terms, it shows "borderline mild randomness".

Thus, the skew normal is useful for modeling skewed distributions which nevertheless have no more outliers than the normal, while the exponentially modified normal is useful for cases with an increased incidence of outliers in (just) one direction.

References

Andel, J., Netuka, I. and Zvara, K. (1984) On threshold autoregressive processes. Kybernetika, 20, 89-106
Azzalini, A. (1985). "A class of distributions which includes the normal ones". Scandinavian Journal of Statistics. 12: 171–178.
Azzalini, Adelchi; Capitanio, Antonella (2014). The skew-normal and related families. pp. 32–33. ISBN 978-1-107-02927-9.
Pewsey, Arthur. "Problems of inference for Azzalini's skewnormal distribution." Journal of Applied Statistics 27.7 (2000): 859-870

Andel, J., Netuka, I. and Zvara, K. (1984). On threshold autoregressive processes. Kybernetika, 20, 89-106 .
Ashour, S., and Abdel-Hamid, M. (2010). Approximate skew normal distribution. Journal of Advanced Research, 1, 341–350.
Chan, K-S. and Tong, H. (1986). A note on certain integral equations associated with non-linear time series analysis. Probability and Related Fields, 73, 153–158.
O'Hagan, A. and Leonard, T. (1976). Bayes estimation subject to uncertainty about parameter constraints. Biometrika, 63, 201–202.
Mudholkar, G. S. and Hutson, A. D. (2000) The epsilon-skew-normal distribution for analyzing near-normal data. Journal of Statistical Planning and Inference, 83, 291–309.

External links

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[1] Andel, J., Netuka, I. and Zvara, K. (1984) On threshold autoregressive processes. Kybernetika, 20, 89-106

[Azzalini1985-2] Azzalini, A. (1985). "A class of distributions which includes the normal ones". Scandinavian Journal of Statistics. 12: 171–178.

[Azzalini2014-3] Azzalini, Adelchi; Capitanio, Antonella (2014). The skew-normal and related families. pp. 32–33. ISBN 978-1-107-02927-9.

[4] Pewsey, Arthur. "Problems of inference for Azzalini's skewnormal distribution." Journal of Applied Statistics 27.7 (2000): 859-870

Probability distributions (List)
Discrete univariate with finite support	Benford Bernoulli beta-binomial binomial categorical hypergeometric Poisson binomial Rademacher soliton discrete uniform Zipf Zipf–Mandelbrot
Discrete univariate with infinite support	beta negative binomial Borel Conway–Maxwell–Poisson discrete phase-type Delaporte extended negative binomial Flory–Schulz Gauss–Kuzmin geometric logarithmic negative binomial parabolic fractal Poisson Skellam Yule–Simon zeta
Continuous univariate supported on a bounded interval	arcsine ARGUS Balding–Nichols Bates beta beta rectangular continuous Bernoulli Irwin–Hall Kumaraswamy logit-normal noncentral beta raised cosine reciprocal triangular U-quadratic uniform Wigner semicircle
Continuous univariate supported on a semi-infinite interval	Benini Benktander 1st kind Benktander 2nd kind beta prime Burr chi-squared chi Dagum Davis exponential-logarithmic Erlang exponential F folded normal Fréchet gamma gamma/Gompertz generalized gamma generalized inverse Gaussian Gompertz half-logistic half-normal Hotelling's T-squared hyper-Erlang hyperexponential hypoexponential inverse chi-squared scaled inverse chi-squared inverse Gaussian inverse gamma Kolmogorov Lévy log-Cauchy log-Laplace log-logistic log-normal Lomax matrix-exponential Maxwell–Boltzmann Maxwell–Jüttner Mittag-Leffler Nakagami noncentral chi-squared noncentral F Pareto phase-type poly-Weibull Rayleigh relativistic Breit–Wigner Rice shifted Gompertz truncated normal type-2 Gumbel Weibull discrete Weibull Wilks's lambda
Continuous univariate supported on the whole real line	Cauchy exponential power Fisher's z Gaussian q generalized normal generalized hyperbolic geometric stable Gumbel Holtsmark hyperbolic secant Johnson's S_U Landau Laplace asymmetric Laplace logistic noncentral t normal (Gaussian) normal-inverse Gaussian skew normal slash stable Student's t type-1 Gumbel Tracy–Widom variance-gamma Voigt
Continuous univariate with support whose type varies	generalized chi-squared generalized extreme value generalized Pareto Marchenko–Pastur q-exponential q-Gaussian q-Weibull shifted log-logistic Tukey lambda
Mixed continuous-discrete univariate	rectified Gaussian
Multivariate (joint)	Discrete Ewens multinomial Dirichlet-multinomial negative multinomial Continuous Dirichlet generalized Dirichlet multivariate Laplace multivariate normal multivariate stable multivariate t normal-inverse-gamma normal-gamma Matrix-valued inverse matrix gamma inverse-Wishart matrix normal matrix t matrix gamma normal-inverse-Wishart normal-Wishart Wishart
Directional	Univariate (circular) directional Circular uniform univariate von Mises wrapped normal wrapped Cauchy wrapped exponential wrapped asymmetric Laplace wrapped Lévy Bivariate (spherical) Kent Bivariate (toroidal) bivariate von Mises Multivariate von Mises–Fisher Bingham
Degenerate and singular	Degenerate Dirac delta function Singular Cantor
Families	Circular compound Poisson elliptical exponential natural exponential location–scale maximum entropy mixture Pearson Tweedie wrapped

Skew Normal
Probability density function
Cumulative distribution function
Parameters	$\xi \,$ location (real) $\omega \,$ scale (positive, real) $\alpha \,$ shape (real)
Support	${\displaystyle x\in (-\infty$
PDF	${\frac {2}{\omega {\sqrt {2\pi }}}}e^{-{\frac {(x-\xi )^{2}}{2\omega ^{2}}}}\int _{-\infty }^{\alpha \left({\frac {x-\xi }{\omega }}\right)}{\frac {1}{\sqrt {2\pi }}}e^{-{\frac {t^{2}}{2}}}\ dt$
CDF	$\Phi \left({\frac {x-\xi }{\omega }}\right)-2T\left({\frac {x-\xi }{\omega }},\alpha \right)$ $T(h,a)$ is Owen's T function
Mean	$\xi +\omega \delta {\sqrt {\frac {2}{\pi }}}$ where $\delta ={\frac {\alpha }{\sqrt {1+\alpha ^{2}}}}$
Mode	$\xi +\omega m_{o}(\alpha )$
Variance	$\omega ^{2}\left(1-{\frac {2\delta ^{2}}{\pi }}\right)$
Skewness	$\gamma _{1}={\frac {4-\pi }{2}}{\frac {\left(\delta {\sqrt {2/\pi }}\right)^{3}}{\left(1-2\delta ^{2}/\pi \right)^{3/2}}}$
Ex. kurtosis	$2(\pi -3){\frac {\left(\delta {\sqrt {2/\pi }}\right)^{4}}{\left(1-2\delta ^{2}/\pi \right)^{2}}}$
MGF	$M_{X}\left(t\right)=2\exp \left(\xi t+{\frac {\omega ^{2}t^{2}}{2}}\right)\Phi \left(\omega \delta t\right)$
CF	$e^{it\xi -t^{2}\omega ^{2}/2}\left(1+i\,{\textrm {Erfi}}\left({\frac {\delta \omega t}{\sqrt {2}}}\right)\right)$