In probability theory, a sub-Gaussian distribution, the distribution of a sub-Gaussian random variable, is a probability distribution with strong tail decay. More specifically, the tails of a sub-Gaussian distribution are dominated by (i.e. decay at least as fast as) the tails of a Gaussian. This property gives sub-Gaussian distributions their name.
Formally, the probability distribution of a random variable is called sub-Gaussian if there is a positive constant C such that for every ,
- .
Alternatively, a random variable is considered sub-Gaussian if its distribution function is upper bounded (up to a constant) by the distribution function of a Gaussian. Specifically, we say that is sub-Gaussian if for all we have that:
where is constant and is a mean zero Gaussian random variable.[1]: Theorem 2.6
Definitions
The sub-Gaussian norm of , denoted as , is defined by
which is the Orlicz norm of generated by the Orlicz function By condition below, sub-Gaussian random variables can be characterized as those random variables with finite sub-Gaussian norm.
Sub-Gaussian properties
Let be a random variable. The following conditions are equivalent:
- for all , where is a positive constant;
- , where is a positive constant;
- for all , where is a positive constant.
Proof. By the layer cake representation,
After a change of variables , we find that
Using the Taylor series for :
we obtain that
which is less than or equal to for . Take , then
More equivalent definitions
The following properties are equivalent:
- The distribution of is sub-Gaussian.
- Laplace transform condition: for some B, b > 0, holds for all .
- Moment condition: for some K > 0, for all .
- Moment generating function condition: for some , for all such that . [2]
- Union bound condition: for some c > 0, for all n > c, where are i.i.d copies of X.
Examples
A standard normal random variable is a sub-Gaussian random variable.
Let be a random variable with symmetric Bernoulli distribution (or Rademacher distribution). That is, takes values and with probabilities each. Since , it follows that
and hence is a sub-Gaussian random variable.
Maximum of Sub-Gaussian Random Variables
Consider a finite collection of subgaussian random variables, X1, ..., Xn, with corresponding subgaussian parameters . The random variable Mn = max(X1, ..., Xn) represents the maximum of this collection. The expectation can be bounded above by . Note that no independence assumption is needed to form this bound.[1]
See also
Notes
- 1 2 Wainwright MJ. High-Dimensional Statistics: A Non-Asymptotic Viewpoint. Cambridge: Cambridge University Press; 2019. doi:10.1017/9781108627771, ISBN 9781108627771.
- ↑ Vershynin, R. (2018). High-dimensional probability: An introduction with applications in data science. Cambridge: Cambridge University Press. pp. 33–34.
References
- Kahane, J.P. (1960). "Propriétés locales des fonctions à séries de Fourier aléatoires". Studia Mathematica. 19: 1–25. doi:10.4064/sm-19-1-1-25.
- Buldygin, V.V.; Kozachenko, Yu.V. (1980). "Sub-Gaussian random variables". Ukrainian Mathematical Journal. 32 (6): 483–489. doi:10.1007/BF01087176.
- Ledoux, Michel; Talagrand, Michel (1991). Probability in Banach Spaces. Springer-Verlag.
- Stromberg, K.R. (1994). Probability for Analysts. Chapman & Hall/CRC.
- Litvak, A.E.; Pajor, A.; Rudelson, M.; Tomczak-Jaegermann, N. (2005). "Smallest singular value of random matrices and geometry of random polytopes" (PDF). Advances in Mathematics. 195 (2): 491–523. doi:10.1016/j.aim.2004.08.004.
- Rudelson, Mark; Vershynin, Roman (2010). "Non-asymptotic theory of random matrices: extreme singular values". Proceedings of the International Congress of Mathematicians 2010. pp. 1576–1602. arXiv:1003.2990. doi:10.1142/9789814324359_0111.
- Rivasplata, O. (2012). "Subgaussian random variables: An expository note" (PDF). Unpublished.
- Vershynin, R. (2018). "High-dimensional probability: An introduction with applications in data science" (PDF). Volume 47 of Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge.
- Zajkowskim, K. (2020). "On norms in some class of exponential type Orlicz spaces of random variables". Positivity. An International Mathematics Journal Devoted to Theory and Applications of Positivity. 24(5): 1231--1240. arXiv:1709.02970. doi.org/10.1007/s11117-019-00729-6.