Tukey g-and-h Parametric Distribution

The Tukey g-and-h (TGH) distribution is a flexible family of parametric distributions that can accommodate a wide range of skewness and kurtosis values, making it useful for modeling non-normal data.

Usage

tukeyGH(
  x,
  type = "d",
  location = 0,
  scale = 1,
  g = 0,
  h = 0,
  log = FALSE,
  log.p = FALSE,
  n = NULL
)

Arguments

x: A numeric vector or a single value, depending on the function being called.
type: A character string specifying the function to be called (d for density, p for cumulative distribution, q for quantile, or r for random number generation).
location: The location parameter of the TGH distribution.
scale: The scale parameter of the TGH distribution.
g: The skewness parameter of the TGH distribution.
h: The kurtosis parameter of the TGH distribution.
log: If TRUE, the log-density is returned for the density function. Default is FALSE.
log.p: If TRUE, the log-probability is returned for the cumulative distribution and quantile functions. Default is FALSE.
n: For random number generation, the number of random values to be generated. Default is NULL.

Value

A numeric vector.

Details

The Tukey g-and-h distribution, introduced by John W. Tukey in 1977, is defined by two transformation functions, g and h, which are applied to a standard normal distribution. The g transformation controls the skewness of the distribution, while the h transformation controls the kurtosis (or heaviness of the tails). The TGH distribution is given by:

$$T_{g,h}(Z) = \frac{1}{g} (e^{g Z} -1) e^{\frac{1}{2} h Z^2}$$

Where $Z$ is a random variable with standard normal distribution. The parameters $g$ and $h$ stand for the bias and elongation of the tails, respectively, of Tukey’s $g$-and-$h$ distribution.

References

Tukey, J.W., (1977). Modern techniques in data analysis. NSF‐sponsored regional research conference at Southeastern Massachusetts University, North Dartmouth, MA.
Martinez, J., Iglewicz, B., (1984). Some properties of the Tukey g and h family of distributions. Communications in Statistics: Theory and Methods, 13(3):353–369.
Hoaglin, D.C., (1985). Summarizing shape numerically: The g-and-h distributions. In: Hoaglin, D.C., Mosteller, F., Tukey, J.W., (eds), Data Analysis for Tables, Trends, and Shapes. New York:Wiley.

Author

Christian L. Goueguel

Examples

x <- seq(-5, 5, length.out = 100)
y1 <- tukeyGH(x, type = "d", location = 0, scale = 1, g = 0, h = 0)
y2 <- tukeyGH(x, type = "d", location = 0, scale = 1, g = 0.5, h = 0)
y3 <- tukeyGH(x, type = "d", location = 0, scale = 1, g = -0.5, h = 0)
y4 <- tukeyGH(x, type = "d", location = 0, scale = 1, g = 0, h = 0.1)
y5 <- tukeyGH(x, type = "d", location = 0, scale = 1, g = 0.5, h = 0.1)
y6 <- tukeyGH(x, type = "d", location = 0, scale = 1, g = -0.5, h = 0.1)
plot(x, y1, type = "b", col = "red", pch = 16,
main = "Tukey g-and-h distribution", ylim = c(0, 0.6))
lines(x, y2, type = "b", col = "blue", pch = 15)
lines(x, y3, type = "b", col = "green", pch = 17)
lines(x, y4, type = "b", col = "black", pch = 6)
lines(x, y5, type = "b", col = "gold", pch = 18)
lines(x, y6, type = "b", col = "orange", pch = 8)
legend(
"topright",
legend = c(
"g = 0, h = 0", "g = 0.5, h = 0", "g = -0.5, h = 0",
"g = 0, h = 0.1", "g = 0.5, h = 0.1", "g = -0.5, h = 0.1"),
col = c("red", "blue", "green", "black", "gold", "orange"),
lty = 1
)