The McDonald exponentiated gamma distribution and its statistical properties

Al-Babtain, Abdulhakim A; Merovci, Faton; Elbatal, Ibrahim

doi:10.1186/2193-1801-4-2

Research
Open access
Published: 12 February 2015

The McDonald exponentiated gamma distribution and its statistical properties

Abdulhakim A Al-Babtain¹,
Faton Merovci² &
Ibrahim Elbatal³

SpringerPlus volume 4, Article number: 2 (2015) Cite this article

2425 Accesses
2 Citations
Metrics details

Abstract

In this paper, we propose a five-parameter lifetime model called the McDonald exponentiated gamma distribution to extend beta exponentiated gamma, Kumaraswamy exponentiated gamma and exponentiated gamma, among several other models. We provide a comprehensive mathematical treatment of this distribution. We derive the moment generating function and the rth moment. We discuss estimation of the parameters by maximum likelihood and provide the information matrix.

AMS Subject Classification

Primary 62N05; secondary 90B25

1 Introduction

The gamma distribution is the most popular model for analyzing skewed data and hydrological processes. One of the important families of distributions in lifetime tests is the exponentiated gamma (EG) distribution. The exponentiated gamma (EG) distribution has been introduced by Gupta et al. 1998 which has cumulative distribution function (c.d.f.) and a probability density function (p.d.f.) of the form, respectively;

(1)

where λ and θ are scale and shape parameters respectively. The corresponding probability density function (pdf) is given by

(2)

Shawky and Bakoban 2008 discussed the exponentiated gamma distribution as an important model of life time models and derived Bayesian and non-Bayesian estimators of the shape parameter, reliability and failure rate functions in the case of complete and type-II censored samples. Also order statistics from exponentiated gamma distribution and associated inference was discussed by Shawky and Bakoban 2009. Ghanizadeh, et al. 2011, dealt with the estimation of parameters of the exponentiated gamma (EG) distribution with presence of k outliers. The maximum likelihood and moment estimators were derived. These estimators are compared empirically using Monte Carlo simulation. Singh et al. 2011b proposed Bayes estimators of the parameter of the exponentiated gamma distribution and associated reliability function under general entropy loss function for a censored sample. The proposed estimators were compared with the corresponding Bayes estimators obtained under squared error loss function and maximum likelihood estimators through their simulated risks. Khan and Kumar 2011established the explicit expressions and some recurrence relations for single and product moments of lower generalized order statistics from exponentiated gamma distribution. Sing et al. 2011a where proposed Bayes estimators of the parameter of the exponentiated gamma distribution and associated reliability function under general entropy loss function for a censored sample. Feroze ans Aslam 2012 introduced Bayesian analysis of exponentiated gamma distribution under type II censored samples. Recently, Nasiri et al. 2013 discussed Classical and Bayesian estimation of parameters on the generalized exponentiated gamma distribution.

2 Mc-Donald generalized distribution

Consider an arbitrary parent cdf G(x). The probability density function (pdf) f(x) of the new class of distributions called the Mc-Donald generalized distributions (denoted with the prefix "Mc" for short) is defined by

(3)

where a > 0,b > 0 and c > 0 are additional shape parameters. (See Corderio et al. (2012) for additional details). Note that g(x) is the pdf of parent distribution, . Introduction of this additional shape parameters is specially to introduce skewness. Also, this allows us to vary tail weight. It is important to note that for c = 1 we obtain a sub-model of this generalization which is a beta generalization (see Eugene et al. 2002) and for a = 1, we have the Kumaraswamy (Kw), [Kumaraswamy generalized distributions (see Cordeiro and Castro 2011)). For random variable X with density function (2), we write X ∼ Mc-G. The probability density function (3) will be most tractable when G(x) and g(x) have simple analytic expressions. The corresponding cumulative function for this generalization is given by

(4)

where denotes the incomplete beta function ratio (Gradshteyn and Ryzhik 2000). Equation (4) can also be rewritten as follows

(5)

where

is the well-known hypergeometric functions which are well established in the literature (see, Gradshteyn and Ryzhik 2000). Some mathematical properties of the cdf F(x) for any Mc-G distribution defined from a parent G(x) in Equation 5, could, in principle, follow from the properties of the hypergeometric function, which are well established in the literature (Gradshteyn and Ryzhik 2000 Sec. 9.1). One important benefit of this class is its ability to skewed data that cannot properly be fitted by many other existing distributions. Mc- G family of densities allows for higher levels of flexibility of its tails and has a lot of applications in various fields including economics, finance, reliability, engineering, biology and medicine.

The hazard function (hf) and reverse hazard functions (rhf) of the Mc-G distribution are given by

(6)

and

respectively. Recently Cordeiro et al. 2012 presented results on the McDonald normal distribution. Cordeiro et al. 2012 proposed McDonald Weibull distribution, Merovci and Elbatal 2013 proposed McDonald modified Weibull distribution, Elbatal et al. 2014 proposed McDonald generalized linear failure rate Distribution, Elbatal and Merovci 2014 introduced McDonald Pareto distribution and Marciano et al. 2012 obtained the statistical properties of the Mc - Γ and applied the model to reliability data. In this paper we introduce a new class of distribution, called McDonald exponentiated gamma (McEG) distribution which extends the exponentiated gamma model and has several other models as special cases. since it has more shape parameters, yielding a large variety of forms. It can also be useful for testing the goodness of fit of its sub-models.

The outline of this paper is as follows. In Section 2, the McDonald exponentiated gamma (MceG) and related family distributions are introduced. The series expansion for the density, hazard and reverse hazard functions, and other properties are presented in Section 3. Section 4 provides expansions for the cumulative and density functions. In Section 5, we present the statistical properties, in particular moments, moment generating function. The distribution of the order statistics is expressed in Section 6. Section 7 provides least squares and weighted least squares estimators. Maximum likelihood estimates of the parameters index to the distribution are discussed in Section 8. Section 9 provides applications to real data sets. Section 10 ends with some conclusions.

3 McDonald exponentiated gamma distribution

In this section we studied the five parameter McDonald exponentiated gamma (McEG) distribution. Using G(x) and g(x) in (3) to be the cdf and pdf of (1) and (2). The pdf of the McEG distribution is given by

(7)

where x > 0 and φ = (λ,θ,a,b,c). The corresponding cdf of the McEG distribution is given by

(8)

also, the cdf can be written as follows

(9)

where

Figures 1 and 2 illustrates some of the possible shapes of the pdf and cdf of the McEG distribution for selected values of the parameters λ,θ,a,b and c, respectively.

The hazard rate function and reversed hazard rate function of the new distribution are given by

(10)

and

(11)

respectively.

Figures 3 and 4 illustrates some of the possible shapes of the hazard and reversed hazard of the McEG distribution for selected values of the parameters λ,θ,a,b and c, respectively.

4 Expansions for the cumulative and density functions

In this section,we present a series expansion of the McEG cdf and pdf. distribution depending if the parameter b > 0 is real non- integer or integer. First, if |z| < 1 and b > 0 is real non- integer, we have in this subsection, we present some representations of cdf and pdf of (McEG) Equations 7 and (8) are straightforward to compute using any software with algebraic facilities. The mathematical relation given below will be useful in this subsection. If b is a positive real non integer and |z| ≤ 1,then

(12)

Using the expansion (12) in (8), the cdf of the McEG distribution becomes

(13)

If b > 0 is an integer, then

(14)

Similarly, if b > 0 is real non- integer the pdf is given by

and

(15)

for b > 0 is an integer. Where are constants such that and G(x,λ,θc(a + j)) is a finite mixture of exponentiated gamma distribution with λ and θc(a + j) are scale and shape parameters respectively. The graphs below are the pdf, cdf, survival function, h(x), and τ(x) of the McEG distribution for different values of parameters λ,θ,a,b and c.

5 Statistical properties

This section is devoted to studying statistical properties of the (McEG) distribution, specifically quantile function, moments and moment generating function

5.1 Quantile function and simulation

The quantile function corresponding to (7) is F(x_q) = P(X ≤ x_q) where (x_q)_(McEG) = F^-1(u),is given by the following relation

(16)

Simulating the McEG random variable is straightforward. Let U be a uniform variate on the unit interval (0, 1). Thus, by means of the inverse transformation method, we consider the random variable X given by the relation

(17)

5.2 Moments

In this subsection we discuss the r_th moment for (McEG) distribution. Moments are necessary and important in any statistical analysis, especially in applications. It can be used to study the most important features and characteristics of a distribution (e.g., tendency, dispersion, skewness and kurtosis). We use the results presented earlier, which was obtained by expanding the pdf.

Theorem 3.1. If X has McEG(φ,x),φ = (λ,θ,a,b,c) then the r_th moment of X is given by the following

(18)

where

Proof. Let X be a random variable with density function (7). The r_th ordinary moment of the (McEG) distribution is given by

(19)

Using the fact that

(20)

we obtain

(21)

again using the binomial series expansion

(22)

but

(23)

thus Equation 21 becomes

let λ(k + 1)x = t then

(24)

which completes the proof. Based on the first four moments of the (McEG) distribution, the measures of skewness A(φ) and kurtosis k(φ) of the (McEG) distribution can obtained as

(25)

and

(26)

5.3 Moment generating function

In this subsection we derived the moment generating function of (McEG) distribution.

Theorem 3.2. If X has (McEG) distribution, then the moment generating function M_X(t) has the following form

(27)

Proof. We start with the well known definition of the moment generating function given by

(28)

let x(λ(k + 1)-t) = z then

which completes the proof.

6 Conditional moments, residual life and reversed failure rate function

For lifetime models, it is also of interest to find the conditional moments and the mean residual lifetime function. The conditional moments for (McEG) distribution is given by

(29)

using (20), (22) and (23), Equation 29 becomes

(30)

where

Given that a component survives up to time t ≥ 0, the residual life is the period beyond t until the time of failure and defined by the conditional random variable X - t|X > t. In reliability, it is well known that the mean residual life function and ratio of two consecutive moments of residual life determine the distribution uniquely (Gupta and Gupta, 1983). Therefore, we obtain the r^th-order moment of the residual life via the general formula

Applying the binomial expansion of (x-t)^r and substituting f(x,φ) given by (7) into the above formula gives

(31)

where is the upper incomplete gamma function. Also the mean residual life of the McEG distribution is given by

(32)

On the other hand, we analogously discuss the reversed residual life and some of its properties. The reversed residual life can be defined as the conditional random variable t - X|X ≤ t which denotes the time elapsed from the failure of a component given that its life is less than or equal to t. This random variable may also be called the inactivity time (or time since failure); for more details you may see (Kundu and Nanda, 2010; Nanda, Singh, Misra, and Paul, 2003). Also, in reliability, the mean reversed residual life and ratio of two consecutive moments of reversed residual life characterize the distribution uniquely. the reversed failure (or reversed hazard) rate function is given by Equation 11. The r^th-order moment of the reversed residual life can be obtained by the well known formula

(33)

Applying the binomial expansion of (t-x)^r and substituting f(x,φ) given by (2.1) into the above formula gives

(34)

where is the lower incomplete gamma function. Thus the mean of the reversed residual life of the McEG distribution is given by

Using m(t)and m₂(t) we obtain the variance of the reversed residual life of the McEG distribution, and hence the coefficient of variation of the reversed residual life of the McEG distribution can be easily obtained.

7 Distribution of the order statistics

In this section, we derive closed form expressions for the pdfs of the r_th order statistic of the (McEG) distribution, also, the measures of skewness and kurtosis of the distribution of the r_th order statistic in a sample of size n for different choices of n;r are presented in this section. Let X₁,X₂,…,X_n be a simple random sample from (McEG) distribution with pdf and cdf given by (7) and (9), respectively.

Let X₁,X₂,…,X_n denote the order statistics obtained from this sample. We now give the probability density function of X_r:n, say f_r:n(x,φ) and the moments of X_r:n, r = 1,2,…,n. Therefore, the measures of skewness and kurtosis of the distribution of the X_r:n are presented. The probability density function of X_r:n is given by

(35)

where F(x,φ) and f(x,φ) are the cdf and pdf of the (McEG) distribution given by (7), (8), respectively, and since 0<F(x,φ) < 1, for x > 0, by using the binomial series expansion of [1 - F(x,φ)]^n-r, given by

(36)

we have

(37)

substituting from (7) and (8) into (37), we can express the k_th ordinary moment of the r_th order statistics X_r:n say as a liner combination of the k_th moments of the (McEG) distribution with different shape parameters. Therefore, the measures of skewness and kurtosis of the distribution of X_r:n can be calculated.

8 Estimation and inference

In this section, we determine the maximum likelihood estimates (MLEs) of the parameters of the (McEG) distribution from complete samples only. Let X₁,X₂,…,X_n be a random sample of size n from McEG (λ,θ,a,b,c).The likelihood function for the vector of parameters φ = (λ,θ,a,b,c) can be written as

(38)

Taking the log-likelihood function for the vector of parameters φ = (λ,θ,a,b,c) we get

(39)

The log-likelihood can be maximized either directly or by solving the nonlinear likelihood equations obtained by differentiating (39). The components of the score vector are given by

(40)

(41)

(42)

(43)

and

(44)

We can find the estimates of the unknown parameters by maximum likelihood method by setting these above non-linear Eqs. 40- (44) to zero and solve them simultaneously. Therefore, we have to use mathematical package to get the MLE of the unknown parameters. Also, all the second order derivatives exist. Thus we have the inverse dispersion matrix is given by

(45)

The elements of Hessian matrix is given in the Appendix.

By solving this inverse dispersion matrix these solutions will yield asymptotic variance and covariances of these ML estimators for ,, , and Using (44), we approximate 100(1 - γ)% confidence intervals for λ,θ,a,b and c are determined respectively as

where z_γ is the upper 100γ_the percentile of the standard normal distribution.

We can compute the maximized unrestricted and restricted log-likelihood functions to construct the likelihood ratio (LR) test statistic for testing on some the McEG sub-models. For example, we can use the LR test statistic to check whether the McEG distribution for a given data set is statistically superior to the EG distribution. In any case, hypothesis tests of the type H₀:φ = φ₀ versus H₀:φ ≠ φ₀ can be performed using a LR test. In this case, the LR test statistic for testing H₀ versus H₁ is , where and are the MLEs under H₁ and H₀, respectively. The statistic ω is asymptotically (as n → ∞) distributed as , where k is the length of the parameter vector θ of interest. The LR test rejects H₀ if , where denotes the upper 100γ% quantile of the distribution.

9 Application

In this section, we compare the results of fitting the McEG and EG distributions to real data sets. Sixty-three breaking strengths of glass fibres of length 1.5 cm were reported by Smith and Naylor (1987). No units for the breaking strengths were given. The The data are as follows:

The LR test statistic to test the hypotheses H₀:a = b = c = 1 versus H₁:a ≠ 1 ∨ b ≠ 1 ∨ c ≠ 1 is , so we reject the null hypothesis.

In order to compare the two distribution models, we consider criteria like -2ℓ, AIC (Akaike information criterion)and CAIC (corrected Akaike information criterion) for the data set. The better distribution corresponds to smaller -2ℓ, AIC and CAIC values:

where k is the number of parameters in the statistical model, n the sample size and ℓ is the maximized value of the log-likelihood function under the considered model. Also, here for calculating the values of KS we use the sample estimates of θ,α,a,b and c. Table 1 shows the MLEs under both distributions, Table 2 shows the values of -2ℓ, AIC and CAIC values. The values in Table 2 indicate that the McEG distribution leads to a better fit than the EG distribution.

A density plot compares the fitted densities of the models with the empirical histogram of the observed data (Figure 5). The fitted density for the McEG model is closer to the empirical histogram than the fits of the EG model.

Empirical, fitted McEG and EG cdf of the data set is given in Figure 6. PP of McEG, EG and KEG distribution are given, respectively in Figures 6, 7, 8 and 9.

Table 1 Estimated parameters of the EG and McEG distribution for the data set

Full size table

Table 2 Criteria for comparison

Full size table

10 Simulated data

In this subsection, we provided an algorithm to generated a random sample from the McEG distribution for the given values of its parameters and sample size n. The simulation process consists the following steps:

1.
Set n, and Θ = (λ,θ,a,b,c).
2.
Set initial value x⁰ for the random starting.
3.
Set j = 1.
4.
Generate U ∼ Uniform (0,1).
5.
Update x⁰ by using the Newton’s formula such as

6.
If ∣x⁰-x^⋆∣ ≤ ε, (very small, ε > 0 tolerance limit). Then, x^⋆ will be the desired sample from F(x).
7.
If ∣x⁰-x^⋆∣ > ε, then, set x⁰ = x^⋆ and go to step 5.
8.
Repeat steps 4-7, for j = 1,2,…,n and obtained x₁,x₂,…,x_n.

Using the above algorithm, we generated a sample of size 100 from McEG distribution for arbitrary values of λ = 0.1,θ = 0.5,a = 0.3,b = 4 and c = 5. The simulated sample is given by

The maximum likelihood estimates with corresponding confidence intervals are calculated based on the simulated sample. The MLEs of (λ,θ,a,b,c) are

respectively. The asymptotic confidence intervals for (λ,θ,a,b,c) are obtained as (0 ∼ 0.278), (0 ∼ 13.295), (0 ∼ 0.444), (0 ∼ 8.342) and (0 ∼ 21.30274044) respectively.

The pdf and empirical, fitted McEG cdf of the simulated data are given in (Figure 10) and (Figure 11).

11 Conclusion

Here we propose a new model, the so-called the McEG distribution which extends the EG distribution in the analysis of data with real support. An obvious reason for generalizing a standard distribution is because the generalized form provides larger flexibility in modeling real data. We derive expansions for the moments and for the moment generating function. The estimation of parameters is approached by the method of maximum likelihood, also the information matrix is derived. We consider the likelihood ratio statistic to compare the model with its baseline model. An application of the McEG distribution to real data show that the new distribution can be used quite effectively to provide better fits than EG distribution.

Appendix

The elements of Hessian matrix are:

References

Cordeiro GM, de Castro M: A new family of generalized distributions.J Stat Comput Simul 2011,81(7):883–898. 10.1080/00949650903530745
Article MathSciNet MATH Google Scholar
Cordeiro GM, Cintra RJ, Rêgo LC, Ortega EMM: The McDonald normal distribution.Pak J Stat Oper Res 2012,8(3):301–329.
Article MathSciNet Google Scholar
Corderio GM, Hashimoto EM, Ortega EMM: The McDonald-Weibull model, statistics.Statistics 2012,48(2):256–278.
Article Google Scholar
Eugene N, Lee C, Famoye F: Beta-normal distribution and its applications.Commun Stat Theory Methods 2002, 31:497–512. 10.1081/STA-120003130
Article MathSciNet MATH Google Scholar
Elbatal I, Merovci F, Marzouk W: McDonald generalized linear failure rate distribution.Pak J Stat Oper Res 2014,10(3):267–288.
Article MathSciNet Google Scholar
Elbatal I, Merovci F: A note on a generalization of the exponentiated pareto distribution.Econ Qual Control 2014,29(1):77–87.
Article MATH Google Scholar
Feroze N, Aslam M: Bayesian analysis of exponentiated gamma distribution under type II censored samples.Sci J Pure Appl Sci 2012,1(1):30–39.
Google Scholar
Ghanizadeh A, Pazira H, Lotfi R: Classical estimations of the exponentiated gamma distribution parameters with presence of k outliers.Aust J Basic Appl Sci 2011,5(3):571–579.
Google Scholar
Gupta RC, Gupta PL, Gupta RD: Modeling failure time data by Lehman alternatives.Commun Statistics-Theory Methods 1998,27(4):887–904. 10.1080/03610929808832134
Article MathSciNet MATH Google Scholar
Gradshteyn IS, Ryzhik IM: Table of integrals, series, and products. Academic Press, San Diego; 2000.
MATH Google Scholar
Khan RU, Kumar D: Lower generalized order statistics from exponentiated gamma distribution and its characterization.ProbStat Forum 2011, 4:25–38.
MathSciNet MATH Google Scholar
Marciano FW, Nascimento ADC, Santos-Neto M, Cordeiro GM: The Mc-Gammadistribution and its statistical properties: an application to reliability data.Int J Stat Probability 2012,1(1):53–71.
Article Google Scholar
Merovci F, Elbatal I: The McDonald modified weibull distribution: properties and applications.arXiv preprint arXiv 2013, 1309.2961.
Google Scholar
Nasiri P, Lotfi R, Veisipour H: Classical and Bayesian estimation of parameters on the generalized exponentiated gamma distribution.Sci Res Essays 2013,8(8):309–314.
Google Scholar
Singh SK, Singh U, Kumar D: Bayesian estimation of the exponentiated gamma parameter and reliability function under assymetric loss function.REVSTAT–Stat J 2011,9(3):247–260.
MATH Google Scholar
Singh U, Kumar D, Singh, SK: Bayesian estimation of the exponentiated gamma parameter and reliability function under asymmetric loss function.REVSTAT – Stat J 2011,9(3):247–260.
MathSciNet MATH Google Scholar
Shawky AI, Bakoban RA: Bayesian and non-Bayesian estimations on the exponentiated gamma distribution.Appl Math Sci 2008,2(51):2521–2530.
MathSciNet MATH Google Scholar
Shawky AI, Bakoban RA: Order statistics from exponentiated gamma distribution and associated inference.Int J Contemp Math Sci 2009,4(2):71–91.
MathSciNet MATH Google Scholar

Download references

Acknowledgements

This project was supported by King Saud University, Deanship of Scientific Research, College of Sciences Research Center.

Author information

Authors and Affiliations

Statistics and Operations Research, College of Science, King Saud University, P.O. Box 2455, Riyadh, 11451, Saudi Arabia
Abdulhakim A Al-Babtain
Department of Mathematics, University of Prishtina “Hasan Prishtina” & University of Mitrovica “Isa Boletini”, Mother Teresa, Av=5, 10000, Prishtinë, Kosovo
Faton Merovci
Institute of Statistical Studies and Research, Department of Mathematical Statistics, Cairo University, Cairo, Egypt
Ibrahim Elbatal

Authors

Abdulhakim A Al-Babtain
View author publications
You can also search for this author in PubMed Google Scholar
Faton Merovci
View author publications
You can also search for this author in PubMed Google Scholar
Ibrahim Elbatal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Abdulhakim A Al-Babtain, Faton Merovci or Ibrahim Elbatal.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

The authors, viz AA, FM, and IE with the consultation of each other carried out this work and drafted the manuscript together. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Al-Babtain, A.A., Merovci, F. & Elbatal, I. The McDonald exponentiated gamma distribution and its statistical properties. SpringerPlus 4, 2 (2015). https://doi.org/10.1186/2193-1801-4-2

Download citation

Received: 20 November 2014
Accepted: 15 December 2014
Published: 12 February 2015
DOI: https://doi.org/10.1186/2193-1801-4-2

The McDonald exponentiated gamma distribution and its statistical properties

Abstract

Abstract

AMS Subject Classification

1 Introduction

2 Mc-Donald generalized distribution

3 McDonald exponentiated gamma distribution

4 Expansions for the cumulative and density functions

5 Statistical properties

5.1 Quantile function and simulation

5.2 Moments

5.3 Moment generating function

6 Conditional moments, residual life and reversed failure rate function

7 Distribution of the order statistics

8 Estimation and inference

9 Application

10 Simulated data

11 Conclusion

Appendix

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords