Acessibilidade / Reportar erro

A COMPARISON OF THE PERFORMANCE OF THE GEOMETRIC PROCESS AND ITS EXTENSIONS

ABSTRACT

The geometric process (GP) is a stochastic process that was an extension of the renewal process. It was introduced by Lam (1988)LAM Y. 1988. A note on the optimal replacement problem. Advances in Applied Probability, 20(2): 479-482. in 1988 with an intention to model the failure process of a repairable system whose the times between failures become shorter and shorter after repairs and repair times become longer and longer. The GP has been widely studied in the literature of reliability and maintenance and applied in optimisation of maintenance policies. Some authors have proposed various versions of its extensions (or the GP-like models), including the α-series process, the threshold GP, the extended Poisson process, the doubly GP, and the doubly-ratio GP. Some papers also compare the performance of the GP with that of other models, but not with the performance of the extensions of the GP. This paper therefore reviews the GP-like models, compares the performance of the GP and its extensions in terms of the Akaike information criterion (AIC), the corrected AIC (AICc) and the maximum likelihood (ML) based on 25 real-world datasets. Besides, the least square methods for estimating the parameters in some models are discussed, which is used for model performance of GP and GP-like models. The finding is useful for practitioners in their selection of the GP-like models.

Keywords:
Geometric process; failure process; recurrent event data analysis

1 INTRODUCTION

Modelling the failure process of a system is an important research topic in the reliability and maintenance research community and is normally done by using stochastic processes. The geometric process (GP), which is introduced by Lam (1988LAM Y. 1988. A note on the optimal replacement problem. Advances in Applied Probability, 20(2): 479-482.), is one of such stochastic processes. The GP has been widely used for modelling system failure processes. It is able to describe a stochastically increasing or decreasing trend, which is a characteristic in many practical applications in different fields Zhang (1999ZHANG YL. 1999. An optimal geometric process model for a cold standby repairable system. Reliability Engineering & System Safety , 63(1): 107-110.); Lam et al. (2004)LAM Y, ZHU LX, CHAN JS & LIU Q. 2004. Analysis of data from a series of events by a geometric process model. Acta Mathematicae Applicatae Sinica, English Series, 20(2): 263-282.; Wang & Yam (2017WANG GJ & YAM RC. 2017. Generalized geometric process and its application in maintenance problems. Applied Mathematical Modelling, 49: 554-567.); Pekalp & Aydoğdu (2021PEKALP MH & AYDOĞDU H. 2021. Power series expansions for the probability distribution, mean value and variance functions of a geometric process with gamma interarrival times. Journal of Computational and Applied Mathematics, 388: 113287.). In fact, the GP has received a lot of attention and has been extended into different variants, such as the general repair geometric process Finkelstein (1993FINKELSTEIN MS. 1993. A scale model of general repair. Microelectronics Reliability, 33(1): 41-44.), the α-series process Braun et al. (2005BRAUN WJ, LI W & ZHAO YQ. 2005. Properties of the geometric and related processes. Naval Research Logistics (NRL), 52(7): 607-616.), the threshold geometric process Chan et al. (2006CHAN JS, YU PL, LAM Y & HO AP. 2006. Modelling SARS data using threshold geometric process. Statistics in medicine, 25(11): 1826-1839.), the extended Possion process Wu & Clements-Croome (2006WU S & CLEMENTS-CROOME D. 2006. A novel repair model for imperfect maintenance. IMA Journal of Management Mathematics, 17(3): 235-243.), the doubly geometric process Wu (2018)WU S. 2018. Doubly geometric processes and applications. Journal of the Operational Research Society, pp. 1-13. and the doubly-ratio geometric process Wu (2022)WU S. 2022. The double ratio geometric process for the analysis of recurrent events. Naval Research Logistics (NRL) , pp. 484-495.. However, little research specifically compares the performance of these models based on real world datasets. This paper aims to compare the performance of the GP and its extensions.

The remainder of this paper is structured as follows. Section 2 describes the background of the GP and its extensions. Section 3 describes the least square estimation for models. Section 4 describes the test design and methods for estimating the performance of the models. Section 5 describes datasets for comparison in this paper and shows the result of the test for independence. Section 6 describes the comparison result among datasets. Section 7 concludes the findings and proposes future research.

2 GP AND ITS EXTENSIONS

This section introduces the GP and its extensions in detail. Before that, relevant concepts are introduced. Assume that X and Y are two random variables. If for every real number u, the inequality

P ( X u ) P ( Y u )

holds, then X is stochastically greater than or equal to Y , or Y is stochastically less than or equal to X. Then, the monotonicity of a stochastic process can be defined.

Definition 1.Ross et al. (1996ROSS SM, KELLY JJ, SULLIVAN RJ, PERRY WJ, MERCER D, DAVIS RM, WASHBURN TD, SAGER EV, BOYCE JB & BRISTOW VL. 1996. Stochastic processes. vol. 2. Wiley New York.) Given a stochastic process {X k = 1, 2...}, if X k st X k+1 or X k st X k+1 for k = 1, 2... then {X k , k = 1, 2, ...} is stochastically increasing or decreasing.

Lam (1988LAM Y. 1988. A note on the optimal replacement problem. Advances in Applied Probability, 20(2): 479-482.) proposes the definition of the geometric process as shown below.

Definition 2.Lam (1988LAM Y. 1988. A note on the optimal replacement problem. Advances in Applied Probability, 20(2): 479-482.) Given a sequence of non-negative random variables {X k , k = 1, 2, . . .}, if they are independent and the cdf of X k is given by F(a k−1 x) for k = 1, 2, . . . , where a is a positive constant, then {X k , k = 1, 2, · · ·} is called a geometric process (GP).

We refer to the random variable X k as the kth inter-arrival time in what follows.

Remark 1

From Definitions 1 and 2, the following results have been obtained.

  • If a > 1, then {X k , k = 1, 2, · · ·} is stochastically decreasing.

  • If a < 1, then {X k , k = 1, 2, · · ·} is stochastically increasing.

  • If a = 1, then {X k , k = 1, 2, · · ·} is a renewal process (RP).

As the GP has been extended into several variants Braun et al. (2005BRAUN WJ, LI W & ZHAO YQ. 2005. Properties of the geometric and related processes. Naval Research Logistics (NRL), 52(7): 607-616.); Wu & Clements-Croome (2006WU S & CLEMENTS-CROOME D. 2006. A novel repair model for imperfect maintenance. IMA Journal of Management Mathematics, 17(3): 235-243.); Chan et al. (2006CHAN JS, YU PL, LAM Y & HO AP. 2006. Modelling SARS data using threshold geometric process. Statistics in medicine, 25(11): 1826-1839.); Zhang & Wang (2016ZHANG YL & WANG GJ. 2016. An extended geometric process repair model for a cold standby repairable system with imperfect delayed repair. International Journal of Systems Science: Operations & Logistics, 3(3): 163-175.); Wu (2018WU S. 2018. Doubly geometric processes and applications. Journal of the Operational Research Society, pp. 1-13., 2022WU S. 2022. The double ratio geometric process for the analysis of recurrent events. Naval Research Logistics (NRL) , pp. 484-495.), they can be regarded as as a special case of the following definition Wu et al. (2020)WU D, PENG R & WU S. 2020. A review of the extensions of the geometric process, applications, and challenges. Quality and Reliability Engineering International, 36(2): 436-446..

Definition 3.Given a sequence of non-negative variables {X k , k = 1, 2, ...}, if variables are independent the distribution function of X k is given byF(g(δ1,k)xh(δ2,k))for k = 1, 2, ..., where g(δ 1 , k) is a positive function of δ 1 and h(δ 2 , k) is a positive function of δ 2 and k, and δ 1 and δ 2 are estimable parameter vectors, then {X k , k = 1, 2, · · ·} is a generalized GP.

Based on Remark 1 and Definition 3, the following results can be obtained:

  • If g(δ 1 , k) increase over k and h(δ 2 , k) = 1, then {X k , k = 1, 2, · · · n} is stochastically decreasing.

  • If g(δ 1 , k) decrease over k and h(δ 2 , k) = 1, then {X k , k = 1, 2, · · · n} is stochastically increasing.

  • If a = 1, then {X k , k = 1, 2, · · ·} is a (RP).

According to Wu (2018WU S. 2018. Doubly geometric processes and applications. Journal of the Operational Research Society, pp. 1-13.), the GP has the following limitations

  • Based on the definition of the GP, {X k , k = 1, 2, ...} always change monotonously over k. That means the GP cannot be used to describe the situation with non-monotonous failure ratio/patterns.

  • Suppose that X 1 follows the Weibull distribution. The generalised GP has constant g(δ 1 , k) and h(δ 2 , k) over k’s, which means both of scale and shape parameters of X k are independent of k. However, this may be too stringent that the GP may not be suitable for many real-world data.

In order to improve these limitations, several GP variants have been introduced and these will be discussed in the following part.

2.1 Extensions of the GP

2.1.1 α-series process

The α-series process (α-series) is proposed by Braun et al. (2005BRAUN WJ, LI W & ZHAO YQ. 2005. Properties of the geometric and related processes. Naval Research Logistics (NRL), 52(7): 607-616.). It considers that the expected number of counts at an arbitrary time does not exist for the decreasing geometric process. Then the decreasing version of the α-series process has a finite expected number of counts within certain conditions.

According to Braun et al. (2005BRAUN WJ, LI W & ZHAO YQ. 2005. Properties of the geometric and related processes. Naval Research Logistics (NRL), 52(7): 607-616.), suppose Y 1 ,Y 2 ,... is a sequence of independent and identically distributed random variables, it defines

X k A = Y k k a ,

where XkA is a related process to XkG, which is a GP. Then,

Y k = X k A k a , (1)

where a is a real-valued parameter, then the sum of XkA is given by

S n A = k - 1 n X k A ,

set

S 0 A = 0 ,

and the point process can be defined

N A ( t ) = sup { k : S k A t } ,

where N A (t) is an α-series process.

Besides, it indicates that for the α-series process, E(N A (t)) < ∞ in the increasing case, as well as in the decreasing case, provided that the random variable Y has a sufficient number of moments. Then

  • if α < 0, then the E(N A (t)) is finite for all t > 0.

  • if 0 ⩽ α ⩽ 1/2 and E(|Yn|2+ξ)< , then the E(N A (t)) is finite for all t > 0.

  • if 1/2 ⩽ α ⩽ 1 and E(|Yn|(1+ξ)/(1-α))< , then the E(N A (t)) is finite for all t > 0.

Then the definition of an α-series process can be obtained.

Definition 4.Braun et al. (2005BRAUN WJ, LI W & ZHAO YQ. 2005. Properties of the geometric and related processes. Naval Research Logistics (NRL), 52(7): 607-616.) Given a sequence of non-negative random variables {X k , k = 1, 2, ...}, if they are independent and the cdf of X k is given by F(k a x) for k = 1, 2..., where a is a positive constant.

For an α-series process, based on Definitions 1 and 2, g(δ 1 , k) = k a and h(δ 2 , k) = 1 can be obtained.

2.1.2 Threshold geometric process

The threshold geometric process (TGP) is proposed by Chan et al. (2006CHAN JS, YU PL, LAM Y & HO AP. 2006. Modelling SARS data using threshold geometric process. Statistics in medicine, 25(11): 1826-1839.). The main characterise of TGP is that it can describe non-monotonous trends in a failure process. According to Chan et al. (2006)CHAN JS, YU PL, LAM Y & HO AP. 2006. Modelling SARS data using threshold geometric process. Statistics in medicine, 25(11): 1826-1839., it has separate GPs and the kth GP is denoted

GP k = { X i , T k i < T k + 1 } , k = 1 , . . . , K ,

to each trend with turning point T k . Then each GP has its a and the following definition can be introduced

Definition 5.Chan et al. (2006CHAN JS, YU PL, LAM Y & HO AP. 2006. Modelling SARS data using threshold geometric process. Statistics in medicine, 25(11): 1826-1839.) Given a sequence of non-negative random variables {X k , k = 1, 2, ...}, if they are independent and the cdf of X k is given byF(aki-Tkx)for k = 1, 2..., where a is a positive constant, then {X k , k = 1, 2, ...} is called a threshold geometric process.

Then, based on Definitions 1 and 2, g(δ1,k)=aki-Tk and h(δ 2 , k) = 1 can be obtained.

2.1.3 Extended Possion Process

The Extended Possion Process (EPP) is proposed by Wu & Clements-Croome (2006WU S & CLEMENTS-CROOME D. 2006. A novel repair model for imperfect maintenance. IMA Journal of Management Mathematics, 17(3): 235-243.). Similar to the TGP and DPG, the EPP can model a failure process with non-monotonous trends.

Definition 6.Wu & Clements-Croome (2006WU S & CLEMENTS-CROOME D. 2006. A novel repair model for imperfect maintenance. IMA Journal of Management Mathematics, 17(3): 235-243.) Given a sequence of non-negative random variables {X k , k = 1, 2, ...}, if they are independent and the cdf of X k is given byF((αak-1+βbk-1)x)for k = 1, 2..., where α + β ≠ 0, α, β ≥ 0, a ≥ 0 and 0 < b ≤ 1, then {X k , k = 1, 2, ...} is called as an extended possion process (EPP).

Besides, a EPP has these two main properties:

  • if αa k−1 ≠ 0 and b = 1,

    1. if a > 1, then the EPP can model a failure process with decreasing failure intensity functions with respect to k;

    2. if a < 1, then the EPP can model a failure process with increasing failure intensity functions with respect to k;

  • if a = 1, b < 1 and βb k−1 ≠ 1, then {X k , k = 1, 2, ...} can model the process with increasing increasing failure intensities over k.

Then, based on Definitions 1 and 2, g(δ1,k)=(αak-1+βbk-1) and h(δ 2 , k) = 1 can be obtained.

2.1.4 Doubly geometric process

The doubly geometric process (DGP) is proposed by Wu (2018WU S. 2018. Doubly geometric processes and applications. Journal of the Operational Research Society, pp. 1-13.). Different from the GP, the DGP can model a situation that the shape parameters of the lifetime distributions of inter-arrival times Xk changes k. Besides, it can model not only monotonously increasing or decreasing stochastic processes, but also a failure process with different trends (stochastically increasing or decreasing), which is similar to TGP. The definition of a DPG is given in the following definition.

Definition 7.wu2017doubly Given a sequence of non-negative random variables {X k , k = 1, 2, ...}, if they are independent and the cdf of X k is given by F(a k−1 x h(k) ) for k = 1, 2..., where a is a positive constant, h(k) is a function of k and the likelihood of the parameters in h(k) has a known closed form, and h(k) > 0, then {X k , k = 1, 2, ...} is called as a doubly geometric process (DGP). The a k−1 is refereed as the scale impact factor and h(k) as the shape impact factor.

The series {a k−1 , k = 1, 2...} and {h(k), k = 1, 2...} can be two different geometric series. According to Definition 7, if h(k) equals to 1, then the shape parameter becomes constant over k if X 1 follows a Weibull distribution. If h(k) is not 1, Wu (2018WU S. 2018. Doubly geometric processes and applications. Journal of the Operational Research Society, pp. 1-13.) uses

h ( k ) = ( 1 + log ( k ) ) b

where log is the logarithm with base 10 and b is a parameter. Based on Definition 7, a DGP can model more flexible processes and it has the following properties

  • if 0 < a < 1 and b < 0, then {X k , k = 1, 2, ...} is stochastically increasing.

  • if a > 1 and b < 0, then {X k , k = 1, 2, ...} is stochastically decreasing.

  • if 0 < a < 1 and 0 < b < 4.898226, then {X k , k = 1, 2, ...} is stochastically increasing.

  • if a > 1 and 0 < b < 4.898226, then {X k , k = 1, 2, ...} is stochastically decreasing.

Then, based on Definitions 1 and 2, g(δ 1 , k) = a k−1 and h(δ2,k)=(1+log(k))b can be obtained.

2.1.5 Doubly-ratio geometric process

The doubly-ratio geometric process (DRGP) is proposed by Wu (2022WU S. 2022. The double ratio geometric process for the analysis of recurrent events. Naval Research Logistics (NRL) , pp. 484-495.). Suppose that the hazard function of X k is denoted by r k (t) and that {X k , k = 1, 2, . . .} follows the GP, then

h k ( t ) = a h k - 1 ( a t ) , (2)

where the two a’s play different roles in and have different implications in describing maintenance effectiveness: the first a describes the effectiveness on how the hazard function is affected and the second a (i.e., the one multiplying t in the parentheses) describes the effectiveness on how the age of the item under maintenance is affected. Therefore, the following definition of the DRGP can be given

Definition 8.(Wu, 2022WU S. 2022. The double ratio geometric process for the analysis of recurrent events. Naval Research Logistics (NRL) , pp. 484-495.) Given a sequence of non-negative random variables {X k , k = 1, 2, ...}, if they are independent and the cdf of X k is given byFk(t)=1-(1-F1(akt))bk/akfor k = 1, 2..., where a k and b k are positive parameters and a 1 = b 1 = 1. Then {X k , k = 1, 2, ...} is a double-ratio geometric process.

It has the following proprieties

  • If both a k and b k are increasing in k, then the DRGP is stochastically decreasing,

  • If both a k and b k are decreasing in k, then the DRGP is stochastically increasing, and

  • If a k or b k is increasing in k and b k or a k is decreasing in k, then the DRGP may not be stochastically monotonic.

Based on Definitions 4, 5, 6, 7 and 8, the following table summarizes the characterises of the GP and its extensions.

Table 1
GP and its extensions.

3 LEAST SQUARE ESTIMATION

In the Section 4.2 in Lam (2007LAM Y. 2007. The geometric process and its applications. World Scientific: Singapore.), the least squares estimators of a GP is given. Similarly, the least square method can be applied to other GP’s extensions.

3.1 Least square estimation of α-series

For the α-series process, let

Y k = k a X k , k = 1 , 2 , . . . . n . (3)

Then {Y k , k = 1, 2, ...} is a sequence of i.i.d random variables, so is {lnY k , k = 1, 2, ...}. Taking logarithm on the both sides of Eq (3) give

ln Y k = a ln k + ln X k , k = 1 , 2 , . . . n . (4)

Let

ln Y k = μ α + e k , k = 1 , 2 , . . . n , (5)

then,

ln X k = μ α - a ln k + e k , k = 1 , 2 , . . . n . (6)

Lemma 1. Denote Q α as the sum squares of errors

Q α = k = 1 n ( ln X k - μ α + a ln k ) 2 , (7)

minimising Q α then the least square estimator of µ α is given by

μ α = 1 n k = 1 n ln X k + a ln k . (8)

Proof. Set the derivatives of Q α to 0,

Q α μ α = 0 ,

then

- 2 k = 1 n ( ln X k - μ α + a ln ( k ) ) = 0 , k = 1 n μ α = k = 1 n ( ln X k + a ln ( k ) ) , n μ α = k = 1 n ( ln X k + a ln ( k ) ) , μ α = 1 n k = 1 n ln X k + a ln k .

This completes the proof.

3.2 Least square estimation of TGP

According to Chan et al. (2006CHAN JS, YU PL, LAM Y & HO AP. 2006. Modelling SARS data using threshold geometric process. Statistics in medicine, 25(11): 1826-1839.), for the TGP, let

Y j = a j 1 - k X j + T k - 1 - 1 , k = 1 , 2 , n , j = T k - 1 , T k - 1 + 1 , T k - 1 , (9)

where T 0 = 0. Then taking logarithm on the both sides of Eq. (9) gives

ln Y j = ( 1 - k ) ln a + ln X j + T k - 1 - 1 , k = 1 , 2 , n , j = T k - 1 , T k - 1 + 1 , T k - 1 . (10)

Let

ln Y j = μ j + e j + T k - 1 - 1 , k = 1 , 2 , n , j = T k - 1 , T k - 1 + 1 , T k - 1 , (11)

then,

ln X j + T k - 1 - 1 = μ j - ( 1 - k ) ln a + e j + T k - 1 - 1 , k = 1 , 2 , n , j = T k - 1 , T k - 1 + 1 , T k - 1 . (12)

Lemma 2. Denote Q tgp as the sum squares of errors

Q t g p = j = T k - 1 T k - 1 Q j , (13)

and

j = T k - 1 T k - 1 Q j = k = 1 n j = T k - 1 T k - 1 ( X j + T k - 1 - 1 - μ j a j 1 - k ) 2 , (14)

minimising Q tgp then the least square estimator of µ j is given by

μ j = k = 1 n j = T k - 1 T k - 1 ( X j + T k - 1 - 1 ) k = 1 n j = T k - 1 T k - 1 a j 1 - k . (15)

Proof. Set the derivative of Q tgp to 0,

Q 0 μ 0 = 0 , Q 1 μ 1 = 0 , Q j μ j = 0 ,

then

- 2 a j 1 - k k = 1 n j = T k - 1 T k - 1 ( X j + T k - 1 - 1 - μ j a j 1 - k ) = 0 , k = 1 n j = T k - 1 T k - 1 ( X j + T k - 1 - 1 - μ j a j 1 - k ) = 0 , k = 1 n j = T k - 1 T k - 1 ( X j + T k - 1 - 1 ) - μ j k = 1 n j = T k - 1 T k - 1 a j 1 - k = 0 , μ j = k = 1 n j = T k - 1 T k - 1 ( X j + T k - 1 - 1 ) k = 1 n j = T k - 1 T k - 1 a j 1 - k .

This completes the proof.

3.3 Least square estimation of EPP

Similarly, for the EPP, let

Y k = ( α a k - 1 + β b k - 1 ) X k , k = 1 , 2 , . . . . n . (16)

Then {Y k , k = 1, 2, ...} is a sequence of i.i.d random variables, so is {lnY k , k = 1, 2, ...}. Taking logarithm on the both sides of Eq. (16) give

ln Y k = ln ( α a k - 1 + β b k - 1 ) + ln X k , k = 1 , 2 , . . . n . (17)

Let

ln Y k = μ e p p + e k , k = 1 , 2 , . . . n , (18)

then,

ln X k = μ e p p - ln ( α a k - 1 + β b k - 1 ) + e k , k = 1 , 2 , . . . n . (19)

Lemma 3. Denote Q epp as the sum squares of errors

Q e p p = k = 1 n ( ln X k - μ e p p + ln ( α a k - 1 + β b k - 1 ) ) 2 , (20)

minimising Q epp then the least square estimator of µ epp is given by

μ e p p = 1 n k = 1 n ln X k + ln ( α a k - 1 + β b k - 1 ) ) (21)

Proof. Set the derivatives of Q epp to 0,

Q e p p μ e p p = 0 ,

Then

- 2 k = 1 n ( ln X k - μ e p p + ln ( α a k - 1 + β b k - 1 ) ) = 0 , k = 1 n μ e p p = k = 1 n ( ln X k + ln ( α a k - 1 + β b k - 1 ) ) , n μ e p p = k = 1 n ( ln X k + ln ( α a k - 1 + β b k - 1 ) ) , μ e p p = 1 n k = 1 n ( ln X k + ln ( α a k - 1 + β b k - 1 ) ) .

This completes the proof.

3.4 Least square estimation of DGP

Lemma 4. Denote Q dgp as the sum squares of errors, the Q dgp is

Q d p g = k = 1 n ( ln X k - μ d g p - ( k - 1 ) ln a h ( k ) ) 2 ,

where h ( k ) = ( 1 + log ( k ) ) b and minimising Q dpg , then, the µ dgp is given by

μ d g p = k = 1 n ( ln X k + k - 1 h ( k ) ln a ) n k = 1 n ( 1 h ( k ) ) . (22)

Proof. Set the derivatives of Qdgp to 0,

Q d g p μ d g p = 0 ,

then

- 2 h ( k ) k = 1 n ( ln X k - 1 h ( k ) μ d g p + k - 1 h ( k ) ln a ) = 0 , k = 1 n ( ln X k - 1 h ( k ) μ d g p + k - 1 h ( k ) ln a ) = 0 , k = 1 n μ d g p h ( k ) = k = 1 n ln X k + k = 1 n k - 1 h ( k ) ln a , n μ d g p k = 1 n 1 h ( k ) = k = 1 n ln X k + k = 1 n k - 1 h ( k ) ln a , μ e p p = 1 n k = 1 n ( ln X k + ln ( α a k - 1 + β b k - 1 ) ) .

This completes the proof.

4 TEST DESIGN

4.1 Independent and identically distributed test

One important assumption of the GP and its extension is that {X n , n = 1, 2, ...} are independent and identically distributed (i.i.d) random variables. According to Theorem 4.2.1, Equations 4.2.1 and 4.2.2 in Lam (2007LAM Y. 2007. The geometric process and its applications. World Scientific: Singapore.), if denote

Y k = X 2 k X 2 k - 1 , k = 1 , 2 . . . ,

and

Y k ' = X 2 k + 1 X 2 k , k = 1 , 2 . . . ,

if {X k , k = 1, 2, ...} is a GP, then {Y k , k = 1, 2, ...} and {Y k ′, k = 1, 2, ...} are respectively two sequence of i.i.d random variable.

The statistical R package ”spgs” is used in this paper for the i.i.d test (download at https://cran.r-project. org/web/packages/spgs/index.html).

4.2 Evaluation of the model performance

In this paper, the Akaike information criterion (AIC), the corrected AIC (AICc) will be used as index to compare the fitness among different GPs. The model with lower AIC score is better in this test.

AIC = - 2 ln ( L ) + 2 q

and

AICc = AIC + 2 q ( q + 1 ) n - q - 1

where q is the number of parameters in a model, L is the maximum likelihood (ML) and n is the number of observations in a dataset. Among the candidate model with the smallest AIC and AICc is regarded the best model.

The ML will be used to estimate the performance of a model. The model with the largest ML score regarded the best.

The statistical R package ”GenSA” is used in this paper for optimization strategy of parameters (download at https://cran.r-project.org/web/packages/GenSA/index.html).

5 DATASETS

The following table shows the datasets and their reference.

Table 2
Summary of datasets.

In Table 2, 25 datasets are used in this paper. The following Table 3 shows independence test for each dataset. The dataset with a p-value lower than 0.05 means that it does not follow the independence assumption, therefore, such datasets will not be used for further comparisons.

Table 3
Summary of independence test.

According to the above result, the following findings can be obtained:

  • The size of our most dataset is not huge enough (n ≦ 30) so that it is impossible to use Chi-square test.

  • Datasets 8, 16, 21 and 22 (BUS510, Xie, Hong Kong SARA and Singapore SARA) are not satisfied with the conditions of independence as the independence p-value is less than 0.05. Thus, there datasets will be ignored in this paper.

It is worth noticing that the Section 5 in Lam (2007LAM Y. 2007. The geometric process and its applications. World Scientific: Singapore.) briefly studies the model performance by the GP, the RP or the homogeneous Poisson process (HPP) model, the Cox-Lewis model and the Weibull process (WP) model based on ten real datasets. Among them, six datasets, which are 11, 13, 19, 21, 22 and 24, will be used in this paper. To compare the fitness of models, the mean squared error (MSE) and the maximum percentage error (MPE) will be used to evaluate the model performance. In Lam (2007)LAM Y. 2007. The geometric process and its applications. World Scientific: Singapore., the GP model is the best model among these four models.

In this paper, more different datasets, which have been shown in Table 2 will be used for comparison of the performance among more GP’s extensions in this paper. AIC, AICc and ML will be used to evaluate the model performance. The next section describes numerical results in this paper.

6 CASE STUDIES

The following Table 4 shows the model performance based on 25 datasets. The order of data in each sheet from top to bottom is AIC, AICc and ML.

It is worth noticing that the results are shown in the sequence in the rank of model performance.

Table 4
The values of AIC, AICc, and ML of the models.

According to the above table, the following result can be obtained:

  • In terms of the AIC result

    • - Based on the rank of model performance, the result shows that AICTGP > AICRP > AICα-series > AICGP = AICDRGP > AICEPP > AICDGP.

    • - Based on the rank of the average AIC score, AICTGP > AICDRGP > AICRP > AICGP > AICα-series > AICDRGP > AICEPP.

    • - The TGP has the best performance (6 best scores), the second one is the RP (5 best scores), then the α-series (4 best scores), EPP (2 best scores) and DRGP (2 best scores) have similar performance.

    • - The EPP works better than the DGP in these 23 datasets. However, the DGP model has lower average AIC score than the EPP.

    • - The DRGP has the second lowest average AIC score, its combined performance is better than other five models and is second only to the TGP.

  • In terms of the AICc result

    • - Based on the rank of model performance, the result shows that AICcRP > AICcTGP > AICcα-series = AICcGP = AICcDGP = AICcDRGP > AICcEPP.

    • - Based on the rank of the average AICc score, the result shows that AICcRP > AICcTGP > AICcDGP > AICcGP > AICcEPP > AICcα-series > AICcDRGP.

    • - The RP has the best performance (9 best scores), second one is the TGP (7 best scores). The performance of these two models is more pronounced in terms of AICc.

    • - In terms of AICc, TGP has the best performance, which is different from the result of AIC.

    • - The performance of the GP is better in terms of AICc (comparing to the AIC).

    • - The performance of the DRGP has obvious disadvantage in terms of AICc. It is 623.80 which is much higher than other six models.

  • In terms of the ML result

    • - The TGP has significant advantage on model performance.

    • - Based on the rank of model performance, the result shows that MLTGP > MLDRGP > MLEPP > MLRP = MLGP = MLα-series = MLDGP.

    • - Based on the rank of the average ML score, the result shows that MLTGP > MLDRGP > MLDGP > MLEPP > MLGP > MLRP > MLα-series.

    • - The performance of both DRGP and DGP has significant increase comparing with the result in terms of AIC and AICc.

The following table shows the best model for each dataset in terms of different index.

Table 5
Result of the performance for datasets.

7 CONCLUSIONS

This paper compared the performances of the geometric process (GP) and its extensions based on 25 real-world datasets. These extensions, the α-series process, the threshold geometric process (TGP), the extended possion process (EPP), the doubly geometric process (DGP) and the doubly-ratio geometric process (DRGP), have been used in this paper. Three performances metrics, AIC (Akaike Information Criterion), AICc (correction of AIC) and ML (Maximum Likelihood) were used. Besides, it discussed the LSE (least square estimation) as an alternative method to estimate the model parameters.

According to the comparison results, based on the AIC, the best model is the TGP and the second best is RP. Then in terms of the AICc, the best model is the DRGP and the second best is the TGP. Based on the result of the ML, the best model is the TGP and the second best is the DRGP. These results would give supports for choosing the best model in practical applications.

References

  • ANDREWS DF & HERZBERG AM. 2012. Data: a collection of problems from many fields for the student and research worker. Springer Science & Business Media.
  • ASCHER H & FEINGOLD H. 1984. Repairable systems reliability: modeling, inference, misconceptions and their causes. M. Dekker New York.
  • BRAUN WJ, LI W & ZHAO YQ. 2005. Properties of the geometric and related processes. Naval Research Logistics (NRL), 52(7): 607-616.
  • CHAKRABORTY S & CHAKRAVARTY D. 2012. Discrete gamma distributions: Properties and parameter estimations. Communications in Statistics-Theory and Methods, 41(18): 3301-3324.
  • CHAN JS, YU PL, LAM Y & HO AP. 2006. Modelling SARS data using threshold geometric process. Statistics in medicine, 25(11): 1826-1839.
  • FINKELSTEIN MS. 1993. A scale model of general repair. Microelectronics Reliability, 33(1): 41-44.
  • GUIDA M & PULCINI G. 2009. Reliability analysis of mechanical systems with bounded and bathtub shaped intensity function. IEEE Transactions on Reliability, 58(3): 432-443.
  • GUPTA S, MAITI J, KUMAR R & KUMAR U. 2009. A control chart guided maintenance policy selection. International Journal of Mining, Reclamation and Environment, 23(3): 216-226.
  • HAND D, DALY F, MCCONWAY K, LUNN D & OSTROWSKI E. 1993. A Handbook of Small Data Sets. CRC Press. Available at: https://books.google.co.uk/books?id=m4O1DwAAQBAJ
    » https://books.google.co.uk/books?id=m4O1DwAAQBAJ
  • KUMAR U & KLEFSJÖ B. 1992. Reliability analysis of hydraulic systems of LHD machines using the power law process model. Reliability Engineering & System Safety, 35(3): 217-224.
  • LAM Y. 1988. A note on the optimal replacement problem. Advances in Applied Probability, 20(2): 479-482.
  • LAM Y. 2007. The geometric process and its applications. World Scientific: Singapore.
  • LAM Y, ZHU LX, CHAN JS & LIU Q. 2004. Analysis of data from a series of events by a geometric process model. Acta Mathematicae Applicatae Sinica, English Series, 20(2): 263-282.
  • LEE L. 1980. Testing adequacy of the Weibull and log linear rate models for a Poisson process. Technometrics, 22(2): 195-199.
  • MODARRES M. 2006. Risk analysis in engineering: techniques, tools, and trends. CRC press.
  • MUSA JD. 1979. Software reliability data. Technical Report in Rome Air Development Center, .
  • PEKALP MH & AYDOĞDU H. 2021. Power series expansions for the probability distribution, mean value and variance functions of a geometric process with gamma interarrival times. Journal of Computational and Applied Mathematics, 388: 113287.
  • PERCY DF & ALKALI BM. 2007. Scheduling preventive maintenance for oil pumps using generalized proportional intensities models. International Transactions in Operational Research, 14(6): 547-563.
  • PROSCHAN F. 2000. Theoretical explanation of observed decreasing failure rate. Technometrics , 42(1): 7-11.
  • ROSS SM, KELLY JJ, SULLIVAN RJ, PERRY WJ, MERCER D, DAVIS RM, WASHBURN TD, SAGER EV, BOYCE JB & BRISTOW VL. 1996. Stochastic processes. vol. 2. Wiley New York.
  • WANG GJ & YAM RC. 2017. Generalized geometric process and its application in maintenance problems. Applied Mathematical Modelling, 49: 554-567.
  • WU D, PENG R & WU S. 2020. A review of the extensions of the geometric process, applications, and challenges. Quality and Reliability Engineering International, 36(2): 436-446.
  • WU S. 2018. Doubly geometric processes and applications. Journal of the Operational Research Society, pp. 1-13.
  • WU S. 2022. The double ratio geometric process for the analysis of recurrent events. Naval Research Logistics (NRL) , pp. 484-495.
  • WU S & CLEMENTS-CROOME D. 2006. A novel repair model for imperfect maintenance. IMA Journal of Management Mathematics, 17(3): 235-243.
  • XIE M & LAI CD. 1996. Reliability analysis using an additive Weibull model with bathtub-shaped failure rate function. Reliability Engineering & System Safety , 52(1): 87-93.
  • ZHANG YL. 1999. An optimal geometric process model for a cold standby repairable system. Reliability Engineering & System Safety , 63(1): 107-110.
  • ZHANG YL & WANG GJ. 2016. An extended geometric process repair model for a cold standby repairable system with imperfect delayed repair. International Journal of Systems Science: Operations & Logistics, 3(3): 163-175.

Publication Dates

  • Publication in this collection
    12 Dec 2022
  • Date of issue
    2022

History

  • Received
    16 Mar 2022
  • Accepted
    29 July 2022
Sociedade Brasileira de Pesquisa Operacional Rua Mayrink Veiga, 32 - sala 601 - Centro, 20090-050 Rio de Janeiro RJ - Brasil, Tel.: +55 21 2263-0499, Fax: +55 21 2263-0501 - Rio de Janeiro - RJ - Brazil
E-mail: sobrapo@sobrapo.org.br