1. Introduction
Bivariate count data occur in many contexts, often as the counts of two events, objects or individuals during a certain period of time. For example, such counts occur in epidemiology when two kinds of related diseases are examined, in criminology when two kinds of crimes are committed, in business when the volume of sales of two correlated products are observed or in manufacturing when two similar products are produced.
In real application, the observed time series data are often discrete, over-dispersed (the empirical variance is greater than the empirical mean) and have other features such as time dependence. Many univariate models have been proposed to deal with integer-valued time series data based on the univariate binomial thinning operator , which is proposed by Steutel and van Harn [1]:
(1)
where X is a non-negative integer-valued random variable and . The INAR(1) model [2], The BAR(1) model [3], the INAR(p) model [4], The PDINAR(1) model [5] and the BARIO model [6] are very popular in analyzing non-negative integer-valued time series; see Weiß [7], Scotto et al. [8] and Davis et al. [9] for recent reviews on this topic. Motivated by infinite-patch metapopulation models discussed in Buckley and Pollett [10], Weiß [11] proposed an extension to the popular Poisson INAR(1) model, which is characterized by time-dependent innovations, i.e., the mean of the innovation is linearly increased by the previous population size. An important advantage of this model is that it gives a reasonable interpretation for immigration, which becomes more attractive if the current population is large; see Weiß [11] for an application to iceberg order data.Univariate models are extensively investigated in the literature, but relatively few multivariate models, especially for bivariate versions, have been studied in detail. Franke and Rao [12] proposed a multivariate INAR(1) model, which is generalized to the p-order case by Latour [13]. Pedeli and Karlis [14] discussed a tractable bivariate INAR(1) model, which can be used to deal with bivariate count data with equi-dispersion and over-dispersion, but with small flexibility. See Pedeli and Karlis [15] for the estimation of the BINAR model and Pedeli and Karlis [16] for a further discussion of the properties of the multivariate INAR(1) model. Based on hierarchical dynamic models, Ravishanker et al. [17] described a Bayesian framework for estimation and prediction for multivariate times series of counts. Popović [18] proposed a bivariate INAR(1) model with random coefficients based on different binomial thinning operators. The above models assumed the innovations of their marginal models are independent and identically distributed. Based on a finite range of counts, Scotto et al. [19] considered the density-dependent bivariate binomial autoregressive models by using their state-dependent thinning concept. Li et al. [20] proposed a bivariate random coefficient INAR(1) model with asymmetric Hermite innovations.
Inspired by Weiß [11], we aim at providing a bivariate INAR model to analyze bivariate time series with time dependence and cross-correlation. The first contribution is that this paper gives an available method to capture the time-dependence trend by imposing the past information in the distribution of the innovational vector, which in turn makes the cross-correlation between the two entries into an innovation vector. The second contribution is that the new model not only allows autocorrelation but also allows cross-correlation. The third contribution is that this paper illustrates the stationarity and ergodicity of the extended bivariate INAR process and its two subprocesses, which is important to derive the consistence and asymptotic normality of the CML estimation.
The remainder of the paper is organized as follows. In Section 2, we first give brief reviews of the bivariate Poisson distribution and the bivariate binomial thinning operator, based on which we give the definition of the new bivariate INAR(1) model. Conditional maximum likelihood (CML) estimates and the asymptotic properties of unknown parameters are discussed in Section 3. A simulation and two real data examples that show the effectiveness of the new model are given in Section 4 and Section 5, respectively. Conclusions are made in Section 6.
2. A New Bivariate INAR(1) Model
For readability, we first give a brief review of the bivariate Poisson distribution.
If the joint probability mass function (pmf) of satisfies
(2)
where and , then is called a bivariate Poisson random variable with parameters , i.e.,BPFrom Kocherlakota and Kocherlakota [21], we obtain the fact that if follows BP, there must exist three mutually independent random variables , , such that and , where and follow Poisson, Poisson and Poisson, respectively. Then, we have the conclusion that . In addition, , given in (2), is continuous and differentiable. For convenience, we denote . By using Lemma A3 in Li et al. [20], we obtain that
(3)
(4)
(5)
Applying the univariate binomial thinning operator given in (1) to the bivariate case with leads to the bivariate binomial thinning operator:
where , and are non-negative integer-valued random variables, and all the thinnings are performed independent of each other.By calculation, Denoting as the variance matrix of the Bernoulli random variables with , we obtain that Furthermore, if all the counting series of and are independent, .
In the following, we give the definition of the new bivariate INAR(1) model, which not only includes the property of the models defined by Pedeli and Karlis [14,16] but also allows the innovation vectors to be time-dependent.
Let be non-negative integer-valued bivariate random vector. If the process satisfies
(6)
then is said to follow the extended bivariate INAR(1) process, where , , for any with , , .For simplicity, we denote the new model as the EBINAR(1) model. It is easy to see that the ith equation of model (6) is presented by:
(7)
Notice that the model given by (7) is similar to the one discussed in Weiß [11], the main difference is that involves two paralleled survivors and . It is known that the EBINAR(1) process has two parts: the first part consists of the survivors of the elements of the system at the preceding time , denoted by ; the other part is comprised by the time-dependent innovation vector , which implies that the mean of the innovation vector is linearly increased by the previous population size.
Remark 1. (1). If both and are diagonal matrices, the component equation given in (7) becomes to the one discussed by Weiß [11].
(2). If is diagonal and , model (6) becomes the one discussed in Pedeli and Karlis [14], but it is worth mentioning that the autoregression matrix in Pedeli and Karlis [14] is diagonal, which means that it causes no cross-correlation in the counts.
(3). If is non-diagonal and , model (6) becomes the one discussed in Pedeli and Karlis [16], which accounts for cross-correlation in the counts, but they still keep the innovations of their marginal models independent and identically distributed such that the time dependence can not to be captured.
To derive the pmf the EBINAR(1) process, we first denote is the convolution of , with and . By calculation, we obtain that
(8)
Furthermore, by using Lemma A3 in Li et al. [20],
(9)
(10)
Second, we denote , , and let and . Then, the conditional probability distribution of the EBINAR(1) process takes the following form:
(11)
where , , with and .If the largest eigenvalue of non-negative matrix is less than 1, then the bivariate marginal distribution of model (6) can be expressed in terms of the bivariate innovation vectors:
(12)
where is an identity matrix, and is the initial value of the process.In what follows, we first discuss the stationarity and ergodicity of processes (6) and (7), respectively. Second, we obtain the first two-moment of and , respectively. Third, we give a necessary and sufficient condition for the existence of and for any fixed positive integer k. These properties are necessary to derive the asymptotic properties of the estimators.
Let follow (6), with . If the largest eigenvalue of Γ is less than 1, there exists a strictly stationary and ergodic process satisfying (7).
Let , and be independent of each other and each of them be independent and identically distributed, i.e., , and , where means the convolution of the distributions and , and ; see Weiß [11] for details. According to the concepts of bivariate binomial thinning and the additivity of binomial distribution and Poisson distribution, (7) can be rewritten as
(13)
Since , we have and . Denote and , then In addition, that and , thus, . Therefore, the Theorem of Heathcote [22] holds. Hence, there exists a stationary marginal distribution of (7), i.e., there exists a strictly stationary process satisfying (7). Similarly, we also have a similar conclusion for . □
To prove the stationaity of the EBINAR(1) process, we first introduce a sequence of random variables that could be considered as approximations to with
where the largest eigenvalues of the non-negative matrices , and are less than 1, all of the non-negative matrices , and are invertible, , is independent with and follows a Poisson distribution with the parameter , .If the conditions of Theorem 1 hold, there exists a strictly stationary process satisfying (6).
Because
are identically distributed for and are identically distributed. Thus, is strictly stationary. Now, we suppose is strictly stationary. Then,(14)
and(15)
Because the joint distributions of the variables involved in the right-hand sides of (14) and (15) are identical, thus, and in the left-hand side of the two equations are identically distributed. Hence, the process is strictly stationary. Therefore, is a strictly stationary process. □
If the conditions of Theorem 1 hold, is a null recurrent and ergodic Markov chain.
First, we prove is null recurrent. Because then with probability one. Let and , . Then, we obtain that
According to Theorem 2, there exists an such that Then, we have Hence,
Therefore, . Thus, , i.e., is a recurrent state.
Second, we illustrate the ergodicity. For all states , , , we have
where denotes the transition probability from state to state . Thus, is a homogeneous Markov chain. Since , thus Denote n-state transition probability from state to state with . For a given , the conditional probability function of the random vector is derived by: then for all . According to (12), every state can be reached from any other state with positive probability in a finite number of steps, analogously. Hence, is irreducible. By (12), k steps of conditional probability distribution are obtained with:Note that the first multiplier (V) is positive, which can be obtained by a similar method to (11). Denoting , then we have:
thus, the second part, (VI), is also positive. Therefore, , with probability one, i.e., , is aperiodic. Hence, is an ergodic Markov chain. □Note that and
where involves the first moments of and . Hence, the first two moments of are finite. Thus, is stationary and ergodic by Theorem 2, Theorem 3 and Shumway and Stoffer [23].If the conditions of Theorem 1 hold, the first two moments and covariance matrix of exist and
(1). ;
(2). , if exists, where denotes the identity matrix;
(3). , .
In addition, if , where , and . Specifically, if and are diagonal matrices,
(1) and (2) are easy to prove by the moment property of , and we omit them. Here, we only give the proof of (3):
In fact, and . Hence,
where Let , and be the largest eigenvalues of , and , respectively. If and are diagonal matrices, then is a nonsingular matrix. Hence, is obtained. □If the conditions of Theorem 1 hold, the first two moments and covariance matrices of exist and:
(1). ;
(2). if exists;
(3). , .
(1) is easy to prove by the distribution of . We only need to prove (2) and (3).
(2). by the definition of . can be obtained directly by (6).
(3). According to the construction of the EBINAR(1) model, we have:
(16)
(17)
then is achieved by substituting (16) into (17), i.e., . Note that i.e., the formula holds for . □For any fixed positive integer k, it is a necessary and sufficient condition that is , .
For convenience, let and be diagonal matrices.
Necessity. According to Lemma 2.1 of Silva and Oliveira [24], , where involves the moments of of order and . Then,
(18)
Thus, by (18). Hence, if , i.e., . Similarly, if . Hence, if , .
Sufficiency. We know that holds for by Theorems 4 and 5. The sufficient condition can be proved by induction with respect to k. Now suppose that , . According to (13), we define
where , , and are independent of each other and each of them is independent and identically distributed, i.e., , , and . Using the univariate binomial thinning operator, and admit the representations:(19)
(20)
where and . It is easy to see both and are non-decreasing. According to Lemma 2.1 of [24], we have: where and involve the moments of and of order , and and involve the moments of and of order , respectively. According to (19) and (20), we obtain:(21)
(22)
Using (21) and (22),
(23)
where . Note that the numerator in (23) involves the moments of and of order and is finite; thus, is finite if . In addition that both and are non-negative; thus, and are finite. □3. Parameter Estimation
In this section, we consider the conditional maximum likelihood estimation for model (6). Let , . Suppose that are generated by the EBINAR(1) model with the true parameter value .
By (11), the conditional log-likelihood function can be written as:
(24)
where with , , , , and are given in (2) and (8), respectively.By using (3)–(5), and (9) and (10), we can derive the score equation:
(25)
whereThe maximizer of (24) is the CML estimate of , which is obtained by numerically maximizing the log-likelihood (24) or by solving the score Equation (25). To study the asymptotic behaviour of the estimator, we make the following two Assumptions about the parameter space and the underlying process.
The parametric space Θ is compact with , where , , , , and are finite positive constants, and is an interior point in Θ.
If there exists a , such that , a.s., then where is the probability measure under the true parameter
To derive the identification of the EBINAR(1) model, we give the following two Lemmas.
Let , for Then, if , then .
By the assumption:
we obtain: . □Similarly, we denote . If , then we have by the same method.
If is the strictly stationary and ergodic solution of model (6), Assumptions 1 and 2 hold, then model (6) is identifiable.
According to Lemma 1, we conclude that if , then . Similarly, if , then . Thus, if , , then , , According to (7), we have , If , then we have , otherwise
then and , which contradicts the fact that , .By Assumption 2, for given and , we have
Thus, Hence, model (6) is identifiable. □
Suppose that is the strictly stationary and ergodic solution of model (6) and Assumptions 1 and 2 hold. As , there exists an estimator such that .
To prove the strong consistence of , we need to check all the assumptions given in Theorems 4.1.2 and 4.1.3 in Amemiya [25]. Let , then . We observe that is a measurable function of for all , and is continuous in an open and convex neighborhood of , then there at least exists a point such that attains the maximum value at .
Thus,
Note that is a stationary and ergodic time series, and in terms of Theorem 4.1.2 in Amemiya [25], in probability as . By Jensen’s inequality, we have:
(26)
Thus, attains a strict local maximum at by (26) and Lemma 2. Hence, the conditions of Theorem 4.1.2 in Amemiya [25] are fulfilled; thus, there exists an estimator such that , . □
If the conditions of Theorem 7 hold, as ,
where
To prove the asymptotic normality of , we need to verify the assumptions of Theorem 4.1.3 in Amemiya [25].
First, by Proposition 1 in Freeland and McCabe [26], it is easy to obtain all the partial derivatives in a similar way, i.e., exist and are three times continuous differentiable in ; thus, exists and is continuous in , for any . Thus, there at least exists a point such that attains the maximum value at . Hence,
For convenience, we denote: and . We only need to prove converges to a finite and non-stochastic function . Let , then . Hence, the conditions of Theorem 4.1.3 in [25] are fulfilled. Thus, converges to 0 in probability uniformly in . Furthermore, converges to 0 in probability, when , .
Second, it is easy to see because .
Using the ergodic theorem,
Using the martingale central limit theorem and the Cramér–Wold device, it is direct to show that
Third, there exists an such that and by Theorem 4. By the Taylor expansion, we have
(27)
where lies in between and . We observe that the in (27) by (25), then(28)
Hence, the asymptotic normality of follows from (28). □
4. Simulation
In this section, we conduct a simulation study to illustrate the finite sample property of the CML estimate. The simulation is carried out in by using the function for the optimization of the conditional log-likelihood function.
In the simulation, we generate data from the non-diagonal EBINAR(1) model and the diagonal EBINAR(1) model. The sizes of samples are chosen to be 50, 100, 200, 500 and 1000 to reflect relatively small, small, moderate, large and relatively large sample sizes, and we use 500 replications. For the simulated sample, performances of the estimators are evaluated by mean squared error (MSE) and mean absolute deviation error (MADE), where , , where is the estimator of in the ith replication and m denotes replication times. The used parameter combinations of are listed as follows:
(1). For a non-diagonal model: ;
(2). For a diagonal model: ,
Table 1, Table 2, Table 3, Table 4 and Table 5 show that the MSE and MADE decrease with the increase in T for diagonal and non-diagonal models, which implies that the estimators are consistent.
To illustrate the location and dispersion of the estimates, we present the boxplots of the estimates for the non-diagonal and I of diagonal parameter combinations in Figure 1 and Figure 2; the others are similar.
Figure 1 and Figure 2 illustrate the large sample properties of the estimators on a limited sample size. In general, the estimated medians are apparently closer to the real parameter values with the sample size increases. Regarding dispersion issues, both the interquartile ranges and the overall ranges of the produced values are narrower with the sample size increases.
5. Illustrative Examples
In this section, we apply the proposed model to two crime datasets coming from different number of car beats, which is the unique ID for the observation unit’s geography in Pittsburgh Police Department. The crime data is available online at “The Forecasting Principles” site (
According to Cohen and Gorr [27], the occurrence of criminal mischief may be accompanied by burglary behavior, so does for the robbery. Hence, the monthly counts of burglary and CMIS (or those of burglary and robbery) may exhibit dependence. In this section, we take the monthly counts of burglary and CMIS in beat 11 and the monthly counts of burglary and robbery in beat 26 as examples.
5.1. Monthly Counts of Burglary and CMIS in Beat 11
In this part, we consider the monthly number of burglary and criminal mischief (CMIS) from January 1990 to December 2001 in the geographic ID = 11. Table 6 gives the statistics of the counts of burglaries and CMIS.
Table 6 shows that both the counts of burglaries and CMIS are over-dispersed because their variances are greater than their means. In contrast, this relationship can also be illustrated by the cross-correlation graph of the samples, which are given in Figure 3.
From Figure 3, the counts of burglaries are weakly dependent with those of CMIS. Their plots of sample path, autocorrelation function (ACF) and partial autocorrelation function (PACF) are given in Figure 4, which show that the analyzed data sets are bivariate integer-valued time series with some characteristics of mutual influence.
To give quantitative results about cross-correlation, we compare our model with the following models:
Full BINAR-BP with following [16];
Full BINAR-NB with following bivariate negative binomial distribution with parameters ; see [14,16] for detail.
As the goodness-of-fit criteria, we use the Akaike information criterion (AIC), the Bayesian information criterion (BIC) and the mean square error of the Pearson residuals (PRMS), which is equal to , where denotes the number of estimated parameters and denotes standardized Pearson residuals.
The CML estimate and approximated standard error (SE) of the parameter, including the fitted values of PRMS, AIC, BIC and log-likelihood function (Log Lik), are summarized in Table 7, where the approximated standard error is computed by using the estimated version of the robust sandwich matrix ; see Theorem 8 for details.
Table 7 shows that the EBINAR(1) model takes the highest Log Lik value and the lowest AIC, BIC and PRMS for the monthly number of burglaries and CMIS. Hence, the EBINAR(1) model is more suitable for the data sets.
5.2. Monthly Counts of Burglaries and Robberies in Beat 26
In this part, we consider the monthly number of burglaries and robberies from January 1990 to December 2001 in the geographic ID = 26; see Table 8 for some of their statistics.
Table 8 shows the monthly number of burglary and robbery are over-dispersed. In contrast, this relationship can also be illustrated by their cross-correlation graph given in Figure 5, which shows that the counts of burglary are significantly dependent on those of robbery.
To further illustrate the the monthly number of burglaries and robberies in beat 26, we present their sample path, ACF and PACF plots in Figure 6, from which we can conclude that the analyzed data set exhibits some characteristics of mutual influence.
To give quantitative result about cross-correlation, we compare our model with the Full BINAR-BP and Full BINAR-NB models. The CML estimate and SE, including the fitted PRMS, AIC, BIC and Log Lik, are summarized in Table 9.
Table 9 shows that the EBINAR(1) model takes the highest Log Lik value and the lowest AIC, BIC and PRMS for burglaries and robberies in beat 26. Hence, EBINAR(1) model is more suitable.
To sum up, our findings reveal that there are some connections for the burglary and CMIS in beat 11 and those for the burglary and robbery in beat 26, which agrees with the conclusion of Cohen and Gorr [27]. Of course, the counts of burglary may be affected by other crime activities, such as simple assault, vagrancy and trespassing, which will be studied in a further study.
For the two real datasets, our EBINAR(1) model performs best, but it is not clear enough regarding predicting unseen data. To further illustrate the better performance of the new model in prediction, one available solution is to conduct a further experiment when dividing the considered data into a training set and test set. In addition such experiment will be considered in future study of the crime data.
6. Concluding Remarks
This paper proposes a more flexible model for bivariate integer-valued time series data, i.e., the EBINAR(1) model, whose innovation vector is time-dependent. It is a generalization of the EINAR(1) model [11] to the two-dimensional case as well as a generalization of the BINAR(1) model [14,16], but with more flexibility. We discuss some necessary properties of the model, the CML estimators of parameters and their large-sample properties. Simulation was conducted to examine the finite sample performance of estimators. Real data examples are provided to illustrate our model to be effective relative to existing models.
To make the bivariate INAR-type models more flexible with respect to real-data applications in some cases, it may be interesting to include explanatory covariates or periodicity in the model to account for dependence through thinning operations on several factors, which will be considered in another project: see Aknouche et al. [28] and Chen and Khamthong [29].
Conceptualization, H.C. and F.Z.; methodology, H.C. and F.Z.; software, H.C., F.Z. and X.L.; validation, H.C., F.Z. and X.L.; formal analysis, H.C.; investigation, H.C.; resources, F.Z.; data curation, X.L.; writing—original draft preparation, H.C. and F.Z.; writing—review and editing, H.C. and F.Z.; visualization, H.C. and F.Z.; supervision, F.Z. All authors have read and agreed to the published version of the manuscript.
Crime data are available online at The Forecasting Principles site and were downloaded on 23 September 2016 from
We are very grateful to anonymous referees for providing several exceptionally helpful comments which led to a significant improvement in the manuscript.
The authors declare no conflict of interests in the publication of this paper.
The following abbreviations are used in this manuscript:
inverse of matrix | |
transpose of the matrix or vector | |
Euclidean norm of a matrix or vector; | |
| absolute value of a univariate variable; |
convergence in distribution; | |
convergence in probability one; | |
pmf | probability mass function; |
CML | conditional maximun likelihood; |
AIC | Akaike information criterion; |
BIC | Bayesian information criterion; |
SE | standard error; |
PRMS | mean square error of the Pearson residual; |
Para. | parameter. |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure 2. Boxplots of the CML estimates for diagonal EBINAR(1) model with parameter I.
Figure 3. Cross-correlation between the monthly number of burglaries and CMIS in beat 11.
Figure 4. Beat 11: (1) monthly number of burglary, (2) monthly number of CMIS, (3) ACF of burglary, (4) ACF of CMIS, (5) PACF of burglary, (6) PACF of CMIS.
Figure 5. Cross-correlation between the monthly number of burglaries and robberies in beat 26.
Figure 6. Beat 26: (1) monthly number of burglaries, (2) monthly number of robberies, (3) ACF of burglaries, (4) ACF of robberies, (5) PACF of burglaries, (6) PACF of robberies.
Results for non-diagonal EBINAR(1) model.
Size |
|
|
|
|
|
|
|
|
|
|
|
|
---|---|---|---|---|---|---|---|---|---|---|---|---|
50 | MSE | 0.0095 | 0.0122 | 0.0082 | 0.0177 | 0.0048 | 0.0187 | 0.0100 | 0.0380 | 0.0172 | 0.0246 | 0.0160 |
MADE | 0.0561 | 0.0537 | 0.0390 | 0.0460 | 0.0371 | 0.0487 | 0.0409 | 0.0750 | 0.0857 | 0.0991 | 0.0811 | |
100 | MSE | 0.0069 | 0.0043 | 0.0033 | 0.0127 | 0.0063 | 0.0070 | 0.0099 | 0.0090 | 0.0128 | 0.0210 | 0.0150 |
MADE | 0.0473 | 0.0269 | 0.0287 | 0.0417 | 0.0399 | 0.0355 | 0.0378 | 0.0578 | 0.0763 | 0.0913 | 0.0736 | |
200 | MSE | 0.0044 | 0.0019 | 0.0034 | 0.0108 | 0.0058 | 0.0063 | 0.0054 | 0.0063 | 0.0178 | 0.0185 | 0.0170 |
MADE | 0.0421 | 0.0281 | 0.0326 | 0.0291 | 0.0426 | 0.0363 | 0.0372 | 0.0532 | 0.0901 | 0.0876 | 0.0824 | |
500 | MSE | 0.0033 | 0.0008 | 0.0011 | 0.0105 | 0.0044 | 0.0027 | 0.0008 | 0.0061 | 0.0029 | 0.0044 | 0.0063 |
MADE | 0.0317 | 0.0161 | 0.0213 | 0.0469 | 0.0379 | 0.0293 | 0.0225 | 0.0556 | 0.0446 | 0.0529 | 0.0465 | |
1000 | MSE | 0.0002 | 0.0001 | 0.0005 | 0.0041 | 0.0014 | 0.0007 | 0.0002 | 0.0006 | 0.0006 | 0.0015 | 0.0048 |
MADE | 0.0116 | 0.0079 | 0.0167 | 0.0413 | 0.0280 | 0.0225 | 0.0114 | 0.0190 | 0.0199 | 0.0345 | 0.0379 |
Results for diagonal EBINAR(1) model with parameter I.
Size |
|
|
|
|
|
|
|
|
---|---|---|---|---|---|---|---|---|
50 | MSE | 0.0031 | 0.0146 | 0.0205 | 0.0030 | 1.2188 | 0.6134 | 0.3256 |
MADE | 0.0406 | 0.1021 | 0.1250 | 0.0398 | 0.7710 | 0.5880 | 0.5059 | |
100 | MSE | 0.0020 | 0.0062 | 0.0134 | 0.0023 | 0.7887 | 0.4527 | 0.2778 |
MADE | 0.0323 | 0.0665 | 0.0976 | 0.0330 | 0.6125 | 0.4978 | 0.4843 | |
200 | MSE | 0.0015 | 0.0045 | 0.0088 | 0.0010 | 0.4832 | 0.3250 | 0.2572 |
MADE | 0.0300 | 0.0524 | 0.0775 | 0.0217 | 0.5016 | 0.3995 | 0.4742 | |
500 | MSE | 0.0007 | 0.0031 | 0.0043 | 0.0010 | 0.2240 | 0.1528 | 0.2198 |
MADE | 0.0202 | 0.0396 | 0.0495 | 0.0227 | 0.3655 | 0.2670 | 0.4312 | |
1000 | MSE | 0.0005 | 0.0020 | 0.0022 | 0.0004 | 0.1965 | 0.1147 | 0.1789 |
MADE | 0.0156 | 0.0352 | 0.0377 | 0.0142 | 0.3100 | 0.2084 | 0.3954 |
Results for diagonal EBINAR(1) model with parameter II.
Size |
|
|
|
|
|
|
|
|
---|---|---|---|---|---|---|---|---|
50 | MSE | 0.0154 | 0.0183 | 0.0122 | 0.0191 | 0.7510 | 1.1007 | 0.3183 |
MADE | 0.0871 | 0.0988 | 0.0903 | 0.0949 | 0.6894 | 0.8269 | 0.5008 | |
100 | MSE | 0.0059 | 0.0089 | 0.0059 | 0.0072 | 0.4333 | 0.6728 | 0.2336 |
MADE | 0.0470 | 0.0742 | 0.0599 | 0.0582 | 0.4957 | 0.5889 | 0.4442 | |
200 | MSE | 0.0042 | 0.0044 | 0.0041 | 0.0053 | 0.2876 | 0.4939 | 0.1983 |
MADE | 0.0411 | 0.0499 | 0.0475 | 0.0470 | 0.3796 | 0.4866 | 0.4193 | |
500 | MSE | 0.0027 | 0.0035 | 0.0036 | 0.0025 | 0.1038 | 0.3240 | 0.1899 |
MADE | 0.0344 | 0.0400 | 0.0414 | 0.0326 | 0.2636 | 0.4216 | 0.4107 | |
1000 | MSE | 0.0013 | 0.0022 | 0.0017 | 0.0011 | 0.0730 | 0.0855 | 0.1352 |
MADE | 0.0238 | 0.0303 | 0.0307 | 0.0204 | 0.1978 | 0.2221 | 0.3512 |
Results for diagonal EBINAR(1) model with parameter III.
Size |
|
|
|
|
|
|
|
|
---|---|---|---|---|---|---|---|---|
50 | MSE | 0.0027 | 0.0048 | 0.0473 | 0.0083 | 0.0078 | 0.0083 | 0.0013 |
MADE | 0.0258 | 0.0428 | 0.1533 | 0.0546 | 0.0620 | 0.0586 | 0.0316 | |
100 | MSE | 0.0036 | 0.0064 | 0.0429 | 0.0102 | 0.0089 | 0.0087 | 0.0017 |
MADE | 0.0359 | 0.0485 | 0.1486 | 0.0640 | 0.0680 | 0.0638 | 0.0330 | |
200 | MSE | 0.0060 | 0.0059 | 0.0380 | 0.0059 | 0.0054 | 0.0047 | 0.0017 |
MADE | 0.0341 | 0.0469 | 0.1239 | 0.0469 | 0.0541 | 0.0507 | 0.0321 | |
500 | MSE | 0.0018 | 0.0042 | 0.0082 | 0.0031 | 0.0046 | 0.0039 | 0.0016 |
MADE | 0.0312 | 0.0429 | 0.0638 | 0.0380 | 0.0477 | 0.0426 | 0.0287 | |
1000 | MSE | 0.0011 | 0.0030 | 0.0037 | 0.0020 | 0.0020 | 0.0016 | 0.0005 |
MADE | 0.0253 | 0.0395 | 0.0380 | 0.0331 | 0.0384 | 0.0341 | 0.0182 |
Results for diagonal EBINAR(1) model with parameter IV.
Size |
|
|
|
|
|
|
|
|
---|---|---|---|---|---|---|---|---|
50 | MSE | 0.0031 | 0.0146 | 0.0205 | 0.0030 | 1.2188 | 0.6134 | 0.3256 |
MADE | 0.0406 | 0.1021 | 0.1250 | 0.0398 | 0.7710 | 0.5880 | 0.5059 | |
100 | MSE | 0.0020 | 0.0062 | 0.0134 | 0.0023 | 0.7887 | 0.4527 | 0.2778 |
MADE | 0.0323 | 0.0665 | 0.0976 | 0.0330 | 0.6125 | 0.4978 | 0.4843 | |
200 | MSE | 0.0015 | 0.0045 | 0.0088 | 0.0010 | 0.4832 | 0.3250 | 0.2572 |
MADE | 0.0300 | 0.0524 | 0.0775 | 0.0217 | 0.5016 | 0.3995 | 0.4742 | |
500 | MSE | 0.0007 | 0.0031 | 0.0043 | 0.0010 | 0.2240 | 0.1528 | 0.2198 |
MADE | 0.0202 | 0.0396 | 0.0495 | 0.0227 | 0.3655 | 0.2670 | 0.4312 | |
1000 | MSE | 0.0005 | 0.0020 | 0.0022 | 0.0004 | 0.1965 | 0.1147 | 0.1789 |
MADE | 0.0156 | 0.0352 | 0.0377 | 0.0142 | 0.3100 | 0.2084 | 0.3954 |
Summary statistics for the monthly number of burglaries and CMIS in beat 11.
Data | Mean | Variance | Minimum | Median | Maximum |
---|---|---|---|---|---|
Burglary | 2.8819 | 4.1188 | 0 | 3 | 10 |
CMIS | 6.3819 | 10.0839 | 1 | 6 | 22 |
Estimates for the monthly numbers of burglaries and those of CMIS in beat 11.
EBINAR(1) | Full BINAR(1)-NB | Full BINAR(1)-BP | ||||||
---|---|---|---|---|---|---|---|---|
Para. | Estimate | SE | Para. | Estimate | SE | Para. | Estimate | SE |
|
0.1689 | 0.1559 |
|
0.2784 | 0.0665 |
|
0.2993 | 0.0838 |
|
0.0179 | 0.0411 |
|
0.0217 | 0.0092 |
|
0.0217 | 0.0215 |
|
0.0390 | 0.1447 |
|
0.1060 | 0.0550 |
|
0.1060 | 0.0719 |
|
0.1131 | 0.1236 |
|
0.5010 | 0.0295 |
|
0.1934 | 0.0551 |
|
0.0690 | 0.1460 | ||||||
|
0.0093 | 0.0559 | ||||||
|
0.1014 | 0.1809 | ||||||
|
0.1354 | 0.1414 | ||||||
|
1.0007 | 0.4372 |
|
1.9814 | 0.0186 |
|
1.5164 | 0.2561 |
|
3.3190 | 0.5478 |
|
2.3137 | 0.0166 |
|
4.5258 | 0.4493 |
|
0.5273 | 0.2628 |
|
0.1374 | 0.9759 |
|
0.4044 | 0.2274 |
PRMS | 0.0064 | 0.0245 | 0.0103 | |||||
AIC | 1315.4620 | 1387.8913 | 1350.9488 | |||||
BIC | 1348.1300 | 1408.6800 | 1371.7375 | |||||
Log Lik | −646.7310 | −686.9457 | −668.4744 |
Summary statistics for the monthly number of burglaries and robberies in beat 26.
Data | Mean | Variance | Minimum | Median | Maximum |
---|---|---|---|---|---|
Burglary | 3.9306 | 9.7434 | 0 | 3 | 15 |
Robbery | 3.0625 | 9.6394 | 0 | 2 | 17 |
Estimates for the monthly number of burglaries and robberies in beat 26.
EBINAR(1) | Full BINAR(1)-NB | Full BINAR(1)-BP | ||||||
---|---|---|---|---|---|---|---|---|
Para. | Estimate | SE | Para. | Estimate | SE | Para. | Estimate | SE |
|
0.3117 | 0.0654 |
|
0.2314 | 0.3042 |
|
0.2765 | 0.0537 |
|
0.2086 | 0.0611 |
|
0.3172 | 0.2442 |
|
0.0927 | 0.0471 |
|
0.0900 | 0.0511 |
|
0.1099 | 0.2834 |
|
0.0001 | 0.0000 |
|
0.1906 | 0.1163 |
|
0.4361 | 0.2244 |
|
0.4249 | 0.0415 |
|
0.0671 | 0.0706 | ||||||
|
0.2280 | 0.0653 | ||||||
|
0.1233 | 0.0511 | ||||||
|
0.3358 | 0.1161 | ||||||
|
0.2043 | 0.2048 |
|
2.2310 | 0.0026 |
|
1.7652 | 0.2048 |
|
0.4139 | 0.1139 |
|
1.1708 | 0.0076 |
|
0.9604 | 0.1601 |
|
0.5599 | 0.1187 |
|
0.4073 | 0.7189 |
|
0.7778 | 0.1494 |
PRMS | 0.0087 | 0.0748 | 0.0992 | |||||
AIC | 1320.8092 | 1344.6968 | 1357.7718 | |||||
BIC | 1353.4771 | 1365.4855 | 1378.5604 | |||||
Log Lik | −649.4046 | −665.3484 | −671.8859 |
References
1. Steutel, F.W.; van Harn, K. Discrete analogues of self-decomposability and stability. Ann. Probab.; 1979; 7, pp. 893-899. [DOI: https://dx.doi.org/10.1214/aop/1176994950]
2. Al-Osh, M.A.; Alzaid, A.A. First-order integer-valued autoregressive process. J. Time Ser. Anal.; 1987; 8, pp. 261-275. [DOI: https://dx.doi.org/10.1111/j.1467-9892.1987.tb00438.x]
3. McKenzie, E. Some simple models for discrete variate time series. Water Resoure Bull.; 1985; 21, pp. 645-650. [DOI: https://dx.doi.org/10.1111/j.1752-1688.1985.tb05379.x]
4. Du, J.; Li, Y. The integer valued autoregressive INAR(p) model. J. Time Ser. Anal.; 1991; 12, pp. 129-142.
5. Alzaid, A.A.; Omair, M.A. Poisson difference integer valued autoressive model of order one. Bull. Malays. Math. Sci. Soc.; 2014; 37, pp. 465-485.
6. Chen, H.; Li, Q.; Zhu, F. Binomial AR(1) processes with innovational outliers. Commun. Stat. Theory Methods; 2021; 50, pp. 446-472. [DOI: https://dx.doi.org/10.1080/03610926.2019.1635704]
7. Weiß, C.H. Thinning operations for modeling time series of counts—A survey. Adv. Stat. Anal.; 2008; 92, pp. 319-341. [DOI: https://dx.doi.org/10.1007/s10182-008-0072-3]
8. Scotto, M.G.; Wei, C.H.; Gouveia, S. Thinning-based models in the analysis of integer-valued time series: A review. Stat. Model.; 2015; 15, pp. 590-618. [DOI: https://dx.doi.org/10.1177/1471082X15584701]
9. Davis, R.A.; Fokianos, K.; Holan, S.H.; Joe, H.; Livsey, J.; Lund, R.; Pipiras, V.; Ravishanker, N. Count time series: A methodological review. J. Am. Stat. Assoc.; 2021; 116, pp. 1533-1547. [DOI: https://dx.doi.org/10.1080/01621459.2021.1904957]
10. Buckley, F.M.; Pollett, P.K. Limit theorems for discrete-time metapopulation models. Probab. Surv.; 2010; 7, pp. 53-83. [DOI: https://dx.doi.org/10.1214/10-PS158]
11. Weiß, C.H. A Poisson INAR(1) model with serially dependent innovations. Metrika; 2015; 78, pp. 829-851. [DOI: https://dx.doi.org/10.1007/s00184-015-0529-9]
12. Franke, J.; Rao, T.S. Multivariate First-Order Integer Valued Autoregressions; Technical Report Department of Mathematics, UMIST: Manchester, UK, 1995.
13. Latour, A. The multivariate GINAR(p) process. Adv. Appl. Probab.; 1997; 29, pp. 228-248. [DOI: https://dx.doi.org/10.2307/1427868]
14. Pedeli, X.; Karlis, D. A bivariate INAR(1) processes with application. Stat. Model.; 2011; 11, pp. 325-349. [DOI: https://dx.doi.org/10.1177/1471082X1001100403]
15. Pedeli, X.; Karlis, D. On estimation of the bivariate Poisson INAR process. Commun. Stat. Simul. Comput.; 2013; 42, pp. 514-533. [DOI: https://dx.doi.org/10.1080/03610918.2011.639001]
16. Pedeli, X.; Karlis, D. Some properties of multivariate INAR(1) processes. Comput. Stat. Data Anal.; 2013; 67, pp. 213-225. [DOI: https://dx.doi.org/10.1016/j.csda.2013.05.019]
17. Ravishanker, N.; Serhiyenko, V.; Willig, M.R. Hierarchical dynamic models for multivariate times series of counts. Stat. Its Interface; 2014; 7, pp. 559-570. [DOI: https://dx.doi.org/10.4310/SII.2014.v7.n4.a11]
18. Popović, P.M. A bivariate INAR(1) model with different thinning parameters. Stat. Pap.; 2016; 57, pp. 517-538. [DOI: https://dx.doi.org/10.1007/s00362-015-0667-1]
19. Scotto, M.G.; Wei, C.H.; Silva, M.E.; Pereira, I. Bivariate binomial autoregressive models. J. Multivar. Anal.; 2014; 125, pp. 233-251. [DOI: https://dx.doi.org/10.1016/j.jmva.2013.12.014]
20. Li, Q.; Chen, H.; Liu, X. A new bivariate random coefficient INAR(1) model with applications. Symmetry; 2022; 14, 39. [DOI: https://dx.doi.org/10.3390/sym14010039]
21. Kocherlakota, S.; Kocherlakota, K. Bivariate Discrete Distributions; Marcel Dekker: New York, NY, USA, 1992; pp. 87-97.
22. Heathcote, C.R. Corrections and comments on the paper “A branching process allowing immigration”. J. R. Stat. Soc. Ser. B; 1966; 28, pp. 213-217. [DOI: https://dx.doi.org/10.1111/j.2517-6161.1966.tb00634.x]
23. Shumway, R.H.; Stoffer, D.S. Time Series Analysis and Its Applications with R examples; 3rd ed. Springer: New York, NY, USA, 2011.
24. Silva, M.E.; Oliveira, V.L. Difference equations for the higher-order moments and cumulants of the INAR(1) model. J. Time Ser. Anal.; 2004; 25, pp. 317-333. [DOI: https://dx.doi.org/10.1111/j.1467-9892.2004.01685.x]
25. Amemiya, T. Advanced Econometrics; Harvard University Press: Cambridge, MA, USA, 1985; pp. 110-112.
26. Freeland, R.K.; McCabe, B.P.M. Analysis of low count time series data by poisson autoregression. J. Time Ser. Anal.; 2004; 25, pp. 701-722. [DOI: https://dx.doi.org/10.1111/j.1467-9892.2004.01885.x]
27. Cohen, J.; Gorr, W.L. Development of Crime Forecasting and Mapping Systems for Use by Police; Inter-University Consortium for Political and Social Research: New York, NY, USA, 2005; [DOI: https://dx.doi.org/10.3886/ICPSR04545.v1]
28. Aknouche, A.; Bentarzi, W.; Demouche, N. On periodic ergodicity of a general periodic mixed Poisson autoregression. Stat. Probab. Lett.; 2018; 134, pp. 15-21. [DOI: https://dx.doi.org/10.1016/j.spl.2017.10.014]
29. Chen, C.W.S.; Khamthong, K. Bayesian modelling of nonlinear negative binomial integer-valued GARCHX models. Stat. Model.; 2020; 20, pp. 537-561. [DOI: https://dx.doi.org/10.1177/1471082X19845541]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Recently, there has been a growing interest in integer-valued time series models, especially in multivariate models. Motivated by the diversity of the infinite-patch metapopulation models, we propose an extension to the popular bivariate INAR(1) model, whose innovation vector is assumed to be time-dependent in the sense that the mean of the innovation vector is linearly increased by the previous population size. We discuss the stationarity and ergodicity of the observed process and its subprocesses. We consider the conditional maximum likelihood estimate of the parameters of interest, and establish their large-sample properties. The finite sample performance of the estimator is assessed via simulations. Applications on crime data illustrate the model.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer