Copula-Based Estimation Methods for a Common Mean

Full text

Turn on search term navigation

1. Introduction

Bivariate outcomes often arise in meta-analyses on scientific studies, such as education and medicine. Educational researchers may analyze bivariate exam scores on verbal and mathematics [1,2], or on mathematics and statistics [3]. Medical experts may analyze bivariate risk scores on myocardial infection and cardiovascular death for diabetes patients [4,5]. Bivariate meta-analyses are statistical methods designed for these meta-analytical studies [6]. Dependence between two outcomes should be considered while performing bivariate meta-analyses. If one simply considers univariate (marginal) analysis for each outcome separately, any possible dependence between the outcomes is ignored. Riley [2] and Copas et al. [7] showed that ignoring the dependence between two outcomes increases the error for estimating parameters due to the loss of information. In medical research, dependence itself can be of clinical importance, e.g., dependence between two survival outcomes in meta-analysis [8,9,10,11].

In the traditional bivariate meta-analyses, the parameters of interest are the means of a bivariate normal model [6]. However, the bivariate normal model is not flexible enough to describe the true dependence structure of real meta-analyses. It will be shown that the bivariate normal mode fits poorly to the dependence structure of real bivariate meta-analyses (Section 8). This has motivated researchers to consider alternative models.

As an alternative to the bivariate normal model, recent papers have adopted “copula” models for bivariate meta-analyses [3,5,12,13,14,15]. Copula models are flexible as they allow a variety of dependence structures. Copulas consist of both symmetric copulas (e.g., the normal copula) and asymmetric copulas (e.g., the Clayton copula). Copula models have become very popular in all areas of science by replacing the traditional multivariate normal models. In astronomy, Takeuchi [16] constructed the bivariate luminosity density functions using the FGM copula; see reference [17] for the application of the FGM copula to engineering. In ecology, Ghosh et al. [18] applied copulas to model the dependence structure in environmental and biological variables. In environmental science, Alidoost et al. [19] used bivariate copulas in the analysis of temperature. See the survey of [20] for applications to energy, forestry, and environmental sciences. The books of [21,22] are devoted to the applications of copulas in survival analysis; see also references [11,23,24,25].

While bivariate copula models for meta-analyses are promising, there are only a few methodologically and theoretically solid studies on copula-based bivariate meta-analysis. For instance, the detailed theoretical studies of [3] are limited to the FGM copula. Other copula-based meta-analyses published in biostatistical journals, such as [5,12,13,14,15], are proposed without theoretical details. Furthermore, copula-based bivariate meta-analyses have not been implemented in a free software environment.

Therefore, the goal of this article is to fully develop the methodologies and theories of the copula-based bivariate meta-analysis for estimating the common mean vector. This work is regarded as a large generalization of our previous methodological/theoretical studies under the FGM copula model [3] to a broad class of copula models. In this article, we obtain theoretical results, including the formula of the information matrix and large sample theories. Our theoretical results guarantee the applications of many copulas, such as the Clayton, Gumbel, Frank, and normal copulas, in addition to the FGM copula. In addition, we developed a new R package, “CommonMean.Copula” [26], to implement the proposed methods under the five copulas. Therefore, the aim of the article is to make a solid development of the methodologies, theories, and practical implementations of copula-based bivariate meta-analysis for the common mean, which are not yet available in the literature.

The article is organized as follows. Section 2 reviews the background of this research. Section 3 introduces the proposed model and estimator. Section 4 provides the asymptotic theory and Section 5 gives confidence sets. Section 6 introduces our new R package. Section 7 conducts simulations to check the accuracy of the proposed methods. Section 8 analyzes two real datasets for illustration. Section 9 extends the proposed methods to non-normal data. Finally, Section 10 concludes with a discussion.

2. Background

This section reviews the literature on bivariate meta-analyses and the concept of copulas.

2.1. Bivariate Meta-Analysis

We review the bivariate meta-analysis method for bivariate continuous outcomes [6,27]. For each study $i$ , let the bivariate outcomes, $Y_{i 1}$ and $Y_{i 2}$ , follow a bivariate normal distribution

(1) $Y_{i} = [\begin{matrix} Y_{i 1} \\ Y_{i 2} \end{matrix}] ~ N (μ = [\begin{matrix} μ_{1} \\ μ_{2} \end{matrix}], Ω_{i} = [\begin{matrix} σ_{i 1}^{2} & ρ_{i} σ_{i 1} σ_{i 2} \\ ρ_{i} σ_{i 1} σ_{i 2} & σ_{i 2}^{2} \end{matrix}]), i = 1, 2, \dots, n,$

where

ρ_{i} \in (- 1, 1)

is the within-study correlation for each

i

. In Equation (1), all the responses (

Y_{i}

s) share the common mean vector (

μ

). The covariance matrix

Ω_{i}

is assumed to be known (from the

i

-th study) in usual bivariate meta-analyses. We do not consider a setting where the covariance is unknown [28,29].

Then, the MLE of the common mean vector is quite easily computed as

${\hat{μ}}_{n}^{Normal} = [\begin{matrix} {\hat{μ}}_{n, 1}^{Normal} \\ {\hat{μ}}_{n, 2}^{Normal} \end{matrix}] = {(\sum_{i = 1}^{n} Ω_{i}^{- 1})}^{- 1} \sum_{i = 1}^{n} Ω_{i}^{- 1} Y_{i} .$

One could use the R package mvmeta [30], although the above computation is easy.

The bivariate normal model (1) does not allow for a different dependence structure between the two outcomes. In practice, the bivariate normal model (1) can be too restrictive, as there are various dependence patterns between two outcomes. For example, to model the luminosity function of galaxies, Takeuchi [16] pointed out that the FGM copula model offers a more ideal shape than the normal copula model from a physical point of view. Such a limitation motivates us to construct a general copula model that can describe various dependence structures.

2.2. Copulas

This subsection prepares the basic terms on copulas that will subsequently be used.

A copula is a bivariate distribution function whose margins are uniformly distributed on the unit interval [31,32]. Copulas are indispensable tools when modelling a dependence structure between two random variables. We specifically consider the following parametric copulas.

The normal copula: The copula function is

$C_{ρ}^{Normal} (u, v) = Φ_{ρ} \{Φ^{- 1} (u), Φ^{- 1} (v)\}, - 1 < ρ < 1, 0 < u, v < 1,$

where

Φ_{ρ} (\cdot, \cdot)

is the cumulative distribution function (CDF) of the bivariate standard normal distribution with correlation

ρ

and

Φ^{- 1}

is the inverse of the standard normal CDF

Φ

. While this copula is easy to understand, it has a complex form involving two implicit functions

Φ_{ρ}

and

Φ^{- 1}

. The following two copulas provide simpler forms than the normal copula.

The Farlie–Gumbel–Morgenstern (FGM) copula [33]: The copula function is

$C_{θ}^{FGM} (u, v) = u v \{1 + θ (1 - u) (1 - v)\}, - 1 \leq θ \leq 1, 0 < u, v < 1 .$

The FGM copula has a very simple form, and is a fundamental copula, which has been extended to a variety of copulas, called the generalized FGM copulas [34,35,36,37,38].

The Clayton copula [39]: The copula function is

$C_{α}^{Clayton} (u, v) = {(u^{- α} + v^{- α} - 1)}^{- 1 / α}, α > 0, 0 < u, v < 1 .$

The Clayton copula is one of the simplest and most frequently used copulas in applications. The Clayton copula is derived from the gamma frailty model, leading to its remarkable popularity in survival data analysis [22,40]. It has a lower tail dependence [31], but is not tractable for modeling negative dependence.

The Gumbel copula [41]: The copula function is

$C_{β}^{Gumbel} (u, v) = \exp [- {{(- \log u)}^{β} + {(- \log v)}^{β}}^{1 / β}], β \geq 1, 0 < u, v < 1 .$

The Gumbel copula is a popular copula with upper tail dependence [31]. The Gumbel copula does not offer a negative dependence, as in the Clayton copula.

The Frank copula [42]: The copula function is

$C_{γ}^{Frank} (u, v) = - \frac{1}{γ} \log \{1 + \frac{(e^{- γ u} - 1) (e^{- γ v} - 1)}{e^{- γ} - 1}\}, γ \neq 0, 0 < u, v < 1 .$

The Frank copula does not have tail dependence [31]. Unlike the Clayton and Gumbel copulas, it can model both positive and negative dependences as the normal copula.

Under the null parameter (e.g., $θ = 0$ ), all the above copulas reduce to the independence copula $Π (u, v) = u v$ . As the parameter departs from the null, the dependence gets stronger.

We define the notations for partial derivatives (if they exist) as

$C^{[j, k]} (u, v) = \frac{\partial^{j + k}}{\partial u^{j} \partial v^{k}} C (u, v); j, k \in \{0, 1, 2, \dots\} .$

For instance,

$C^{[1, 0]} (u, v) = \frac{\partial}{\partial u} C (u, v), C^{[0, 1]} (u, v) = \frac{\partial}{\partial v} C (u, v), C^{[1, 1]} (u, v) = \frac{\partial^{2}}{\partial u \partial v} C (u, v),$

where

C^{[1, 1]}

is called the copula density.

The copula is symmetric if $C^{[1, 1]} (u, v) = C^{[1, 1]} (1 - u, 1 - v)$ . This means that the normal and FGM copulas are symmetric while the Clayton and Gumbel copulas are asymmetric. This symmetry should not be confused with the exchangeability $C (u, v) = C (v, u)$ . All the aforementioned parametric copulas are exchangeable.

3. Proposed Methods

This section proposes a general copula-based approach for estimating a bivariate common mean vector. We first define the bivariate copula model and provide sufficient conditions for the copula parameter to be identifiable. We then develop a maximum likelihood estimator (MLE) for the common mean vector. In addition, we derive the expression for the information matrix.

3.1. General Copula Model for the Common Mean

This subsection proposes a new model for estimating the common mean in bivariate meta-analyses.

For $i = 1, 2, \dots, n$ , let $Y_{i} = (Y_{i 1}, Y_{i 2})$ be a random vector satisfying

$Y_{i 1} ~ N (μ_{1}, σ_{i 1}^{2}), Y_{i 2} ~ N (μ_{2}, σ_{i 2}^{2}), μ \equiv E (Y_{i}) = [\begin{matrix} E (Y_{i 1}) \\ E (Y_{i 2}) \end{matrix}] = [\begin{matrix} μ_{1} \\ μ_{2} \end{matrix}],$

$Ω_{i} \equiv C o v (Y_{i}) = [\begin{matrix} V a r (Y_{i 1}) & C o v (Y_{i 1}, Y_{i 2}) \\ C o v (Y_{i 1}, Y_{i 2}) & V a r (Y_{i 2}) \end{matrix}] = [\begin{matrix} σ_{i 1}^{2} & ρ_{i} σ_{i 1} σ_{i 2} \\ ρ_{i} σ_{i 1} σ_{i 2} & σ_{i 2}^{2} \end{matrix}] .$

Here, we call $μ = (μ_{1}, μ_{2})$ the ‘common mean vector’ since it is common across $i = 1, 2, \dots, n$ . Our target is the estimation of $μ$ when $Ω_{i}$ , $i = 1, 2, \dots, n$ are known. In general, $Ω_{i} \neq Ω_{j}$ for some $i \neq j$ , and, therefore, the random vectors $Y_{i}$ , $i = 1, 2, \dots, n$ are independent but not identically distributed (i.n.i.d.). While the marginal normality is specified, the bivariate normality is unspecified. We only specify the equation $Corr (Y_{i 1}, Y_{i 2}) = ρ_{i}$ , where $ρ_{i}$ is known.

We now specify a bivariate distribution for $Y_{i}$ . According to Sklar’s Theorem [43], for copulas $C_{θ_{i}},$ $i = 1, 2, \dots, n,$ we define the bivariate CDFs

$P r (Y_{i 1} \leq y_{1}, Y_{i 2} \leq y_{2}) = C_{θ_{i}} \{Φ (\frac{y_{1} - μ_{1}}{σ_{i 1}}), Φ (\frac{y_{2} - μ_{2}}{σ_{i 2}})\}, i = 1, 2, \dots, n .$

However, since $ρ_{i}$ is known, the copula can be restricted. To see the problem clearly, we define the correlation function $ρ_{C} : Θ \mapsto R_{C}$ as

$ρ_{C} (θ) = E \{(\frac{Y_{i 1} - μ_{1}}{σ_{i 1}}) (\frac{Y_{i 2} - μ_{2}}{σ_{i 2}})\} = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} z_{1} z_{2} d C_{θ} \{Φ (z_{1}), Φ (z_{2})\},$

where

R_{C} \equiv \{ρ_{C} (θ) : θ \in Θ\}

denotes the range of

ρ_{C}

that depends on the choice of

C_{θ}

. The correlation function

ρ_{C}

does not depend on

μ

. For the copula to be useful in real meta-analyses,

θ_{i}

has to be identifiable from

ρ_{i}

. This means that one has to be able to solve the equation

ρ_{C} (θ) = ρ

. Now, we define our general copula model for a bivariate common mean vector.

Definition 1.

(Copula-based common mean model): The copula-based common mean model is

(2) $P r (Y_{i 1} \leq y_{1}, Y_{i 2} \leq y_{2}) = C_{θ_{i}} \{Φ (\frac{y_{1} - μ_{1}}{σ_{i 1}}), Φ (\frac{y_{2} - μ_{2}}{σ_{i 2}})\}, i = 1, 2, \dots, n,$

where the copula parameter

θ_{i}

is identified by

ρ_{C} (θ_{i}) = ρ_{i}

for

i = 1, 2, \dots, n

To explain the flexibility and generality of our model, we give examples for $C_{θ_{i}}$ .

Example 1.

(the normal copula): Under the normal copula, the model in Equation (2) becomes

$P r (Y_{i 1} \leq y_{1}, Y_{i 2} \leq y_{2}) = C_{ρ_{i}}^{Normal} \{Φ (\frac{y_{1} - μ_{1}}{σ_{i 1}}), Φ (\frac{y_{2} - μ_{2}}{σ_{i 2}})\} = Φ_{ρ_{i}} (\frac{y_{1} - μ_{1}}{σ_{i 1}}, \frac{y_{2} - μ_{2}}{σ_{i 2}}) .$

Under this model, the correlation function is the identity function

ρ_{C^{Normal}} (ρ) = ρ

. In addition, one has the copula parameter space

Θ_{C^{Normal}} = (- 1, 1)

, and the range of correlations

R_{C^{Normal}} = (- 1, 1)

. Without doubt, for any

ρ_{i} \in (- 1, 1)

, the copula parameter can be identified.

Example 2.

(the FGM copula): Under the FGM copula, the model in Equation (2) becomes

$P r (Y_{i 1} \leq y_{1}, Y_{i 2} \leq y_{2}) = Φ (\frac{y_{1} - μ_{1}}{σ_{i 1}}) Φ (\frac{y_{2} - μ_{2}}{σ_{i 2}}) [1 + θ_{i} \{1 - Φ (\frac{y_{1} - μ_{1}}{σ_{i 1}})\} \{1 - Φ (\frac{y_{2} - μ_{2}}{σ_{i 2}})\}] .$

Under this model, the correlation function is

ρ_{C^{FGM}} (θ) = θ / π

for

- 1 \leq θ \leq 1

[44]. Thus, the copula parameter is identified by

θ_{i} = π ρ_{i}

, as long as

ρ_{i} \in [- 1 / π, 1 / π]\approx[- 0.32, 0.32]

. If

ρ_{i} \notin [- 1 / π, 1 / π]

, we suggest

θ_{i} = - 1

θ_{i} = 1

, using

θ_{i} \equiv π ρ_{i}^{*}

, where

ρ_{i}^{*} = m i n [- 1 / π, m a x {ρ_{i}, 1 / π}]

. Hence,

θ_{i}

can still be identified by

ρ_{i}

. This boundary enforcement is illustrated in Figure 1.

Example 3.

(The Clayton copula): Under the Clayton copula, the model in Equation (2) becomes

$P r (Y_{i 1} \leq y_{1}, Y_{i 2} \leq y_{2}) = {\{Φ {(\frac{y_{1} - μ_{1}}{σ_{i 1}})}^{- α_{i}} + Φ {(\frac{y_{2} - μ_{2}}{σ_{i 2}})}^{- α_{i}} - 1\}}^{- 1 / α_{i}} .$

for

α_{i} > 0

. The correlation function does not have a closed-form, and is written as

$ρ_{C^{Clayton}} (α) = (α + 1) \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \frac{z_{1} z_{2} φ (z_{1}) φ (z_{2})}{Φ {(z_{1})}^{α + 1} Φ {(z_{2})}^{α + 1} {Φ {(z_{1})}^{- α} + Φ {(z_{2})}^{- α} - 1}^{1 / α + 2}} d z_{1} d z_{2} .$

It is known that

{l i m}_{α \to 0} ρ_{C^{Clayton}} (α) = 0

and

{l i m}_{α \to \infty} ρ_{C^{Clayton}} (α) = 1

. In addition, if

α_{2} \geq α_{1}

then

C_{α_{2}}^{Clayton} (u, v) \geq C_{α_{1}}^{Clayton} (u, v)

for all

u, v \in (0, 1)

[45]. Then, we conclude that the range of the correlation is

R_{C^{Clayton}} = (0, 1)

. Thus, one can identify by solving

ρ_{C^{Clayton}} (α_{i}) = ρ_{i}

numerically if

ρ_{i} > 0

. If

ρ_{i} \leq 0

, we suggest the independence model (Figure 1)

$P r (Y_{i 1} \leq y_{1}, Y_{i 2} \leq y_{2}) = Φ (\frac{y_{1} - μ_{1}}{σ_{i 1}}) Φ (\frac{y_{2} - μ_{2}}{σ_{i 2}}) .$

Example 4.

(The Gumbel copula): Under the Gumbel copula, the model in Equation (2) becomes

$P r (Y_{i 1} \leq y_{1}, Y_{i 2} \leq y_{2}) = \exp (- {[{\{- \log Φ (\frac{y_{1} - μ_{1}}{σ_{i 1}})\}}^{β_{i}} + {\{- \log Φ (\frac{y_{2} - μ_{2}}{σ_{i 2}})\}}^{β_{i}}]}^{1 / β_{i}})$

for

β_{i} \geq 1

. Similar to the Clayton copula, the correlation function does not have a closed-form, and is not displayed here. It is known that

ρ_{C^{Gumbel}} (1) = 0

and

{l i m}_{β \to \infty} ρ_{C^{Gumbel}} (β) = 1

. If

ρ_{i} < 0

, we suggest the independence model as in the Clayton copula.

Example 5.

(The Frank copula): Under the Frank copula, the model in Equation (2) becomes

$P r (Y_{i 1} \leq y_{1}, Y_{i 2} \leq y_{2}) = - \frac{1}{γ_{i}} \log (1 + \frac{[\exp \{- γ_{i} Φ (\frac{y_{1} - μ_{1}}{σ_{i 1}})\} - 1] [\exp \{- γ_{i} Φ (\frac{y_{2} - μ_{2}}{σ_{i 2}})\} - 1]}{e^{- γ_{i}} - 1})$

for

γ_{i} \neq 0

. Again, the correlation function does not have a closed-form, and is not displayed here. It is known that

{l i m}_{γ \to - \infty} ρ_{C^{Frank}} (γ) = - 1

and

{l i m}_{γ \to \infty} ρ_{C^{Frank}} (γ) = 1

. Thus, the Frank copula parameter does not require boundary correction.

3.2. Statistical Inference Methods

This subsection develops statistical inference methods under the proposed model.

We propose the MLE for $μ$ under the general copula model (Definition 1) in Equation (2). Suppose that the copula density $C_{θ}^{[1, 1]}$ exists. Then, the joint density of $Y_{i}$ is

$f_{i, μ}^{} (y) = \frac{\partial^{2}}{\partial y_{1} \partial y_{2}} P r (Y_{i 1} \leq y_{1}, Y_{i 2} \leq y_{2}) = \frac{1}{σ_{i 1} σ_{i 2}} φ (\frac{y_{1} - μ_{1}}{σ_{i 1}}) φ (\frac{y_{2} - μ_{2}}{σ_{i 2}}) C_{θ_{i}}^{[1, 1]} \{Φ (\frac{Y_{1} - μ_{1}}{σ_{i 1}}), Φ (\frac{Y_{2} - μ_{2}}{σ_{i 2}})\} .$

where

y = (y_{1}, y_{2})

and

φ (\cdot)

is the density of

N (0, 1)

. Given the samples, the log-likelihood function is

$ℓ_{n} (μ) = constant + \sum_{i = 1}^{n} \log [C_{θ_{i}}^{[1, 1]} \{Φ (\frac{Y_{i 1} - μ_{1}}{σ_{i 1}}), Φ (\frac{Y_{i 2} - μ_{2}}{σ_{i 2}})\}] - \frac{1}{2} \sum_{j = 1}^{2} \sum_{i = 1}^{n} {(\frac{Y_{i j} - μ_{j}}{σ_{i j}})}^{2} .$

The MLE of the common mean vector is defined as

${\hat{μ}}_{n} = [\begin{matrix} {\hat{μ}}_{n, 1} \\ {\hat{μ}}_{n, 2} \end{matrix}] = \underset{μ \in R^{2}}{argmax} ℓ_{n} (μ),$

where

R = (- \infty, \infty)

is a real line. The MLE does not have a closed-form expression except for the normal copula. Thus, the MLE can also be obtained by the Newton–Raphson algorithm or some software functions (e.g., the R functions optim or nlm). One may also apply our R package CommonMean.Copula [26], which will be explained in Section 6.

3.3. Information Matrix

For the MLE to be well-behaved, it is necessary to show that the (Fisher) information matrix exists and is non-singular. In other words, the MLE, without verifying these conditions, may have some problems, e.g., the non-existence, inconsistency, or inefficiency of the MLE. Furthermore, the information matrix describes how a copula influences the MLE.

We define the $2 \times 2$ information matrix $I_{i} (μ)$ for $i = 1, 2, \dots, n$ as

$I_{i, j k} (μ) = E \{\frac{\partial \log f_{i, μ} (Y_{i})}{\partial μ_{j}} \frac{\partial \log f_{i, μ} (Y_{i})}{\partial μ_{k}}\}, j, k = 1, 2 .$

The following theorem gives the formula of the information matrix.

Lemma 1.

If $C_{θ}^{[3, 1]}$ , $C_{θ}^{[1, 3]}$ , and $C_{θ}^{[2, 2]}$ exist in ${(0, 1)}^{2}$ , for each $i$ , the following equalities hold

$E \{\frac{\partial \log f_{i, μ} (Y_{i})}{\partial μ_{j}} \frac{\partial \log f_{i, μ} (Y_{i})}{\partial μ_{k}}\} = E \{- \frac{\partial^{2} \log f_{i, μ} (Y_{i})}{\partial μ_{j} \partial μ_{k}}\}; j, k = 1, 2 .$

The proof of Lemma 1 is given in Appendix A.1.

Many copulas have $C_{θ}^{[3, 1]}$ , $C_{θ}^{[1, 3]}$ , and $C_{θ}^{[2, 2]}$ in ${(0, 1)}^{2}$ , such as the normal, FGM, and Clayton copulas (Appendix A.2.). The following theorem gives the formula of the information matrix.

Theorem 1.

Under the copula-based model (Definition 1), the information matrix does not depend on $μ$ . Furthermore, if $C_{θ}^{[3, 1]}$ , $C_{θ}^{[1, 3]}$ , and $C_{θ}^{[2, 2]}$ exist in ${(0, 1)}^{2}$ , it can be decomposed into the sum of the information matrix for the independent model and the additional information by the copula,

(3) $I_{i} = [\begin{matrix} \frac{1}{σ_{i 1}^{2}} & 0 \\ 0 & \frac{1}{σ_{i 2}^{2}} \end{matrix}] + [\begin{matrix} \frac{1}{σ_{i 1}^{2}} E_{C}^{11} (θ_{i}) & \frac{1}{σ_{i 1} σ_{i 2}} \{E_{C}^{12} (θ_{i}) - ρ_{C} (θ_{i})\} \\ \frac{1}{σ_{i 1} σ_{i 2}} \{E_{C}^{12} (θ_{i}) - ρ_{C} (θ_{i})\} & \frac{1}{σ_{i 2}^{2}} E_{C}^{22} (θ_{i}) \end{matrix}],$

where

$E_{C}^{11} (θ_{i}) = E {\{\frac{φ (Z_{i 1}) C_{θ_{i}}^{[2, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}\}}^{2},$

$E_{C}^{22} (θ_{i}) = E {\{\frac{φ (Z_{i 2}) C_{θ_{i}}^{[1, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}\}}^{2},$

$E_{C}^{12} (θ_{i}) = E [\frac{φ (Z_{i 1}) φ (Z_{i 2}) C_{θ_{i}}^{[2, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\} C_{θ_{i}}^{[1, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} {Φ (Z_{i 1}), Φ (Z_{i 2})}^{2}}] .$

Theorem 1 can be proved by straightforward calculations as Lemma 1 (Appendix A.1.). Theorem 1 helps us interpret the role of the copula $C_{θ_{i}}$ on the information matrix.

Theorem 2.

The determinant of $I_{i}$ can be expressed as

$\det (I_{i}) = \frac{1}{σ_{i 1}^{2} σ_{i 2}^{2}} {E_{C}^{11} (θ_{i}) E_{C}^{22} (θ_{i}) - E_{C}^{12} {(θ_{i})}^{2}} + \frac{1}{σ_{i 1}^{2} σ_{i 2}^{2}} {E_{C}^{11} (θ_{i}) + E_{C}^{22} (θ_{i}) + 2 ρ_{C} (θ_{i}) E_{C}^{12} (θ_{i})} + \frac{1}{σ_{i 1}^{2} σ_{i 2}^{2}} {1 - ρ_{C} {(θ_{i})}^{2}} .$

In addition,

\det (I_{i}) > 0

and

I_{i}

is positive definite.

Proof of Theorem 2.

The expression of $\det (I_{i})$ is obtained by straightforward calculations. Clearly, we have $|ρ_{C} (θ_{i})| < 1$ . Then, by the Cauchy-Schwarz inequality,

$E_{C}^{11} (θ_{i}) E_{C}^{22} (θ_{i}) \geq E_{C}^{12} {(θ_{i})}^{2} .$

Furthermore, by the arithmetic-geometric mean inequality, we have

$E_{C}^{11} (θ_{i}) + E_{C}^{22} (θ_{i}) \geq 2 {E_{C}^{11} (θ_{i}) E_{C}^{22} (θ_{i})}^{1 / 2} \geq 2 |E_{C}^{12} (θ_{i})| > 2 |ρ_{C} (θ_{i}) E_{C}^{12} (θ_{i})| .$

Then we obtain $\det (I_{i}) > 0$ . Since $E_{C}^{11} (θ_{i}) / σ_{i 1}^{2} + 1 / σ_{i 1}^{2} > 0$ , both the upper left $1 \times 1$ and $2 \times 2$ determinants of $I_{i}$ are positive. Thus, $I_{i}$ is positive definite. □

Based on Theorem 1, one can derive the information matrix $I_{i} (μ)$ for parametric copulas. Below, we show examples of the normal, FGM, and Clayton copulas.

Example 6.

(The normal copula): Under the normal copula

$E_{C^{Normal}}^{11} (ρ_{i}) = E_{C^{Normal}}^{22} (ρ_{i}) = \frac{ρ_{i}^{2}}{1 - ρ_{i}^{2}}, E_{C^{Normal}}^{12} (ρ_{i}) = - \frac{ρ_{i}^{3}}{1 - ρ_{i}^{2}} .$

Then, by Theorem 1, the information matrix in Equation (3) becomes

$I_{i} = [\begin{matrix} \frac{1}{σ_{i 1}^{2}} & 0 \\ 0 & \frac{1}{σ_{i 2}^{2}} \end{matrix}] + [\begin{matrix} \frac{1}{σ_{i 1}^{2}} \frac{ρ_{i}^{2}}{1 - ρ_{i}^{2}} & \frac{1}{σ_{i 1} σ_{i 2}} (- \frac{ρ_{i}^{3}}{1 - ρ_{i}^{2}} - ρ_{i}) \\ \frac{1}{σ_{i 1} σ_{i 2}} (- \frac{ρ_{i}^{3}}{1 - ρ_{i}^{2}} - ρ_{i}) & \frac{1}{σ_{i 2}^{2}} \frac{ρ_{i}^{2}}{1 - ρ_{i}^{2}} \end{matrix}] = \frac{1}{1 - ρ_{i}^{2}} [\begin{matrix} \frac{1}{σ_{i 1}^{2}} & - \frac{ρ_{i}}{σ_{i 1} σ_{i 2}} \\ - \frac{ρ_{i}}{σ_{i 1} σ_{i 2}} & \frac{1}{σ_{i 2}^{2}} \end{matrix}] = Ω_{i}^{- 1} .$

and its determinant is

\det (I_{i}^{Normal}) = 1 / (σ_{i 1}^{2} σ_{i 2}^{2})

. Clearly,

I_{i}^{Normal}

is positive definite.

Example 7.

(The FGM copula): Under the FGM copula

$E_{C^{FGM}}^{11} (θ_{i}) = E_{C^{FGM}}^{22} (θ_{i}) = 4 θ_{i}^{2} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \frac{φ {(z_{1})}^{3} {1 - 2 Φ (z_{2})}^{2} φ (z_{2})}{1 + θ_{i} \{1 - 2 Φ (z_{1})\} \{1 - 2 Φ (z_{2})\}} d z_{1} d z_{2},$

$E_{C^{FGM}}^{12} (θ_{i}) = 4 θ_{i}^{2} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \frac{φ {(z_{1})}^{2} \{1 - 2 Φ (z_{1})\} φ {(z_{2})}^{2} \{1 - 2 Φ (z_{2})\}}{1 + θ_{i} \{1 - 2 Φ (z_{1})\} \{1 - 2 Φ (z_{2})\}} d z_{1} d z_{2}, ρ_{C^{FGM}} (θ) = \frac{θ}{π} .$

Then, by Theorem 1, the information matrix in Equation (3) becomes

$I_{i}^{FGM} = [\begin{matrix} \frac{1}{σ_{i 1}^{2}} & 0 \\ 0 & \frac{1}{σ_{i 2}^{2}} \end{matrix}] + [\begin{matrix} \frac{1}{σ_{i 1}^{2}} E_{C^{FGM}}^{11} (θ_{i}) & \frac{1}{σ_{i 1} σ_{i 2}} E_{C^{FGM}}^{12} (θ_{i}) - \frac{θ_{i}}{π σ_{i 1} σ_{i 2}} \\ \frac{1}{σ_{i 1} σ_{i 2}} E_{C^{FGM}}^{12} (θ_{i}) - \frac{θ_{i}}{π σ_{i 1} σ_{i 2}} & \frac{1}{σ_{i 2}^{2}} E_{C^{FGM}}^{22} (θ_{i}) \end{matrix}] .$

By Theorem 2, its determinant is

$\det (I_{i}) = \frac{1}{σ_{i 1}^{2} σ_{i 2}^{2}} \{E_{C^{FGM}}^{11} {(θ_{i})}^{2} - E_{C^{FGM}}^{12} {(θ_{i})}^{2}\} + \frac{2}{σ_{i 1}^{2} σ_{i 2}^{2}} \{E_{C^{FGM}}^{11} (θ_{i}) + \frac{θ_{i}}{π} E_{C^{FGM}}^{12} (θ_{i})\} + \frac{1}{σ_{i 1}^{2} σ_{i 2}^{2}} (1 - \frac{θ_{i}^{2}}{π^{2}}) .$

This result agrees with [3] who considered the FGM model.

Example 8.

(The Clayton copula): Under the Clayton copula

$E_{C^{Clayton}}^{11} (α_{i}) = E_{C^{Clayton}}^{22} (α_{i}) = (α_{i} + 1) \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \frac{φ {(z_{1})}^{3} φ (z_{2}) {α_{i} Φ {(z_{1})}^{- α_{i}} - (α_{i} + 1) Φ {(z_{2})}^{- α_{i}} + (α_{i} + 1)}^{2}}{Φ {(z_{1})}^{α_{i} + 3} Φ {(z_{2})}^{α_{i} + 1} {Φ {(z_{1})}^{- α_{i}} + Φ {(z_{2})}^{- α_{i}} - 1}^{1 / α_{i} + 4}} d z_{1} d z_{2},$

$E_{C^{Clayton}}^{12} (α_{i}) = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \frac{φ {(z_{1})}^{2} {α_{i} Φ {(z_{1})}^{- α_{i}} - (α_{i} + 1) Φ {(z_{2})}^{- α_{i}} + (α_{i} + 1)}}{Φ {(z_{1})}^{α_{i} + 2} {Φ {(z_{1})}^{- α_{i}} + Φ {(z_{2})}^{- α_{i}} - 1}^{1 / 2 α_{i} + 2}} \times \frac{φ {(z_{2})}^{2} {α_{i} Φ {(z_{2})}^{- α_{i}} - (α_{i} + 1) Φ {(z_{1})}^{- α_{i}} + (α_{i} + 1)}}{Φ {(z_{2})}^{α_{i} + 2} {Φ {(z_{1})}^{- α_{i}} + Φ {(z_{2})}^{- α_{i}} - 1}^{1 / 2 α_{i} + 2}} d z_{1} d z_{2},$

$ρ_{C^{Clayton}} (α_{i}) = (α_{i} + 1) \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \frac{z_{1} z_{2} φ (z_{1}) φ (z_{2})}{Φ {(z_{1})}^{α_{i} + 1} Φ {(z_{2})}^{α_{i} + 1} {Φ {(z_{1})}^{- α_{i}} + Φ {(z_{2})}^{- α_{i}} - 1}^{1 / α_{i} + 2}} d z_{1} d z_{2} .$

Then, by Theorems 1 and 2, we obtain

I_{i}^{Clayton}

and

\det (I_{i}^{Clayton})

accordingly.

4. Asymptotic Theory

To assess the sampling variability of ${\hat{μ}}_{n}$ , its asymptotic distribution is presented in this section.

A technical burden comes from the fact that our samples $Y_{i}$ , $i = 1, 2, \dots, n$ are independent and non-identically distributed (i.n.i.d.) owing to heterogeneous variances ( $Ω_{i} \neq Ω_{j}, i \neq j$ ). The existence of the asymptotic distribution requires the stabilization of the information matrix [3,46,47] in large samples. For the asymptotic variance of ${\hat{μ}}_{n}$ , to be defined, we assume the existence of a $2 \times 2$ positive definite matrix $I \equiv \lim_{n \to \infty} \sum_{i = 1}^{n} I_{i} / n$ . We further assume that the copula’s derivatives $C_{θ}^{[4, 1]}$ , $C_{θ}^{[3, 2]}$ , $C_{θ}^{[2, 3]}$ , and $C_{θ}^{[1, 4]}$ exist in ${(0, 1)}^{2}$ . With these conditions and many other technical conditions given in [48], we establish the consistency and asymptotic normality of ${\hat{μ}}_{n}$ :

Theorem 3.

Under the copula model (Definition 1), if some regularity conditions hold, then

(a)
Existence and consistency: With probability tending to one, there exists the MLE ${\hat{μ}}_{n} = ({\hat{μ}}_{n, 1}, {\hat{μ}}_{n, 2})$ such that ${\hat{μ}}_{n} \to^{p} μ$ , as $n \to \infty$ ;
(b)
Asymptotic normality: $n^{1 / 2} ({\hat{μ}}_{n} - μ) \to^{d} N (0, I^{- 1})$ , as $n \to \infty$ .

The proof of Theorem 3 and the required regularity conditions are given in the Ph.D dissertation of [48]. The proof approximates $n^{1 / 2} ({\hat{μ}}_{n} - μ)$ by the sum of independent random variables, and then applies the weak law of large numbers for i.n.i.d. random variables from Theorem 1.14 in [49] and the Lindeberg–Feller multivariate central limit theorem from Proposition 2.27 in [50]. The proof is fairly technical, but similar to those of Theorem 6.5.1 in [51], Theorem 1 in [47], and Theorem 5.1 in [3].

5. SE and Confidence Sets

As Section 4 has established the asymptotic theory to evaluate the variability of the proposed MLE, we can derive the SE, confidence interval (CI), and confidence ellipse (CE).

Let $g : R^{2} \mapsto R$ be a differentiable function, and $g (μ)$ be the parameter of interest. For instance, $g (μ) = μ_{1}$ and $g (μ) = μ_{2} - μ_{1}$ can be considered. The SE of $g ({\hat{μ}}_{n})$ is

$SE \{g ({\hat{μ}}_{n})\} = {[{{\{\frac{\partial g (μ)}{\partial μ}\}}^{T} {\{- \frac{\partial^{2} ℓ_{n} (μ)}{\partial μ \partial μ^{T}}\}}^{- 1} \{\frac{\partial g (μ)}{\partial μ}\}|}_{μ = {\hat{μ}}_{n}}]}^{1 / 2} .$

This formula is based on the delta method and the large sample approximation

$I \approx \frac{1}{n} \sum_{i = 1}^{n} I_{i} \approx {- \frac{1}{n} \frac{\partial^{2} ℓ_{n} (μ)}{\partial μ \partial μ^{T}}|}_{μ = {\hat{μ}}_{n}} .$

The 95% CI for $g (μ)$ is $g ({\hat{μ}}_{n}) \pm 1.96 \times SE \{g ({\hat{μ}}_{n})\}$ .

Moreover, based on Theorem 3, we construct a 95% CE for $μ$ :

$CE = \{μ : {({\hat{μ}}_{n} - μ)}^{T} ({- \frac{\partial^{2} ℓ_{n} (μ)}{\partial μ \partial μ^{T}}|}_{μ = {\hat{μ}}_{n}}) ({\hat{μ}}_{n} - μ) \leq χ_{df = 2, 0.95}^{2}\},$

where

χ_{df = 2, 0.95}^{2}

is be the 95% point of the

χ^{2}

-distribution with two degrees of freedom.

6. R Package

We implement the proposed methods in an R package CommonMean.Copula [26]. R users can easily compute the MLE with its SE and 95% CI under the FGM, Clayton, Gumbel, Frank, and normal copulas. In this package, the log-likelihood is maximized by the R optim function, where the initial values are set as the univariate estimators

$μ_{j}^{(0)} = {(\sum_{i = 1}^{n} \frac{1}{σ_{i j}^{2}})}^{- 1} \sum_{i = 1}^{n} \frac{Y_{i j}}{σ_{i j}^{2}}, j = 1, 2 .$

For illustration, we fitted the Clayton copula by the following R codes:

- some outputs are omitted for brevity –

Here, $CommonMean1 shows ${\hat{μ}}_{n, 1}^{Clayton} = 33.95$ , $SE ({\hat{μ}}_{n, 1}^{Clayton}) = 0.439$ , and the 95% CI (33.089, 34.812); $CommonMean2 is similar. $V shows the covariance matrix $C o v ({\hat{μ}}_{n}^{Clayton})$ . $‘Log-likelihood values’ shows $ℓ_{n}^{Clayton} ({\hat{μ}}_{n}^{Clayton}) = - 285.65$ . One can fit other copulas by changing “Clayton” to “FGM”, “Gumbel”, “Frank”, or “normal”.

7. Simulation Studies

We conducted Monte Carlo simulations to examine the accuracy of the proposed methods. We report the results for the Clayton copula; more results are available from [48].

We generated $Y_{i}$ , $i = 1, 2, \dots, n$ , under the Clayton copula with $α_{i} ~ G a m m a (64, 1 / 8)$ , $G a m m a (4, 1 / 2)$ , or $G a m m a (1, 1)$ , leading to $E [α_{i}] = 8$ , $E [α_{i}] = 2$ , or $E [α_{i}] = 1$ , respectively. In all three cases, we have $V a r [α_{i}] = 1$ . Without loss of generality, we set $μ = (0, 0)$ . To set $σ_{i 1}^{2}$ and $σ_{i 2}^{2}$ , we followed the simulation setting of [52]. That is, $σ_{i 1}^{2}, σ_{i 2}^{2} ~ χ_{d f = 1}^{2} / 4$ , restricted in the interval $[0.009, 0.6]$ . This setting leads to $E [σ_{i 1}^{2}]= E[σ_{i 2}^{2}] = 0.173$ . Based on the generated data, we computed ${\hat{μ}}_{n, 1}^{Clayton}$ , ${\hat{μ}}_{n, 2}^{Clayton} - {\hat{μ}}_{n, 1}^{Clayton}$ , and ${\hat{μ}}_{n}^{Clayton}$ , and their SEs and 95% CIs (CEs) by using the R function CommonMean.Copula (Section 6). We then evaluated the coverage probability (CP) of the 95% CI (CE) to see how the confidence set can cover the true value. We consider a small sample size $n \in \{5, 10, 15\}$ and a large sample size $n \in \{50, 100, 300\}$ . Our simulations are based on 1000 repetitions.

Table 1 summarizes the results. For ${\hat{μ}}_{n, 1}^{Clayton}$ and ${\hat{μ}}_{n, 2}^{Clayton} - {\hat{μ}}_{n, 1}^{Clayton}$ , the SDs of the estimates decrease when $n$ increases from $n = 5$ to $n = 300$ . We report the boxplots summarizing the 1000 repetitions for ${\hat{μ}}_{n, 1}^{Clayton}$ in Figure 2. This clearly visualizes how the variability of the estimates vanishes as the sample sizes increase. Table 1 also shows that the SDs are close to the average SEs, except for $n = 5$ (due to the very small samples). Consequently, the CPs are close enough to the nominal level of 0.95, especially when sample sizes are large, which is consistent with our asymptotic theories. For ${\hat{μ}}_{n}^{Clayton}$ , the CPs of the 95% CEs are also reasonably close to the nominal level. In summary, the proposed estimators and the asymptotic theory work fairly well in finite samples.

8. Data Analysis

We analyze two real datasets to illustrate the usefulness of the proposed methods.

8.1. The Entrance Exam Data

The first dataset we analyzed was the entrance exam scores on mathematics and statistics, which was introduced by [3]. The data come from undergrad students who took written exams from 2013 to 2017 to enter the Graduate Institute of Statistics, National Central University, Taiwan. The possible score range is from 0 to 100 for both subjects. Let $i = 1, 2, \dots, 5$ be indices for years 2013, 2014, …, 2017. Table 2 provides the data, including the values of mathematics ( $Y_{i 1}$ = mean math score) and statistics ( $Y_{i 2}$ = mean stat score), and their covariance matrix ( $Ω_{i}$ ).

We fitted the data to the proposed model using the R function CommonMean.Copula(.) in our R package (Section 6). Table 3 summarizes the fitted results for the FGM, Clayton, Gumbel, Frank, and normal copulas. According to the values of the log-likelihood, the Gumbel copula produces the best fit, the Frank copula the second best, and the bivariate normal model the worst fit. The FGM copula failed to capture the dependence and fitted at the boundary $θ_{i} = 1$ for all $i$ (Table 2).

Since the number of unknown parameters across different copulas is the same, model selection by the Akaike information criterion (AIC) is equivalent to model selection by the log-likelihood value. An alternative way of selecting a copula is based on a leave-one-out cross validation (CV), defined as

$C V = \sum_{i = 1}^{n} {{(Y_{i 1} - {\hat{μ}}_{n, 1}^{(- i)})}^{2} + {(Y_{i 2} - {\hat{μ}}_{n, 2}^{(- i)})}^{2}},$

where

{\hat{μ}}_{n, 1}^{(- i)}

and

{\hat{μ}}_{n, 2}^{(- i)}

are the MLE obtained without the ith sample. Here,

C V

measures how a sample is predicted by the others under a copula model. A smaller

C V

corresponds to a better performance of the model.

Table 3 reports the values of $C V$ for each copula. It shows that the Clayton copula has the best performance while the Gumbel copula has the worst. The normal copula has the second worst performance. Overall, our analysis clearly shows the insufficiency of the bivariate normal model.

Figure 3 shows the 95% CEs for the mean vector $μ$ . This visualizes how the resultant estimates vary from the choice of copulas. Interestingly, the CE under the Clayton copula is far away from the other four, although it has a larger log-likelihood value than the normal copula. The normal and Clayton copulas produce the rotated oval shape of the CEs, representing a positive dependence between math and stat scores. The FGM and Gumbel copulas produce similar shapes for their CE. We adopt the 95% CE given by the Gumbel copula because it has the largest log-likelihood value.

Figure 4 gives the 3D plot of the log-likelihood surface under the Gumbel copula model. The plot shows that the estimate of the common mean ${\hat{μ}}_{n}^{Gumbel} = (37.67, 42.56)$ attains the global maximum of the log-likelihood function.

8.2. The Blood Pressure Data

The second dataset we used contains 10 studies that examined the effectiveness of hypertension treatment for lowering blood pressure. Each study provides complete data on two treatment effects, the difference in systolic blood pressure (SBP) and diastolic blood pressure (DBP) between the treatment and the control groups, where these differences are adjusted for the participants’ baseline blood pressures. The within-study correlations of the two outcomes range from $ρ_{i} = 0.45$ to $ρ_{i} = 0.78$ , exhibiting positive dependence. This dataset is available in R package mvmeta [30] and was previously analyzed by [53].

We fitted the data to the proposed copula models using the R function CommonMean.Copula(.) in our R package (Section 6). Table 4 summarizes the fitted results for all the copulas. Based on the log-likelihood values, the Frank copula produces the best fit, the Gumbel copula the second best, and the Clayton copula produces the worst fit. The FGM copula failed to capture the dependence and fitted at the boundary $θ_{i} = 1$ for all $i$ . Again, our analysis reveals the insufficiency of the bivariate normal model; the Frank copula best captured the correlations in the blood pressure data. We also compared $C V$ across all the copulas (Table 4). The results show that the Clayton copula has the best performance while the normal copula has the worst. Again, our analysis shows the insufficiency of the normal model.

Figure 5 shows the 95% CEs for the mean vector $μ$ . The CE under the Clayton and normal copula are far away from the other three. The CE under the FGM copula was almost fully covered by the CE under the Frank copula. We adopt the 95% CE given by the Frank copula, since it has the largest log-likelihood value (Table 4).

Figure 6 depicts the 3D plot of the log-likelihood surface under the Frank copula model. The plot shows that the estimate of the common mean ${\hat{μ}}_{n}^{Frank} = (- 9.20, - 3.94)$ attains the global maximum of the log-likelihood function.

9. Extension to Non-Normal Models

So far, we have considered a common mean model under the marginal normality. This section explains how the proposed methods can be extended to non-normal models. For this reason, we specifically consider a common mean model under the marginal exponential distributions.

${\overline{F}}_{j} (y) \equiv P r (Y_{i j} > y) = \exp (- λ_{j} y), y > 0, λ_{j} > 0, j = 1, 2 .$

Thus, the common mean vector is $μ = (1 / λ_{1}, 1 / λ_{2}) .$

We consider the Clayton copula to specify the bivariate distribution because it has simple derivatives with respect to the copula parameter [54]. Therefore, we propose a bivariate common mean Clayton copula model with exponential margins as follows:

(4) $P r (Y_{i 1} > y_{1}, Y_{i 2} > y_{2}) = C_{α_{i}}^{Clayton} \{{\overline{F}}_{1} (y_{1}), {\overline{F}}_{2} (y_{2})\} = {\exp (α_{i} λ_{1} y_{1}) + \exp (α_{i} λ_{2} y_{2}) - 1}^{- 1 / α_{i}},$

where

α_{i}

is known for

i = 1, 2, \dots, n

. Note that copula

C_{α_{i}}^{Clayton}

is a survival copula for (

Y_{i 1}, Y_{i 2})

as the usual way to model a survival function [22]. Using similar arguments to [55], the information matrix with respect to

λ = (λ_{1}, λ_{2})

can be decomposed as

(5) $I_{i} (λ) = [\begin{matrix} \frac{1}{λ_{1}^{2}} & 0 \\ 0 & \frac{1}{λ_{2}^{2}} \end{matrix}] + [\begin{matrix} \frac{2 α_{i}^{2} (α_{i} + 1)}{λ_{1}^{2} (3 α_{i} + 1)} & - \frac{α_{i} (2 α_{i} + 1)}{λ_{1} λ_{2}} ϕ (α_{i}) \\ - \frac{α_{i} (2 α_{i} + 1)}{λ_{1} λ_{2}} ϕ (α_{i}) & \frac{2 α_{i}^{2} (α_{i} + 1)}{λ_{2}^{2} (3 α_{i} + 1)} \end{matrix}],$

where

$ϕ (α) = \frac{1}{3 α + 1} + \frac{1}{2 (3 α + 1) (2 α + 1)} \{Ψ (\frac{1}{2 α}) - Ψ (\frac{α + 1}{2 α})\}, Ψ (α) = \frac{d^{2} \log Γ (α)}{d α^{2}} .$

See Appendix A.3. for detailed derivations. The expression of $I_{i} (λ)$ is an extension of Theorem 3 to the exponential model. With the information matrix, the properties of the MLE and the asymptotic theory are similar to the normal models.

We conducted Monte Carlo simulations to examine the correctness of Equation (5) by comparing it with their empirical version. We set $λ_{1} = λ_{2} = 1$ and $α_{i} = 1$ for all $i$ . We generated data $(Y_{i 1}, Y_{i 2}), i = 1, \dots, n$ from the model in Equation (4) and computed the empirical versions of $I_{i, 11} (λ)$ and $I_{i, 12} (λ)$ as

$\frac{1}{n} \sum_{i = 1}^{n} \frac{\partial^{2} \log f_{i, λ}^{Clayton} (Y_{i 1}, Y_{i 2})}{\partial λ_{1}^{2}}, \frac{1}{n} \sum_{i = 1}^{n} \frac{\partial^{2} \log f_{i, λ}^{Clayton} (Y_{i 1}, Y_{i 2})}{\partial λ_{1} \partial λ_{2}} .$

The formulas for the derivatives of the log-density are found in Equations (A1) and (A2) in Appendix A.3. Our simulations were based on 1000 repetitions with $n \in \{100, 200, 300, 400, 500\}$ .

Figure 7 depicts the simulation results based on 1000 repetitions. It clearly shows that the empirical versions are scattered around the theoretical values of $I_{i, 11} (λ) = 2$ and $I_{i, 12} (λ) = - 1.16$ . The variability of the empirical versions vanishes as $n$ increases. The simulation results assert the correctness of Equation (5).

10. Conclusions

At present, copula models are very popular in all areas of science. Bivariate meta-analyses are among those research areas that require sophisticated copula-based methods and theories. Nonetheless, there are only a few studies on copula-based bivariate meta-analysis from a methodological/theoretical perspective. This article fully develops the methodologies and theories of the copula-based bivariate meta-analysis, specifically for estimating the common mean vector. These developments will provide solid methodological/theoretical bases that are not available to date.

In this article, we emphasize the flexibility of the proposed copula models that allow for a variety of dependence structures. In the two real data examples, we employed the log-likelihood value as a criterion for model selection (Section 8). Even if the best copula is selected, it still raises the issue of goodness-of-fit, which is difficult to assess under the meta-analysis setting. The classical methods, such as Kolmogorov–Smirnov or Cramér–von Mises type statistics, cannot be directly applied to the non-identically distributed samples for which the empirical distribution function is difficult to interpret. Therefore, the development of goodness-of-fit tests is a possible research direction.

The fundamental assumption made in the proposed model is the common mean model, with known within-study correlations. The common mean assumption, although convenient for summarizing the data for a small number of studies [56], may not always hold in real meta-analyses [6]. Therefore, the extension of the proposed estimator to random means (random-effects models) or ordered means [57,58] is an important direction for future research. To model the random effects, we need another bivariate copula. The estimation problem for these hierarchical copula-based models is beyond the scope of the paper. Nonetheless, the results presented in this paper serve as fundamental knowledge before the exploration of more advanced models.

Author Contributions

Conceptualization, J.-H.S. and T.E.; methodology, J.-H.S. and T.E.; data curation, J.-H.S. and T.E.; writing, J.-H.S., Y.K., Y.-T.C. and T.E.; supervision, J.-H.S., Y.K., Y.-T.C. and T.E.; funding acquisition, Y.K. and Y.-T.C. All authors have read and agreed to the published version of the manuscript.

Funding

Konno Y. is financially supported by JSPS KAKENHI Grant Number 19K11867. Chang Y.-T. is supported by JSPS KAKENHI Grant Numbers JP26330047 and JP18K11196.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors thank five reviewers for their helpful comments that improved the manuscript. The authors kindly thank the Special Issue editors for their invitation to submit our work to Symmetry.

Conflicts of Interest

The authors declare no conflict of interest.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures and Tables

View Image - Figure 1. The boundary correction for the correlation coefficient under the bivariate FGM and Clayton models. The first step forces the correlation [Forumla omitted. See PDF.] to fall in a range that can be modeled by the chosen copula. The second step transforms the corrected correlation [Forumla omitted. See PDF.] to the corresponding copula parameter.

Figure 1. The boundary correction for the correlation coefficient under the bivariate FGM and Clayton models. The first step forces the correlation [Forumla omitted. See PDF.] to fall in a range that can be modeled by the chosen copula. The second step transforms the corrected correlation [Forumla omitted. See PDF.] to the corresponding copula parameter.

View Image - Figure 2. The boxplots summarizing the 1000 repetitions for [Forumla omitted. See PDF.] with true parameter [Forumla omitted. See PDF.] under the copula parameters [Forumla omitted. See PDF.], [Forumla omitted. See PDF.], or [Forumla omitted. See PDF.]. The sample size varies from [Forumla omitted. See PDF.] to [Forumla omitted. See PDF.].

Figure 2. The boxplots summarizing the 1000 repetitions for [Forumla omitted. See PDF.] with true parameter [Forumla omitted. See PDF.] under the copula parameters [Forumla omitted. See PDF.], [Forumla omitted. See PDF.], or [Forumla omitted. See PDF.]. The sample size varies from [Forumla omitted. See PDF.] to [Forumla omitted. See PDF.].

View Image - Figure 3. The MLE and the 95% CE (colored region) for the common means based on the exam score data. The copulas are signified by colors: blue = FGM; gray = Clayton; orange = Gumbel; pink = Frank; green = normal.

Figure 3. The MLE and the 95% CE (colored region) for the common means based on the exam score data. The copulas are signified by colors: blue = FGM; gray = Clayton; orange = Gumbel; pink = Frank; green = normal.

View Image - Figure 4. The 3D plot of the log-likelihood value under the Gumbel copula based on the entrance exam data. The maximum occurs at [Forumla omitted. See PDF.].

Figure 4. The 3D plot of the log-likelihood value under the Gumbel copula based on the entrance exam data. The maximum occurs at [Forumla omitted. See PDF.].

View Image - Figure 5. The MLE and the 95% CE (colored region) for the common means based on the blood pressure data. The correspondence copulas of the colored regions are: blue = FGM; gray = Clayton; orange = Gumbel; pink = Frank; green = normal.

Figure 5. The MLE and the 95% CE (colored region) for the common means based on the blood pressure data. The correspondence copulas of the colored regions are: blue = FGM; gray = Clayton; orange = Gumbel; pink = Frank; green = normal.

View Image - Figure 6. The 3D plot of the log-likelihood value under the Frank copula based on the blood pressure data. The maximum occurs at [Forumla omitted. See PDF.].

Figure 6. The 3D plot of the log-likelihood value under the Frank copula based on the blood pressure data. The maximum occurs at [Forumla omitted. See PDF.].

View Image - Figure 7. The boxplots summarizing the 1000 repetitions for the empirical versions of [Forumla omitted. See PDF.] and [Forumla omitted. See PDF.] (dashed lines) under the Clayton copula model in Equation (4) with parameter = 1. The sample size varies from [Forumla omitted. See PDF.] to [Forumla omitted. See PDF.].

Figure 7. The boxplots summarizing the 1000 repetitions for the empirical versions of [Forumla omitted. See PDF.] and [Forumla omitted. See PDF.] (dashed lines) under the Clayton copula model in Equation (4) with parameter = 1. The sample size varies from [Forumla omitted. See PDF.] to [Forumla omitted. See PDF.].

Table 1

Simulation results based on 1000 repetitions.

		${\hat{μ}}_{n, 1}^{Clayton}$			${\hat{μ}}_{n, 2}^{Clayton} - {\hat{μ}}_{n, 1}^{Clayton}$			${\hat{μ}}_{n}^{Clayton}$
Parameters	$n$	SD	SE	CP	SD	SE	CP	CP
$E [α_{i}] = 8$	5	0.064	0.046	0.888	0.042	0.033	0.885	0.859
	10	0.033	0.026	0.913	0.023	0.019	0.931	0.894
	15	0.021	0.019	0.933	0.015	0.014	0.936	0.919
	50	0.010	0.009	0.952	0.007	0.007	0.954	0.950
	100	0.007	0.006	0.955	0.005	0.005	0.943	0.944
	300	0.004	0.004	0.948	0.003	0.003	0.942	0.948
$E [α_{i}] = 2$	5	0.105	0.092	0.938	0.100	0.086	0.920	0.909
	10	0.061	0.057	0.937	0.060	0.055	0.929	0.919
	15	0.049	0.045	0.938	0.045	0.042	0.943	0.930
	50	0.023	0.023	0.959	0.022	0.021	0.944	0.943
	100	0.016	0.016	0.942	0.015	0.015	0.957	0.950
	300	0.009	0.009	0.946	0.008	0.008	0.946	0.949
$E [α_{i}] = 1$	5	0.115	0.105	0.932	0.128	0.120	0.935	0.922
	10	0.069	0.068	0.950	0.079	0.075	0.937	0.937
	15	0.058	0.053	0.941	0.062	0.058	0.947	0.937
	50	0.028	0.027	0.942	0.029	0.030	0.955	0.943
	100	0.019	0.019	0.942	0.021	0.020	0.936	0.945
	300	0.010	0.011	0.958	0.011	0.012	0.960	0.959

SD = standard deviation, SE = standard error, CP = coverage probability of the 95% CI (CE).

Table 2

The entrance exam data from [3].

$i$	Year	Mean Math Score $(Y_{i 1})$	Mean Stat Score $(Y_{i 2})$	Covariance Matrix $(Ω_{i})$	Copula Parameter
$i$	Year	Mean Math Score $(Y_{i 1})$	Mean Stat Score $(Y_{i 2})$	Covariance Matrix $(Ω_{i})$	$ρ_{i}$	$θ_{i}$	$α_{i}$	$β_{i}$	$γ_{i}$
1	2013	35.17	30.41	$[\begin{matrix} 1.77 & 0.89 \\ 0.89 & 2.99 \end{matrix}]$	0.38	1.00	0.67	1.34	2.68
2	2014	23.43	31.63	$[\begin{matrix} 1.89 & 1.76 \\ 1.76 & 3.61 \end{matrix}]$	0.67	1.00	1.92	1.90	6.00
3	2015	30.74	48.11	$[\begin{matrix} 2.15 & 2.12 \\ 2.12 & 6.13 \end{matrix}]$	0.58	1.00	1.37	1.65	4.67
4	2016	50.91	65.22	$[\begin{matrix} 3.87 & 2.91 \\ 2.91 & 5.02 \end{matrix}]$	0.66	1.00	1.82	1.85	5.76
5	2017	61.62	40.22	$[\begin{matrix} 3.17 & 2.10 \\ 2.10 & 3.29 \end{matrix}]$	0.65	1.00	1.75	1.83	5.60

$ρ_{i}$ = the Pearson correlation; $θ_{i}$ = the FGM copula parameter; $α_{i}$ = the Clayton copula parameter; $β_{i}$ = the Gumbel copula parameter; $γ_{i}$ = the Frank copula parameter.

Table 3

Estimation results for the entrance exam data.

Copula	Math: Estimate (95% CI)	Stat: Estimate (95% CI)	Log-likelihood	$C V$
FGM	37.16 (35.85, 38.47)	41.17 (39.65, 42.70)	−291.80	2723.91
Clayton	32.56 (31.71, 33.40)	43.80 (42.55, 45.05)	−322.84	2644.03
Gumbel	37.67 (36.30, 39.03)	42.56 (40.79, 44.33)	−279.28	2860.21
Frank	37.23 (35.97, 38.49)	39.76 (38.13, 41.40)	−287.63	2738.09
Normal	35.83 (34.51, 37.16)	38.64 (36.94, 40.34)	−342.65	2773.41

Table 4

Estimation results for the blood pressure data.

Copula	SBP: Estimate (95% CI)	DBP: Estimate (95% CI)	Log-likelihood	$C V$
FGM	−9.18 (−9.32, −9.04)	−3.94 (−4.00, −3.89)	−530.29	177.23
Clayton	−9.53 (−9.70, −9.36)	−4.34 (−4.38, −4.29)	−787.02	163.04
Gumbel	−8.99 (−9.16, −8.83)	−3.94 (−4.00, −3.89)	−514.67	184.67
Frank	−9.20 (−9.40, −9.00)	−3.94 (−3.99, −3.88)	−513.34	179.81
Normal	−8.43 (−8.60, −8.25)	−3.95 (−4.01, −3.90)	−771.82	206.49

SBP = the difference in systolic blood pressure; DBP = the difference in diastolic blood pressure.

Appendix A

Appendix A.1. Proof of Lemma 1

We first prepare a lemma:

Lemma A1.

Under the general copula model (Definition 1), if $C_{θ}^{[2, 2]}$ exists in ${(0, 1)}^{2}$ , the correlation function has alternative expressions $ρ_{C} (θ_{i}) = E [\frac{Z_{i 2} φ (Z_{i 1}) C_{θ_{i}}^{[2, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}] = E [\frac{Z_{i 1} φ (Z_{i 2}) C_{θ_{i}}^{[1, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}] = E [\frac{φ (Z_{i 1}) φ (Z_{i 2}) C_{θ_{i}}^{[2, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}],$ where $Z_{i 1}$ and $Z_{i 2}$ have the joint density $f_{i} (z_{1}, z_{2}) = φ (z_{1}) φ (z_{2}) C_{θ_{i}}^{[1, 1]} \{Φ (z_{1}), Φ (z_{2})\} .$

Proof of Lemma A1.

We only prove the first identity for illustration. If $C_{θ}^{[2, 2]}$ exists, then $ρ_{C} (θ_{i}) = \int_{- \infty}^{\infty} z_{2} φ (z_{2}) \int_{- \infty}^{\infty} z_{1} φ (z_{1}) C_{θ_{i}}^{[1, 1]} \{Φ (z_{1}), Φ (z_{2})\} d z_{1} d z_{2} = \int_{- \infty}^{\infty} z_{2} φ (z_{2}) \int_{- \infty}^{\infty} φ {(z_{1})}^{2} C_{θ_{i}}^{[2, 1]} \{Φ (z_{1}), Φ (z_{2})\} d z_{1} d z_{2},$ where the last equality follows from Stein’s identity. Thus, we obtain $ρ_{C} (θ_{i}) = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} z_{2} φ (z_{2}) φ {(z_{1})}^{2} C_{θ_{i}}^{[2, 1]} \{Φ (z_{1}), Φ (z_{2})\} d z_{1} d z_{2} = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} z_{2} φ (z_{1}) \frac{C_{θ_{i}}^{[2, 1]} \{Φ (z_{1}), Φ (z_{2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (z_{1}), Φ (z_{2})\}} f_{i} (z_{1}, z_{2}) d z_{1} d z_{2} = E [\frac{Z_{i 2} φ (Z_{i 1}) C_{θ_{i}}^{[2, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}] .$ The proof completes. □

Lemma A1 is a generalization of Lemma 3.2 in [3].

Now, we prove Lemma 1 for $j = 1$ and $k = 2$ . If $C_{θ}^{[2, 2]}$ exists, by straightforward calculations, $E \{\frac{\partial \log f_{i, μ} (Y_{i})}{\partial μ_{j}} \frac{\partial \log f_{i, μ} (Y_{i})}{\partial μ_{k}}\} = \frac{1}{σ_{i 1} σ_{i 2}} (E (Z_{i 1} Z_{i 2}) - E [\frac{Z_{i 2} φ (Z_{i 1}) C_{θ_{i}}^{[2, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}] - E [\frac{Z_{i 1} φ (Z_{i 2}) C_{θ_{i}}^{[1, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}] + E [\frac{φ (Z_{i 1}) φ (Z_{i 2}) C_{θ_{i}}^{[2, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\} C_{θ_{i}}^{[1, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} {\{Φ (Z_{i 1}), Φ (Z_{i 2})\}}^{2}}]) .$

On the other hand, $E \{- \frac{\partial^{2} \log f_{i, μ} (Y_{i})}{\partial μ_{j} \partial μ_{k}}\} = \frac{1}{σ_{i 1} σ_{i 2}} (E [\frac{φ (Z_{i 1}) φ (Z_{i 2}) C_{θ_{i}}^{[2, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\} C_{θ_{i}}^{[1, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} {\{Φ (Z_{i 1}), Φ (Z_{i 2})\}}^{2}}] - E [\frac{φ (Z_{i 1}) φ (Z_{i 2}) C_{θ_{i}}^{[2, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}]) .$

Based on the above results, it suffices to show $E (Z_{i 1} Z_{i 2}) - E [\frac{Z_{i 2} φ (Z_{i 1}) C_{θ_{i}}^{[2, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}] - E [\frac{Z_{i 1} φ (Z_{i 2}) C_{θ_{i}}^{[1, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}] + E [\frac{φ (Z_{i 1}) φ (Z_{i 2}) C_{θ_{i}}^{[2, 2]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}{C_{θ_{i}}^{[1, 1]} \{Φ (Z_{i 1}), Φ (Z_{i 2})\}}] = 0$ which is asserted by Lemma A1. Hence, the proof is completed. □

Appendix A.2. Derivatives for Copulas

The normal copula: $C_{ρ}^{Normal [1, 1]} (u, v) = \frac{1}{\sqrt{1 - ρ^{2}}} \exp \{- \frac{ρ^{2} Φ^{- 1} {(u)}^{2}}{2 (1 - ρ^{2})} - \frac{ρ^{2} Φ^{- 1} {(v)}^{2}}{2 (1 - ρ^{2})} + \frac{ρ Φ^{- 1} (u) Φ^{- 1} (v)}{1 - ρ^{2}}\},$ $C_{ρ}^{Normal [2, 1]} (u, v) = \frac{\sqrt{2 π}}{\sqrt{1 - ρ^{2}}} \exp \{\frac{(1 - 2 ρ^{2}) Φ^{- 1} {(u)}^{2}}{2 (1 - ρ^{2})} - \frac{ρ^{2} Φ^{- 1} {(v)}^{2}}{2 (1 - ρ^{2})} + \frac{ρ Φ^{- 1} (u) Φ^{- 1} (v)}{1 - ρ^{2}}\}$ $\times \{\frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}} - \frac{ρ^{2} Φ^{- 1} (u)}{1 - ρ^{2}}\},$ $C_{ρ}^{Normal [2, 2]} (u, v) = \frac{2 π}{\sqrt{1 - ρ^{2}}} \exp \{\frac{(1 - 2 ρ^{2}) Φ^{- 1} {(u)}^{2}}{2 (1 - ρ^{2})} - \frac{(1 - 2 ρ^{2}) Φ^{- 1} {(v)}^{2}}{2 (1 - ρ^{2})} + \frac{ρ Φ^{- 1} (u) Φ^{- 1} (v)}{1 - ρ^{2}}\}$ $\times [\{\frac{ρ Φ^{- 1} (u)}{1 - ρ^{2}} - \frac{ρ^{2} Φ^{- 1} (v)}{1 - ρ^{2}}\} \{\frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}} - \frac{ρ^{2} Φ^{- 1} (u)}{1 - ρ^{2}}\} + \frac{ρ}{1 - ρ^{2}}],$ $C_{ρ}^{Normal [3, 1]} (u, v) = \frac{2 π}{\sqrt{1 - ρ^{2}}} \exp \{\frac{(2 - 3 ρ^{2}) Φ^{- 1} {(u)}^{2}}{2 (1 - ρ^{2})} - \frac{ρ^{2} Φ^{- 1} {(v)}^{2}}{2 (1 - ρ^{2})} + \frac{ρ Φ^{- 1} (u) Φ^{- 1} (v)}{1 - ρ^{2}}\}$ $\times [\{\frac{(1 - 2 ρ^{2}) Φ^{- 1} (u)}{1 - ρ^{2}} + \frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}}\} \{\frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}} - \frac{ρ^{2} Φ^{- 1} (u)}{1 - ρ^{2}}\} - \frac{ρ^{2}}{1 - ρ^{2}}],$ $C_{ρ}^{Normal [3, 2]} (u, v) = \frac{{(2 π)}^{3 / 2}}{\sqrt{1 - ρ^{2}}} \exp \{\frac{(2 - 3 ρ^{2}) Φ^{- 1} {(u)}^{2}}{2 (1 - ρ^{2})} - \frac{(1 - 2 ρ^{2}) Φ^{- 1} {(v)}^{2}}{2 (1 - ρ^{2})} + \frac{ρ Φ^{- 1} (u) Φ^{- 1} (v)}{1 - ρ^{2}}\}$ $\times [\{\frac{(1 - 2 ρ^{2}) Φ^{- 1} (u)}{1 - ρ^{2}} + \frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}}\} \{\frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}} - \frac{ρ^{2} Φ^{- 1} (u)}{1 - ρ^{2}}\} - \frac{ρ^{2}}{1 - ρ^{2}}$ $+ \frac{ρ}{1 - ρ^{2}} \{\frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}} + \frac{ρ^{2} Φ^{- 1} (u)}{1 - ρ^{2}}\} + \frac{ρ}{1 - ρ^{2}} \{\frac{(1 - 2 ρ^{2}) Φ^{- 1} (u)}{1 - ρ^{2}} + \frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}}\}],$ $C_{ρ}^{Normal [4, 1]} (u, v) = \frac{{(2 π)}^{3 / 2}}{\sqrt{1 - ρ^{2}}} \exp \{\frac{(3 - 4 ρ^{2}) Φ^{- 1} {(u)}^{2}}{2 (1 - ρ^{2})} - \frac{ρ^{2} Φ^{- 1} {(v)}^{2}}{2 (1 - ρ^{2})} + \frac{ρ Φ^{- 1} (u) Φ^{- 1} (v)}{1 - ρ^{2}}\}$ $\times [\{\frac{(1 - 2 ρ^{2}) Φ^{- 1} (u)}{1 - ρ^{2}} + \frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}}\} \{\frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}} - \frac{ρ^{2} Φ^{- 1} (u)}{1 - ρ^{2}}\} - \frac{ρ^{2}}{1 - ρ^{2}}$ $+ \frac{1 - 2 ρ^{2}}{1 - ρ^{2}} \{\frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}} + \frac{ρ^{2} Φ^{- 1} (u)}{1 - ρ^{2}}\} - \frac{ρ^{2}}{1 - ρ^{2}} \{\frac{(1 - 2 ρ^{2}) Φ^{- 1} (u)}{1 - ρ^{2}} + \frac{ρ Φ^{- 1} (v)}{1 - ρ^{2}}\}] .$

The FGM copula: $C_{θ}^{FGM [1, 1]} (u, v) = 1 + θ (1 - 2 u) (1 - 2 v), C_{θ}^{FGM [2, 1]} (u, v) = - 2 θ (1 - 2 v),$ $C_{θ}^{FGM [2, 2]} (u, v) = 4 θ, C_{θ}^{FGM [3, 1]} (u, v) = C_{θ}^{FGM [3, 2]} (u, v) = C_{θ}^{FGM [4, 1]} (u, v) = 0 .$

The Clayton copula: $C_{α}^{Clayton [j, k]} (u, v) = (α + 1) u^{- α - j} v^{- α - k} {(u^{- α} + v^{- α} - 1)}^{- 1 / α - (j + k)} ω_{j, k} (u, v),$ where $(j, k) \in \{(1, 1), (2, 1), (2, 2), (3, 1), (3, 2), (4, 1)\},$ $ω_{1, 1} (u, v) = 1, ω_{2, 1} (u, v) = α u^{- α} + (α + 1) (1 - v^{- α}),$ $ω_{2, 2} (u, v) = - α (α + 1) u^{- 2 α} - (α + 1) u^{- α} + (4 α^{2} + 3 α + 1) u^{- α} v^{- α} - α (α + 1) v^{- 2 α} - (α + 1) v^{- α} + {(α + 1)}^{2} .$ $ω_{3, 1} (u, v) = α (α - 1) u^{- 2 α} + (4 α - 1) (α + 1) (1 - v^{- α}) u^{- α} + (α + 1) (α + 2) {(1 - v^{- α})}^{2},$ $ω_{3, 2} (u, v) = - α (α - 1) (α + 1) u^{- 3 α} - (α + 1) (3 α^{2} + 4 α - 1) u^{- 2 α} + 3 (α - 1) {(α + 1)}^{2} u^{- α}$ $+ α (α + 1) (α + 2) v^{- 3 α} - (α - 1) (α + 1) (α + 2) v^{- 2 α} - (α + 1) {(α + 2)}^{2} v^{- α}$ $+ (11 α^{3} + 7 α^{2} + α - 1) u^{- 2 α} v^{- α} - (α + 1) (11 α^{2} + 5 α + 2) u^{- α} v^{- 2 α}$ $+ (α + 1) (8 α^{2} + 5 α + 5) u^{- α} v^{- α} + {(α + 1)}^{2} (α + 2) .$ $ω_{4, 1} (u, v) = α (α - 1) (α - 2) u^{- 3 α} + (α - 1) (α + 1) (11 α - 2) (1 - v^{- α}) u^{- 2 α}$ $+ (α + 1) (11 α^{2} + 14 α - 7) {(1 - v^{- α})}^{2} u^{- α} + (α + 1) (α + 2) (α + 3) {(1 - v^{- α})}^{3} .$

Appendix A.3. The Information Matrix under the Clayton Copula with Exponential Margins

The Clayton copula model with exponential margins is given as $P r (Y_{i 1} > y_{1}, Y_{i 2} > y_{2}) = C_{α_{i}}^{Clayton} \{{\overline{F}}_{1} (y_{1}), {\overline{F}}_{2} (y_{2})\} = {\exp (α_{i} λ_{1} y_{1}) + \exp (α_{i} λ_{2} y_{2}) - 1}^{- 1 / α_{i}} .$

Then, the joint density is $f_{i, λ}^{Clayton} (y_{1}, y_{2}) = \frac{\partial^{2}}{\partial y_{1} \partial y_{2}} P r (Y_{i 1} > y_{1}, Y_{i 2} > y_{2}) = \frac{(α_{i} + 1) λ_{1} λ_{2} e^{α_{i} λ_{1} y_{1} + α_{i} λ_{2} y_{2}}}{{(e^{α_{i} λ_{1} y_{1}} + e^{α_{i} λ_{2} y_{2}} - 1)}^{1 / α_{i} + 2}},$ where $λ = (λ_{1}, λ_{2})$ . The log-density is $\log f_{i, λ}^{Clayton} (y_{1}, y_{2}) = \log (α_{i} + 1) + \log λ_{1} + \log λ_{2} + α_{i} λ_{1} y_{1} + α_{i} λ_{2} y_{2} - (\frac{1}{α_{i}} + 2) \log (e^{α_{i} λ_{1} y_{1}} + e^{α_{i} λ_{2} y_{2}} - 1) .$

The first-order partial derivative of $\log f_{i, λ}^{Clayton} (y_{1}, y_{2})$ with respect to $λ_{j}$ is $\frac{\partial \log f_{i, λ}^{Clayton} (y_{1}, y_{2})}{\partial λ_{j}} = \frac{1}{λ_{j}} + α_{i} y_{j} - (2 α_{i} + 1) \frac{y_{j} e^{α_{i} λ_{j} y_{j}}}{e^{α_{i} λ_{1} y_{1}} + e^{α_{i} λ_{2} y_{2}} - 1}, j = 1, 2 .$

The second-order partial derivatives of $\log f_{α_{i}}^{Clayton} (y_{1}, y_{2})$ are (A1) $\frac{\partial^{2} \log f_{i, λ}^{Clayton} (y_{1}, y_{2})}{\partial λ_{j}^{2}} = - \frac{1}{λ_{j}^{2}} - α_{i} (2 α_{i} + 1) \{\frac{y_{j}^{2} e^{α_{i} λ_{j} y_{j}}}{e^{α_{i} λ_{1} y_{1}} + e^{α_{i} λ_{2} y_{2}} - 1} - \frac{y_{j}^{2} e^{2 α_{i} λ_{j} y_{j}}}{{(e^{α_{i} λ_{1} y_{1}} + e^{α_{i} λ_{2} y_{2}} - 1)}^{2}}\}, j = 1, 2,$ (A2) $\frac{\partial^{2} \log f_{i, λ}^{Clayton} (y_{1}, y_{2})}{\partial λ_{1} \partial λ_{2}} = α_{i} (2 α_{i} + 1) \frac{y_{1} y_{2} e^{α_{i} λ_{1} y_{1} + α_{i} λ_{2} y_{2}}}{{(e^{α_{i} λ_{1} y_{1}} + e^{α_{i} λ_{2} y_{2}} - 1)}^{2}} .$

Then, the Fisher information matrix is $I_{i, j k}^{Clayton} (λ) = E \{- \frac{\partial^{2} \log f_{i, λ}^{Clayton} (Y_{i})}{\partial λ_{j} \partial λ_{k}}\}, j, k = 1, 2 .$

For $j = k = 1$ , we have $I_{i, 11}^{Clayton} (λ) = E \{- \frac{\partial^{2} \log f_{i, λ}^{Clayton} (Y_{i})}{\partial λ_{1}^{2}}\}$ $= \frac{1}{λ_{1}^{2}} + \frac{α_{i} (2 α_{i} + 1)}{λ_{1}^{2}} [E (\frac{λ_{1}^{2} Y_{i 1}^{2} e^{α_{i} λ_{1} Y_{i 1}}}{e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1}) - E \{\frac{λ_{1}^{2} Y_{i 1}^{2} e^{2 α_{i} λ_{1} Y_{i 1}}}{{(e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1)}^{2}}\}] .$

To compute the first expectation, we consider the change in variable $λ_{j} y_{j} = x_{j}$ , $j = 1, 2$ . Then, $E (\frac{λ_{1}^{2} Y_{i 1}^{2} e^{α_{i} λ_{1} Y_{i 1}}}{e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1}) = (α_{i} + 1) \int_{0}^{\infty} \int_{0}^{\infty} \frac{λ_{1}^{3} y_{1}^{2} λ_{2} e^{2 α_{i} λ_{1} y_{1} + α_{i} λ_{2} y_{2}}}{{(e^{α_{i} y_{1}} + e^{α_{i} y_{2}} - 1)}^{1 / α_{i} + 3}} d y_{1} d y_{2}$ $= (α_{i} + 1) \int_{0}^{\infty} \int_{0}^{\infty} \frac{x_{1}^{2} e^{2 α_{i} x_{1} + α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}} d x_{1} d x_{2}$ $= (α_{i} + 1) \int_{0}^{\infty} x_{1}^{2} e^{2 α_{i} x_{1}} \int_{0}^{\infty} \frac{e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}} d x_{1} d x_{2} .$

For the inner integral, $\int_{0}^{\infty} \frac{e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}} d x_{2} = - \frac{1}{2 α_{i} + 1} \{{\frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 2}}|}_{0}^{\infty}\} = \frac{1}{2 α_{i} + 1} e^{- (2 α_{i} + 1) x_{1}} .$

It follows that $\int_{0}^{\infty} x_{1}^{2} e^{2 α_{i} x_{1}} \{\frac{1}{2 α_{i} + 1} e^{- (2 α_{i} + 1) x_{1}}\} d x_{1} = \frac{1}{2 α_{i} + 1} \int_{0}^{\infty} x_{1}^{2} e^{- x_{1}} d x_{1} = \frac{2}{2 α_{i} + 1} .$

Thus, we obtain $E (\frac{λ_{1}^{2} Y_{i 1}^{2} e^{α_{i} λ_{1} Y_{i 1}}}{e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1}) = \frac{2 (α_{i} + 1)}{2 α_{i} + 1} .$

The second expectation is computed as $E \{\frac{λ_{1}^{2} Y_{i 1}^{2} e^{2 α_{i} λ_{1} Y_{i 1}}}{{(e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1)}^{2}}\} = (α_{i} + 1) \int_{0}^{\infty} \int_{0}^{\infty} \frac{λ_{1}^{3} y_{1}^{2} λ_{2} e^{3 α_{i} λ_{1} y_{1} + α_{i} λ_{2} y_{2}}}{{(e^{α_{i} y_{1}} + e^{α_{i} y_{2}} - 1)}^{1 / α_{i} + 4}} d y_{1} d y_{2}$ $= (α_{i} + 1) \int_{0}^{\infty} \int_{0}^{\infty} \frac{x_{1}^{2} e^{3 α_{i} x_{1} + α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 4}} d x_{1} d x_{2}$ $= (α_{i} + 1) \int_{0}^{\infty} x_{1}^{2} e^{3 α_{i} x_{1}} \int_{0}^{\infty} \frac{e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 4}} d x_{1} d x_{2} .$

For the inner integral, $\int_{0}^{\infty} \frac{e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 4}} d x_{2} = - \frac{1}{3 α_{i} + 1} \{{\frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}}|}_{0}^{\infty}\} = \frac{1}{3 α_{i} + 1} e^{- (3 α_{i} + 1) x_{1}} .$

It follows that $\int_{0}^{\infty} x_{1}^{2} e^{3 α_{i} x_{1}} \{\frac{1}{3 α_{i} + 1} e^{- (3 α_{i} + 1) x_{1}}\} d x_{1} = \frac{1}{3 α_{i} + 1} \int_{0}^{\infty} x_{1}^{2} e^{- x_{1}} d x_{1} = \frac{2}{3 α_{i} + 1} .$

Thus, we obtain $E \{\frac{λ_{1}^{2} Y_{i 1}^{2} e^{2 α_{i} λ_{1} Y_{i 1}}}{{(e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1)}^{2}}\} = \frac{2 (α_{i} + 1)}{3 α_{i} + 1} .$

Combining the above results, we have $I_{i, 11}^{Clayton} (λ) = \frac{1}{λ_{1}^{2}} + \frac{1}{λ_{1}^{2}} \frac{2 α_{i}^{2} (α_{i} + 1)}{3 α_{i} + 1} .$

In a similar fashion, for $j = k = 2$ , we also have $I_{i, 22}^{Clayton} (λ) = \frac{1}{λ_{2}^{2}} + \frac{1}{λ_{2}^{2}} \frac{2 α_{i}^{2} (α_{i} + 1)}{3 α_{i} + 1} .$

For $j = 1$ , $k = 2$ , we have $I_{i, 12}^{Clayton} (λ) = E \{- \frac{\partial^{2} \log f_{i, λ}^{Clayton} (Y_{i})}{\partial λ_{1} \partial λ_{2}}\} = - (\frac{1}{α_{i}} + 2) E \{\frac{λ_{1} λ_{2} Y_{i 1} Y_{i 2} e^{α_{i} λ_{1} Y_{i 1} + α_{i} λ_{2} Y_{i 2}}}{{(e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1)}^{2}}\}$ $= - \frac{α_{i} (2 α_{i} + 1)}{λ_{1} λ_{2}} E \{\frac{λ_{1} λ_{2} Y_{i 1} Y_{i 2} e^{α_{i} λ_{1} Y_{i 1} + α_{i} λ_{2} Y_{i 2}}}{{(e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1)}^{2}}\} .$

We consider $E \{\frac{λ_{1} λ_{2} Y_{i 1} Y_{i 2} e^{α_{i} λ_{1} Y_{i 1} + α_{i} λ_{2} Y_{i 2}}}{{(e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1)}^{2}}\} = (α_{i} + 1) \int_{0}^{\infty} \int_{0}^{\infty} \frac{λ_{1}^{2} λ_{2}^{2} y_{1} y_{2} e^{2 α_{i} λ_{1} y_{1} + 2 α_{i} λ_{2} y_{2}}}{{(e^{α_{i} y_{1}} + e^{α_{i} y_{2}} - 1)}^{1 / α_{i} + 4}} d y_{1} d y_{2}$ $= (α_{i} + 1) \int_{0}^{\infty} \int_{0}^{\infty} \frac{x_{1} x_{2} e^{2 α_{i} x_{1} + 2 α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 4}} d x_{1} d x_{2}$ $= (α_{i} + 1) \int_{0}^{\infty} x_{1} e^{2 α_{i} x_{1}} \int_{0}^{\infty} \frac{x_{2} e^{2 α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 4}} d x_{1} d x_{2} .$

For the inner integral, $\int_{0}^{\infty} \frac{x_{2} e^{2 α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 4}} d x_{1}$ $= - \frac{1}{3 α_{i} + 1} \int_{0}^{\infty} x_{2} e^{α_{i} x_{2}} d \{\frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}}\}$ $= - \frac{1}{3 α_{i} + 1} \{{\frac{x_{2} e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}}|}_{0}^{\infty} - \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}} d (x_{2} e^{α_{i} x_{2}})\}$ $= \frac{1}{3 α_{i} + 1} \int_{0}^{\infty} \frac{e^{α_{i} x_{2}} + α_{i} x_{2} e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}} d x_{2}$ $= \frac{1}{3 α_{i} + 1} \int_{0}^{\infty} \frac{e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}} d x_{2} + \frac{α_{i}}{3 α_{i} + 1} \int_{0}^{\infty} \frac{x_{2} e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}} d x_{2},$ where the second equality follows from integration by parts. For the first integral, $\frac{1}{3 α_{i} + 1} \int_{0}^{\infty} \frac{e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}} d x_{2}$ $= - \frac{1}{(3 α_{i} + 1) (2 α_{i} + 1)} ({\frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 2}}|}_{0}^{\infty})$ $= \frac{1}{(3 α_{i} + 1) (2 α_{i} + 1)} e^{- 2 (α_{i} + 1) x_{1}} .$

The second integral is $\frac{α_{i}}{3 α_{i} + 1} \int_{0}^{\infty} \frac{x_{2} e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 3}} d x_{2}$ $= - \frac{α_{i}}{(3 α_{i} + 1) (2 α_{i} + 1)} \int_{0}^{\infty} x_{2} d (\frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 2}})$ $= - \frac{α_{i}}{(3 α_{i} + 1) (2 α_{i} + 1)} \{{\frac{x_{2}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 2}}|}_{0}^{\infty} - \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 2}} d x_{2}\}$ $= \frac{α_{i}}{(3 α_{i} + 1) (2 α_{i} + 1)} \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 2}} d x_{2},$ where the second equality follows from integration by parts. Now, the expectation becomes $E \{\frac{λ_{1} λ_{2} Y_{i 1} Y_{i 2} e^{α_{i} λ_{1} Y_{i 1} + α_{i} λ_{2} Y_{i 2}}}{{(e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1)}^{2}}\}$ $= \frac{α_{i} + 1}{(3 α_{i} + 1) (2 α_{i} + 1)} \int_{0}^{\infty} x_{1} e^{- x_{1}} d x_{1} + \frac{α_{i} (α_{i} + 1)}{(3 α_{i} + 1) (2 α_{i} + 1)} \int_{0}^{\infty} \int_{0}^{\infty} \frac{x_{1} e^{2 α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 2}} d x_{1} d x_{2},$ $= \frac{α_{i} + 1}{(3 α_{i} + 1) (2 α_{i} + 1)} + \frac{α_{i} (α_{i} + 1)}{(3 α_{i} + 1) (2 α_{i} + 1)} \int_{0}^{\infty} \int_{0}^{\infty} \frac{x_{1} e^{2 α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 2}} d x_{1} d x_{2} .$

For the integral in the above expression, we consider its inner integral, $\int_{0}^{\infty} \frac{x_{1} e^{2 α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 2}} d x_{1}$ $= - \frac{1}{α_{i} + 1} \int_{0}^{\infty} x_{1} e^{α_{i} x_{1}} d \{\frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 1}}\}$ $= - \frac{1}{α_{i} + 1} \{{\frac{x_{1} e^{α_{i} x_{1}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 1}}|}_{0}^{\infty} - \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 1}} d (x_{1} e^{α_{i} x_{1}})\}$ $= \frac{1}{α_{i} + 1} \int_{0}^{\infty} \frac{e^{α_{i} x_{2}} + α_{i} x_{1} e^{α_{i} x_{1}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 1}} d x_{1}$ $= \frac{1}{α_{i} + 1} \int_{0}^{\infty} \frac{e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 1}} d x_{1} + \frac{α_{i}}{α_{i} + 1} \int_{0}^{\infty} \frac{x_{1} e^{α_{i} x_{1}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 1}} d x_{1},$ where the second last equality follows from integration by parts. We compute the above two integrals separately. We have $\frac{1}{α_{i} + 1} \int_{0}^{\infty} \frac{e^{α_{i} x_{2}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 1}} d x_{1} = - \frac{1}{α_{i} + 1} \{{\frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}}|}_{0}^{\infty}\} = \frac{1}{α_{i} + 1} e^{- x_{2}} .$

On the other hand, we have $\frac{α_{i}}{α_{i} + 1} \int_{0}^{\infty} \frac{x_{1} e^{α_{i} x_{1}}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i} + 1}} d x_{1}$ $= - \frac{α_{i}}{α_{i} + 1} \int_{0}^{\infty} x_{1} d \{\frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}}\}$ $= - \frac{α_{i}}{α_{i} + 1} \{{\frac{x_{1}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}}|}_{0}^{\infty} - \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}} d x_{1}\}$ $= \frac{α_{i}}{α_{i} + 1} \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}} d x_{1} .$

Then, $\int_{0}^{\infty} \{\frac{1}{α_{i} + 1} e^{- x_{2}} + \frac{α_{i}}{α_{i} + 1} \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}} d x_{1}\} d x_{2} = \frac{1}{α_{i} + 1} + \frac{α_{i}}{α_{i} + 1} \int_{0}^{\infty} \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}} d x_{1} d x_{2} .$

Combine all the results, one has $E \{\frac{λ_{1} λ_{2} Y_{i 1} Y_{i 2} e^{α_{i} λ_{1} Y_{i 1} + α_{i} λ_{2} Y_{i 2}}}{{(e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1)}^{2}}\}$ $= \frac{α_{i} + 1}{(3 α_{i} + 1) (2 α_{i} + 1)} + \frac{α_{i}}{(3 α_{i} + 1) (2 α_{i} + 1)} + \frac{α_{i}^{2}}{(3 α_{i} + 1) (2 α_{i} + 1)} \int_{0}^{\infty} \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}} d x_{1} d x_{2}$ $= \frac{1}{3 α_{i} + 1} + \frac{α_{i}^{2}}{(3 α_{i} + 1) (2 α_{i} + 1)} \int_{0}^{\infty} \int_{0}^{\infty} \frac{1}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}} d x_{1} d x_{2} .$

Let $α_{i} x_{1} = s$ and $α_{i} x_{2} = t$ , according to [52], we have $\int_{0}^{\infty} \int_{0}^{\infty} \frac{α_{i}^{2}}{{(e^{α_{i} x_{1}} + e^{α_{i} x_{2}} - 1)}^{1 / α_{i}}} d x_{1} d x_{2} = \int_{0}^{\infty} \int_{0}^{\infty} \frac{1}{{(e^{s} + e^{t} - 1)}^{1 / α_{i}}} d s d t = \frac{1}{2} \{Ψ (\frac{1}{2 α_{i}}) - Ψ (\frac{α_{i} + 1}{2 α_{i}})\} .$

Hence, we obtain $E \{\frac{λ_{1} λ_{2} Y_{i 1} Y_{i 2} e^{α_{i} λ_{1} Y_{i 1} + α_{i} λ_{2} Y_{i 2}}}{{(e^{α_{i} λ_{1} Y_{i 1}} + e^{α_{i} λ_{2} Y_{i 2}} - 1)}^{2}}\} = \frac{1}{3 α_{i} + 1} + \frac{1}{2 (3 α_{i} + 1) (2 α_{i} + 1)} \{Ψ (\frac{1}{2 α_{i}}) - Ψ (\frac{α_{i} + 1}{2 α_{i}})\} .$

Finally, combining the above results, we have $I_{i, 12}^{Clayton} (λ) = - \frac{α_{i} (2 α_{i} + 1)}{λ_{1} λ_{2}} ϕ (α_{i}),$ where $ϕ (α) = \frac{1}{3 α_{i} + 1} + \frac{1}{2 (3 α + 1) (2 α + 1)} \{Ψ (\frac{1}{2 α}) - Ψ (\frac{α + 1}{2 α})\} .$

References

1. Gleser, L.J.; Olkin, L. Stochastically dependent effect sizes. The Handbook of Research Synthesis; Russel Sage Foundation: New York, NY, USA, 1994.

2. Riley, R.D. Multivariate meta-analysis: The effect of ignoring within-study correlation. J. R. Stat. Soc. Ser. A Stat. Soc.; 2009; 172, pp. 789-811. [DOI: https://dx.doi.org/10.1111/j.1467-985X.2008.00593.x]

3. Shih, J.-H.; Konno, Y.; Chang, Y.-T.; Emura, T. Estimation of a common mean vector in bivariate meta-analysis under the FGM copula. Statistics; 2019; 53, pp. 673-695. [DOI: https://dx.doi.org/10.1080/02331888.2019.1581782]

4. Nissen, S.E.; Wolski, K. Effect of Rosiglitazone on the Risk of Myocardial Infarction and Death from Cardiovascular Causes. N. Engl. J. Med.; 2007; 356, pp. 2457-2471. [DOI: https://dx.doi.org/10.1056/NEJMoa072761] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17517853]

5. Yamaguchi, Y.; Maruo, K. Bivariate beta-binomial model using Gaussian copula for bivariate meta-analysis of two binary outcomes with low incidence. Jpn. J. Stat. Data Sci.; 2019; 2, pp. 347-373. [DOI: https://dx.doi.org/10.1007/s42081-019-00037-z]

6. Mavridis, D.; Salanti, G. A practical introduction to multivariate meta-analysis. Stat. Methods Med. Res.; 2011; 22, pp. 133-158. [DOI: https://dx.doi.org/10.1177/0962280211432219]

7. Copas, J.B.; Jackson, D.; White, I.; Riley, R.D. The role of secondary outcomes in multivariate meta-analysis. J. R. Stat. Soc. Ser. C Appl. Stat.; 2018; 67, pp. 1177-1205. [DOI: https://dx.doi.org/10.1111/rssc.12274]

8. Burzykowski, T.; Molenberghs, G.; Buyse, M.; Geys, H.; Renard, D. Validation of surrogate end points in multiple randomized clinical trials with failure time end points. J. R. Stat. Soc. Ser. C Appl. Stat.; 2001; 50, pp. 405-422. [DOI: https://dx.doi.org/10.1111/1467-9876.00244]

9. Rotolo, F.; Paoletti, X.; Michiels, S. surrosurv: An R package for the evaluation of failure time surrogate endpoints in individual patient data meta-analysis of randomized clinical trials. Comput. Methods Programs Biomed.; 2018; 155, pp. 189-198. [DOI: https://dx.doi.org/10.1016/j.cmpb.2017.12.005]

10. Rotolo, F.; Paoletti, X.; Burzykowski, T.; Buyse, M.; Michiels, S. A Poisson approach to the validation of failure time surrogate endpoints in individual patient data meta-analyses. Stat. Methods Med. Res.; 2019; 28, pp. 170-183. [DOI: https://dx.doi.org/10.1177/0962280217718582]

11. Emura, T.; Sofeu, C.L.; Rondeau, V. Conditional copula models for correlated survival endpoints: Individual patient data meta-analysis of randomized controlled trials. Stat. Methods Med. Res.; 2021; 30, pp. 2634-2650. [DOI: https://dx.doi.org/10.1177/09622802211046390]

12. Kuss, O.; Hoyer, A.; Solms, A. Meta-analysis for diagnostic accuracy studies: A new statistical model using beta-binomial dis-tributions and bivariate copulas. Stat. Med.; 2014; 33, pp. 17-30. [DOI: https://dx.doi.org/10.1002/sim.5909]

13. Nikoloulopoulos, A.K. A vine copula mixed effect model for trivariate meta-analysis of diagnostic test accuracy studies ac-counting for disease prevalence. Stat. Methods Med. Res.; 2017; 26, pp. 2270-2286. [DOI: https://dx.doi.org/10.1177/0962280215596769]

14. Nikoloulopoulos, A.K. A D-vine copula mixed model for joint meta-analysis and comparison of diagnostic tests. Stat. Methods Med. Res.; 2018; 28, pp. 3286-3300. [DOI: https://dx.doi.org/10.1177/0962280218796685]

15. Nikoloulopoulos, A.K. A multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic studies in the presence of non-evaluable subjects. Stat. Methods Med. Res.; 2020; 29, pp. 2988-3005. [DOI: https://dx.doi.org/10.1177/0962280220913898]

16. Takeuchi, T.T. Constructing a bivariate distribution function with given marginals and correlation: Application to the galaxy luminosity function. Mon. Not. R. Astron. Soc.; 2010; 406, pp. 1830-1840. [DOI: https://dx.doi.org/10.1111/j.1365-2966.2010.16778.x]

17. Ota, S.; Kimura, M. Effective estimation algorithm for parameters of multivariate Farlie–Gumbel–Morgenstern copula. Jpn. J. Stat. Data Sci.; 2021; 4, pp. 1049-1078. [DOI: https://dx.doi.org/10.1007/s42081-021-00118-y]

18. Ghosh, S.; Sheppard, L.W.; Holder, M.T.; Loecke, T.D.; Reid, P.C.; Bever, J.D.; Reuman, D.C. Copulas and their potential for ecology. Adv. Ecol. Res.; 2020; 62, pp. 409-468. [DOI: https://dx.doi.org/10.1016/bs.aecr.2020.01.003]

19. Alidoost, F.; Stein, A.; Su, Z. The use of bivariate copulas for bias correction of reanalysis air temperature data. PLoS ONE; 2019; 14, e0216059. [DOI: https://dx.doi.org/10.1371/journal.pone.0216059]

20. Bhatti, M.I.; Do, H.Q. Recent development in copula and its applications to the energy, forestry and environmental sciences. Int. J. Hydrog. Energy; 2019; 44, pp. 19453-19473. [DOI: https://dx.doi.org/10.1016/j.ijhydene.2019.06.015]

21. Emura, T.; Chen, Y.H. Analysis of Survival Data with Dependent Censoring: Copula-Based Approaches; JSS Research Series in Statistics; Springer: Singapore, 2018.

22. Emura, T.; Matsui, S.; Rondeau, V. Survival Analysis with Correlated Endpoints, Joint Frailty-Copula Models; JSS Research Series in Statistics; Springer: Singapore, 2019.

23. Emura, T.; Shih, J.-H.; Ha, I.D.; Wilke, R.A. Comparison of the marginal hazard model and the sub-distribution hazard model for competing risks under an assumed copula. Stat. Methods Med. Res.; 2020; 29, pp. 2307-2327. [DOI: https://dx.doi.org/10.1177/0962280219892295]

24. Peng, M.; Xiang, L.; Wang, S. Semiparametric regression analysis of clustered survival data with semi-competing risks. Comput. Stat. Data Anal.; 2018; 124, pp. 53-70. [DOI: https://dx.doi.org/10.1016/j.csda.2018.02.003]

25. Huang, X.-W.; Wang, W.; Emura, T. A copula-based Markov chain model for serially dependent event times with a dependent terminal event. Jpn. J. Stat. Data Sci.; 2020; 4, pp. 917-951. [DOI: https://dx.doi.org/10.1007/s42081-020-00087-8]

26. Shih, J.H. Common Mean. Copula: Bivariate Common Mean Vector under Copula Models; CRAN. 2022; Available online: https://CRAN.R-project.org/package=CommonMean.Copula (accessed on 8 January 2022).

27. Berkey, C.S.; Hoaglin, D.C.; Antczak-bouckoms, A.; Mosteller, F.; Colditz, G.A. Meta-analysis of multiple outcomes by regression with random effect. Stat. Med.; 1998; 17, pp. 2537-2550. [DOI: https://dx.doi.org/10.1002/(SICI)1097-0258(19981130)17:22<2537::AID-SIM953>3.0.CO;2-C]

28. Shinozaki, N. A note on estimating the common mean of k normal distributions and the stein problem. Commun. Stat.-Theory Methods; 1978; 7, pp. 1421-1432. [DOI: https://dx.doi.org/10.1080/03610927808827724]

29. Malekzadeh, A.; Kharrati-Kopaei, M. Inferences on the common mean of several normal populations under hetero-scedasticity. Comput. Stat.; 2018; 33, pp. 1367-1384. [DOI: https://dx.doi.org/10.1007/s00180-017-0789-0]

30. Gasparrini, A. Mvmeta: Multivariate and Univariate Meta-Analysis and Meta-Regression; CRAN. 2019; Available online: https://CRAN.R-project.org/package=mvmeta (accessed on 8 January 2022).

31. Nelsen, R. An Introduction to Copulas. Technometrics. Lett.; 2000; 42, 317. [DOI: https://dx.doi.org/10.2307/1271100]

32. Durante, F.; Sempi, C. Principles of Copula Theory; CRC/Chapman & Hall: Boca Raton, FL, USA, 2016.

33. Morgenstern, D. Einfache Beispiele zweidimensionaler Verteilungen. Mitt. Für Math. Stat.; 1956; 8, pp. 34-235.

34. Bairamov, I.G.; Kotz, S.; Bekci, M. New generalized Farlie-Gumbel-Morgenstern distributions and concomitants of order statistics. J. Appl. Stat.; 2001; 28, pp. 521-536. [DOI: https://dx.doi.org/10.1080/02664760120047861]

35. Bairamov, I.; Kotz, S. Dependence structure and symmetry of Hunag-Kotz FGM distributions and their extensions. Metrika; 2002; 56, pp. 55-72. [DOI: https://dx.doi.org/10.1007/s001840100158]

36. Amini, M.; Jabbari, H.; Borzadaran, G.R.M. Aspects of Dependence in Generalized Farlie-Gumbel-Morgenstern Distributions. Commun. Stat.-Simul. Comput.; 2011; 40, pp. 1192-1205. [DOI: https://dx.doi.org/10.1080/03610918.2011.568149]

37. Domma, F.; Giordano, S. A copula-based approach to account for dependence in stress-strength models. Stat. Pap.; 2012; 54, pp. 807-826. [DOI: https://dx.doi.org/10.1007/s00362-012-0463-0]

38. Chesneau, C. A new two-dimensional relation copula inspiring generalized version of the Farlie-Gumbel-Morgenstern copula. Res. Commun. Math. Math. Sci.; 2021; 13, pp. 99-128.

39. Clayton, D.G. A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence. Biometrika; 1978; 65, pp. 141-151. [DOI: https://dx.doi.org/10.1093/biomet/65.1.141]

40. Duchateau, L.; Janssen, P. The Frailty Model; Springer: New York, NY, USA, 2007.

41. Gumbel, E.J. Distributions de valeurs extrêmes en plusieurs dimensions. Publ. Inst. Statist. Univ. Paris; 1960; 9, pp. 171-173.

42. Frank, M.J. On the simultaneous associativity of F(x, y) and x+y−F(x, y). Aequ. Math.; 1979; 19, pp. 194-226. [DOI: https://dx.doi.org/10.1007/BF02189866]

43. Sklar, A. Fonctions de répartition à n dimensions et leurs marges. Publ. Inst. Statist. Univ. Paris; 1959; 8, pp. 229-231.

44. Schucany, W.R.; Parr, W.C.; Boyer, J.E. Correlation structure in Farlie-Gumbel-Morgenstern distributions. Biometrika; 1978; 65, pp. 650-653. [DOI: https://dx.doi.org/10.1093/biomet/65.3.650]

45. Nelsen, R.B. Dependence and Order in Families of Archimedean Copulas. J. Multivar. Anal.; 1997; 60, pp. 111-122. [DOI: https://dx.doi.org/10.1006/jmva.1996.1646]

46. Bradley, R.A.; Gart, J.J. The asymptotic properties of ML estimators when sampling from associated populations. Biometrika; 1962; 49, pp. 205-214. [DOI: https://dx.doi.org/10.1093/biomet/49.1-2.205]

47. Emura, T.; Hu, Y.H.; Konno, Y. Asymptotic inference for maximum likelihood estimators under the special exponential family with double-truncation. Stat. Pap.; 2017; 58, pp. 877-909. [DOI: https://dx.doi.org/10.1007/s00362-015-0730-y]

48. Shih, J.H. Copula-Based Statistical Inferences for a Common Mean Vector and Correlation Ratios Using Bivariate Data. Ph.D. Thesis; National Central University Library: Taoyuan, Taiwan, 2020; Available online: https://etd.lib.nctu.edu.tw/cgi-bin/gs32/ncugsweb.cgi/ccd=GLZeNP/record?r1=2&h1=2 (accessed on 3 October 2020).

49. Shao, J. Mathematical Statistics; Springer: New York, NY, USA, 2003.

50. Van der Vaart, A.W. Asymptotic Statistics; Cambridge University Press: Cambridge, UK, 1998.

51. Lehmann, E.L.; Casella, G. Theory of Point Estimation; 2nd ed. Springer: New York, NY, USA, 1998.

52. Kontopantelis, E.; Reeves, D. Performance of statistical methods for meta-analysis when true study effects are non-normally distributed: A simulation study. Stat. Methods Med Res.; 2010; 21, pp. 409-426. [DOI: https://dx.doi.org/10.1177/0962280210392008]

53. Jackson, D.; White, I.R.; Riley, R.D. A matrix-based method of moments for fitting the multivariate random effects model for meta-analysis and meta-regression. Biom. J.; 2013; 55, pp. 231-245. [DOI: https://dx.doi.org/10.1002/bimj.201200152]

54. Schepsmeier, U.; Stöber, J. Derivatives and Fisher information of bivariate copulas. Stat. Pap.; 2013; 55, pp. 525-542. [DOI: https://dx.doi.org/10.1007/s00362-013-0498-x]

55. Oakes, D. A Model for Association in Bivariate Survival Data. J. R. Stat. Soc. Ser. B Stat. Methodol.; 1982; 44, pp. 414-422. [DOI: https://dx.doi.org/10.1111/j.2517-6161.1982.tb01222.x]

56. Nakatochi, M.; Kanai, M.; Nakayama, A.; Hishida, A.; Kawamura, Y.; Ichihara, S.; Matsuo, H. Genome-wide me-ta-analysis identifies multiple novel loci associated with serum uric acid levels in Japanese individuals. Commun. Biol.; 2019; 2, pp. 1-10. [DOI: https://dx.doi.org/10.1038/s42003-019-0339-0]

57. Taketomi, N.; Konno, Y.; Chang, Y.-T.; Emura, T. A Meta-Analysis for Simultaneously Estimating Individual Means with Shrinkage, Isotonic Regression and Pretests. Axioms; 2021; 10, 267. [DOI: https://dx.doi.org/10.3390/axioms10040267]

58. Taketomi, N.; Michimae, H.; Chang, Y.-T.; Emura, T. meta.shrinkage: An R package for meta-analyses for simultaneously estimating individual means. Algorithms; 2022; accepted

Word count: 9194

Show less

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Traditional bivariate meta-analyses adopt the bivariate normal model. As the bivariate normal distribution produces symmetric dependence, it is not flexible enough to describe the true dependence structure of real meta-analyses. As an alternative to the bivariate normal model, recent papers have adopted “copula” models for bivariate meta-analyses. Copulas consist of both symmetric copulas (e.g., the normal copula) and asymmetric copulas (e.g., the Clayton copula). While copula models are promising, there are only a few studies on copula-based bivariate meta-analysis. Therefore, the goal of this article is to fully develop the methodologies and theories of the copula-based bivariate meta-analysis, specifically for estimating the common mean vector. This work is regarded as a generalization of our previous methodological/theoretical studies under the FGM copula to a broad class of copulas. In addition, we develop a new R package, “CommonMean.Copula”, to implement the proposed methods. Simulations are performed to check the proposed methods. Two real dataset are analyzed for illustration, demonstrating the insufficiency of the bivariate normal model.

Details

Title

Copula-Based Estimation Methods for a Common Mean Vector for Bivariate Meta-Analyses

Author

Jia-Han, Shih¹; Konno, Yoshihiko²; Yuan-Tsung, Chang³; Emura, Takeshi⁴

¹ Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan; [email protected]
² Department of Mathematical and Physical Sciences, Japan Women’s University, Tokyo 112-8681, Japan; [email protected]
³ Department of Social Information, Mejiro University, Tokyo 161-8539, Japan; [email protected]
⁴ Biostatistics Center, Kurume University, Kurume, Fukuoka 830-0011, Japan

First page

186

Publication year

2022

Publication date

2022

Publisher

MDPI AG

e-ISSN

20738994

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.3390/sym14020186

ProQuest document ID

2633192578

Copula-Based Estimation Methods for a Common Mean Vector for Bivariate Meta-Analyses

Jump to:

Full text

Abstract

Details

Suggested sources