Bayesian Uncertainty Quantification for

Full text

Turn on search term navigation

1. Introduction

The distribution of subsurface properties are mainly controlled by the location of distinct geologic facies with sharp contrasts in properties, such as permeability and porosity, across facies boundaries [1]. For example, in a fluvial setting, high permeability channel sands are often embedded in a nearly impermeable background causing the dominant fluid movement to be restricted within these channels. Under such conditions, the channel geometry plays an important role in determining the flow behavior in the subsurface. Consequently, in predicting the flow through highly heterogeneous porous formations, it is important to model facies boundaries accurately and to properly account for the uncertainties in these models. Traditional geostatistical techniques for subsurface characterization have typically relied on two-point correlation functions to describe the spatial variability. Such spatial fields do not reproduce discrete and irregular geologic features (geo-bodies) such as fluvial channels [2,3,4]. The success of object-based models, such as discrete Boolean or object-based models [5], is heavily dependent on the parameters to specify the object size, shapes, proportion, and orientation. Typically, these parameters are highly uncertain, particularly in the early stages of subsurface characterization [2,6]. For example, in a channel type environment, the channel sands may be observed at only a few well locations. There are many plausible channel geometries that will satisfy the channel sand and well intersections. Thus, the stochastic models for channels will require the specification of random variables that govern the channel boundaries. All the parameters have considerable uncertainty associated with them and will impact fluid flow in the subsurface. A considerable amount of prior information is typically available for building the facies models for fluid flow simulation [1]. These include well logs and cores, seismic data, and geologic conceptualization based on outcrops and analogs. Although prior information plays a vital role in reducing uncertainty and preserving geologic realism, it is imperative that the geologic models reproduce the dynamic response based on flow and transport data. In the last decade, significant progress has been made in conditioning pixel-based geologic models to flow and transport data [4,7,8,9,10,11,12]. The approach typically involves the solution of an inverse problem requiring the minimization of a suitably defined objective function. Both gradient-based methods and combinatorial optimization methods have been used for this purpose. The existing approaches are not readily applicable to facies-based models where the primary goal is to locate the facies boundaries and preserve the contrast in facies properties.

In this article, we consider a Bayesian approach where the solution to the inverse problem is given by the posterior distribution of the subsurface properties given the dynamic response on flow and transport data. The Bayesian approach has a natural mechanism of regularization in the form of prior information and provides quantitative assessments of the uncertainties of important model parameters. This provides a better understanding of their direct and indirect effects on the response of the physical system. Such Bayesian models have been used for uncertainty quantification in subsurface models [13,14,15,16,17], and also in other areas such as seismic modeling [18,19]. Here we use a Bayesian hierarchical model that preserves the facies architecture, at the same time populating the petrophysical properties within the facies in a geologically consistent manner by incorporating available static and dynamic information. To maintain the contrast in facies properties, we represent the facies boundaries using level sets, which provide a systematic method for morphing the facies shapes to reconstruct a wide variety of facies geometries [20,21,22,23]. Although level sets have recently been used to represent facies boundaries [24], the novelty here is that we employ an efficient Bayesian hierarchical uncertainty quantification technique to perturb the facies boundaries and properties to match the dynamic response such as multiphase production history. The description of the facies boundaries in our level set approach is based on a parameterization of the pseudo-velocity fields that deform the interfaces. We will mostly focus on smooth interfaces that require smooth velocity fields in the level set method. The space of smooth velocity fields can be parameterized with fewer parameters, thus providing us with a smaller dimensional uncertainty space to explore.

The dynamic and transport flow data that we consider in our simulations is the fractional flow data, which is the fraction of water produced in relation to the total production rate. A significant part of the computational expense in any dynamic data integration method is the modeling of flow and transport through high-resolution geologic models. To precondition these simulations, we adopt a multi-stage MCMC method to minimize the number of fine-scale flow simulations during MCMC sampling. In this approach, simplified models using mixed MsFEM are used to screen the proposals before running detailed fine-scale simulations. Note that our forward model consists of coupled flow and transport equations. The mixed MsFEM is used to solve the flow equation on a coarse grid and further use the velocity field on a coarse grid to compute the fractional flow.

A major part of this article is devoted to studying the regularity of the posterior measure with respect to the prior measure. In particular, we estimate the difference in the expectation of a function with respect to full and truncated posterior distributions. Here, the full posterior distribution refers to the posterior computed using all parameter space, while the truncated posterior distribution refers to the posterior computed using a truncated parameter space. The error in the fractional flow (the quantity that is often measured) is obtained in terms of the truncation error in K-L expansion. In particular, we show that the error is proportional to the truncation error in K-L expansion. Moreover, we show that the constants in these error estimates are independent of the dimension of the parameter space. The latter is very important for our application, as the dimension of the parameter space without K-L truncation, which depends on the discretization of the domain, can be very large. The convergence of MCMC methods on such high dimensional parameter space is infeasible. The results confirm the validity of using a reduced dimensional parameterization in this context, for which the MCMC methods become feasible. Numerical results are also presented in support of the theoretical bounds of the posterior error due to the truncation. For efficient sampling of the posterior we use a two-stage MCMC method, that utilizes an inexpensive mixed MsFEM in a up-scaled coarse field to screen the bad proposals, before being accepted in the final stage, which needs a more expensive solver.

The paper is organized in the following way. In Section 2, we discuss the Bayesian inverse problem setup and prior parameterization. Section 3 is devoted to the estimation of the posterior error due to the truncation in prior parameterization. In Section 4, we briefly describe the sampling algorithms. Numerical results are presented in Section 5.

2. Geological Models, Parameterization, and the Bayesian Inverse Problem

2.1. Fine and Coarse Models

We consider two-phase flow in a subsurface formation (denoted by $Ω$ ) under the assumption that the displacement is dominated by viscous effects. For clarity of exposition, we neglect the effects of gravity, compressibility, and capillary pressure, although our proposed approach is independent of the choice of physical mechanisms. Porosity will also be considered to be constant. The two phases will be referred to as water and oil (or a non-aqueous phase liquid), designated by subscripts w and o, respectively. We write Darcy’s law for each phase as:

(1) $v_{j} = - \frac{k_{r j} (S)}{μ_{j}} k \nabla p,$

where

v_{j}

is the phase velocity, k is the permeability tensor,

k_{r j}

is the relative permeability to phase j (

j = o, w

), S is the water saturation (volume fraction), and p is the pressure. Combining Darcy’s law with a statement of conservation of mass allows us to express the governing equations in terms of pressure and saturation equations:

(2) $\nabla \cdot (λ (S) k \nabla p) = Q_{s},$

(3) $\frac{\partial S}{\partial t} + v \cdot \nabla f (S) = 0,$

where

λ

is the total mobility,

Q_{s}

is a source term, f is the fractional flux of water, and

v

is the total velocity, which are respectively given by:

(4) $λ (S) = \frac{k_{r w} (S)}{μ_{w}} + \frac{k_{r o} (S)}{μ_{o}},$

(5) $f (S) = \frac{k_{r w} (S) / μ_{w}}{k_{r w} (S) / μ_{w} + k_{r o} (S) / μ_{o}},$

(6) $v = v_{w} + v_{o} = - λ (S) k \cdot \nabla p .$

The above descriptions are referred to as the fine-scale model of the two-phase flow problem.

As for the coarse-scale model, we consider single-phase flow-based multiscale simulation methods. This technique is similar to upscaling methods [25], except that instead of computing effective properties, multiscale basis functions are calculated. These basis functions are coupled through a variational formulation of the problem. For multi-phase flow and transport simulations, the conservative fine-scale velocity is often needed. For this reason, mixed MsFEM is used. We refer to [26,27] for the mixed MsFEM method and its use in two-phase flow and transport. In our simulations, the multiscale basis functions are computed for the velocity once with $λ = 1$ . These basis functions are used later without any update for solving two-phase flow equations. As a result, we obtain a coarse-scale velocity field that is used for solving the transport equation on the coarse grid. We note that mixed MsFEM can be implemented on unstructured grids [28].

In this article, we will infer about the permeability field based on fractional flow, denoted by $F (t)$ . F for a two-phase water-oil flow is defined as the fraction of water in the produced fluid and is given by $q_{w} / q_{t}$ , where $q_{t} = q_{o} + q_{w}$ , with $q_{o}$ and $q_{w}$ representing the flow rates of oil and water respectively at the production edge of the model. In mathematical notation,

(7) $F (t) = \frac{\int_{\partial Ω^{o u t}} v_{n} f (S) d l^{F}}{\int_{\partial Ω^{o u t}} v_{n} d l^{F}},$

where

\partial Ω^{o u t}

is the outflow boundary and

v_{n}

is the normal velocity field. The fractional flow of oil is

1 - F (t)

. It is to be noted that the inference based on fractional flow of water or fractional flow of oil are equivalent.

2.2. The Bayesian Inverse Problem

Our main objective is to quantify the uncertainty in the permeability field given the observed fractional flow data. For a given permeability field k, we denote the corresponding fractional flow obtained by solving the forward model as $F_{k}$ . The corresponding observed fractional flow data is denoted by $F_{o b s}$ . The non-linear forward mapping from permeability field k to $F_{k}$ is not one-to-one. In addition to that, the observed data also contain measurement errors. We define the combined model error and measurement error as a random error $ϵ$ . The model can be written as:

(8) $F_{o b s} = F_{k} + ϵ,$

where

ϵ

is distributed as

N (0, σ_{f}^{2} I)

, i.e.,

P (F_{o b s} | k)

is assumed to be

N (F_{k}, σ_{f}^{2} I)

. The Bayesian solution to the inverse problem is given by the posterior probability distribution

P (k | F_{o b s})

In the following subsection, we will represent the facies and interfaces of the permeability field through K-L expansions. The parameterization of the interfaces will be based on pseudo-velocity fields in a level set method. The parameter space to express smooth velocity fields is usually small, and thus one can substantially reduce the dimension of the parameter space and at the same time preserve the contrast in facies properties. Since the number of parameters is the dimension of the inverse problem, a small dimension requires less computation to obtain convergence results.

Let $θ$ parameterize the permeability field within facies and $α$ parameterize the velocity in the level set method, then the permeability field k is completely parameterized by $θ$ and $α$ . We assume $θ$ and $α$ are independent. By Bayes’ theorem the posterior distribution can be written as:

(9) $\begin{matrix} π (θ, α) = P (θ, α | F_{o b s}) & \propto & P (F_{o b s} | θ, α) P ((θ, α)) \\ = & P (F_{o b s} | θ, α) P (θ) P (α) [since θ and α are independent], \end{matrix}$

where

P (F_{o b s} | θ, α)

is the likelihood, and

P (θ)

and

P (α)

are the prior for

θ

and

α

respectively. Without proper regularization, such an inverse problem of inferring about the permeability given the fractional flow data is ill-posed due to the non-linear forward model. However, the Bayesian formulation contains a natural mechanism of regularization in the form of prior probability distribution and casts the inverse solution in the form of the posterior distribution.

2.3. Parameterization of Permeability Field

In this section, we introduce the parameterization of the permeability field. First, a heterogeneous permeability field is decomposed into several high and low permeable subregions, where each region represents a facies (see Figure 1 for illustration). This type of hierarchical representation allows us to write of the permeability field as:

(10) $k (x) = \sum_{i} k_{i} (x) I_{Ω_{i}} (x),$

where

I_{Ω_{i}}

is an indicator function of region

Ω_{i}

(i.e.,

I (x) = 1

x \in Ω_{i}

and

I (x) = 0

otherwise). We then need to parameterize facies and interfaces. For interface, we use the level set method, while the permeability field within each facies is assumed to follow a log-Gaussian distribution with a known spatial covariance. It is to be noted that in our approach the permeability field description is defined on a finite dimension whereas the partial differential equations (PDEs) to solve the forward problem are defined on an infinite dimensional setting.

2.3.1. Parameterization of Interfaces

Suppose that any interface is a zero level set function $φ (x, τ) = 0$ . The evolution equation for an interface is given by:

(11) $\frac{\partial φ}{\partial τ} + w \cdot \nabla φ = 0,$

where w is a pseudo-velocity field and

τ

is pseudo-time. We denote

φ_{i}

as the

i t h

interface if there are more than one, then

φ

can be written as

φ = φ_{i}

for different interfaces. More information on the level set method can be found in Refs. [20,22]. A key is to specify w for (11) to describe and update the boundaries of facies.

We consider a set of pseudo-velocity fields W, where $W =$ { $w |$ w admits fixed streamlines, and $| w |$ is constant along streamlines on $Ω$ }. In other words, we assume that the direction of the streamlines is fixed and the magnitude of the velocity along a certain streamline is the same. However the magnitudes of the velocity vary among different streamlines. In general, one can take streamline directions to be random. To keep the dimension of the parameter space small, we take streamlines to be fixed. For example, in our numerical experiment, vertical streamlines are used. We assume further that the magnitude of velocity field $w \in W$ follows the expansion,

(12) $| w | = \sum_{i} α_{i} ϕ_{i} (z), α_{i} \sim N (0, 1), z \in Ω^{'} .$

ϕ_{i} (z)

’s are a spatial basis for the magnitude of the velocity field and defined on the lower dimensional space of the interface, i.e.,

ϕ_{i} (z)

lives in

Ω^{'} \subset Ω

, where

dim (Ω^{'}) = dim (Ω) - 1

. For example, assume that

| w |

is a second order stochastic process on

Ω^{'}

with a given covariance structure, then

ϕ_{i} = \sqrt{λ_{i}} ψ_{i}

in (12) is

L_{2} (Ω^{'})

basis. In this case,

| w |

is expressed to be a K-L expansion. More details about K-L expansion will be recalled in the next subsection.

Now, if the initial interface is $φ (x_{0}, τ_{0}) = 0$ at $τ_{0}$ , the interface at $τ_{0} + τ$ can be written as $φ (x_{0} + \int_{τ_{0}}^{τ} w (τ) d τ, τ_{0} + τ) = 0$ . Any interface corresponds to a pseudo-velocity field $w \in W$ and a time $τ$ . Therefore, all interfaces of interest can be generated through the evolution Equation (11), with pair $(w, τ)$ . The following lemma proves that the set of interfaces generated through this one-step movement is well defined. Otherwise, the map between the interface set and the pseudo-velocity field W is not one-to-one.

Lemma 1.

For any $φ (x, τ) = 0$ , ∃ $\tilde{w} \in W$ with Expansion (12), such that $φ (x, τ) = 0$ can be obtained from $φ (x_{0}, τ_{0}) = 0$ through evolution Equation (11).

Proof.

For any $w_{1}, w_{2} \in W$ with $| w_{1} | = \sum_{i} α_{1 i} ϕ_{i} (z)$ and $| w_{2} | = \sum_{i} α_{2 i} ϕ_{i} (z)$ , $z \in Ω^{'}$ , the new interface formed by moving the initial $φ (x_{0}, τ_{0}) = 0$ with $w_{1}$ and $w_{2}$ consecutively in time $τ_{1}$ and $τ_{2}$ is $φ (x_{0} + \int_{τ_{0}}^{τ_{1}} w_{1} d τ + \int_{τ_{1}}^{τ_{2}} w_{2} d τ, τ_{0} + τ_{1} + τ_{2}) = 0$ .

Assuming $τ_{0} = 0$ , we can choose $τ = \sqrt{τ_{1}^{2} + τ_{2}^{2}}$ , and let ${\tilde{α}}_{i} = (α_{1 i} τ_{1} + α_{2 i} τ_{2}) / τ$ , so ${\tilde{α}}_{i} \sim N (0, 1)$ . For $\tilde{w} \in W$ with $| \tilde{w} | = \sum_{i} {\tilde{α}}_{i} ϕ_{i} (z)$ , we have the distance of any particle in an interface moved by (11) in time interval $τ$ as the arc length:

$| \tilde{w} | τ = \sum_{i} {\tilde{α}}_{i} ϕ_{i} (z) τ = \sum_{i} \frac{α_{1 i} τ_{1} + α_{2 i} τ_{2}}{τ} τ ϕ_{i} (z) = \sum_{i} α_{1 i} τ_{1} ϕ_{i} (z) + \sum_{i} α_{2 i} τ_{2} ϕ_{i} (z) = | w_{1} | τ_{1} + | w_{2} | τ_{2} .$

Since $w_{1}$ , $w_{2}$ , and $\tilde{w}$ have the same direction at any location, this implies that $\int_{0}^{τ} \tilde{w} d τ = \int_{0}^{τ_{1}} w_{1} d τ + \int_{τ_{1}}^{τ_{2}} w_{2} d τ$ . Therefore, the new interface:

$φ (x, τ) = φ (x_{0} + \int_{0}^{τ} \tilde{w} d τ, τ) = φ (x_{0} + \int_{0}^{τ_{1}} w_{1} d τ + \int_{τ_{1}}^{τ_{2}} w_{2} d τ, τ_{0} + τ_{1} + τ_{2}) = 0 .$

Namely, any interface can be obtained by moving the initial interface in a certain time period once by a

\tilde{w} \in W

, a Gaussian random field with deterministic direction. □

In our numerical experiments, we consider vertical streamlines in $Ω$ . The pseudo-velocity is then $w = (w_{x}, w_{y}) = (0, w_{y})$ and the magnitude along streamlines is assumed to be $w_{y} = | w | = \sum_{i} α_{i} ϕ_{i} (x)$ , $α_{i} \sim N (0, 1)$ , $x \in Ω^{'}$ . The Lemma holds for this case, and Figure 2 illustrates the process. To get a simpler model, we also have numerical examples determining the velocity field via its values at certain discrete locations. The velocity values at these locations are updated as shown in Figure 3. In this case, the basis functions $ϕ_{i}$ ’s in (12) are taken to be the indicator functions for each location, and $α_{i}$ ’s are assumed to be linear between nodes.

2.3.2. Parameterization within Facies

Now we describe the parameterization of the permeability field within the facies. The function $Y (x, ω) = log [k (x, ω)]$ is often considered instead of $k (x, ω)$ as permeability is positive and distributed right skewed. Since one of the most commonly used stochastic description of spatial fields is based on a two-point correlation function, our parameterization of permeability field then starts from the correlation function of $Y (x, ω)$ , i.e., $R (x, y) = E [Y (x, ω) Y (y, ω)]$ , where $E [\cdot]$ refers to the expectation and x, y are points in the spatial domain. $R (x, y)$ is assumed to be known. The Karhunen–Loève expansion [29] can be used to get a parameterization of permeability field $k (x, ω)$ or $Y (x, ω)$ as a linear combination of a random component and spatial dependent component. The expansion is done by representing the permeability field in terms of an optimal $L^{2}$ basis. By truncating the expansion, we can represent the permeability matrix by fewer random parameters.

Here we briefly recall some properties of the K-L expansion. Suppose $Y (x, ω)$ is a second order stochastic process with $E \int_{Ω} Y^{2} (x, ω) d x < \infty$ . For simplicity, we assume that $E [Y (x, ω)] = 0 .$ Given an orthonormal basis ${Φ_{i}}$ in $L^{2} (Ω)$ , we can expand $Y (x, ω)$ as a general Fourier series:

$Y (x, ω) = \sum_{i = 1}^{\infty} Y_{i} (ω) Φ_{i} (x), Y_{i} (ω) = \int_{Ω} Y (x, ω) Φ_{i} (x) d x .$

The special $L^{2}$ basis ${Φ_{i}}$ which makes the random variables $Y_{i}$ uncorrelated is of interest here. Namely, $E (Y_{i} Y_{j}) = 0$ for all $i \neq j$ . The basis functions ${Φ_{i}}$ satisfy:

$E (Y_{i} Y_{j}) = \int_{Ω} Φ_{i} (x) d x \int_{Ω} R (x, y) Φ_{j} (y) d y = 0, i \neq j .$

Since

{Φ_{i}}

is a complete basis in

L^{2} (Ω)

, it follows that

Φ_{i} (x)

are eigenfunctions of

R (x, y)

(13) $\int_{Ω} R (x, y) Φ_{i} (y) d y = λ_{i} Φ_{i} (x), i = 1, 2, \dots,$

where

λ_{i} = E [Y_{i}^{2}] > 0

. Furthermore, we have:

$R (x, y) = \sum_{i = 1}^{\infty} λ_{i} Φ_{i} (x) Φ_{i} (y) .$

Denote

θ_{i} = Y_{i} / \sqrt{λ_{i}}

, then

θ_{i}

satisfies

E (θ_{i}) = 0

and

E (θ_{i} θ_{j}) = δ_{i j}

. It follows that:

(14) $Y (x, ω) = \sum_{i = 1}^{\infty} \sqrt{λ_{i}} θ_{i} (ω) Φ_{i} (x),$

where

Φ_{i}

and

λ_{i}

satisfy (13). The eigenvalues

λ_{i}

s can be ordered as

λ_{1} \geq λ_{2} \geq \dots

. The

L^{2}

basis functions

Φ_{i} (x)

are deterministic and resolve the spatial dependence of the permeability field. The randomness is represented by the scalar random variables

θ_{i}

s. The expansion (14) is called the Karhunen–Loève expansion.

In practice, a K-L expansion with finite terms rather than the infinite expansion (14) is used. Given an analytical correlation function $R (x, y)$ , we can represent it on a discretized mesh. If we discretize the domain $Ω$ by an $N \times N$ rectangular mesh, we can obtain at most N pairs of eigenvalues $λ_{i}$ and eigenvectors ${\tilde{Φ}}_{i} (x)$ . The ${\tilde{Φ}}_{i}$ ’s are discrete fields. To simplify the notation, we still use $Φ_{i}$ to denote ${\tilde{Φ}}_{i}$ . In this case, the continuous K-L expansion (14) is reduced to finite terms. We get a “discretized” K-L expansion:

(15) $Y_{N} = \sum_{i = 1}^{N} \sqrt{λ_{i}} θ_{i} Φ_{i} .$

3. Posterior Error Introduced by Truncation

In this section, we study the regularity of the posterior distribution with respect to the parameterization errors. The errors are due to using a truncated series for the velocity $| w |$ and the parameterization of permeability Y within facies. In future discussion, we use w to denote $| w |$ for simplicity.

Consider a permeability field $k (x, ω)$ in $Ω$ , see Figure 1, which has s facies ${Ω_{i}}_{i = 1}^{s}$ and $\tilde{s}$ interfaces ${φ_{i}}_{i = 1}^{\tilde{s}}$ . The number of interfaces are assumed to be known and the interfaces are continuous. Each facies is described by a covariance matrix $R_{i} (x, y)$ as in Section 2.3.2. Then, the permeability field $k (x, ω)$ is a function given by:

$k (x, ω) = \sum_{i = 1}^{s} k_{i} (x, ω) I_{Ω_{i}} (x),$

where

I_{Ω_{i}}

is an indicator function on

Ω_{i}

. Through K-L expansion, the permeability of each facies

Ω_{i}

k_{i} (x, ω) = exp {Y_{i} (x, ω)} = exp {\sum_{j = 1}^{\infty} \sqrt{λ_{i j}} θ_{i j} ψ_{i j} (x)}

, and each interface is formed by moving the initial interface

φ_{i} (x, t_{0}) = 0

by a velocity field with deterministic direction

w_{i} = \sum_{j = 1}^{\infty} α_{i j} ϕ_{i j} (z)

as in Section 2.3.1. Then, the permeability field

k (x, ω)

can be written as:

$k (x, θ, α) = \sum_{i = 1}^{s} exp (Y_{i}) I_{Ω_{i} (α)} (x) .$

Considering the finite discretized case allows us to write $Y_{i}$ and $w_{i}$ in each $Ω_{i}$ as $Y_{N_{i}} = \sum_{j = 1}^{N_{i}} \sqrt{λ_{i j}^{(θ)}}$ $θ_{i j} ψ_{i j} (x)$ , $i = 1, \dots, s$ , $x \in Ω$ and $w_{{\tilde{N}}_{i}} = \sum_{j = 1}^{{\tilde{N}}_{i}} \sqrt{λ_{i j}^{(α)}} α_{i j} ϕ_{i j} (z)$ , $i = 1, \dots, \tilde{s}$ , $z \in Ω^{'}$ with $dim Ω = dim Ω^{'} + 1$ . Note that $λ_{i j}^{(θ)}$ and $λ_{i j}^{(α)}$ usually decrease to 0 fast, the truncated K-L expansions, i.e., $Y_{M_{i}} = \sum_{i = 1}^{M_{i}} \sqrt{λ_{i j}^{(θ)}} θ_{i j} ψ_{i j}$ and $w_{\tilde{M_{i}}} = \sum_{j = 1}^{\tilde{M_{i}}} \sqrt{λ_{i j}^{(α)}} α_{i j} ϕ_{i j}$ can be used to reduce the dimension of the parameter space, which in turn would save CPU time while sampling from the posterior distribution. We denote $θ = (θ_{11}, \dots, θ_{1 N_{1}}, \dots, θ_{s 1}, \dots, θ_{s N_{s}})$ and $α = (α_{11}, \dots, α_{1 {\tilde{N}}_{1}}, \dots, α_{(s - 1) 1}, \dots, α_{\tilde{s} {\tilde{N}}_{\tilde{s}}})$ , where $(θ_{i 1}, \dots, θ_{i N_{i}})$ describe the permeability field $k (θ, α)$ within the $i t h$ facies and $(α_{j 1}, \dots, α_{j {\tilde{N}}_{j}})$ describe the $j t h$ interface. $θ_{M}$ and $α_{\tilde{M}}$ are truncations of $θ$ and $α$ , respectively. Then, the corresponding representations of the permeability field in full and truncated case are given by:

$\begin{matrix} k (θ, α) & = & \sum_{i = 1}^{s} exp (\sum_{j = 1}^{N_{i}} \sqrt{λ_{i j}^{(θ)}} θ_{i j} ψ_{i j}) I_{{Ω_{i} (α_{i 1}, \dots, α_{i {\tilde{N}}_{i}})}}, \\ k (θ_{M}, α_{\tilde{M}}) & = & \sum_{i = 1}^{s} exp (\sum_{j = 1}^{M_{i}} \sqrt{λ_{i j}^{(θ)}} θ_{i j} ψ_{i j}) I_{{Ω_{i} (α_{i 1}, \dots, α_{i {\tilde{M}}_{i}})}} . \end{matrix}$

Correspondingly, the two posterior distributions of the permeability field in the Bayesian framework are given by:

$\begin{matrix} π (θ, α) & \propto & G (θ, α) \prod_{i = 1}^{s} π_{0} (θ_{i 1}, \dots, θ_{i N_{i}}) \prod_{j = 1}^{\tilde{s}} π_{0} (α_{j 1}, \dots, α_{j {\tilde{N}}_{j}}), \\ \tilde{π} (θ, α) & \propto & \tilde{G} (θ_{M}, α_{\tilde{M}}) \prod_{i = 1}^{s} π_{0} (θ_{i 1}, \dots, θ_{i N_{i}}) \prod_{j = 1}^{\tilde{s}} π_{0} (α_{j 1}, \dots, α_{j {\tilde{N}}_{j}}), \end{matrix}$

where

G (θ, α) = exp (- \int_{0}^{T} | F_{o b s} - F (k {(θ, α); t) |}^{2} d t / σ_{f}^{2})

\tilde{G} (θ_{M}, α_{\tilde{M}}) = exp (- \int_{0}^{T} | F_{o b s} - F (k (θ_{M},

α_{\tilde{M}}) {; t) |}^{2} d t / σ_{f}^{2})

, and

F_{o b s}

is the observed fractional flow data. The priors

π_{0} (θ, α)

is assumed to be the product Gaussian measure.

It is clear that this truncation process affects the posterior inference. Our goal here is to estimate the error introduced by this truncation, which can also provide a guidance to choose $M_{i}$ and ${\tilde{M}}_{j}$ for specific requirements. A case with single facies is considered first before we state the main theorem involving multiple facies.

Theorem 1.

Assume that the assumptions in Appendix A hold, and suppose the log-permeability field $Y = \sum_{i = 1}^{N} \sqrt{λ_{i}} θ_{i} ψ_{i}$ is a stationary spatial process on a bounded region, with the truncation $\tilde{Y} = \sum_{i = 1}^{M} \sqrt{λ_{i}} θ_{i} ψ_{i}$ , and $f (θ)$ is a square integrable with respect to Gaussian measure, i.e., ${\int | f (θ) |}^{2} π_{0} (θ) d θ < \infty$ , then:

(16) $\begin{matrix} | E_{π (θ)} [f (θ)] - E_{\tilde{π} (θ)} [f (θ)] | \leq C {\{\sum_{i = M + 1}^{N} λ_{i}\}}^{\frac{1}{2}}, \end{matrix}$

where C is independent of dimension N.

To prove this result, the main estimation of the fractional flow error $| F (k_{1}; t) - F (k_{2}, t) |$ is obtained from estimation of the governing PDE system (1)–(3). Details are provided in Appendix A. For a general case, when the permeability field has more than one facies, we can state the main theorem.

Theorem 2.

Assume that the assumptions in Appendix A hold, and suppose the discretized K-L expansion of the log permeability field and the random velocity field are given by $Y_{N_{i}} = \sum_{j = 1}^{N_{i}} \sqrt{λ_{i j}^{(θ)}} θ_{i j} ψ_{i} (x)$ and $w_{{\tilde{N}}_{i}} = \sum_{j = 1}^{{\tilde{N}}_{i}} \sqrt{λ_{i j}^{(α)}} α_{i j} ϕ_{i j} (z)$ , where all $Y_{N_{i}}$ and $w_{{\tilde{N}}_{i}}$ are stationary spatial processes on a bounded region, and the truncated expansions are $Y_{M_{i}} = \sum_{j = 1}^{M_{i}} \sqrt{λ_{i j}^{(θ)}} θ_{i j} ψ_{i j}$ and $w_{\tilde{M_{i}}} = \sum_{j = 1}^{\tilde{M_{i}}} \sqrt{λ_{i j}^{(α)}} α_{i j} ϕ_{i j}$ respectively. Assume that $f (θ, α)$ is a square integrable function with respect to Gaussian measure, i.e., ${\int | f (θ, α) |}^{2} π_{0} (θ, α) d θ d α < \infty$ , then:

(17) $\begin{matrix} |E_{π (θ, α)} [f (θ, α)] - E_{\tilde{π} (θ, α)} [f (θ, α)]| \leq C_{1} max_{1 \leq i \leq s} {\{\sum_{j = M_{i} + 1}^{N_{i}} λ_{i j}^{(θ)}\}}^{\frac{1}{2}} + C_{2} max_{1 \leq i \leq \tilde{s}} {\{\sum_{j = {\tilde{M}}_{i} + 1}^{{\tilde{N}}_{i}} λ_{i j}^{(α)}\}}^{\frac{1}{2}}, \end{matrix}$

where $C_{1}$ and $C_{2}$ are independent of dimension $N_{i}$ and ${\tilde{N}}_{i}$ .

Proof.

Note that:

$|E_{π (θ, α)} [f (θ, α)] - E_{\tilde{π} (θ, α)} [f (θ, α)]| \leq C (E_{1} + E_{2}),$

where:

$\begin{matrix} E_{1} & = & \int | f (θ, α) | \cdot | \tilde{G} (θ, α_{\tilde{M}}) - \tilde{G} (θ_{M}, α_{\tilde{M}}) | π_{0} (θ, α) d (θ, α), \\ E_{2} & = & \int | f (θ, α) | \cdot | G (θ, α) - \tilde{G} (θ, α_{\tilde{M}}) | π_{0} (θ, α) d (θ, α) . \end{matrix}$

It is clear that Lemma A3 can be generalized to the multi-facies case to get $| \tilde{G} (θ, α_{\tilde{M}}) - \tilde{G} (θ_{M}, α_{\tilde{M}}) | \leq \frac{C}{σ_{f}^{2}} \sum_{i = 1}^{s} ∥ k (θ_{i 1}, \dots, θ_{i N_{i}}) - k (θ_{i 1}, \dots,$ $θ_{i M_{i}} {) ∥}_{L_{2} (Ω_{i} (α_{\tilde{M}}))}$ . Then,

$\begin{matrix} E_{1} & \leq & \frac{C}{σ_{f}^{2}} \int | f (θ, α) | \sum_{i = 1}^{s} {∥ k (θ_{i 1}, \dots, θ_{i N_{i}}) - k (θ_{i 1}, \dots, θ_{i M_{i}}) ∥}_{L_{2} (Ω_{i} (α_{\tilde{M}}))} π_{0} (θ, α) d (θ, α) \\ \leq & \frac{C}{σ_{f}^{2}} \int \sum_{i = 1}^{s} \int | f (θ, α) | \cdot | | k (θ_{i 1}, \dots, θ_{i N_{i}}) - k (θ_{i 1}, \dots, θ_{i M_{i}}) ∥_{L_{2} (Ω_{i} (α_{\tilde{M}}))} π_{0} (θ) d θ π_{0} (α) d α \\ \leq & \frac{C}{σ_{f}^{2}} \int \sum_{i = 1}^{s} {{\int | f (θ, α) |}^{2} π_{0} {(θ) d θ}}^{\frac{1}{2}} \\ \cdot {\int ∥ k (θ_{i 1}, \dots, θ_{i N_{i}}) - k (θ_{i 1}, \dots, θ_{i M_{i}}) {∥_{L_{2} (Ω_{i} (α_{\tilde{M}}))}^{2} π_{0} (θ) d θ}}^{\frac{1}{2}} π_{0} (α) d α \\ \leq & C \int \sum_{i = 1}^{s} {\{\sum_{j = M_{i} + 1}^{N_{i}} λ_{i j}^{(θ)}\}}^{\frac{1}{2}} π_{0} (α) d α \leq C max_{1 \leq i \leq s} {\{\sum_{j = M_{i} + 1}^{N_{i}} λ_{i j}^{(θ)}\}}^{\frac{1}{2}}, \end{matrix}$

by Theorem 1. To estimate

E_{2}

, the corresponding error estimation for permeability fields is also needed, i.e.,

$\begin{matrix} ∥ k (θ, α) - k (θ, α_{_{\tilde{M}}}) ∥_{L_{2} (Ω)}^{2} \\ = & \int_{Ω} {|\sum_{i = 1}^{s} k_{i} I_{Ω_{i} (α)} - \sum_{i = 1}^{s} k_{i} I_{Ω_{i} (α_{_{\tilde{M}}})}|}^{2} d x \leq C \sum_{i = 1}^{\tilde{s}} \int_{Ω^{'}} k_{i}^{2} {| w_{{\tilde{N}}_{i}} - w_{\tilde{M_{i}}} |}^{2} d z \\ \leq & C \sum_{i = 1}^{\tilde{s}} \int_{Ω^{'}} k_{i}^{2} {|\sum_{j = {\tilde{M}}_{i} + 1}^{{\tilde{N}}_{i}} \sqrt{λ_{i j}^{(α)}} α_{i j} ϕ_{i j}|}^{2} d z \leq C \sum_{i = 1}^{\tilde{s}} \int_{Ω^{'}} k_{i}^{2} exp (2 \sum_{j = {\tilde{M}}_{i} + 1}^{{\tilde{N}}_{i}} \sqrt{λ_{i j}^{(α)}} α_{i j} ϕ_{i j}) d z . \end{matrix}$

Then we can estimate $E_{2}$ as:

$\begin{matrix} E_{2} & \leq & \frac{C}{σ_{f}^{2}} {{\int | f (θ, α) |}^{2} π_{0} {(θ, α) d (θ, α)}}^{\frac{1}{2}} \cdot {\int | | k (θ, α) - k (θ, α_{_{\tilde{M}}}) {| |}_{L_{2} (Ω)}^{2} π_{0} (θ, α) d (θ, α)}^{\frac{1}{2}} \\ \leq & \frac{C}{σ_{f}^{2}} {\int \sum_{i = 1}^{\tilde{s}} \int_{Ω^{'}} k_{i}^{2} exp (2 \sum_{j = {\tilde{M}}_{i} + 1}^{{\tilde{N}}_{i}} \sqrt{λ_{i j}^{(α)}} α_{i j} ϕ_{i j}) d z π_{0} (θ, α) d (θ, α)}^{\frac{1}{2}} \\ \leq & \frac{C}{σ_{f}^{2}} {\int_{Ω^{'}} [\sum_{i = 1}^{\tilde{s}} \int k_{i}^{2} π_{0} (θ_{i}) d θ_{i} \int exp (2 \sum_{j = {\tilde{M}}_{i} + 1}^{{\tilde{N}}_{i}} \sqrt{λ_{i j}^{(α)}} α_{i j} ϕ_{i j}) π_{0} (α_{i}) d α_{i}] d z}^{\frac{1}{2}} \\ \leq & \frac{C}{σ_{f}^{2}} {\sum_{i = 1}^{\tilde{s}} \int_{Ω^{'}} exp (2 \sum_{j = 1}^{N} λ_{i j}^{(θ)} ψ_{i j}^{2}) exp (2 \sum_{j = {\tilde{M}}_{i} + 1}^{{\tilde{N}}_{i}} λ_{i j}^{(α)} ϕ_{i j}^{2}) d z}^{\frac{1}{2}} \\ \leq & C max_{1 \leq i \leq \tilde{s}} {\{\sum_{j = {\tilde{M}}_{i} + 1}^{{\tilde{N}}_{i}} λ_{i j}^{(α)}\}}^{\frac{1}{2}} . \end{matrix}$

Since all

Y_{N_{i}}

and

w_{{\tilde{N}}_{i}}

are stationary spatial processes on a bounded region, i.e., spatial processes where the covariance function depends only on the distance not on the spatial location, the eigenfunctions

{ψ_{i j}}

and

{ϕ_{i j}}

are uniform

L_{\infty} (Ω)

bounded (see [30]). Thus:

$\begin{matrix} |E_{π (θ, α)} [f (θ, α)] - E_{\tilde{π} (θ, α)} [f (θ, α)]| \leq C_{1} max_{1 \leq i \leq s} {\{\sum_{j = M_{i} + 1}^{N_{i}} λ_{i j}^{(θ)}\}}^{\frac{1}{2}} + C_{2} max_{1 \leq i \leq \tilde{s}} {\{\sum_{j = {\tilde{M}}_{i} + 1}^{{\tilde{N}}_{i}} λ_{i j}^{(α)}\}}^{\frac{1}{2}} . \end{matrix}$

Please note that as

C_{1}

and

C_{2}

are independent of

N_{i}

and

{\tilde{N}}_{i}

, when

M_{i} \to N_{i}

and

{\tilde{M}}_{i} \to {\tilde{N}}_{i}

, we have

|E_{π (θ, α)} (f (θ, α)) - E_{\tilde{π} (θ, α)} (f (θ, α))| \to 0

. □

The theorem here shows the possibility to lower the dimension of the inverse problem by truncating K-L expansions. Without truncation, the dimension of K-L expansions of the permeability field and pseudo-velocity field, which is decided by the dimension of discretization, can be very large. A large dimension requires more iterations for the MCMC method to converge and therefore makes the computation expensive, if not infeasible. The theorem provides a bound to the error of the two posteriors and confirms the validity of the truncation of the parameter space without introducing unbounded errors. To choose $M_{i}$ and ${\tilde{M}}_{i}$ a priori, estimations of $C_{1}$ and $C_{2}$ are needed. The constants $C_{1}$ and $C_{2}$ are independent of $N_{i}$ and ${\tilde{N}}_{i}$ , but they are dependent on the square integrable function f and other factors. It is also possible to estimate C numerically.

4. MCMC Sampling from the Truncated Posterior

In this section, we introduce the sampling scheme used in the numerical examples in Section 5.2. To see the advantages of this scheme, we first state the standard Metropolis–Hastings algorithm and then point out the motivation of using the scheme.

For channelized permeability field, the standard Metropolis–Hastings (M-H) algorithm can be formed in the following way to sample from the truncated posterior distribution $P (k | F_{o b s})$ .

Algorithm (M-H MCMC [31]): Suppose at the $n t h$ step, we have permeability field $k_{n} (α_{n}, θ_{n})$ .

Step 1. Generate $α$ from a distribution $q_{α} (α | α_{n})$ and $θ$ from a distribution $q_{θ} (θ | θ_{n})$ . Then the entire permeability field is proposed using (10).

Step 2. Accept k as a sample with probability:

(18) $\begin{matrix} γ (k_{n}, k) & = & min \{1, \frac{π (k) q (k_{n} | k)}{π (k_{n}) q (k | k_{n})}\} \\ = & min \{1, \frac{P (F_{o b s} | k)}{P (F_{o b s} | k_{n})} \cdot \frac{P (α) P (θ)}{P (α_{n}) P (θ_{n})} \cdot \frac{q_{α} (α_{n} | α) q_{θ} (θ_{n} | θ)}{q_{α} (α | α_{n}) q_{θ} (θ | θ_{n})}\}, \end{matrix}$

i.e., take

k_{n + 1} = k

with probability

γ (k_{n}, k)

, and

k_{n + 1} = k_{n}

with probability

1 - γ (k_{n}, k)

. □

Starting with an initial permeability sample $k_{0}$ , the MCMC algorithm generates a Markov chain ${k_{n}}$ with the transition kernel as:

$\begin{matrix} K_{r} (k_{n}, k) & = & γ (k_{n}, k) q (k | k_{n}) + (1 - \int γ (k_{n}, k) q (k | k_{n}) d k) δ_{k_{n}} (k) \\ = & γ (k_{n}, k) q_{α} (α | α_{n}) q_{θ} (θ | θ_{n}) + (1 - \int γ (k_{n}, k) q_{α} (α | α_{n}) q_{θ} (θ | θ_{n}) d α d θ) δ_{α_{n}} (α) δ_{θ_{n}} (θ) . \end{matrix}$

The target distribution

π (k)

is the stationary distribution of the Markov chain

{k_{n}}

, so

k_{n}

represents the sample generated from

π (k)

after the chain converges and reaches a steady state.

The main disadvantage of the MCMC algorithm is the high computational cost in solving the coupled nonlinear PDE system (1)–(3) on the fine-grid in order to compute $F_{k}$ in the target distribution $π (k)$ . Typically, the MCMC method in our simulations converges to the steady-state after thousands of iterations, and the acceptance rate is also quite low. A large amount of CPU time is spent on simulating the rejected samples. The MCMC method can be improved by adapting the proposal distribution $q (k | k_{n})$ to the target distribution using a coarse-scale model. The process essentially modifies the proposal distribution $q (k | k_{n})$ by incorporating the coarse-scale information. The algorithm for a general two-stage MCMC method with upscaling was introduced in [32].

Let $F_{k}^{*}$ be the fractional flow computed by solving the coarse-scale model of (1)–(3) for the given k. This is done with mixed MsFEM [28]. Mixed MsFEM is used to solve pressure, and saturation is solved on a coarse grid. The fine-scale target distribution $π (k)$ is approximated on the coarse-scale by $π^{*} (k)$ . Here, we have:

$π (k) \propto exp (- \frac{∥ F_{o b s} - F_{k} ∥^{2}}{σ_{f}^{2}}) P (k),$

$π^{*} (k) \propto exp (- \frac{(G (∥ F_{o b s} - F_{k}^{*} {∥))}^{2}}{σ_{c}^{2}}) P (k),$

where the function

G

is estimated based on offline computations using independent samples from the prior. Using the coarse-scale distribution

π^{*} (k)

as a filter, the two-stage MCMC can be described as follows.

Algorithm (Two-stage MCMC [32]): Suppose at the $n t h$ step, we have $α_{n}$ , $θ_{n}$ , and permeability field $k_{n}$ .

Coarse stage:

Step 1. At $k_{n}$ , generate a trial proposal $\tilde{k}$ from distribution $q_{α} (α | α_{n})$ and $q_{θ} (θ | θ_{n})$ . The fractional flow $F_{k}^{*}$ is computed by solving the coarse-scale model.

Step 2. Take the proposal as:

$k = \{\begin{matrix} \tilde{k} & with probability γ_{p} (k_{n}, \tilde{k}), \\ k_{n} & with probability 1 - γ_{p} (k_{n}, \tilde{k}) . \end{matrix}$

The acceptance probability is given by:

(19) $γ_{p} (k_{n}, \tilde{k}) = min \{1, \frac{P^{*} (F_{o b s} | \tilde{k})}{P^{*} (F_{o b s} | k_{n})} \cdot \frac{P (α) P (θ)}{P (α_{n}) P (θ_{n})} \cdot \frac{q_{α} (α_{n} | α) q_{θ} (θ_{n} | θ)}{q_{α} (α | α_{n}) q_{θ} (θ | θ_{n})}\} .$

Therefore, the final proposal k is generated from the effective instrumental distribution:

$Q (k | k_{n}) = γ_{p} (k_{n}, k) q (k | k_{n}) + (1 - \int γ_{p} (k_{n}, k) q (k | k_{n}) d k) δ_{k_{n}} (k) .$

If $k = \tilde{k}$ , go to the Step 3. Otherwise, i.e., $k = k_{n}$ , return to Step 1.

Fine stage:

Step 3. Accept k as a sample with probability:

(20) $γ_{f} (k_{n}, k) = min (1, \frac{Q (k_{n} | k) π (k)}{Q (k | k_{n}) π (k_{n})}),$

i.e.,

k_{n + 1} = k

with probability

γ_{f} (k_{n}, k)

, and

k_{n + 1} = k_{n}

with probability

1 - γ_{f} (k_{n}, k)

. □

Using the argument as in [32], the acceptance probability (20) can be simplified as:

$γ_{f} (k_{n}, k) = min (1, \frac{π (k) π^{*} (k_{n})}{π (k_{n}) π^{*} (k)}) .$

In our numerical example, we use a random walk to generate proposals, i.e., at the

n t h

step, we propose

α = α_{n} + h_{α} u_{α}

, where

u_{α}

is generated from a

N (0, I)

distribution. Similarly, we propose

θ = θ_{n} + h_{θ} u_{θ}

, where

u_{θ}

is also generated from a

N (0, I)

distribution. Here

h_{α}

and

h_{θ}

represent the step size of the jump in each iteration of the Metropolis–Hastings algorithm. The values of

h_{α}

and

h_{θ}

control the convergence of the MCMC algorithm. The prior distribution of

α

can be taken to be

N (α_{o}, σ_{α}^{2} I)

. Similarly, the prior distribution of

θ

can be taken to be

N (θ_{o}, σ_{θ}^{2} I)

. Then, our acceptance probability (18) is given by:

(21) $γ (k_{n}, k) = min \{1, \frac{exp (\frac{- ∥ F_{o b s} - F_{k} ∥^{2}}{2 σ_{f}^{2}})}{exp (\frac{- ∥ F_{o b s} - F_{k_{n}} ∥^{2}}{2 σ_{f}^{2}})} \frac{exp (\frac{- ∥ θ - θ_{o} ∥^{2}}{2 σ_{θ}^{2}} + \frac{- ∥ α - α_{o} ∥^{2}}{2 σ_{α}^{2}})}{exp (\frac{- ∥ θ_{n} - θ_{o} ∥^{2}}{2 σ_{θ}^{2}} + \frac{- ∥ α_{n} - α_{o} ∥^{2}}{2 σ_{α}^{2}})}\} .$

The acceptance probability (19) in the two-stage MCMC algorithm is similar.

We also use a simple relation for modeling coarse- and fine-scale errors. In particular, $G$ is taken to be a linear function with the condition $G (0) = 0$ . Then, $π^{*} (k)$ becomes:

$π^{*} (k) \propto exp (- \frac{∥ F_{o b s} - F_{k}^{*} ∥^{2}}{σ_{c}^{2}}) P (k),$

i.e., on the coarse-scale

F_{o b s} | k

is assumed to follow

N (F_{k}^{*}, σ_{c}^{2} I)

distribution,

$P^{*} (F_{o b s} | k) \propto exp (- \frac{∥ F_{o b s} - F_{k}^{*} ∥^{2}}{σ_{c}^{2}}),$

where

σ_{c}

is the precision associated with the coarse-scale model. The parameter

σ_{c}

plays an important role in improving the acceptance rate of the preconditioned MCMC method. The optimal value of

σ_{c}

depends on the correlation between

∥ F - F_{k} ∥

and

∥ F - F_{k}^{*} ∥

, which can be estimated by offline computations.

Assuming that on the fine-scale $F_{o b s} | k$ follows a $N (F_{k}, σ_{f}^{2} I)$ distribution, i.e.,

$P (F_{o b s} | k) \propto exp (- \frac{∥ F_{o b s} - F_{k} ∥^{2}}{σ_{f}^{2}}),$

the acceptance probability (21) becomes:

(22) $γ_{f} (k_{n}, k) = min (1, \frac{exp (- \frac{∥ F_{o b s} - F_{k} ∥^{2}}{σ_{f}^{2}}) exp (- \frac{∥ F_{o b s} - F_{k_{n}}^{*} ∥^{2}}{σ_{c}^{2}})}{exp (- \frac{∥ F_{o b s} - F_{k_{n}} ∥^{2}}{σ_{f}^{2}}) exp (- \frac{∥ F_{o b s} - F_{k}^{*} ∥^{2}}{σ_{c}^{2}})}) .$

5. Numerical Results

5.1. Convergence Estimation

The numerical results are presented here to validate the results from Theorems 1 and 2 on truncation error estimation. We consider the water-cut function (7) as our $f ()$ in the theorems, because it is an important property for reservoirs. First, the tests are completed for single facies permeability field, and then channelized cases are considered.

5.1.1. Single Facies

In the first simulation example, a permeability field without any channelized structure is considered. To describe the permeability field, a two-point correlation function is defined as:

(23) $R (x, y) = σ^{2} exp (- \frac{| x_{1} - y_{1} |^{2}}{2 l_{1}^{2}} - \frac{| x_{2} - y_{2} |^{2}}{2 l_{2}^{2}}) .$

K-L expansion is then used to describe the permeability field.

f (θ)

(the quantity of interest) is taken to be water-cut function F. One injector at

(0, 0.5)

and one producer at

(1, 0.5)

are considered when we run the forward model in the reference permeability field to get the fractional flow as discussed in Section 2.1. The numerical results for the error estimation are shown in Table 1 and Table 2.

We consider two sets of correlation lengths in our numerical examples. In the first example, we take $l_{1} = 0.1$ , $l_{2} = 0.4$ , and $σ^{2} = 2$ in a $50 \times 50$ fine-scale grid. $θ$ ’s are generated from a log-normal distribution. Eigenvalues decrease rapidly as shown in Figure 4.

The MCMC method with a random walk proposal of step-size $0.3$ is used to get samples from $π (θ)$ . A different number of K-L terms are taken into account. The chains are run for 10,000 iterations with the first 500 samples as the burn-in period. The Monte Carlo integration retaining all the terms in the discrete K-L expansion is considered to be the true value of $E_{π (θ)} F (θ)$ . Samples with a different number of truncated terms are taken to compute $E_{\tilde{π} (θ)} F (θ)$ in different cases to compare with the true one.

In the second example, we take the case with $l_{1} = l_{2} = 0.2$ and $σ^{2} = 2$ . Table 1 and Table 2 show the results with a different number of truncated K-L terms M for the first and second example respectively. In both cases, the errors decrease with the same convergence rate related to the sum of eigenvalue remainders of $R (x)$ . This can be observed more clearly from Figure 5, where the data sets ${{(\sum_{i = M + 1}^{N} λ_{i}^{(θ)})}^{\frac{1}{2}},$ $| E_{π} F - E_{\tilde{π}} F |}$ can be fitted as a straight line. Namely, the relationship between $| E_{π} F - E_{\tilde{π}} F |$ and ${(\sum_{i = M + 1}^{N} λ_{i}^{(θ)})}^{\frac{1}{2}}$ is linear as shown in Theorem 1, while ignoring the errors in computing $F (θ)$ .

5.1.2. Channelized Reservoirs

In our next example, we consider a permeability field with three facies. It is assumed that there is a high permeability layer in the middle and low permeability layers in the two ends. The corresponding two interfaces are chosen randomly with the condition that the upper facies boundary is always above the lower facies boundary. The two different channels are populated using two log-Gaussian random fields generated from truncated K-L expansions with two-point correlation function (23). The high permeable layer has correlation lengths $l_{x} = 0.1$ , $l_{y} = 0.4$ , and $σ = 1$ , and the low permeable layer has correlation lengths $l_{x} = l_{y} = 0.2$ , and $σ = 0.4$ . For both interfaces, a 1-d version of Equation (23) is used with correlation length $l = 0.05$ and $σ = 1.5$ .

We take the generated permeability field as the reference, and run the forward model with one injector at $(0, 0.5)$ and one producer at $(1, 0.5)$ in this reference permeability field to get the fractional flow data $F_{o b s}$ . An MCMC chain is run for 10,000 iterations to get the posterior samples of the permeability field, with the first 500 samples as a burn-in period.

The estimations of posterior errors for a different number of terms in the truncations of K-L expansions, similar to Table 1 and Table 2, are reported in Table 3. The Monte Carlo integration retaining all the terms in the discrete K-L expansions is considered to be the true value. In Table 3, we can see that the error between the true value and the estimated value from the truncated posterior decreases consistently as we increase the number of the terms retained in K-L expansion. If we further plot the errors, we can see that they lie on a plane (see Figure 6) as indicated in Theorem 2.

5.2. Matching Permeability with Reduced Parameters

In this example, we will show that the reference permeability field can be recovered from matching the observations and that the accuracy of such estimates is certainly affected by the truncation of expansions. We consider a high permeable layer in the middle and low permeable layers in the two ends with the same correlation lengths as in Section 5.1.2. The interfaces are taken as a linear interpolation of independent points.

In the first part, we truncate the K-L expansion and retain only the first 20 terms for the two permeability fields. We consider 25 points on the facies. So the dimension of $θ$ is 40 and the dimension of $α$ is 25. The two-stage MCMC method is used to sample from the posterior. The initial facies boundaries are taken to be straight lines joining the two ends of the known facies boundaries. We use random walks to perturb $θ$ and $α$ with step sizes $0.25$ and $0.05$ , respectively, and with independent Gaussian priors for $θ$ and $α$ . We run the MCMC chain for 10,000 iterations and leave out the first 500 samples as the burn-in period.

In Figure 7, the reference permeability field, the initial permeability field, and the mean of the posterior permeability field are shown. We can see that the sample mean is very close to the reference field. On the left plot of Figure 8 we can see that the sample estimate of the fractional flow of oil is quite close to the observed data. From the right plot of Figure 8 we can see that combined error decreases nearly to zero and stays there, which shows that the Markov chain has converged. The two-stage MCMC has a higher acceptance rate [32] (four times in these calculations) because it rejects the bad proposal quickly in the first stage, which is inexpensive. Next, we repeat the same procedure of sampling the posterior but we retain 25 terms in the K-L expansion for the two permeability fields. We use the same reference permeability field and the fractional flow of oil data. The numerical results are shown in Figure 9 and Figure 10. We can see the sampled mean of the permeability field is more accurate than the previous example with 20 K-L coefficients.

6. Conclusions

In this article, subsurface characterization for flows in highly heterogeneous porous media is studied. We consider channelized spatial fields to describe the permeability field where channel boundaries are assumed to have random locations and described via a level set approach. In particular, we use smooth velocity fields to change the channel boundaries within a level set framework and, thus, the parameterization of channel boundaries can be mapped to that of smooth velocity fields. This gives a reduced dimensional parameterization. Permeability distribution within each channel is assumed to be log-Gaussian and described via K-L expansion. One of our main contributions is the study of the regularity of posterior distribution. In particular, we study errors introduced in the posterior measure by truncating the prior distribution. The result from the theorem allows us to carry out the Bayesian uncertainty analysis in a finite-dimensional space. This makes the analysis easy and avoids involving “infinite” dimensional probabilistic spaces. We show that the posterior error introduced by truncation is bounded by a function of eigenvalues up to a constant, where this constant is independent of the dimension of the stochastic space. The latter guarantees that the truncation of K-L expansion based on a discretization will not introduce an unbounded error for the corresponding posterior distribution. The subsurface characterization within the Bayesian framework is based on the MCMC samples from the posterior distribution. We use an efficient two-stage MCMC that utilizes mixed MsFEM to screen the proposals. The numerical results show the validity of the proposed parameterization to interfaces and the error estimations.

One limitation of our study is that the results about the posterior error bounds are based on the assumptions that the saturation is a smooth field and permeability is a second-order stationary spatial process whose prior distribution is a Gaussian process. The complete set of assumptions are given in Appendix A. Although the stationarity assumption is quite reasonable for permeability description within facies boundaries, relaxing some of these assumptions could be beneficial for more complex reservoir models. In future, we would like to extend our Bayesian methodology for non-stationary permeability fields. Studying the posterior error bounds for non-Gaussian spatial processes and non-smooth velocity fields are also of interest for future research.

Author Contributions

Conceptualization, A.M. and J.W.; methodology, A.M. and J.W.; software, A.M. and J.W.; validation, A.M. and J.W.; formal analysis, A.M. and J.W.; investigation, A.M. and J.W.; resources, A.M. and J.W.; writing—original draft preparation, A.M. and J.W.; writing—review and editing, A.M. and J.W.; visualization, A.M. and J.W.; supervision, A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

We would like to thank the reviewers for their valuable comments and suggestions, which helped us to improve the quality of the article.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations and mathematical notations are used in this manuscript:

K-L	Karhunen–Loève
MCMC	Markov chain Monte Carlo
MsFEM	Multiscale finite element method
M-H	Metropolis Hastings
PDE	Partial differential equation
$v_{j}$	Velocity of phase j
k	Permeability
$k_{r j}$	Relative permeability to phase j
S	Water saturation
p	Pressure
f	Fractional flux of water
$λ$	Total mobility
$F (t)$	Fractional flow
$\partial Ω^{o u t}$	Outflow boundary
$v_{n}$	Normal velocity field
$F_{o b s}$	Observed fractional flow data
$F_{k}$	Fractional flow obtained by running the forward model to permeability k
$ϵ$	Random error
$σ_{f}^{2}$	Error variance
w	Pseudo-velocity field
$τ$	Pseudo-time
$φ_{i}$	The $i t h$ interface
$λ_{i}$	$i t h$ Eigen value from the K-L expansion of permeability
$Φ_{i}$	$i t h$ Eigen vector from the K-L expansion of permeability
$θ_{i}$	$i t h$ K-L coefficient
$α_{i}$	$i^{t h e}$ Coefficient for the velocity field
$ϕ_{i}$	$i^{t h e}$ Spatial basis for the velocity field
Y	Log permeability
$π (θ, α)$	Posterior
$π_{0}$	Prior
$P$	Flowmap
T	Time to flight

Appendix A

Our goal is to estimate the difference in expected values of a function with respect to two different posteriors, where one of them is a truncation of the other. We consider the domain $Ω = [0, 1] \times [0, 1]$ and assume that $\nabla p \in L_{\infty} (Ω)$ , $k \in L_{\infty} (Ω)$ , and $v \in L_{\infty} (Ω)$ , where p is pressure, k is permeability field, and v is velocity. The lemmas and theorems in this section are obtained under assumptions described in the following paragraph.

Assumptions: (i) Without loss of generality, we assume $p = 1$ and $S = 1$ on $x = 0$ ; $p = 0$ on $x = 1$ ; and no flow boundary conditions on the lateral boundaries $y = 0$ and $y = 1$ . (ii) The saturation is a smooth field. Note that if the velocity and initial conditions are smooth functions, then the saturation will be a smooth spatial field. (iii) The permeability field k is a stationary spatial process. (iv) The prior distribution is multivariate Gaussian distribution with identity covariance matrix.

Assume that (i)–(iv) hold, then we first find an upper bound of the difference between two saturation fields via the difference of the permeability fields in an appropriate norm.

Lemma A1.

$∥ S_{1} - S_{2} ∥_{L_{2} (Ω)} \leq C {∥ k_{1} - k_{2} ∥}_{L_{2} (Ω)}$ , where $S_{1}$ and $S_{2}$ are water saturations.

Proof.

In order to estimate the difference of saturation, we need the concept of time of flight. For a particle that starts at a point ℘ at $t = 0$ and moves with velocity v, the flow map $P (℘, T)$ is its position at time $t = T$ , i.e.,

$\frac{d P}{d T} = v (P), P (℘, 0) = ℘ .$

Time of flight T characterizes particle motion under the velocity field, since velocity is a function of the spatial variable:

$\frac{d T}{d P} = \frac{1}{v (P)}, T = \int_{℘}^{P} \frac{d r}{v (r)} .$

Then, by [33] we have:

(A1)

\begin{matrix} ∥ S_{1} - S_{2} ∥_{L_{2} (Ω)} & \leq & C ∥ T_{1} - T_{2} ∥_{L_{2} (Ω)} \leq C {∥\int_{℘}^{P} \frac{d r}{v_{1} (r)} - \int_{℘}^{P} \frac{d r}{v_{2} (r)}∥}_{L_{2} (Ω)} \\ \leq & C {∥\int_{℘}^{P} \frac{v_{2} (r) - v_{1} (r)}{v_{1} (r) v_{2} (r)} d r∥}_{L_{2} (Ω)} \leq C {∥ v_{2} - v_{1} ∥}_{L_{2} (Ω)}, \end{matrix}

since

v_{1}, v_{2} \in L_{\infty} (Ω)

On the other hand, $v (x) = - k (x) \nabla p$ , therefore,

$\begin{matrix} ∥ v_{1} - v_{2} ∥_{L_{2} (Ω)} & = & ∥ k_{1} \nabla p_{1} - k_{2} \nabla p_{2} ∥_{L_{2} (Ω)} \\ \leq & ∥ k_{1} \nabla (p_{1} - p_{2}) ∥_{L_{2} (Ω)} + ∥ k_{1} - k_{2} ∥_{L_{2} (Ω)} {∥ \nabla p_{2} ∥}_{L_{\infty} (Ω)} \\ \leq & ∥ k_{1} \nabla (p_{1} - p_{2}) ∥_{L_{2} (Ω)} + C {∥ k_{1} - k_{2} ∥}_{L_{2} (Ω)} . \end{matrix}$

In addition, since $d i v (k_{1} \nabla p_{1}) = 0, d i v (k_{2} \nabla p_{2}) = 0$ , then $d i v (k_{1} \nabla p_{1}) - d i v (k_{2} \nabla p_{2}) = 0$ , and further $d i v (k_{1} \nabla (p_{1} - p_{2})) = d i v ((k_{2} - k_{1}) \nabla p_{2})$ , so:

$\begin{matrix} ∥ k_{1} \nabla (p_{1} - p_{2}) ∥_{L_{2} (Ω)} & = & ∥ (k_{2} - k_{1}) \nabla p_{2} ∥_{L_{2} (Ω)} \leq ∥ k_{1} - k_{2} ∥_{L_{2} (Ω)} {∥ \nabla p_{2} ∥}_{L_{\infty} (Ω)} \\ \leq & C ∥ k_{1} - k_{2} ∥_{L_{2} (Ω)} . \end{matrix}$

Therefore,

(A2)

\begin{matrix} ∥ v_{1} - v_{2} ∥_{L_{2} (Ω)} \leq C {∥ k_{1} - k_{2} ∥}_{L_{2} (Ω)} . \end{matrix}

Then, from (A1) and (A2), we have:

$∥ S_{1} - S_{2} ∥_{L_{2} (Ω)} \leq C {∥ k_{1} - k_{2} ∥}_{L_{2} (Ω)} .$

□

In the Bayesian framework, the reference fractional flow or water-cut $F (k; t) = \int_{0}^{t} \int_{0}^{1} v (1, y) S (1,$ $y, t) d y d t$ is matched to the observed data to get the target posterior distribution. Next, we will estimate the difference between two water-cut responses via the corresponding permeability fields.

Lemma A2.

$| F (k_{1}; t) - F (k_{2}; t) |^{2} \leq C {∥ k_{1} - k_{2} ∥}_{L_{2} (Ω)}$ , where $k_{1}$ and $k_{2}$ are permeabilities, and $F (k_{1}; t)$ and $F (k_{2}; t)$ are water-cut functions.

Proof.

Note that:

$\begin{matrix} F (k; t) & = & \int_{0}^{t} \int_{0}^{1} v (1, y) S (1, y, t) d y d t \\ = & \int_{0}^{t} [\int_{0}^{1} v (1, y) S (1, y, t) d y - \int_{0}^{1} v (0, y) S (0, y, t) d y] d t \\ + & \int_{0}^{t} \int_{0}^{1} v (0, y) S (0, y, t) d y d t . \end{matrix}$

Using $S (0, y, t) = 1$ and $S_{t} + v \cdot \nabla S = 0$ , it follows $\int_{0}^{1} v (0, y) S (0, y, t) d y = \int_{0}^{1} v (0, y) d y = \int_{0}^{1} v (s, y) d y$ for any $s \in [0, 1]$ , since v is divergence free. Then,

$\begin{matrix} F (k_{1}; t) & = & \int_{0}^{t} [\int_{\partial Ω} v_{1} (x, y) S_{1} (x, y, t) d y] d t + \int_{0}^{t} \int_{0}^{1} v_{1} (0, y) d y d t \\ = & \int_{0}^{t} [\int_{Ω} d i v {v_{1} (x, y) S_{1} (x, y, t)} d x d y] d t + \int_{0}^{t} \int_{0}^{1} v_{1} (s, y) d y d t \\ = & \int_{0}^{t} [\int_{Ω} v_{1} (x, y) \cdot \nabla S_{1} (x, y, t) d x d y] d t + \int_{0}^{t} \int_{0}^{1} v_{1} (s, y) d y d t \\ = & \int_{0}^{t} [- \int_{Ω} {(S_{1})}_{t} d x d y] d t + \int_{0}^{t} \int_{0}^{1} v_{1} (s, y) d y d t \\ = & - \int_{Ω} S_{1} (x, y, t) d x d y + \int_{Ω} S_{1} (x, y, 0) d x d y + \int_{0}^{t} \int_{0}^{1} v_{1} (s, y) d y d t . \end{matrix}$

A similar result can be obtained for $F (k_{2}; t)$ . Then,

$\begin{matrix} | F (k_{1}; t) - F (k_{2}; t) |^{2} & = & | \int_{Ω} (S_{2} (x, y, t) - S_{1} (x, y, t)) d x d y + \int_{Ω} (S_{1} (x, y, 0) - S_{2} (x, y, 0)) d x d y \\ + \int_{0}^{t} \int_{0}^{1} (v_{1} (s, y) - v_{2} (s, y)) {d y d t |}^{2} \\ \leq & C (\int_{Ω} | (S_{2} (x, y, t) - S_{1} (x, y, t)) |^{2} d x d y + \int_{Ω} | S_{1} (x, y, 0) - S_{2} (x, y, 0) |^{2} d x d y \\ + \int_{0}^{t} \int_{Ω} | v_{1} (x, y) - v_{2} {(x, y) |}^{2} d x d y d t) \leq C ∥ k_{1} - k_{2} ∥_{L_{2} (Ω)}^{2}, \end{matrix}$

by Lemma A1. □

Next, we consider the case with single facies and the permeability that is described via K-L expansion. In particular, we assume $k (x, ω) = exp (\sum_{i = 1}^{N} θ_{i} ϕ_{i} (x))$ and consider the truncated expansion $k (x, ω) = exp (\sum_{i = 1}^{M} θ_{i} ϕ_{i} (x))$ . Then, the posterior distributions can be written as:

$π (θ) \propto G (θ_{1}, \dots, θ_{N}) π_{0} (θ), \tilde{π} (θ) \propto \tilde{G} (θ_{1}, \dots, θ_{M}) π_{0} (θ),$

where

π (θ)

is the posterior needed to be sampled,

\tilde{π} (θ)

is an approximation of

π (θ)

, and

π_{0} (θ)

is the prior distribution.

G (θ_{1}, \dots, θ_{N})

and

\tilde{G} (θ_{1}, \dots, θ_{N})

are likelihoods, where:

$\begin{matrix} G (θ_{1}, \dots, θ_{N}) & = & exp (- \frac{\int_{0}^{T} {| F_{o b s} - F (k_{1} (θ_{1}, \dots, θ_{N}); t) |}^{2} d t}{σ_{f}^{2}}), \\ \tilde{G} (θ_{1}, \dots, θ_{M}) & = & exp (- \frac{\int_{0}^{T} {| F_{o b s} - F (k_{2} (θ_{1}, \dots, θ_{M}); t) |}^{2} d t}{σ_{f}^{2}}) . \end{matrix}$

Next, we estimate the difference between G and $\tilde{G}$ .

Lemma A3.

$| G (θ_{1}, \dots, θ_{N}) - \tilde{G} (θ_{1}, \dots, θ_{M}) | \leq \frac{C}{σ_{f}^{2}} {∥ k_{1} - k_{2} ∥}_{L_{2} (Ω)}$ .

Proof.

It is clear that $| F (k_{1}; t) - F_{o b s} |$ and $| F (k_{2}; t) - F_{o b s} |$ are bounded. Then, by Lemma A2:

$\begin{matrix} | G (θ_{1}, \dots, θ_{N}) - \tilde{G} (θ_{1}, \dots, θ_{M}) | \\ \leq & \frac{C}{σ_{f}^{2}} |\int_{0}^{T} | F_{o b s} - F (k_{1}; t) |^{2} d t - \int_{0}^{T} {| F_{o b s} - F (k_{2}; t) |}^{2} d t| \\ \leq & \frac{C}{σ_{f}^{2}} (\int_{0}^{T} | 2 F_{o b s} - F (k_{2}; t) - F (k_{1}; t) |^{2} {d t)}^{\frac{1}{2}} \cdot (\int_{0}^{T} | F (k_{1}; t) - F (k_{2}; t) {|^{2} d t)}^{\frac{1}{2}} \\ \leq & \frac{C}{σ_{f}^{2}} (\int_{0}^{T} | F (k_{1}; t) - F (k_{2}; t) |^{2} {d t)}^{\frac{1}{2}} \leq \frac{C}{σ_{f}^{2}} {∥ k_{1} - k_{2} ∥}_{L_{2} (Ω)} . \end{matrix}$

□

Theorem A1.

(Theorem 1) Suppose that the permeability field k is a stationary spatial process on a bounded region and $f (θ)$ is square integrable with respect to Gaussian measure, i.e., ${\int | f (θ) |}^{2} π_{0} (θ) d θ < \infty$ , then:

(A3)

\begin{matrix} |E_{π (θ)} [f (θ)] - E_{\tilde{π} (θ)} [f (θ)]| \leq C {\{\sum_{i = M + 1}^{N} λ_{i}\}}^{\frac{1}{2}}, \end{matrix}

where C is independent of dimension N.

Proof.

If $f (θ)$ is square integrable with respect to Gaussian measure (e.g., a polynomial function), we can show that:

$\begin{matrix} |E_{π (θ)} [f (θ)] - E_{\tilde{π} (θ)} [f (θ)]| & \leq & C \int | f (θ) | | G (θ_{1}, \dots, θ_{N}) - \tilde{G} (θ_{1}, \dots, θ_{M}) | π_{0} (θ) d θ \\ \leq & \frac{C}{σ_{f}^{2}} \int | f (θ) | || k_{1} - k_{2} ∥_{L_{2}} π_{0} (θ) d θ \\ \leq & \frac{C}{σ_{f}^{2}} {(\int | f (θ) |}^{2} π_{0} {(θ) d θ)}^{\frac{1}{2}} (\int || k_{1} - k_{2} {∥_{L_{2}}^{2} π_{0} (θ) d θ)}^{\frac{1}{2}} \\ \leq & \frac{C}{σ_{f}^{2}} (\int || k_{1} - k_{2} {∥_{L_{2}}^{2} π_{0} (θ) d θ)}^{\frac{1}{2}} . \end{matrix}$

To estimate the error of truncation of K-L expansion, let $k_{1} = exp (\sum_{i = 1}^{N} θ_{i} \sqrt{λ_{i}} ψ_{i})$ and $k_{2} = exp (\sum_{i = 1}^{M} θ_{i} \sqrt{λ_{i}} ψ_{i})$ . We assume $θ_{i} \sim N (0, 1)$ for simplicity, then:

$\begin{matrix} {|\int f (θ) π (θ) d θ - \int f (θ) \tilde{π} (θ) d θ|}^{2} \\ \leq & \frac{C}{σ_{f}^{4}} \int {∥exp (\sum_{i = 1}^{N} θ_{i} \sqrt{λ_{i}} ψ_{i}) - exp (\sum_{i = 1}^{M} θ_{i} \sqrt{λ_{i}} ψ_{i})∥}_{L_{2}}^{2} π_{0} (θ) d θ \\ \leq & \frac{C}{σ_{f}^{4}} \int_{Ω} \int exp (2 \sum_{i = 1}^{M} θ_{i} \sqrt{λ_{i}} ψ_{i}) {[1 - exp (\sum_{i = M + 1}^{N} θ_{i} \sqrt{λ_{i}} ψ_{i})]}^{2} π_{0} (θ) d θ d x d y \\ \leq & \frac{C}{σ_{f}^{4}} \int_{Ω} I_{1} I_{2} d x d y, \end{matrix}$

where:

$\begin{matrix} I_{1} & = & \int \dots \int exp (2 \sum_{i = 1}^{M} θ_{i} \sqrt{λ_{i}} ψ_{i}) π_{0} (θ_{1}, \dots, θ_{M}) d θ_{1} \dots d θ_{M} \\ = & \prod_{i = 1}^{M} \frac{1}{\sqrt{2 π}} \int exp (- \frac{1}{2} {(θ_{i}^{2} - 2 \sqrt{λ_{i}} ψ_{i})}^{2} + 2 λ_{i} ψ_{i}^{2}) d θ_{i} = exp (2 \sum_{i = 1}^{M} λ_{i} ψ_{i}^{2}), \end{matrix}$

because

ψ_{i}

’s are bounded, and:

$\begin{matrix} I_{2} & = & \int \dots \int {[1 - exp (\sum_{i = M + 1}^{N} θ_{i} \sqrt{λ_{i}} ψ_{i})]}^{2} π_{0} (θ_{M + 1}, \dots, θ_{N}) d θ_{M + 1} \dots d θ_{N} \\ = & \int \dots \int {1 - 2 exp (\sum_{i = M + 1}^{N} θ_{i} \sqrt{λ_{i}} ψ_{i}) + exp (2 \sum_{i = M + 1}^{N} θ_{i} \sqrt{λ_{i}} ψ_{i})} \prod_{i = M + 1}^{N} \frac{1}{\sqrt{2 π}} exp (- \frac{θ_{i}^{2}}{2}) d θ_{i} \\ \leq & 1 - 2 (1 + \frac{1}{2} \sum_{i = M + 1}^{N} λ_{i} ψ_{i}^{2}) + 1 + 2 \sum_{i = M + 1}^{N} λ_{i} ψ_{i}^{2} (exp (2 \sum_{i = M + 1}^{N} λ_{i} ψ_{i}^{2}) + \frac{1}{2}) \\ \leq & C exp (2 \sum_{i = M + 1}^{N} λ_{i} ψ_{i}^{2}) \sum_{i = M + 1}^{N} λ ψ_{i}^{2} . \end{matrix}$

Since k is a stationary spatial process on a bounded region, i.e., for a spatial process where the covariance function depends only on the distance not on the spatial location, then by [30], ${ψ_{i}}$ is uniform $L_{\infty} (Ω)$ bounded. Thus,

$\begin{matrix} |\int f (θ) π (θ_{1}, \dots, θ_{N}) d θ - \int f (θ) \tilde{π} (θ_{1}, \dots, θ_{N}) d θ| \\ \leq & \frac{C}{σ_{f}^{2}} {\{\int_{Ω} I_{1} I_{2} d x d y\}}^{\frac{1}{2}} \leq C {\{\int_{Ω} exp (2 \sum_{i = 1}^{N} λ_{i} ψ_{i}^{2}) \sum_{i = M + 1}^{N} λ_{i} ψ_{i}^{2} d x d y\}}^{\frac{1}{2}} \leq C {\{\sum_{i = M + 1}^{N} λ_{i}\}}^{\frac{1}{2}} . \end{matrix}$

□

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures and Tables

Figure 1. Illustration of the permeability field with facies.

Figure 2. Interface evolution by moving initial interface with different vertical velocity fields.

Figure 3. Interface updates using velocity representation at some fixed points.

Figure 4. Ordered eigenvalues for l1=0.1, l2=0.4, and σ2=2.

Figure 5. Linear fit of {(∑i=M+1Nλi(θ))12,|EπF−Eπ˜F|}. Left: l1=0.1, l2=0.4, σ2=2, and σf2=0.001; Right: l1=0.2, l2=0.2, σ2=2, and σf2=0.005.

Figure 6. Plots of (max{(∑j=M1+1N1λ1j(θ))12, (∑j=M2+1N2λ2j(θ))12},(∑i=M˜+1N˜λi(α))12,|EπF−Eπ˜F|).

View Image - Figure 7. Top left: The true log-permeability field. Top right: Initial log-permeability field. Bottom left: One of the sampled log-permeability field. Bottom Right: The mean of the sampled log-permeability field from two-stage MCMC using 20 K-L terms.

Figure 7. Top left: The true log-permeability field. Top right: Initial log-permeability field. Bottom left: One of the sampled log-permeability field. Bottom Right: The mean of the sampled log-permeability field from two-stage MCMC using 20 K-L terms.

$View Image - Figure 8. Left: Red line designates the fine-scale reference fractional flow of oil, the blue line designates the initial fractional flow of oil, and the green line designates fractional flow of oil corresponding to mean of the sampled permeability field from two-stage MCMC. Right: Fractional flow errors vs. accepted iterations when sampled from the posterior distribution retaining 20 terms in K-L expansion.$

Figure 8. Left: Red line designates the fine-scale reference fractional flow of oil, the blue line designates the initial fractional flow of oil, and the green line designates fractional flow of oil corresponding to mean of the sampled permeability field from two-stage MCMC. Right: Fractional flow errors vs. accepted iterations when sampled from the posterior distribution retaining 20 terms in K-L expansion.

View Image - Figure 9. Top left: The true log-permeability field. Top right: Initial log-permeability field. Bottom left: One of the sampled log-permeability field. Bottom Right: The mean of the sampled log-permeability field from two-stage MCMC using 25 K-L terms.

Figure 9. Top left: The true log-permeability field. Top right: Initial log-permeability field. Bottom left: One of the sampled log-permeability field. Bottom Right: The mean of the sampled log-permeability field from two-stage MCMC using 25 K-L terms.

$View Image - Figure 10. Left: Red line designates the fine-scale reference fractional flow of oil, the blue line designates the initial fractional flow of oil, and the green line designates fractional flow of oil corresponding to mean of the sampled permeability field from two-stage MCMC. Right: Fractional flow errors vs. accepted iterations when sampled from the posterior distribution retaining 25 terms in K-L expansion.$

Figure 10. Left: Red line designates the fine-scale reference fractional flow of oil, the blue line designates the initial fractional flow of oil, and the green line designates fractional flow of oil corresponding to mean of the sampled permeability field from two-stage MCMC. Right: Fractional flow errors vs. accepted iterations when sampled from the posterior distribution retaining 25 terms in K-L expansion.

Table 1

Posterior errors $| E_{π} F - E_{\tilde{π}} F |$ when the K-L expansion is truncated to M terms. Here $l_{1} = 0.1$ , $l_{2} = 0.4$ , $σ^{2} = 2$ , and $σ_{f}^{2} = 0.001$ .

M	${(\sum_{i = M + 1}^{N} λ_{i}^{(θ)})}^{\frac{1}{2}}$	$\| E_{π} F - E_{\tilde{π}} F \|$
5	1.111681	0.081809
10	0.750662	0.106264
15	0.517555	0.063635
20	0.337901	0.030207
25	0.189272	0.017931
30	0.071924	0.011225

Table 2

Posterior errors $| E_{π} F - E_{\tilde{π}} F |$ when the K-L expansion is truncated to M terms. Here $l_{1} = 0.2$ , $l_{2} = 0.2$ , $σ^{2} = 2$ , and $σ_{f}^{2} = 0.005$ .

M	${(\sum_{i = M + 1}^{N} λ_{i}^{(θ)})}^{\frac{1}{2}}$	$\| E_{π} F - E_{\tilde{π}} F \|$
5	1.176697	0.308118
10	0.820661	0.191601
15	0.566938	0.119590
20	0.378454	0.059173
25	0.248267	0.033023
30	0.123347	0.014965

Table 3

Posterior errors $| E_{π} F - E_{\tilde{π}} F |$ when the K-L expansion is truncated to M terms for different facies.

$M_{1}$	$M_{2}$	$\tilde{M}$	${(\sum_{j = M_{1} + 1}^{N_{1}} λ_{1 j}^{(θ)})}^{\frac{1}{2}}$	${(\sum_{j = M_{2} + 1}^{N_{2}} λ_{2 j}^{(θ)})}^{\frac{1}{2}}$	${(\sum_{i = \tilde{M} + 1}^{\tilde{N}} λ_{i}^{(α)})}^{\frac{1}{2}}$	$\| E_{π} F - E_{\tilde{π}} F \|$
5	5	5	0.526235	0.786077	0.853727	0.109464
10	5	5	0.367011	0.786077	0.853727	0.116172
10	10	10	0.367011	0.530798	0.477141	0.051925
15	10	10	0.253542	0.530798	0.477141	0.093109
15	15	10	0.253542	0.365967	0.477141	0.053869
20	15	15	0.169250	0.365967	0.210844	0.047356
20	20	15	0.169250	0.238932	0.210844	0.019996

Word count: 7473

Show less

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

In this article, we study uncertainty quantification for flows in heterogeneous porous media. We use a Bayesian approach where the solution to the inverse problem is given by the posterior distribution of the permeability field given the flow and transport data. Permeability fields within facies are assumed to be described by two-point correlation functions, while interfaces that separate facies are represented via smooth pseudo-velocity fields in a level set formulation to get reduced dimensional parameterization. The permeability fields within facies and pseudo-velocity fields representing interfaces can be described using Karhunen–Loève (K-L) expansion, where one can select dominant modes. We study the error of posterior distributions introduced in such truncations by estimating the difference in the expectation of a function with respect to full and truncated posteriors. The theoretical result shows that this error can be bounded by the tail of K-L eigenvalues with constants independent of the dimension of discretization. This result guarantees the feasibility of such truncations with respect to posterior distributions. To speed up Bayesian computations, we use an efficient two-stage Markov chain Monte Carlo (MCMC) method that utilizes mixed multiscale finite element method (MsFEM) to screen the proposals. The numerical results show the validity of the proposed parameterization to channel geometry and error estimations.

Details

Title

Bayesian Uncertainty Quantification for Channelized Reservoirs via Reduced Dimensional Parameterization

Author

Mondal, Anirban¹

; Jia, Wei²

¹ Department of Mathematics, Applied Mathematics, and Statistics, Case Western Reserve University, Cleveland, OH 44106, USA
² Bayesian Learning Corp., 825 S Golden West Ave, Arcadia, CA 91007, USA; [email protected]

First page

1067

Publication year

2021

Publication date

2021

Publisher

MDPI AG

e-ISSN

22277390

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.3390/math9091067

ProQuest document ID

2530157123

Bayesian Uncertainty Quantification for Channelized Reservoirs via Reduced Dimensional Parameterization

Jump to:

Full text

Abstract

Details

Suggested sources