1. Introduction
The distribution of subsurface properties are mainly controlled by the location of distinct geologic facies with sharp contrasts in properties, such as permeability and porosity, across facies boundaries [1]. For example, in a fluvial setting, high permeability channel sands are often embedded in a nearly impermeable background causing the dominant fluid movement to be restricted within these channels. Under such conditions, the channel geometry plays an important role in determining the flow behavior in the subsurface. Consequently, in predicting the flow through highly heterogeneous porous formations, it is important to model facies boundaries accurately and to properly account for the uncertainties in these models. Traditional geostatistical techniques for subsurface characterization have typically relied on two-point correlation functions to describe the spatial variability. Such spatial fields do not reproduce discrete and irregular geologic features (geo-bodies) such as fluvial channels [2,3,4]. The success of object-based models, such as discrete Boolean or object-based models [5], is heavily dependent on the parameters to specify the object size, shapes, proportion, and orientation. Typically, these parameters are highly uncertain, particularly in the early stages of subsurface characterization [2,6]. For example, in a channel type environment, the channel sands may be observed at only a few well locations. There are many plausible channel geometries that will satisfy the channel sand and well intersections. Thus, the stochastic models for channels will require the specification of random variables that govern the channel boundaries. All the parameters have considerable uncertainty associated with them and will impact fluid flow in the subsurface. A considerable amount of prior information is typically available for building the facies models for fluid flow simulation [1]. These include well logs and cores, seismic data, and geologic conceptualization based on outcrops and analogs. Although prior information plays a vital role in reducing uncertainty and preserving geologic realism, it is imperative that the geologic models reproduce the dynamic response based on flow and transport data. In the last decade, significant progress has been made in conditioning pixel-based geologic models to flow and transport data [4,7,8,9,10,11,12]. The approach typically involves the solution of an inverse problem requiring the minimization of a suitably defined objective function. Both gradient-based methods and combinatorial optimization methods have been used for this purpose. The existing approaches are not readily applicable to facies-based models where the primary goal is to locate the facies boundaries and preserve the contrast in facies properties.
In this article, we consider a Bayesian approach where the solution to the inverse problem is given by the posterior distribution of the subsurface properties given the dynamic response on flow and transport data. The Bayesian approach has a natural mechanism of regularization in the form of prior information and provides quantitative assessments of the uncertainties of important model parameters. This provides a better understanding of their direct and indirect effects on the response of the physical system. Such Bayesian models have been used for uncertainty quantification in subsurface models [13,14,15,16,17], and also in other areas such as seismic modeling [18,19]. Here we use a Bayesian hierarchical model that preserves the facies architecture, at the same time populating the petrophysical properties within the facies in a geologically consistent manner by incorporating available static and dynamic information. To maintain the contrast in facies properties, we represent the facies boundaries using level sets, which provide a systematic method for morphing the facies shapes to reconstruct a wide variety of facies geometries [20,21,22,23]. Although level sets have recently been used to represent facies boundaries [24], the novelty here is that we employ an efficient Bayesian hierarchical uncertainty quantification technique to perturb the facies boundaries and properties to match the dynamic response such as multiphase production history. The description of the facies boundaries in our level set approach is based on a parameterization of the pseudo-velocity fields that deform the interfaces. We will mostly focus on smooth interfaces that require smooth velocity fields in the level set method. The space of smooth velocity fields can be parameterized with fewer parameters, thus providing us with a smaller dimensional uncertainty space to explore.
The dynamic and transport flow data that we consider in our simulations is the fractional flow data, which is the fraction of water produced in relation to the total production rate. A significant part of the computational expense in any dynamic data integration method is the modeling of flow and transport through high-resolution geologic models. To precondition these simulations, we adopt a multi-stage MCMC method to minimize the number of fine-scale flow simulations during MCMC sampling. In this approach, simplified models using mixed MsFEM are used to screen the proposals before running detailed fine-scale simulations. Note that our forward model consists of coupled flow and transport equations. The mixed MsFEM is used to solve the flow equation on a coarse grid and further use the velocity field on a coarse grid to compute the fractional flow.
A major part of this article is devoted to studying the regularity of the posterior measure with respect to the prior measure. In particular, we estimate the difference in the expectation of a function with respect to full and truncated posterior distributions. Here, the full posterior distribution refers to the posterior computed using all parameter space, while the truncated posterior distribution refers to the posterior computed using a truncated parameter space. The error in the fractional flow (the quantity that is often measured) is obtained in terms of the truncation error in K-L expansion. In particular, we show that the error is proportional to the truncation error in K-L expansion. Moreover, we show that the constants in these error estimates are independent of the dimension of the parameter space. The latter is very important for our application, as the dimension of the parameter space without K-L truncation, which depends on the discretization of the domain, can be very large. The convergence of MCMC methods on such high dimensional parameter space is infeasible. The results confirm the validity of using a reduced dimensional parameterization in this context, for which the MCMC methods become feasible. Numerical results are also presented in support of the theoretical bounds of the posterior error due to the truncation. For efficient sampling of the posterior we use a two-stage MCMC method, that utilizes an inexpensive mixed MsFEM in a up-scaled coarse field to screen the bad proposals, before being accepted in the final stage, which needs a more expensive solver.
The paper is organized in the following way. In Section 2, we discuss the Bayesian inverse problem setup and prior parameterization. Section 3 is devoted to the estimation of the posterior error due to the truncation in prior parameterization. In Section 4, we briefly describe the sampling algorithms. Numerical results are presented in Section 5.
2. Geological Models, Parameterization, and the Bayesian Inverse Problem
2.1. Fine and Coarse Models
We consider two-phase flow in a subsurface formation (denoted by ) under the assumption that the displacement is dominated by viscous effects. For clarity of exposition, we neglect the effects of gravity, compressibility, and capillary pressure, although our proposed approach is independent of the choice of physical mechanisms. Porosity will also be considered to be constant. The two phases will be referred to as water and oil (or a non-aqueous phase liquid), designated by subscripts w and o, respectively. We write Darcy’s law for each phase as:
(1)
where is the phase velocity, k is the permeability tensor, is the relative permeability to phase j (), S is the water saturation (volume fraction), and p is the pressure. Combining Darcy’s law with a statement of conservation of mass allows us to express the governing equations in terms of pressure and saturation equations:(2)
(3)
where is the total mobility, is a source term, f is the fractional flux of water, and is the total velocity, which are respectively given by:(4)
(5)
(6)
The above descriptions are referred to as the fine-scale model of the two-phase flow problem.As for the coarse-scale model, we consider single-phase flow-based multiscale simulation methods. This technique is similar to upscaling methods [25], except that instead of computing effective properties, multiscale basis functions are calculated. These basis functions are coupled through a variational formulation of the problem. For multi-phase flow and transport simulations, the conservative fine-scale velocity is often needed. For this reason, mixed MsFEM is used. We refer to [26,27] for the mixed MsFEM method and its use in two-phase flow and transport. In our simulations, the multiscale basis functions are computed for the velocity once with . These basis functions are used later without any update for solving two-phase flow equations. As a result, we obtain a coarse-scale velocity field that is used for solving the transport equation on the coarse grid. We note that mixed MsFEM can be implemented on unstructured grids [28].
In this article, we will infer about the permeability field based on fractional flow, denoted by . F for a two-phase water-oil flow is defined as the fraction of water in the produced fluid and is given by , where , with and representing the flow rates of oil and water respectively at the production edge of the model. In mathematical notation,
(7)
where is the outflow boundary and is the normal velocity field. The fractional flow of oil is . It is to be noted that the inference based on fractional flow of water or fractional flow of oil are equivalent.2.2. The Bayesian Inverse Problem
Our main objective is to quantify the uncertainty in the permeability field given the observed fractional flow data. For a given permeability field k, we denote the corresponding fractional flow obtained by solving the forward model as . The corresponding observed fractional flow data is denoted by . The non-linear forward mapping from permeability field k to is not one-to-one. In addition to that, the observed data also contain measurement errors. We define the combined model error and measurement error as a random error . The model can be written as:
(8)
where is distributed as , i.e., is assumed to be . The Bayesian solution to the inverse problem is given by the posterior probability distribution .In the following subsection, we will represent the facies and interfaces of the permeability field through K-L expansions. The parameterization of the interfaces will be based on pseudo-velocity fields in a level set method. The parameter space to express smooth velocity fields is usually small, and thus one can substantially reduce the dimension of the parameter space and at the same time preserve the contrast in facies properties. Since the number of parameters is the dimension of the inverse problem, a small dimension requires less computation to obtain convergence results.
Let parameterize the permeability field within facies and parameterize the velocity in the level set method, then the permeability field k is completely parameterized by and . We assume and are independent. By Bayes’ theorem the posterior distribution can be written as:
(9)
where is the likelihood, and and are the prior for and respectively. Without proper regularization, such an inverse problem of inferring about the permeability given the fractional flow data is ill-posed due to the non-linear forward model. However, the Bayesian formulation contains a natural mechanism of regularization in the form of prior probability distribution and casts the inverse solution in the form of the posterior distribution.2.3. Parameterization of Permeability Field
In this section, we introduce the parameterization of the permeability field. First, a heterogeneous permeability field is decomposed into several high and low permeable subregions, where each region represents a facies (see Figure 1 for illustration). This type of hierarchical representation allows us to write of the permeability field as:
(10)
where is an indicator function of region (i.e., if and otherwise). We then need to parameterize facies and interfaces. For interface, we use the level set method, while the permeability field within each facies is assumed to follow a log-Gaussian distribution with a known spatial covariance. It is to be noted that in our approach the permeability field description is defined on a finite dimension whereas the partial differential equations (PDEs) to solve the forward problem are defined on an infinite dimensional setting.2.3.1. Parameterization of Interfaces
Suppose that any interface is a zero level set function . The evolution equation for an interface is given by:
(11)
where w is a pseudo-velocity field and is pseudo-time. We denote as the interface if there are more than one, then can be written as for different interfaces. More information on the level set method can be found in Refs. [20,22]. A key is to specify w for (11) to describe and update the boundaries of facies.We consider a set of pseudo-velocity fields W, where {w admits fixed streamlines, and is constant along streamlines on }. In other words, we assume that the direction of the streamlines is fixed and the magnitude of the velocity along a certain streamline is the same. However the magnitudes of the velocity vary among different streamlines. In general, one can take streamline directions to be random. To keep the dimension of the parameter space small, we take streamlines to be fixed. For example, in our numerical experiment, vertical streamlines are used. We assume further that the magnitude of velocity field follows the expansion,
(12)
’s are a spatial basis for the magnitude of the velocity field and defined on the lower dimensional space of the interface, i.e., lives in , where . For example, assume that is a second order stochastic process on with a given covariance structure, then in (12) is basis. In this case, is expressed to be a K-L expansion. More details about K-L expansion will be recalled in the next subsection.Now, if the initial interface is at , the interface at can be written as . Any interface corresponds to a pseudo-velocity field and a time . Therefore, all interfaces of interest can be generated through the evolution Equation (11), with pair . The following lemma proves that the set of interfaces generated through this one-step movement is well defined. Otherwise, the map between the interface set and the pseudo-velocity field W is not one-to-one.
For any , ∃ with Expansion (12), such that can be obtained from through evolution Equation (11).
For any with and , , the new interface formed by moving the initial with and consecutively in time and is .
Assuming , we can choose , and let , so . For with , we have the distance of any particle in an interface moved by (11) in time interval as the arc length:
Since , , and have the same direction at any location, this implies that . Therefore, the new interface:
Namely, any interface can be obtained by moving the initial interface in a certain time period once by a , a Gaussian random field with deterministic direction. □In our numerical experiments, we consider vertical streamlines in . The pseudo-velocity is then and the magnitude along streamlines is assumed to be , , . The Lemma holds for this case, and Figure 2 illustrates the process. To get a simpler model, we also have numerical examples determining the velocity field via its values at certain discrete locations. The velocity values at these locations are updated as shown in Figure 3. In this case, the basis functions ’s in (12) are taken to be the indicator functions for each location, and ’s are assumed to be linear between nodes.
2.3.2. Parameterization within Facies
Now we describe the parameterization of the permeability field within the facies. The function is often considered instead of as permeability is positive and distributed right skewed. Since one of the most commonly used stochastic description of spatial fields is based on a two-point correlation function, our parameterization of permeability field then starts from the correlation function of , i.e., , where refers to the expectation and x, y are points in the spatial domain. is assumed to be known. The Karhunen–Loève expansion [29] can be used to get a parameterization of permeability field or as a linear combination of a random component and spatial dependent component. The expansion is done by representing the permeability field in terms of an optimal basis. By truncating the expansion, we can represent the permeability matrix by fewer random parameters.
Here we briefly recall some properties of the K-L expansion. Suppose is a second order stochastic process with . For simplicity, we assume that Given an orthonormal basis in , we can expand as a general Fourier series:
The special basis which makes the random variables uncorrelated is of interest here. Namely, for all . The basis functions satisfy:
Since is a complete basis in , it follows that are eigenfunctions of ,(13)
where . Furthermore, we have:Denote , then satisfies and . It follows that:(14)
where and satisfy (13). The eigenvalues s can be ordered as . The basis functions are deterministic and resolve the spatial dependence of the permeability field. The randomness is represented by the scalar random variables s. The expansion (14) is called the Karhunen–Loève expansion.In practice, a K-L expansion with finite terms rather than the infinite expansion (14) is used. Given an analytical correlation function , we can represent it on a discretized mesh. If we discretize the domain by an rectangular mesh, we can obtain at most N pairs of eigenvalues and eigenvectors . The ’s are discrete fields. To simplify the notation, we still use to denote . In this case, the continuous K-L expansion (14) is reduced to finite terms. We get a “discretized” K-L expansion:
(15)
3. Posterior Error Introduced by Truncation
In this section, we study the regularity of the posterior distribution with respect to the parameterization errors. The errors are due to using a truncated series for the velocity and the parameterization of permeability Y within facies. In future discussion, we use w to denote for simplicity.
Consider a permeability field in , see Figure 1, which has s facies and interfaces . The number of interfaces are assumed to be known and the interfaces are continuous. Each facies is described by a covariance matrix as in Section 2.3.2. Then, the permeability field is a function given by:
where is an indicator function on . Through K-L expansion, the permeability of each facies is , and each interface is formed by moving the initial interface by a velocity field with deterministic direction as in Section 2.3.1. Then, the permeability field can be written as:Considering the finite discretized case allows us to write and in each as , , and , , with . Note that and usually decrease to 0 fast, the truncated K-L expansions, i.e., and can be used to reduce the dimension of the parameter space, which in turn would save CPU time while sampling from the posterior distribution. We denote and , where describe the permeability field within the facies and describe the interface. and are truncations of and , respectively. Then, the corresponding representations of the permeability field in full and truncated case are given by:
Correspondingly, the two posterior distributions of the permeability field in the Bayesian framework are given by:
where , , and is the observed fractional flow data. The priors is assumed to be the product Gaussian measure.It is clear that this truncation process affects the posterior inference. Our goal here is to estimate the error introduced by this truncation, which can also provide a guidance to choose and for specific requirements. A case with single facies is considered first before we state the main theorem involving multiple facies.
Assume that the assumptions in Appendix A hold, and suppose the log-permeability field is a stationary spatial process on a bounded region, with the truncation , and is a square integrable with respect to Gaussian measure, i.e., , then:
(16)
where C is independent of dimension N.To prove this result, the main estimation of the fractional flow error is obtained from estimation of the governing PDE system (1)–(3). Details are provided in Appendix A. For a general case, when the permeability field has more than one facies, we can state the main theorem.
Assume that the assumptions in Appendix A hold, and suppose the discretized K-L expansion of the log permeability field and the random velocity field are given by and , where all and are stationary spatial processes on a bounded region, and the truncated expansions are and respectively. Assume that is a square integrable function with respect to Gaussian measure, i.e., , then:
(17)
where and are independent of dimension and .Note that:
where:It is clear that Lemma A3 can be generalized to the multi-facies case to get . Then,
by Theorem 1. To estimate , the corresponding error estimation for permeability fields is also needed, i.e.,Then we can estimate as:
Since all and are stationary spatial processes on a bounded region, i.e., spatial processes where the covariance function depends only on the distance not on the spatial location, the eigenfunctions and are uniform bounded (see [30]). Thus: Please note that as and are independent of and , when and , we have . □The theorem here shows the possibility to lower the dimension of the inverse problem by truncating K-L expansions. Without truncation, the dimension of K-L expansions of the permeability field and pseudo-velocity field, which is decided by the dimension of discretization, can be very large. A large dimension requires more iterations for the MCMC method to converge and therefore makes the computation expensive, if not infeasible. The theorem provides a bound to the error of the two posteriors and confirms the validity of the truncation of the parameter space without introducing unbounded errors. To choose and a priori, estimations of and are needed. The constants and are independent of and , but they are dependent on the square integrable function f and other factors. It is also possible to estimate C numerically.
4. MCMC Sampling from the Truncated Posterior
In this section, we introduce the sampling scheme used in the numerical examples in Section 5.2. To see the advantages of this scheme, we first state the standard Metropolis–Hastings algorithm and then point out the motivation of using the scheme.
For channelized permeability field, the standard Metropolis–Hastings (M-H) algorithm can be formed in the following way to sample from the truncated posterior distribution .
Algorithm (M-H MCMC [31]): Suppose at the step, we have permeability field .
Step 1. Generate from a distribution and from a distribution . Then the entire permeability field is proposed using (10).
Step 2. Accept k as a sample with probability:
(18)
i.e., take with probability , and with probability . □Starting with an initial permeability sample , the MCMC algorithm generates a Markov chain with the transition kernel as:
The target distribution is the stationary distribution of the Markov chain , so represents the sample generated from after the chain converges and reaches a steady state.The main disadvantage of the MCMC algorithm is the high computational cost in solving the coupled nonlinear PDE system (1)–(3) on the fine-grid in order to compute in the target distribution . Typically, the MCMC method in our simulations converges to the steady-state after thousands of iterations, and the acceptance rate is also quite low. A large amount of CPU time is spent on simulating the rejected samples. The MCMC method can be improved by adapting the proposal distribution to the target distribution using a coarse-scale model. The process essentially modifies the proposal distribution by incorporating the coarse-scale information. The algorithm for a general two-stage MCMC method with upscaling was introduced in [32].
Let be the fractional flow computed by solving the coarse-scale model of (1)–(3) for the given k. This is done with mixed MsFEM [28]. Mixed MsFEM is used to solve pressure, and saturation is solved on a coarse grid. The fine-scale target distribution is approximated on the coarse-scale by . Here, we have:
where the function is estimated based on offline computations using independent samples from the prior. Using the coarse-scale distribution as a filter, the two-stage MCMC can be described as follows.Algorithm (Two-stage MCMC [32]): Suppose at the step, we have , , and permeability field .
Coarse stage:
Step 1. At , generate a trial proposal from distribution and . The fractional flow is computed by solving the coarse-scale model.
Step 2. Take the proposal as:
The acceptance probability is given by:(19)
Therefore, the final proposal k is generated from the effective instrumental distribution:If , go to the Step 3. Otherwise, i.e., , return to Step 1.
Fine stage:
Step 3. Accept k as a sample with probability:
(20)
i.e., with probability , and with probability . □Using the argument as in [32], the acceptance probability (20) can be simplified as:
In our numerical example, we use a random walk to generate proposals, i.e., at the step, we propose , where is generated from a distribution. Similarly, we propose , where is also generated from a distribution. Here and represent the step size of the jump in each iteration of the Metropolis–Hastings algorithm. The values of and control the convergence of the MCMC algorithm. The prior distribution of can be taken to be . Similarly, the prior distribution of can be taken to be . Then, our acceptance probability (18) is given by:(21)
The acceptance probability (19) in the two-stage MCMC algorithm is similar.We also use a simple relation for modeling coarse- and fine-scale errors. In particular, is taken to be a linear function with the condition . Then, becomes:
i.e., on the coarse-scale is assumed to follow distribution, where is the precision associated with the coarse-scale model. The parameter plays an important role in improving the acceptance rate of the preconditioned MCMC method. The optimal value of depends on the correlation between and , which can be estimated by offline computations.Assuming that on the fine-scale follows a distribution, i.e.,
the acceptance probability (21) becomes:(22)
5. Numerical Results
5.1. Convergence Estimation
The numerical results are presented here to validate the results from Theorems 1 and 2 on truncation error estimation. We consider the water-cut function (7) as our in the theorems, because it is an important property for reservoirs. First, the tests are completed for single facies permeability field, and then channelized cases are considered.
5.1.1. Single Facies
In the first simulation example, a permeability field without any channelized structure is considered. To describe the permeability field, a two-point correlation function is defined as:
(23)
K-L expansion is then used to describe the permeability field. (the quantity of interest) is taken to be water-cut function F. One injector at and one producer at are considered when we run the forward model in the reference permeability field to get the fractional flow as discussed in Section 2.1. The numerical results for the error estimation are shown in Table 1 and Table 2.We consider two sets of correlation lengths in our numerical examples. In the first example, we take , , and in a fine-scale grid. ’s are generated from a log-normal distribution. Eigenvalues decrease rapidly as shown in Figure 4.
The MCMC method with a random walk proposal of step-size is used to get samples from . A different number of K-L terms are taken into account. The chains are run for 10,000 iterations with the first 500 samples as the burn-in period. The Monte Carlo integration retaining all the terms in the discrete K-L expansion is considered to be the true value of . Samples with a different number of truncated terms are taken to compute in different cases to compare with the true one.
In the second example, we take the case with and . Table 1 and Table 2 show the results with a different number of truncated K-L terms M for the first and second example respectively. In both cases, the errors decrease with the same convergence rate related to the sum of eigenvalue remainders of . This can be observed more clearly from Figure 5, where the data sets can be fitted as a straight line. Namely, the relationship between and is linear as shown in Theorem 1, while ignoring the errors in computing .
5.1.2. Channelized Reservoirs
In our next example, we consider a permeability field with three facies. It is assumed that there is a high permeability layer in the middle and low permeability layers in the two ends. The corresponding two interfaces are chosen randomly with the condition that the upper facies boundary is always above the lower facies boundary. The two different channels are populated using two log-Gaussian random fields generated from truncated K-L expansions with two-point correlation function (23). The high permeable layer has correlation lengths , , and , and the low permeable layer has correlation lengths , and . For both interfaces, a 1-d version of Equation (23) is used with correlation length and .
We take the generated permeability field as the reference, and run the forward model with one injector at and one producer at in this reference permeability field to get the fractional flow data . An MCMC chain is run for 10,000 iterations to get the posterior samples of the permeability field, with the first 500 samples as a burn-in period.
The estimations of posterior errors for a different number of terms in the truncations of K-L expansions, similar to Table 1 and Table 2, are reported in Table 3. The Monte Carlo integration retaining all the terms in the discrete K-L expansions is considered to be the true value. In Table 3, we can see that the error between the true value and the estimated value from the truncated posterior decreases consistently as we increase the number of the terms retained in K-L expansion. If we further plot the errors, we can see that they lie on a plane (see Figure 6) as indicated in Theorem 2.
5.2. Matching Permeability with Reduced Parameters
In this example, we will show that the reference permeability field can be recovered from matching the observations and that the accuracy of such estimates is certainly affected by the truncation of expansions. We consider a high permeable layer in the middle and low permeable layers in the two ends with the same correlation lengths as in Section 5.1.2. The interfaces are taken as a linear interpolation of independent points.
In the first part, we truncate the K-L expansion and retain only the first 20 terms for the two permeability fields. We consider 25 points on the facies. So the dimension of is 40 and the dimension of is 25. The two-stage MCMC method is used to sample from the posterior. The initial facies boundaries are taken to be straight lines joining the two ends of the known facies boundaries. We use random walks to perturb and with step sizes and , respectively, and with independent Gaussian priors for and . We run the MCMC chain for 10,000 iterations and leave out the first 500 samples as the burn-in period.
In Figure 7, the reference permeability field, the initial permeability field, and the mean of the posterior permeability field are shown. We can see that the sample mean is very close to the reference field. On the left plot of Figure 8 we can see that the sample estimate of the fractional flow of oil is quite close to the observed data. From the right plot of Figure 8 we can see that combined error decreases nearly to zero and stays there, which shows that the Markov chain has converged. The two-stage MCMC has a higher acceptance rate [32] (four times in these calculations) because it rejects the bad proposal quickly in the first stage, which is inexpensive. Next, we repeat the same procedure of sampling the posterior but we retain 25 terms in the K-L expansion for the two permeability fields. We use the same reference permeability field and the fractional flow of oil data. The numerical results are shown in Figure 9 and Figure 10. We can see the sampled mean of the permeability field is more accurate than the previous example with 20 K-L coefficients.
6. Conclusions
In this article, subsurface characterization for flows in highly heterogeneous porous media is studied. We consider channelized spatial fields to describe the permeability field where channel boundaries are assumed to have random locations and described via a level set approach. In particular, we use smooth velocity fields to change the channel boundaries within a level set framework and, thus, the parameterization of channel boundaries can be mapped to that of smooth velocity fields. This gives a reduced dimensional parameterization. Permeability distribution within each channel is assumed to be log-Gaussian and described via K-L expansion. One of our main contributions is the study of the regularity of posterior distribution. In particular, we study errors introduced in the posterior measure by truncating the prior distribution. The result from the theorem allows us to carry out the Bayesian uncertainty analysis in a finite-dimensional space. This makes the analysis easy and avoids involving “infinite” dimensional probabilistic spaces. We show that the posterior error introduced by truncation is bounded by a function of eigenvalues up to a constant, where this constant is independent of the dimension of the stochastic space. The latter guarantees that the truncation of K-L expansion based on a discretization will not introduce an unbounded error for the corresponding posterior distribution. The subsurface characterization within the Bayesian framework is based on the MCMC samples from the posterior distribution. We use an efficient two-stage MCMC that utilizes mixed MsFEM to screen the proposals. The numerical results show the validity of the proposed parameterization to interfaces and the error estimations.
One limitation of our study is that the results about the posterior error bounds are based on the assumptions that the saturation is a smooth field and permeability is a second-order stationary spatial process whose prior distribution is a Gaussian process. The complete set of assumptions are given in Appendix A. Although the stationarity assumption is quite reasonable for permeability description within facies boundaries, relaxing some of these assumptions could be beneficial for more complex reservoir models. In future, we would like to extend our Bayesian methodology for non-stationary permeability fields. Studying the posterior error bounds for non-Gaussian spatial processes and non-smooth velocity fields are also of interest for future research.
Author Contributions
Conceptualization, A.M. and J.W.; methodology, A.M. and J.W.; software, A.M. and J.W.; validation, A.M. and J.W.; formal analysis, A.M. and J.W.; investigation, A.M. and J.W.; resources, A.M. and J.W.; writing—original draft preparation, A.M. and J.W.; writing—review and editing, A.M. and J.W.; visualization, A.M. and J.W.; supervision, A.M. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Acknowledgments
We would like to thank the reviewers for their valuable comments and suggestions, which helped us to improve the quality of the article.
Conflicts of Interest
The authors declare no conflict of interest.
Abbreviations
The following abbreviations and mathematical notations are used in this manuscript:
K-L | Karhunen–Loève |
MCMC | Markov chain Monte Carlo |
MsFEM | Multiscale finite element method |
M-H | Metropolis Hastings |
PDE | Partial differential equation |
Velocity of phase j | |
k | Permeability |
Relative permeability to phase j | |
S | Water saturation |
p | Pressure |
f | Fractional flux of water |
Total mobility | |
Fractional flow | |
Outflow boundary | |
Normal velocity field | |
Observed fractional flow data | |
Fractional flow obtained by running the forward model to permeability k | |
Random error | |
Error variance | |
w | Pseudo-velocity field |
Pseudo-time | |
The interface | |
Eigen value from the K-L expansion of permeability | |
Eigen vector from the K-L expansion of permeability | |
K-L coefficient | |
Coefficient for the velocity field | |
Spatial basis for the velocity field | |
Y | Log permeability |
Posterior | |
Prior | |
Flowmap | |
T | Time to flight |
Appendix A
Our goal is to estimate the difference in expected values of a function with respect to two different posteriors, where one of them is a truncation of the other. We consider the domain and assume that , , and , where p is pressure, k is permeability field, and v is velocity. The lemmas and theorems in this section are obtained under assumptions described in the following paragraph.
Assumptions: (i) Without loss of generality, we assume and on ; on ; and no flow boundary conditions on the lateral boundaries and . (ii) The saturation is a smooth field. Note that if the velocity and initial conditions are smooth functions, then the saturation will be a smooth spatial field. (iii) The permeability field k is a stationary spatial process. (iv) The prior distribution is multivariate Gaussian distribution with identity covariance matrix.
Assume that (i)–(iv) hold, then we first find an upper bound of the difference between two saturation fields via the difference of the permeability fields in an appropriate norm.
Lemma A1.
, where and are water saturations.
Proof.
In order to estimate the difference of saturation, we need the concept of time of flight. For a particle that starts at a point ℘ at and moves with velocity v, the flow map is its position at time , i.e.,
Time of flight T characterizes particle motion under the velocity field, since velocity is a function of the spatial variable:Then, by [33] we have:
(A1)
since .On the other hand, , therefore,
In addition, since , then , and further , so:
Therefore,
(A2)
Then, from (A1) and (A2), we have:
□
In the Bayesian framework, the reference fractional flow or water-cut is matched to the observed data to get the target posterior distribution. Next, we will estimate the difference between two water-cut responses via the corresponding permeability fields.
Lemma A2.
, where and are permeabilities, and and are water-cut functions.
Proof.
Note that:
Using and , it follows for any , since v is divergence free. Then,
A similar result can be obtained for . Then,
by Lemma A1. □Next, we consider the case with single facies and the permeability that is described via K-L expansion. In particular, we assume and consider the truncated expansion . Then, the posterior distributions can be written as:
where is the posterior needed to be sampled, is an approximation of , and is the prior distribution. and are likelihoods, where:Next, we estimate the difference between G and .
Lemma A3.
.
Proof.
It is clear that and are bounded. Then, by Lemma A2:
□
Theorem A1.
(Theorem 1) Suppose that the permeability field k is a stationary spatial process on a bounded region and is square integrable with respect to Gaussian measure, i.e., , then:
(A3)
where C is independent of dimension N.
Proof.
If is square integrable with respect to Gaussian measure (e.g., a polynomial function), we can show that:
To estimate the error of truncation of K-L expansion, let and . We assume for simplicity, then:
where: because ’s are bounded, and:Since k is a stationary spatial process on a bounded region, i.e., for a spatial process where the covariance function depends only on the distance not on the spatial location, then by [30], is uniform bounded. Thus,
□
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures and Tables
Figure 2. Interface evolution by moving initial interface with different vertical velocity fields.
Figure 5. Linear fit of {(∑i=M+1Nλi(θ))12,|EπF−Eπ˜F|}. Left: l1=0.1, l2=0.4, σ2=2, and σf2=0.001; Right: l1=0.2, l2=0.2, σ2=2, and σf2=0.005.
Figure 6. Plots of (max{(∑j=M1+1N1λ1j(θ))12, (∑j=M2+1N2λ2j(θ))12},(∑i=M˜+1N˜λi(α))12,|EπF−Eπ˜F|).
Figure 7. Top left: The true log-permeability field. Top right: Initial log-permeability field. Bottom left: One of the sampled log-permeability field. Bottom Right: The mean of the sampled log-permeability field from two-stage MCMC using 20 K-L terms.
Figure 8. Left: Red line designates the fine-scale reference fractional flow of oil, the blue line designates the initial fractional flow of oil, and the green line designates fractional flow of oil corresponding to mean of the sampled permeability field from two-stage MCMC. Right: Fractional flow errors vs. accepted iterations when sampled from the posterior distribution retaining 20 terms in K-L expansion.
Figure 9. Top left: The true log-permeability field. Top right: Initial log-permeability field. Bottom left: One of the sampled log-permeability field. Bottom Right: The mean of the sampled log-permeability field from two-stage MCMC using 25 K-L terms.
Figure 10. Left: Red line designates the fine-scale reference fractional flow of oil, the blue line designates the initial fractional flow of oil, and the green line designates fractional flow of oil corresponding to mean of the sampled permeability field from two-stage MCMC. Right: Fractional flow errors vs. accepted iterations when sampled from the posterior distribution retaining 25 terms in K-L expansion.
Posterior errors when the K-L expansion is truncated to M terms. Here , , , and .
M | ||
---|---|---|
5 | 1.111681 | 0.081809 |
10 | 0.750662 | 0.106264 |
15 | 0.517555 | 0.063635 |
20 | 0.337901 | 0.030207 |
25 | 0.189272 | 0.017931 |
30 | 0.071924 | 0.011225 |
Posterior errors when the K-L expansion is truncated to M terms. Here , , , and .
M | ||
---|---|---|
5 | 1.176697 | 0.308118 |
10 | 0.820661 | 0.191601 |
15 | 0.566938 | 0.119590 |
20 | 0.378454 | 0.059173 |
25 | 0.248267 | 0.033023 |
30 | 0.123347 | 0.014965 |
Posterior errors when the K-L expansion is truncated to M terms for different facies.
5 | 5 | 5 | 0.526235 | 0.786077 | 0.853727 | 0.109464 |
10 | 5 | 5 | 0.367011 | 0.786077 | 0.853727 | 0.116172 |
10 | 10 | 10 | 0.367011 | 0.530798 | 0.477141 | 0.051925 |
15 | 10 | 10 | 0.253542 | 0.530798 | 0.477141 | 0.093109 |
15 | 15 | 10 | 0.253542 | 0.365967 | 0.477141 | 0.053869 |
20 | 15 | 15 | 0.169250 | 0.365967 | 0.210844 | 0.047356 |
20 | 20 | 15 | 0.169250 | 0.238932 | 0.210844 | 0.019996 |
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
In this article, we study uncertainty quantification for flows in heterogeneous porous media. We use a Bayesian approach where the solution to the inverse problem is given by the posterior distribution of the permeability field given the flow and transport data. Permeability fields within facies are assumed to be described by two-point correlation functions, while interfaces that separate facies are represented via smooth pseudo-velocity fields in a level set formulation to get reduced dimensional parameterization. The permeability fields within facies and pseudo-velocity fields representing interfaces can be described using Karhunen–Loève (K-L) expansion, where one can select dominant modes. We study the error of posterior distributions introduced in such truncations by estimating the difference in the expectation of a function with respect to full and truncated posteriors. The theoretical result shows that this error can be bounded by the tail of K-L eigenvalues with constants independent of the dimension of discretization. This result guarantees the feasibility of such truncations with respect to posterior distributions. To speed up Bayesian computations, we use an efficient two-stage Markov chain Monte Carlo (MCMC) method that utilizes mixed multiscale finite element method (MsFEM) to screen the proposals. The numerical results show the validity of the proposed parameterization to channel geometry and error estimations.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 Department of Mathematics, Applied Mathematics, and Statistics, Case Western Reserve University, Cleveland, OH 44106, USA
2 Bayesian Learning Corp., 825 S Golden West Ave, Arcadia, CA 91007, USA;