TetraFEM: Numerical Solution of Partial

Full text

Turn on search term navigation

1. Introduction

Partial differential equations (PDEs) describe many real-life processes in the observable world in lots of practical fields such as engineering, economics, sociology, biology, etc. If one could solve such an equation, i.e., calculate the target function that defines the process, the evolution of the process becomes predictable. The owner of the solution gets an opportunity to simulate the process on a computer to acquire extensive knowledge about its behavior and, therefore, obtain a huge advantage. So the solutions of PDEs are of huge interest to science and industry. There are families of methods for treating PDEs numerically, including finite difference methods, finite volume methods, and finite element methods.

The equations that are interesting in a practical sense are often very complex [1]. Analytical (exact) solutions either do not exist or are extremely hard to find. In order to make progress, numerical methods are employed to produce a solution as a finite set of approximated values.

One of the most flexible and efficient methods for the numerical solution of PDEs is the finite element method (FEM) [2,3]. It is one of the most popular methods for boundary value problems arising in engineering and mathematical modeling in 2D and 3D. Its most appealing feature is the convenience of handling domains of complicated shapes. The main idea of the finite element method is to approximate a target function, $u (x, y)$ , as a sum of N known basis functions ${φ_{i} (x, y)}_{i = 1}^{N}$ multiplied by coefficients ${{\bar{u}}_{i}}_{i = 1}^{N}$ :

(1) $u (x, y) \approx u_{N} (x, y) : = \sum_{i = 1}^{N} {\bar{u}}_{i} φ_{i} (x, y) .$

These basis functions are linked with the discretization of the problem, i.e., to obtain a better solution, one must push the number N.

The main limitation of this approach is the inevitable trade-off between computational resources such as computer RAM, physical time, and the quality of the solution. The production of a satisfactory result usually takes very dense discretization, which leads to enormous amounts of computer RAM and processor runtime being consumed by the algorithm. There are numerous ways to improve the method in various aspects, including extended FEM [4], the virtual element method [5], and enrichment FEM [6], but in the present work, we adhere to a basic approach.

To tackle these challenges, we employ Tensor Train (TT) methods for the numerical solution of PDEs. The Tensor Train decomposition [7] can be used for a compressed approximate representation of multidimensional arrays—it decomposes a huge tensor into a composition of smaller ones, with an exponential reduction in the number of encoding parameters, as shown in Figure 1. At the same time, the complexity of all algebraic operations in this format also has poly-logarithmic scaling in terms of the full tensor size [7]. The quality of the compression is determined by the so-called Tensor Train ranks of a tensor. For example, any smooth analytical function computed on a uniform grid has restricted TT-ranks [8,9,10,11], and so, it is compressed efficiently, but a random vector possesses the full rank (which is equal to the size of the vector), and so, it does not make sense to perform a tensor decomposition of it. Eventually, a dedicated Functional Tensor Train format [12,13] was developed specifically as a continuous function approximation technique.

Tensor trains are widely researched in the context of different numerical problems, including solutions of linear systems of equations [14], optimization [15,16], machine learning [17,18,19], and numerical solutions of PDEs [9,20,21,22,23,24].

In addition, an important feature of tensor networks is that they can be quite efficiently mapped to a quantum computer [25].

Unlike similar works on the application of Quantized Tensor Trains to finite element discretizations of PDEs [26,27], this work considers the following:

Universal nonlinear domain transformer tailored to the QTT format. In particular, our transformer is suitable for curvilinear rectangles and degenerate angles, which reduces the number of subdomain splittings and simplifies their structure, leading to lower QTT ranks.
Efficient assembly of FEM matrices in the course of iterations for a nonlinear problem and/or time stepping in a time-dependent problem. In particular, the stitching of matrices corresponding to different subdomains is free from the element-wise manipulations which are suboptimal for the TT structure. Instead, the total matrix is given simply by matrix-TT products of precomputed reference-generating matrices and a diagonal matrix made of a QTT tensor of coefficients.

The paper is organized as follows. In Section 2, we describe the tensor format and the basic finite element method. Section 3 presents the domain transformer as well as the algorithms for the operator assembly and subdomain stitching. In Section 4, we provide the results for different test problems, including the Poisson equation and incompressible Navier–Stokes equations. We conclude the paper in Section 5.

2. Background

2.1. Quantized Tensor Train Format

Tensor Train and its special case, Quantized Tensor Train (QTT), are methods for compressed representation of multidimensional arrays [28] which are generalizations of matrix decompositions like SVD and skeleton decomposition [29] onto d-dimensional tensors. Tensor Train decomposition is given by

(2) $\begin{matrix} \begin{matrix} x (i_{1}, \dots, i_{d}) = \sum_{α_{0}, \dots, α_{d - 1}, α_{d}} & G_{1} (α_{0}, i_{1}, α_{1}) G_{2} (α_{1}, i_{2}, α_{2}) \\ \dots G_{d} (α_{d - 1}, i_{d}, α_{d}), \end{matrix} \end{matrix}$

where

G_{j}

is the 3-dimensional tensor called the TT-core. The main characteristic of such a representation is the TT-rank, r, which is equal to the maximum size among bond indices

α_{0}, α_{1}, \dots, α_{d}

and expresses the correlations between the dimensions of the tensor. In the case of weak correlations (low rank r), this format allows one to store and perform operations with tensors with logarithmic complexity in their size. The case of separable indices, for example, leads to the TT-rank of one. The graphical representation of a tensor train is shown in Figure 1. The TT operator (matrix) has two spatial indices for each dimension

(3) $\begin{matrix} \begin{matrix} A ((i_{1}, j_{1}), \dots, (i_{d}, j_{d})) = \sum_{α_{0}, \dots, α_{d - 1}, α_{d}} & G_{1} (α_{0}, i_{1}, j_{1}, α_{1}) G_{2} (α_{1}, i_{2}, j_{2}, α_{2}) \\ \dots G_{d} (α_{d - 1}, i_{d}, j_{d}, α_{d}) . \end{matrix} \end{matrix}$

The Quantized Tensor Train format is obtained by reshaping the original tensor so that all its outer dimensions are 2. It allows us to apply tensor decompositions to problems that are not particularly high-dimensional and still obtain the benefits of tensor decompositions. Also, for a wide range of functions, including exponential, polynomial, trigonometrical, and its combinations, the QTT ranks have low theoretical estimations.

It can be seen in (2) that the memory required to store the compressed tensor is $O (n d r^{2})$ vs. $O (n^{d})$ memory of the full tensor, which leads to the exponential advantage in the case of small TT-ranks.

In this subsection, as well as in Table 1, we list all the operations regarding tensors, which are used in our method, and provide the designations:

Basic linear algebra operations between tensors.
Required operations include summation (+), Hadamard (element-wise) multiplication (∘), finding matrix–matrix and matrix–vector products (@ and $MV (\cdot, \cdot)$ respectively) and Kronecker product ( $kron (\cdot, \cdot)$ ). Also, in this work, an optimization-based version of matrix–matrix product ( $AMEnMM (\cdot, \cdot)$ ) is used.
Conversion between full and compressed format, tensor rounding.
To analyze the results, it must be possible to switch between different representations of the data. The algorithms for compressing and decompressing the data are straightforward [7]. Rounding is the procedure of creating a tensor similar to the given one, but with a lower rank with specified accuracy. The rounding is required to limit the ranks that are increasing during the solution according to Table 1.
Cross-approximation of functions.
This algorithm lets one approximate a function in TT format with only $O (d n r^{2})$ function evaluations [30]. This algorithm is used to represent the Jacobian of the area transformation and to set the initial and boundary conditions.
Matrix construction.
Tensor Train format for TT-matrices allows one to perform matrix transposition, construction of a diagonal matrix from a tensor and constructing a shift matrix of given dimensions (i.e., matrix with the ones placed above the main diagonal being the only non-zero elements).
Solution of linear systems.
For performing matrix inversion, we decided to use the Alternating Minimal Energy (AMEn) algorithm, which is a combination of the single-block Density Matrix Renormalization Group [31] and the steepest descent [32] algorithms [14].

All the TT algorithms used in this paper, including the AMEn algorithm for the solution of linear systems [14], are implemented in the ttpy 1.2.0. package for Python [33].

2.2. Finite Element Method

We use the well-known finite element method since it stacks well with the Tensor Train approach and because, unlike finite difference and finite volume methods, FEM provides us with matrix operators of convenient constant shapes while handling complex areas.

For the basic finite element method, formulas for the stiffness and mass matrix elements are as follows:

(4) $\begin{matrix} S_{i j} = \int \int (\nabla φ_{i}, \nabla φ_{j}) d x d y, M_{i j} = \int \int φ_{i} φ_{j} d x d y . \end{matrix}$

Matrices discretizing the first derivatives are derived similarly:

(5) $\begin{matrix} {(D_{μ})}_{i j} = \int \int (φ_{i}, \frac{\partial φ_{j}}{\partial μ}) d x d y, μ \in {x, y} . \end{matrix}$

Here,

φ_{i} (x, y)

are FEM basis functions. These matrices are then computed numerically. We will show later how to efficiently transfer the assembly process to the Tensor Train format. Also, we will observe that such matrices are effectively represented in such a format.

In addition, it is experimentally shown that the finite element matrices usually have low QTT ranks, and therefore, are well-compressible, which can be seen later on in the paper and in [26]. This means that all tensor operations from Table 1 will have lower asymptotic complexity than operations with full and even sparse matrices.

3. Quantized Tensor Train Finite Element Method

In this section, we describe our main contribution: the Quantized Tensor Train framework for the assembly and solution of Finite Element problems in general domains.

3.1. Domain Splitting and Transformation

To solve on the domain D, we cut it on s subdomains $D^{1}, \dots, D^{s}$ , which are curvilinear quadrilaterals connected by the sides. The sides of the subdomain are specified by four parametrized differentiable curves:

$\begin{matrix} f_{k} : t ⟶ (x, y), t \in [- 1, 1], k = 1, \dots 4, \end{matrix}$

and to ensure that the curves coincide at the corners of the subdomain, we assert the following:

$\begin{matrix} \{\begin{matrix} f_{0} (- 1) = f_{3} (- 1), \\ f_{1} (- 1) = f_{0} (1), \\ f_{2} (1) = f_{1} (1), \\ f_{3} (1) = f_{2} (- 1) . \end{matrix} \end{matrix}$

After specifying the functions, the differentiable mapping from a reference $[- 1, 1] \times [- 1, 1]$ square to a quadrilateral can be constructed by transfinite interpolation [34]:

(6) $\begin{matrix} N (ξ, η) = \frac{(1 - η)}{2} f_{0} (ξ) + \frac{(1 + ξ)}{2} f_{1} (η) + \frac{(1 + η)}{2} f_{2} (ξ) + \frac{(1 - ξ)}{2} f_{3} (η) - \\ \frac{(1 - ξ) (1 - η)}{4} f_{0} (- 1) - \frac{(1 + ξ) (1 - η)}{4} f_{1} (- 1) - \frac{(1 + ξ) (1 + η)}{4} f_{2} (1) - \frac{(1 - ξ) (1 + η)}{4} f_{3} (1) . \end{matrix}$

and example is shown in Figure 2. The Jacobian of this transformation will be as follows:

(7) $\begin{matrix} J = [\begin{matrix} J_{11} & J_{12} \\ J_{21} & J_{22} \end{matrix}] = [\begin{matrix} \frac{\partial x}{\partial ξ} & \frac{\partial x}{\partial η} \\ \frac{\partial y}{\partial ξ} & \frac{\partial y}{\partial η} \end{matrix}] = [\begin{matrix} \frac{\partial N (ξ, η)}{\partial ξ} & \frac{\partial N (ξ, η)}{\partial η} \end{matrix}] . \end{matrix}$

3.2. Operators Assembly in Tensor Train Format

To set the finite element mesh, a uniform Cartesian grid of size $2^{d} \times 2^{d}$ is introduced on the reference square and then projected onto each quadrilateral with (6). Here, d is the number of TT-cores of size 2 required to represent the mesh along a single dimension; also, let us denote $n = 2^{d}$ and $N = 4^{d}$ . Using this and also the analytic formula for the Jacobian (7), finite element matrices from (4) and (5) are constructed for each of the s subdomains. So (4) becomes

(8) $\begin{matrix} \begin{matrix} S_{i j} = \underset{[- 1, 1] \times [- 1, 1]}{\int \int} ({(J^{T})}^{- 1} \nabla Φ_{i}, {(J^{T})}^{- 1} \nabla Φ_{j}) | J | d x d y, \\ M_{i j} = \underset{[- 1, 1] \times [- 1, 1]}{\int \int} Φ_{i} Φ_{j} | J | d x d y, \end{matrix} \end{matrix}$

where

Φ

is a first-order Lagrange basis function, so for the finite element discretization of the reference square, bilinear

Q_{1}

elements are utilized.

Let us denote

(9) $\begin{matrix} R = [\begin{matrix} (J_{22}^{2} + J_{12}^{2}) / | J | & - (J_{22} J_{21} + J_{12} J_{11}) / | J | \\ - (J_{22} J_{21} + J_{12} J_{11}) / | J | & (J_{11}^{2} + J_{21}^{2}) / | J | \end{matrix}] . \end{matrix}$

Here

| J |

and

J_{i j}, {i, j} \in {1, 2}

are the functions of

ξ

and

η

To numerically compute the integral, $2 \times 2$ Gauss–Legendre quadrature is employed [35], so instead of the integral, the weighted sum is computed.

Now, we move to the assembly of discretized operators in Tensor Train format. We begin with the description for a single subdomain in this subsection and will describe the concatenation later.

To represent the values of the Jacobian J for the arbitrary $ξ$ and $η$ , the cross-approximation approach is utilized [30]. Since subdomain transformation is defined by explicit differentiable functions, the TT-ranks of the the Jacobians in most cases will be small. So, we cross-approximate the values on the quadrature points mesh of the reference domain as shown in Figure 3. The resulting sets of TT-vectors computed for the Jacobians from (7), (9) we will denote with bold symbols $J, R$ , and $| J |$ for the determinant of the Jacobian. Also we denote the TT vector of Gaussian quadrature weights as $W$ .

The matrix $Φ$ contains the values of basis functions on quadrature points of the reference domain, as shown in Figure 3, and is assembled with Kronecker products. Analogously, $Φ' = [\begin{matrix} Φ_{ξ} \\ Φ_{η} \end{matrix}]$ is the pair of matrices populated with values of partial derivatives of basis functions.

Now we have everything to assemble the finite element matrices:

(10) $\begin{matrix} S = \sum_{μ = 1}^{2} \sum_{χ = 1}^{2} Φ_{μ}' @ diag (W \circ R_{μ χ}) @ {(Φ_{χ}')}^{T}, \\ M = Φ @ diag (W \circ | J |) @ Φ^{T}, \\ D_{x} = Φ @ C_{x}, \\ D_{y} = Φ @ C_{y}, \end{matrix}$

where

(11) $\begin{matrix} C_{x} = diag (W \circ J_{22}) @ {(Φ_{1}')}^{T} - diag (W \circ J_{21}) @ {(Φ_{2}')}^{T}, \\ C_{y} = - diag (W \circ J_{12}) @ {(Φ_{1}')}^{T} + diag (W \circ J_{11}) @ {(Φ_{2}')}^{T} . \end{matrix}$

Here,

diag (\cdot)

is a function that converts a TT-vector into a diagonal TT-matrix. After the assembly, we usually perform

round ()

on the result to significantly truncate the ranks.

3.3. Subdomain Concatenation

During the assembly, subdomain finite element matrices are constructed in TT format on each of the s quadrilaterals. Then, the subdomain matrices are joined into <<big>> domain block matrices of shape $(s N \times s N)$ by the subdomain concatenation. In the end of the assembly, we are left with the mass matrix $M$ , the stiffness matrix $S$ , and derivative matrices $D_{x}, D_{y}$ . In this subsection, we will consider the stiffness matrix $S$ , because the concatenation algorithm is identical for every type of matrix.

The idea here is to unite the basis functions on the boundaries of seamed subdomains while adding a penalizing equation to the block matrix.

Let us define the interface matrices, which are responsible for handling joined boundaries:

$\begin{matrix} P_{i j}^{k} & = \{\begin{matrix} 1 & if i = j and i - th node of k - th subdomain is a node of another subdomain \\ 0 & else, \end{matrix} \\ P_{i j}^{k l} & = \{\begin{matrix} 1 & if i - th node of k - th subdomain is j - th node of l - th subdomain \\ 0 & else . \end{matrix} \end{matrix}$

And the block of the block matrix

S

is defined as follows:

(12) $S^{k l} = \{\begin{matrix} S^{l} + c P^{l} & if k = l \\ P^{k l} S^{l} - c P^{k l} & else, \end{matrix}$

where

S^{k}

is the stiffness matrix of the k-th subdomain, c is the maximum absolute value across all the main diagonals of

S^{k}

for every k.

The assembly of the block matrix is performed with Kronecker product in the TT format:

(13) $S = \sum_{k, l = 1}^{s} S^{k l} \otimes I^{k l},$

where

I^{k l}

is the

s \times s

matrix with the only non-zero element

I_{k l}^{k l} = 1

After the assembly, the TT operators are rounded with high precision to save the important properties of the matrices.

Since it is not possible to modify individual elements of the Tensor Train, the mask approach is used to impose boundary conditions:

$T_{i j}^{I} = \{\begin{matrix} 1 & if i = j and i is a boundary node index \\ 0 & else . \end{matrix}$

Here,

T^{I}

stands for the mask for inner (non-boundary) nodes and

T^{O} = E - T^{I}

stands for the outer (boundary) nodes, where

E

is the indentity matrix. The masks are diagonal matrices so that after multiplying the solution by the mask, the corresponding values are left as is, and the rest are zeroed. Subsequently, to impose the boundary values, one adds up the vector multiplied by the inner mask and the boundary values multiplied by the outer mask:

$u = MV (T^{I}, u^{I}) + MV (T^{O}, u^{O}) .$

3.4. Nonlinear Operators Reassembly with Coefficients

In this subsection we will describe the procedure of assembly of the finite element operators with some coefficient function. This function can be, for example, time-dependent, so having an efficient algorithm for the reassembly is crucial for the performance of the computation. For example, the Navier–Stokes equation has the nonlinear convective operator $(u \cdot \nabla)$ , which depends on the fluid velocities that are non-constant during the simulation.

Also we modify the stitching procedure introduced in [26] by precomputing the block interface matrices. This allows to perform the subdomain concatenation as a matrix–matrix multiplicaton and a matrix addition. From (12) and (13), it directly follows that (for $s = 3$ )

(14) $S = P^{I} @ [\begin{matrix} S^{1} \\ S^{2} \\ S^{3} \end{matrix}] + c P^{II},$

where

$P^{I} = [\begin{matrix} I & P^{21} & P^{31} \\ P^{12} & I & P^{32} \\ P^{13} & P^{23} & I \end{matrix}], P^{II} = [\begin{matrix} P^{11} & - P^{21} & - P^{31} \\ - P^{12} & P^{22} & - P^{32} \\ - P^{13} & - P^{23} & P^{33} \end{matrix}] .$

The full algorithm for the assembly of the nonlinear convective operator for a 3-part domain:

First evaluate the function on the quadrature points. The velocites ${(u^{i})}^{q}, {(v^{i})}^{q},$ $i \in 1, \dots, s$ can be evaluated in TT format on each subdomain by computing the sum of basis functions values over the finite elements (see Figure 3) and then assembled into the big vector. Another way is to use cross-approximation.
Then, we can put the coefficient in between the matrix products in (10) and perform the standard matrix–matrix product and obtain the block-diagonal convective derivative matrices:
(15) $\begin{matrix} D_{x}^{u} = [\begin{matrix} Φ \\ Φ \\ Φ \end{matrix}] @ diag ([\begin{matrix} {(u^{1})}^{q} \\ {(u^{2})}^{q} \\ {(u^{3})}^{q} \end{matrix}]) @ [\begin{matrix} C_{x}^{1} \\ C_{x}^{2} \\ C_{x}^{3} \end{matrix}], \\ D_{y}^{v} = [\begin{matrix} Φ \\ Φ \\ Φ \end{matrix}] @ diag ([\begin{matrix} {(v^{1})}^{q} \\ {(v^{2})}^{q} \\ {(v^{3})}^{q} \end{matrix}]) @ [\begin{matrix} C_{y}^{1} \\ C_{y}^{2} \\ C_{y}^{3} \end{matrix}] . \end{matrix}$
Then the subdomain concatenation is performed according to (14):
$D_{x}^{u} = P^{I} @ D_{x}^{u} + c P^{II}, D_{y}^{v} = P^{I} @ D_{y}^{v} + c P^{II} .$

Here, we propose two approaches:

Performing the assembly by exact matrix–matrix products in Tensor Train format and truncating the excessive ranks by TT rounding. The exact multiplication takes $O (n d r^{6})$ operations, and since it is performed between two high-rank tensors/matrices, the time required between time steps is not optimal.
Employing approximate matrix–matrix multiplication via AMEn. This iterative method requires $O (n d r^{4})$ operations, which is preferable for the high-rank case. Also, the rounding is not required since the relative tolerance is set before the call. The convergence is achieved only with a single sweep of a method because of a close initial guess from the previous time step.

3.5. Example Application: Incompressible Navier–Stokes Equations

The incompressible Navier–Stokes equations are considered:

(16) $\{\begin{matrix} \frac{\partial u}{\partial t} + u \cdot \nabla u = - \nabla p + \frac{1}{R e} Δ u, \\ \nabla \cdot u = 0, \end{matrix}$

where

u = {[u v]}^{T}

stands for the X- and Y-components of the velocity, p is the pressure and

R e

is the Reynolds number. The first equation describes the time evolution of the medium and the second one represents incompressibility.

The Navier–Stokes equations are widely used in different fields of science and engineering, including aircraft modeling, weather prediction, modeling of natural currents, and artificial flows [36].

Chorin’s predictor-corrector scheme is employed to perform the time-stepping procedure [37]. The main idea is to treat viscous and pressure forces separately. Each time step consists of three substeps:

The predictor step. The intermediate velocity, $u^{*}$ , is computed for the momentum equation without the pressure:
$u^{*} = u^{n} + Δ t (- u^{n} \cdot \nabla u^{n} + \frac{1}{R e} Δ u^{n})$
The pressure step. Then Poisson’s equation for the pressure is solved.
$Δ p^{n + 1} = \frac{1}{Δ t} \nabla \cdot u^{*}$
The corrector step. Finally, the pressure term is added to the equation to satisfy the continuity condition.
$u^{n + 1} = u^{*} - Δ t \nabla p^{n + 1}$

The discrete version of the algorithm with vectors and operators in Tensor Train format:

Compute $u^{q}, v^{q}$ and reassemble $D_{x}^{u}$ and $D_{y}^{v}$ as presented in Section 3.4.
Compute the tentative velocities $u^{*}, v^{*}$ :
$\begin{matrix} u^{*} = M^{- 1} (M + Δ t (\frac{1}{R e} S + D_{x}^{u} + D_{y}^{v})) u^{n}, \\ v^{*} = M^{- 1} (M + Δ t (\frac{1}{R e} S + D_{x}^{u} + D_{y}^{v})) v^{n} . \end{matrix}$
Solve the Poisson equation for the pressure:
$p = - \frac{1}{Δ t} S^{- 1} (D_{x} u^{*} + D_{y} v^{*}) .$
Perform the corrector step:
$\begin{matrix} u^{n + 1} = u^{*} - Δ t M^{- 1} D_{x} p, \\ v^{n + 1} = v^{*} - Δ t M^{- 1} D_{y} p . \end{matrix}$
Impose boundary conditions with masks:
$\begin{matrix} u^{n + 1} = T_{u}^{I} u^{n + 1} + T_{u}^{O} u^{BC}, \\ v^{n + 1} = T_{v}^{I} v^{n + 1} + T_{v}^{O} v^{BC} . \end{matrix}$

Here, by multiplying by the inverted mass matrix $M^{- 1}$ , we imply solving a linear system with the matrix.

4. Numerical Results

We validate our methodology by solving multiple model problems. In Section 4.1, we consider the Poisson equation in a triangular domain and compare the results with [26]. In Section 4.2, we solve the Poisson equation on a single domain of quadrilateral shape and the two-part domain of the same shape with a curved boundary to ensure the correctness of the subdomain stitching. In Section 4.3, the same equation is also solved, button curved domains like a circle or an annulus. And finally, in Section 4.4 and Section 4.5, the set of incompressible Navier–Stokes equations is solved in a lid-driven cavity, and a more complex L-shaped domain via Chorin’s projection method and the runtime of the method is compared with the full format.

4.1. Poisson Equation in a Triangle

Here, we solve a Poisson equation in a triangle:

(17) $\{\begin{matrix} - Δ u = f, \\ {u |}_{\partial Ω} = 0, \end{matrix}$

where

f \equiv 1

, and

Ω

is shown in Figure 4. The domain is taken from [26], but instead of three subdomains, we specify a triangle as a single quadrilateral with a single degenerated angle.

The algorithm for finding the solution goes as follows:

Mass and stiffness operators $M, S$ and masks corresponding to zero Dirichlet boundary conditions $T^{I}, T^{O}$ are assembled in the QTT format. The right-hand side $f$ is constructed using cross-approximation of the function.
Boundary conditions are applied to operators:
$\begin{matrix} \bar{M} = T^{I} @ M, \\ \bar{S} = T^{I} @ S + T^{O} . \end{matrix}$
The right-hand side is multiplied by the mass matrix, and the stiffness matrix is inverted to find the solution:
$\begin{matrix} u = solve (\bar{S}, MV (\bar{M}, f)) . \end{matrix}$

Then, the solution TT-vector is decompressed to the full format, and the error between the numerical and analytical solution is computed.

In Figure 5, we analyze the convergence via the Runge rule since we are not provided with the analytical solution. On each step, we refine the mesh and solve the equation. The error is then computed between the new solution downscaled to the previous mesh and the solution for the previous mesh. The second order is present despite the singularity in the degenerated corner.

The runtime comparison is presented in Figure 6. For the basic non-tensor method, we assembled the system in compressed sparse column format and solved via the conjugate gradient method taken from $scipy . spase 1.12.0$ Python package. The advantage of tensor trains on moderate computational meshes is observed.

We compare the ranks and matrix sizes with [26] on Figure 7. The ranks of stiffness matrix in our case are significantly lower. There are two factors causing this. The first one is that a single subdomain has fewer degrees of freedom and does not require stitching, which also increases the ranks. The second one is that putting the Jacobian values in the Z-order disrupts the smooth structure of the function vector, which is present for the regular order QTT.

4.2. Poisson Equation in a Quadrilateral Domain

Here we solve the Poisson Equation (17) but in a quad-shaped domain, specified by linear coordinate transformation $(ξ, η) \to (x, y)$ from the reference $[- 1, 1] \times [- 1, 1]$ square. We start from a manufactured solution $\hat{u} (x, y) = sin (π ξ (x, y)) sin (π η (x, y))$ , and construct the forcing term to match this solution, $f (x, y) \equiv - \frac{\partial^{2} \hat{u}}{\partial x^{2}} - \frac{\partial^{2} \hat{u}}{\partial y^{2}}$ .

We compare two ways of transforming the domain:

Use the single original quadrilateral domain, transformed into the reference domain.
The original domain is split into two parts with the sinusoidal seam as shown in Figure 8, and each part is transformed into the reference domain.

The algorithm for finding the numerical solution of (17) is the same as for the quadrilateral domain, and the right-hand side of the equation is $f (x, y) \equiv 1$ . The errors and TT-ranks are presented in Figure 9 and Figure 10. Despite having much higher TT-ranks due to the curved seam, the two-part domain representation requires much less memory than the full format.

4.3. Poisson Equation in a Circle/Annulus

Here, the Equation (17) is solved in a circle

$Ω^{\circ} = {(x, y) : \sqrt{x^{2} + y^{2}} \leq 3 π}$

and an annulus

$Ω^{⊙} = {(x, y) : 2 π \leq \sqrt{x^{2} + y^{2}} \leq 3 π}$

which are depicted in Figure 11. The right-hand side and analytic solution are

$f (x, y) \equiv 1, \hat{u} (x, y) = \frac{{(3 π)}^{2} - x^{2} - y^{2}}{4},$

respectively for the circular domain and

$f (x, y) = \frac{cos \sqrt{x^{2} + y^{2}}}{\sqrt{x^{2} + y^{2}}} - sin \sqrt{x^{2} + y^{2}}, \hat{u} (x, y) = sin \sqrt{x^{2} + y^{2}},$

for the annular one.

The circular domain consists of a single subdomain which is a unit square projected on a circle. This is an example of a <<bad>> case for the Tensor Train decomposition since the ranks of the stiffness matrix are growing exponentially, which can be seen in Figure 12. One reason for that may be four singularities of the Jacobian, one for each corner. However, in terms of memory consumption, the compression, compared to the sparse format, is present on denser meshes. Also, the error convergence is not affected, as shown in Figure 13, since the matrix TT ranks are only related to the time and memory consumption of the algorithms.

On the other hand, the annular-shaped domain consists of four subdomains with low ranks, so, despite containing 4 times as many nodes, the mesh is compressed efficiently.

4.4. Navier–Stokes Flow in a Lid-Driven Cavity

Here, we solve the Navier–Stokes equations via Chorin’s method described in Section 3.5 using the regular finite element method and TetraFEM for the discretization and compare the results.

The computational domain is a square $Ω = {0 \leq x \leq 1, 0 \leq y \leq 1}$ , the boundary conditions for the velocities are zeros on every wall, except some constant velocity on the top lid $u (x, 1) = 1$ for $0 \leq x \leq 1$ . For the pressure boundary, conditions are not specified.

The streamlines of the steady-state flow obtained via the tensorized algorithm are visually identical to the benchmark results obtained in [38] as shown in Figure 14. To quantitatively compare the results, we show 1D-slices of velocity vectors in Figure 15, together with the benchmark solutions of Ghia, Ghia, and Shin [38].

Figure 16 and Figure 17 show the advantage of TetraFEM in terms of the required time.

Since the ranks of the solution are not very big, the compression comparing with the full format computations is present. Increasing the number of mesh points provides better scaling, which can be seen in Figure 16.

For the full format, we used sparse format for the matrices and conjugate gradient method for the solution of the linear systems. The relative tolerance of the method is the same as the Tensor Train rounding tolerance of the tensorized method.

The numerical solution of the nonstationary incompressible Navier–Stokes equations in the arbitrary 2D domain was achieved via the Tensor Train Finite element method with the speed-up, compared to the classical finite element methods. The combination of low-rank Tensor Train operators for the smoothly transformed domains and the reasonable ranks of the solution led to significant memory reduction and speed-up even on moderate mesh discretizations for the nonlinear time-stepping problem.

4.5. Navier–Stokes Flow in a Backward-Facing Step Domain

Here we solve the Navier–Stokes equations in a more complex domain using the same algorithm as in previous subsection.

The L-shaped domain (backward-facing step) consists of three concatenated rectangular domains. The parabolic inflow boundary condition is imposed on the left side, the outflow <<do nothing>> condition is on the right side; everything else is a no-slip wall.

The visual depiction of the stationary flow is shown on Figure 18. The correct physical behavior of the fluid can be observed. The multi-domain system assembly/reassembly procedures presented in Section 3.3 and Section 3.4 are tested on a nonlinear fluid simulation problem.

5. Discussion

In this paper, we presented a methodology for solving partial differential equations in complex domains using Tensor Trains. The TT compression provides a milder (sublinear) asymptotic scaling compared to the straightforward vector solution. However, the constant in front of the complexity is higher due to its dependence on TT ranks. Potentially large TT ranks of nontrivial solutions are the main limitation of the method. Another one is finding the appropriate transformation to match the computational domain. In our examples, the TT approach becomes faster than the standard conjugate gradient solver, starting from grid sizes of 200–300. It depends on the application in which grid size (governed by the discretization error) is practical. For example, to resolve small eddies in the solution to the Navier–Stokes equation, the grid sizes in the order of hundreds are not excessive. However, more complex (e.g., multiscale) solutions may also exhibit higher TT ranks. The proposed methodology should be evaluated case by case. Nonetheless, for multiscale but regular structures which require fine discretization but yield low TT ranks, the developed approach should be beneficial. In our examples, we have demonstrated that our approach is faster than existing TT methods for PDEs on nontrivial domains.

A breakthrough against high TT ranks may come from quantum computing. Specifically, the TT decomposition can be converted exactly into a quantum circuit of $O (log r)$ qubits [25,39,40,41,42], see Figure 19. The total complexity should thus become poly-logarithmic in the original problem size, which will provide an opportunity to solve many practical problems in the future. However, this assumes perfect quantum hardware, allowing one to operate thousands of qubits and gates with controllable noise. Such a state of the hardware is not there yet but is expected to emerge in the next decade.

Author Contributions

Conceptualization, methodology, writing, E.K., S.D., M.P. and A.M.; software, E.K.; visualization, E.K. All authors have read and agreed to the published version of the manuscript.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

Egor Kornev, Michael Perelshtein and Artem Melnikov were employed by the Terra Quantum AG. Sergey Dolgov is a consultant for Terra Quantum AG. The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

PDE	Partial differential equation
TT	Tensor Train
QTT	Quantized Tensor Train
FEM	Finite element method
SVD	Singular value decomposition
AMEn	Alternating minimal energy method

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Figures and Table

View Image - Figure 1. Graphical representation of an operator in Tensor Train format. A huge tensor is decomposed into a product of smaller tensors.

Figure 1. Graphical representation of an operator in Tensor Train format. A huge tensor is decomposed into a product of smaller tensors.

View Image - Figure 2. The projection of a reference square onto the subdomain from (6). The sides of the subdomain are set with four smooth explicitly defined functions.

Figure 2. The projection of a reference square onto the subdomain from (6). The sides of the subdomain are set with four smooth explicitly defined functions.

View Image - Figure 3. The finite element projected on a reference square. Here, black dots with indices represent the positions of mesh points, and red symbols denote the positions of quadrature points.

Figure 3. The finite element projected on a reference square. Here, black dots with indices represent the positions of mesh points, and red symbols denote the positions of quadrature points.

Figure 4. The scatter plot of the solution on triangular domain. The black dots represent the corners of the quadrilateral domain.

View Image - Figure 5. Relative Frobenius norm errors resulting from the Runge rule for different mesh discretizations. The X-axis denotes the number of grid points along a single dimension. The second order of convergence with respect to [Forumla omitted. See PDF.] is observed (where h is the linear size of a grid cell).

Figure 5. Relative Frobenius norm errors resulting from the Runge rule for different mesh discretizations. The X-axis denotes the number of grid points along a single dimension. The second order of convergence with respect to [Forumla omitted. See PDF.] is observed (where h is the linear size of a grid cell).

View Image - Figure 6. Times required to solve the Poisson equation in a triangle for different mesh discretizations in sparse format, Tensor Train format, and Z-order QTT format from [26], respectively. The system with a sparse matrix was solved via the congugate gradient method with the same relative tolerance as the TT-solver.

Figure 6. Times required to solve the Poisson equation in a triangle for different mesh discretizations in sparse format, Tensor Train format, and Z-order QTT format from [26], respectively. The system with a sparse matrix was solved via the congugate gradient method with the same relative tolerance as the TT-solver.

View Image - Figure 7. The left plot depicts the numbers of elements required to represent the stiffness matrices in sparse format, Tensor Train format, and Z-order QTT format from [26] for differently sized meshes, which can also be interpreted as memory requirement for the operator storage. It can be noted that for our method the sizes are significantly lower. The right plot shows the effective TT ranks of stiffness matrices and solution vectors.

Figure 7. The left plot depicts the numbers of elements required to represent the stiffness matrices in sparse format, Tensor Train format, and Z-order QTT format from [26] for differently sized meshes, which can also be interpreted as memory requirement for the operator storage. It can be noted that for our method the sizes are significantly lower. The right plot shows the effective TT ranks of stiffness matrices and solution vectors.

View Image - Figure 8. Quadrialteral domains in which the Poisson equation is solved. The right one in the figure consists of two connected curvilinear domains.

Figure 8. Quadrialteral domains in which the Poisson equation is solved. The right one in the figure consists of two connected curvilinear domains.

View Image - Figure 9. Relative Frobenius norm error between the analytical and numerical solution vectors for different mesh discretizations of the quadrilateral domains. The X-axis denotes the number of grid points along a single dimension. The second order of convergence is observed.

Figure 9. Relative Frobenius norm error between the analytical and numerical solution vectors for different mesh discretizations of the quadrilateral domains. The X-axis denotes the number of grid points along a single dimension. The second order of convergence is observed.

View Image - Figure 10. Comparison between amount of elements required to represent the stiffness matrices in sparse, Tensor Train, and full format for quadrilaterals (left). The extra number of elements for the joint quad domain is caused by the curved seam, which increases the TT-rank. Nevertheless, the growth of TT effective ranks of the stiffness matrix stagnate during the mesh refinement (right).

Figure 10. Comparison between amount of elements required to represent the stiffness matrices in sparse, Tensor Train, and full format for quadrilaterals (left). The extra number of elements for the joint quad domain is caused by the curved seam, which increases the TT-rank. Nevertheless, the growth of TT effective ranks of the stiffness matrix stagnate during the mesh refinement (right).

Figure 11. A single-subdomain circle and an annulus consisting of four subdomains.

View Image - Figure 12. The amounts of elements required to represent the stiffness matrix for the circle and the annulus in sparse and Tensor Train format are shown on the left plot. The effective ranks are depicted on the right plot. The circular domain is an example of a shape that is not suitable for the Tensor Train format.

Figure 12. The amounts of elements required to represent the stiffness matrix for the circle and the annulus in sparse and Tensor Train format are shown on the left plot. The effective ranks are depicted on the right plot. The circular domain is an example of a shape that is not suitable for the Tensor Train format.

View Image - Figure 13. Relative Frobenius norm error between the analytical and numerical solution vectors for different mesh discretizations for the circle/annulus case. The X-axis denotes the number of grid points along a single dimension of a single subdomain. The second order convergence is observed.

Figure 13. Relative Frobenius norm error between the analytical and numerical solution vectors for different mesh discretizations for the circle/annulus case. The X-axis denotes the number of grid points along a single dimension of a single subdomain. The second order convergence is observed.

View Image - Figure 14. Streamplot of the lid-driven cavity problem steady-state solution. The Reynolds number is [Forumla omitted. See PDF.]. The relative TT-rounding tolerance is [Forumla omitted. See PDF.]. Note the eddies in the bottom right and left corners.

Figure 14. Streamplot of the lid-driven cavity problem steady-state solution. The Reynolds number is [Forumla omitted. See PDF.]. The relative TT-rounding tolerance is [Forumla omitted. See PDF.]. Note the eddies in the bottom right and left corners.

View Image - Figure 15. Profiles of X-velocity on a vertical centerline [Forumla omitted. See PDF.] of a square cavity (left) and Y-velocity on a horizontal line y = 0.5 (right). Discretization is [Forumla omitted. See PDF.] points. Red symbols refer to the data from [38].

Figure 15. Profiles of X-velocity on a vertical centerline [Forumla omitted. See PDF.] of a square cavity (left) and Y-velocity on a horizontal line y = 0.5 (right). Discretization is [Forumla omitted. See PDF.] points. Red symbols refer to the data from [38].

View Image - Figure 16. The time-step averaged timings of the different parts of the algorithm. The left plot shows the time taken by the solution of the pressure Poisson equation, and the the time for the operator reassemby is depicted on the right plot. The time required for the Poisson equation by the TT-algorithm scales much better than the full format version. It can be observed that the operator reassembly is much slower in TT format and is the major part of the algorithm. It can also be noted that the reassembly by AMEn outperforms the TT-MatMat approach with the growth of the mesh.

Figure 16. The time-step averaged timings of the different parts of the algorithm. The left plot shows the time taken by the solution of the pressure Poisson equation, and the the time for the operator reassemby is depicted on the right plot. The time required for the Poisson equation by the TT-algorithm scales much better than the full format version. It can be observed that the operator reassembly is much slower in TT format and is the major part of the algorithm. It can also be noted that the reassembly by AMEn outperforms the TT-MatMat approach with the growth of the mesh.

View Image - Figure 17. The mean time for a time step for different meshes. The advantage of Tensor format on denser meshes is observed due to better scaling.

Figure 17. The mean time for a time step for different meshes. The advantage of Tensor format on denser meshes is observed due to better scaling.

Figure 18. Streamplot of the incompressible Navier–Stokes flow in L-shaped domain for [Forumla omitted. See PDF.].

View Image - Figure 19. Any Tensor Train of rank r can be exactly encoded into a quantum circuit using sequential multi-qubit operations, each of which acts on [Forumla omitted. See PDF.] qubits.

Figure 19. Any Tensor Train of rank r can be exactly encoded into a quantum circuit using sequential multi-qubit operations, each of which acts on [Forumla omitted. See PDF.] qubits.

Table 1

Operations with its resulting ranks and complexities. $x$ , $y$ , and $z$ are tensor train vectors of the same dimensions. Their ranks are $r (x)$ , $r (y)$ , and $r (z)$ , respectively. $A$ is a tensor train matrix of rank $r (A)$ . f is a function of s variables to evaluate on a list of tensors of the same shape.

№	Operation	Result Rank	Complexity
1	$z = x \cdot const$	$r (z) = r (x)$	$O (d r (x))$
2	$z = x + y$	$r (z) \leq r (x) + r (y)$	$O (n d {(r (x) + r (y))}^{2})$
3	$z = x \circ y$	$r (z) \leq r (x) r (y)$	$O (n d r^{3} (x) r^{3} (y))$
4	$z = MV (A, x)$	$r (z) \leq r (A) r (x)$	$O (n d r^{3} (A) r^{3} (x))$
5	$solve (A x) = y$	$r (x)$	$O (n d r^{3} (x) r (A))$
6	$z = round (x, ε)$	$r (z) \leq r (x)$	$O (n d r^{3} (x))$
7	$z = kron (x, y)$	$\max (r (x), r (y))$	$O (n d {(r (x) + r (y))}^{2})$
8	$z = cross (f, [x_{1}, \dots, x_{s}])$	$r (z)$	$O (n d r^{2} (z))$

References

1. Bazighifan, O.; Ali, A.H.; Mofarreh, F.; Raffoul, Y.N. Extended Approach to the Asymptotic Behavior and Symmetric Solutions of Advanced Differential Equations. Symmetry; 2022; 14, 686. [DOI: https://dx.doi.org/10.3390/sym14040686]

2. Brenner, S.; Scott, R. The Mathematical Theory of Finite Element Methods; Texts in Applied Mathematics Springer: New York, NY, USA, 2007.

3. Ciarlet, P.G. The Finite Element Method for Elliptic Problems; SIAM: Philadelphia, PA, USA, 2002.

4. Moës, N.; Dolbow, J.; Belytschko, T. A finite element method for crack growth without remeshing. Int. J. Numer. Methods Eng.; 1999; 46, pp. 131-150. [DOI: https://dx.doi.org/10.1002/(SICI)1097-0207(19990910)46:1<131::AID-NME726>3.0.CO;2-J]

5. Beirão da Veiga, L.; Brezzi, F.; Cangiani, A.; Manzini, G.; Marini, L.D.; Russo, A. Basic principles of Virtual Element Methods. Math. Model. Methods Appl. Sci.; 2013; 23, pp. 199-214. [DOI: https://dx.doi.org/10.1142/S0218202512500492]

6. Dell’Accio, F.; Di Tommaso, F.; Guessab, A.; Nudo, F. Enrichment strategies for the simplicial linear finite elements. Appl. Math. Comput.; 2023; 451, 128023. [DOI: https://dx.doi.org/10.1016/j.amc.2023.128023]

7. Oseledets, I. Tensor-Train Decomposition. SIAM J. Sci. Comput.; 2011; 33, pp. 2295-2317. [DOI: https://dx.doi.org/10.1137/090752286]

8. Oseledets, I. Constructive Representation of Functions in Low-Rank Tensor Formats. Constr. Approx.; 2010; 37, pp. 1-8. [DOI: https://dx.doi.org/10.1007/s00365-012-9175-x]

9. Dolgov, S.V.; Khoromskij, B.N.; Oseledets, I.V. Fast solution of parabolic problems in the tensor train/quantized tensor train format with initial application to the Fokker–Planck equation. SIAM J. Sci. Comput.; 2012; 34, pp. A3016-A3038. [DOI: https://dx.doi.org/10.1137/120864210]

10. Schneider, R.; Uschmajew, A. Approximation rates for the hierarchical tensor format in periodic Sobolev spaces. J. Complex.; 2013; [DOI: https://dx.doi.org/10.1016/j.jco.2013.10.001]

11. Griebel, M.; Harbrecht, H. Analysis of tensor approximation schemes for continuous functions. Found. Comput. Math.; 2023; 23, pp. 219-240. [DOI: https://dx.doi.org/10.1007/s10208-021-09544-6]

12. Bigoni, D.; Engsig-Karup, A.P.; Marzouk, Y.M. Spectral tensor-train decomposition. SIAM J. Sci. Comput.; 2016; 38, pp. A2405-A2439. [DOI: https://dx.doi.org/10.1137/15M1036919]

13. Gorodetsky, A.; Karaman, S.; Marzouk, Y. A continuous analogue of the tensor-train decomposition. Comput. Methods Appl. Mech. Engrg.; 2019; 347, pp. 59-84. [DOI: https://dx.doi.org/10.1016/j.cma.2018.12.015]

14. Dolgov, S.; Savostyanov, D. Alternating minimal energy methods for linear systems in higher dimensions. SIAM J. Sci. Comput.; 2014; 36, pp. 1-24. [DOI: https://dx.doi.org/10.1137/140953289]

15. Sozykin, K.; Chertkov, A.; Schutski, R.; Phan, A.H.; Cichocki, A.; Oseledets, I. TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning. arXiv; 2022; arXiv: 2205.00293

16. Morozov, D.; Melnikov, A.; Shete, V.; Perelshtein, M. Protein-protein docking using a tensor train black-box optimization method. arXiv; 2023; arXiv: 2302.03410

17. Novikov, A.; Podoprikhin, D.; Osokin, A.; Vetrov, D.P. Tensorizing neural networks. Adv. Neural Inf. Process. Syst.; 2015; 28, pp. 1-9.

18. Sagingalieva, A.; Kurkin, A.; Melnikov, A.; Kuhmistrov, D.; Perelshtein, M.; Melnikov, A.; Skolik, A.; Von Dollen, D. Hyperparameter optimization of hybrid quantum neural networks for car classification. arXiv; 2022; arXiv: 2205.04878

19. Naumov, A.; Melnikov, A.; Abronin, V.; Oxanichenko, F.; Izmailov, K.; Pflitsch, M.; Melnikov, A.; Perelshtein, M. Tetra-AML: Automatic Machine Learning via Tensor Networks. arXiv; 2023; arXiv: 2303.16214

20. Blazek, J. Computational Fluid Dynamics: Principles and Applications; Butterworth-Heinemann: Oxford, UK, 2015.

21. Dolgov, S.; Pearson, J.W. Preconditioners and Tensor Product Solvers for Optimal Control Problems from Chemotaxis. SIAM J. Sci. Comput.; 2019; 41, pp. B1228-B1253. [DOI: https://dx.doi.org/10.1137/18M1198041]

22. Gourianov, N.; Lubasch, M.; Dolgov, S.; van den Berg, Q.Y.; Babaee, H.; Givi, P.; Kiffner, M.; Jaksch, D. A quantum-inspired approach to exploit turbulence structures. Nat. Comput. Sci.; 2022; 2, pp. 30-37. [DOI: https://dx.doi.org/10.1038/s43588-021-00181-1]

23. Ion, I.G.; Loukrezis, D.; De Gersem, H. Tensor train based isogeometric analysis for PDE approximation on parameter dependent geometries. Comput. Methods Appl. Mech. Eng.; 2022; 401, 115593. [DOI: https://dx.doi.org/10.1016/j.cma.2022.115593]

24. Ion, I.G. Low-Rank Tensor Decompositions for Surrogate Modeling in forward and inverse Problems. Ph.D. Thesis; Technische Universität Darmstadt: Darmstadt, The Netherland, 2024; [DOI: https://dx.doi.org/10.26083/tuprints-00026678]

25. Schön, C.; Solano, E.; Verstraete, F.; Cirac, J.I.; Wolf, M.M. Sequential Generation of Entangled Multiqubit States. Phys. Rev. Lett.; 2005; 95, 110503. [DOI: https://dx.doi.org/10.1103/PhysRevLett.95.110503] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/16196992]

26. Markeeva, L.; Tsybulin, I.; Oseledets, I. QTT-isogeometric solver in two dimensions. J. Comput. Phys.; 2021; 424, 109835. [DOI: https://dx.doi.org/10.1016/j.jcp.2020.109835]

27. Kazeev, V.A.; Schwab, C. Approximation of Singularities by Quantized-Tensor FEM. PAMM; 2015; 15, pp. 743-746. [DOI: https://dx.doi.org/10.1002/pamm.201510353]

28. Khoromskij, B. O(dlogN)-Quantics Approximation of N-d Tensors in High-Dimensional Numerical Modeling. Constr. Approx.; 2009; 34, pp. 257-280. [DOI: https://dx.doi.org/10.1007/s00365-011-9131-1]

29. Sekmen, A.; Aldroubi, A.; Koku, A.B.; Hamm, K. Matrix resconstruction: Skeleton decomposition versus singular value decomposition. Proceedings of the 2017 International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS); Seattle, WA, USA, 9–12 July 2017; pp. 1-8. [DOI: https://dx.doi.org/10.23919/SPECTS.2017.8046777]

30. Oseledets, I.; Tyrtyshnikov, E. TT-cross approximation for multidimensional arrays. Linear Algebra Its Appl.; 2010; 432, pp. 70-88. [DOI: https://dx.doi.org/10.1016/j.laa.2009.07.024]

31. White, S.R. Density matrix formulation for quantum renormalization groups. Phys. Rev. Lett.; 1992; 69, pp. 2863-2866. [DOI: https://dx.doi.org/10.1103/PhysRevLett.69.2863]

32. Saad, Y. Iterative Methods for Sparse Linear Systems; 2nd ed. Other Titles in Applied Mathematics; SIAM: Philadelphia, PA, USA, 2003; [DOI: https://dx.doi.org/10.1137/1.9780898718003]

33. Oseledets, I. ttpy 1.2.0. 2017; Available online: https://github.com/oseledets/ttpy (accessed on 15 October 2024).

34. Gordon, W.J.; Thiel, L.C. Transfinite mappings and their application to grid generation. Appl. Math. Comput.; 1982; 10–11, pp. 171-233. [DOI: https://dx.doi.org/10.1016/0096-3003(82)90191-6]

35. Elman, H.; Silvester, D.; Wathen, A. Finite Elements and Fast Iterative Solvers: With Applications in Incompressible Fluid Dynamics; Oxford University Press: Oxford, UK, 2006.

36. Temam, R. Navier–Stokes Equations: Theory and Numerical Analysis; American Mathematical Society: Providence, RI, USA, 2024; Volume 343.

37. Chorin, A.J. The numerical solution of the Navier-Stokes equations for an incompressible fluid. Bull. Am. Math. Soc.; 1967; 73, pp. 928-931. [DOI: https://dx.doi.org/10.1090/S0002-9904-1967-11853-6]

38. Ghia, U.; Ghia, K.; Shin, C. High-Re solutions for incompressible flow using the Navier-Stokes equations and a multigrid method. J. Comput. Phys.; 1982; 48, pp. 387-411. [DOI: https://dx.doi.org/10.1016/0021-9991(82)90058-4]

39. Melnikov, A.A.; Termanova, A.A.; Dolgov, S.V.; Neukart, F.; Perelshtein, M.R. Quantum state preparation using tensor networks. Quantum Sci. Technol.; 2023; 8, 035027. [DOI: https://dx.doi.org/10.1088/2058-9565/acd9e7]

40. Ran, S.J. Encoding of matrix product states into quantum circuits of one- and two-qubit gates. Phys. Rev. A; 2020; 101, 032310. [DOI: https://dx.doi.org/10.1103/PhysRevA.101.032310]

41. Zhou, P.F.; Hong, R.; Ran, S.J. Automatically differentiable quantum circuit for many-qubit state preparation. Phys. Rev. A; 2021; 104, 042601. [DOI: https://dx.doi.org/10.1103/PhysRevA.104.042601]

42. Rudolph, M.S.; Chen, J.; Miller, J.; Acharya, A.; Perdomo-Ortiz, A. Decomposition of Matrix Product States into Shallow Quantum Circuits. arXiv; 2022; arXiv: 2209.00595[DOI: https://dx.doi.org/10.1088/2058-9565/ad04e6]

Word count: 6490

Show less

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

In this paper, we present a methodology for the numerical solving of partial differential equations in 2D geometries with piecewise smooth boundaries via finite element method (FEM) using a Quantized Tensor Train (QTT) format. During the calculations, all the operators and data are assembled and represented in a compressed tensor format. We introduce an efficient assembly procedure of FEM matrices in the QTT format for curvilinear domains. The features of our approach include efficiency in terms of memory consumption and potential expansion to quantum computers. We demonstrate the correctness and advantages of the method by solving a number of problems, including nonlinear incompressible Navier–Stokes flow, in differently shaped domains.

Details

Title

TetraFEM: Numerical Solution of Partial Differential Equations Using Tensor Train Finite Element Method

Author

Kornev, Egor¹

; Dolgov, Sergey²; Perelshtein, Michael¹

; Melnikov, Artem¹

¹ Terra Quantum AG, Kornhausstrasse 25, 9000 St. Gallen, Switzerland
² Terra Quantum AG, Kornhausstrasse 25, 9000 St. Gallen, Switzerland; Department of Mathematical Sciences, University of Bath, Claverton Down, Bath BA2 7AY, UK

First page

3277

Publication year

2024

Publication date

2024

Publisher

MDPI AG

e-ISSN

22277390

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.3390/math12203277

ProQuest document ID

3120735473

TetraFEM: Numerical Solution of Partial Differential Equations Using Tensor Train Finite Element Method

Jump to:

Full text

2. Background

2.1. Quantized Tensor Train Format

2.2. Finite Element Method

3. Quantized Tensor Train Finite Element Method

3.1. Domain Splitting and Transformation

3.2. Operators Assembly in Tensor Train Format

3.3. Subdomain Concatenation

3.4. Nonlinear Operators Reassembly with Coefficients

3.5. Example Application: Incompressible Navier–Stokes Equations

4. Numerical Results

4.1. Poisson Equation in a Triangle

4.2. Poisson Equation in a Quadrilateral Domain

4.3. Poisson Equation in a Circle/Annulus

4.4. Navier–Stokes Flow in a Lid-Driven Cavity

4.5. Navier–Stokes Flow in a Backward-Facing Step Domain

Abstract

Details

Suggested sources