Table 1 - uploaded by Bram Metsch
Content may be subject to copyright.
Setup time in seconds for Example 1.

Setup time in seconds for Example 1.

Source publication
Technical Report
Full-text available
Multigrid methods (MG) are known to be optimal solvers for large sparse linear systems arising from the finite element, finite difference or finite volume discretization of a partial differential equation (PDE). Algebraic multigrid methods (AMG) extend this approach to wide a class of problems, e.g. anisotropic operators or unstructured grids. Howe...

Contexts in source publication

Context 1
... we expect that the setup times for the CGC and CGC-E schemes are very similar. In Table 1 we give the setup times for this prob- lem. We see that both CGC variants are faster than the Falgout algorithm. ...
Context 2
... the CGC and CGC-E algorithm, this means that more edges are constructed in the coarse grid selec- tion graph, while the Falgout algorithm needs to employ the boundary treatment on twice as many points. For the operator complexities, c.f. Table 10, we see an almost equal increase for the Falgout and CGC-E algorithm. In contrast, the CGC algorithm shows a larger deterioration than in the grid-aligned case, which can be explained from the large amount of strongly coupled coarse grid points on different processor subdomains. ...
Context 3
... contrast, the CGC algorithm shows a larger deterioration than in the grid-aligned case, which can be explained from the large amount of strongly coupled coarse grid points on different processor subdomains. From the solution times given in Table 11 and the convergence factors given in Table 12 we see that the classical CGC algorithm again does not provide an effi- cient multigrid cycle without agglomeration. The other algorithms only exhibit some deterioration as we go from the sequential to the parallel setting, but the parallel con- vergence factors are independent of the number of pro- cessors. ...
Context 4
... contrast, the CGC algorithm shows a larger deterioration than in the grid-aligned case, which can be explained from the large amount of strongly coupled coarse grid points on different processor subdomains. From the solution times given in Table 11 and the convergence factors given in Table 12 we see that the classical CGC algorithm again does not provide an effi- cient multigrid cycle without agglomeration. The other algorithms only exhibit some deterioration as we go from the sequential to the parallel setting, but the parallel con- vergence factors are independent of the number of pro- cessors. ...
Context 5
... Table 13 we show the setup times for this example. We see that while both CGC variants exhibit compara- ble times up to 64 processors, this is no longer true for 512 processors any more. ...
Context 6
... contrast, the CGC-E algorithm, which combines an efficient coarse grid matching and fast coarsening, achieves the fastest setup. From Table 14 we see that the operator complexity increases for all coarsening methods as the number of processors grows. Hence, we cannot expect solution-time scalability. ...
Context 7
... the solution time mainly depends on the com- putational work done per multigrid cycle which in turn is related to the operator complexity. This can be seen from the solution times given in Table 15. As we increase the number of processors from 64 to 512, we see a large jump in the solution times due to hardware and inter- connect bandwidth limitations. ...

Similar publications

Preprint
Full-text available
Algebraic multigrid (AMG) methods are among the most efficient solvers for linear systems of equations and they are widely used for the solution of problems stemming from the discretization of Partial Differential Equations (PDEs). The most severe limitation of AMG methods is the dependence on parameters that require to be fine-tuned. In particular...
Thesis
Full-text available
This thesis deals with the development of a new Algebraic Multigrid method (AMG) for the solution of linear systems arising from Generalized Finite Difference Methods (GFDM). In particular, we consider the Finite Pointset Method, which is based on GFDM. Being a meshfree method, FPM does not rely on a mesh and can therefore deal with moving geometri...
Article
Full-text available
This paper presents the development of an Adaptive Algebraic Multiscale Solver for Compressible flow (C-AMS) in heterogeneous porous media. Similar to the recently developed AMS for incompressible (linear) flows [Wang et al., JCP, 2014], C-AMS operates by defining primal and dual-coarse blocks on top of the fine-scale grid. These coarse grids facil...
Thesis
Full-text available
We consider a second-order, elliptic partial differential equation (PDE) discretized by the Hybrid High-Order (HHO) method. HHO is a polyhedral method that handles arbitrary polynomial orders, and for which globally coupled unknowns are located at faces. To efficiently solve the linear system arising after static condensation, this work proposes no...
Preprint
Full-text available
The numerical simulation of structural mechanics applications via finite elements usually requires the solution of large-size and ill-conditioned linear systems, especially when accurate results are sought for derived variables interpolated with lower order functions, like stress or deformation fields. Such task represents the most time-consuming k...

Citations

... In addition, it should be possible to apply the operator in parallel in order to ensure scalability. The tested coarsening techniques include the classical Ruge Stueben technique (RGST) (Ruge and Stüben, 1987) using a third pass for correcting inconsistencies on the interprocessor boundaries, the Cleary Luby Jones Plassman scheme (CLJP) (Cleary et al., 1998), the Falgout scheme (FALG) (Henson and Meier Yang, 2002), the hybrid modified independent set technique (HMIS) and the extended coarse grid classification algorithm (ECGC) of Griebel et al. (2006b). The RGST method shows a good sequential performance (Griebel et al., 2006a, Section 4) and is often used as a basis for enhanced coarsening techniques. ...
... It has been shown that the construction of coarsenings for highly partitioned test cases is difficult to achieve. The ECGC algorithm addresses this problem showing robust coarsening rates that are independent of the number of partitions (Griebel et al., 2006b). Table 1 gives results for the combination of several operator coarsening methods. ...
Article
Large scale industrial induction heating models require economical and robust computational methods. There is an increasing demand for efficient models for use in optimisation procedures, where a large number of parameter combinations must be solved. The goal of this work is to demonstrate that the multiphysics model of induction heating can efficiently be described by a weak coupling of the eddy current model with heat transfer and material evolution to derive a numerical model that is both robust and scalable. The solution of the electromagnetic problem is accelerated by using an auxiliary multigrid method. It will be demonstrated that the numerical convergence is mostly unaffected by discontinuities of the magnetic permeability, which can, for instance, be found at material boundary interfaces. A crankshaft will be used as an example for the broad class of problems that can be described by this induction heating model.
... For example, one can apply multigrid techniques to the whole system (1)- (2) to solve problems in areas of computational fluid dynamics [22], [30], [40], [50], [52], [24], [25] constrained optimization [43], [44], [45], [46], mixed finite elements [1], [23] and elsewhere. For parallel multigrid see e.g [26], [27], [28]. ...
... In caseλ 1 andλ N are the two conjugate complex roots of (19), it follows that (26) must also hold for ρ(H(τ 1 , τ 2 , ω 2 , a)) to be minimized. So, (27) holds if either (19) has real or conjugate complex roots. However, if (27) holds, then (13) becomes ...
... So, (27) holds if either (19) has real or conjugate complex roots. However, if (27) holds, then (13) becomes ...
Article
Full-text available
In this paper we study the impact of two types of preconditioning on the numerical solution of large sparse augmented linear systems. The first preconditioning matrix is the lower triangular part whereas the second is the product of the lower triangular part with the upper triangular part of the augmented system’s coefficient matrix. For the first preconditioning matrix we form the Generalized Modified Extrapolated Successive Overrelaxation (GMESOR) method, whereas the second preconditioning matrix yields the Generalized Modified Preconditioned Simultaneous Displacement (GMPSD) method, which is an extrapolated form of the Symmetric Successive Overrelaxation method. We find sufficient conditions for each aforementioned iterative method to converge. In addition, we develop a geometric approach, for determining the optimum values of their parameters and corresponding spectral radii. It is shown that both iterative methods studied (GMESOR and GMPSD) attain the same rate of convergence. Numerical results confirm our theoretical expectations.
... Algebraic coarsening is typically used as a 'black box' method [15] and the coarsening requires computation of Galerkin products which are a kind of sparse matrix-matrix multiplications. Creating the pyramid of coarsened problems, which dominates the preparation time, has also undergone parallelization [22,23,26], but is difficult to implement efficiently for GPUs [5,44]. Many GPU implementations of AMG perform the preparation on the CPU [29] or make use of special auxiliary grids [44]. ...
Article
Full-text available
We present an algorithm for general sparse matrix-matrix multiplication (SpGEMM) on many-core architectures, such as GPUs. SpGEMM is implemented by iterative row merging, similar to merge sort, except that elements with duplicate column indices are aggregated on the fly. The main kernel merges small numbers of sparse rows at once using subwarps of threads to realize an early compression effect which reduces the overhead of global memory accesses. The performance is compared with a parallel CPU implementation as well as with three GPU-based implementations. Measurements performed for computing the matrix square for 21 sparse matrices show that the proposed method consistently outperforms the other methods. Analysis showed that the performance is achieved by utilizing the compression effect and the GPU caching architecture. An improved performance was also found for computing Galerkin products which are required by algebraic multigrid solvers. The performance was particularly good for seven-point stencil matrices arising in the context of diffuse optical imaging and the improved performance allows one to perform image reconstruction at higher resolution using the same computational resources.
... The drawback of the CGC algorithm is that it can not handle arbitrary small coarsenings, i.e. the algorithm fails, if the number of nodes is relatively small compared to the number of processors. [Griebel et al., 2006b] mentions that this leads to smallest coarsenings that scale with the number of cpus n proc , so that the coarsest node size N behaves like N = O(n proc ). ...
... The extended coarse grid classification (ECGC) algorithm remedies this problem. It is proposed in [Griebel et al., 2006b]. The ECGC method extends the original CGC method, by replacing local coarsenings that can be interpolated by outside grid points using empty grids in the global selection step. ...
... This allows arbitrary levels of coarsenings on the global level up to a single point. [Griebel et al., 2006b] provides several example results for the CGC and ECGC method in comparison to a Falgout scheme. It shows that the convergence factors and solution times of the CGC and the ECGC method are comparable to the Falgout method. ...
Article
Electromagnetic induction heating is an efficient process allowing to directly heat up a prescribed area beneath the surface of metallic workpieces to enable quenching. This work presents a mathematical model for the coupled electromagnetic/heat transfer process as well as numerical solution methods. The electromagnetic model is based on a magnetic vector potential formulation. The source currents are prescribed using a voltage potential formulation enabling the modelling of arbitrary inductor geometries. The heat transfer problem is modelled using the classical heat diffusion equation. The electromagnetic model is fully transient, in order to allow the introduction of non-linear effects. The space discretisation is based on an edge finite element approach using a global domain including air, workpiece and inductor. The resulting linear system of equations of the implicit formulation is sparse and semi-definite, including a large kernel. It is demonstrated that a preconditioner based on the auxiliary space algebraic multigrid method in connection with a Krylov solver substantially reduces the solution time of the electromagnetic problem in comparison to classical solution methods and can be effectively applied in parallel. Applications for the heat treatment of a gearwheel and for an automotive crankshaft are presented. The surface heat treatment of complex geometrical parts requires the introduction of a relative movement of workpiece and inductor to ensure a homogeneous surface treatment. A novel method is proposed, which is based on a discrete level set representation of the inductor motion that can be used to generate conforming finite element meshes in a Lagrangian setting.
Article
We prove that no curvature-dimension bound CD(K, N) holds in any Heisenberg group H(n). On the contrary, the measure contraction property MC P(0, 2n + 3) holds and is optimal for the dimension 2n + 3. For the nonexistence of a curvature-dimension bound, we prove that the generalized "geodesic" Brunn-Minkowski inequality is false in H(n). We also show in a new and direct way (and for all n is an element of N\{0}), that the general "multiplicative" Brunn-Minkowski inequality with dimension N > 2n + 1 is false.
Article
no. 283 Diese Arbeit ist mit Unterstützung des von der Deutschen Forschungs-gemeinschaft getragenen Sonderforschungsbereiches 611 an der Univer-sität Bonn entstanden und als Manuskript vervielfältigt worden. Abstract. By introducing a new class of test functions, we prove that weak solutions to the thin–film equation with Neumann or periodic boundary conditions remain positive for all times provided initial data are "flat", i.e. close to a constant steady state in a suitable sense. As a consequence they are unique and, for smooth initial data, classical.
Article
no. 285 Diese Arbeit ist mit Unterstützung des von der Deutschen Forschungs-gemeinschaft getragenen Sonderforschungsbereiches 611 an der Univer-sität Bonn entstanden und als Manuskript vervielfältigt worden. Abstract We study a system of stochastically interacting particles (vortices) and show that for a large number of vortices the weighted empirical measure of the system approximates the solution of the three-dimensional Navier-Stokes equation in vorticity form.
Article
Full-text available
We study the existence theory for parabolic variational inequalities in weighted $L^2$ spaces with respect to excessive measures associated with a transition semigroup. We characterize the value function of optimal stopping problems for finite and infinite dimensional diffusions as a generalized solution of such a variational inequality. The weighted $L^2$ setting allows us to cover some singular cases, such as optimal stopping for stochastic equations with degenerate diffusion coefficient. As an application of the theory, we consider the pricing of American-style contingent claims. Among others, we treat the cases of assets with stochastic volatility and with path-dependent payoffs.