-
PTHelper: An open source tool to support the Penetration Testing process
Authors:
Jacobo Casado de Gracia,
Alfonso Sánchez-Macián
Abstract:
Offensive security is one of the state of the art measures to protect enterprises and organizations. Penetration testing, broadly called pentesting, is a branch of offensive security designed to find, rate and exploit these vulnerabilities, in order to assess the security posture of an organization. This process is often time-consuming and the quantity of information that pentesters need to manage…
▽ More
Offensive security is one of the state of the art measures to protect enterprises and organizations. Penetration testing, broadly called pentesting, is a branch of offensive security designed to find, rate and exploit these vulnerabilities, in order to assess the security posture of an organization. This process is often time-consuming and the quantity of information that pentesters need to manage might also be difficult to handle. This project takes a practical approach to solve the automation of pentesting and proposes a usable tool, called PTHelper. This open-source tool has been designed in a modular way to be easily upgradable by the pentesting community, and uses state of the art tools and artificial intelligence to achieve its objective.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
A fitted scheme for a Caputo initial-boundary value problem
Authors:
J. L. Gracia,
E. O'Riordan,
M. Stynes
Abstract:
In this paper we consider an initial-boundary value problem with a Caputo time derivative of order $α\in(0,1)$. The solution typically exhibits a weak singularity near the initial time and this causes a reduction in the orders of convergence of standard schemes. To deal with this singularity, the solution is computed with a fitted difference scheme on a graded mesh. The convergence of this scheme…
▽ More
In this paper we consider an initial-boundary value problem with a Caputo time derivative of order $α\in(0,1)$. The solution typically exhibits a weak singularity near the initial time and this causes a reduction in the orders of convergence of standard schemes. To deal with this singularity, the solution is computed with a fitted difference scheme on a graded mesh. The convergence of this scheme is analysed using a discrete maximum principle and carefully chosen barrier functions. Sharp error estimates are proved, which show an enhancement in the convergence rate compared with the standard L1 approximation on uniform meshes, and also indicate an optimal choice for the mesh grading. This optimal mesh grading is less severe than the optimal grading for the standard L1 scheme. Furthermore, the dependence of the error on the final time forms part of our error estimate. Numerical experiments are presented which corroborate our theoretical results.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Beyond domain alignment: Revealing the effect of intrinsic magnetic order on electrochemical water splitting
Authors:
Emma van der Minne,
Lucas Korol,
Lidewij M. A. Krakers,
Michael Verhage,
Carlos M. M. Rosário,
Thijs J. Roskamp,
Raymond J. Spiteri,
Chiara Biz,
Mauro Fianchini,
Guus Rijnders,
Kees Flipse,
Jose Gracia,
Guido Mul,
Hans Hilgenkamp,
Robert J. Green,
Gertjan Koster,
Christoph Baeumer
Abstract:
To reach a long term viable green hydrogen economy, rational design of active oxygen evolution reaction (OER) catalysts is critical. An important hurdle in this reaction originates from the fact that the reactants are singlet molecules, whereas the oxygen molecule has a triplet ground state with parallel spin alignment, implying that magnetic order in the catalyst is essential. Accordingly, multip…
▽ More
To reach a long term viable green hydrogen economy, rational design of active oxygen evolution reaction (OER) catalysts is critical. An important hurdle in this reaction originates from the fact that the reactants are singlet molecules, whereas the oxygen molecule has a triplet ground state with parallel spin alignment, implying that magnetic order in the catalyst is essential. Accordingly, multiple experimentalists reported a positive effect of external magnetic fields on OER activity of ferromagnetic catalysts. However, it remains a challenge to investigate the influence of the intrinsic magnetic order on catalytic activity. Here, we tuned the intrinsic magnetic order of epitaxial La$_{0.67}$Sr$_{0.33}$MnO$_{3}$ thin film model catalysts from ferro- to paramagnetic by changing the temperature in-situ during water electrolysis. Using this strategy, we show that ferromagnetic ordering below the Curie temperature enhances OER activity. Moreover, we show a slight current density enhancement upon application of an external magnetic field and find that the dependence of magnetic field direction correlates with the magnetic anisotropy in the catalyst film. Our work thus suggests that both the intrinsic magnetic order in La$_{0.67}$Sr$_{0.33}$MnO$_{3}$ films and magnetic domain alignment increase their catalytic activity. We observe no long-range magnetic order at the catalytic surface, implying that the OER enhancement is connected to the magnetic order of the bulk catalyst. Combining the effects found with existing literature, we propose a unifying picture for the spin-polarized enhancement in magnetic oxide catalysts.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
A singularly perturbed convection-diffusion parabolic problem with incompatible boundary/initial data
Authors:
Jose Luis Gracia,
Eugene O'Riordan
Abstract:
A singularly perturbed parabolic problem of convection-diffusion type with incompatible inflow boundary and initial conditions is examined. In the case of constant coefficients, a set of singular functions are identified which match certain incompatibilities in the data and also satisfy the associated homogenous differential equation. When the convective coefficient only depends on the time variab…
▽ More
A singularly perturbed parabolic problem of convection-diffusion type with incompatible inflow boundary and initial conditions is examined. In the case of constant coefficients, a set of singular functions are identified which match certain incompatibilities in the data and also satisfy the associated homogenous differential equation. When the convective coefficient only depends on the time variable and the initial/boundary data is discontinuous, then a mixed analytical/numerical approach is taken. In the case of variable coefficients and the zero level of compatibility being satisfied (i.e. continuous boundary/initial data), a numerical method is constructed whose order of convergence is shown to depend on the next level of compatibility being satisfied by the data. Numerical results are presented to support the theoretical error bounds established for both of the approaches examined in the paper.
△ Less
Submitted 19 December, 2022;
originally announced December 2022.
-
Callback-based Completion Notification using MPI Continuations
Authors:
Joseph Schuchart,
Philipp Samfass,
Christoph Niethammer,
José Gracia,
George Bosilca
Abstract:
Asynchronous programming models (APM) are gaining more and more traction, allowing applications to expose the available concurrency to a runtime system tasked with coordinating the execution. While MPI has long provided support for multi-threaded communication and non-blocking operations, it falls short of adequately supporting APMs as correctly and efficiently handling MPI communication in differ…
▽ More
Asynchronous programming models (APM) are gaining more and more traction, allowing applications to expose the available concurrency to a runtime system tasked with coordinating the execution. While MPI has long provided support for multi-threaded communication and non-blocking operations, it falls short of adequately supporting APMs as correctly and efficiently handling MPI communication in different models is still a challenge. We have previously proposed an extension to the MPI standard providing operation completion notifications using callbacks, so-called MPI Continuations. This interface is flexible enough to accommodate a wide range of different APMs.
In this paper, we present an extension to the previously described interface that allows for finer control of the behavior of the MPI Continuations interface. We then present some of our first experiences in using the interface in the context of different applications, including the NAS parallel benchmarks, the PaRSEC task-based runtime system, and a load-balancing scheme within an adaptive mesh refinement solver called ExaHyPE. We show that the interface, implemented inside Open MPI, enables low-latency, high-throughput completion notifications that outperform solutions implemented in the application space.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Quo Vadis MPI RMA? Towards a More Efficient Use of MPI One-Sided Communication
Authors:
Joseph Schuchart,
Christoph Niethammer,
José Gracia,
George Bosilca
Abstract:
The MPI standard has long included one-sided communication abstractions through the MPI Remote Memory Access (RMA) interface. Unfortunately, the MPI RMA chapter in the 4.0 version of the MPI standard still contains both well-known and lesser known short-comings for both implementations and users, which lead to potentially non-optimal usage patterns. In this paper, we identify a set of issues and p…
▽ More
The MPI standard has long included one-sided communication abstractions through the MPI Remote Memory Access (RMA) interface. Unfortunately, the MPI RMA chapter in the 4.0 version of the MPI standard still contains both well-known and lesser known short-comings for both implementations and users, which lead to potentially non-optimal usage patterns. In this paper, we identify a set of issues and propose ways for applications to better express anticipated usage of RMA routines, allowing the MPI implementation to better adapt to the application's needs. In order to increase the flexibility of the RMA interface, we add the capability to duplicate windows, allowing access to the same resources encapsulated by a window using different configurations. In the same vein, we introduce the concept of MPI memory handles, meant to provide life-time guarantees on memory attached to dynamic windows, removing the overhead currently present in using dynamically exposed memory. We will show that our extensions provide improved accumulate latencies, reduced overheads for multi-threaded flushes, and allow for zero overhead dynamic memory window usage.
△ Less
Submitted 15 November, 2021;
originally announced November 2021.
-
Parameter-uniform numerical methods for singularly perturbed linear transport problems
Authors:
J. L. Gracia,
A. Navas-Montilla,
E. O'Riordan
Abstract:
Pointwise accurate numerical methods are constructed and analysed for three classes of singularly perturbed first order transport problems. The methods involve piecewise-uniform Shishkin meshes and the numerical approximations are shown to be parameter-uniformly convergent in the maximum norm. A transport problem from the modelling of fluid-particle interaction is formulated and used as a test pro…
▽ More
Pointwise accurate numerical methods are constructed and analysed for three classes of singularly perturbed first order transport problems. The methods involve piecewise-uniform Shishkin meshes and the numerical approximations are shown to be parameter-uniformly convergent in the maximum norm. A transport problem from the modelling of fluid-particle interaction is formulated and used as a test problem for these numerical methods. Numerical results are presented to illustrate the performance of the numerical methods and to confirm the theoretical error bounds established in the paper.
△ Less
Submitted 27 July, 2021;
originally announced July 2021.
-
Normal matrices
Authors:
Gorka Armentia,
Juan-Miguel Gracia,
Francisco-Enrique Velasco
Abstract:
Let $A$ be a square complex matrix and $z$ a complex number. The distance, with respect to the spectral norm, from $A$ to the set of matrices which have $z$ as an eigenvalue is less than or equal to the distance from $z$ to the spectrum of $A$. If these two distances are equal for a sufficiently large finite set of numbers $z$ which are not in the spectrum of $A$, then the matrix $A$ is normal.
Let $A$ be a square complex matrix and $z$ a complex number. The distance, with respect to the spectral norm, from $A$ to the set of matrices which have $z$ as an eigenvalue is less than or equal to the distance from $z$ to the spectrum of $A$. If these two distances are equal for a sufficiently large finite set of numbers $z$ which are not in the spectrum of $A$, then the matrix $A$ is normal.
△ Less
Submitted 1 June, 2021;
originally announced June 2021.
-
Magnetic domain wall dynamics in the precessional regime: influence of the Dzyaloshinskii-Moriya interaction
Authors:
J. Pena Gracia,
A. Fassatoui,
J. Vogel,
A. Thiaville,
S. Pizzini
Abstract:
The domain wall dynamics driven by an out of plane magnetic field was measured for a series of magnetic trilayers with different strengths of the interfacial Dzyaloshinskii-Moriya interaction (DMI). The features of the field-driven domain wall velocity curves strongly depend on the amplitude of the HD field stabilising chiral Néel walls. The measured Walker velocity, which in systems with large DM…
▽ More
The domain wall dynamics driven by an out of plane magnetic field was measured for a series of magnetic trilayers with different strengths of the interfacial Dzyaloshinskii-Moriya interaction (DMI). The features of the field-driven domain wall velocity curves strongly depend on the amplitude of the HD field stabilising chiral Néel walls. The measured Walker velocity, which in systems with large DMI is maintained after the Walker field giving rise to a velocity plateau up to the Slonczewski field HS, can be easily related to the DMI strength. Yet, when the DMI field HD and the domain wall demagnetising field HDW have comparable values, a careful analysis needs to be done in order to evaluate the impact of the DMI on the domain wall velocity. By means of a one-dimensional model and 2D simulations, we successfully extend this method to explain experimental results to cases where HD and HDW are comparable.
△ Less
Submitted 17 July, 2021; v1 submitted 16 December, 2020;
originally announced December 2020.
-
Fibers are not (P)Threads: The Case for Loose Coupling of Asynchronous Programming Models and MPI Through Continuations
Authors:
Joseph Schuchart,
Christoph Niethammer,
José Gracia
Abstract:
Asynchronous programming models (APM) are gaining more and more traction, allowing applications to expose the available concurrency to a runtime system tasked with coordinating the execution. While MPI has long provided support for multi-threaded communication and non-blocking operations, it falls short of adequately supporting APMs as correctly and efficiently handling MPI communication in differ…
▽ More
Asynchronous programming models (APM) are gaining more and more traction, allowing applications to expose the available concurrency to a runtime system tasked with coordinating the execution. While MPI has long provided support for multi-threaded communication and non-blocking operations, it falls short of adequately supporting APMs as correctly and efficiently handling MPI communication in different models is still a challenge. Meanwhile, new low-level implementations of light-weight, cooperatively scheduled execution contexts (fibers, aka user-level threads (ULT)) are meant to serve as a basis for higher-level APMs and their integration in MPI implementations has been proposed as a replacement for traditional POSIX thread support to alleviate these challenges.
In this paper, we first establish a taxonomy in an attempt to clearly distinguish different concepts in the parallel software stack. We argue that the proposed tight integration of fiber implementations with MPI is neither warranted nor beneficial and instead is detrimental to the goal of MPI being a portable communication abstraction. We propose MPI Continuations as an extension to the MPI standard to provide callback-based notifications on completed operations, leading to a clear separation of concerns by providing a loose coupling mechanism between MPI and APMs. We show that this interface is flexible and interacts well with different APMs, namely OpenMP detached tasks, OmpSs-2, and Argobots.
△ Less
Submitted 12 November, 2020;
originally announced November 2020.
-
Parameter-uniform approximations for a singularly perturbed convection-diffusion problem with a discontinuous initial condition
Authors:
Jose Luis Gracia,
Eugene O'Riordan
Abstract:
A singularly perturbed parabolic problem of convection-diffusion type with a discontinuous initial condition is examined. A particular complimentary error function is identified which matches the discontinuity in the initial condition. The difference between this analytical function and the solution of the parabolic problem is approximated numerically. A co-ordinate transformation is used so that…
▽ More
A singularly perturbed parabolic problem of convection-diffusion type with a discontinuous initial condition is examined. A particular complimentary error function is identified which matches the discontinuity in the initial condition. The difference between this analytical function and the solution of the parabolic problem is approximated numerically. A co-ordinate transformation is used so that a layer-adapted mesh can be aligned to the interior layer present in the solution. Numerical analysis is presented for the associated numerical method, which establishes that the numerical method is a parameter-uniform numerical method. Numerical results are presented to illustrate the pointwise error bounds established in the paper.
△ Less
Submitted 8 February, 2022; v1 submitted 19 October, 2020;
originally announced October 2020.
-
Numerical approximations to a singularly perturbed convection-diffusion problem with a discontinuous initial condition
Authors:
Jose Luis Gracia,
Eugene O'Riordan
Abstract:
A singularly perturbed parabolic problem of convection-diffusion type with a discontinuous initial condition is examined. An analytic function is identified which matches the discontinuity in the initial condition and also satisfies the homogenous parabolic differential equation associated with the problem. The difference between this analytical function and the solution of the parabolic problem i…
▽ More
A singularly perturbed parabolic problem of convection-diffusion type with a discontinuous initial condition is examined. An analytic function is identified which matches the discontinuity in the initial condition and also satisfies the homogenous parabolic differential equation associated with the problem. The difference between this analytical function and the solution of the parabolic problem is approximated numerically, using an upwind finite difference operator combined with an appropriate layer-adapted mesh. The numerical method is shown to be parameter-uniform. Numerical results are presented to illustrate the theoretical error bounds established in the paper.
△ Less
Submitted 14 February, 2022; v1 submitted 19 October, 2020;
originally announced October 2020.
-
Collectives in hybrid MPI+MPI code: design, practice and performance
Authors:
Huan Zhou,
Jose Gracia,
Naweiluo Zhou,
Ralf Schneider
Abstract:
The use of hybrid scheme combining the message passing programming models for inter-node parallelism and the shared memory programming models for node-level parallelism is widely spread. Existing extensive practices on hybrid Message Passing Interface (MPI) plus Open Multi-Processing (OpenMP) programming account for its popularity. Nevertheless, strong programming efforts are required to gain perf…
▽ More
The use of hybrid scheme combining the message passing programming models for inter-node parallelism and the shared memory programming models for node-level parallelism is widely spread. Existing extensive practices on hybrid Message Passing Interface (MPI) plus Open Multi-Processing (OpenMP) programming account for its popularity. Nevertheless, strong programming efforts are required to gain performance benefits from the MPI+OpenMP code. An emerging hybrid method that combines MPI and the MPI shared memory model (MPI+MPI) is promising. However, writing an efficient hybrid MPI+MPI program -- especially when the collective communication operations are involved -- is not to be taken for granted.
In this paper, we propose a new design method to implement hybrid MPI+MPI context-based collective communication operations. Our method avoids on-node memory replications (on-node communication overheads) that are required by semantics in pure MPI. We also offer wrapper primitives hiding all the design details from users, which comes with practices on how to structure hybrid MPI+MPI code with these primitives. The micro-benchmarks show that our collectives are comparable or superior to those in pure MPI context. We have further validated the effectiveness of the hybrid MPI+MPI model (which uses our wrapper primitives) in three computational kernels, by comparison to the pure MPI and hybrid MPI+OpenMP models.
△ Less
Submitted 22 July, 2020;
originally announced July 2020.
-
MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI+MPI Parallel Codes
Authors:
Huan Zhou,
Jose Gracia,
Ralf Schneider
Abstract:
The advent of multi-/many-core processors in clusters advocates hybrid parallel programming, which combines Message Passing Interface (MPI) for inter-node parallelism with a shared memory model for on-node parallelism. Compared to the traditional hybrid approach of MPI plus OpenMP, a new, but promising hybrid approach of MPI plus MPI-3 shared-memory extensions (MPI+MPI) is gaining attraction. We d…
▽ More
The advent of multi-/many-core processors in clusters advocates hybrid parallel programming, which combines Message Passing Interface (MPI) for inter-node parallelism with a shared memory model for on-node parallelism. Compared to the traditional hybrid approach of MPI plus OpenMP, a new, but promising hybrid approach of MPI plus MPI-3 shared-memory extensions (MPI+MPI) is gaining attraction. We describe an algorithmic approach for collective operations (with allgather and broadcast as concrete examples) in the context of hybrid MPI+MPI, so as to minimize memory consumption and memory copies. With this approach, only one memory copy is maintained and shared by on-node processes. This allows the removal of unnecessary on-node copies of replicated data that are required between MPI processes when the collectives are invoked in the context of pure MPI. We compare our approach of collectives for hybrid MPI+MPI and the traditional one for pure MPI, and also have a discussion on the synchronization that is required to guarantee data integrity. The performance of our approach has been validated on a Cray XC40 system (Cray MPI) and NEC cluster (OpenMPI), showing that it achieves comparable or better performance for allgather operations. We have further validated our approach with a standard computational kernel, namely distributed matrix multiplication, and a Bayesian Probabilistic Matrix Factorization code.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.
-
Performance and energy consumption of HPC workloads on a cluster based on Arm ThunderX2 CPU
Authors:
Filippo Mantovani,
Marta Garcia-Gasulla,
José Gracia,
Esteban Stafford,
Fabio Banchelli,
Marc Josep-Fabrego,
Joel Criado-Ledesma,
Mathias Nachtmann
Abstract:
In this paper, we analyze the performance and energy consumption of an Arm-based high-performance computing (HPC) system developed within the European project Mont-Blanc 3. This system, called Dibona, has been integrated by ATOS/Bull, and it is powered by the latest Marvell's CPU, ThunderX2. This CPU is the same one that powers the Astra supercomputer, the first Arm-based supercomputer entering th…
▽ More
In this paper, we analyze the performance and energy consumption of an Arm-based high-performance computing (HPC) system developed within the European project Mont-Blanc 3. This system, called Dibona, has been integrated by ATOS/Bull, and it is powered by the latest Marvell's CPU, ThunderX2. This CPU is the same one that powers the Astra supercomputer, the first Arm-based supercomputer entering the Top500 in November 2018. We study from micro-benchmarks up to large production codes. We include an interdisciplinary evaluation of three scientific applications (a finite-element fluid dynamics code, a smoothed particle hydrodynamics code, and a lattice Boltzmann code) and the Graph 500 benchmark, focusing on parallel and energy efficiency as well as studying their scalability up to thousands of Armv8 cores. For comparison, we run the same tests on state-of-the-art x86 nodes included in Dibona and the Tier-0 supercomputer MareNostrum4. Our experiments show that the ThunderX2 has a 25% lower performance on average, mainly due to its small vector unit yet somewhat compensated by its 30% wider links between the CPU and the main memory. We found that the software ecosystem of the Armv8 architecture is comparable to the one available for Intel. Our results also show that ThunderX2 delivers similar or better energy-to-solution and scalability, proving that Arm-based chips are legitimate contenders in the market of next-generation HPC systems.
△ Less
Submitted 10 July, 2020; v1 submitted 9 July, 2020;
originally announced July 2020.
-
Orchestrating NLP Services for the Legal Domain
Authors:
Julián Moreno-Schneider,
Georg Rehm,
Elena Montiel-Ponsoda,
Víctor Rodriguez-Doncel,
Artem Revenko,
Sotirios Karampatakis,
Maria Khvalchik,
Christian Sageder,
Jorge Gracia,
Filippo Maganza
Abstract:
Legal technology is currently receiving a lot of attention from various angles. In this contribution we describe the main technical components of a system that is currently under development in the European innovation project Lynx, which includes partners from industry and research. The key contribution of this paper is a workflow manager that enables the flexible orchestration of workflows based…
▽ More
Legal technology is currently receiving a lot of attention from various angles. In this contribution we describe the main technical components of a system that is currently under development in the European innovation project Lynx, which includes partners from industry and research. The key contribution of this paper is a workflow manager that enables the flexible orchestration of workflows based on a portfolio of Natural Language Processing and Content Curation services as well as a Multilingual Legal Knowledge Graph that contains semantic information and meaningful references to legal documents. We also describe different use cases with which we experiment and develop prototypical solutions.
△ Less
Submitted 28 March, 2020;
originally announced March 2020.
-
Semantic Relatedness for Keyword Disambiguation: Exploiting Different Embeddings
Authors:
María G. Buey,
Carlos Bobed,
Jorge Gracia,
Eduardo Mena
Abstract:
Understanding the meaning of words is crucial for many tasks that involve human-machine interaction. This has been tackled by research in Word Sense Disambiguation (WSD) in the Natural Language Processing (NLP) field. Recently, WSD and many other NLP tasks have taken advantage of embeddings-based representation of words, sentences, and documents. However, when it comes to WSD, most embeddings mode…
▽ More
Understanding the meaning of words is crucial for many tasks that involve human-machine interaction. This has been tackled by research in Word Sense Disambiguation (WSD) in the Natural Language Processing (NLP) field. Recently, WSD and many other NLP tasks have taken advantage of embeddings-based representation of words, sentences, and documents. However, when it comes to WSD, most embeddings models suffer from ambiguity as they do not capture the different possible meanings of the words. Even when they do, the list of possible meanings for a word (sense inventory) has to be known in advance at training time to be included in the embeddings space. Unfortunately, there are situations in which such a sense inventory is not known in advance (e.g., an ontology selected at run-time), or it evolves with time and its status diverges from the one at training time. This hampers the use of embeddings models for WSD. Furthermore, traditional WSD techniques do not perform well in situations in which the available linguistic information is very scarce, such as the case of keyword-based queries. In this paper, we propose an approach to keyword disambiguation which grounds on a semantic relatedness between words and senses provided by an external inventory (ontology) that is not known at training time. Building on previous works, we present a semantic relatedness measure that uses word embeddings, and explore different disambiguation algorithms to also exploit both word and sentence representations. Experimental results show that this approach achieves results comparable with the state of the art when applied for WSD, without training for a particular domain.
△ Less
Submitted 25 February, 2020;
originally announced February 2020.
-
Singularly perturbed reaction-diffusion problems with discontinuities in the initial and/or the boundary data
Authors:
Jose Luis Gracia,
Eugene O'Riordan
Abstract:
Numerical approximations to the solutions of three different problem classes of singularly perturbed parabolic reaction-diffusion problems, each with a discontinuity in the bound\-ary-initial data, are generated. For each problem class, an analytical function associated with the discontinuity in the data, is identified. Parameter-uniform numerical approximations to the difference between the analy…
▽ More
Numerical approximations to the solutions of three different problem classes of singularly perturbed parabolic reaction-diffusion problems, each with a discontinuity in the bound\-ary-initial data, are generated. For each problem class, an analytical function associated with the discontinuity in the data, is identified. Parameter-uniform numerical approximations to the difference between the analytical function and the solution of the singularly perturbed problem are generated using piecewise-uniform Shishkin meshes. Numerical results are given to illustrate all the theoretical error bounds established in the paper.
△ Less
Submitted 19 February, 2019; v1 submitted 9 November, 2018;
originally announced November 2018.
-
Parameter-uniform numerical methods for singularly perturbed parabolic problems with incompatible boundary-initial data
Authors:
Jose Luis Gracia,
Eugene O'Riordan
Abstract:
Numerical approximations to the solution of a linear singularly perturbed parabolic reaction-diffusion problem with incompatible bound\-ary-initial data are generated, The method involves combining the computational solution of a classical finite difference operator on a tensor product of two piecewise-uniform Shishkin meshes with an analytical function that captures the local nature of the incomp…
▽ More
Numerical approximations to the solution of a linear singularly perturbed parabolic reaction-diffusion problem with incompatible bound\-ary-initial data are generated, The method involves combining the computational solution of a classical finite difference operator on a tensor product of two piecewise-uniform Shishkin meshes with an analytical function that captures the local nature of the incompatibility. A proof is given to show almost first order parameter-uniform convergence of these numerical/analytical approximations. Numerical results are given to illustrate the theoretical error bounds.
△ Less
Submitted 15 November, 2018; v1 submitted 27 June, 2018;
originally announced June 2018.
-
Two planetary systems with transiting Earth-size and super-Earth planets orbiting late-type dwarf stars
Authors:
E. Diez Alonso,
J. I. Gonzalez Hernandez,
S. L. Suarez Gomez,
D. S. Aguado,
C. Gonzalez Gutierrez,
A. Suarez Mascareno,
A. Cabrera-Lavers,
J. Gonzalez-Nuevo,
B. Toledo Padron,
J. Gracia,
F. J. de Cos Juez,
R. Rebolo
Abstract:
We present two new planetary systems found around cool dwarf stars with data from the K2 mission. The first system was found in K2-239 (EPIC 248545986), char- acterized in this work as M3.0V and observed in the 14th campaign of K2. It consists of three Earth-size transiting planets with radii of 1.1, 1.0 and 1.1 R Earth, showing a compact configuration with orbital periods of 5.24, 7.78 and 10.1 d…
▽ More
We present two new planetary systems found around cool dwarf stars with data from the K2 mission. The first system was found in K2-239 (EPIC 248545986), char- acterized in this work as M3.0V and observed in the 14th campaign of K2. It consists of three Earth-size transiting planets with radii of 1.1, 1.0 and 1.1 R Earth, showing a compact configuration with orbital periods of 5.24, 7.78 and 10.1 days, close to 2:3:4 resonance. The second was found in K2-240 (EPIC 249801827), characterized in this work as M0.5V and observed in the 15th campaign. It consists of two transiting super-Earths with radii 2.0 and 1.8 R Earth and orbital periods of 6.03 and 20.5 days. The equilibrium temperatures of the atmospheres of these planets are estimated to be in the range of 380-600 K and the amplitudes of signals in transmission spectroscopy are estimated at ~10 ppm.
△ Less
Submitted 5 June, 2018; v1 submitted 4 June, 2018;
originally announced June 2018.
-
Towards performance portability through locality-awareness for applications using one-sided communication primitives
Authors:
Huan Zhou,
Jose Gracia
Abstract:
MPI is the most widely used data transfer and communication model in High Performance Computing. The latest version of the standard, MPI-3, allows skilled programmers to exploit all hardware capabilities of the latest and future supercomputing systems. The revised asynchronous remote-memory-access model in combination with the shared-memory window extension, in particular, allow writing code that…
▽ More
MPI is the most widely used data transfer and communication model in High Performance Computing. The latest version of the standard, MPI-3, allows skilled programmers to exploit all hardware capabilities of the latest and future supercomputing systems. The revised asynchronous remote-memory-access model in combination with the shared-memory window extension, in particular, allow writing code that hides communication latencies and optimizes communication paths according to the locality of data origin and destination. The latter is particularly important for today's multi- and many-core systems. However, writing such efficient code is highly complex and error-prone. In this paper we evaluate a recent remote-memory-access model, namely DART-MPI. This model claims to hide the aforementioned complexities from the programmer, but deliver locality-aware remote-memory-access semantics which outperforms MPI-3 one-sided communication primitives on multi-core systems. Conceptually, the DART-MPI interface is simple; at the same time it takes care of the complexities of the underlying MPI-3 and system topology. This makes DART-MPI an interesting candidate for porting legacy applications. We evaluate these claims using a realistic scientific application, specifically a finite-difference stencil code which solves the heat diffusion equation, on a large-scale Cray XC40 installation.
△ Less
Submitted 29 September, 2016;
originally announced September 2016.
-
Asynchronous progress design for a MPI-based PGAS one-sided communication system
Authors:
Huan Zhou,
Jose Gracia
Abstract:
Remote-memory-access models, also known as one-sided communication models, are becoming an interesting alternative to traditional two-sided communication models in the field of High Performance Computing. In this paper we extend previous work on an MPI-based, locality-aware remote-memory-access model with a asynchronous progress-engine for non-blocking communication operations. Most previous relat…
▽ More
Remote-memory-access models, also known as one-sided communication models, are becoming an interesting alternative to traditional two-sided communication models in the field of High Performance Computing. In this paper we extend previous work on an MPI-based, locality-aware remote-memory-access model with a asynchronous progress-engine for non-blocking communication operations. Most previous related work suggests to drive progression on communication through an additional thread within the application process. In contrast, our scheme uses an arbitrary number of dedicated processes to drive asynchronous progression. Further, we describe a prototypical library implementation of our concepts, namely DART, which is used to quantitatively evaluate our design against a MPI-3 baseline reference. The evaluation consists of micro-benchmark to measure overlap of communication and computation and a scientific application kernel to assess total performance impact on realistic use-cases. Our benchmarks shows, that our asynchronous progression scheme can overlap computation and communication efficiently and lead to substantially shorter communication cost in real applications.
△ Less
Submitted 27 September, 2016;
originally announced September 2016.
-
A Bandwidth-saving Optimization for MPI Broadcast Collective Operation
Authors:
Huan Zhou,
Vladimir Marjanovic,
Christoph Niethammer,
José Gracia
Abstract:
The efficiency and scalability of MPI collective operations, in particular the broadcast operation, plays an integral part in high performance computing applications. MPICH, as one of the contemporary widely-used MPI software stacks, implements the broadcast operation based on point-to-point operation. Depending on the parameters, such as message size and process count, the library chooses to use…
▽ More
The efficiency and scalability of MPI collective operations, in particular the broadcast operation, plays an integral part in high performance computing applications. MPICH, as one of the contemporary widely-used MPI software stacks, implements the broadcast operation based on point-to-point operation. Depending on the parameters, such as message size and process count, the library chooses to use different algorithms, as for instance binomial dissemination, recursive-doubling exchange or ring all-to-all broadcast (allgather). However, the existing broadcast design in latest release of MPICH does not provide good performance for large messages (\textit{lmsg}) or medium messages with non-power-of-two process counts (\textit{mmsg-npof2}) due to the inner suboptimal ring allgather algorithm. In this paper, based on the native broadcast design in MPICH, we propose a tuned broadcast approach with bandwidth-saving in mind catering to the case of \textit{lmsg} and \textit{mmsg-npof2}. Several comparisons of the native and tuned broadcast designs are made for different data sizes and program sizes on Cray XC40 cluster. The results show that the performance of the tuned broadcast design can get improved by a range from 2\% to 54\% for \textit{lmsg} and \textit{mmsg-npof2} in terms of user-level testing.
△ Less
Submitted 22 March, 2016;
originally announced March 2016.
-
Leveraging MPI-3 Shared-Memory Extensions for Efficient PGAS Runtime Systems
Authors:
Huan Zhou,
Kamran Idrees,
José Gracia
Abstract:
The relaxed semantics and rich functionality of one-sided communication primitives of MPI-3 makes MPI an attractive candidate for the implementation of PGAS models. However, the performance of such implementation suffers from the fact, that current MPI RMA implementations typically have a large overhead when source and target of a communication request share a common, local physical memory. In thi…
▽ More
The relaxed semantics and rich functionality of one-sided communication primitives of MPI-3 makes MPI an attractive candidate for the implementation of PGAS models. However, the performance of such implementation suffers from the fact, that current MPI RMA implementations typically have a large overhead when source and target of a communication request share a common, local physical memory. In this paper, we present an optimized PGAS-like runtime system which uses the new MPI-3 shared-memory extensions to serve intra-node communication requests and MPI-3 one-sided communication primitives to serve inter-node communication requests. The performance of our runtime system is evaluated on a Cray XC40 system through low-level communication benchmarks, a random-access benchmark and a stencil kernel. The results of the experiments demonstrate that the performance of our hybrid runtime system matches the performance of low-level RMA libraries for intra-node transfers, and that of MPI-3 for inter-node transfers.
△ Less
Submitted 7 March, 2016;
originally announced March 2016.
-
A Micromegas-based low-background x-ray detector coupled to a slumped-glass telescope for axion research
Authors:
F. Aznar,
J. Castel,
F. E. Christensen,
T. Dafni,
T. A. Decker,
E. Ferrer-Ribas,
J. A. Garcia,
I. Giomataris,
J. G. Gracia,
C. J. Hailey,
R. M. Hill,
F. J. Iguaz,
I. G. Irastorza,
A. C. Jakobsen,
G. Luzon,
H. Mirallas,
T. Papaevangelou,
M. J. Pivovaroff,
J. Ruz,
T. Vafeiadis,
J. K. Vogel
Abstract:
We report on the design, construction and operation of a low background x-ray detection line composed of a shielded Micromegas (micromesh gaseous structure) detector of the microbulk technique. The detector is made from radiopure materials and is placed at the focal point of a $\sim$~5 cm diameter, 1.3 m focal-length, cone-approximation Wolter I x-ray telescope (XRT) comprised of thermally-formed…
▽ More
We report on the design, construction and operation of a low background x-ray detection line composed of a shielded Micromegas (micromesh gaseous structure) detector of the microbulk technique. The detector is made from radiopure materials and is placed at the focal point of a $\sim$~5 cm diameter, 1.3 m focal-length, cone-approximation Wolter I x-ray telescope (XRT) comprised of thermally-formed (or "slumped") glass substrates deposited with multilayer coatings. The system has been conceived as a technological pathfinder for the future International Axion Observatory (IAXO), as it combines two of the techniques (optic and detector) proposed in the conceptual design of the project. It is innovative for two reasons: it is the first time an x-ray optic has been designed and fabricated specifically for axion research, and the first time a Micromegas detector has been operated with an x-ray optic. The line has been installed at one end of the CERN Axion Solar Telescope (CAST) magnet and is currently looking for solar axions. The combination of the XRT and Micromegas detector provides the best signal-to-noise ratio obtained so far by any detection system of the CAST experiment with a background rate of 5.4$\times$10$^{-3}\;$counts per hour in the energy region-of-interest and signal spot area.
△ Less
Submitted 21 September, 2015;
originally announced September 2015.
-
Exploring 0.1-10$\,$eV axions with a new helioscope concept
Authors:
J. Galán,
T. Dafni,
E. Ferrer-Ribas,
I. Giomataris,
F. J. Iguaz,
I. G. Irastorza,
J. A. García,
J. Gracia,
G. Luzón,
T. Papaevangelou,
J. Redondo,
A. Tomás
Abstract:
We explore the possibility to develop a new axion helioscope type, sensitive to the higher axion mass region favored by axion models. We propose to use a low background large volume TPC immersed in an intense magnetic field. Contrary to traditional tracking helioscopes, this detection technique takes advantage of the capability to directly detect the photons converted on the buffer gas which defin…
▽ More
We explore the possibility to develop a new axion helioscope type, sensitive to the higher axion mass region favored by axion models. We propose to use a low background large volume TPC immersed in an intense magnetic field. Contrary to traditional tracking helioscopes, this detection technique takes advantage of the capability to directly detect the photons converted on the buffer gas which defines the axion mass sensitivity region, and does not require pointing the magnet to the Sun. The operation flexibility of a TPC to be used with different gas mixtures (He, Ne, Xe, etc) and pressures (from 10 mbar to 10 bar) will allow to enhance sensitivity for axion masses from few meV to several eV. We present different helioscope data taking scenarios, considering detection efficiency and axion absorption probability, and show the sensitivities reachable with this technique to be few $\times$ 10$^{-11}\,$GeV$^{-1}$ for a 5$\,$T$\,$m$^3$ scale TPC. We show that a few years program taking data with such setup would allow to probe the KSVZ axion model for axion masses above 100 meV.
△ Less
Submitted 7 September, 2015; v1 submitted 12 August, 2015;
originally announced August 2015.
-
DART-MPI: An MPI-based Implementation of a PGAS Runtime System
Authors:
Huan Zhou,
Yousri Mhedheb,
Kamran Idrees,
Colin W. Glass,
José Gracia,
Karl Fürlinger,
Jie Tao
Abstract:
A Partitioned Global Address Space (PGAS) approach treats a distributed system as if the memory were shared on a global level. Given such a global view on memory, the user may program applications very much like shared memory systems. This greatly simplifies the tasks of develo** parallel applications, because no explicit communication has to be specified in the program for data exchange between…
▽ More
A Partitioned Global Address Space (PGAS) approach treats a distributed system as if the memory were shared on a global level. Given such a global view on memory, the user may program applications very much like shared memory systems. This greatly simplifies the tasks of develo** parallel applications, because no explicit communication has to be specified in the program for data exchange between different computing nodes. In this paper we present DART, a runtime environment, which implements the PGAS paradigm on large-scale high-performance computing clusters. A specific feature of our implementation is the use of one-sided communication of the Message Passing Interface (MPI) version 3 (i.e. MPI-3) as the underlying communication substrate. We evaluated the performance of the implementation with several low-level kernels in order to determine overheads and limitations in comparison to the underlying MPI-3.
△ Less
Submitted 7 July, 2015;
originally announced July 2015.
-
Spread of pathogens in the patient transfer network of US hospitals
Authors:
Juan Fernández Gracia,
Jukka-Pekka Onnela,
Michael L. Barnett,
Víctor M. Eguíluz,
Nicholas A. Christakis
Abstract:
Emergent antibiotic-resistant bacterial infections are an increasingly significant source of morbidity and mortality. Antibiotic-resistant organisms have a natural reservoir in hospitals, and recent estimates suggest that almost 2 million people develop hospital-acquired infections each year in the US alone. We investigate a network induced by the transfer of Medicare patients across US hospitals…
▽ More
Emergent antibiotic-resistant bacterial infections are an increasingly significant source of morbidity and mortality. Antibiotic-resistant organisms have a natural reservoir in hospitals, and recent estimates suggest that almost 2 million people develop hospital-acquired infections each year in the US alone. We investigate a network induced by the transfer of Medicare patients across US hospitals over a 2-year period to learn about the possible role of hospital-to-hospital transfers of patients in the spread of infections. We analyze temporal, geographical, and topological properties of the transfer network and demonstrate, using C. Diff. as a case study, that this network may serve as a substrate for the spread of infections. Finally, we study different strategies for the early detection of incipient epidemics, finding that using approximately 2% of hospitals as sensors, chosen based on their network in-degree, results in optimal performance for this early warning system, enabling the early detection of 80% of the C. Diff. cases.
△ Less
Submitted 30 April, 2015;
originally announced April 2015.
-
Directional derivatives of the singular values of matrices depending on several real parameters
Authors:
Juan-Miguel Gracia
Abstract:
In this document I recapitulate some results by Hiriart-Urruty and Ye (1995) concerning the properties of differentiability and the existence of lateral directional derivatives of the multiple eigenvalues of a complex Hermitian matrix function of several real variables, where the eigenvalues are supposed in a decreasing order. Another version of these results was obtained by Ji-guang Sun (1988).…
▽ More
In this document I recapitulate some results by Hiriart-Urruty and Ye (1995) concerning the properties of differentiability and the existence of lateral directional derivatives of the multiple eigenvalues of a complex Hermitian matrix function of several real variables, where the eigenvalues are supposed in a decreasing order. Another version of these results was obtained by Ji-guang Sun (1988).
This 2020 version has been written following a remark raised by Miloud Sadkane about the presence or not of the value 1/2 in the formulas that give the lateral directional derivatives of multiple singular values. Therefore, I have added Theorem 9. I have also written in more detail the relationship of the reviewed results with those in the bibliography. Finally, I have also put a result of mine (Corollary 11 in [1]) (joint with Armentia and Velasco) which interprets a Lippert's Theorem (2005).
△ Less
Submitted 14 May, 2020; v1 submitted 7 April, 2015;
originally announced April 2015.
-
CppSs -- a C++ Library for Efficient Task Parallelism
Authors:
Steffen Brinkmann,
Jose Gracia
Abstract:
We present the C++ library CppSs (C++ super-scalar), which provides efficient task-parallelism without the need for special compilers or other software. Any C++ compiler that supports C++11 is sufficient. CppSs features different directionality clauses for defining data dependencies. While the variable argument lists of the taskified functions are evaluated at compile time, the resulting task depe…
▽ More
We present the C++ library CppSs (C++ super-scalar), which provides efficient task-parallelism without the need for special compilers or other software. Any C++ compiler that supports C++11 is sufficient. CppSs features different directionality clauses for defining data dependencies. While the variable argument lists of the taskified functions are evaluated at compile time, the resulting task dependencies are fixed by the runtime value of the arguments and are thus analysed at runtime. With CppSs, we provide task-parallelism using merely native C++.
△ Less
Submitted 26 February, 2015;
originally announced February 2015.
-
3D simulations of disc-winds extending radially self-similar MHD models
Authors:
Matthias Stute,
Jose Gracia,
Nektarios Vlahakis,
Kanaris Tsinganos,
Andrea Mignone,
Silvano Massaglia
Abstract:
Disc-winds originating from the inner parts of accretion discs are considered as the basic component of magnetically collimated outflows. The only available analytical MHD solutions to describe disc-driven jets are those characterized by the symmetry of radial self-similarity. However, radially self-similar MHD jet models, in general, have three geometrical shortcomings, (i) a singularity at the j…
▽ More
Disc-winds originating from the inner parts of accretion discs are considered as the basic component of magnetically collimated outflows. The only available analytical MHD solutions to describe disc-driven jets are those characterized by the symmetry of radial self-similarity. However, radially self-similar MHD jet models, in general, have three geometrical shortcomings, (i) a singularity at the jet axis, (ii) the necessary assumption of axisymmetry, and (iii) the non-existence of an intrinsic radial scale, i.e. the jets formally extend to radial infinity. Hence, numerical simulations are necessary to extend the analytical solutions towards the axis, by solving the full three-dimensional equations of MHD and impose a termination radius at finite radial distance. We focus here on studying the effects of relaxing the (ii) assumption of axisymmetry, i.e. of performing full 3D numerical simulations of a disc-wind crossing all magnetohydrodynamic critical surfaces. We compare the results of these runs with previous axisymmetric 2.5D simulations. The structure of the flow in all simulations shows strong similarities. The 3D runs reach a steady state and stay close to axisymmetry for most of the physical quantities, except for the poloidal magnetic field and the toroidal velocity which slightly deviate from axisymmetry. The latter quantities show signs of instabilities, which, however, are confined to the region inside the fast magnetosonic separatrix surface. The forces present in the flow, both of collimating and accelerating nature, are in good agreement in both the 2.5D and the 3D runs. We conclude that the analytical solution behaves well also after relaxing the basic assumption of axisymmetry.
△ Less
Submitted 31 January, 2014;
originally announced February 2014.
-
Avoiding Serialization Effects in Data-Dependency aware Task Parallel Algorithms for Spatial Decomposition
Authors:
Christoph Niethammer,
Colin W. Glass,
Jose Gracia
Abstract:
Spatial decomposition is a popular basis for parallelising code. Cast in the frame of task parallelism, calculations on a spatial domain can be treated as a task. If neighbouring domains interact and share results, access to the specific data needs to be synchronized to avoid race conditions. This is the case for a variety of applications, like most molecular dynamics and many computational fluid…
▽ More
Spatial decomposition is a popular basis for parallelising code. Cast in the frame of task parallelism, calculations on a spatial domain can be treated as a task. If neighbouring domains interact and share results, access to the specific data needs to be synchronized to avoid race conditions. This is the case for a variety of applications, like most molecular dynamics and many computational fluid dynamics codes. Here we present an unexpected problem which can occur in dependency-driven task parallelization models like StarSs: the tasks accessing a specific spatial domain are treated as interdependent, as dependencies are detected automatically via memory addresses. Thus, the order in which tasks are generated will have a severe impact on the dependency tree. In the worst case, a complete serialization is reached and no two tasks can be calculated in parallel. We present the problem in detail based on an example from molecular dynamics, and introduce a theoretical framework to calculate the degree of serialization. Furthermore, we present strategies to avoid this unnecessary problem. We recommend treating these strategies as best practice when using dependency-driven task parallel programming models like StarSs on such scenarios.
△ Less
Submitted 17 January, 2014;
originally announced January 2014.
-
A finite difference method for a two-point boundary value problem with a Caputo fractional derivative
Authors:
Martin Stynes,
José Luis Gracia
Abstract:
A two-point boundary value problem whose highest-order term is a Caputo fractional derivative of order $δ\in (1,2)$ is considered. Al-Refai's comparison principle is improved and modified to fit our problem. Sharp a priori bounds on derivatives of the solution $u$ of the boundary value problem are established, showing that $u''(x)$ may be unbounded at the interval endpoint $x=0$. These bounds and…
▽ More
A two-point boundary value problem whose highest-order term is a Caputo fractional derivative of order $δ\in (1,2)$ is considered. Al-Refai's comparison principle is improved and modified to fit our problem. Sharp a priori bounds on derivatives of the solution $u$ of the boundary value problem are established, showing that $u''(x)$ may be unbounded at the interval endpoint $x=0$. These bounds and a discrete comparison principle are used to prove pointwise convergence of a finite difference method for the problem, where the convective term is discretized using simple upwinding to yield stability on coarse meshes for all values of $δ$. Numerical results are presented to illustrate the performance of the method.
△ Less
Submitted 18 December, 2013;
originally announced December 2013.
-
Characterization of a medium size Xe/TMA TPC instrumented with microbulk Micromegas, using low-energy $γ$-rays
Authors:
The NEXT collaboration,
V. Alvarez,
F. I. G. M. Borges,
S. Carcel,
J. Castel,
S. Cebrian,
A. Cervera,
C. A. N. Conde,
T. Dafni,
T. H. V. T. Dias,
J. Diaz,
M. Egorov,
R. Esteve,
P. Evtoukhovitch,
L. M. P. Fernandes,
P. Ferrario,
A. L. Ferreira,
E. D. C. Freitas,
V. M. Gehman,
A. Gil,
A. Goldschmidt,
H. Gomez,
J. J. Gomez-Cadenas,
D. Gonzalez-Diaz,
R. M. Gutierrez
, et al. (65 additional authors not shown)
Abstract:
NEXT-MM is a general-purpose high pressure (10 bar, $\sim25$ l active volume) Xenon-based TPC, read out in charge mode with an 8 cm $\times$8 cm-segmented 700 cm$^2$ plane (1152 ch) of the latest microbulk-Micromegas technology. It has been recently commissioned at University of Zaragoza as part of the R&D of the NEXT $0νββ$ experiment, although the experiment's first stage is currently being buil…
▽ More
NEXT-MM is a general-purpose high pressure (10 bar, $\sim25$ l active volume) Xenon-based TPC, read out in charge mode with an 8 cm $\times$8 cm-segmented 700 cm$^2$ plane (1152 ch) of the latest microbulk-Micromegas technology. It has been recently commissioned at University of Zaragoza as part of the R&D of the NEXT $0νββ$ experiment, although the experiment's first stage is currently being built based on a SiPM/PMT-readout concept relying on electroluminescence. Around 2 million events were collected during the last months, stemming from the low energy $γ$-rays emitted by a $^{241}$Am source when interacting with the Xenon gas ($ε$ = 26, 30, 59.5 keV). The localized nature of such events above atmospheric pressure, the long drift times, as well as the possibility to determine their production time from the associated $α$ particle in coincidence, allow the extraction of primordial properties of the TPC filling gas, namely the drift velocity, diffusion and attachment coefficients. In this work we focus on the little explored combination of Xe and trimethylamine (TMA) for which, in particular, such properties are largely unknown. This gas mixture offers potential advantages over pure Xenon when aimed at Rare Event Searches, mainly due to its Penning characteristics, wave-length shifting properties and reduced diffusion, and it is being actively investigated by our collaboration. The chamber is currently operated at 2.7 bar, as an intermediate step towards the envisaged 10 bar. We report here its performance as well as a first implementation of the calibration procedures that have allowed the extension of the previously reported energy resolution to the whole readout plane (10.6%FWHM@30keV).
△ Less
Submitted 17 November, 2013; v1 submitted 14 November, 2013;
originally announced November 2013.
-
Cudagrind: A Valgrind Extension for CUDA
Authors:
Thomas M. Baumann,
Jose Gracia
Abstract:
Valgrind, and specifically the included tool Memcheck, offers an easy and reliable way for checking the correctness of memory operations in programs. This works in an unintrusive way where Valgrind translates the program into intermediate code and executes it on an emulated CPU. The heavy weight tool Memcheck uses this to keep a full shadow copy of the memory used by a program and tracking accesse…
▽ More
Valgrind, and specifically the included tool Memcheck, offers an easy and reliable way for checking the correctness of memory operations in programs. This works in an unintrusive way where Valgrind translates the program into intermediate code and executes it on an emulated CPU. The heavy weight tool Memcheck uses this to keep a full shadow copy of the memory used by a program and tracking accesses to it. This allows the detection of memory leaks and checking the validity of accesses.
Though suited for a wide variety of programs, this approach still fails when accelerator based programming models are involved. The code running on these devices is separate from the code running on the host. Access to memory on the device and starting of kernels is being handled by an API provided by the driver being used. Hence Valgrind is unable to understand and instrument operations being run on the device.
To circumvent this limitation a new set of wrapper functions have been introduced. These wrap a subset of the CUDA Driver API function that is responsible for (de-)allocation memory regions on the device and the respective memory copy operations. This allows to check whether memory is fully allocated during a transfer and, through the functionality provided by Valgrind, whether the memory transfered to the device from the host is defined and addressable. Through this technique it is possible to detect a number of common programming mistakes, which are very difficult to debug by other means. The combination of these wrappers together with the Valgrind tool Memcheck is being called Cudagrind.
△ Less
Submitted 3 October, 2013;
originally announced October 2013.
-
Hybrid MPI/StarSs - a case study
Authors:
Jose Gracia,
Christoph Niethammer,
Manuel Hasert,
Steffen Brinkmann,
Rainer Keller,
Colin W. Glass
Abstract:
Hybrid parallel programming models combining distributed and shared memory paradigms are well established in high-performance computing. The classical prototype of hybrid programming in HPC is MPI/OpenMP, but many other combinations are being investigated. Recently, the data-dependency driven, task parallel model for shared memory parallelisation named StarSs has been suggested for usage in combin…
▽ More
Hybrid parallel programming models combining distributed and shared memory paradigms are well established in high-performance computing. The classical prototype of hybrid programming in HPC is MPI/OpenMP, but many other combinations are being investigated. Recently, the data-dependency driven, task parallel model for shared memory parallelisation named StarSs has been suggested for usage in combination with MPI. In this paper we apply hybrid MPI/StarSs to a Lattice-Boltzmann code. In particular, we present the hybrid programming model, the benefits we expect, the challenges in porting, and finally a comparison of the performance of MPI/StarSs hybrid, MPI/OpenMP hybrid and the original MPI-only versions of the same code.
△ Less
Submitted 18 April, 2012;
originally announced April 2012.
-
TEMANEJO - a debugger for task based parallel programming models
Authors:
Steffen Brinkmann,
José Gracia,
Christoph Niethammer,
Rainer Keller
Abstract:
We present the program Temanejo, a debugger for task based parallelisation models such as StarSs. The challenge in debugging StarSs applications lies in the fact that tasks are scheduled at runtime, i.e dynamically in accordance to the data dependencies between them. Our tool assists the programmer in the debugging process by visualising the task dependency graph and allowing to control the schedu…
▽ More
We present the program Temanejo, a debugger for task based parallelisation models such as StarSs. The challenge in debugging StarSs applications lies in the fact that tasks are scheduled at runtime, i.e dynamically in accordance to the data dependencies between them. Our tool assists the programmer in the debugging process by visualising the task dependency graph and allowing to control the scheduling of tasks. The toolset consists of the library Ayudame which communicates with the StarSs runtime on one side and of the debugger Temanejo on the other side which communicates with Ayudame. Temanejo provides a graphical user interface with which the application can be analysed and controlled.
△ Less
Submitted 20 December, 2011;
originally announced December 2011.
-
Comparison of synthetic maps from truncated jet-formation models with YSO jet observations. II. The effect of varying inclinations
Authors:
Matthias Stute,
Jose Gracia
Abstract:
Analytical radially self-similar models are the best available solutions describing disk-winds but need several improvements. In a previous article, we introduced models of jets from truncated disks, i.e. evolved in time numerical simulations based on a radially self-similar MHD solution but including the effects of a finite radius of the jet-emitting disk and thus the outflow. In paper I of this…
▽ More
Analytical radially self-similar models are the best available solutions describing disk-winds but need several improvements. In a previous article, we introduced models of jets from truncated disks, i.e. evolved in time numerical simulations based on a radially self-similar MHD solution but including the effects of a finite radius of the jet-emitting disk and thus the outflow. In paper I of this series, we compared these models with available observational data varying the jet density and velocity, the mass of the protostar and the radius of the aforementioned truncation. In paper I, we assumed that the jet lies in the plane of the sky. In this paper, we investigate the effect of different inclinations of the jet. In order to compare our models with observed jet widths inferred from recent optical images taken with HST and AO, we create again emission maps in different forbidden lines and from such emission maps, we determine the jet width as the full-width half-maximum of the emission. We can reproduce the jet width of DG Tau and its variations very well and the derived inclination of 40$^\circ$ is in excellent agreement with literature values of 32--52$^\circ$. In CW Tau we overestimate the inclination in our best-fit model. In the other objects, we cannot find appropriate models which reproduce the variations of the observed jet widths, only the average jet width itself is well modeled as in paper I. We conclude that truncation -- i.e. taking into account the finite radius of the jet launching region -- is necessary to reproduce the observed jet widths and our simulations limit the possible range of truncation radii. The effects of inclination are important for modeling the intrinsic variations seen in observed jet widths. Our models can be used to infer independently the inclinations in the observed sample, however, a parameter study with a finer grid of parameters is needed.
△ Less
Submitted 30 November, 2011;
originally announced December 2011.
-
Comparison of synthetic maps from truncated jet-formation models with YSO jet observations
Authors:
Matthias Stute,
Jose Gracia,
Kanaris Tsinganos,
Nektarios Vlahakis
Abstract:
(abridged) Significant progress has been made in the last years in the understanding of the jet formation mechanism through a combination of numerical simulations and analytical MHD models for outflows characterized by the symmetry of self-similarity. In a previous article we introduced models of truncated jets from disks, i.e. evolved in time numerical simulations based on a radially self-similar…
▽ More
(abridged) Significant progress has been made in the last years in the understanding of the jet formation mechanism through a combination of numerical simulations and analytical MHD models for outflows characterized by the symmetry of self-similarity. In a previous article we introduced models of truncated jets from disks, i.e. evolved in time numerical simulations based on a radially self-similar MHD solution, but including the effects of a finite radius of the jet-emitting disk and thus the outflow. These models need now to be compared with available observational data. A direct comparison of the results of combined analytical theoretical models and numerical simulations with observations has not been performed as yet. In order to compare our models with observed jet widths inferred from recent optical images taken with HST and AO observations, we use a new set of tools to create emission maps in different forbidden lines, from which we determine the jet width as the FWHM of the emission. It is shown that the untruncated analytical disk outflow solution considered here cannot fit the small jet widths inferred by observations of several jets. Various truncated disk-wind models are examined, whose extracted jet widths range from higher to lower values compared to the observations. Thus we can fit the observed range of jet widths by tuning our models. We conclude that truncation is necessary to reproduce the observed jet widths and our simulations limit the possible range of truncation radii. We infer that the truncation radius, which is the radius on the disk mid-plane where the jet-emitting disk switches to a standard disk, must be between around 0.1 up to about 1 AU in the observed sample for the considered disk-wind solution. One disk-wind simulation with an inner truncation radius at about 0.11 AU also shows potential for reproducing the observations, but a parameter study is needed.
△ Less
Submitted 3 May, 2010;
originally announced May 2010.
-
Multiplicities of the structured pseudoeigenvalues
Authors:
Juan-Miguel Gracia
Abstract:
The structured pseudospectra of a matrix A are sets of complex numbers that are eigenvalues of matrices X which are near to A and have the same entries as A at a fixed set of places. The sum of multiplicities of the eigenvalues of X inside each connected component of the structured pseudospectra of A does not depend on X. This fact is known, but not so much as it should be. For this reason, we g…
▽ More
The structured pseudospectra of a matrix A are sets of complex numbers that are eigenvalues of matrices X which are near to A and have the same entries as A at a fixed set of places. The sum of multiplicities of the eigenvalues of X inside each connected component of the structured pseudospectra of A does not depend on X. This fact is known, but not so much as it should be. For this reason, we give here an elementary and detailed proof of the result.
△ Less
Submitted 20 August, 2009; v1 submitted 11 July, 2009;
originally announced July 2009.
-
Resistive MHD jet simulations with large resistivity
Authors:
Miljenko Cemeljic,
Jose Gracia,
Nektarios Vlahakis,
Kanaris Tsinganos
Abstract:
Axisymmetric resistive MHD simulations for radially self-similar initial conditions are performed, using the NIRVANA code. The magnetic diffusivity could occur in outflows above an accretion disk, being transferred from the underlying disk into the disk corona by MHD turbulence (anomalous turbulent diffusivity), or as a result of ambipolar diffusion in partially ionized flows. We introduce, in a…
▽ More
Axisymmetric resistive MHD simulations for radially self-similar initial conditions are performed, using the NIRVANA code. The magnetic diffusivity could occur in outflows above an accretion disk, being transferred from the underlying disk into the disk corona by MHD turbulence (anomalous turbulent diffusivity), or as a result of ambipolar diffusion in partially ionized flows. We introduce, in addition to the classical magnetic Reynolds number Rm, which measures the importance of resistive effects in the induction equation, a new number Rb, which measures the importance of the resistive effects in the energy equation. We find two distinct regimes of solutions in our simulations. One is the low-resistivity regime, in which results do not differ much from ideal-MHD solutions. In the high-resistivity regime, results seem to show some periodicity in time-evolution, and depart significantly from the ideal-MHD case. Whether this departure is caused by numerical or physical reasons is of considerable interest for numerical simulations and theory of astrophysical outflows and is currently investigated.
△ Less
Submitted 17 June, 2009;
originally announced June 2009.
-
Synthetic synchrotron emission maps from MHD models for the jet of M87
Authors:
J. Gracia,
N. Vlahakis,
I. Agudo,
K. Tsinganos,
S. V. Bogovalov
Abstract:
We present self-consistent global, steady-state MHD models and synthetic optically thin synchrotron emission maps for the jet of M87. The model consist of two distinct zones: an inner relativistic outflow, which we identify with the observed jet, and an outer cold disk-wind. While the former does not self-collimate efficiently due to its high effective inertia, the latter fulfills all the condit…
▽ More
We present self-consistent global, steady-state MHD models and synthetic optically thin synchrotron emission maps for the jet of M87. The model consist of two distinct zones: an inner relativistic outflow, which we identify with the observed jet, and an outer cold disk-wind. While the former does not self-collimate efficiently due to its high effective inertia, the latter fulfills all the conditions for efficient collimation by the magneto-centrifugal mechanism. Given the right balance between the effective inertia of the inner flow and the collimation efficiency of the outer disk wind, the relativistic flow is magnetically confined into a well collimated beam and matches the measurements of the opening angle of M87 over several orders of magnitude in spatial extent. The synthetic synchrotron maps reproduce the morphological structure of the jet of M87, i.e. center-bright profiles near the core and limb-bright profiles away from the core. At the same time, they also show a local increase of brightness at some distance along the axis associated to a recollimation shock in the MHD model. Its location coincides with the position of the optical knot HST-1. In addition our best fitting model is consistent with a number of observational constraints such as the magnetic field in the knot HST-1, and the jet-to-counterjet brightness ratio.
△ Less
Submitted 17 January, 2009;
originally announced January 2009.
-
Stability and structure of analytical MHD jet formation models with a finite outer disk radius
Authors:
Matthias Stute,
Kanaris Tsinganos,
Nektarios Vlahakis,
Titos Matsakos,
Jose Gracia
Abstract:
(Abridged) Finite radius accretion disks are a strong candidate for launching astrophysical jets from their inner parts and disk-winds are considered as the basic component of such magnetically collimated outflows. The only available analytical MHD solutions for describing disk-driven jets are those characterized by the symmetry of radial self-similarity. Radially self-similar MHD models, in gen…
▽ More
(Abridged) Finite radius accretion disks are a strong candidate for launching astrophysical jets from their inner parts and disk-winds are considered as the basic component of such magnetically collimated outflows. The only available analytical MHD solutions for describing disk-driven jets are those characterized by the symmetry of radial self-similarity. Radially self-similar MHD models, in general, have two geometrical shortcomings, a singularity at the jet axis and the non-existence of an intrinsic radial scale, i.e. the jets formally extend to radial infinity. Hence, numerical simulations are necessary to extend the analytical solutions towards the axis and impose a physical boundary at finite radial distance. We focus here on studying the effects of imposing an outer radius of the underlying accreting disk (and thus also of the outflow) on the topology, structure and variability of a radially self-similar analytical MHD solution. The initial condition consists of a hybrid of an unchanged and a scaled-down analytical solution, one for the jet and the other for its environment. In all studied cases, we find at the end steady two-component solutions.
△ Less
Submitted 9 September, 2008;
originally announced September 2008.
-
The effect of a stellar magnetic variation on the jet velocity
Authors:
Fabio De Colle,
Jose Gracia,
Gareth Murphy
Abstract:
Stellar jets are normally constituted by chains of knots with some periodicity in their spatial distribution, corresponding to a variability of order of several years in the ejection from the protostar/disk system. A widely accepted theory for the presence of knots is related to the generation of internal working surfaces due to variations in the jet ejection velocity. In this paper we study the…
▽ More
Stellar jets are normally constituted by chains of knots with some periodicity in their spatial distribution, corresponding to a variability of order of several years in the ejection from the protostar/disk system. A widely accepted theory for the presence of knots is related to the generation of internal working surfaces due to variations in the jet ejection velocity. In this paper we study the effect of variations in the inner disk-wind radius on the jet ejection velocity. We show that a small variation in the inner disk-wind radius produce a variation in the jet velocity large enough to generate the observed knots. We also show that the variation in the inner radius may be related to a variation of the stellar magnetic field.
△ Less
Submitted 14 August, 2008;
originally announced August 2008.
-
Resistive jet simulations extending radially self-similar magnetohydrodynamic models
Authors:
Miljenko Cemeljic,
Jose Gracia,
Nektarios Vlahakis,
Kanaris Tsinganos
Abstract:
Numerical simulations with self-similar initial and boundary conditions provide a link between theoretical and numerical investigations of jet dynamics. We perform axisymmetric resistive magnetohydrodynamic (MHD) simulations for a generalised solution of the Blandford & Payne type, and compare them with the corresponding analytical and numerical ideal-MHD solutions. We disentangle the effects of…
▽ More
Numerical simulations with self-similar initial and boundary conditions provide a link between theoretical and numerical investigations of jet dynamics. We perform axisymmetric resistive magnetohydrodynamic (MHD) simulations for a generalised solution of the Blandford & Payne type, and compare them with the corresponding analytical and numerical ideal-MHD solutions. We disentangle the effects of the numerical and physical diffusivity. The latter could occur in outflows above an accretion disk, being transferred from the underlying disk into the disk corona by MHD turbulence (anomalous turbulent diffusivity), or as a result of ambipolar diffusion in partially ionized flows. We conclude that while the classical magnetic Reynolds number $R_{\rm m}$ measures the importance of resistive effects in the induction equation, a new introduced number, $\rbeta=(β/2)R_{\rm m}$ with $β$ the plasma beta, measures the importance of the resistive effects in the energy equation. Thus, in magnetised jets with $β<2$, when $\rbeta \la 1$ resistive effects are non-negligible and affect mostly the energy equation. The presented simulations indeed show that for a range of magnetic diffusivities corresponding to $\rbeta \ga 1$ the flow remains close to the ideal-MHD self-similar solution.
△ Less
Submitted 20 June, 2008;
originally announced June 2008.
-
MHD models and synthetic synchrotron maps for the jet of M87
Authors:
J. Gracia,
S. Bogovalov,
K. Tsinganos
Abstract:
We present a self-consistent MHD model for the jet of M87. The model consist of two distinct zones: an inner relativistic outflow, which we identify with the observed jet, and an outer cold disk-wind. While the former does not self-collimate efficiently due to its high effective inertia, the latter fulfills all the conditions for efficient collimation by the magneto-centrifugal mechanism. Given…
▽ More
We present a self-consistent MHD model for the jet of M87. The model consist of two distinct zones: an inner relativistic outflow, which we identify with the observed jet, and an outer cold disk-wind. While the former does not self-collimate efficiently due to its high effective inertia, the latter fulfills all the conditions for efficient collimation by the magneto-centrifugal mechanism. Given the right balance between the effective inertia of the inner flow and the collimation efficiency of the outer disk wind, the relativistic flow is magnetically confined into a well collimated beam for a wide range of parameters and matches the measurements of the opening angle of M87 over several orders of magnitude in spatial extent.
In the second part of this work, we present synthetic synchrotron emission maps for our MHD models. In principle the two-zone model can reproduce the morphological structure seen in radio observations, as central-peaked profiles across the jet close the the source, limb-bright further down the jet, and a bright knot close to the position of HST-1. However it is difficult to reconcile all features into a single set of parameters.
△ Less
Submitted 17 December, 2007;
originally announced December 2007.
-
Testing the inverse-Compton catastrophe scenario in the intra-day variable blazar S5 0716+71: II. A search for intra-day variability at millimetre wavelengths with the IRAM 30 m telescope
Authors:
I. Agudo,
T. P. Krichbaum,
H. Ungerechts,
A. Kraus,
A. Witzel,
E. Angelakis,
L. Fuhrmann,
U. Bach,
S. Britzen,
J. A. Zensus,
S. J. Wagner,
L. Ostorero,
E. Ferrero,
J. Gracia,
M. Grewing
Abstract:
We report on a densely time sampled polarimetric flux density monitoring of the BL Lac object S5 0716+71 at 86 GHz and 229 GHz with the IRAM 30 m telescope within a coordinated broad spectral band campaign, centred around an INTEGRAL observation during November 10 to 16, 2003. Our aim was to search for signatures of inverse-Compton "catastrophes". At 86 GHz, making use of a new calibration strat…
▽ More
We report on a densely time sampled polarimetric flux density monitoring of the BL Lac object S5 0716+71 at 86 GHz and 229 GHz with the IRAM 30 m telescope within a coordinated broad spectral band campaign, centred around an INTEGRAL observation during November 10 to 16, 2003. Our aim was to search for signatures of inverse-Compton "catastrophes". At 86 GHz, making use of a new calibration strategy, we reach a relative rms accuracy of the flux density measurements of 1.2%. At this frequency, S5 0716+71 showed no intra-day variability, but showed remarkable inter-day variability with a flux density increase of 34% during the first four observing days, which can not be explained by source extrinsic causes. The 86 GHz linear polarization fraction of S5 0716+71 was unusually large 15.0+-1.8%. Inter-day variability in linear polarization at 86 GHz, with significance level >~95%; sigma_P/<P>=15% and sigma_chi=6 deg., was also observed. From the emission variations at the synchrotron turnover frequency (~86 GHz) we compute an apparent brightness temperature T_B,app>1.4x10^14K at a redshift of 0.3, which exceeds by two orders of magnitude the inverse-Compton limit. A relativistic correction for T_B,app with a Doppler factor delta > 7.8 brings the observed brightness temperature down to the inverse Compton limit. A more accurate lower limit of delta > 14.0, is obtained from the comparison of the 86 GHz synchrotron flux density and the upper limits for the synchrotron self-Compton flux density obtained from the INTEGRAL observations. The relativistic beaming of the emission by this high Doppler factor explains the non-detection of "catastrophic" inverse-Compton avalanches by INTEGRAL.
△ Less
Submitted 2 June, 2006;
originally announced June 2006.
-
Hard TeV spectra of blazars and the constraints to the IR intergalactic background
Authors:
K. Katarzynski,
G. Ghisellini,
F. Tavecchio,
J. Gracia,
L. Maraschi
Abstract:
Recent gamma-ray observations of the blazar 1ES 1101-232 (redshift z=0.186) reveal that the unabsorbed TeV spectrum is hard, with spectral index $α\lesssim 0.5$ [$F(ν) \propto ν^{-α}$]. We show that simple one-zone synchrotron self-Compton model can explain such hard spectra if we assume a power law energy distribution of the emitting electrons with a relatively high minimum energy. In this case…
▽ More
Recent gamma-ray observations of the blazar 1ES 1101-232 (redshift z=0.186) reveal that the unabsorbed TeV spectrum is hard, with spectral index $α\lesssim 0.5$ [$F(ν) \propto ν^{-α}$]. We show that simple one-zone synchrotron self-Compton model can explain such hard spectra if we assume a power law energy distribution of the emitting electrons with a relatively high minimum energy. In this case the intrinsic TeV spectrum can be as hard as $F(ν)\propto ν^{1/3}$, while the predicted X-ray spectrum can still be much softer. The observations of 1ES 1101-232 can therefore be reconciled with relatively high intensities of the infrared background, even if some extreme background levels can indeed be excluded. We show that the other TeV sources (Mrk 421, Mrk 501 and PKS 2155-304) can be interpreted in the same framework, with a somewhat larger minimum energy.
△ Less
Submitted 1 March, 2006;
originally announced March 2006.
-
Testing the inverse-Compton catastrophe scenario in the intra-day variable blazar S5 0716+71. I. Simultaneous broadband observations during November 2003
Authors:
L. Ostorero,
S. J. Wagner,
J. Gracia,
E. Ferrero,
T. P. Krichbaum,
S. Britzen,
A. Witzel,
K. Nilsson,
M. Villata
Abstract:
Some intra-day variable, compact extra-galactic radio sources show brightness temperatures severely exceeding 10^{12} K, the limit set by catastrophic inverse-Compton (IC) cooling in sources of incoherent synchrotron radiation. The violation of the IC limit, possible under non-stationary conditions, would lead to IC avalanches in the soft-gamma-ray energy band during transient periods. For the f…
▽ More
Some intra-day variable, compact extra-galactic radio sources show brightness temperatures severely exceeding 10^{12} K, the limit set by catastrophic inverse-Compton (IC) cooling in sources of incoherent synchrotron radiation. The violation of the IC limit, possible under non-stationary conditions, would lead to IC avalanches in the soft-gamma-ray energy band during transient periods. For the first time, broadband signatures of possible IC catastrophes were searched for in S5 0716+71. A multifrequency observing campaign targetting S5 0716+71 was carried out in November 2003 under the framework of the European Network for the Investigation of Galactic nuclei through Multifrequency Analysis (ENIGMA) together with a campaign by the Whole Earth Blazar Telescope (WEBT), involving a pointing by the soft-gamma-ray satellite INTEGRAL, optical, near-infrared, sub-millimeter, millimeter, radio, and Very Long Baseline Array (VLBA) monitoring. S5 0716+71 was very bright at radio frequencies and in a rather faint optical state during the INTEGRAL pointing; significant inter-day and low intra-day variability was recorded in the radio regime, while typical fast variability features were observed in the optical band. No correlation was found between the radio and optical emission. The source was not detected by INTEGRAL, neither by the X-ray monitor JEM-X nor by the gamma-ray imager ISGRI, but upper limits to the source emission in the 3-200 keV energy band were estimated. A brightness temperature Tb>2.1x10^{14} K was inferred from the radio variability, but no corresponding signatures of IC avalanches were recorded at higher energies. The absence of IC-catastrophe signatures provides either a lower limit delta>8 to the Doppler factor affecting the radio emission or strong constraints for modelling of the Compton catastrophes in S5 0716+71.
△ Less
Submitted 10 February, 2006;
originally announced February 2006.
-
The cyclo-synchrotron process and particle heating through the absorption of photons
Authors:
K. Katarzynski,
G. Ghisellini,
R. Svensson,
J. Gracia
Abstract:
We propose a new approximation for the cyclo-synchrotron emissivity of a single electron. In the second part of this work, we discuss a simple application for our approximation, and investigate the heating of electrons through the self-absorption process. Finally, we investigate the self-absorbed part of the spectrum produced by a power-law population of electrons. In comparison to earlier appro…
▽ More
We propose a new approximation for the cyclo-synchrotron emissivity of a single electron. In the second part of this work, we discuss a simple application for our approximation, and investigate the heating of electrons through the self-absorption process. Finally, we investigate the self-absorbed part of the spectrum produced by a power-law population of electrons. In comparison to earlier approximations, our formula provides a few significant advantages. Integration of the emissivity over the whole frequency range, starting from the proper minimal emitting frequency, gives the correct cooling rate for any energy particle. Further, the spectrum of the emission is well approximated over the whole frequency range, even for relatively low particle energies (beta << 0.1), where most of the power is emitted in the first harmonic. In order to test our continuous approximation, we compare it with a recently derived approximation of the first ten harmonics. Finally, our formula connects relatively smooth to the synchrotron emission at beta=0.9. We show that the self-absorption is a very efficient heating mechanism for low energy particles, independent of the shape of the particle distribution responsible for the self-absorbed synchrotron emission. We find that the energy gains for low energy particles are always higher than energy losses by cyclo-synchrotron emission. We show also that the spectral index of the self-absorbed part of the spectrum at very low frequencies differs significantly from the well known standard relation I(nu) ~ nu^(5/2).
△ Less
Submitted 8 February, 2006;
originally announced February 2006.