The A4 project: physics data processing using the Google protocol buffer library
Authors:
Johannes Ebke,
Peter Waller
Abstract:
In this paper, we present the High Energy Physics data format, processing toolset and analysis library a4, providing fast I/O of structured data using the Google protocol buffer library. The overall goal of a4 is to provide physicists with tools to work efficiently with billions of events, providing not only high speeds, but also automatic metadata handling, a set of UNIX-like tools to operate on…
▽ More
In this paper, we present the High Energy Physics data format, processing toolset and analysis library a4, providing fast I/O of structured data using the Google protocol buffer library. The overall goal of a4 is to provide physicists with tools to work efficiently with billions of events, providing not only high speeds, but also automatic metadata handling, a set of UNIX-like tools to operate on a4 files, and powerful and fast histogramming capabilities. At present, a4 is an experimental project, but it has already been used by the authors in preparing physics publications. We give an overview of the individual modules of a4, provide examples of use, and supply a set of basic benchmarks. We compare a4 read performance with the common practice of storing unstructured data in ROOT trees. For the common case of storing a variable number of floating-point numbers per event, speedups in read speed of up to a factor of six are observed.
△ Less
Submitted 8 August, 2012;
originally announced August 2012.
Electroweak di-boson production in ATLAS
Authors:
Johannes Ebke
Abstract:
In the LHC era, it is crucial to gain a good understanding of the electroweak sector of the Standard Model (SM). Determining the cross-sections of electroweak processes is therefore of great importance, both as prerequisites to Higgs-Boson searches as well as measurements in their own right. For this conference, several new measurements with the data from proton-proton collisions at \sqrt{s} = 7 T…
▽ More
In the LHC era, it is crucial to gain a good understanding of the electroweak sector of the Standard Model (SM). Determining the cross-sections of electroweak processes is therefore of great importance, both as prerequisites to Higgs-Boson searches as well as measurements in their own right. For this conference, several new measurements with the data from proton-proton collisions at \sqrt{s} = 7 TeV taken in 2010 and 2011 with the ATLAS detector have been prepared: A measurement of the isolated di-photon cross-section using 37 pb^{-1} collected in 2010 and a measurement of the WZ production cross-section using 205 pb^{-1} collected in 2011. In addition and for a complete view of the electroweak sector, the already published measurements of the Wγand Zγcross-sections and the WW cross-sections are presented as well.
△ Less
Submitted 3 January, 2012;
originally announced January 2012.
Ganga: a tool for computational-task management and easy access to Grid resources
Authors:
J. T. Mościcki,
F. Brochu,
J. Ebke,
U. Egede,
J. Elmsheuser,
K. Harrison,
R. W. L. Jones,
H. C. Lee,
D. Liko,
A. Maier,
A. Muraru,
G. N. Patrick,
K. Pajchel,
W. Reece,
B. H. Samset,
M. W. Slater,
A. Soroko,
C. L. Tan,
D. C. Vanderster,
M. Williams
Abstract:
In this paper, we present the computational task-management tool Ganga, which allows for the specification, submission, bookkee** and post-processing of computational tasks on a wide set of distributed resources. Ganga has been developed to solve a problem increasingly common in scientific projects, which is that researchers must regularly switch between different processing systems, each with…
▽ More
In this paper, we present the computational task-management tool Ganga, which allows for the specification, submission, bookkee** and post-processing of computational tasks on a wide set of distributed resources. Ganga has been developed to solve a problem increasingly common in scientific projects, which is that researchers must regularly switch between different processing systems, each with its own command set, to complete their computational tasks. Ganga provides a homogeneous environment for processing data on heterogeneous resources. We give examples from High Energy Physics, demonstrating how an analysis can be developed on a local system and then transparently moved to a Grid system for processing of all available data. Ganga has an API that can be used via an interactive interface, in scripts, or through a GUI. Specific knowledge about types of tasks or computational resources is provided at run-time through a plugin system, making new developments easy to integrate. We give an overview of the Ganga architecture, give examples of current use, and demonstrate how Ganga can be used in many different areas of science.
△ Less
Submitted 9 June, 2009; v1 submitted 16 February, 2009;
originally announced February 2009.