Skip to main content

Showing 1–34 of 34 results for author: Pivarski, J

.
  1. arXiv:2404.18170  [pdf, other

    cs.PL physics.data-an

    Bridging Worlds: Achieving Language Interoperability between Julia and Python in Scientific Computing

    Authors: Ianna Osborne, Jim Pivarski, Jerry Ling

    Abstract: In the realm of scientific computing, both Julia and Python have established themselves as powerful tools. Within the context of High Energy Physics (HEP) data analysis, Python has been traditionally favored, yet there exists a compelling case for migrating legacy software to Julia. This article focuses on language interoperability, specifically exploring how Awkward Array data structures can brid… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 8 pages, 1 figure, ACAT2024 workshop

  2. arXiv:2404.02100  [pdf, other

    hep-ex

    Analysis Facilities White Paper

    Authors: D. Ciangottini, A. Forti, L. Heinrich, N. Skidmore, C. Alpigiani, M. Aly, D. Benjamin, B. Bockelman, L. Bryant, J. Catmore, M. D'Alfonso, A. Delgado Peris, C. Doglioni, G. Duckeck, P. Elmer, J. Eschle, M. Feickert, J. Frost, R. Gardner, V. Garonne, M. Giffels, J. Gooding, E. Gramstad, L. Gray, B. Hegner , et al. (41 additional authors not shown)

    Abstract: This white paper presents the current status of the R&D for Analysis Facilities (AFs) and attempts to summarize the views on the future direction of these facilities. These views have been collected through the High Energy Physics (HEP) Software Foundation's (HSF) Analysis Facilities forum, established in March 2022, the Analysis Ecosystems II workshop, that took place in May 2022, and the WLCG/HS… ▽ More

    Submitted 15 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  3. arXiv:2310.01461  [pdf, ps, other

    cs.PL

    Awkward Just-In-Time (JIT) Compilation: A Developer's Experience

    Authors: Ianna Osborne, Jim Pivarski, Ioana Ifrim, Angus Hollands, Henry Schreiner

    Abstract: Awkward Array is a library for performing NumPy-like computations on nested, variable-sized data, enabling array-oriented programming on arbitrary data structures in Python. However, imperative (procedural) solutions can sometimes be easier to write or faster to run. Performant imperative programming requires compilation; JIT-compilation makes it convenient to compile in an interactive Python envi… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 7 pages

  4. arXiv:2306.03675  [pdf, other

    hep-ph cs.PL hep-ex physics.comp-ph

    Potential of the Julia programming language for high energy physics computing

    Authors: J. Eschle, T. Gal, M. Giordano, P. Gras, B. Hegner, L. Heinrich, U. Hernandez Acosta, S. Kluth, J. Ling, P. Mato, M. Mikhasenko, A. Moreno Briceño, J. Pivarski, K. Samaras-Tsakiris, O. Schulz, G. . A. Stewart, J. Strube, V. Vassilev

    Abstract: Research in high energy physics (HEP) requires huge amounts of computing and storage, putting strong constraints on the code speed and resource usage. To meet these requirements, a compiled high-performance language is typically used; while for physicists, who focus on the application when develo** the code, better research productivity pleads for a high-level programming language. A popular app… ▽ More

    Submitted 6 October, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 32 pages, 5 figures, 4 tables

    ACM Class: J.2

    Journal ref: Computing. Comput Softw Big Sci 7, 10 (2023)

  5. arXiv:2303.02205  [pdf, other

    cs.MS hep-ex

    The Awkward World of Python and C++

    Authors: Manasvi Goyal, Ianna Osborne, Jim Pivarski

    Abstract: There are undeniable benefits of binding Python and C++ to take advantage of the best features of both languages. This is especially relevant to the HEP and other scientific communities that have invested heavily in the C++ frameworks and are rapidly moving their data analyses to Python. Version 2 of Awkward Array, a Scikit-HEP Python library, introduces a set of header-only C++ libraries that do… ▽ More

    Submitted 1 May, 2024; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: 6 pages, 2 figures; submitted to ACAT 2022 proceedings

  6. arXiv:2303.02202  [pdf, other

    hep-ex cs.PF

    Using a DSL to read ROOT TTrees faster in Uproot

    Authors: Aryan Roy, Jim Pivarski

    Abstract: Uproot reads ROOT TTrees using pure Python. For numerical and (singly) jagged arrays, this is fast because a whole block of data can be interpreted as an array without modifying the data. For other cases, such as arrays of std::vector<std::vector<float>>, numerical data are interleaved with structure, and the only way to deserialize them is with a sequential algorithm. When written in Python, such… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 6 pages, 3 figures; submitted to ACAT 2022 proceedings

  7. arXiv:2302.09860  [pdf, other

    hep-ex astro-ph.IM cs.CL physics.data-an

    Awkward to RDataFrame and back

    Authors: Ianna Osborne, Jim Pivarski

    Abstract: Awkward Arrays and RDataFrame provide two very different ways of performing calculations at scale. By adding the ability to zero-copy convert between them, users get the best of both. It gives users a better flexibility in mixing different packages and languages in their analysis. In Awkward Array version 2, the ak.to_rdataframe function presents a view of an Awkward Array as an RDataFrame source.… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 5 pages, 3 figures

  8. Second Analysis Ecosystem Workshop Report

    Authors: Mohamed Aly, Jackson Burzynski, Bryan Cardwell, Daniel C. Craik, Tal van Daalen, Tomas Dado, Ayanabha Das, Antonio Delgado Peris, Caterina Doglioni, Peter Elmer, Engin Eren, Martin B. Eriksen, Jonas Eschle, Giulio Eulisse, Conor Fitzpatrick, José Flix Molina, Alessandra Forti, Ben Galewsky, Sean Gasiorowski, Aman Goel, Loukas Gouskos, Enrico Guiraud, Kanhaiya Gupta, Stephan Hageboeck, Allison Reinsvold Hall , et al. (44 additional authors not shown)

    Abstract: The second workshop on the HEP Analysis Ecosystem took place 23-25 May 2022 at IJCLab in Orsay, to look at progress and continuing challenges in scaling up HEP analysis to meet the needs of HL-LHC and DUNE, as well as the very pressing needs of LHC Run 3 analysis. The workshop was themed around six particular topics, which were felt to capture key questions, opportunities and challenges. Each to… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Report number: HSF-DOC-2022-02

  9. arXiv:2202.03911  [pdf, other

    hep-ex cs.PL physics.comp-ph physics.data-an

    An array-oriented Python interface for FastJet

    Authors: Aryan Roy, Jim Pivarski, Chad Wells Freer

    Abstract: Analysis on HEP data is an iterative process in which the results of one step often inform the next. In an exploratory analysis, it is common to perform one computation on a collection of events, then view the results (often with histograms) to decide what to try next. Awkward Array is a Scikit-HEP Python package that enables data analysis with array-at-a-time operations to implement cuts as slice… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

    Comments: 5 pages, 2 figures, submitted to ACAT 2021 proceedings

    Journal ref: J. Phys.: Conf. Ser. 2438 012011 (2023)

  10. arXiv:2202.02194  [pdf, other

    physics.data-an hep-ex

    HL-LHC Computing Review Stage 2, Common Software Projects: Data Science Tools for Analysis

    Authors: Jim Pivarski, Eduardo Rodrigues, Kevin Pedro, Oksana Shadura, Benjamin Krikler, Graeme A. Stewart

    Abstract: This paper was prepared by the HEP Software Foundation (HSF) PyHEP Working Group as input to the second phase of the LHCC review of High-Luminosity LHC (HL-LHC) computing, which took place in November, 2021. It describes the adoption of Python and data science tools in HEP, discusses the likelihood of future scenarios, and recommendations for action by the HEP community.

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: 25 pages, 7 figures; presented at https://indico.cern.ch/event/1058274/ (LHCC Review of HL-LHC Computing)

    Report number: FERMILAB-CONF-22-061-SCD

  11. arXiv:2106.15783  [pdf, other

    physics.soc-ph hep-ex

    Learning from the Pandemic: the Future of Meetings in HEP and Beyond

    Authors: Mark S. Neubauer, Todd Adams, Jennifer Adelman-McCarthy, Gabriele Benelli, Tulika Bose, David Britton, Pat Burchat, Joel Butler, Timothy A. Cartwright, Tomáš Davídek, Jacques Dumarchez, Peter Elmer, Matthew Feickert, Ben Galewsky, Mandeep Gill, Maciej Gladki, Aman Goel, Jonathan E. Guyer, Bo Jayatilaka, Brendan Kiburg, Benjamin Krikler, David Lange, Claire Lee, Nick Manganelli, Giovanni Marchiori , et al. (14 additional authors not shown)

    Abstract: The COVID-19 pandemic has by-and-large prevented in-person meetings since March 2020. While the increasing deployment of effective vaccines around the world is a very positive development, the timeline and pathway to "normality" is uncertain and the "new normal" we will settle into is anyone's guess. Particle physics, like many other scientific fields, has more than a year of experience in holding… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: A report from the "Virtual Meetings" IRIS-HEP Blueprint Workshop: https://indico.cern.ch/event/1026363/

  12. AwkwardForth: accelerating Uproot with an internal DSL

    Authors: Jim Pivarski, Ianna Osborne, Pratyush Das, David Lange, Peter Elmer

    Abstract: File formats for generic data structures, such as ROOT, Avro, and Parquet, pose a problem for deserialization: it must be fast, but its code depends on the type of the data structure, not known at compile-time. Just-in-time compilation can satisfy both constraints, but we propose a more portable solution: specialized virtual machines. AwkwardForth is a Forth-driven virtual machine for deserializin… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: 11 pages, 2 figures, submitted to the 25th International Conference on Computing in High Energy & Nuclear Physics

  13. arXiv:2011.01950  [pdf, ps, other

    hep-ph physics.comp-ph

    Analysis Description Languages for the LHC

    Authors: Sezen Sekmen, Philippe Gras, Lindsey Gray, Benjamin Krikler, Jim Pivarski, Harrison B. Prosper, Andrea Rizzi, Gokhan Unel, Gordon Watts

    Abstract: An analysis description language is a domain specific language capable of describing the contents of an LHC analysis in a standard and unambiguous way, independent of any computing framework. It is designed for use by anyone with an interest in, and knowledge of, LHC physics, i.e., experimentalists, phenomenologists and other enthusiasts. Adopting analysis description languages would bring numerou… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted contribution to the proceedings of The 8th Annual Conference on Large Hadron Collider Physics, LHCP2020, 25-30 May, 2020, online

    Journal ref: Proceedings of Science, PoS(LHCP2020)065

  14. arXiv:2008.13636  [pdf, ps, other

    physics.comp-ph hep-ex

    HL-LHC Computing Review: Common Tools and Community Software

    Authors: HEP Software Foundation, :, Thea Aarrestad, Simone Amoroso, Markus Julian Atkinson, Joshua Bendavid, Tommaso Boccali, Andrea Bocci, Andy Buckley, Matteo Cacciari, Paolo Calafiura, Philippe Canal, Federico Carminati, Taylor Childers, Vitaliano Ciulli, Gloria Corti, Davide Costanzo, Justin Gage Dezoort, Caterina Doglioni, Javier Mauricio Duarte, Agnieszka Dziurda, Peter Elmer, Markus Elsing, V. Daniel Elvira, Giulio Eulisse , et al. (85 additional authors not shown)

    Abstract: Common and community software packages, such as ROOT, Geant4 and event generators have been a key part of the LHC's success so far and continued development and optimisation will be critical in the future. The challenges are driven by an ambitious physics programme, notably the LHC accelerator upgrade to high-luminosity, HL-LHC, and the corresponding detector upgrades of ATLAS and CMS. In this doc… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

    Comments: 40 pages contribution to Snowmass 2021

    Report number: HSF-DOC-2020-01

  15. Coffea -- Columnar Object Framework For Effective Analysis

    Authors: Nicholas Smith, Lindsey Gray, Matteo Cremonesi, Bo Jayatilaka, Oliver Gutsche, Allison Hall, Kevin Pedro, Maria Acosta, Andrew Melo, Stefano Belforte, Jim Pivarski

    Abstract: The coffea framework provides a new approach to High-Energy Physics analysis, via columnar operations, that improves time-to-insight, scalability, portability, and reproducibility of analysis. It is implemented with the Python programming language, the scientific python package ecosystem, and commodity big data technologies. To achieve this suite of improvements across many use cases, coffea takes… ▽ More

    Submitted 6 August, 2021; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: As presented at CHEP 2019

    Journal ref: EPJ Web of Conferences 245, 06012 (2020)

  16. arXiv:2007.03577  [pdf, other

    physics.comp-ph hep-ex

    The Scikit HEP Project -- overview and prospects

    Authors: Eduardo Rodrigues, Benjamin Krikler, Chris Burr, Dmitri Smirnov, Hans Dembinski, Henry Schreiner, Jaydeep Nandi, Jim Pivarski, Matthew Feickert, Matthieu Marinangeli, Nick Smith, Pratyush Das

    Abstract: Scikit-HEP is a community-driven and community-oriented project with the goal of providing an ecosystem for particle physics data analysis in Python. Scikit-HEP is a toolset of approximately twenty packages and a few "affiliated" packages. It expands the typical Python data analysis tools for particle physicists. Each package focuses on a particular topic, and interacts with other packages in the… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: 6 pages, 3 figures, Proceedings of the 24th International Conference on Computing in High Energy and Nuclear Physics (CHEP 2019), Adelaide, Australia, 4-8 November 2019

  17. Awkward Arrays in Python, C++, and Numba

    Authors: Jim Pivarski, Peter Elmer, David Lange

    Abstract: The Awkward Array library has been an important tool for physics analysis in Python since September 2018. However, some interface and implementation issues have been raised in Awkward Array's first year that argue for a reimplementation in C++ and Numba. We describe those issues, the new architecture, and present some examples of how the new interface will look to users. Of particular importance i… ▽ More

    Submitted 2 July, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: To be published in CHEP 2019 proceedings, EPJ Web of Conferences; post-review update

  18. Using Big Data Technologies for HEP Analysis

    Authors: Matteo Cremonesi, Claudio Bellini, Bianny Bian, Luca Canali, Vasileios Dimakopoulos, Peter Elmer, Ian Fisk, Maria Girone, Oliver Gutsche, Siew-Yan Hoh, Bo Jayatilaka, Viktor Khristenko, Andrea Luiselli, Andrew Melo, Evangelos Evangelos, Dominick Olivito, Jacopo Pazzini, Jim Pivarski, Alexey Svyatkovskiy, Marco Zanetti

    Abstract: The HEP community is approaching an era were the excellent performances of the particle accelerators in delivering collision at high rate will force the experiments to record a large amount of information. The growing size of the datasets could potentially become a limiting factor in the capability to produce scientific results timely and efficiently. Recently, new technologies and new approaches… ▽ More

    Submitted 21 January, 2019; originally announced January 2019.

  19. arXiv:1812.00761  [pdf, ps, other

    physics.comp-ph

    HEP Software Foundation Community White Paper Working Group -- Data Organization, Management and Access (DOMA)

    Authors: Dario Berzano, Riccardo Maria Bianchi, Ian Bird, Brian Bockelman, Simone Campana, Kaushik De, Dirk Duellmann, Peter Elmer, Robert Gardner, Vincent Garonne, Claudio Grandi, Oliver Gutsche, Andrew Hanushevsky, Burt Holzman, Bodhitha Jayatilaka, Ivo Jimenez, Michel Jouvin, Oliver Keeble, Alexei Klimentov, Valentin Kuznetsov, Eric Lancon, Mario Lassnig, Miron Livny, Carlos Maltzahn, Shawn McKee , et al. (13 additional authors not shown)

    Abstract: Without significant changes to data organization, management, and access (DOMA), HEP experiments will find scientific output limited by how fast data can be accessed and digested by computational resources. In this white paper we discuss challenges in DOMA that HEP experiments, such as the HL-LHC, will face as well as potential ways to address them. A research and development timeline to assess th… ▽ More

    Submitted 30 November, 2018; originally announced December 2018.

    Comments: arXiv admin note: text overlap with arXiv:1712.06592

    Report number: HSF-CWP-2017-04

  20. arXiv:1811.10309  [pdf, other

    physics.comp-ph hep-ex

    HEP Software Foundation Community White Paper Working Group --- Visualization

    Authors: Matthew Bellis, Riccardo Maria Bianchi, Sebastien Binet, Ciril Bohak, Benjamin Couturier, Hadrien Grasland, Oliver Gutsche, Sergey Linev, Alex Martyniuk, Thomas McCauley, Edward Moyse, Alja Mrak Tadel, Mark Neubauer, Jeremi Niedziela, Leo Piilonen, Jim Pivarski, Martin Ritter, Tai Sakuma, Matevz Tadel, Barthélémy von Haller, Ilija Vukotic, Ben Waugh

    Abstract: In modern High Energy Physics (HEP) experiments visualization of experimental data has a key role in many activities and tasks across the whole data chain: from detector development to monitoring, from event generation to reconstruction of physics objects, from detector simulation to data analysis, and all the way to outreach and education. In this paper, the definition, status, and evolution of d… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Report number: HSF-CWP-2017-15

  21. arXiv:1810.01191  [pdf, other

    physics.comp-ph

    HEP Software Foundation Community White Paper Working Group - Data and Software Preservation to Enable Reuse

    Authors: M. D. Hildreth, A. Boehnlein, K. Cranmer, S. Dallmeier, R. Gardner, T. Hacker, L. Heinrich, I. Jimenez, M. Kane, D. S. Katz, T. Malik, C. Maltzahn, M. Neubauer, S. Neubert, Jim Pivarski, E. Sexton, J. Shiers, T. Simko, S. Smith, D. South, A. Verbytskyi, G. Watts, J. Wozniak

    Abstract: In this chapter of the High Energy Physics Software Foundation Community Whitepaper, we discuss the current state of infrastructure, best practices, and ongoing developments in the area of data and software preservation in high energy physics. A re-framing of the motivation for preservation to enable re-use is presented. A series of research and development goals in software and other cyberinfrast… ▽ More

    Submitted 2 October, 2018; originally announced October 2018.

    Report number: HSF-CWP-2017-06

  22. arXiv:1807.02876  [pdf, other

    physics.comp-ph cs.LG hep-ex stat.ML

    Machine Learning in High Energy Physics Community White Paper

    Authors: Kim Albertsson, Piero Altoe, Dustin Anderson, John Anderson, Michael Andrews, Juan Pedro Araque Espinosa, Adam Aurisano, Laurent Basara, Adrian Bevan, Wahid Bhimji, Daniele Bonacorsi, Bjorn Burkle, Paolo Calafiura, Mario Campanelli, Louis Capps, Federico Carminati, Stefano Carrazza, Yi-fan Chen, Taylor Childers, Yann Coadou, Elias Coniavitis, Kyle Cranmer, Claire David, Douglas Davis, Andrea De Simone , et al. (103 additional authors not shown)

    Abstract: Machine learning has been applied to several problems in particle physics research, beginning with applications to high-level physics analysis in the 1990s and 2000s, followed by an explosion of applications in particle and event identification and reconstruction in the 2010s. In this document we discuss promising future research and development areas for machine learning in particle physics. We d… ▽ More

    Submitted 16 May, 2019; v1 submitted 8 July, 2018; originally announced July 2018.

    Comments: Editors: Sergei Gleyzer, Paul Seyfert and Steven Schramm

  23. arXiv:1804.03983  [pdf, other

    physics.comp-ph hep-ex

    HEP Software Foundation Community White Paper Working Group - Data Analysis and Interpretation

    Authors: Lothar Bauerdick, Riccardo Maria Bianchi, Brian Bockelman, Nuno Castro, Kyle Cranmer, Peter Elmer, Robert Gardner, Maria Girone, Oliver Gutsche, Benedikt Hegner, José M. Hernández, Bodhitha Jayatilaka, David Lange, Mark S. Neubauer, Daniel S. Katz, Lukasz Kreczko, James Letts, Shawn McKee, Christoph Paus, Kevin Pedro, Jim Pivarski, Martin Ritter, Eduardo Rodrigues, Tai Sakuma, Elizabeth Sexton-Kennedy , et al. (4 additional authors not shown)

    Abstract: At the heart of experimental high energy physics (HEP) is the development of facilities and instrumentation that provide sensitivity to new phenomena. Our understanding of nature at its most fundamental level is advanced through the analysis and interpretation of data from sophisticated detectors in HEP experiments. The goal of data analysis systems is to realize the maximum possible scientific po… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

    Comments: arXiv admin note: text overlap with arXiv:1712.06592

    Report number: HSF-CWP-2017-05

  24. arXiv:1712.06982  [pdf, other

    physics.comp-ph hep-ex

    A Roadmap for HEP Software and Computing R&D for the 2020s

    Authors: Johannes Albrecht, Antonio Augusto Alves Jr, Guilherme Amadio, Giuseppe Andronico, Nguyen Anh-Ky, Laurent Aphecetche, John Apostolakis, Makoto Asai, Luca Atzori, Marian Babik, Giuseppe Bagliesi, Marilena Bandieramonte, Sunanda Banerjee, Martin Barisits, Lothar A. T. Bauerdick, Stefano Belforte, Douglas Benjamin, Catrin Bernius, Wahid Bhimji, Riccardo Maria Bianchi, Ian Bird, Catherine Biscarat, Jakob Blomer, Kenneth Bloom, Tommaso Boccali , et al. (285 additional authors not shown)

    Abstract: Particle physics has an ambitious and broad experimental programme for the coming decades. This programme requires large investments in detector hardware, either to build new facilities and experiments, or to upgrade existing ones. Similarly, it requires commensurate investment in the R&D of software to acquire, manage, process, and analyse the shear amounts of data to be recorded. In planning for… ▽ More

    Submitted 19 December, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

    Report number: HSF-CWP-2017-01

    Journal ref: Comput Softw Big Sci (2019) 3, 7

  25. arXiv:1711.02659  [pdf, other

    cs.DC

    Optimizing ROOT IO For Analysis

    Authors: Brian Bockelman, Zhe Zhang, Jim Pivarski

    Abstract: The ROOT I/O (RIO) subsystem is foundational to most HEP experiments - it provides a file format, a set of APIs/semantics, and a reference implementation in C++. It is often found at the base of an experiment's framework and is used to serialize the experiment's data; in the case of an LHC experiment, this may be hundreds of petabytes of files! Individual physicists will further use RIO to perform… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

    Comments: 18th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT)

  26. arXiv:1711.01229  [pdf, other

    cs.DC

    Toward real-time data query systems in HEP

    Authors: Jim Pivarski, David Lange, Thanat Jatuphattharachat

    Abstract: Exploratory data analysis tools must respond quickly to a user's questions, so that the answer to one question (e.g. a visualized histogram or fit) can influence the next. In some SQL-based query systems used in industry, even very large (petabyte) datasets can be summarized on a human timescale (seconds), employing techniques such as columnar data representation, caching, indexing, and code gener… ▽ More

    Submitted 8 November, 2017; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: 6 pages, 2 figures, proceedings for ACAT 2017

  27. arXiv:1711.00375  [pdf, other

    cs.DC

    CMS Analysis and Data Reduction with Apache Spark

    Authors: Oliver Gutsche, Luca Canali, Illia Cremer, Matteo Cremonesi, Peter Elmer, Ian Fisk, Maria Girone, Bo Jayatilaka, Jim Kowalkowski, Viktor Khristenko, Evangelos Motesnitsalis, Jim Pivarski, Saba Sehrish, Kacper Surdy, Alexey Svyatkovskiy

    Abstract: Experimental Particle Physics has been at the forefront of analyzing the world's largest datasets for decades. The HEP community was among the first to develop suitable software and computing tools for this task. In recent times, new toolkits and systems for distributed data processing, collectively called "Big Data" technologies have emerged from industry and open source projects to support the a… ▽ More

    Submitted 31 October, 2017; originally announced November 2017.

    Comments: Proceedings for 18th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2017). arXiv admin note: text overlap with arXiv:1703.04171

  28. arXiv:1708.08319  [pdf, other

    cs.PL cs.DB cs.IR

    Fast Access to Columnar, Hierarchically Nested Data via Code Transformation

    Authors: Jim Pivarski, Peter Elmer, Brian Bockelman, Zhe Zhang

    Abstract: Big Data query systems represent data in a columnar format for fast, selective access, and in some cases (e.g. Apache Drill), perform calculations directly on the columnar data without row materialization, avoiding runtime costs. However, many analysis procedures cannot be easily or efficiently expressed as SQL. In High Energy Physics, the majority of data processing requires nested loops with c… ▽ More

    Submitted 3 November, 2017; v1 submitted 20 August, 2017; originally announced August 2017.

    Comments: 10 pages, 2 figures, submitted to IEEE Big Data

  29. Big Data in HEP: A comprehensive use case study

    Authors: Oliver Gutsche, Matteo Cremonesi, Peter Elmer, Bo Jayatilaka, Jim Kowalkowski, Jim Pivarski, Saba Sehrish, Cristina Mantilla Surez, Alexey Svyatkovskiy, Nhan Tran

    Abstract: Experimental Particle Physics has been at the forefront of analyzing the worlds largest datasets for decades. The HEP community was the first to develop suitable software and computing tools for this task. In recent times, new toolkits and systems collectively called Big Data technologies have emerged to support the analysis of Petabyte and Exabyte datasets in industry. While the principles of dat… ▽ More

    Submitted 12 March, 2017; originally announced March 2017.

    Comments: Proceedings for 22nd International Conference on Computing in High Energy and Nuclear Physics (CHEP 2016)

  30. The Matsu Wheel: A Cloud-based Framework for Efficient Analysis and Reanalysis of Earth Satellite Imagery

    Authors: Maria T Patterson, Nikolas Anderson, Collin Bennett, Jacob Bruggemann, Robert Grossman, Matthew Handy, Vuong Ly, Dan Mandl, Shane Pederson, Jim Pivarski, Ray Powell, Jonathan Spring, Walt Wells

    Abstract: Project Matsu is a collaboration between the Open Commons Consortium and NASA focused on develo** open source technology for the cloud-based processing of Earth satellite imagery. A particular focus is the development of applications for detecting fires and floods to help support natural disaster detection and relief. Project Matsu has developed an open source cloud-based infrastructure to proce… ▽ More

    Submitted 22 February, 2016; originally announced February 2016.

    Comments: 10 pages, accepted for presentation to IEEE BigDataService 2016

  31. LHC discovery potential of the lightest NMSSM Higgs in the h1 -> a1 a1 -> 4 muons channel

    Authors: Alexander Belyaev, Jim Pivarski, Alexei Safonov, Sergey Senkin, Aysen Tatarinov

    Abstract: We explore the potential of the Large Hadron Collider to observe the h1 -> a1 a1 -> 4 muons signal from the lightest scalar Higgs boson (h1) decaying into the two lightest pseudoscalar Higgs bosons (a1), followed by their decays into four muons in the Next-to-Minimal Supersymmetric Standard Model (NMSSM). The signature under study applies to the region of the NMSSM parameter space in which m_a1… ▽ More

    Submitted 3 March, 2010; v1 submitted 9 February, 2010; originally announced February 2010.

    Comments: 12 pages, 11 figures; added more discussion of collider constraints

    Journal ref: Phys.Rev.D81:075021,2010

  32. arXiv:hep-ex/0604026  [pdf, ps, other

    hep-ex

    A High-Precision Measurement of the Di-Electron Widths of the Upsilon(1S), Upsilon(2S), and Upsilon(3S) Mesons at CLEO-III

    Authors: J. Pivarski

    Abstract: The di-electron width of an Upsilon meson is the decay rate of the Upsilon into an electron-positron pair, expressed in units of energy. We measure the di-electron width of the Upsilon(1S) meson to be 1.354 +- 0.004 +- 0.020 keV (the first uncertainty is statistical and the second is systematic), the di-electron width of the Upsilon(2S) to be 0.619 +- 0.004 +- 0.010 keV and that of the Upsilon(3… ▽ More

    Submitted 17 August, 2007; v1 submitted 12 April, 2006; originally announced April 2006.

    Comments: 160 pages, 73 figures, Ph.D. dissertation, also available through http://www.lepp.cornell.edu/public/THESIS/2006/ and http://hdl.handle.net/1813/2672, see hep-ex/0512056; corrected numerical values in abstract

    Report number: Cornell University Laboratory of Elementary Particle Physics THESIS 06-1

  33. arXiv:hep-ph/0507214  [pdf, ps, other

    hep-ph

    Testing Cosmology at the ILC

    Authors: A. Birkedal, K. Matchev, J. Alexander, K. Ecklund, L. Fields, R. C. Gray, D. Hertz, C. D. Jones, J. Pivarski

    Abstract: We investigate the capabilities for the LHC and the ILC to perform measurements of new physics parameters relevant for the calculation of the cosmological relic abundance of the lightest neutralino in supersymmetry. Specifically, we delineate the range of values for the cold dark matter relic abundance $Ω_χ h^2$, which will be consistent with the expected precision measurements at the LHC, and,… ▽ More

    Submitted 18 July, 2005; originally announced July 2005.

    Comments: 6 pages, 4 figures. Based on talks given by A. Birkedal at Linear Collider workshops in 2004 and 2005

    Report number: UFIFT-HEP-05-11

    Journal ref: ECONF C050318:0708,2005

  34. arXiv:hep-ex/0507008  [pdf, ps, other

    hep-ex

    Measuring Mass and Cross Section Parameters at a Focus Point Region

    Authors: R. Gray, J. Alexander, K. M. Ecklund, L. Fields, D. Hertz, C. D. Jones, J. Pivarski, A. Birkedal, K. Matchev

    Abstract: The purpose of this study is to determine the experimental uncertainties in measuring mass and cross section parameters of SUSY particles at a 500 GeV Linear Collider. In this study SUSY is a point in the focus point region of mSUGRA parameter space that is compatible with WMAP constraints on dark matter relic density. At this study point the masses of the squarks and sleptons are very heavy, an… ▽ More

    Submitted 1 July, 2005; originally announced July 2005.

    Comments: Invited talk at 2005 International Linear Collider Workshop, Stanford Ca (LCWS05) 6 pages, LaTex, 2 eps figures

    Report number: CLNS 05/1924

    Journal ref: ECONF C050318:0711,2005