-
Performance Optimization using Multimodal Modeling and Heterogeneous GNN
Authors:
Akash Dutta,
Jordi Alcaraz,
Ali TehraniJamsaz,
Eduardo Cesar,
Anna Sikora,
Ali Jannesari
Abstract:
Growing heterogeneity and configurability in HPC architectures has made auto-tuning applications and runtime parameters on these systems very complex. Users are presented with a multitude of options to configure parameters. In addition to application specific solutions, a common approach is to use general purpose search strategies, which often might not identify the best configurations or their ti…
▽ More
Growing heterogeneity and configurability in HPC architectures has made auto-tuning applications and runtime parameters on these systems very complex. Users are presented with a multitude of options to configure parameters. In addition to application specific solutions, a common approach is to use general purpose search strategies, which often might not identify the best configurations or their time to convergence is a significant barrier. There is, thus, a need for a general purpose and efficient tuning approach that can be easily scaled and adapted to various tuning tasks. We propose a technique for tuning parallel code regions that is general enough to be adapted to multiple tasks. In this paper, we analyze IR-based programming models to make task-specific performance optimizations. To this end, we propose the Multimodal Graph Neural Network and Autoencoder (MGA) tuner, a multimodal deep learning based approach that adapts Heterogeneous Graph Neural Networks and Denoizing Autoencoders for modeling IR-based code representations that serve as separate modalities. This approach is used as part of our pipeline to model a syntax, semantics, and structure-aware IR-based code representation for tuning parallel code regions/kernels. We extensively experiment on OpenMP and OpenCL code regions/kernels obtained from PolyBench, Rodinia, STREAM, DataRaceBench, AMD SDK, NPB, NVIDIA SDK, Parboil, SHOC, and LULESH benchmarks. We apply our multimodal learning techniques to the tasks of i) optimizing the number of threads, scheduling policy and chunk size in OpenMP loops and, ii) identifying the best device for heterogeneous device map** of OpenCL kernels. Our experiments show that this multimodal learning based approach outperforms the state-of-the-art in all experiments.
△ Less
Submitted 27 April, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
CosmoHub: Interactive exploration and distribution of astronomical data on Hadoop
Authors:
Pau Tallada,
Jorge Carretero,
Jordi Casals,
Carles Acosta-Silva,
Santiago Serrano,
Marc Caubet,
Francisco J. Castander,
Eduardo César,
Martín Crocce,
Manuel Delfino,
Martin Eriksen,
Pablo Fosalba,
Enrique Gaztañaga,
Gonzalo Merino,
Christian Neissner,
Nadia Tonello
Abstract:
We present CosmoHub (https://cosmohub.pic.es), a web application based on Hadoop to perform interactive exploration and distribution of massive cosmological datasets. Recent Cosmology seeks to unveil the nature of both dark matter and dark energy map** the large-scale structure of the Universe, through the analysis of massive amounts of astronomical data, progressively increasing during the last…
▽ More
We present CosmoHub (https://cosmohub.pic.es), a web application based on Hadoop to perform interactive exploration and distribution of massive cosmological datasets. Recent Cosmology seeks to unveil the nature of both dark matter and dark energy map** the large-scale structure of the Universe, through the analysis of massive amounts of astronomical data, progressively increasing during the last (and future) decades with the digitization and automation of the experimental techniques.
CosmoHub, hosted and developed at the Port d'Informació Científica (PIC), provides support to a worldwide community of scientists, without requiring the end user to know any Structured Query Language (SQL). It is serving data of several large international collaborations such as the Euclid space mission, the Dark Energy Survey (DES), the Physics of the Accelerating Universe Survey (PAUS) and the Marenostrum Institut de Ciències de l'Espai (MICE) numerical simulations. While originally developed as a PostgreSQL relational database web frontend, this work describes the current version of CosmoHub, built on top of Apache Hive, which facilitates scalable reading, writing and managing huge datasets. As CosmoHub's datasets are seldomly modified, Hive it is a better fit.
Over 60 TiB of catalogued information and $50 \times 10^9$ astronomical objects can be interactively explored using an integrated visualization tool which includes 1D histogram and 2D heatmap plots. In our current implementation, online exploration of datasets of $10^9$ objects can be done in a timescale of tens of seconds. Users can also download customized subsets of data in standard formats generated in few minutes.
△ Less
Submitted 10 March, 2020; v1 submitted 4 March, 2020;
originally announced March 2020.
-
Extra phase noise from thermal fluctuations in nonlinear optical crystals
Authors:
J. E. S. Cesar,
A. S. Coelho,
K. N. Cassemiro,
A. S. Villar,
M. Lassen,
P. Nussenzveig,
M. Martinelli
Abstract:
We show theoretically and experimentally that scattered light by thermal phonons inside a second-order nonlinear crystal is the source of additional phase noise observed in Optical Parametric Oscillators. This additional phase noise reduces the quantum correlations and has hitherto hindered the direct production of multipartite entanglement in a single nonlinear optical system. We cooled the non…
▽ More
We show theoretically and experimentally that scattered light by thermal phonons inside a second-order nonlinear crystal is the source of additional phase noise observed in Optical Parametric Oscillators. This additional phase noise reduces the quantum correlations and has hitherto hindered the direct production of multipartite entanglement in a single nonlinear optical system. We cooled the nonlinear crystal and observed a reduction of the extra noise. Our treatment of this noise can be successfully applied to different systems in the literature.
△ Less
Submitted 22 June, 2009;
originally announced June 2009.