-
Report on 2023 CyberTraining PI Meeting, 26-27 September 2023
Authors:
Geoffrey Fox,
Mary P Thomas,
Sajal Bhatia,
Marisa Brazil,
Nicole M Gasparini,
Venkatesh Mohan Merwade,
Henry J. Neeman,
Jeff Carver,
Henri Casanova,
Vipin Chaudhary,
Dirk Colbry,
Lonnie Crosby,
Prasun Dewan,
Jessica Eisma,
Nicole M Gasparini,
Ahmed Irfan,
Kate Kaehey,
Qianqian Liu,
Zhen Ni,
Sushil Prasad,
Apan Qasem,
Erik Saule,
Prabha Sundaravadivel,
Karen Tomko
Abstract:
This document describes a two-day meeting held for the Principal Investigators (PIs) of NSF CyberTraining grants. The report covers invited talks, panels, and six breakout sessions. The meeting involved over 80 PIs and NSF program managers (PMs). The lessons recorded in detail in the report are a wealth of information that could help current and future PIs, as well as NSF PMs, understand the futur…
▽ More
This document describes a two-day meeting held for the Principal Investigators (PIs) of NSF CyberTraining grants. The report covers invited talks, panels, and six breakout sessions. The meeting involved over 80 PIs and NSF program managers (PMs). The lessons recorded in detail in the report are a wealth of information that could help current and future PIs, as well as NSF PMs, understand the future directions suggested by the PI community. The meeting was held simultaneously with that of the PIs of the NSF Cyberinfrastructure for Sustained Scientific Innovation (CSSI) program. This co-location led to two joint sessions: one with NSF speakers and the other on broader impact. Further, the joint poster and refreshment sessions benefited from the interactions between CSSI and CyberTraining PIs.
△ Less
Submitted 28 December, 2023; v1 submitted 20 December, 2023;
originally announced December 2023.
-
GPU-accelerated Parallel Solutions to the Quadratic Assignment Problem
Authors:
Clara Novoa,
Apan Qasem
Abstract:
The Quadratic Assignment Problem (QAP) is an important combinatorial optimization problem with applications in many areas including logistics and manufacturing. QAP is known to be NP-hard, a computationally challenging problem, which requires the use of sophisticated heuristics in finding acceptable solutions for most real-world data sets.
In this paper, we present GPU-accelerated implementation…
▽ More
The Quadratic Assignment Problem (QAP) is an important combinatorial optimization problem with applications in many areas including logistics and manufacturing. QAP is known to be NP-hard, a computationally challenging problem, which requires the use of sophisticated heuristics in finding acceptable solutions for most real-world data sets.
In this paper, we present GPU-accelerated implementations of a 2opt and a tabu search algorithm for solving the QAP. For both algorithms, we extract parallelism at multiple levels and implement novel code optimization techniques that fully utilize the GPU hardware. On a series of experiments on the well-known QAPLIB data sets, our solutions, on average run an order-of-magnitude faster than previous implementations and deliver up to a factor of 63 speedup on specific instances. The quality of the solutions produced by our implementations of 2opt and tabu is within 1.03% and 0.15% of the best known values. The experimental results also provide key insight into the performance characteristics of accelerated QAP solvers. In particular, the results reveal that both algorithmic choice and the shape of the input data sets are key factors in finding efficient implementations.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
The Main Role of Thermal Annealing in Controlling the Structural and Optical Properties of ITO Thin Film Layer
Authors:
Moustafa Ahmed,
Ahmed Bakry,
Ammar Qasem,
Hamed Dalir
Abstract:
Here we report on studying the electronic and optical material properties of the technologically-relevant material indium tin oxide (ITO) as a function of thermal annealing. In this work, ITO powder has been prepared utilizing solid-state reaction methods. An electron beam gun technology has been used to prepare a ITO film (325 nm). The ITO window layer has been investigated at various temperature…
▽ More
Here we report on studying the electronic and optical material properties of the technologically-relevant material indium tin oxide (ITO) as a function of thermal annealing. In this work, ITO powder has been prepared utilizing solid-state reaction methods. An electron beam gun technology has been used to prepare a ITO film (325 nm). The ITO window layer has been investigated at various temperatures. The effects of absolute temperature on the structural, optical, and electrical properties of the prepared ITO thin film layer are investigated. The energy band type corresponding to the orbital transitions has been determined, and the energies of the orbital transitions have been calculated in the Tauc region, HOMO/LUMO gap, and charge transfer gap. In additions, the exciton and Urbach energies have been computed. It has been found that these energies increase with increasing the annealing temperature, except for Urbach's energies which behave differently. Thin-film quality coefficient, surface resistance, and thermal emission in addition to the angle of refraction as a function of wavelength, have been determined.
△ Less
Submitted 25 December, 2020;
originally announced April 2021.
-
A Tool for Automatically Suggesting Source-Code Optimizations for Complex GPU Kernels
Authors:
Saeed Taheri,
Apan Qasem,
Martin Burtscher
Abstract:
Future computing systems, from handhelds to supercomputers, will undoubtedly be more parallel and heterogeneous than todays systems to provide more performance and energy efficiency. Thus, GPUs are increasingly being used to accelerate general purpose applications, including applications with data dependent, irregular control flow and memory access patterns. However, the growing complexity, expose…
▽ More
Future computing systems, from handhelds to supercomputers, will undoubtedly be more parallel and heterogeneous than todays systems to provide more performance and energy efficiency. Thus, GPUs are increasingly being used to accelerate general purpose applications, including applications with data dependent, irregular control flow and memory access patterns. However, the growing complexity, exposed memory hierarchy, incoherence, heterogeneity, and parallelism will make accelerator based systems progressively more difficult to program. In the foreseeable future, the vast majority of programmers will no longer be able to extract additional performance or energy savings from next generation systems be-cause the programming will be too difficult. Automatic performance analysis and optimization recommendation tools have the potential to avert this situation. They embody expert knowledge and make it available to software developers when needed. In this paper, we describe and evaluate such a tool.
△ Less
Submitted 17 October, 2019;
originally announced October 2019.