Skip to main content

Showing 1–10 of 10 results for author: Rathnayake, T

.
  1. arXiv:2309.16381  [pdf, other

    cs.DC cs.PF

    Nek5000/RS Performance on Advanced GPU Architectures

    Authors: Misun Min, Yu-Hsiang Lan, Paul Fischer, Thilina Rathnayake, John Holmen

    Abstract: We demonstrate NekRS performance results on various advanced GPU architectures. NekRS is a GPU-accelerated version of Nek5000 that targets high performance on exascale platforms. It is being developed in DOE's Center of Efficient Exascale Discretizations, which is one of the co-design centers under the Exascale Computing Project. In this paper, we consider Frontier, Crusher, Spock, Polaris, Perlmu… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 24 pages, 13 figures, 2 tables

    MSC Class: 35-04 ACM Class: D.0; F.2; G.2; G.4; I.6; J.2

  2. arXiv:2304.11868  [pdf, other

    cs.CV

    A Benchmark for Cycling Close Pass Near Miss Event Detection from Video Streams

    Authors: Mingjie Li, Tharindu Rathnayake, Ben Beck, Lingheng Meng, Zijue Chen, Akansel Cosgun, Xiaojun Chang, Dana Kulić

    Abstract: Cycling is a healthy and sustainable mode of transport. However, interactions with motor vehicles remain a key barrier to increased cycling participation. The ability to detect potentially dangerous interactions from on-bike sensing could provide important information to riders and policy makers. Thus, automated detection of conflict between cyclists and drivers has attracted researchers from both… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: 15 pages, 19 figurers and 2 tables

  3. arXiv:2110.01716  [pdf, other

    physics.comp-ph

    Highly Optimized Full-Core Reactor Simulations on Summit

    Authors: Paul Fischer, Elia Merzari, Misun Min, Stefan Kerkemeier, Yu-Hsiang Lan, Malachi Phillips, Thilina Rathnayake, April Novak, Derek Gaston, Noel Chalmers, Tim Warburton

    Abstract: Nek5000/RS is a highly-performant open-source spectral element code for simulation of incompressible and low-Mach fluid flow, heat transfer, and combustion with a particular focus on turbulent flows in complex domains. It is based on high-order discretizations that realize the same (or lower) cost per gridpoint as traditional low-order methods. State-of-the-art multilevel preconditioners, efficien… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: 9 pages, 3 figures, 6 tables

    MSC Class: 35-04 ACM Class: D.0; F.2; G.2; G.4; I.6; J.2

  4. arXiv:2109.05072  [pdf, other

    cs.DC cs.MS math.NA

    GPU Algorithms for Efficient Exascale Discretizations

    Authors: Ahmad Abdelfattah, Valeria Barra, Natalie Beams, Ryan Bleile, Jed Brown, Jean-Sylvain Camier, Robert Carson, Noel Chalmers, Veselin Dobrev, Yohann Dudouit, Paul Fischer, Ali Karakus, Stefan Kerkemeier, Tzanio Kolev, Yu-Hsiang Lan, Elia Merzari, Misun Min, Malachi Phillips, Thilina Rathnayake, Robert Rieben, Thomas Stitt, Ananias Tomboulides, Stanimire Tomov, Vladimir Tomov, Arturo Vargas , et al. (2 additional authors not shown)

    Abstract: In this paper we describe the research and development activities in the Center for Efficient Exascale Discretization within the US Exascale Computing Project, targeting state-of-the-art high-order finite-element algorithms for high-order applications on GPU-accelerated platforms. We discuss the GPU developments in several components of the CEED software stack, including the libCEED, MAGMA, MFEM,… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  5. arXiv:2109.04996  [pdf, other

    cs.DC cs.MS math.NA

    Efficient Exascale Discretizations: High-Order Finite Element Methods

    Authors: Tzanio Kolev, Paul Fischer, Misun Min, Jack Dongarra, Jed Brown, Veselin Dobrev, Tim Warburton, Stanimire Tomov, Mark S. Shephard, Ahmad Abdelfattah, Valeria Barra, Natalie Beams, Jean-Sylvain Camier, Noel Chalmers, Yohann Dudouit, Ali Karakus, Ian Karlin, Stefan Kerkemeier, Yu-Hsiang Lan, David Medina, Elia Merzari, Aleksandr Obabko, Will Pazner, Thilina Rathnayake, Cameron W. Smith , et al. (5 additional authors not shown)

    Abstract: Efficient exploitation of exascale architectures requires rethinking of the numerical algorithms used in many large-scale applications. These architectures favor algorithms that expose ultra fine-grain parallelism and maximize the ratio of floating point operations to energy intensive data movement. One of the few viable approaches to achieve high efficiency in the area of PDE discretizations on u… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: 22 pages, 18 figures

  6. arXiv:2104.05829  [pdf, other

    cs.PF cs.DC

    NekRS, a GPU-Accelerated Spectral Element Navier-Stokes Solver

    Authors: Paul Fischer, Stefan Kerkemeier, Misun Min, Yu-Hsiang Lan, Malachi Phillips, Thilina Rathnayake, Elia Merzari, Ananias Tomboulides, Ali Karakus, Noel Chalmers, Tim Warburton

    Abstract: The development of NekRS, a GPU-oriented thermal-fluids simulation code based on the spectral element method (SEM) is described. For performance portability, the code is based on the open concurrent compute abstraction and leverages scalable developments in the SEM code Nek5000 and in libParanumal, which is a library of high-performance kernels for high-order discretizations and PDE-based miniapps… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 14 pages, 8 figures

    MSC Class: 35-04 ACM Class: D.0; F.2; G.2; G.4; I.6

  7. arXiv:2004.06722  [pdf, other

    cs.PF cs.DC

    Scalability of High-Performance PDE Solvers

    Authors: Paul Fischer, Misun Min, Thilina Rathnayake, Som Dutta, Tzanio Kolev, Veselin Dobrev, Jean-Sylvain Camier, Martin Kronbichler, Tim Warburton, Kasia Swirydowicz, Jed Brown

    Abstract: Performance tests and analyses are critical to effective HPC software development and are central components in the design and implementation of computational algorithms for achieving faster simulations on existing and future computing architectures for large-scale application problems. In this paper, we explore performance and space-time trade-offs for important compute-intensive kernels of large… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: 25 pages, 54 figures

    MSC Class: 35-04 ACM Class: D.0; F.2; G.2; G.4; I.6

  8. arXiv:1702.08641  [pdf, other

    eess.SY

    Statistical Information Fusion for Multiple-View Sensor Data in Multi-Object Tracking

    Authors: Xiaoying Wang, Reza Hoseinnezhad, Amirali K. Gostar, Tharindu Rathnayake, Benlian Xu, Alireza Bab-Hadiashar

    Abstract: This paper presents a novel statistical information fusion method to integrate multiple-view sensor data in multi-object tracking applications. The proposed method overcomes the drawbacks of the commonly used Generalized Covariance Intersection method, which considers constant weights allocated for sensors. Our method is based on enhancing the Generalized Covariance Intersection with adaptive weig… ▽ More

    Submitted 27 February, 2017; originally announced February 2017.

    Comments: 28 pages,7 figures

  9. Multi-Sensor Control for Multi-Object Bayes Filters

    Authors: Xiaoying Wang, Reza Hoseinnezhad, Amirali K. Gostar, Tharindu Rathnayake, Benlian Xu, Alireza Bab-Hadiashar

    Abstract: Sensor management in multi-object stochastic systems is a theoretically and computationally challenging problem. This paper presents a novel approach to the multi-target multi-sensor control problem within the partially observed Markov decision process (POMDP) framework. We model the multi-object state as a labeled multi-Bernoulli random finite set (RFS), and use the labeled multi-Bernoulli filter… ▽ More

    Submitted 20 February, 2017; originally announced February 2017.

    Comments: 21 pages, 8 figures

    Journal ref: Signal Processing Volume 142, January 2018, Pages 260-270

  10. arXiv:1604.05966   

    cs.CV

    Labeled Multi-Bernoulli Tracking for Industrial Mobile Platform Safety

    Authors: Tharindu Rathnayake, Reza Hoseinnezhad, Ruwan Tennakoon, Alireza Bab-Hadiashar

    Abstract: This paper presents a track-before-detect labeled multi-Bernoulli filter tailored for industrial mobile platform safety applications. We derive two application specific separable likelihood functions that capture the geometric shape and colour information of the human targets who are wearing a high visible vest. These likelihoods are then used in a labeled multi-Bernoulli filter with a novel two s… ▽ More

    Submitted 10 May, 2016; v1 submitted 20 April, 2016; originally announced April 2016.

    Comments: The conference which this paper was submitted, has rejected this paper. Thus, we are in the process of enhancing the content of the paper and submit it to another conference/journal