Skip to main content

Showing 1–22 of 22 results for author: Karp, M

.
  1. arXiv:2405.05640  [pdf, other

    cs.DC cs.MS physics.flu-dyn

    Experience and Analysis of Scalable High-Fidelity Computational Fluid Dynamics on Modular Supercomputing Architectures

    Authors: Martin Karp, Estela Suarez, Jan H. Meinke, Måns I. Andersson, Philipp Schlatter, Stefano Markidis, Niclas Jansson

    Abstract: The never-ending computational demand from simulations of turbulence makes computational fluid dynamics (CFD) a prime application use case for current and future exascale systems. High-order finite element methods, such as the spectral element method, have been gaining traction as they offer high performance on both multicore CPUs and modern GPU-based accelerators. In this work, we assess how high… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 13 pages, 5 figures, 3 tables, preprint

    ACM Class: J.2; C.1.4; G.4

  2. arXiv:2405.05639  [pdf, other

    cs.DC

    Supercomputers as a Continous Medium

    Authors: Martin Karp, Niclas Jansson, Philipp Schlatter, Stefano Markidis

    Abstract: As supercomputers' complexity has grown, the traditional boundaries between processor, memory, network, and accelerators have blurred, making a homogeneous computer model, in which the overall computer system is modeled as a continuous medium with homogeneously distributed computational power, memory, and data movement transfer capabilities, an intriguing and powerful abstraction. By applying a ho… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 10 pages, 8 figures, 3 tables

    ACM Class: F.1; F.2; I.6

  3. arXiv:2306.08522  [pdf, other

    cs.RO

    Challenges of Indoor SLAM: A multi-modal multi-floor dataset for SLAM evaluation

    Authors: Pushyami Kaveti, Aniket Gupta, Dennis Giaya, Madeline Karp, Colin Keil, Jagatpreet Nir, Zhiyong Zhang, Hanumant Singh

    Abstract: Robustness in Simultaneous Localization and Map** (SLAM) remains one of the key challenges for the real-world deployment of autonomous systems. SLAM research has seen significant progress in the last two and a half decades, yet many state-of-the-art (SOTA) algorithms still struggle to perform reliably in real-world environments. There is a general consensus in the research community that we need… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  4. Anisotropic Satellite Galaxy Quenching: A Unique Signature of Energetic Feedback by Supermassive Black Holes?

    Authors: Juliana S. M. Karp, Johannes U. Lange, Risa H. Wechsler

    Abstract: The quenched fraction of satellite galaxies is aligned with the orientation of the halo's central galaxy, such that on average, satellites form stars at a lower rate along the major axis of the central. This effect, called anisotropic satellite galaxy quenching (ASGQ), has been found in observational data and cosmological simulations. Analyzing the IllustrisTNG simulation, Martín-Navarro et al. (2… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 7 pages, 4 figures; Submitted to ApJL; Comments welcome!

  5. arXiv:2207.07098  [pdf, other

    cs.MS cs.CE cs.DC physics.flu-dyn

    Large-Scale Direct Numerical Simulations of Turbulence Using GPUs and Modern Fortran

    Authors: Martin Karp, Daniele Massaro, Niclas Jansson, Alistair Hart, Jacob Wahlgren, Philipp Schlatter, Stefano Markidis

    Abstract: We present our approach to making direct numerical simulations of turbulence with applications in sustainable ship**. We use modern Fortran and the spectral element method to leverage and scale on supercomputers powered by the Nvidia A100 and the recent AMD Instinct MI250X GPUs, while still providing support for user software developed in Fortran. We demonstrate the efficiency of our approach by… ▽ More

    Submitted 23 June, 2022; originally announced July 2022.

    Comments: 13 pages, 7 figures

    ACM Class: G.4; J.2

  6. arXiv:2204.12526  [pdf, other

    q-bio.QM cs.LG stat.ML

    Identification of feasible pathway information for c-di-GMP binding proteins in cellulose production

    Authors: Syeda Sakira Hassan, Rahul Mangayil, Tommi Aho, Olli Yli-Harja, Matti Karp

    Abstract: In this paper, we utilize a machine learning approach to identify the significant pathways for c-di-GMP signaling proteins. The dataset involves gene counts from 12 pathways and 5 essential c-di-GMP binding domains for 1024 bacterial genomes. Two novel approaches, Least absolute shrinkage and selection operator (Lasso) and Random forests, have been applied for analyzing and modeling the dataset. B… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Journal ref: EMBEC & NBC 2017. EMBEC NBC 2017 2017. IFMBE Proceedings, vol 65. Springer, Singapore

  7. arXiv:2109.03592  [pdf, ps, other

    cs.DC

    Strong Scaling of OpenACC enabled Nek5000 on several GPU based HPC systems

    Authors: Jonathan Vincent, **g Gong, Martin Karp, Adam Peplinski, Niclas Jansson, Artur Podobas, Andreas Jocksch, Jie Yao, Fazle Hussain, Stefano Markidis, Matts Karlsson, Dirk Pleiter, Erwin Laure, Philipp Schlatter

    Abstract: We present new results on the strong parallel scaling for the OpenACC-accelerated implementation of the high-order spectral element fluid dynamics solver Nek5000. The test case considered consists of a direct numerical simulation of fully-developed turbulent flow in a straight pipe, at two different Reynolds numbers $Re_τ=360$ and $Re_τ=550$, based on friction velocity and pipe radius. The strong… ▽ More

    Submitted 4 November, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 9 pages, 8 figures. Submitted to HPC-Asia 2022 conference, updated to address reviewers comments

    ACM Class: G.4; J.2; C.1

  8. A High-Fidelity Flow Solver for Unstructured Meshes on Field-Programmable Gate Arrays

    Authors: Martin Karp, Artur Podobas, Tobias Kenter, Niclas Jansson, Christian Plessl, Philipp Schlatter, Stefano Markidis

    Abstract: The impending termination of Moore's law motivates the search for new forms of computing to continue the performance scaling we have grown accustomed to. Among the many emerging Post-Moore computing candidates, perhaps none is as salient as the Field-Programmable Gate Array (FPGA), which offers the means of specializing and customizing the hardware to the computation at hand. In this work, we de… ▽ More

    Submitted 2 November, 2021; v1 submitted 27 August, 2021; originally announced August 2021.

    Comments: 12 pages, 3 figures, 3 tables, Accepted to HPC Asia 2022

    ACM Class: G.4; J.2; C.1

  9. arXiv:2107.01243  [pdf

    cs.MS

    Neko: A Modern, Portable, and Scalable Framework for High-Fidelity Computational Fluid Dynamics

    Authors: Niclas Jansson, Martin Karp, Artur Podobas, Stefano Markidis, Philipp Schlatter

    Abstract: Recent trends and advancement in including more diverse and heterogeneous hardware in High-Performance Computing is challenging software developers in their pursuit for good performance and numerical stability. The well-known maxim "software outlives hardware" may no longer necessarily hold true, and developers are today forced to re-factor their codebases to leverage these powerful new systems. C… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  10. arXiv:2010.13463  [pdf

    cs.DC

    High-Performance Spectral Element Methods on Field-Programmable Gate Arrays

    Authors: Martin Karp, Artur Podobas, Niclas Jansson, Tobias Kenter, Christian Plessl, Philipp Schlatter, Stefano Markidis

    Abstract: Improvements in computer systems have historically relied on two well-known observations: Moore's law and Dennard's scaling. Today, both these observations are ending, forcing computer users, researchers, and practitioners to abandon the general-purpose architectures' comforts in favor of emerging post-Moore systems. Among the most salient of these post-Moore systems is the Field-Programmable Gate… ▽ More

    Submitted 4 May, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: 10 pages, IEEE International Parallel and Distributed Processing Symposium 2021 (IPDPS'21)

    ACM Class: G.4; J.2; C.1

  11. arXiv:2010.10571  [pdf, other

    physics.flu-dyn

    Shock-induced heating and transition to turbulence in a hypersonic boundary layer

    Authors: Lin Fu, Michael Karp, Sanjeeb T. Bose, Parviz Moin, Javier Urzay

    Abstract: The interaction between an incident shock wave and a Mach-6 undisturbed hypersonic laminar boundary layer over a cold wall is addressed using direct numerical simulations (DNS) and wall-modeled large-eddy simulations (WMLES) at different angles of incidence. At sufficiently high shock-incidence angles, the boundary layer transitions to turbulence via breakdown of near-wall streaks shortly downstre… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: 48 pages, 36 figures

    MSC Class: 76K05; 76N06; 76N20; 76F06; 76F40; 76F50; 76F65; 76F02

  12. arXiv:2005.13425  [pdf

    cs.DC

    Optimization of Tensor-product Operations in Nekbone on GPUs

    Authors: Martin Karp, Niclas Jansson, Artur Podobas, Philipp Schlatter, Stefano Markidis

    Abstract: In the CFD solver Nek5000, the computation is dominated by the evaluation of small tensor operations. Nekbone is a proxy app for Nek5000 and has previously been ported to GPUs with a mixed OpenACC and CUDA approach. In this work, we continue this effort and optimize the main tensor-product operation in Nekbone further. Our optimization is done in CUDA and uses a different, 2D, thread structure to… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Comments: 4 pages, 4 figures

    ACM Class: G.4; J.2

  13. arXiv:2005.05303  [pdf, other

    physics.flu-dyn

    Cause-and-effect of linear mechanisms sustaining wall turbulence

    Authors: Adrián Lozano-Durán, Navid C. Constantinou, Marios-Andreas Nikolaidis, Michael Karp

    Abstract: Despite the nonlinear nature of turbulence, there is evidence that part of the energy-transfer mechanisms sustaining wall turbulence can be ascribed to linear processes. The different scenarios stem from linear stability theory and comprise exponential instabilities, neutral modes, transient growth from non-normal operators, and parametric instabilities from temporal mean-flow variations, among ot… ▽ More

    Submitted 7 October, 2020; v1 submitted 9 May, 2020; originally announced May 2020.

    Journal ref: J. Fluid Mech., vol. 914, A8, 2021

  14. Alternative physics to understand wall turbulence: Navier-Stokes equations with modified linear dynamics

    Authors: Adrán Lozano-Durán, Marios-Andreas Nikolaidis, Navid C. Constantinou, Michael Karp

    Abstract: Despite the nonlinear nature of wall turbulence, there is evidence that the energy-injection mechanisms sustaining wall turbulence can be ascribed to linear processes. The different scenarios stem from linear stability theory and comprise exponential instabilities from mean-flow inflection points, transient growth from non-normal operators, and parametric instabilities from temporal mean-flow vari… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1909.05490

  15. arXiv:1909.05490  [pdf, ps, other

    physics.flu-dyn physics.comp-ph

    Wall turbulence without modal instability of the streaks

    Authors: Adrián Lozano-Durán, Marios-Andreas Nikolaidis, Navid C. Constantinou, Michael Karp

    Abstract: Despite the nonlinear nature of wall turbulence, there is evidence that the mechanism underlying the energy transfer from the mean flow to the turbulent fluctuations can be ascribed to linear processes. One of the most acclaimed linear instabilities for this energy transfer is the modal growth of perturbations with respect to the streamwise-averaged flow (or streaks). Here, we devise a numerical e… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

  16. arXiv:1902.01914  [pdf, ps, other

    physics.flu-dyn

    Wall turbulence with constrained energy extraction from mean flow

    Authors: Adrián Lozano-Durán, Michael Karp, Navid. C. Constantinou

    Abstract: We study the mechanism of energy injection from the mean flow to the fluctuating velocity necessary to maintain wall turbulence. This process is believed to be correctly represented by the linearized Navier--Stokes equations, and three potential linear mechanisms have been considered, namely, modal instability of the streamwise mean cross-flow $U(y,z,t)$, non-modal transient growth, and non-modal… ▽ More

    Submitted 13 February, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: Corrected typo in author name

    Journal ref: Center for Turbulence Research Annual Research Briefs 2018

  17. arXiv:1807.06701  [pdf, other

    cs.DC cs.DS

    Massively Parallel Symmetry Breaking on Sparse Graphs: MIS and Maximal Matching

    Authors: Soheil Behnezhad, Mahsa Derakhshan, MohammadTaghi Hajiaghayi, Richard M. Karp

    Abstract: The success of modern parallel paradigms such as MapReduce, Hadoop, or Spark, has attracted a significant attention to the Massively Parallel Computation (MPC) model over the past few years, especially on graph problems. In this work, we consider symmetry breaking problems of maximal independent set (MIS) and maximal matching (MM), which are among the most intensively studied problems in distribut… ▽ More

    Submitted 6 May, 2019; v1 submitted 17 July, 2018; originally announced July 2018.

    Comments: A merger of this paper and the independent and concurrent paper [arxiv:1807.05374] appeared at PODC 2019

  18. arXiv:1111.5572  [pdf, other

    cs.DS q-bio.GN

    Faster and More Accurate Sequence Alignment with SNAP

    Authors: Matei Zaharia, William J. Bolosky, Kristal Curtis, Armando Fox, David Patterson, Scott Shenker, Ion Stoica, Richard M. Karp, Taylor Sittler

    Abstract: We present the Scalable Nucleotide Alignment Program (SNAP), a new short and long read aligner that is both more accurate (i.e., aligns more reads with fewer errors) and 10-100x faster than state-of-the-art tools such as BWA. Unlike recent aligners based on the Burrows-Wheeler transform, SNAP uses a simple hash index of short seed sequences from the genome, similar to BLAST's. However, SNAP greatl… ▽ More

    Submitted 23 November, 2011; originally announced November 2011.

  19. arXiv:1009.0909  [pdf, other

    cs.DS

    Comparing Pedigree Graphs

    Authors: Bonnie Kirkpatrick, Yakir Reshef, Hilary Finucane, Haitao Jiang, Binhai Zhu, Richard M. Karp

    Abstract: Pedigree graphs, or family trees, are typically constructed by an expensive process of examining genealogical records to determine which pairs of individuals are parent and child. New methods to automate this process take as input genetic data from a set of extant individuals and reconstruct ancestral individuals. There is a great need to evaluate the quality of these methods by comparing the esti… ▽ More

    Submitted 18 October, 2011; v1 submitted 5 September, 2010; originally announced September 2010.

  20. arXiv:0707.1532  [pdf, ps, other

    cs.DS cs.DM

    Sorting and Selection in Posets

    Authors: Constantinos Daskalakis, Richard M. Karp, Elchanan Mossel, Samantha Riesenfeld, Elad Verbin

    Abstract: Classical problems of sorting and searching assume an underlying linear ordering of the objects being compared. In this paper, we study a more general setting, in which some pairs of objects are incomparable. This generalization is relevant in applications related to rankings in sports, college admissions, or conference submissions. It also has potential applications in biology, such as comparin… ▽ More

    Submitted 10 July, 2007; originally announced July 2007.

    Comments: 24 pages

    ACM Class: F.2.2; G.2.1; G.2.2

  21. arXiv:q-bio/0702001  [pdf, ps, other

    q-bio.MN

    Comparing Protein Interaction Networks via a Graph Match-and-Split Algorithm

    Authors: Manikandan Narayanan, Richard M. Karp

    Abstract: We present a method that compares the protein interaction networks of two species to detect functionally similar (conserved) protein modules between them. The method is based on an algorithm we developed to identify matching subgraphs between two graphs. Unlike previous network comparison methods, our algorithm has provable guarantees on correctness and efficiency. Our algorithm framework also a… ▽ More

    Submitted 1 February, 2007; originally announced February 2007.

    Comments: 15 pages, 4 figures, 6 tables. Supplemental text available at http://www.cs.berkeley.edu/~nmani/mas-supplement.pdf

  22. Probabilistic Analysis of Linear Programming Decoding

    Authors: Constantinos Daskalakis, Alexandros G. Dimakis, Richard M. Karp, Martin J. Wainwright

    Abstract: We initiate the probabilistic analysis of linear programming (LP) decoding of low-density parity-check (LDPC) codes. Specifically, we show that for a random LDPC code ensemble, the linear programming decoder of Feldman et al. succeeds in correcting a constant fraction of errors with high probability. The fraction of correctable errors guaranteed by our analysis surpasses previous non-asymptotic… ▽ More

    Submitted 10 March, 2008; v1 submitted 2 February, 2007; originally announced February 2007.

    Comments: To appear, IEEE Transactions on Information Theory, (replaces shorter version that appeared in SODA'07)