Skip to main content

Showing 1–11 of 11 results for author: Franchetti, F

.
  1. arXiv:2307.07931  [pdf, other

    cs.MS

    ProtoX: A First Look

    Authors: Het Mankad, Sanil Rao, Brian Van Straalen, Phillip Colella, Franz Franchetti

    Abstract: We present a first look at ProtoX, a code generation framework for stencil and pointwise operations that occur frequently in the numerical solution of partial differential equations. ProtoX has Proto as its library frontend and SPIRAL as the backend. Proto is a C++ based domain specific library which optimizes the algorithms used to compute the numerical solution of partial differential equations.… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  2. arXiv:2304.05237  [pdf

    cs.CR cs.AR cs.DC cs.PF

    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation

    Authors: David Bruce Cousins, Yuriy Polyakov, Ahmad Al Badawi, Matthew French, Andrew Schmidt, Ajey Jacob, Benedict Reynwar, Kellie Canida, Akhilesh Jaiswal, Clynn Mathew, Homer Gamil, Negar Neda, Deepraj Soni, Michail Maniatakos, Brandon Reagen, Naifeng Zhang, Franz Franchetti, Patrick Brinich, Jeremy Johnson, Patrick Broderick, Mike Franusich, Bo Zhang, Zeming Cheng, Massoud Pedram

    Abstract: Secure computation is of critical importance to not only the DoD, but across financial institutions, healthcare, and anywhere personally identifiable information (PII) is accessed. Traditional security techniques require data to be decrypted before performing any computation. When processed on untrusted systems the decrypted data is vulnerable to attacks to extract the sensitive information. To ad… ▽ More

    Submitted 18 April, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: 6 pages, 5 figures and 2 tables

  3. arXiv:2303.17118  [pdf, other

    cs.AR cs.CR

    RPU: The Ring Processing Unit

    Authors: Deepraj Soni, Negar Neda, Naifeng Zhang, Benedict Reynwar, Homer Gamil, Benjamin Heyman, Mohammed Nabeel, Ahmad Al Badawi, Yuriy Polyakov, Kellie Canida, Massoud Pedram, Michail Maniatakos, David Bruce Cousins, Franz Franchetti, Matthew French, Andrew Schmidt, Brandon Reagen

    Abstract: Ring-Learning-with-Errors (RLWE) has emerged as the foundation of many important techniques for improving security and privacy, including homomorphic encryption and post-quantum cryptography. While promising, these techniques have received limited use due to their extreme overheads of running on general-purpose machines. In this paper, we present a novel vector Instruction Set Architecture (ISA) a… ▽ More

    Submitted 13 April, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

  4. arXiv:2012.15647  [pdf, other

    q-bio.QM

    Indirect Measurement of Hepatic Drug Clearance by Fitting Dynamical Models

    Authors: Yoko Franchetti, Thomas D. Nolin, Franz Franchetti

    Abstract: We present an indirect signal processing-based measurement method for biological quantities in humans that cannot be directly measured. We develop the method by focusing on estimating hepatic enzyme and drug transporter activity through breath-biopsy samples clinically obtained via the erythromycin breath test (EBT): a small dose of radio-labeled drug is injected and the subsequent content of radi… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: This preprint is based on Chapter 2 of the PhD dissertation of Y Franchetti. The dissertation thesis is available at http://d-scholarship.pitt.edu/id/eprint/39885

    Report number: University of Pittsburgh ETD 39885

  5. arXiv:1904.10119  [pdf, other

    cs.MS cs.DC

    A Flexible Framework for Parallel Multi-Dimensional DFTs

    Authors: Doru Thom Popovici, Martin D. Schatz, Franz Franchetti, Tze Meng Low

    Abstract: Multi-dimensional discrete Fourier transforms (DFT) are typically decomposed into multiple 1D transforms. Hence, parallel implementations of any multi-dimensional DFT focus on parallelizing within or across the 1D DFT. Existing DFT packages exploit the inherent parallelism across the 1D DFTs and offer rigid frameworks, that cannot be extended to incorporate both forms of parallelism and various da… ▽ More

    Submitted 22 December, 2019; v1 submitted 22 April, 2019; originally announced April 2019.

  6. Fast and accurate object detection in high resolution 4K and 8K video using GPUs

    Authors: Vít Růžička, Franz Franchetti

    Abstract: Machine learning has celebrated a lot of achievements on computer vision tasks such as object detection, but the traditionally used models work with relatively low resolution images. The resolution of recording devices is gradually increasing and there is a rising need for new methods of processing high resolution data. We propose an attention pipeline method which uses two staged evaluation of ea… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

    Comments: 6 pages, 12 figures, Best Paper Finalist at IEEE High Performance Extreme Computing Conference (HPEC) 2018; copyright 2018 IEEE; (DOI will be filled when known)

    Journal ref: 2018 IEEE High Performance extreme Computing Conference (HPEC)

  7. arXiv:1809.10170  [pdf, other

    cs.LG cs.DC stat.ML

    High Performance Zero-Memory Overhead Direct Convolutions

    Authors: Jiyuan Zhang, Franz Franchetti, Tze Meng Low

    Abstract: The computation of convolution layers in deep neural networks typically rely on high performance routines that trade space for time by using additional memory (either for packing purposes or required as part of the algorithm) to improve performance. The problems with such an approach are two-fold. First, these routines incur additional memory overhead which reduces the overall size of the network… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

    Comments: the 35th International Conference on Machine Learning(ICML 2018), camera ready

  8. arXiv:1611.08035  [pdf, other

    cs.MS

    Automating the Last-Mile for High Performance Dense Linear Algebra

    Authors: Richard Michael Veras, Tze Meng Low, Tyler Michael Smith, Robert van de Geijn, Franz Franchetti

    Abstract: High performance dense linear algebra (DLA) libraries often rely on a general matrix multiply (Gemm) kernel that is implemented using assembly or with vector intrinsics. In particular, the real-valued Gemm kernels provide the overwhelming fraction of performance for the complex-valued Gemm kernels, along with the entire level-3 BLAS and many of the real and complex LAPACK routines. Thus,achieving… ▽ More

    Submitted 28 April, 2017; v1 submitted 23 November, 2016; originally announced November 2016.

  9. arXiv:1606.05790  [pdf, other

    cs.MS astro-ph.IM cs.DC cs.DS

    Mathematical Foundations of the GraphBLAS

    Authors: Jeremy Kepner, Peter Aaltonen, David Bader, Aydın Buluc, Franz Franchetti, John Gilbert, Dylan Hutchison, Manoj Kumar, Andrew Lumsdaine, Henning Meyerhenke, Scott McMillan, Jose Moreira, John D. Owens, Carl Yang, Marcin Zalewski, Timothy Mattson

    Abstract: The GraphBLAS standard (GraphBlas.org) is being developed to bring the potential of matrix based graph algorithms to the broadest possible audience. Mathematically the Graph- BLAS defines a core set of matrix-based graph operations that can be used to implement a wide class of graph algorithms in a wide range of programming environments. This paper provides an introduction to the mathematics of th… ▽ More

    Submitted 13 July, 2016; v1 submitted 18 June, 2016; originally announced June 2016.

    Comments: 9 pages; 11 figures; accepted to IEEE High Performance Extreme Computing (HPEC) conference 2016. arXiv admin note: text overlap with arXiv:1504.01039

  10. arXiv:1201.2934  [pdf, ps, other

    math.OC cs.DS cs.IT

    An Information-Theoretic Approach to PMU Placement in Electric Power Systems

    Authors: Qiao Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti, Marija D. Ilic

    Abstract: This paper presents an information-theoretic approach to address the phasor measurement unit (PMU) placement problem in electric power systems. Different from the conventional 'topological observability' based approaches, this paper advocates a much more refined, information-theoretic criterion, namely the mutual information (MI) between the PMU measurements and the power system states. The propos… ▽ More

    Submitted 13 January, 2012; originally announced January 2012.

    Comments: 8 pages, 7 figures

  11. arXiv:1106.5063  [pdf, ps, other

    math.OC eess.SY

    On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems

    Authors: Qiao Li, Tao Cui, Rohit Negi, Franz Franchetti, Marija D. Ilic

    Abstract: The concept of plug-in electric vehicles (PEV) are gaining increasing popularity in recent years, due to the growing societal awareness of reducing greenhouse gas (GHG) emissions, and gaining independence on foreign oil or petroleum. Large-scale deployment of PEVs currently faces many challenges. One particular concern is that the PEV charging can potentially cause significant impacts on the exist… ▽ More

    Submitted 18 November, 2011; v1 submitted 24 June, 2011; originally announced June 2011.

    Comments: 12 pages, 9 figures