Skip to main content

Showing 1–15 of 15 results for author: Pouchet, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12304  [pdf, other

    cs.AR

    Automatic Hardware Pragma Insertion in High-Level Synthesis: A Non-Linear Programming Approach

    Authors: Stéphane Pouget, Louis-Noël Pouchet, Jason Cong

    Abstract: High-Level Synthesis enables the rapid prototy** of hardware accelerators, by combining a high-level description of the functional behavior of a kernel with a set of micro-architecture optimizations as inputs. Such optimizations can be described by inserting pragmas e.g. pipelining and replication of units, or even higher level transformations for HLS such as automatic data caching using the AMD… ▽ More

    Submitted 30 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  2. arXiv:2405.03058  [pdf, other

    cs.SE cs.PL

    Enhancing High-Level Synthesis with Automated Pragma Insertion and Code Transformation Framework

    Authors: Stéphane Pouget, Louis-Noël Pouchet, Jason Cong

    Abstract: High-level synthesis, source-to-source compilers, and various Design Space Exploration techniques for pragma insertion have significantly improved the Quality of Results of generated designs. These tools offer benefits such as reduced development time and enhanced performance. However, achieving high-quality results often requires additional manual code transformations and tiling selections, which… ▽ More

    Submitted 21 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  3. Self-Supervised Learning to Prove Equivalence Between Straight-Line Programs via Rewrite Rules

    Authors: Steve Kommrusch, Martin Monperrus, Louis-Noël Pouchet

    Abstract: We target the problem of automatically synthesizing proofs of semantic equivalence between two programs made of sequences of statements. We represent programs using abstract syntax trees (AST), where a given set of semantics-preserving rewrite rules can be applied on a specific AST pattern to generate a transformed and semantically equivalent program. In our system, two programs are equivalent if… ▽ More

    Submitted 8 July, 2023; v1 submitted 21 September, 2021; originally announced September 2021.

    Comments: 30 pages including appendix

    Journal ref: IEEE Transactions on Software Engineering, 2023

  4. arXiv:2106.02452  [pdf, other

    cs.PL cs.LG

    Proving Equivalence Between Complex Expressions Using Graph-to-Sequence Neural Models

    Authors: Steve Kommrusch, Théo Barollet, Louis-Noël Pouchet

    Abstract: We target the problem of provably computing the equivalence between two complex expression trees. To this end, we formalize the problem of equivalence between two such programs as finding a set of semantics-preserving rewrite rules from one into the other, such that after the rewrite the two programs are structurally identical, and therefore trivially equivalent.We then develop a graph-to-sequence… ▽ More

    Submitted 8 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: 10 pages (24 including references and appendices), 8 figures, 17 tables. arXiv admin note: substantial text overlap with arXiv:2002.06799. Updated to include funding acknowledgement

  5. arXiv:2012.11473  [pdf, other

    cs.AR cs.PF

    PALMED: Throughput Characterization for Superscalar Architectures -- Extended Version

    Authors: Nicolas Derumigny, Fabian Gruber, Théophile Bastian, Guillaume Iooss, Christophe Guillon, Louis-Noël Pouchet, Fabrice Rastello

    Abstract: In a super-scalar architecture, the scheduler dynamically assigns micro-operations ($μ$OPs) to execution ports. The port map** of an architecture describes how an instruction decomposes into $μ$OPs and lists for each $μ$OP the set of ports it can be mapped to. It is used by compilers and performance debugging tools to characterize the performance throughput of a sequence of instructions repeated… ▽ More

    Submitted 18 January, 2022; v1 submitted 21 December, 2020; originally announced December 2020.

  6. arXiv:2011.05422  [pdf, other

    cs.DC cs.AR

    Coherence Traffic in Manycore Processors with Opaque Distributed Directories

    Authors: Steve Kommrusch, Marcos Horro, Louis-Noël Pouchet, Gabriel Rodríguez, Juan Touriño

    Abstract: Manycore processors feature a high number of general-purpose cores designed to work in a multithreaded fashion. Recent manycore processors are kept coherent using scalable distributed directories. A paramount example is the Intel Mesh interconnect, which consists of a network-on-chip interconnecting "tiles", each of which contains computation cores, local caches, and coherence masters. The distrib… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: 17 pages, 13 figures, submitted to IEEE for possible publication

  7. arXiv:2002.06799  [pdf, other

    cs.LG cs.FL stat.ML

    Equivalence of Dataflow Graphs via Rewrite Rules Using a Graph-to-Sequence Neural Model

    Authors: Steve Kommrusch, Théo Barollet, Louis-Noël Pouchet

    Abstract: In this work we target the problem of provably computing the equivalence between two programs represented as dataflow graphs. To this end, we formalize the problem of equivalence between two programs as finding a set of semantics-preserving rewrite rules from one into the other, such that after the rewrite the two programs are structurally identical, and therefore trivially equivalent. We then dev… ▽ More

    Submitted 3 June, 2021; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: 20 pages including references and appendices, 10 figures, updated to include acknowledgement

  8. arXiv:1911.06664  [pdf, other

    cs.CC

    Automated Derivation of Parametric Data Movement Lower Bounds for Affine Programs

    Authors: Auguste Olivry, Julien Langou, Louis-Noël Pouchet, P. Sadayappan, Fabrice Rastello

    Abstract: For most relevant computation, the energy and time needed for data movement dominates that for performing arithmetic operations on all computing systems today. Hence it is of critical importance to understand the minimal total data movement achievable during the execution of an algorithm. The achieved total data movement for different schedules of an algorithm can vary widely depending on how effi… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

  9. arXiv:1901.01808  [pdf, other

    cs.SE cs.LG stat.ML

    SequenceR: Sequence-to-Sequence Learning for End-to-End Program Repair

    Authors: Zimin Chen, Steve Kommrusch, Michele Tufano, Louis-Noël Pouchet, Denys Poshyvanyk, Martin Monperrus

    Abstract: This paper presents a novel end-to-end approach to program repair based on sequence-to-sequence learning. We devise, implement, and evaluate a system, called SequenceR, for fixing bugs based on sequence-to-sequence learning on source code. This approach uses the copy mechanism to overcome the unlimited vocabulary problem that occurs with big code. Our system is data-driven; we train it on 35,578 s… ▽ More

    Submitted 9 September, 2019; v1 submitted 24 December, 2018; originally announced January 2019.

    Comments: 21 pages, 15 figures

    Journal ref: IEEE Transactions on Software Engineering, 2019

  10. arXiv:1811.07999  [pdf, other

    cs.CV cs.LG stat.ML

    Synthetic Lung Nodule 3D Image Generation Using Autoencoders

    Authors: Steve Kommrusch, Louis-Noël Pouchet

    Abstract: One of the challenges of using machine learning techniques with medical data is the frequent dearth of source image data on which to train. A representative example is automated lung cancer diagnosis, where nodule images need to be classified as suspicious or benign. In this work we propose an automatic synthetic lung nodule image generator. Our 3D shape generator is designed to augment the variet… ▽ More

    Submitted 9 September, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: 19 pages, 12 figures, full paper for work initially presented at IJCAI 2018

    Report number: CS-18-101

  11. arXiv:1811.06043  [pdf, other

    cs.DC

    A Performance Vocabulary for Affine Loop Transformations

    Authors: Martin Kong, Louis-Noël Pouchet

    Abstract: Modern polyhedral compilers excel at aggressively optimizing codes with static control parts, but the state-of-practice to find high-performance polyhedral transformations especially for different hardware targets still largely involves auto-tuning. In this work we propose a novel polyhedral scheduling technique, with the aim to reduce the need for auto-tuning while allowing to build customizable… ▽ More

    Submitted 9 April, 2019; v1 submitted 14 November, 2018; originally announced November 2018.

    MSC Class: 68N20 ACM Class: D.2.6; D.3.0

  12. On Characterizing the Data Access Complexity of Programs

    Authors: Venmugil Elango, Fabrice Rastello, Louis-Noel Pouchet, J. Ramanujam, P. Sadayappan

    Abstract: Technology trends will cause data movement to account for the majority of energy expenditure and execution time on emerging computers. Therefore, computational complexity will no longer be a sufficient metric for comparing algorithms, and a fundamental characterization of data access complexity will be increasingly important. The problem of develo** lower bounds for data access complexity has be… ▽ More

    Submitted 9 November, 2014; originally announced November 2014.

    ACM Class: F.2; D.2.8

  13. arXiv:1404.4767  [pdf, other

    cs.DC cs.DS

    On Characterizing the Data Movement Complexity of Computational DAGs for Parallel Execution

    Authors: Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan

    Abstract: Technology trends are making the cost of data movement increasingly dominant, both in terms of energy and time, over the cost of performing arithmetic operations in computer systems. The fundamental ratio of aggregate data movement bandwidth to the total computational power (also referred to the machine balance parameter) in parallel computer systems is decreasing. It is there- fore of considerabl… ▽ More

    Submitted 18 April, 2014; originally announced April 2014.

    Report number: RR-8522

  14. arXiv:1401.5024  [pdf, other

    cs.OH

    Beyond Reuse Distance Analysis: Dynamic Analysis for Characterization of Data Locality Potential

    Authors: Naznin Fauzia, Venmugil Elango, Mahesh Ravishankar, J. Ramanujam, Fabrice Rastello, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan

    Abstract: Emerging computer architectures will feature drastically decreased flops/byte (ratio of peak processing rate to memory bandwidth) as highlighted by recent studies on Exascale architectural trends. Further, flops are getting cheaper while the energy cost of data movement is increasingly dominant. The understanding and characterization of data locality properties of computations is critical in order… ▽ More

    Submitted 21 December, 2013; originally announced January 2014.

    Comments: Transaction on Architecture and Code Optimization (2014)

  15. arXiv:1111.6756  [pdf, ps, other

    cs.DC cs.PF cs.PL

    The Potential of Synergistic Static, Dynamic and Speculative Loop Nest Optimizations for Automatic Parallelization

    Authors: Riyadh Baghdadi, Albert Cohen, Cedric Bastoul, Louis-Noel Pouchet, Lawrence Rauchwerger

    Abstract: Research in automatic parallelization of loop-centric programs started with static analysis, then broadened its arsenal to include dynamic inspection-execution and speculative execution, the best results involving hybrid static-dynamic schemes. Beyond the detection of parallelism in a sequential program, scalable parallelization on many-core processors involves hard and interesting parallelism ada… ▽ More

    Submitted 29 November, 2011; originally announced November 2011.