Skip to main content

Showing 1–16 of 16 results for author: Lumsdaine, A

.
  1. arXiv:2307.11684  [pdf, other

    cs.LG

    Minibatching Offers Improved Generalization Performance for Second Order Optimizers

    Authors: Eric Silk, Swarnita Chakraborty, Nairanjana Dasgupta, Anand D. Sarwate, Andrew Lumsdaine, Tony Chiang

    Abstract: Training deep neural networks (DNNs) used in modern machine learning is computationally expensive. Machine learning scientists, therefore, rely on stochastic first-order methods for training, coupled with significant hand-tuning, to obtain good performance. To better understand performance variability of different stochastic algorithms, including second-order methods, we conduct an empirical study… ▽ More

    Submitted 25 May, 2023; originally announced July 2023.

    Comments: 14 pages, 6 figures, 5 tables

  2. arXiv:2201.11326  [pdf, other

    cs.DC cs.DS

    High-order Line Graphs of Non-uniform Hypergraphs: Algorithms, Applications, and Experimental Analysis

    Authors: Xu T. Liu, Jesun Firoz, Sinan Aksoy, Ilya Amburg, Andrew Lumsdaine, Cliff Joslyn, Assefaw H. Gebremedhin, Brenda Praggastis

    Abstract: Hypergraphs offer flexible and robust data representations for many applications, but methods that work directly on hypergraphs are not readily available and tend to be prohibitively expensive. Much of the current analysis of hypergraphs relies on first performing a graph expansion -- either based on the nodes (clique expansion), or on the edges (line graph) -- and then running standard graph anal… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted at "36th IEEE International Parallel & Distributed Processing Symposium (IPDPS '22)"

    Report number: PNNL-SA-167812 MSC Class: 05C65(Primary); 05C85(Secondary); 68W10(Secondary) ACM Class: G.2.2

  3. arXiv:2010.11448  [pdf, other

    cs.DM

    Parallel Algorithms and Heuristics for Efficient Computation of High-Order Line Graphs of Hypergraphs

    Authors: Xu T. Liu, Jesun Firoz, Andrew Lumsdaine, Cliff Joslyn, Sinan Aksoy, Brenda Praggastis, Assefaw Gebremedhin

    Abstract: This paper considers structures of systems beyond dyadic (pairwise) interactions and investigates mathematical modeling of multi-way interactions and connections as hypergraphs, where captured relationships among system entities are set-valued. To date, in most situations, entities in a hypergraph are considered connected as long as there is at least one common "neighbor". However, minimal commona… ▽ More

    Submitted 15 July, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 11 pages

    Report number: PNNL-SA-164086 MSC Class: 05C65 ACM Class: G.2.2

  4. Real-Time Refocusing using an FPGA-based Standard Plenoptic Camera

    Authors: Christopher Hahne, Andrew Lumsdaine, Amar Aggoun, Vladan Velisavljevic

    Abstract: Plenoptic cameras are receiving increasing attention in scientific and commercial applications because they capture the entire structure of light in a scene, enabling optical transforms (such as focusing) to be applied computationally after the fact, rather than once and for all at the time a picture is taken. In many settings, real-time interactive performance is also desired, which in turn requi… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: IEEE Trans. on Industrial Electronics

    Journal ref: Volume: 65, Issue: 12, Dec. 2018, Pages: 9757 - 9766

  5. arXiv:2005.11214  [pdf

    physics.plasm-ph

    PISCES-RF: a liquid-cooled high-power steady-state helicon plasma device

    Authors: Saikat Chakraborty Thakur, Michael J. Simmonds, Juan F. Caneses, Fengjen Chang, Eric M. Hollmann Russell P. Doerner, Richard Goulding, Arnold Lumsdaine, Juergen Rapp, George R. Tynan

    Abstract: Radio-frequency (RF) driven helicon plasma sources can produce relatively high-density plasmas (n > 10^19 m-3) at relatively moderate powers (< 2 kW) in argon. However, to produce similar high-density plasmas for fusion relevant gases such as hydrogen, deuterium and helium, much higher RF powers are needed. For very high RF powers, thermal issues of the RF-transparent dielectric window, used in th… ▽ More

    Submitted 29 December, 2020; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: 13 pages, 22 figures

  6. arXiv:1904.00518  [pdf, other

    cs.DC

    A Comparative Study of Asynchronous Many-Tasking Runtimes: Cilk, Charm++, ParalleX and AM++

    Authors: Abhishek Kulkarni, Andrew Lumsdaine

    Abstract: We evaluate and compare four contemporary and emerging runtimes for high-performance computing(HPC) applications: Cilk, Charm++, ParalleX and AM++. We compare along three bases: programming model, execution model and the implementation on an underlying machine model. The comparison study includes a survey of each runtime system's programming models, their corresponding execution models, their stat… ▽ More

    Submitted 31 March, 2019; originally announced April 2019.

  7. arXiv:1706.05760  [pdf, other

    cs.DC

    Families of Distributed Memory Parallel Graph Algorithms from Self-Stabilizing Kernels-An SSSP Case Study

    Authors: Thejaka Kanewala, Marcin Zalewski, Martina Barnas, Andrew Lumsdaine

    Abstract: Self-stabilizing algorithms are an important because of their robustness and guaranteed convergence. Starting from any arbitrary state, a self-stabilizing algorithm is guaranteed to converge to a legitimate state.Those algorithms are not directly amenable to solving distributed graph processing problems when performance and scalability are important. In this paper, we show the "Abstract Graph Mach… ▽ More

    Submitted 18 June, 2017; originally announced June 2017.

    Comments: 10 pages, including references

    ACM Class: D.1.3

  8. arXiv:1611.06334  [pdf, other

    cs.DC

    A Survey of Methods for Collective Communication Optimization and Tuning

    Authors: Udayanga Wickramasinghe, Andrew Lumsdaine

    Abstract: New developments in HPC technology in terms of increasing computing power on multi/many core processors, high-bandwidth memory/IO subsystems and communication interconnects, pose a direct impact on software and runtime system development. These advancements have become useful in producing high-performance collective communication interfaces that integrate efficiently on a wide variety of platforms… ▽ More

    Submitted 19 November, 2016; originally announced November 2016.

  9. arXiv:1606.05790  [pdf, other

    cs.MS astro-ph.IM cs.DC cs.DS

    Mathematical Foundations of the GraphBLAS

    Authors: Jeremy Kepner, Peter Aaltonen, David Bader, Aydın Buluc, Franz Franchetti, John Gilbert, Dylan Hutchison, Manoj Kumar, Andrew Lumsdaine, Henning Meyerhenke, Scott McMillan, Jose Moreira, John D. Owens, Carl Yang, Marcin Zalewski, Timothy Mattson

    Abstract: The GraphBLAS standard (GraphBlas.org) is being developed to bring the potential of matrix based graph algorithms to the broadest possible audience. Mathematically the Graph- BLAS defines a core set of matrix-based graph operations that can be used to implement a wide class of graph algorithms in a wide range of programming environments. This paper provides an introduction to the mathematics of th… ▽ More

    Submitted 13 July, 2016; v1 submitted 18 June, 2016; originally announced June 2016.

    Comments: 9 pages; 11 figures; accepted to IEEE High Performance Extreme Computing (HPEC) conference 2016. arXiv admin note: text overlap with arXiv:1504.01039

  10. arXiv:1604.04772  [pdf, other

    cs.DC

    Abstract Graph Machine

    Authors: Thejaka Amila Kanewala, Marcin Zalewski, Andrew Lumsdaine

    Abstract: An Abstract Graph Machine(AGM) is an abstract model for distributed memory parallel stabilizing graph algorithms. A stabilizing algorithm starts from a particular initial state and goes through series of different state changes until it converges. The AGM adds work dependency to the stabilizing algorithm. The work is processed within the processing function. All processes in the system execute the… ▽ More

    Submitted 28 April, 2016; v1 submitted 16 April, 2016; originally announced April 2016.

    Comments: 10 pages, including Appendix and References

  11. arXiv:1507.06702  [pdf, other

    cs.DC cs.DS cs.PF cs.SE

    The Anatomy of Large-Scale Distributed Graph Algorithms

    Authors: Jesun Sahariar Firoz, Thejaka Amila Kanewala, Marcin Zalewski, Martina Barnas, Andrew Lumsdaine

    Abstract: The increasing complexity of the software/hardware stack of modern supercomputers results in explosion of parameters. The performance analysis becomes a truly experimental science, even more challenging in the presence of massive irregularity and data dependency. We analyze how the existing body of research handles the experimental aspect in the context of distributed graph algorithms (DGAs). We d… ▽ More

    Submitted 23 July, 2015; originally announced July 2015.

    ACM Class: D.1.3

  12. arXiv:1408.0393  [pdf

    cs.MS cs.DM cs.DS

    Standards for Graph Algorithm Primitives

    Authors: Tim Mattson, David Bader, Jon Berry, Aydin Buluc, Jack Dongarra, Christos Faloutsos, John Feo, John Gilbert, Joseph Gonzalez, Bruce Hendrickson, Jeremy Kepner, Charles Leiserson, Andrew Lumsdaine, David Padua, Stephen Poole, Steve Reinhardt, Mike Stonebraker, Steve Wallach, Andrew Yoo

    Abstract: It is our view that the state of the art in constructing a large collection of graph algorithms in terms of linear algebraic operations is mature enough to support the emergence of a standard set of primitive building blocks. This paper is a position paper defining the problem and announcing our intention to launch an open effort to define this standard.

    Submitted 2 August, 2014; originally announced August 2014.

    Comments: 2 pages, IEEE HPEC 2013

  13. arXiv:1304.5257  [pdf, ps, other

    cs.SE

    What Makes Code Hard to Understand?

    Authors: Michael Hansen, Robert L. Goldstone, Andrew Lumsdaine

    Abstract: What factors impact the comprehensibility of code? Previous research suggests that expectation-congruent programs should take less time to understand and be less prone to errors. We present an experiment in which participants with programming experience predict the exact output of ten small Python programs. We use subtle differences between program versions to demonstrate that seemingly insignific… ▽ More

    Submitted 26 April, 2013; v1 submitted 18 April, 2013; originally announced April 2013.

    Comments: 19 pages, 2 figures

    ACM Class: H.1.2

  14. arXiv:1211.1658  [pdf, ps, other

    cs.DC

    Extending Task Parallelism for Frequent Pattern Mining

    Authors: Prabhanjan Kambadur, Amol Ghoting, Anshul Gupta, Andrew Lumsdaine

    Abstract: Algorithms for frequent pattern mining, a popular informatics application, have unique requirements that are not met by any of the existing parallel tools. In particular, such applications operate on extremely large data sets and have irregular memory access patterns. For efficient parallelization of such applications, it is necessary to support dynamic load balancing along with scheduling mechani… ▽ More

    Submitted 7 November, 2012; originally announced November 2012.

  15. Lazy Evaluation and Delimited Control

    Authors: Ronald Garcia, Andrew Lumsdaine, Amr Sabry

    Abstract: The call-by-need lambda calculus provides an equational framework for reasoning syntactically about lazy evaluation. This paper examines its operational characteristics. By a series of reasoning steps, we systematically unpack the standard-order reduction relation of the calculus and discover a novel abstract machine definition which, like the calculus, goes "under lambdas." We prove that machine… ▽ More

    Submitted 11 July, 2010; v1 submitted 26 March, 2010; originally announced March 2010.

    ACM Class: D.3.1

    Journal ref: Logical Methods in Computer Science, Volume 6, Issue 3 (July 11, 2010) lmcs:1013

  16. arXiv:0708.2255  [pdf, other

    cs.PL cs.SE

    A Language for Generic Programming in the Large

    Authors: Jeremy G. Siek, Andrew Lumsdaine

    Abstract: Generic programming is an effective methodology for develo** reusable software libraries. Many programming languages provide generics and have features for describing interfaces, but none completely support the idioms used in generic programming. To address this need we developed the language G. The central feature of G is the concept, a mechanism for organizing constraints on generics that is… ▽ More

    Submitted 16 August, 2007; originally announced August 2007.

    Comments: 50 pages

    ACM Class: D.3.3