Skip to main content

Showing 1–26 of 26 results for author: Jananthan, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01481  [pdf, other

    cs.DC cs.PF

    LLload: Simplifying Real-Time Job Monitoring for HPC Users

    Authors: Chansup Byun, Julia Mullen, Albert Reuther, William Arcand, William Bergeron, David Bestor, Daniel Burrill, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Peter Michaleas, Guillermo Morales, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner, Lauren Milechin

    Abstract: One of the more complex tasks for researchers using HPC systems is performance monitoring and tuning of their applications. Develo** a practice of continuous performance improvement, both for speed-up and efficient use of resources is essential to the long term success of both the HPC practitioner and the research project. Profiling tools provide a nice view of the performance of an application… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2404.14643  [pdf, other

    cs.CR cs.CY cs.GR cs.NI cs.SI

    Teaching Network Traffic Matrices in an Interactive Game Environment

    Authors: Chasen Milner, Hayden Jananthan, Jeremy Kepner, Vijay Gadepally, Michael Jones, Peter Michaleas, Ritesh Patel, Sandeep Pisharody, Gabriel Wachman, Alex Pentland

    Abstract: The Internet has become a critical domain for modern society that requires ongoing efforts for its improvement and protection. Network traffic matrices are a powerful tool for understanding and analyzing networks and are broadly taught in online graph theory educational resources. Network traffic matrix concepts are rarely available in online computer network and cybersecurity educational resource… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 9 pages, 10 figures, 52 references; accepted to IEEE GrAPL

  3. arXiv:2311.03609  [pdf, other

    cs.LG

    Testing RadiX-Nets: Advances in Viable Sparse Topologies

    Authors: Kevin Kwak, Zack West, Hayden Jananthan, Jeremy Kepner

    Abstract: The exponential growth of data has sparked computational demands on ML research and industry use. Sparsification of hyper-parametrized deep neural networks (DNNs) creates simpler representations of complex data. Past research has shown that some sparse networks achieve similar performance as dense ones, reducing runtime and storage. RadiX-Nets, a subgroup of sparse DNNs, maintain uniformity which… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 5 pages, 8 figures, accepted to IEEE URTC 2023

  4. arXiv:2311.03574  [pdf, ps, other

    cs.DB

    Fuzzy Relational Databases via Associative Arrays

    Authors: Kevin Min, Hayden Jananthan, Jeremy Kepner

    Abstract: The increasing rise in artificial intelligence has made the use of imprecise language in computer programs like ChatGPT more prominent. Fuzzy logic addresses this form of imprecise language by introducing the concept of fuzzy sets, where elements belong to the set with a certain membership value (called the fuzzy value). This paper combines fuzzy data with relational algebra to provide the mathema… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 5 pages, accepted to IEEE URTC 2023

  5. arXiv:2311.03562  [pdf, other

    cs.SI

    From Bits to Insights: Exploring Network Traffic, Traffic Matrices, and Heavy-Tailed Data

    Authors: Christopher Howard, Hayden Jananthan, Jeremy Kepner

    Abstract: With the Internet a central component of modern society, entire industries and fields have developed both in support and against cybersecurity. For cyber operators to best understand their networks, they must conduct detailed traffic analyses. A growing recognition is the ubiquity of heavy-tailed characteristics in network traffic. However, a thorough analysis of cybersecurity programs suggests li… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 5 pages, 5 figures, accepted to IEEE URTC 2023

  6. arXiv:2311.03559  [pdf, other

    cs.DM

    Algebraic Conditions on One-Step Breadth-First Search

    Authors: Emma Fu, Hayden Jananthan, Jeremy Kepner

    Abstract: The GraphBLAS community has demonstrated the power of linear algebra-leveraged graph algorithms, such as matrix-vector products for breadth-first search (BFS) traversals. This paper investigates the algebraic conditions needed for such computations when working with directed hypergraphs, represented by incidence arrays with entries from an arbitrary value set with binary addition and multiplicatio… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 5 pages, 2 figures, accepted to IEEE URTC 2023

  7. arXiv:2310.00522  [pdf, other

    cs.SI

    Map** of Internet "Coastlines" via Large Scale Anonymized Network Source Correlations

    Authors: Hayden Jananthan, Jeremy Kepner, Michael Jones, William Arcand, David Bestor, William Bergeron, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg , et al. (3 additional authors not shown)

    Abstract: Expanding the scientific tools available to protect computer networks can be aided by a deeper understanding of the underlying statistical distributions of network traffic and their potential geometric interpretations. Analyses of large scale network observations provide a unique window into studying those underlying statistics. Newly developed GraphBLAS hypersparse matrices and D4M associative ar… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 9 pages, 7 figures, IEEE HPEC 2023 (accepted)

  8. pPython Performance Study

    Authors: Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner

    Abstract: pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. pPython follows a SPMD (single program multiple data) model of computation. pPython runs on a single-node (e.g., a laptop) running Window… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.14908

  9. Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays

    Authors: Michael Jones, Jeremy Kepner, Andrew Prout, Timothy Davis, William Arcand, David Bestor, William Bergeron, Chansup Byun, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Sandeep Pisharody, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas

    Abstract: Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires int… ▽ More

    Submitted 8 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 8 pages, 8 figures, 1 table, 69 references. arXiv admin note: text overlap with arXiv:2203.13934. text overlap with arXiv:2309.01806

  10. Focusing and Calibration of Large Scale Network Sensors using GraphBLAS Anonymized Hypersparse Matrices

    Authors: Jeremy Kepner, Michael Jones, Phil Dykstra, Chansup Byun, Timothy Davis, Hayden Jananthan, William Arcand, David Bestor, William Bergeron, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg, Charles Yee , et al. (1 additional authors not shown)

    Abstract: Defending community-owned cyber space requires community-based efforts. Large-scale network observations that uphold the highest regard for privacy are key to protecting our shared cyberspace. Deployment of the necessary network sensors requires careful sensor placement, focusing, and calibration with significant volumes of network observations. This paper demonstrates novel focusing and calibrati… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 9 pages, 12 figures, 1 table, 63 references, 2 appendices

  11. arXiv:2209.05725  [pdf, other

    cs.NI cs.DC

    Hypersparse Network Flow Analysis of Packets with GraphBLAS

    Authors: Tyler Trigg, Chad Meiners, Sandeep Pisharody, Hayden Jananthan, Michael Jones, Adam Michaleas, Timothy Davis, Erik Welch, William Arcand, David Bestor, William Bergeron, Chansup Byun, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Doug Stetson, Charles Yee , et al. (1 additional authors not shown)

    Abstract: Internet analysis is a major challenge due to the volume and rate of network traffic. In lieu of analyzing traffic as raw packets, network analysts often rely on compressed network flows (netflows) that contain the start time, stop time, source, destination, and number of packets in each direction. However, many traffic analyses benefit from temporal aggregation of multiple simultaneous netflows,… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: text overlap with arXiv:2203.13934, arXiv:2108.06653, arXiv:2008.00307

  12. Large Scale Enrichment and Statistical Cyber Characterization of Network Traffic (Enriquecimiento a gran escala y caracterización cibernética estadística del tráfico de red)

    Authors: Ivan Kawaminami, Arminda Estrada, Youssef Elsakkary, Hayden Jananthan, Aydın Buluç, Tim Davis, Daniel Grant, Michael Jones, Chad Meiners, Andrew Morris, Sandeep Pisharody, Jeremy Kepner

    Abstract: Modern network sensors continuously produce enormous quantities of raw data that are beyond the capacity of human analysts. Cross-correlation of network sensors increases this challenge by enriching every network event with additional metadata. These large volumes of enriched network data present opportunities to statistically characterize network traffic and quickly answer a key question: "What a… ▽ More

    Submitted 1 December, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 17 pages, 16 figures, HPEC, Spanish version

  13. Python Implementation of the Dynamic Distributed Dimensional Data Model

    Authors: Hayden Jananthan, Lauren Milechin, Michael Jones, William Arcand, William Bergeron, David Bestor, Chansup Byun, Michael Houle, Matthew Hubbell, Vijay Gadepally, Anna Klein, Peter Michaleas, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner

    Abstract: Python has become a standard scientific computing language with fast-growing support of machine learning and data analysis modules, as well as an increasing usage of big data. The Dynamic Distributed Dimensional Data Model (D4M) offers a highly composable, unified data model with strong performance built to handle big data fast and efficiently. In this work we present an implementation of D4M in P… ▽ More

    Submitted 22 November, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: 8 pages, 7 figures, accepted to HPEC 2022

  14. pPython for Parallel Python Programming

    Authors: Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Kurt Keville, Anna Klein, Peter Michaleas, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner

    Abstract: pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. The core data structure in pPython is a distributed numerical array whose distribution onto multiple processors is specified with a map c… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:astro-ph/0606464

  15. arXiv:2204.11289  [pdf, ps, other

    math.LO cs.CL

    Complexity and Avoidance

    Authors: Hayden Jananthan

    Abstract: In this dissertation we examine the relationships between the several hierarchies, including the complexity, $\mathrm{LUA}$ (Linearly Universal Avoidance), and shift complexity hierarchies, with an eye towards quantitative bounds on growth rates therein. We show that for suitable $f$ and $p$, there are $q$ and $g$ such that $\mathrm{LUA}(q) \leq_\mathrm{s} \mathrm{COMPLEX}(f)$ and… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    Comments: Dissertation under the direction of Professor Stephen G. Simpson, submitted to the faculty of the Graduate School of Vanderbilt University in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Mathematics. viii+157 pages. 5 figures. ORCID: 0000-0001-6877-0923

    MSC Class: 03D75 ACM Class: F.4.1

  16. arXiv:2203.13934  [pdf, other

    cs.NI cs.DC cs.OS cs.SI

    GraphBLAS on the Edge: Anonymized High Performance Streaming of Network Traffic

    Authors: Michael Jones, Jeremy Kepner, Daniel Andersen, Aydin Buluc, Chansup Byun, K Claffy, Timothy Davis, William Arcand, Jonathan Bernays, David Bestor, William Bergeron, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Chad Meiners, Lauren Milechin, Julie Mullen, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Jon Sreekanth , et al. (3 additional authors not shown)

    Abstract: Long range detection is a cornerstone of defense in many operating domains (land, sea, undersea, air, space, ..,). In the cyber domain, long range detection requires the analysis of significant network traffic from a variety of observatories and outposts. Construction of anonymized hypersparse traffic matrices on edge network devices can be a key enabler by providing significant data compression i… ▽ More

    Submitted 5 September, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted to IEEE HPEC, Outstanding Paper Award, 8 pages, 8 figures, 1 table, 70 references. arXiv admin note: text overlap with arXiv:2108.06653, arXiv:2008.00307, arXiv:2203.10230

  17. Temporal Correlation of Internet Observatories and Outposts

    Authors: Jeremy Kepner, Michael Jones, Daniel Andersen, Aydın Buluç, Chansup Byun, K Claffy, Timothy Davis, William Arcand, Jonathan Bernays, David Bestor, William Bergeron, Vijay Gadepally, Daniel Grant, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Chad Meiners, Lauren Milechin, Andrew Morris, Julie Mullen, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa , et al. (4 additional authors not shown)

    Abstract: The Internet has become a critical component of modern civilization requiring scientific exploration akin to endeavors to understand the land, sea, air, and space environments. Understanding the baseline statistical distributions of traffic are essential to the scientific understanding of the Internet. Correlating data from different Internet observatories and outposts can be a useful tool for gai… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: 8 pages, 8 figures, 2 tables, 59 references; accepted to GrAPL 2022. arXiv admin note: substantial text overlap with arXiv:2108.06653

  18. arXiv:2103.15203  [pdf, other

    cs.MS cs.DB cs.DM cs.NE math.RA

    Mathematics of Digital Hyperspace

    Authors: Jeremy Kepner, Timothy Davis, Vijay Gadepally, Hayden Jananthan, Lauren Milechin

    Abstract: Social media, e-commerce, streaming video, e-mail, cloud documents, web pages, traffic flows, and network packets fill vast digital lakes, rivers, and oceans that we each navigate daily. This digital hyperspace is an amorphous flow of data supported by continuous streams that stretch standard concepts of type and dimension. The unstructured data of digital hyperspace can be elegantly represented,… ▽ More

    Submitted 28 March, 2021; originally announced March 2021.

    Comments: 9 pages, 8 figures, 2 tables, accepted to GrAPL 2021. arXiv admin note: text overlap with arXiv:1807.03165, arXiv:2004.01181, arXiv:1909.05631, arXiv:1708.02937

  19. arXiv:2001.06731  [pdf, other

    cs.DB

    AI Data Wrangling with Associative Arrays

    Authors: Jeremy Kepner, Vijay Gadepally, Hayden Jananthan, Lauren Milechin, Siddharth Samsi

    Abstract: The AI revolution is data driven. AI "data wrangling" is the process by which unusable data is transformed to support AI algorithm development (training) and deployment (inference). Significant time is devoted to translating diverse data representations supporting the many query and analysis steps found in an AI pipeline. Rigorous mathematical representations of these data enables data translation… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

    Comments: 3 pages, 2 figures, 23 references, accepted for Northeast Database day (NEDB) 2020. arXiv admin note: text overlap with arXiv:1907.04217

  20. arXiv:1809.06009  [pdf, other

    cs.LG math.NA stat.ML

    Uncertainty Propagation in Deep Neural Networks Using Extended Kalman Filtering

    Authors: Jessica S. Titensky, Hayden Jananthan, Jeremy Kepner

    Abstract: Extended Kalman Filtering (EKF) can be used to propagate and quantify input uncertainty through a Deep Neural Network (DNN) assuming mild hypotheses on the input distribution. This methodology yields results comparable to existing methods of uncertainty propagation for DNNs while lowering the computational overhead considerably. Additionally, EKF allows model error to be naturally incorporated int… ▽ More

    Submitted 16 September, 2018; originally announced September 2018.

    Comments: 4 Pages, 8 figures. Accepted at MIT IEEE Undergraduate Research Technology Conference 2018. Publication pending

  21. arXiv:1807.05308  [pdf, other

    cs.DC cs.DB cs.OS cs.PF

    TabulaROSA: Tabular Operating System Architecture for Massively Parallel Heterogeneous Compute Engines

    Authors: Jeremy Kepner, Ron Brightwell, Alan Edelman, Vijay Gadepally, Hayden Jananthan, Michael Jones, Sam Madden, Peter Michaleas, Hamed Okhravi, Kevin Pedretti, Albert Reuther, Thomas Sterling, Mike Stonebraker

    Abstract: The rise in computing hardware choices is driving a reevaluation of operating systems. The traditional role of an operating system controlling the execution of its own hardware is evolving toward a model whereby the controlling processor is distinct from the compute engines that are performing most of the computations. In this context, an operating system can be viewed as software that brokers and… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: 8 pages, 6 figures, accepted at IEEE HPEC 2018

  22. arXiv:1807.03165  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Sparse Deep Neural Network Exact Solutions

    Authors: Jeremy Kepner, Vijay Gadepally, Hayden Jananthan, Lauren Milechin, Sid Samsi

    Abstract: Deep neural networks (DNNs) have emerged as key enablers of machine learning. Applying larger DNNs to more diverse applications is an important challenge. The computations performed during DNN training and inference are dominated by operations on the weight matrices describing the DNN. As DNNs incorporate more layers and more neurons per layers, these weight matrices may be required to be sparse b… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: 8 pages, 10 figures, accepted to IEEE HPEC 2018. arXiv admin note: text overlap with arXiv:1708.02937

  23. arXiv:1803.01281  [pdf, other

    cs.DC cs.DM cs.DS cs.PF math.CO

    Design, Generation, and Validation of Extreme Scale Power-Law Graphs

    Authors: Jeremy Kepner, Siddharth Samsi, William Arcand, David Bestor, Bill Bergeron, Tim Davis, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Anna Klein, Peter Michaleas, Roger Pearce, Lauren Milechin, Julie Mullen, Andrew Prout, Antonio Rosa, Geoff Sanders, Charles Yee, Albert Reuther

    Abstract: Massive power-law graphs drive many fields: metagenomics, brain map**, Internet-of-things, cybersecurity, and sparse machine learning. The development of novel algorithms and systems to process these data requires the design, generation, and validation of enormous graphs with exactly known properties. Such graphs accelerate the proper testing of new algorithms and systems and are a prerequisite… ▽ More

    Submitted 3 March, 2018; originally announced March 2018.

    Comments: 8 pages, 6 figures, IEEE IPDPS 2018 Graph Algorithm Building Blocks (GABB) workshop

  24. Polystore Mathematics of Relational Algebra

    Authors: Hayden Jananthan, Ziqi Zhou, Vijay Gadepally, Dylan Hutchison, Suna Kim, Jeremy Kepner

    Abstract: Financial transactions, internet search, and data analysis are all placing increasing demands on databases. SQL, NoSQL, and NewSQL databases have been developed to meet these demands and each offers unique benefits. SQL, NoSQL, and NewSQL databases also rely on different underlying mathematical models. Polystores seek to provide a mechanism to allow applications to transparently achieve the benefi… ▽ More

    Submitted 3 December, 2017; originally announced December 2017.

    Comments: 10 pages, 2 figures, accepted to Big Data 2017 2nd International Workshop on Methods to Manage Heterogeneous Big Data and Polystore Databases

  25. arXiv:1702.07832  [pdf, other

    cs.DS cs.DM math.CO

    Constructing Adjacency Arrays from Incidence Arrays

    Authors: Hayden Jananthan, Karia Dibert, Jeremy Kepner

    Abstract: Graph construction, a fundamental operation in a data processing pipeline, is typically done by multiplying the incidence array representations of a graph, $\mathbf{E}_\mathrm{in}$ and $\mathbf{E}_\mathrm{out}$, to produce an adjacency array of the graph, $\mathbf{A}$, that can be processed with a variety of algorithms. This paper provides the mathematical criteria to determine if the product… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.

    Comments: 8 pages, 5 figures, accepted to IEEE IPDPS 2017 Workshop on Graph Algorithm Building Blocks

  26. arXiv:1606.05797  [pdf

    cs.DB cs.DC cs.PL

    Associative Array Model of SQL, NoSQL, and NewSQL Databases

    Authors: Jeremy Kepner, Vijay Gadepally, Dylan Hutchison, Hayden Jananthan, Timothy Mattson, Siddharth Samsi, Albert Reuther

    Abstract: The success of SQL, NoSQL, and NewSQL databases is a reflection of their ability to provide significant functionality and performance benefits for specific domains, such as financial transactions, internet search, and data analysis. The BigDAWG polystore seeks to provide a mechanism to allow applications to transparently achieve the benefits of diverse databases while insulating applications from… ▽ More

    Submitted 18 June, 2016; originally announced June 2016.

    Comments: 9 pages; 6 figures; accepted to IEEE High Performance Extreme Computing (HPEC) conference 2016