Skip to main content

Showing 1–27 of 27 results for author: Purvine, E

.
  1. arXiv:2312.00023  [pdf, other

    cs.CR

    Hypergraph Topological Features for Autoencoder-Based Intrusion Detection for Cybersecurity Data

    Authors: Bill Kay, Sinan G. Aksoy, Molly Baird, Daniel M. Best, Helen Jenne, Cliff Joslyn, Christopher Potvin, Gregory Henselman-Petrusek, Garret Seppala, Stephen J. Young, Emilie Purvine

    Abstract: In this position paper, we argue that when hypergraphs are used to capture multi-way local relations of data, their resulting topological features describe global behaviour. Consequently, these features capture complex correlations that can then serve as high fidelity inputs to autoencoder-driven anomaly detection pipelines. We propose two such potential pipelines for cybersecurity data, one that… ▽ More

    Submitted 9 November, 2023; originally announced December 2023.

    MSC Class: 55N31

  2. arXiv:2311.16154  [pdf

    cs.CR

    Step** out of Flatland: Discovering Behavior Patterns as Topological Structures in Cyber Hypergraphs

    Authors: Helen Jenne, Sinan G. Aksoy, Daniel Best, Alyson Bittner, Gregory Henselman-Petrusek, Cliff Joslyn, Bill Kay, Audun Myers, Garret Seppala, Jackson Warley, Stephen J. Young, Emilie Purvine

    Abstract: Data breaches and ransomware attacks occur so often that they have become part of our daily news cycle. This is due to a myriad of factors, including the increasing number of internet-of-things devices, shift to remote work during the pandemic, and advancement in adversarial techniques, which all contribute to the increase in both the complexity of data captured and the challenge of protecting our… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 18 pages, 11 figures. This paper is written for a general audience

    MSC Class: 55N31

  3. arXiv:2310.11626  [pdf, other

    cs.MS

    HyperNetX: A Python package for modeling complex network data as hypergraphs

    Authors: Brenda Praggastis, Sinan Aksoy, Dustin Arendt, Mark Bonicillo, Cliff Joslyn, Emilie Purvine, Madelyn Shapiro, Ji Young Yun

    Abstract: HyperNetX (HNX) is an open source Python library for the analysis and visualization of complex network data modeled as hypergraphs. Initially released in 2019, HNX facilitates exploratory data analysis of complex networks using algebraic topology, combinatorics, and generalized hypergraph and graph theoretical methods on structured data inputs. With its 2023 release, the library supports attaching… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 3 pages, 2 figures

  4. arXiv:2309.08010  [pdf, other

    cs.CG

    Malicious Cyber Activity Detection Using Zigzag Persistence

    Authors: Audun Myers, Alyson Bittner, Sinan Aksoy, Daniel M. Best, Gregory Henselman-Petrusek, Helen Jenne, Cliff Joslyn, Bill Kay, Garret Seppala, Stephen J. Young, Emilie Purvine

    Abstract: In this study we synthesize zigzag persistence from topological data analysis with autoencoder-based approaches to detect malicious cyber activity and derive analytic insights. Cybersecurity aims to safeguard computers, networks, and servers from various forms of malicious attacks, including network damage, data theft, and activity monitoring. Here we focus on the detection of malicious activity u… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  5. arXiv:2309.06634  [pdf, other

    cs.LG math.AT stat.ML

    $G$-Mapper: Learning a Cover in the Mapper Construction

    Authors: Enrique Alvarado, Robin Belton, Emily Fischer, Kang-Ju Lee, Sourabh Palande, Sarah Percival, Emilie Purvine

    Abstract: The Mapper algorithm is a visualization technique in topological data analysis (TDA) that outputs a graph reflecting the structure of a given dataset. However, the Mapper algorithm requires tuning several parameters in order to generate a ``nice" Mapper graph. This paper focuses on selecting the cover parameter. We present an algorithm that optimizes the cover of a Mapper graph by splitting a cove… ▽ More

    Submitted 4 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  6. arXiv:2302.02857  [pdf, other

    cs.CG math.AT

    Topological Analysis of Temporal Hypergraphs

    Authors: Audun Myers, Cliff Joslyn, Bill Kay, Emilie Purvine, Gregory Roek, Madelyn Shapiro

    Abstract: In this work we study the topological properties of temporal hypergraphs. Hypergraphs provide a higher dimensional generalization of a graph that is capable of capturing multi-way connections. As such, they have become an integral part of network science. A common use of hypergraphs is to model events as hyperedges in which the event can involve many elements as nodes. This provides a more complet… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  7. arXiv:2212.00222  [pdf, other

    cs.LG cs.CG

    Experimental Observations of the Topology of Convolutional Neural Network Activations

    Authors: Emilie Purvine, Davis Brown, Brett Jefferson, Cliff Joslyn, Brenda Praggastis, Archit Rathore, Madelyn Shapiro, Bei Wang, Youjia Zhou

    Abstract: Topological data analysis (TDA) is a branch of computational mathematics, bridging algebraic topology and data science, that provides compact, noise-robust representations of complex structures. Deep neural networks (DNNs) learn millions of parameters associated with a series of transformations defined by the model architecture, resulting in high-dimensional, difficult-to-interpret internal repres… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: Accepted at AAAI 2023. This version includes supplementary material

  8. arXiv:2208.06894  [pdf, other

    cs.CV cs.AI cs.LG

    The SVD of Convolutional Weights: A CNN Interpretability Framework

    Authors: Brenda Praggastis, Davis Brown, Carlos Ortiz Marrero, Emilie Purvine, Madelyn Shapiro, Bei Wang

    Abstract: Deep neural networks used for image classification often use convolutional filters to extract distinguishing features before passing them to a linear classifier. Most interpretability literature focuses on providing semantic meaning to convolutional filters to explain a model's reasoning process and confirm its use of relevant information from the input domain. Fully connected layers can be studie… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

    MSC Class: 68T07; 68T01; 05C65

  9. arXiv:2204.01142   

    math.AT cs.CG cs.LG math.CO

    Proceedings of TDA: Applications of Topological Data Analysis to Data Science, Artificial Intelligence, and Machine Learning Workshop at SDM 2022

    Authors: R. W. R. Darling, John A. Emanuello, Emilie Purvine, Ahmad Ridley

    Abstract: Topological Data Analysis (TDA) is a rigorous framework that borrows techniques from geometric and algebraic topology, category theory, and combinatorics in order to study the "shape" of such complex high-dimensional data. Research in this area has grown significantly over the last several years bringing a deeply rooted theory to bear on practical applications in areas such as genomics, natural la… ▽ More

    Submitted 14 April, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

  10. arXiv:2105.10414  [pdf, other

    cs.LG cs.CV math.AT math.CT

    Sheaves as a Framework for Understanding and Interpreting Model Fit

    Authors: Henry Kvinge, Brett Jefferson, Cliff Joslyn, Emilie Purvine

    Abstract: As data grows in size and complexity, finding frameworks which aid in interpretation and analysis has become critical. This is particularly true when data comes from complex systems where extensive structure is available, but must be drawn from peripheral sources. In this paper we argue that in such situations, sheaves can provide a natural framework to analyze how well a statistical model fits at… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: 12 page

  11. arXiv:2104.11214  [pdf, other

    cs.HC cs.CG math.AT

    Topological Simplifications of Hypergraphs

    Authors: Youjia Zhou, Archit Rathore, Emilie Purvine, Bei Wang

    Abstract: We study hypergraph visualization via its topological simplification. We explore both vertex simplification and hyperedge simplification of hypergraphs using tools from topological data analysis. In particular, we transform a hypergraph to its graph representations known as the line graph and clique expansion. A topological simplification of such a graph representation induces a simplification of… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  12. arXiv:2011.08952  [pdf, other

    cs.CL cs.CG

    Argumentative Topology: Finding Loop(holes) in Logic

    Authors: Sarah Tymochko, Zachary New, Lucius Bynum, Emilie Purvine, Timothy Doster, Julien Chaput, Tegan Emerson

    Abstract: Advances in natural language processing have resulted in increased capabilities with respect to multiple tasks. One of the possible causes of the observed performance gains is the introduction of increasingly sophisticated text representations. While many of the new word embedding techniques can be shown to capture particular notions of sentiment or associative structures, we explore the ability o… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  13. arXiv:2010.03068  [pdf, other

    q-bio.QM math.CO

    Hypergraph Models of Biological Networks to Identify Genes Critical to Pathogenic Viral Response

    Authors: Song Feng, Emily Heath, Brett Jefferson, Cliff Joslyn, Henry Kvinge, Hugh D. Mitchell, Brenda Praggastis, Amie J. Eisfeld, Amy C. Sims, Larissa B. Thackray, Shufang Fan, Kevin B. Walters, Peter J. Halfmann, Danielle Westhoff-Smith, Qing Tan, Vineet D. Menachery, Timothy P. Sheahan, Adam S. Cockrell, Jacob F. Kocher, Kelly G. Stratton, Natalie C. Heller, Lisa M. Bramer, Michael S. Diamond, Ralph S. Baric, Katrina M. Waters , et al. (3 additional authors not shown)

    Abstract: Background: Representing biological networks as graphs is a powerful approach to reveal underlying patterns, signatures, and critical components from high-throughput biomolecular data. However, graphs do not natively capture the multi-way relationships present among genes and proteins in biological systems. Hypergraphs are generalizations of graphs that naturally model multi-way relationships and… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    MSC Class: 92C42; 92-08; 05C65

  14. arXiv:2008.04357  [pdf, other

    cs.SI cs.CR cs.DM

    Directional Laplacian Centrality for Cyber Situational Awareness

    Authors: Sinan G. Aksoy, Emilie Purvine, Stephen J. Young

    Abstract: Cyber operations is drowning in diverse, high-volume, multi-source data. In order to get a full picture of current operations and identify malicious events and actors analysts must see through data generated by a mix of human activity and benign automated processes. Although many monitoring and alert systems exist, they typically use signature-based detection methods. We introduce a general method… ▽ More

    Submitted 23 March, 2021; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: 25 pages, 15 figures

  15. arXiv:2003.11782  [pdf, other

    cs.DM

    Hypernetwork Science: From Multidimensional Networks to Computational Topology

    Authors: Cliff A. Joslyn, Sinan Aksoy, Tiffany J. Callahan, Lawrence E. Hunter, Brett Jefferson, Brenda Praggastis, Emilie A. H. Purvine, Ignacio J. Tripodi

    Abstract: As data structures and mathematical objects used for complex systems modeling, hypergraphs sit nicely poised between on the one hand the world of network models, and on the other that of higher-order mathematical abstractions from algebra, lattice theory, and topology. They are able to represent complex systems interactions more faithfully than graphs and networks, while also being some of the sim… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

    Report number: PNNL-SA-152208 MSC Class: 05C65; ACM Class: G.2.2

  16. arXiv:1912.05487  [pdf, other

    physics.data-an

    A Sheaf Theoretical Approach to Uncertainty Quantification of Heterogeneous Geolocation Information

    Authors: Cliff Joslyn, Lauren Charles, Chris DePerno, Nicholas Gould, Kathleen Nowak, Brenda Praggastis, Emilie Purvine, Michael Robinson, Jennifer Strules, Paul Whitney

    Abstract: Integration of heterogeneous sensors is a challenging problem across a range of applications. Prominent among these are multi-target tracking, where one must combine observations from different sensor types in a meaningful way to track multiple targets. Because sensors have differing error models, we seek a theoretically-justified quantification of the agreement among ensembles of sensors, both ov… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: Submitted

  17. arXiv:1906.11295  [pdf, other

    physics.soc-ph cs.SI physics.data-an

    Hypernetwork Science via High-Order Hypergraph Walks

    Authors: Sinan G. Aksoy, Cliff Joslyn, Carlos Ortiz Marrero, Brenda Praggastis, Emilie Purvine

    Abstract: We propose high-order hypergraph walks as a framework to generalize graph-based network science techniques to hypergraphs. Edge incidence in hypergraphs is quantitative, yielding hypergraph walks with both length and width. Graph methods which then generalize to hypergraphs include connected component analyses, graph distance-based metrics such as closeness centrality, and motif-based measures suc… ▽ More

    Submitted 8 June, 2020; v1 submitted 26 June, 2019; originally announced June 2019.

    Comments: Updated to address referee comments, to appear in EPJ Data Science

  18. arXiv:1906.04936  [pdf, other

    cs.DM cs.LG cs.SI

    Relative Hausdorff Distance for Network Analysis

    Authors: Sinan G. Aksoy, Kathleen E. Nowak, Emilie Purvine, Stephen J. Young

    Abstract: Similarity measures are used extensively in machine learning and data science algorithms. The newly proposed graph Relative Hausdorff (RH) distance is a lightweight yet nuanced similarity measure for quantifying the closeness of two graphs. In this work we study the effectiveness of RH distance as a tool for detecting anomalies in time-evolving graph sequences. We apply RH to cyber data with given… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: 20 pages

  19. arXiv:1903.08298  [pdf, other

    math.AT cs.CG

    Local Versus Global Distances for Zigzag Persistence Modules

    Authors: Ellen Gasparovic, Maria Gommel, Emilie Purvine, Radmila Sazdanovic, Bei Wang, Yusu Wang, Lori Ziegelmeier

    Abstract: This short note establishes explicit and broadly applicable relationships between persistence-based distances computed locally and globally. In particular, we show that the bottleneck distance between two zigzag persistence modules restricted to an interval is always bounded above by the distance between the unrestricted versions. While this result is not surprising, it could have different practi… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: 9 pages, 1 figure

  20. arXiv:1812.05282  [pdf, other

    math.AT cs.CG

    The Relationship Between the Intrinsic Cech and Persistence Distortion Distances for Metric Graphs

    Authors: Ellen Gasparovic, Maria Gommel, Emilie Purvine, Radmila Sazdanovic, Bei Wang, Yusu Wang, Lori Ziegelmeier

    Abstract: Metric graphs are meaningful objects for modeling complex structures that arise in many real-world applications, such as road networks, river systems, earthquake faults, blood vessels, and filamentary structures in galaxies. To study metric graphs in the context of comparison, we are interested in determining the relative discriminative capabilities of two topology-based distances between a pair o… ▽ More

    Submitted 13 December, 2018; originally announced December 2018.

    Comments: 18 pages, 6 figures

    MSC Class: 57M15

  21. arXiv:1805.11547  [pdf, other

    math.AT

    Local homology of abstract simplicial complexes

    Authors: Michael Robinson, Chris Capraro, Cliff Joslyn, Emilie Purvine, Brenda Praggastis, Stephen Ranshous, Arun Sathanur

    Abstract: This survey describes some useful properties of the local homology of abstract simplicial complexes. Although the existing literature on local homology is somewhat dispersed, it is largely dedicated to the study of manifolds, submanifolds, or samplings thereof. While this is a vital perspective, the focus of this survey is squarely on the local homology of abstract simplicial complexes. Our motiva… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: 38 pages

    MSC Class: 55N25

  22. On Homotopy Types of Vietoris-Rips Complexes of Metric Gluings

    Authors: Michal Adamaszek, Henry Adams, Ellen Gasparovic, Maria Gommel, Emilie Purvine, Radmila Sazdanovic, Bei Wang, Yusu Wang, Lori Ziegelmeier

    Abstract: We study Vietoris-Rips complexes of metric wedge sums and metric gluings. We show that the Vietoris-Rips complex of a wedge sum, equipped with a natural metric, is homotopy equivalent to the wedge sum of the Vietoris-Rips complexes. We also provide generalizations for when two metric spaces are glued together along a common isometric subset. As our main example, we deduce the homotopy type of the… ▽ More

    Submitted 12 August, 2019; v1 submitted 17 December, 2017; originally announced December 2017.

    MSC Class: 05E45 ACM Class: F.2.2

  23. arXiv:1711.11098  [pdf, ps, other

    physics.soc-ph cs.SI math.CO

    A generative graph model for electrical infrastructure networks

    Authors: Sinan G. Aksoy, Emilie Purvine, Eduardo Cotilla-Sanchez, Mahantesh Halappanavar

    Abstract: We propose a generative graph model for electrical infrastructure networks that accounts for heterogeneity in both node and edge type. To inform the model design, we analyze the properties of power grid graphs derived from the U.S. Eastern Interconnection, Texas Interconnection, and Poland transmission system power grids. Across these datasets, we find subgraphs induced by nodes of the same voltag… ▽ More

    Submitted 19 July, 2018; v1 submitted 29 November, 2017; originally announced November 2017.

  24. arXiv:1702.07379  [pdf, other

    math.AT

    A Complete Characterization of the 1-Dimensional Intrinsic Cech Persistence Diagrams for Metric Graphs

    Authors: Ellen Gasparovic, Maria Gommel, Emilie Purvine, Radmila Sazdanovic, Bei Wang, Yusu Wang, Lori Ziegelmeier

    Abstract: Metric graphs are special types of metric spaces used to model and represent simple, ubiquitous, geometric relations in data such as biological networks, social networks, and road networks. We are interested in giving a qualitative description of metric graphs using topological summaries. In particular, we provide a complete characterization of the 1-dimensional intrinsic Cech persistence diagrams… ▽ More

    Submitted 7 July, 2017; v1 submitted 23 February, 2017; originally announced February 2017.

    Comments: 24 pages, 10 figures

    MSC Class: 57M15

  25. arXiv:1609.02883  [pdf, other

    math.CT

    A Category Theoretical Investigation of the Type Hierarchy for Heterogeneous Sensor Integration

    Authors: Emilie Purvine, Cliff Joslyn, Michael Robinson

    Abstract: Consider the case of many sensors, each returning very different types of data (e.g., a camera returning images, a thermometer returning probability distributions, a newspaper returning articles, a traffic counter returning numbers). Additionally we have a set of questions, or variables, that we wish to use these sensors to inform (e.g., temperature, location, crowd size, topic). Rather than using… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

    Report number: PNNL-25784

  26. Energy Minimization of Discrete Protein Titration State Models Using Graph Theory

    Authors: Emilie Purvine, Kyle Monson, Elizabeth Jurrus, Keith Star, Nathan A. Baker

    Abstract: There are several applications in computational biophysics which require the optimization of discrete interacting states; e.g., amino acid titration states, ligand oxidation states, or discrete rotamer angles. Such optimization can be very time-consuming as it scales exponentially in the number of sites to be optimized. In this paper, we describe a new polynomial-time algorithm for optimization of… ▽ More

    Submitted 16 April, 2016; v1 submitted 24 July, 2015; originally announced July 2015.

  27. arXiv:1501.00943  [pdf, other

    physics.soc-ph eess.SY

    Comparative Study of Clustering Techniques for Real-Time Dynamic Model Reduction

    Authors: Emilie Purvine, Eduardo Cotilla-Sanchez, Mahantesh Halappanavar, Zhenyu Huang, Guang Lin, Shuai Lu, Shaobu Wang

    Abstract: Dynamic model reduction in power systems is necessary for improving computational efficiency. Traditional model reduction using linearized models or online analysis is not adequate to capture dynamic behaviors of the power system, especially with the new mix of intermittent generation and intelligent consumption making the power system more dynamic and non-linear. Real-time dynamic model reduction… ▽ More

    Submitted 18 July, 2017; v1 submitted 5 January, 2015; originally announced January 2015.

    Comments: Statistical Analysis and Data Mining, in press, 2017