Skip to main content

Showing 1–17 of 17 results for author: Holder, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.13487  [pdf, other

    cs.AI

    Understanding and Estimating Domain Complexity Across Domains

    Authors: Katarina Doctor, Mayank Kejriwal, Lawrence Holder, Eric Kildebeck, Emma Resmini, Christopher Pereyda, Robert J. Steininger, Daniel V. Olivença

    Abstract: Artificial Intelligence (AI) systems, trained in controlled environments, often struggle in real-world complexities. We propose a general framework for estimating domain complexity across diverse environments, like open-world learning and real-world applications. This framework distinguishes between intrinsic complexity (inherent to the domain) and extrinsic complexity (dependent on the AI agent).… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 34 pages, 13 figures, 7 tables. arXiv admin note: substantial text overlap with arXiv:2303.04141

  2. arXiv:2308.10781  [pdf, other

    cs.LG

    Mixed-Integer Projections for Automated Data Correction of EMRs Improve Predictions of Sepsis among Hospitalized Patients

    Authors: Mehak Arora, Hassan Mortagy, Nathan Dwarshius, Swati Gupta, Andre L. Holder, Rishikesan Kamaleswaran

    Abstract: Machine learning (ML) models are increasingly pivotal in automating clinical decisions. Yet, a glaring oversight in prior research has been the lack of proper processing of Electronic Medical Record (EMR) data in the clinical context for errors and outliers. Addressing this oversight, we introduce an innovative projections-based method that seamlessly integrates clinical expertise as domain constr… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    MSC Class: 90; 92

  3. arXiv:2303.04141  [pdf, other

    cs.AI

    Toward Defining a Domain Complexity Measure Across Domains

    Authors: Katarina Doctor, Christine Task, Eric Kildebeck, Mayank Kejriwal, Lawrence Holder, Russell Leong

    Abstract: Artificial Intelligence (AI) systems planned for deployment in real-world applications frequently are researched and developed in closed simulation environments where all variables are controlled and known to the simulator or labeled benchmark datasets are used. Transition from these simulators, testbeds, and benchmark datasets to more open-world domains poses significant challenges to AI systems,… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  4. arXiv:2012.04226  [pdf, other

    cs.AI cs.CV cs.LG

    A Unifying Framework for Formal Theories of Novelty:Framework, Examples and Discussion

    Authors: T. E. Boult, P. A. Grabowicz, D. S. Prijatelj, R. Stern, L. Holder, J. Alspector, M. Jafarzadeh, T. Ahmad, A. R. Dhamija, C. Li, S. Cruz, A. Shrivastava, C. Vondrick, W. J. Scheirer

    Abstract: Managing inputs that are novel, unknown, or out-of-distribution is critical as an agent moves from the lab to the open world. Novelty-related problems include being tolerant to novel perturbations of the normal input, detecting when the input includes novel items, and adapting to novel inputs. While significant research has been undertaken in these areas, a noticeable gap exists in the lack of a f… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

    Comments: Extended version/preprint of a AAAI 2021 paper

  5. arXiv:2010.01985  [pdf, other

    cs.AI

    Measuring the Complexity of Domains Used to Evaluate AI Systems

    Authors: Christopher Pereyda, Lawrence Holder

    Abstract: There is currently a rapid increase in the number of challenge problem, benchmarking datasets and algorithmic optimization tests for evaluating AI systems. However, there does not currently exist an objective measure to determine the complexity between these newly created domains. This lack of cross-domain examination creates an obstacle to effectively research more general AI systems. We propose… ▽ More

    Submitted 18 September, 2020; originally announced October 2020.

  6. arXiv:2002.08312  [pdf, other

    cs.SI cs.AI

    ITeM: Independent Temporal Motifs to Summarize and Compare Temporal Networks

    Authors: Sumit Purohit, Lawrence B. Holder, George Chin

    Abstract: Networks are a fundamental and flexible way of representing various complex systems. Many domains such as communication, citation, procurement, biology, social media, and transportation can be modeled as a set of entities and their relationships. Temporal networks are a specialization of general networks where the temporal evolution of the system is as important to understand as the structure of t… ▽ More

    Submitted 5 August, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

  7. arXiv:1703.08614  [pdf, other

    cs.SI cs.DB

    GraphZip: Dictionary-based Compression for Mining Graph Streams

    Authors: Charles A. Packer, Lawrence B. Holder

    Abstract: A massive amount of data generated today on platforms such as social networks, telecommunication networks, and the internet in general can be represented as graph streams. Activity in a network's underlying graph generates a sequence of edges in the form of a stream; for example, a social network may generate a graph stream based on the interactions (edges) between different users (nodes) over tim… ▽ More

    Submitted 24 March, 2017; originally announced March 2017.

  8. arXiv:1503.00849  [pdf, other

    cs.DB

    A Selectivity based approach to Continuous Pattern Detection in Streaming Graphs

    Authors: Sutanay Choudhury, Lawrence Holder, George Chin, Khushbu Agarwal, John Feo

    Abstract: Cyber security is one of the most significant technical challenges in current times. Detecting adversarial activities, prevention of theft of intellectual properties and customer data is a high priority for corporations and government agencies around the world. Cyber defenders need to analyze massive-scale, high-resolution network flows to identify, categorize, and mitigate attacks involving netwo… ▽ More

    Submitted 3 March, 2015; originally announced March 2015.

    Comments: in 18th International Conference on Extending Database Technology (EDBT) (2015)

  9. arXiv:1410.1783   

    cs.SI physics.soc-ph

    Feature Engineering for Supervised Link Prediction on Dynamic Social Networks

    Authors: Jeyanthi Narasimhan, Lawrence Holder

    Abstract: Link prediction is an important network science problem in many domains such as social networks, chem/bio-informatics, etc. Most of these networks are dynamic in nature with patterns evolving over time. In such cases, it is necessary to incorporate time in the mining process in a seamless manner to aid in better prediction performance. We propose a two-step solution strategy to the link prediction… ▽ More

    Submitted 17 September, 2015; v1 submitted 7 October, 2014; originally announced October 2014.

    Comments: 7 pages, 12 figures, the 10th international conference on Data Mining, DMIN'14. The paper is withdrawn by the author owing to change in results

  10. arXiv:1407.3745  [pdf, other

    cs.DB

    Query Optimization for Dynamic Graphs

    Authors: Sutanay Choudhury, Lawrence Holder, George Chin, Patrick Mackey, Khushbu Agarwal, John Feo

    Abstract: Given a query graph that represents a pattern of interest, the emerging pattern detection problem can be viewed as a continuous query problem on a dynamic graph. We present an incremental algorithm for continuous query processing on dynamic graphs. The algorithm is based on the concept of query decomposition; we decompose a query graph into smaller subgraphs and assemble the result of sub-queries… ▽ More

    Submitted 14 July, 2014; originally announced July 2014.

    Report number: PNNL-SA-103238, Pacific Northwest National Laboratory, Richland, WA

  11. arXiv:1406.5161  [pdf, other

    cs.DC cs.LG

    Fast Support Vector Machines Using Parallel Adaptive Shrinking on Distributed Systems

    Authors: Jeyanthi Narasimhan, Abhinav Vishnu, Lawrence Holder, Adolfy Hoisie

    Abstract: Support Vector Machines (SVM), a popular machine learning technique, has been applied to a wide range of domains such as science, finance, and social networks for supervised learning. Whether it is identifying high-risk patients by health-care professionals, or potential high-school students to enroll in college by school districts, SVMs can play a major role for social good. This paper undertakes… ▽ More

    Submitted 19 June, 2014; originally announced June 2014.

    Comments: 10 pages, 9 figures, 3 tables

  12. arXiv:1306.2460  [pdf, other

    cs.DB

    StreamWorks - A system for Dynamic Graph Search

    Authors: Sutanay Choudhury, Lawrence Holder, George Chin, Abhik Ray, Sherman Beus, John Feo

    Abstract: Acting on time-critical events by processing ever growing social media, news or cyber data streams is a major technical challenge. Many of these data sources can be modeled as multi-relational graphs. Mining and searching for subgraph patterns in a continuous setting requires an efficient approach to incremental graph search. The goal of our work is to enable real-time search capabilities for grap… ▽ More

    Submitted 11 June, 2013; originally announced June 2013.

    Comments: SIGMOD 2013: International Conference on Management of Data

    ACM Class: H.2.4

  13. arXiv:1306.2459  [pdf, other

    cs.DB

    Fast Search for Dynamic Multi-Relational Graphs

    Authors: Sutanay Choudhury, Lawrence Holder, George Chin, John Feo

    Abstract: Acting on time-critical events by processing ever growing social media or news streams is a major technical challenge. Many of these data sources can be modeled as multi-relational graphs. Continuous queries or techniques to search for rare events that typically arise in monitoring applications have been studied extensively for relational databases. This work is dedicated to answer the question th… ▽ More

    Submitted 11 June, 2013; originally announced June 2013.

    Comments: SIGMOD Workshop on Dynamic Networks Management and Mining (DyNetMM), 2013

    ACM Class: H.2.4

  14. arXiv:1304.6761  [pdf, other

    cs.CR cs.NI cs.SI

    Towards a Networks-of-Networks Framework for Cyber Security

    Authors: Mahantesh Halappanavar, Sutanay Choudhury, Emilie Hogan, Peter Hui, John R. Johnson, Indrajit Ray, Lawrence Holder

    Abstract: Networks-of-networks (NoN) is a graph-theoretic model of interdependent networks that have distinct dynamics at each network (layer). By adding special edges to represent relationships between nodes in different layers, NoN provides a unified mechanism to study interdependent systems intertwined in a complex relationship. While NoN based models have been proposed for cyber-physical systems, in thi… ▽ More

    Submitted 24 April, 2013; originally announced April 2013.

    Comments: A shorter (3-page) version of this paper will appear in the Proceedings of the IEEE Intelligence and Security Informatics 2013, Seattle Washington, USA, June 4-7, 2013

  15. arXiv:1209.2178   

    cs.DB cs.SI

    Continuous Queries for Multi-Relational Graphs

    Authors: Sutanay Choudhury, Lawrence B. Holder, Abhik Ray, George Chin Jr., John T. Feo

    Abstract: Acting on time-critical events by processing ever growing social media or news streams is a major technical challenge. Many of these data sources can be modeled as multi-relational graphs. Continuous queries or techniques to search for rare events that typically arise in monitoring applications have been studied extensively for relational databases. This work is dedicated to answer the question th… ▽ More

    Submitted 8 March, 2013; v1 submitted 10 September, 2012; originally announced September 2012.

    Comments: Withdrawn because for information disclosure considerations

    Report number: PNNL-SA-90326

  16. Large-scale continuous subgraph queries on streams

    Authors: Sutanay Choudhury, Lawrence Holder, George Chin, John Feo

    Abstract: Graph pattern matching involves finding exact or approximate matches for a query subgraph in a larger graph. It has been studied extensively and has strong applications in domains such as computer vision, computational biology, social networks, security and finance. The problem of exact graph pattern matching is often described in terms of subgraph isomorphism which is NP-complete. The exponential… ▽ More

    Submitted 31 July, 2012; originally announced August 2012.

    Journal ref: In Proceedings of the first annual workshop on High performance computing meets databases (HPCDB 2011). ACM, New York, NY, USA, 29-32

  17. arXiv:cs/9402102  [pdf, ps

    cs.AI

    Substructure Discovery Using Minimum Description Length and Background Knowledge

    Authors: D. J. Cook, L. B. Holder

    Abstract: The ability to identify interesting and repetitive substructures is an essential component to discovering knowledge in structural data. We describe a new version of our SUBDUE substructure discovery system based on the minimum description length principle. The SUBDUE system discovers substructures that compress the original data and represent structural concepts in the data. By replacing previou… ▽ More

    Submitted 31 January, 1994; originally announced February 1994.

    Comments: See http://www.jair.org/ for an online appendix and other files accompanying this article

    Journal ref: Journal of Artificial Intelligence Research, Vol 1, (1994), 231-255