Skip to main content

Showing 1–15 of 15 results for author: Hines, K

.
  1. arXiv:2403.14720  [pdf, other

    cs.CR cs.CL cs.LG

    Defending Against Indirect Prompt Injection Attacks With Spotlighting

    Authors: Keegan Hines, Gary Lopez, Matthew Hall, Federico Zarfati, Yonatan Zunger, Emre Kiciman

    Abstract: Large Language Models (LLMs), while powerful, are built and trained to process a single text input. In common applications, multiple inputs can be processed by concatenating them together into a single stream of text. However, the LLM is unable to distinguish which sections of prompt belong to various input sources. Indirect prompt injection attacks take advantage of this vulnerability by embeddin… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  2. arXiv:2303.13299  [pdf, other

    cs.LG cs.AI

    Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective

    Authors: Avi Schwarzschild, Max Cembalest, Karthik Rao, Keegan Hines, John Dickerson

    Abstract: As neural networks increasingly make critical decisions in high-stakes settings, monitoring and explaining their behavior in an understandable and trustworthy manner is a necessity. One commonly used type of explainer is post hoc feature attribution, a family of methods for giving each feature in an input a score corresponding to its influence on a model's output. A major limitation of this family… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  3. Equalizing Credit Opportunity in Algorithms: Aligning Algorithmic Fairness Research with U.S. Fair Lending Regulation

    Authors: I. Elizabeth Kumar, Keegan E. Hines, John P. Dickerson

    Abstract: Credit is an essential component of financial wellbeing in America, and unequal access to it is a large factor in the economic disparities between demographic groups that exist today. Today, machine learning algorithms, sometimes trained on alternative data, are increasingly being used to determine access to credit, yet research has shown that machine learning can encode many different versions of… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Journal ref: AIES '22: Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society

  4. arXiv:2203.07490  [pdf, other

    cs.LG cs.CY

    Repairing Regressors for Fair Binary Classification at Any Decision Threshold

    Authors: Kweku Kwegyir-Aggrey, A. Feder Cooper, Jessica Dai, John Dickerson, Keegan Hines, Suresh Venkatasubramanian

    Abstract: We study the problem of post-processing a supervised machine-learned regressor to maximize fair binary classification at all decision thresholds. By decreasing the statistical distance between each group's score distributions, we show that we can increase fair performance across all thresholds at once, and that we can do so without a large decrease in accuracy. To this end, we introduce a formal m… ▽ More

    Submitted 10 December, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

  5. arXiv:2106.07756  [pdf, other

    cs.LG cs.AI

    Counterfactual Explanations for Machine Learning: Challenges Revisited

    Authors: Sahil Verma, John Dickerson, Keegan Hines

    Abstract: Counterfactual explanations (CFEs) are an emerging technique under the umbrella of interpretability of machine learning (ML) models. They provide ``what if'' feedback of the form ``if an input datapoint were $x'$ instead of $x$, then an ML model's output would be $y'$ instead of $y$.'' Counterfactual explainability for ML models has yet to see widespread adoption in industry. In this short paper,… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: Presented at CHI HCXAI 2021 workshop

  6. arXiv:2106.03962  [pdf, other

    cs.LG cs.AI

    Amortized Generation of Sequential Algorithmic Recourses for Black-box Models

    Authors: Sahil Verma, Keegan Hines, John P. Dickerson

    Abstract: Explainable machine learning (ML) has gained traction in recent years due to the increasing adoption of ML-based systems in many sectors. Algorithmic Recourses (ARs) provide "what if" feedback of the form "if an input datapoint were x' instead of x, then an ML-based system's output would be y' instead of y." ARs are attractive due to their actionable feedback, amenability to existing legal framewo… ▽ More

    Submitted 16 December, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted at AAAI 2022

  7. arXiv:2010.10596  [pdf, other

    cs.LG cs.AI stat.ML

    Counterfactual Explanations and Algorithmic Recourses for Machine Learning: A Review

    Authors: Sahil Verma, Varich Boonsanong, Minh Hoang, Keegan E. Hines, John P. Dickerson, Chirag Shah

    Abstract: Machine learning plays a role in many deployed decision systems, often in ways that are difficult or impossible to understand by human stakeholders. Explaining, in a human-understandable way, the relationship between the input and output of machine learning models is essential to the development of trustworthy machine learning based systems. A burgeoning body of research seeks to define the goals… ▽ More

    Submitted 15 November, 2022; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 23 pages (8 pages of references)

  8. Low-light Environment Neural Surveillance

    Authors: Michael Potter, Henry Gridley, Noah Lichtenstein, Kevin Hines, John Nguyen, Jacob Walsh

    Abstract: We design and implement an end-to-end system for real-time crime detection in low-light environments. Unlike Closed-Circuit Television, which performs reactively, the Low-Light Environment Neural Surveillance provides real time crime alerts. The system uses a low-light video feed processed in real-time by an optical-flow network, spatial and temporal networks, and a Support Vector Machine to ident… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: Pre-print, accepted to IEEE International Workshop on Machine Learning for Signal Processing 2020 Conference Proceedings. Code and dataset are available at https://github.com/mcgridles/

    ACM Class: I.4.9; I.5.4

  9. arXiv:2006.10252  [pdf, other

    cs.LG stat.ML

    Quantifying Challenges in the Application of Graph Representation Learning

    Authors: Antonia Gogoglou, C. Bayan Bruss, Brian Nguyen, Reza Sarshogh, Keegan E. Hines

    Abstract: Graph Representation Learning (GRL) has experienced significant progress as a means to extract structural information in a meaningful way for subsequent learning tasks. Current approaches including shallow embeddings and Graph Neural Networks have mostly been tested with node classification and link prediction tasks. In this work, we provide an application oriented perspective to a set of popular… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  10. arXiv:1910.03081  [pdf, ps, other

    cs.LG stat.ML

    On the Interpretability and Evaluation of Graph Representation Learning

    Authors: Antonia Gogoglou, C. Bayan Bruss, Keegan E. Hines

    Abstract: With the rising interest in graph representation learning, a variety of approaches have been proposed to effectively capture a graph's properties. While these approaches have improved performance in graph machine learning tasks compared to traditional graph techniques, they are still perceived as techniques with limited insight into the information encoded in these representations. In this work, w… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019 Graph Representation Learning workshop

  11. Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools

    Authors: Anh Truong, Austin Walters, Jeremy Goodsitt, Keegan Hines, C. Bayan Bruss, Reza Farivar

    Abstract: There has been considerable growth and interest in industrial applications of machine learning (ML) in recent years. ML engineers, as a consequence, are in high demand across the industry, yet improving the efficiency of ML engineers remains a fundamental challenge. Automated machine learning (AutoML) has emerged as a way to save time and effort on repetitive tasks in ML pipelines, such as data pr… ▽ More

    Submitted 3 September, 2019; v1 submitted 15 August, 2019; originally announced August 2019.

  12. arXiv:1907.07225  [pdf, other

    cs.LG cs.SI stat.ML

    DeepTrax: Embedding Graphs of Financial Transactions

    Authors: C. Bayan Bruss, Anish Khazane, Jonathan Rider, Richard Serpe, Antonia Gogoglou, Keegan E. Hines

    Abstract: Financial transactions can be considered edges in a heterogeneous graph between entities sending money and entities receiving money. For financial institutions, such a graph is likely large (with millions or billions of edges) while also sparsely connected. It becomes challenging to apply machine learning to such large and sparse graphs. Graph representation learning seeks to embed the nodes of a… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

  13. arXiv:1907.01705  [pdf, other

    cs.LG stat.ML

    Graph Embeddings at Scale

    Authors: C. Bayan Bruss, Anish Khazane, Jonathan Rider, Richard Serpe, Saurabh Nagrecha, Keegan E. Hines

    Abstract: Graph embedding is a popular algorithmic approach for creating vector representations for individual vertices in networks. Training these algorithms at scale is important for creating embeddings that can be used for classification, ranking, recommendation and other common applications in industry. While industrial systems exist for training graph embeddings on large datasets, many of these distrib… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: Workshop on Mining and Learning with Graphs 2019

  14. arXiv:1906.09266  [pdf, other

    cs.CL cs.CV

    A Multitask Network for Localization and Recognition of Text in Images

    Authors: Mohammad Reza Sarshogh, Keegan E. Hines

    Abstract: We present an end-to-end trainable multi-task network that addresses the problem of lexicon-free text extraction from complex documents. This network simultaneously solves the problems of text localization and text recognition and text segments are identified with no post-processing, crop**, or word grou**. A convolutional backbone and Feature Pyramid Network are combined to provide a shared r… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: ICDAR 2019

  15. arXiv:1808.10742  [pdf, other

    cs.CR

    Anomaly Detection in Cyber Network Data Using a Cyber Language Approach

    Authors: Bartley D. Richardson, Benjamin J. Radford, Shawn E. Davis, Keegan Hines, David Pekarek

    Abstract: As the amount of cyber data continues to grow, cyber network defenders are faced with increasing amounts of data they must analyze to ensure the security of their networks. In addition, new types of attacks are constantly being created and executed globally. Current rules-based approaches are effective at characterizing and flagging known attacks, but they typically fail when presented with a new… ▽ More

    Submitted 15 August, 2018; originally announced August 2018.