Skip to main content

Showing 1–22 of 22 results for author: Ackerman, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13020  [pdf, other

    cs.CL cs.AI

    Using Combinatorial Optimization to Design a High quality LLM Solution

    Authors: Samuel Ackerman, Eitan Farchi, Rami Katan, Orna Raz

    Abstract: We introduce a novel LLM based solution design approach that utilizes combinatorial optimization and sampling. Specifically, a set of factors that influence the quality of the solution are identified. They typically include factors that represent prompt types, LLM inputs alternatives, and parameters governing the generation and design alternatives. Identifying the factors that govern the LLM solut… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  2. arXiv:2402.05388  [pdf, other

    cs.HC cs.SI

    Form-From: A Design Space of Social Media Systems

    Authors: Amy X. Zhang, Michael S. Bernstein, David R. Karger, Mark S. Ackerman

    Abstract: Social media systems are as varied as they are pervasive. They have been almost universally adopted for a broad range of purposes including work, entertainment, activism, and decision making. As a result, they have also diversified, with many distinct designs differing in content type, organization, delivery mechanism, access control, and many other dimensions. In this work, we aim to characterize… ▽ More

    Submitted 23 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Journal ref: Proc. ACM Hum.-Comput. Interact. 8, CSCW1, Article 167 (April 2024), 47 pages

  3. arXiv:2311.01152  [pdf, other

    cs.CL

    Predicting Question-Answering Performance of Large Language Models through Semantic Consistency

    Authors: Ella Rabinovich, Samuel Ackerman, Orna Raz, Eitan Farchi, Ateret Anaby-Tavor

    Abstract: Semantic consistency of a language model is broadly defined as the model's ability to produce semantically-equivalent outputs, given semantically-equivalent inputs. We address the task of assessing question-answering (QA) semantic consistency of contemporary large language models (LLMs) by manually creating a benchmark dataset with high-quality paraphrases for factual questions, and release the da… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: EMNLP2023 GEM workshop, 17 pages

  4. arXiv:2310.14893  [pdf, other

    cs.LG eess.SY stat.AP

    Data Drift Monitoring for Log Anomaly Detection Pipelines

    Authors: Dipak Wani, Samuel Ackerman, Eitan Farchi, Xiaotong Liu, Hau-wen Chang, Sarasi Lalithsena

    Abstract: Logs enable the monitoring of infrastructure status and the performance of associated applications. Logs are also invaluable for diagnosing the root causes of any problems that may arise. Log Anomaly Detection (LAD) pipelines automate the detection of anomalies in logs, providing assistance to site reliability engineers (SREs) in system diagnosis. Log patterns change over time, necessitating updat… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  5. arXiv:2310.14829  [pdf

    cs.CL stat.AP

    Characterizing how 'distributional' NLP corpora distance metrics are

    Authors: Samuel Ackerman, George Kour, Eitan Farchi

    Abstract: A corpus of vector-embedded text documents has some empirical distribution. Given two corpora, we want to calculate a single metric of distance (e.g., Mauve, Frechet Inception) between them. We describe an abstract quality, called `distributionality', of such metrics. A non-distributional metric tends to use very local measurements, or uses global measurements in a way that does not fully reflect… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Published in the August 2023 Joint Statistical Meetings proceedings

  6. arXiv:2305.17750  [pdf, other

    cs.CL

    Reliable and Interpretable Drift Detection in Streams of Short Texts

    Authors: Ella Rabinovich, Matan Vetzler, Samuel Ackerman, Ateret Anaby-Tavor

    Abstract: Data drift is the change in model input data that is one of the key factors leading to machine learning models performance degradation over time. Monitoring drift helps detecting these issues and preventing their harmful consequences. Meaningful drift interpretation is a fundamental step towards effective re-training of the model. In this study we propose an end-to-end framework for reliable model… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: ACL2023 industry track (9 pages)

  7. arXiv:2305.08115  [pdf, other

    cs.LG stat.AP

    Automatic Generation of Attention Rules For Containment of Machine Learning Model Errors

    Authors: Samuel Ackerman, Axel Bendavid, Eitan Farchi, Orna Raz

    Abstract: Machine learning (ML) solutions are prevalent in many applications. However, many challenges exist in making these solutions business-grade. For instance, maintaining the error rate of the underlying ML models at an acceptably low level. Typically, the true relationship between feature inputs and the target feature to be predicted is uncertain, and hence statistical in nature. The approach we prop… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  8. arXiv:2211.16259  [pdf, other

    cs.CL

    Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

    Authors: George Kour, Samuel Ackerman, Orna Raz, Eitan Farchi, Boaz Carmeli, Ateret Anaby-Tavor

    Abstract: The ability to compare the semantic similarity between text corpora is important in a variety of natural language processing applications. However, standard methods for evaluating these metrics have yet to be established. We propose a set of automatic and interpretable measures for assessing the characteristics of corpus-level semantic similarity metrics, allowing sensible comparison of their beha… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Published at GEM (https://gem-benchmark.com/workshop) workshop at the Empirical Methods in Natural Language Processing (EMNLP) conference in 2022

  9. arXiv:2204.13043  [pdf, other

    cs.HC stat.AP

    High-quality Conversational Systems

    Authors: Samuel Ackerman, Ateret Anaby-Tavor, Eitan Farchi, Esther Goldbraich, George Kour, Ella Rabinovich, Orna Raz, Saritha Route, Marcel Zalmanovici, Naama Zwerdling

    Abstract: Conversational systems or chatbots are an example of AI-Infused Applications (AIIA). Chatbots are especially important as they are often the first interaction of clients with a business and are the entry point of a business into the AI (Artificial Intelligence) world. The quality of the chatbot is, therefore, key. However, as is the case in general with AIIAs, it is especially challenging to asses… ▽ More

    Submitted 28 April, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

  10. arXiv:2201.00355  [pdf, other

    cs.LG cs.SE

    Theory and Practice of Quality Assurance for Machine Learning Systems An Experiment Driven Approach

    Authors: Samuel Ackerman, Guy Barash, Eitan Farchi, Orna Raz, Onn Shehory

    Abstract: The crafting of machine learning (ML) based systems requires statistical control throughout its life cycle. Careful quantification of business requirements and identification of key factors that impact the business requirements reduces the risk of a project failure. The quantification of business requirements results in the definition of random variables representing the system key performance ind… ▽ More

    Submitted 12 April, 2022; v1 submitted 2 January, 2022; originally announced January 2022.

  11. arXiv:2112.11832  [pdf, other

    cs.LG

    Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation

    Authors: George Kour, Marcel Zalmanovici, Orna Raz, Samuel Ackerman, Ateret Anaby-Tavor

    Abstract: Testing Machine Learning (ML) models and AI-Infused Applications (AIIAs), or systems that contain ML models, is highly challenging. In addition to the challenges of testing classical software, it is acceptable and expected that statistical ML models sometimes output incorrect results. A major challenge is to determine when the level of incorrectness, e.g., model accuracy or F1 score for classifier… ▽ More

    Submitted 27 October, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

    Comments: Accepted to EDSMLS workshop at AAAI conference

  12. arXiv:2111.05672  [pdf, other

    cs.LG

    Automatically detecting data drift in machine learning classifiers

    Authors: Samuel Ackerman, Orna Raz, Marcel Zalmanovici, Aviad Zlotnick

    Abstract: Classifiers and other statistics-based machine learning (ML) techniques generalize, or learn, based on various statistical properties of the training data. The assumption underlying statistical ML resulting in theoretical or empirical performance guarantees is that the distribution of the training data is representative of the production data distribution. This assumption often breaks; for instanc… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Journal ref: Originally published in proceedings of Engineering Dependable and Secure Machine Learning Systems (EDSMLS) workshop at AAAI 2019 conference

  13. arXiv:2111.05136  [pdf, other

    stat.AP cs.LG

    Using sequential drift detection to test the API economy

    Authors: Samuel Ackerman, Parijat Dube, Eitan Farchi

    Abstract: The API economy refers to the widespread integration of API (advanced programming interface) microservices, where software applications can communicate with each other, as a crucial element in business models and functions. The number of possible ways in which such a system could be used is huge. It is thus desirable to monitor the usage patterns and identify when the system is used in a way that… ▽ More

    Submitted 25 November, 2021; v1 submitted 9 November, 2021; originally announced November 2021.

  14. arXiv:2110.12506  [pdf, other

    cs.LG cs.AI

    Detecting model drift using polynomial relations

    Authors: Eliran Roffe, Samuel Ackerman, Orna Raz, Eitan Farchi

    Abstract: Machine learning models serve critical functions, such as classifying loan applicants as good or bad risks. Each model is trained under the assumption that the data used in training and in the field come from the same underlying unknown distribution. Often, this assumption is broken in practice. It is desirable to identify when this occurs, to minimize the impact on model performance. We suggest… ▽ More

    Submitted 22 December, 2021; v1 submitted 24 October, 2021; originally announced October 2021.

  15. arXiv:2110.05430  [pdf, other

    cs.LG stat.AP

    Density-based interpretable hypercube region partitioning for mixed numeric and categorical data

    Authors: Samuel Ackerman, Eitan Farchi, Orna Raz, Marcel Zalmanovici, Maya Zohar

    Abstract: Consider a structured dataset of features, such as $\{\textrm{SEX}, \textrm{INCOME}, \textrm{RACE}, \textrm{EXPERIENCE}\}$. A user may want to know where in the feature space observations are concentrated, and where it is sparse or empty. The existence of large sparse or empty regions can provide domain knowledge of soft or hard feature constraints (e.g., what is the typical income range, or that… ▽ More

    Submitted 8 November, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

  16. arXiv:2109.02540  [pdf, other

    cs.AI cs.DC cs.SE

    Towards API Testing Across Cloud and Edge

    Authors: Samuel Ackerman, Sanjib Choudhury, Nirmit Desai, Eitan Farchi, Dan Gisolfi, Andrew Hicks, Saritha Route, Diptikalyan Saha

    Abstract: API economy is driving the digital transformation of business applications across the hybrid Cloud and edge environments. For such transformations to succeed, end-to-end testing of the application API composition is required. Testing of API compositions, even in centralized Cloud environments, is challenging as it requires coverage of functional as well as reliability requirements. The combinatori… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

  17. FreaAI: Automated extraction of data slices to test machine learning models

    Authors: Samuel Ackerman, Orna Raz, Marcel Zalmanovici

    Abstract: Machine learning (ML) solutions are prevalent. However, many challenges exist in making these solutions business-grade. One major challenge is to ensure that the ML solution provides its expected business value. In order to do that, one has to bridge the gap between the way ML model performance is measured and the solution requirements. In previous work (Barash et al, "Bridging the gap...") we dem… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Journal ref: International Workshop on Engineering Dependable and Secure Machine Learning Systems, at EDSMLS 2020

  18. arXiv:2108.05319  [pdf, other

    cs.LG stat.AP

    Machine Learning Model Drift Detection Via Weak Data Slices

    Authors: Samuel Ackerman, Parijat Dube, Eitan Farchi, Orna Raz, Marcel Zalmanovici

    Abstract: Detecting drift in performance of Machine Learning (ML) models is an acknowledged challenge. For ML models to become an integral part of business applications it is essential to detect when an ML model drifts away from acceptable operation. However, it is often the case that actual labels are difficult and expensive to get, for example, because they require expert judgment. Therefore, there is a n… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Journal ref: DeepTest workshop of ICSE, 2021

  19. arXiv:2012.09258  [pdf, other

    stat.AP cs.LG stat.ML

    Detection of data drift and outliers affecting machine learning model performance over time

    Authors: Samuel Ackerman, Eitan Farchi, Orna Raz, Marcel Zalmanovici, Parijat Dube

    Abstract: A trained ML model is deployed on another `test' dataset where target feature values (labels) are unknown. Drift is distribution change between the training and deployment data, which is concerning if model performance changes. For a cat/dog image classifier, for instance, drift during deployment could be rabbit images (new class) or cat/dog images with changed characteristics (change in distribut… ▽ More

    Submitted 6 September, 2022; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: In: JSM Proceedings, Nonparametric Statistics Section, 20202. Philadelphia, PA: American Statistical Association. 144--160

  20. arXiv:2007.16109  [pdf, other

    stat.AP cs.LG stat.ML

    Sequential Drift Detection in Deep Learning Classifiers

    Authors: Samuel Ackerman, Parijat Dube, Eitan Farchi

    Abstract: We utilize neural network embeddings to detect data drift by formulating the drift detection within an appropriate sequential decision framework. This enables control of the false alarm rate although the statistical tests are repeatedly applied. Since change detection algorithms naturally face a tradeoff between avoiding false alarms and quick correct detection, we introduce a loss function which… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: 11 pages + appendix, 7 figures

  21. arXiv:1805.08320  [pdf, other

    cs.MA cs.RO

    The Swarmathon: An Autonomous Swarm Robotics Competition

    Authors: Sarah M. Ackerman, G. Matthew Fricke, Joshua P. Hecker, Kastro M. Hamed, Samantha R. Fowler, Antonio D. Griego, Jarett C. Jones, J. Jake Nichol, Kurt W. Leucht, Melanie E. Moses

    Abstract: The Swarmathon is a swarm robotics programming challenge that engages college students from minority-serving institutions in NASA's Journey to Mars. Teams compete by programming a group of robots to search for, pick up, and drop off resources in a collection zone. The Swarmathon produces prototypes for robot swarms that would collect resources on the surface of Mars. Robots operate completely auto… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: Paper presented May 2018 at ICRA 2018 Workshop: "Swarms: From Biology to Robotics and Back"

  22. arXiv:cs/9904010  other

    cs.CY cs.HC

    Beyond Concern: Understanding Net Users' Attitudes About Online Privacy

    Authors: Lorrie Faith Cranor, Joseph Reagle, Mark S. Ackerman

    Abstract: People are concerned about privacy, particularly on the Internet. While many studies have provided evidence of this concern, few have explored the nature of the concern in detail, especially for the online environment. With this study, we have tried to better understand the nature of online privacy concerns; we look beyond the fact that people are concerned and attempt to understand how they are… ▽ More

    Submitted 18 April, 1999; originally announced April 1999.

    Comments: 5 figures and appendix

    Report number: AT&T Labs-Research Technical Report TR 99.4.3 ACM Class: K.4.1