Skip to main content

Showing 1–30 of 30 results for author: Farchi, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02355  [pdf, ps, other

    cs.SE

    Effective Technical Reviews

    Authors: Scott Ballentine, Eitan Farchi

    Abstract: There are two ways to check if a program is correct, namely execute it or review it. While executing a program is the ultimate test for its correctness reviewing the program can occur earlier in its development and find problems if done effectively. This work focuses on review techniques. It enables the programmer to effectively review a program and find a range of problems from concurrency to int… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2405.13020  [pdf, other

    cs.CL cs.AI

    Using Combinatorial Optimization to Design a High quality LLM Solution

    Authors: Samuel Ackerman, Eitan Farchi, Rami Katan, Orna Raz

    Abstract: We introduce a novel LLM based solution design approach that utilizes combinatorial optimization and sampling. Specifically, a set of factors that influence the quality of the solution are identified. They typically include factors that represent prompt types, LLM inputs alternatives, and parameters governing the generation and design alternatives. Identifying the factors that govern the LLM solut… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  3. arXiv:2403.09704  [pdf, other

    cs.CL cs.AI cs.LG

    Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations

    Authors: Swapnaja Achintalwar, Ioana Baldini, Djallel Bouneffouf, Joan Byamugisha, Maria Chang, Pierre Dognin, Eitan Farchi, Ndivhuwo Makondo, Aleksandra Mojsilovic, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Inkit Padhi, Orna Raz, Jesus Rios, Prasanna Sattigeri, Moninder Singh, Siphiwe Thwala, Rosario A. Uceda-Sosa, Kush R. Varshney

    Abstract: The alignment of large language models is usually done by model providers to add or control behaviors that are common or universally understood across use cases and contexts. In contrast, in this article, we present an approach and architecture that empowers application developers to tune a model to their particular values, social norms, laws and other regulations, and orchestrate between potentia… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures

  4. arXiv:2403.06009  [pdf, other

    cs.LG

    Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

    Authors: Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Kirushikesh DB, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Nishtha Madaan, Sameep Mehta, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we presen… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  5. arXiv:2311.04124  [pdf, other

    cs.CL cs.AI cs.LG

    Unveiling Safety Vulnerabilities of Large Language Models

    Authors: George Kour, Marcel Zalmanovici, Naama Zwerdling, Esther Goldbraich, Ora Nova Fandina, Ateret Anaby-Tavor, Orna Raz, Eitan Farchi

    Abstract: As large language models become more prevalent, their possible harmful or inappropriate responses are a cause for concern. This paper introduces a unique dataset containing adversarial examples in the form of questions, which we call AttaQ, designed to provoke such harmful or inappropriate responses. We assess the efficacy of our dataset by analyzing the vulnerabilities of various models when subj… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: To be published in GEM workshop. Conference on Empirical Methods in Natural Language Processing (EMNLP). 2023

    ACM Class: I.2.7

  6. arXiv:2311.01152  [pdf, other

    cs.CL

    Predicting Question-Answering Performance of Large Language Models through Semantic Consistency

    Authors: Ella Rabinovich, Samuel Ackerman, Orna Raz, Eitan Farchi, Ateret Anaby-Tavor

    Abstract: Semantic consistency of a language model is broadly defined as the model's ability to produce semantically-equivalent outputs, given semantically-equivalent inputs. We address the task of assessing question-answering (QA) semantic consistency of contemporary large language models (LLMs) by manually creating a benchmark dataset with high-quality paraphrases for factual questions, and release the da… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: EMNLP2023 GEM workshop, 17 pages

  7. arXiv:2310.14893  [pdf, other

    cs.LG eess.SY stat.AP

    Data Drift Monitoring for Log Anomaly Detection Pipelines

    Authors: Dipak Wani, Samuel Ackerman, Eitan Farchi, Xiaotong Liu, Hau-wen Chang, Sarasi Lalithsena

    Abstract: Logs enable the monitoring of infrastructure status and the performance of associated applications. Logs are also invaluable for diagnosing the root causes of any problems that may arise. Log Anomaly Detection (LAD) pipelines automate the detection of anomalies in logs, providing assistance to site reliability engineers (SREs) in system diagnosis. Log patterns change over time, necessitating updat… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  8. arXiv:2310.14829  [pdf

    cs.CL stat.AP

    Characterizing how 'distributional' NLP corpora distance metrics are

    Authors: Samuel Ackerman, George Kour, Eitan Farchi

    Abstract: A corpus of vector-embedded text documents has some empirical distribution. Given two corpora, we want to calculate a single metric of distance (e.g., Mauve, Frechet Inception) between them. We describe an abstract quality, called `distributionality', of such metrics. A non-distributional metric tends to use very local measurements, or uses global measurements in a way that does not fully reflect… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Published in the August 2023 Joint Statistical Meetings proceedings

  9. arXiv:2305.08115  [pdf, other

    cs.LG stat.AP

    Automatic Generation of Attention Rules For Containment of Machine Learning Model Errors

    Authors: Samuel Ackerman, Axel Bendavid, Eitan Farchi, Orna Raz

    Abstract: Machine learning (ML) solutions are prevalent in many applications. However, many challenges exist in making these solutions business-grade. For instance, maintaining the error rate of the underlying ML models at an acceptably low level. Typically, the true relationship between feature inputs and the target feature to be predicted is uncertain, and hence statistical in nature. The approach we prop… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  10. arXiv:2303.01713  [pdf, other

    cs.LG

    Convex Bounds on the Softmax Function with Applications to Robustness Verification

    Authors: Dennis Wei, Haoze Wu, Min Wu, Pin-Yu Chen, Clark Barrett, Eitan Farchi

    Abstract: The softmax function is a ubiquitous component at the output of neural networks and increasingly in intermediate layers as well. This paper provides convex lower bounds and concave upper bounds on the softmax function, which are compatible with convex optimization formulations for characterizing neural networks and other ML models. We derive bounds using both a natural exponential-reciprocal decom… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: AISTATS 2023

  11. arXiv:2302.03651  [pdf, other

    cs.SE

    Quality Engineering for Agile and DevOps on the Cloud and Edge

    Authors: Eitan Farchi, Saritha Route

    Abstract: Today's software projects include enhancements, fixes, and patches need to be delivered almost on a daily basis to clients. Weekly and daily releases are pretty much the norm and sit alongside larger feature upgrades and quarterly releases. Software delivery has to be more agile now than ever before. Companies that were, in the past, experimenting with agile based delivery models, are now looking… ▽ More

    Submitted 16 February, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

  12. arXiv:2211.16259  [pdf, other

    cs.CL

    Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

    Authors: George Kour, Samuel Ackerman, Orna Raz, Eitan Farchi, Boaz Carmeli, Ateret Anaby-Tavor

    Abstract: The ability to compare the semantic similarity between text corpora is important in a variety of natural language processing applications. However, standard methods for evaluating these metrics have yet to be established. We propose a set of automatic and interpretable measures for assessing the characteristics of corpus-level semantic similarity metrics, allowing sensible comparison of their beha… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Published at GEM (https://gem-benchmark.com/workshop) workshop at the Empirical Methods in Natural Language Processing (EMNLP) conference in 2022

  13. arXiv:2207.13143  [pdf, other

    cs.SE

    Random Test Generation of Application Programming Interfaces

    Authors: Eitan Farchi, Krithika Prakash, Vitali Sokhin

    Abstract: Cloud high quality API (Application Programming Interface) testing is essential for supporting the API economy. Autotest is a random test generator that addresses this need. It reads the API specification and deduces a model used in the test generation. This paper describes Autotest. It also address the topic of API specification pitfalls which Autotest may reveal when reading the specification. A… ▽ More

    Submitted 6 November, 2022; v1 submitted 26 July, 2022; originally announced July 2022.

  14. arXiv:2207.06820  [pdf, other

    cs.DB

    Using Fuzzy Matching of Queries to optimize Database workloads

    Authors: Sweta Singh, Vaibhav Kulkarni, Mario Briggs, Deepak Mahajan, Eitan Farchi

    Abstract: Directed Acyclic Graphs (DAGs) are commonly used in Databases and Big Data computational engines like Apache Spark for representing the execution plan of queries. We refer to such graphs as Query Directed Acyclic Graphs (QDAGs). This paper uses similarity hashing to arrive at a fingerprint such that the fingerprint embodies the compute requirements of the query for QDAGs. The fingerprint, thus obt… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: 9 pages, 5 figures

  15. arXiv:2204.13043  [pdf, other

    cs.HC stat.AP

    High-quality Conversational Systems

    Authors: Samuel Ackerman, Ateret Anaby-Tavor, Eitan Farchi, Esther Goldbraich, George Kour, Ella Rabinovich, Orna Raz, Saritha Route, Marcel Zalmanovici, Naama Zwerdling

    Abstract: Conversational systems or chatbots are an example of AI-Infused Applications (AIIA). Chatbots are especially important as they are often the first interaction of clients with a business and are the entry point of a business into the AI (Artificial Intelligence) world. The quality of the chatbot is, therefore, key. However, as is the case in general with AIIAs, it is especially challenging to asses… ▽ More

    Submitted 28 April, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

  16. Generalized Coverage Criteria for Combinatorial Sequence Testing

    Authors: Achiya Elyasaf, Eitan Farchi, Oded Margalit, Gera Weiss, Yeshayahu Weiss

    Abstract: We present a new model-based approach for testing systems that use sequences of actions and assertions as test vectors. Our solution includes a method for quantifying testing quality, a tool for generating high-quality test suites based on the coverage criteria we propose, and a framework for assessing risks. For testing quality, we propose a method that specifies generalized coverage criteria ove… ▽ More

    Submitted 31 October, 2023; v1 submitted 3 January, 2022; originally announced January 2022.

    Comments: 12 pages, 5 tables, 5 figures, and 2 listing

    Journal ref: in IEEE Transactions on Software Engineering, vol. 49, no. 8, pp. 4023-4034, 24 May 2023

  17. arXiv:2201.00355  [pdf, other

    cs.LG cs.SE

    Theory and Practice of Quality Assurance for Machine Learning Systems An Experiment Driven Approach

    Authors: Samuel Ackerman, Guy Barash, Eitan Farchi, Orna Raz, Onn Shehory

    Abstract: The crafting of machine learning (ML) based systems requires statistical control throughout its life cycle. Careful quantification of business requirements and identification of key factors that impact the business requirements reduces the risk of a project failure. The quantification of business requirements results in the definition of random variables representing the system key performance ind… ▽ More

    Submitted 12 April, 2022; v1 submitted 2 January, 2022; originally announced January 2022.

  18. arXiv:2111.05136  [pdf, other

    stat.AP cs.LG

    Using sequential drift detection to test the API economy

    Authors: Samuel Ackerman, Parijat Dube, Eitan Farchi

    Abstract: The API economy refers to the widespread integration of API (advanced programming interface) microservices, where software applications can communicate with each other, as a crucial element in business models and functions. The number of possible ways in which such a system could be used is huge. It is thus desirable to monitor the usage patterns and identify when the system is used in a way that… ▽ More

    Submitted 25 November, 2021; v1 submitted 9 November, 2021; originally announced November 2021.

  19. arXiv:2110.12506  [pdf, other

    cs.LG cs.AI

    Detecting model drift using polynomial relations

    Authors: Eliran Roffe, Samuel Ackerman, Orna Raz, Eitan Farchi

    Abstract: Machine learning models serve critical functions, such as classifying loan applicants as good or bad risks. Each model is trained under the assumption that the data used in training and in the field come from the same underlying unknown distribution. Often, this assumption is broken in practice. It is desirable to identify when this occurs, to minimize the impact on model performance. We suggest… ▽ More

    Submitted 22 December, 2021; v1 submitted 24 October, 2021; originally announced October 2021.

  20. arXiv:2110.05430  [pdf, other

    cs.LG stat.AP

    Density-based interpretable hypercube region partitioning for mixed numeric and categorical data

    Authors: Samuel Ackerman, Eitan Farchi, Orna Raz, Marcel Zalmanovici, Maya Zohar

    Abstract: Consider a structured dataset of features, such as $\{\textrm{SEX}, \textrm{INCOME}, \textrm{RACE}, \textrm{EXPERIENCE}\}$. A user may want to know where in the feature space observations are concentrated, and where it is sparse or empty. The existence of large sparse or empty regions can provide domain knowledge of soft or hard feature constraints (e.g., what is the typical income range, or that… ▽ More

    Submitted 8 November, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

  21. arXiv:2109.02540  [pdf, other

    cs.AI cs.DC cs.SE

    Towards API Testing Across Cloud and Edge

    Authors: Samuel Ackerman, Sanjib Choudhury, Nirmit Desai, Eitan Farchi, Dan Gisolfi, Andrew Hicks, Saritha Route, Diptikalyan Saha

    Abstract: API economy is driving the digital transformation of business applications across the hybrid Cloud and edge environments. For such transformations to succeed, end-to-end testing of the application API composition is required. Testing of API compositions, even in centralized Cloud environments, is challenging as it requires coverage of functional as well as reliability requirements. The combinatori… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

  22. arXiv:2108.05319  [pdf, other

    cs.LG stat.AP

    Machine Learning Model Drift Detection Via Weak Data Slices

    Authors: Samuel Ackerman, Parijat Dube, Eitan Farchi, Orna Raz, Marcel Zalmanovici

    Abstract: Detecting drift in performance of Machine Learning (ML) models is an acknowledged challenge. For ML models to become an integral part of business applications it is essential to detect when an ML model drifts away from acceptable operation. However, it is often the case that actual labels are difficult and expensive to get, for example, because they require expert judgment. Therefore, there is a n… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Journal ref: DeepTest workshop of ICSE, 2021

  23. arXiv:2105.01560  [pdf, other

    cs.LG

    Broadly Applicable Targeted Data Sample Omission Attacks

    Authors: Guy Barash, Eitan Farchi, Sarit Kraus, Onn Shehory

    Abstract: We introduce a novel clean-label targeted poisoning attack on learning mechanisms. While classical poisoning attacks typically corrupt data via addition, modification and omission, our attack focuses on data omission only. Our attack misclassifies a single, targeted test sample of choice, without manipulating that sample. We demonstrate the effectiveness of omission attacks against a large variety… ▽ More

    Submitted 5 May, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

  24. arXiv:2012.09258  [pdf, other

    stat.AP cs.LG stat.ML

    Detection of data drift and outliers affecting machine learning model performance over time

    Authors: Samuel Ackerman, Eitan Farchi, Orna Raz, Marcel Zalmanovici, Parijat Dube

    Abstract: A trained ML model is deployed on another `test' dataset where target feature values (labels) are unknown. Drift is distribution change between the training and deployment data, which is concerning if model performance changes. For a cat/dog image classifier, for instance, drift during deployment could be rabbit images (new class) or cat/dog images with changed characteristics (change in distribut… ▽ More

    Submitted 6 September, 2022; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: In: JSM Proceedings, Nonparametric Statistics Section, 20202. Philadelphia, PA: American Statistical Association. 144--160

  25. arXiv:2011.13300  [pdf, ps, other

    cs.GT

    A Game Theoretic Model for Strategic Coopetition in Business Networks

    Authors: Segev Wasserkrug, Eitan Farchi

    Abstract: Private blockchain is driving the creation of business networks, resulting in the creation of new value or new business models to the enterprises participating in the network. Such business networks form when enterprises come together to derive value through a network which is greater than the value that can be derived solely by any single company. This results in a setting that combines both comp… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

  26. arXiv:2007.16109  [pdf, other

    stat.AP cs.LG stat.ML

    Sequential Drift Detection in Deep Learning Classifiers

    Authors: Samuel Ackerman, Parijat Dube, Eitan Farchi

    Abstract: We utilize neural network embeddings to detect data drift by formulating the drift detection within an appropriate sequential decision framework. This enables control of the false alarm rate although the statistical tests are repeatedly applied. Since change detection algorithms naturally face a tradeoff between avoiding false alarms and quick correct detection, we introduce a loss function which… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: 11 pages + appendix, 7 figures

  27. arXiv:1910.12582  [pdf

    cs.CY

    Engineering Reliable Deep Learning Systems

    Authors: P. Santhanam, Eitan Farchi, Victor Pankratius

    Abstract: Recent progress in artificial intelligence (AI) using deep learning techniques has triggered its wide-scale use across a broad range of applications. These systems can already perform tasks such as natural language processing of voice and text, visual recognition, question-answering, recommendations and decision support. However, at the current level of maturity, the use of an AI component in miss… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: Presented at AAAI FSS-19: Artificial Intelligence in Government and Public Sector, Arlington, Virginia, USA

  28. arXiv:1904.00737  [pdf, other

    cs.CR cs.LG stat.ML

    Defending via strategic ML selection

    Authors: Eitan Farchi, Onn Shehory, Guy Barash

    Abstract: The results of a learning process depend on the input data. There are cases in which an adversary can strategically tamper with the input data to affect the outcome of the learning process. While some datasets are difficult to attack, many others are susceptible to manipulation. A resourceful attacker can tamper with large portions of the dataset and affect them. An attacker can additionally strat… ▽ More

    Submitted 16 January, 2019; originally announced April 2019.

    Comments: EDSMLS 2019 @ AAAI workshop

  29. arXiv:1601.06222  [pdf, other

    cs.SE

    Towards a Human-Centred Approach in Modelling and Testing of Cyber-Physical Systems

    Authors: Maria Spichkova, Anna Zamansky, Eitan Farchi

    Abstract: The ability to capture different levels of abstraction in a system model is especially important for remote integration, testing/verification, and manufacturing of cyber-physical systems (CPSs). However, the complexity of modelling and testing of CPSs makes these processes extremely prone to human error. In this paper we present our ongoing work on introducing human-centred considerations into mod… ▽ More

    Submitted 22 January, 2016; originally announced January 2016.

    Comments: Preprint. Accepted to the Workshop on Automated Testing for Cyber-Physical Systems in the Cloud at ICPADS 2015

  30. arXiv:1507.03687  [pdf, other

    cs.CY cs.LO

    Teaching Logic to Information Systems Students: Challenges and Opportunities

    Authors: Anna Zamansky, Eitan Farchi

    Abstract: In contrast to Computer Science, where the fundamental role of Logic is widely recognized, it plays a practically non-existent role in Information Systems curricula. In this paper we argue that instead of Logic's exclusion from the IS curriculum, a significant adaptation of the contents, as well as teaching methodologies, is required for an alignment with the needs of IS practitioners. We present… ▽ More

    Submitted 13 July, 2015; originally announced July 2015.

    Comments: Proceedings of the Fourth International Conference on Tools for Teaching Logic (TTL2015), Rennes, France, June 9-12, 2015. Editors: M. Antonia Huertas, João Marcos, María Manzano, Sophie Pinchinat, François Schwarzentruber