Skip to main content

Showing 1–9 of 9 results for author: Moskovitch, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17786  [pdf, other

    cs.DB

    Query Refinement for Diverse Top-$k$ Selection

    Authors: Felix S. Campbell, Alon Silberstein, Julia Stoyanovich, Yuval Moskovitch

    Abstract: Database queries are often used to select and rank items as decision support for many applications. As automated decision-making tools become more prevalent, there is a growing recognition of the need to diversify their outcomes. In this paper, we define and study the problem of modifying the selection conditions of an ORDER BY query so that the result of the modified query closely fits some user-… ▽ More

    Submitted 27 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: v2 corrects author order

  2. arXiv:2301.00719  [pdf, other

    cs.LG cs.DB

    Detection of Groups with Biased Representation in Ranking

    Authors: **yang Li, Yuval Moskovitch, H. V. Jagadish

    Abstract: Real-life tools for decision-making in many critical domains are based on ranking results. With the increasing awareness of algorithmic fairness, recent works have presented measures for fairness in ranking. Many of those definitions consider the representation of different ``protected groups'', in the top-$k$ ranked items, for any reasonable $k$. Given the protected groups, confirming algorithmic… ▽ More

    Submitted 6 July, 2023; v1 submitted 30 December, 2022; originally announced January 2023.

  3. arXiv:2210.02943  [pdf, other

    cs.DB

    On Explaining Confounding Bias

    Authors: Brit Youngmann, Michael Cafarella, Yuval Moskovitch, Babak Salimi

    Abstract: When analyzing large datasets, analysts are often interested in the explanations for surprising or unexpected results produced by their queries. In this work, we focus on aggregate SQL queries that expose correlations in the data. A major challenge that hinders the interpretation of such queries is confounding bias, which can lead to an unexpected correlation. We generate explanations in terms of… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  4. arXiv:2103.00288  [pdf, other

    cs.DB

    On Optimizing the Trade-off between Privacy and Utility in Data Provenance

    Authors: Daniel Deutch, Ariel Frankenthal, Amir Gilad, Yuval Moskovitch

    Abstract: Organizations that collect and analyze data may wish or be mandated by regulation to justify and explain their analysis results. At the same time, the logic that they have followed to analyze the data, i.e., their queries, may be proprietary and confidential. Data provenance, a record of the transformations that data underwent, was extensively studied as means of explanations. In contrast, only a… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

  5. arXiv:2010.16340  [pdf, other

    cs.DB

    Patterns Count-Based Labels for Datasets

    Authors: Yuval Moskovitch, H. V. Jagadish

    Abstract: Counts of attribute-value combinations are central to the profiling of a dataset, particularly in determining fitness for use and in eliminating bias and unfairness. While counts of individual attribute values may be stored in some dataset profiles, there are too many combinations of attributes for it to be practical to store counts for each combination. In this paper, we develop the notion of sto… ▽ More

    Submitted 7 November, 2020; v1 submitted 30 October, 2020; originally announced October 2020.

    Comments: ICDE2021

  6. Towards Inferring Queries from Simple and Partial Provenance Examples

    Authors: Amir Gilad, Yuval Moskovitch

    Abstract: The field of query-by-example aims at inferring queries from output examples given by non-expert users, by finding the underlying logic that binds the examples. However, for a very small set of examples, it is difficult to correctly infer such logic. To bridge this gap, previous work suggested attaching explanations to each output example, modeled as provenance, allowing users to explain the reaso… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

  7. arXiv:2007.05463  [pdf, other

    cs.DB

    Equivalence-Invariant Algebraic Provenance for Hyperplane Update Queries

    Authors: Pierre Bourhis, Daniel Deutch, Yuval Moskovitch

    Abstract: The algebraic approach for provenance tracking, originating in the semiring model of Green et. al, has proven useful as an abstract way of handling metadata. Commutative Semirings were shown to be the "correct" algebraic structure for Union of Conjunctive Queries, in the sense that its use allows provenance to be invariant under certain expected query equivalence axioms. In this paper we present… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Journal ref: Proceedings of the 2020 International Conference on Management of Data, SIGMOD Conference 2020, online conference [Portland, OR, USA], pages: 415--429

  8. arXiv:2007.05400  [pdf, other

    cs.DB

    Hypothetical Reasoning via Provenance Abstraction

    Authors: Daniel Deutch, Yuval Moskovitch, Noam Rinetzky

    Abstract: Data analytics often involves hypothetical reasoning: repeatedly modifying the data and observing the induced effect on the computation result of a data-centric application. Previous work has shown that fine-grained data provenance can help make such an analysis more efficient: instead of a costly re-execution of the underlying application, hypothetical scenarios are applied to a pre-computed prov… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Journal ref: Proceedings of the 2019 International Conference on Management of Data, SIGMOD Conference 2019, Amsterdam, The Netherlands, pages 537--554

  9. arXiv:2007.05389  [pdf, other

    cs.DB

    COBRA: Compression via Abstraction of Provenance for Hypothetical Reasoning

    Authors: Daniel Deutch, Yuval Moskovitch, Noam Rinetzky

    Abstract: Data analytics often involves hypothetical reasoning: repeatedly modifying the data and observing the induced effect on the computation result of a data-centric application. Recent work has proposed to leverage ideas from data provenance tracking towards supporting efficient hypothetical reasoning: instead of a costly re-execution of the underlying application, one may assign values to a pre-compu… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Journal ref: 2019 IEEE 35th International Conference on Data Engineering (ICDE), Macao, Macao, 2019, pp. 2016--2019