Skip to main content

Showing 1–16 of 16 results for author: Szlichta, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13000  [pdf, other

    cs.CL cs.AI cs.IR

    RAGE Against the Machine: Retrieval-Augmented LLM Explanations

    Authors: Joel Rorseth, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: This paper demonstrates RAGE, an interactive tool for explaining Large Language Models (LLMs) augmented with retrieval capabilities; i.e., able to query external sources and pull relevant information into their input context. Our explanations are counterfactual in the sense that they identify parts of the input context that, when removed, change the answer to the question posed to the LLM. RAGE in… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Accepted by ICDE 2024 (Demonstration Track)

  2. arXiv:2405.12881  [pdf, other

    cs.DB cs.AI

    Explaining Expert Search and Team Formation Systems with ExES

    Authors: Kiarash Golzadeh, Lukasz Golab, Jaroslaw Szlichta

    Abstract: Expert search and team formation systems operate on collaboration networks, with nodes representing individuals, labeled with their skills, and edges denoting collaboration relationships. Given a keyword query corresponding to the desired skills, these systems identify experts that best match the query. However, state-of-the-art solutions to this problem lack transparency. To address this issue, w… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. CREDENCE: Counterfactual Explanations for Document Ranking

    Authors: Joel Rorseth, Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: Towards better explainability in the field of information retrieval, we present CREDENCE, an interactive tool capable of generating counterfactual explanations for document rankers. Embracing the unique properties of the ranking problem, we present counterfactual explanations in terms of document perturbations, query perturbations, and even other documents. Additionally, users may build and test t… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted by ICDE 2023 (Demonstration Track)

  4. arXiv:2105.08105  [pdf, other

    cs.DB

    Discovery and Contextual Data Cleaning with Ontology Functional Dependencies

    Authors: Zheng Zheng, Longtao Zheng, Morteza Alipour Langouri, Fei Chiang, Lukasz Golab, Jaroslaw Szlichta

    Abstract: Functional Dependencies (FDs) define attribute relationships based on syntactic equality, and, when usedin data cleaning, they erroneously label syntactically different but semantically equivalent values as errors. We explore dependency-based data cleaning with Ontology Functional Dependencies(OFDs), which express semantic attribute relationships such as synonyms and is-a hierarchies defined by an… ▽ More

    Submitted 12 March, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

  5. arXiv:2101.02174  [pdf, other

    cs.DB

    Efficient Discovery of Approximate Order Dependencies

    Authors: Reza Karegar, Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: Order dependencies (ODs) capture relationships between ordered domains of attributes. Approximate ODs (AODs) capture such relationships even when there exist exceptions in the data. During automated discovery of ODs, validation is the process of verifying whether an OD holds. We present an algorithm for validating approximate ODs with significantly improved runtime performance over existing method… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

  6. arXiv:2008.07577  [pdf, other

    cs.LG cs.IR stat.ML

    Joint Variational Autoencoders for Recommendation with Implicit Feedback

    Authors: Bahare Askari, Jaroslaw Szlichta, Amirali Salehi-Abari

    Abstract: Variational Autoencoders (VAEs) have recently shown promising performance in collaborative filtering with implicit feedback. These existing recommendation models learn user representations to reconstruct or predict user preferences. We introduce joint variational autoencoders (JoVA), an ensemble of two VAEs, in which VAEs jointly learn both user and item representations and collectively reconstruc… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

  7. arXiv:2005.14068  [pdf, other

    cs.DB

    Discovering Domain Orders through Order Dependencies

    Authors: Reza Karegar, Melicaalsadat Mirsafian, Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: Much real-world data come with explicitly defined domain orders; e.g., lexicographic order for strings, numeric for integers, and chronological for time. Our goal is to discover implicit domain orders that we do not already know; for instance, that the order of months in the Chinese Lunar calendar is Corner < Apricot < Peach. To do so, we enhance data profiling methods by discovering implicit doma… ▽ More

    Submitted 7 September, 2021; v1 submitted 28 May, 2020; originally announced May 2020.

  8. arXiv:1905.11948  [pdf, other

    cs.DB

    ABC of Order Dependencies

    Authors: Pei Li, Michael Bohlen, Jaroslaw Szlichta, Divesh Srivastava

    Abstract: We enhance constrained-based data quality with approximate band conditional order dependencies (abcODs). Band ODs model the semantics of attributes that are monotonically related with small variations without there being an intrinsic violation of semantics. The class of abcODs generalizes band ODs to make them more relevant to real-world applications by relaxing them to hold approximately (abODs)… ▽ More

    Submitted 28 February, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  9. arXiv:1905.02010  [pdf, other

    cs.DB

    Errata Note: Discovering Order Dependencies through Order Compatibility

    Authors: Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta

    Abstract: A number of extensions to the classical notion of functional dependencies have been proposed to express and enforce application semantics. One of these extensions is that of order dependencies (ODs), which express rules involving order. The article entitled "Discovering Order Dependencies through Order Compatibility" by Consonni et al., published in the EDBT conference proceedings in March 2019, i… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

    Comments: 5

  10. arXiv:1901.02049  [pdf, other

    cs.DB

    Guided Automated Learning for query workload re-Optimization

    Authors: Guilherme Damasio, Vincent Corvinelli, Parke Godfrey, Piotr Mierzejewski, Alexandar Mihaylov, Jaroslaw Szlichta, Calisto Zuzarte

    Abstract: Query optimization is a hallmark of database systems enabling complex SQL queries of today's applications to be run efficiently. The query optimizer often fails to find the best plan, when logical subtleties in business queries and schemas circumvent it. When a query runs more expensively than is viable or warranted, determination of the performance issues is usually performed manually in consulta… ▽ More

    Submitted 22 May, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

    Comments: 12 pages, 15 figures

  11. arXiv:1611.02992  [pdf, other

    cs.SI

    Authority-based Team Discovery in Social Networks

    Authors: Morteza Zihayat, Aijun An, Lukasz Golab, Mehdi Kargar, Jaroslaw Szlichta

    Abstract: Given a social network of experts, we address the problem of discovering a team of experts that collectively holds a set of skills required to complete a given project. Most prior work ranks possible solutions by communication cost, represented by edge weights in the expert network. Our contribution is to take experts authority into account, represented by node weights. We formulate several proble… ▽ More

    Submitted 15 November, 2016; v1 submitted 8 November, 2016; originally announced November 2016.

    Comments: 6 pages

  12. arXiv:1611.02737  [pdf, other

    cs.DB

    Efficient Discovery of Ontology Functional Dependencies

    Authors: Sridevi Baskaran, Alexander Keller, Fei Chiang, Golab Lukasz, Jaroslaw Szlichta

    Abstract: Poor data quality has become a pervasive issue due to the increasing complexity and size of modern datasets. Constraint based data cleaning techniques rely on integrity constraints as a benchmark to identify and correct errors. Data values that do not satisfy the given set of constraints are flagged as dirty, and data updates are made to re-align the data and the constraints. However, many errors… ▽ More

    Submitted 23 May, 2017; v1 submitted 8 November, 2016; originally announced November 2016.

    Comments: 12 pages

  13. arXiv:1608.06169  [pdf, other

    cs.DB

    Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization

    Authors: Jaroslaw Szlichta, Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava

    Abstract: Integrity constraints (ICs) provide a valuable tool for expressing and enforcing application semantics. However, formulating constraints manually requires domain expertise, is prone to human errors, and may be excessively time consuming, especially on large datasets. Hence, proposals for automatic discovery have been made for some classes of ICs, such as functional dependencies (FDs), and recently… ▽ More

    Submitted 23 August, 2016; v1 submitted 22 August, 2016; originally announced August 2016.

    Comments: 14 pages

  14. arXiv:1512.06395  [pdf, other

    cs.DB

    Effective Keyword Search in Graphs

    Authors: Mehdi Kargar, Lukasz Golab, Jaroslaw Szlichta

    Abstract: In a node-labeled graph, keyword search finds subtrees of the graph whose nodes contain all of the query keywords. This provides a way to query graph databases that neither requires mastery of a query language such as SPARQL, nor a deep knowledge of the database schema. Previous work ranks answer trees using combinations of structural and content-based metrics, such as path lengths between keyword… ▽ More

    Submitted 29 March, 2016; v1 submitted 20 December, 2015; originally announced December 2015.

    Comments: 7 pages, 9 figures

  15. arXiv:1510.03302  [pdf

    cs.DB

    OptImatch: Semantic Web System with Knowledge Base for Query Performance Problem Determination

    Authors: Guilherme Damasio, Piotr Mierzejewski, Jaroslaw Szlichta, Calisto Zuzarte

    Abstract: Database query performance problem determination is often performed by analyzing query execution plans (QEPs) in addition to other performance data. As the query workloads that organizations run, have become larger and more complex, analyzing QEPs manually even by experts has become a very time consuming. Most performance diagnostic tools help with identifying problematic queries and most query tu… ▽ More

    Submitted 12 October, 2015; originally announced October 2015.

    Comments: 12 pages

  16. arXiv:1208.0084  [pdf, other

    cs.DB

    Fundamentals of Order Dependencies

    Authors: Jaroslaw Szlichta, Parke Godfrey, Jarek Gryz

    Abstract: Dependencies have played a significant role in database design for many years. They have also been shown to be useful in query optimization. In this paper, we discuss dependencies between lexicographically ordered sets of tuples. We introduce formally the concept of order dependency and present a set of axioms (inference rules) for them. We show how query rewrites based on these axioms can be used… ▽ More

    Submitted 31 July, 2012; originally announced August 2012.

    Comments: VLDB2012

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 11, pp. 1220-1231 (2012)