Skip to main content

Showing 1–44 of 44 results for author: Stoyanovich, J

.
  1. arXiv:2403.17786  [pdf, other

    cs.DB

    Query Refinement for Diverse Top-$k$ Selection

    Authors: Felix S. Campbell, Alon Silberstein, Julia Stoyanovich, Yuval Moskovitch

    Abstract: Database queries are often used to select and rank items as decision support for many applications. As automated decision-making tools become more prevalent, there is a growing recognition of the need to diversify their outcomes. In this paper, we define and study the problem of modifying the selection conditions of an ORDER BY query so that the result of the modified query closely fits some user-… ▽ More

    Submitted 27 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: v2 corrects author order

  2. arXiv:2401.16744  [pdf, other

    cs.AI cs.CY

    ShaRP: Explaining Rankings with Shapley Values

    Authors: Venetia Pliatsika, Joao Fonseca, Tilun Wang, Julia Stoyanovich

    Abstract: Algorithmic decisions in critical domains such as hiring, college admissions, and lending are often based on rankings. Because of the impact these decisions have on individuals, organizations, and population groups, there is a need to understand them: to know whether the decisions are abiding by the law, to help individuals improve their rankings, and to design better ranking procedures. In this… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 8 pages

  3. arXiv:2401.16088  [pdf, other

    cs.LG cs.CY

    Fairness in Algorithmic Recourse Through the Lens of Substantive Equality of Opportunity

    Authors: Andrew Bell, Joao Fonseca, Carlo Abrate, Francesco Bonchi, Julia Stoyanovich

    Abstract: Algorithmic recourse -- providing recommendations to those affected negatively by the outcome of an algorithmic system on how they can take action and change that outcome -- has gained attention as a means of giving persons agency in their interactions with artificial intelligence (AI) systems. Recent work has shown that even if an AI decision-making classifier is ``fair'' (according to some reaso… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  4. arXiv:2401.13935  [pdf, other

    cs.AI cs.CY stat.ML

    A New Paradigm for Counterfactual Reasoning in Fairness and Recourse

    Authors: Lucius E. J. Bynum, Joshua R. Loftus, Julia Stoyanovich

    Abstract: Counterfactuals and counterfactual reasoning underpin numerous techniques for auditing and understanding artificial intelligence (AI) systems. The traditional paradigm for counterfactual reasoning in this literature is the interventional counterfactual, where hypothetical interventions are imagined and simulated. For this reason, the starting point for causal reasoning about legal protections and… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  5. arXiv:2312.11712  [pdf, other

    cs.CR cs.LG

    A Simple and Practical Method for Reducing the Disparate Impact of Differential Privacy

    Authors: Lucas Rosenblatt, Julia Stoyanovich, Christopher Musco

    Abstract: Differentially private (DP) mechanisms have been deployed in a variety of high-impact social settings (perhaps most notably by the U.S. Census). Since all DP mechanisms involve adding noise to results of statistical queries, they are expected to impact our ability to accurately analyze and learn from data, in effect trading off privacy with utility. Alarmingly, the impact of DP on utility can vary… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  6. arXiv:2309.06969  [pdf, other

    cs.LG cs.AI cs.CY

    Setting the Right Expectations: Algorithmic Recourse Over Time

    Authors: Joao Fonseca, Andrew Bell, Carlo Abrate, Francesco Bonchi, Julia Stoyanovich

    Abstract: Algorithmic systems are often called upon to assist in high-stakes decision making. In light of this, algorithmic recourse, the principle wherein individuals should be able to take action against an undesirable outcome made by an algorithmic system, is receiving growing attention. The bulk of the literature on algorithmic recourse to-date focuses primarily on how to provide recourse to a single in… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  7. arXiv:2302.08704  [pdf, other

    cs.LG cs.CY

    The Unbearable Weight of Massive Privilege: Revisiting Bias-Variance Trade-Offs in the Context of Fair Prediction

    Authors: Falaah Arif Khan, Julia Stoyanovich

    Abstract: In this paper we revisit the bias-variance decomposition of model error from the perspective of designing a fair classifier: we are motivated by the widely held socio-technical belief that noise variance in large datasets in social domains tracks demographic characteristics such as gender, race, disability, etc. We propose a conditional-iid (ciid) model built from group-specific classifiers that s… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  8. arXiv:2302.06347  [pdf, other

    cs.LG

    The Possibility of Fairness: Revisiting the Impossibility Theorem in Practice

    Authors: Andrew Bell, Lucius Bynum, Nazarii Drushchak, Tetiana Herasymova, Lucas Rosenblatt, Julia Stoyanovich

    Abstract: The ``impossibility theorem'' -- which is considered foundational in algorithmic fairness literature -- asserts that there must be trade-offs between common notions of fairness and performance when fitting statistical models, except in two special cases: when the prevalence of the outcome being predicted is equal across groups, or when a perfectly accurate predictor is used. However, theory does n… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 14 pages, 3 figures, 1 table

  9. arXiv:2302.04525  [pdf, other

    cs.LG cs.AI cs.CY

    On Fairness and Stability: Is Estimator Variance a Friend or a Foe?

    Authors: Falaah Arif Khan, Denys Herasymuk, Julia Stoyanovich

    Abstract: The error of an estimator can be decomposed into a (statistical) bias term, a variance term, and an irreducible noise term. When we do bias analysis, formally we are asking the question: "how good are the predictions?" The role of bias in the error decomposition is clear: if we trust the labels/targets, then we would want the estimator to have as low bias as possible, in order to minimize error. F… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  10. arXiv:2212.03974  [pdf, other

    cs.AI cs.CY cs.LG stat.ME stat.ML

    Counterfactuals for the Future

    Authors: Lucius E. J. Bynum, Joshua R. Loftus, Julia Stoyanovich

    Abstract: Counterfactuals are often described as 'retrospective,' focusing on hypothetical alternatives to a realized past. This description relates to an often implicit assumption about the structure and stability of exogenous variables in the system being modeled -- an assumption that is reasonable in many settings where counterfactuals are used. In this work, we consider cases where we might reasonably m… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  11. arXiv:2211.02932  [pdf, other

    cs.HC

    Rankers, Rankees, & Rankings: Peeking into the Pandora's Box from a Socio-Technical Perspective

    Authors: Jun Yuan, Julia Stoyanovich, Aritra Dasgupta

    Abstract: Algorithmic rankers have a profound impact on our increasingly data-driven society. From leisurely activities like the movies that we watch, the restaurants that we patronize; to highly consequential decisions, like making educational and occupational choices or getting hired by companies -- these are all driven by sophisticated yet mostly inaccessible rankers. A small change to how these algorith… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: Accepted for Interrogating Human-Centered Data Science workshop at CHI'22

  12. arXiv:2208.12700  [pdf, other

    cs.CR cs.CY

    Epistemic Parity: Reproducibility as an Evaluation Metric for Differential Privacy

    Authors: Lucas Rosenblatt, Bernease Herman, Anastasia Holovenko, Wonkwon Lee, Joshua Loftus, Elizabeth McKinnie, Taras Rumezhak, Andrii Stadnik, Bill Howe, Julia Stoyanovich

    Abstract: Differential privacy (DP) data synthesizers support public release of sensitive information, offering theoretical guarantees for privacy but limited evidence of utility in practical settings. Utility is typically measured as the error on representative proxy tasks, such as descriptive statistics, accuracy of trained classifiers, or performance over a query workload. The ability for these results t… ▽ More

    Submitted 31 May, 2023; v1 submitted 26 August, 2022; originally announced August 2022.

    Comments: Preprint. 14 pages

  13. arXiv:2207.02912  [pdf, other

    cs.CY cs.AI cs.LG

    Towards Substantive Conceptions of Algorithmic Fairness: Normative Guidance from Equal Opportunity Doctrines

    Authors: Falaah Arif Khan, Eleni Manis, Julia Stoyanovich

    Abstract: In this work we use Equal Oppportunity (EO) doctrines from political philosophy to make explicit the normative judgements embedded in different conceptions of algorithmic fairness. We contrast formal EO approaches that narrowly focus on fair contests at discrete decision points, with substantive EO doctrines that look at people's fair life chances more holistically over the course of a lifetime. W… ▽ More

    Submitted 10 July, 2022; v1 submitted 6 July, 2022; originally announced July 2022.

  14. arXiv:2207.01482  [pdf, other

    cs.CY cs.HC cs.LG

    Think About the Stakeholders First! Towards an Algorithmic Transparency Playbook for Regulatory Compliance

    Authors: Andrew Bell, Oded Nov, Julia Stoyanovich

    Abstract: Increasingly, laws are being proposed and passed by governments around the world to regulate Artificial Intelligence (AI) systems implemented into the public and private sectors. Many of these regulations address the transparency of AI systems, and related citizen-aware issues like allowing individuals to have the right to an explanation about how an AI system makes a decision that impacts them. Y… ▽ More

    Submitted 10 June, 2022; originally announced July 2022.

  15. arXiv:2205.14269  [pdf, other

    cs.DB

    Temporal graph patterns by timed automata

    Authors: Amir Pouya Aghasadeghi, Jan Van den Bussche, Julia Stoyanovich

    Abstract: Temporal graphs represent graph evolution over time, and have been receiving considerable research attention. Work on expressing temporal graph patterns or discovering temporal motifs typically assumes relatively simple temporal constraints, such as journeys or, more generally, existential constraints, possibly with finite delays. In this paper we propose to use timed automata to express temporal… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  16. arXiv:2204.12903  [pdf, other

    cs.LG cs.CR

    Spending Privacy Budget Fairly and Wisely

    Authors: Lucas Rosenblatt, Joshua Allen, Julia Stoyanovich

    Abstract: Differentially private (DP) synthetic data generation is a practical method for improving access to data as a means to encourage productive partnerships. One issue inherent to DP is that the "privacy budget" is generally "spent" evenly across features in the data set. This leads to good statistical parity with the real data, but can undervalue the conditional probabilities and marginals that are c… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  17. arXiv:2201.09151  [pdf, other

    cs.CY cs.AI cs.LG

    An External Stability Audit Framework to Test the Validity of Personality Prediction in AI Hiring

    Authors: Alene K. Rhea, Kelsey Markey, Lauren D'Arinzo, Hilke Schellmann, Mona Sloane, Paul Squires, Falaah Arif Kahn, Julia Stoyanovich

    Abstract: Automated hiring systems are among the fastest-develo** of all high-stakes AI systems. Among these are algorithmic personality tests that use insights from psychometric testing, and promise to surface personality traits indicative of future success based on job seekers' resumes or social media profiles. We interrogate the validity of such systems using stability of the outputs they produce, noti… ▽ More

    Submitted 11 April, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

  18. arXiv:2107.01241  [pdf, other

    cs.DB

    Temporal Regular Path Queries

    Authors: Marcelo Arenas, Pedro Bahamondes, Amir Aghasadeghi, Julia Stoyanovich

    Abstract: In the last decade, substantial progress has been made towards standardizing the syntax of graph query languages, and towards understanding their semantics and complexity of evaluation. In this paper, we consider temporal property graphs (TPGs) and propose temporal regular path queries (TRPQs) that incorporate time into TPG navigation. Starting with design principles, we propose a natural syntacti… ▽ More

    Submitted 9 March, 2022; v1 submitted 2 July, 2021; originally announced July 2021.

  19. arXiv:2107.00593  [pdf, other

    cs.LG cs.AI cs.CY stat.AP stat.ML

    Disaggregated Interventions to Reduce Inequality

    Authors: Lucius E. J. Bynum, Joshua R. Loftus, Julia Stoyanovich

    Abstract: A significant body of research in the data sciences considers unfair discrimination against social categories such as race or gender that could occur or be amplified as a result of algorithmic decisions. Simultaneously, real-world disparities continue to exist, even before algorithmic decisions are made. In this work, we draw on insights from the social sciences brought into the realm of causal mo… ▽ More

    Submitted 7 December, 2022; v1 submitted 1 July, 2021; originally announced July 2021.

  20. arXiv:2106.08259  [pdf, other

    cs.CY cs.AI cs.LG

    Fairness as Equality of Opportunity: Normative Guidance from Political Philosophy

    Authors: Falaah Arif Khan, Eleni Manis, Julia Stoyanovich

    Abstract: Recent interest in codifying fairness in Automated Decision Systems (ADS) has resulted in a wide range of formulations of what it means for an algorithmic system to be fair. Most of these propositions are inspired by, but inadequately grounded in, political philosophy scholarship. This paper aims to correct that deficit. We introduce a taxonomy of fairness ideals using doctrines of Equality of Opp… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  21. Most Expected Winner: An Interpretation of Winners over Uncertain Voter Preferences

    Authors: Haoyue **, Julia Stoyanovich

    Abstract: It remains an open question how to determine the winner of an election when voter preferences are incomplete or uncertain. One option is to assume some probability space over the voting profile and select the Most Probable Winner (MPW) -- the candidate or candidates with the best chance of winning. In this paper, we propose an alternative winner interpretation, selecting the Most Expected Winner (… ▽ More

    Submitted 25 April, 2023; v1 submitted 30 April, 2021; originally announced May 2021.

    Comments: This is the technical report of the following paper: Haoyue ** and Julia Stoyanovich. 2023. Most Expected Winner: An Interpretation of Winners over Uncertain Voter Preferences. Proc. ACM Manag. Data, 1, N1, Article 22 (May 2023), 33 pages. https://doi.org/10.1145/3588702

    Journal ref: Proc. ACM Manag. Data, 1, N1, Article 22 (May 2023), 33 pages (2023)

  22. Fairness in Ranking: A Survey

    Authors: Meike Zehlike, Ke Yang, Julia Stoyanovich

    Abstract: In the past few years, there has been much work on incorporating fairness requirements into algorithmic rankers, with contributions coming from the data management, algorithms, information retrieval, and recommender systems communities. In this survey we give a systematic overview of this work, offering a broad perspective that connects formalizations and algorithmic approaches across subfields. A… ▽ More

    Submitted 12 August, 2022; v1 submitted 25 March, 2021; originally announced March 2021.

    Comments: 72 pages. ACM CSUR (2022)

    ACM Class: I.2.6; I.2.8; H.3.3

  23. arXiv:2006.08688  [pdf, other

    cs.LG cs.AI stat.AP stat.ML

    Causal intersectionality for fair ranking

    Authors: Ke Yang, Joshua R. Loftus, Julia Stoyanovich

    Abstract: In this paper we propose a causal modeling approach to intersectional fairness, and a flexible, task-specific method for computing intersectionally fair rankings. Rankings are used in many contexts, ranging from Web search results to college admissions, but causal inference for fair rankings has received limited attention. Additionally, the growing literature on causal fairness has directed little… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  24. arXiv:2005.06779  [pdf, other

    cs.GT

    Algorithmic Techniques for Necessary and Possible Winners

    Authors: Vishal Chakraborty, Theo Delemazure, Benny Kimelfeld, Phokion G. Kolaitis, Kunal Relia, Julia Stoyanovich

    Abstract: We investigate the practical aspects of computing the necessary and possible winners in elections over incomplete voter preferences. In the case of the necessary winners, we show how to implement and accelerate the polynomial-time algorithm of Xia and Conitzer. In the case of the possible winners, where the problem is NP-hard, we give a natural reduction to Integer Linear Programming (ILP) for all… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

  25. arXiv:2003.06984  [pdf, other

    cs.DB

    Supporting Hard Queries over Probabilistic Preferences

    Authors: Haoyue **, Julia Stoyanovich, Benny Kimelfeld

    Abstract: Preference analysis is widely applied in various domains such as social choice and e-commerce. A recently proposed framework augments the relational database with a preference relation that represents uncertain preferences in the form of statistical ranking models, and provides methods to evaluate Conjunctive Queries (CQs) that express preferences among item attributes. In this paper, we explore t… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

    Comments: This is the technical report of the following paper: Supporting Hard Queries over Probabilistic Preferences. PVLDB, 13(7): 1134-1146, 2019. DOI: https://doi.org/10.14778/3384345.3384359

  26. arXiv:1912.10564  [pdf, other

    cs.CY cs.AI cs.LG

    Teaching Responsible Data Science: Charting New Pedagogical Territory

    Authors: Julia Stoyanovich, Armanda Lewis

    Abstract: Although numerous ethics courses are available, with many focusing specifically on technology and computer ethics, pedagogical approaches employed in these courses rely exclusively on texts rather than on software development or data analysis. Technical students often consider these courses unimportant and a distraction from the "real" material. To develop instructional materials and methodologies… ▽ More

    Submitted 22 December, 2019; originally announced December 2019.

  27. arXiv:1911.12587  [pdf, other

    cs.LG cs.CY cs.DB stat.ML

    FairPrep: Promoting Data to a First-Class Citizen in Studies on Fairness-Enhancing Interventions

    Authors: Sebastian Schelter, Yuxuan He, Jatin Khilnani, Julia Stoyanovich

    Abstract: The importance of incorporating ethics and legal compliance into machine-assisted decision-making is broadly recognized. Further, several lines of recent work have argued that critical opportunities for improving data quality and representativeness, controlling for bias, and allowing humans to oversee and impact computational processes are missed if we do not consider the lifecycle stages upstream… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

  28. arXiv:1906.01747  [pdf, other

    cs.AI cs.CY

    Balanced Ranking with Diversity Constraints

    Authors: Ke Yang, Vasilis Gkatzelis, Julia Stoyanovich

    Abstract: Many set selection and ranking algorithms have recently been enhanced with diversity constraints that aim to explicitly increase representation of historically disadvantaged populations, or to improve the overall representativeness of the selected set. An unintended consequence of these constraints, however, is reduced in-group fairness: the selected candidates from a given group may not be the be… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: to appear in IJCAI 2019

  29. arXiv:1903.03683  [pdf, ps, other

    cs.DB cs.CY

    Transparency, Fairness, Data Protection, Neutrality: Data Management Challenges in the Face of New Regulation

    Authors: Serge Abiteboul, Julia Stoyanovich

    Abstract: The data revolution continues to transform every sector of science, industry and government. Due to the incredible impact of data-driven technology on society, we are becoming increasingly aware of the imperative to use data and algorithms responsibly -- in accordance with laws and ethical norms. In this article we discuss three recent regulatory frameworks: the European Union's General Data Prote… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

    Comments: To appear in the ACM Journal of Data and Information Quality (JDIQ)

  30. MobilityMirror: Bias-Adjusted Transportation Datasets

    Authors: Luke Rodriguez, Babak Salimi, Haoyue **, Julia Stoyanovich, Bill Howe

    Abstract: We describe customized synthetic datasets for publishing mobility data. Private companies are providing new transportation modalities, and their data is of high value for integrative transportation research, policy enforcement, and public accountability. However, these companies are disincentivized from sharing data not only to protect the privacy of individuals (drivers and/or passengers), but al… ▽ More

    Submitted 24 January, 2019; v1 submitted 21 August, 2018; originally announced August 2018.

    Comments: Presented at BIDU 2018 workshop and published in Springer Communications in Computer and Information Science vol 926

    Journal ref: Big Social Data and Urban Computing. BiDU 2018. Communications in Computer and Information Science, vol 926. Springer, Cham

  31. arXiv:1805.04156  [pdf, ps, other

    cs.DB cs.AI

    Computational Social Choice Meets Databases

    Authors: Benny Kimelfeld, Phokion G. Kolaitis, Julia Stoyanovich

    Abstract: We develop a novel framework that aims to create bridges between the computational social choice and the database management communities. This framework enriches the tasks currently supported in computational social choice with relational database context, thus making it possible to formulate sophisticated queries about voting rules, candidates, voters, issues, and positions. At the conceptual lev… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

    Comments: This is an extended version of "Computational Social Choice Meets Databases" by Kimelfeld, Kolaitis and Stoyanovich, to appear in IJCAI 2018

  32. On Obtaining Stable Rankings

    Authors: Abolfazl Asudeh, H. V. Jagadish, Gerome Miklau, Julia Stoyanovich

    Abstract: Decision making is challenging when there is more than one criterion to consider. In such cases, it is common to assign a goodness score to each item as a weighted sum of its attribute values and rank them accordingly. Clearly, the ranking obtained depends on the weights used for this summation. Ideally, one would want the ranked order not to change if the weights are changed slightly. We call thi… ▽ More

    Submitted 18 December, 2018; v1 submitted 29 April, 2018; originally announced April 2018.

    Journal ref: Abolfazl Asudeh, H. V. Jagadish, Gerome Miklau, Julia Stoyanovich. On Obtaining Stable Rankings. PVLDB , 12(3): 237-250, 2018

  33. arXiv:1804.07890  [pdf, other

    cs.CY cs.DB cs.HC

    A Nutritional Label for Rankings

    Authors: Ke Yang, Julia Stoyanovich, Abolfazl Asudeh, Bill Howe, HV Jagadish, Gerome Miklau

    Abstract: Algorithmic decisions often result in scoring and ranking individuals to determine credit worthiness, qualifications for college admissions and employment, and compatibility as dating partners. While automatic and seemingly objective, ranking algorithms can discriminate against individuals and protected groups, and exhibit low diversity. Furthermore, ranked results are often unstable --- small cha… ▽ More

    Submitted 21 April, 2018; originally announced April 2018.

    Comments: 4 pages, SIGMOD demo, 3 figuress, ACM SIGMOD 2018

    MSC Class: 68U01; 68P01 ACM Class: H.2, H.2.8, K.4.1

  34. Designing Fair Ranking Schemes

    Authors: Abolfazl Asudeh, H. V. Jagadish, Julia Stoyanovich, Gautam Das

    Abstract: Items from a database are often ranked based on a combination of multiple criteria. A user may have the flexibility to accept combinations that weigh these criteria differently, within limits. On the other hand, this choice of weights can greatly affect the fairness of the produced ranking. In this paper, we develop a system that helps users choose criterion weights that lead to greater fairness.… ▽ More

    Submitted 4 January, 2018; v1 submitted 27 December, 2017; originally announced December 2017.

  35. arXiv:1710.08874  [pdf, other

    cs.CY

    Synthetic Data for Social Good

    Authors: Bill Howe, Julia Stoyanovich, Haoyue **, Bernease Herman, Matt Gee

    Abstract: Data for good implies unfettered access to data. But data owners must be conservative about how, when, and why they share data or risk violating the trust of the people they aim to help, losing their funding, or breaking the law. Data sharing agreements can help prevent privacy violations, but require a level of specificity that is premature during preliminary discussions, and can take over a year… ▽ More

    Submitted 24 October, 2017; originally announced October 2017.

    Comments: Presented at the Data For Good Exchange 2017

  36. arXiv:1709.06176  [pdf, other

    cs.DB

    Zooming in on NYC taxi data with Portal

    Authors: Julia Stoyanovich, Matthew Gilbride, Vera Zaychik Moffitt

    Abstract: In this paper we develop a methodology for analyzing transportation data at different levels of temporal and geographic granularity, and apply our methodology to the TLC Trip Record Dataset, made publicly available by the NYC Taxi & Limousine Commission. This data is naturally represented by a set of trajectories, annotated with time and with additional information such as passenger count and cost… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: Presented at Data Science for Social Good (DSSG) 2017: https://dssg.uchicago.edu/data-science-for-social-good-conference-2017/

  37. arXiv:1701.09007  [pdf, other

    cs.DB

    Research Directions for Principles of Data Management (Dagstuhl Perspectives Workshop 16151)

    Authors: Serge Abiteboul, Marcelo Arenas, Pablo Barceló, Meghyn Bienvenu, Diego Calvanese, Claire David, Richard Hull, Eyke Hüllermeier, Benny Kimelfeld, Leonid Libkin, Wim Martens, Tova Milo, Filip Murlak, Frank Neven, Magdalena Ortiz, Thomas Schwentick, Julia Stoyanovich, Jianwen Su, Dan Suciu, Victor Vianu, Ke Yi

    Abstract: In April 2016, a community of researchers working in the area of Principles of Data Management (PDM) joined in a workshop at the Dagstuhl Castle in Germany. The workshop was organized jointly by the Executive Committee of the ACM Symposium on Principles of Database Systems (PODS) and the Council of the International Conference on Database Theory (ICDT). The mission of this workshop was to identify… ▽ More

    Submitted 31 January, 2017; originally announced January 2017.

  38. arXiv:1610.08559  [pdf, other

    cs.DB

    Measuring Fairness in Ranked Outputs

    Authors: Ke Yang, Julia Stoyanovich

    Abstract: Ranking and scoring are ubiquitous. We consider the setting in which an institution, called a ranker, evaluates a set of individuals based on demographic, behavioral or other characteristics. The final output is a ranking that represents the relative quality of the individuals. While automatic and therefore seemingly objective, rankers can, and often do, discriminate against individuals and system… ▽ More

    Submitted 26 October, 2016; originally announced October 2016.

    Comments: 5 pages, 7 figures, FATML 2016

  39. arXiv:1602.00773  [pdf, other

    cs.DB cs.DC

    Querying Evolving Graphs with Portal

    Authors: Vera Zaychik Moffitt, Julia Stoyanovich

    Abstract: Graphs are used to represent a plethora of phenomena, from the Web and social networks, to biological pathways, to semantic knowledge bases. Arguably the most interesting and important questions one can ask about graphs have to do with their evolution. Which Web pages are showing an increasing popularity trend? How does influence propagate in social networks? How does knowledge evolve? This pape… ▽ More

    Submitted 12 December, 2016; v1 submitted 1 February, 2016; originally announced February 2016.

    Comments: 12 pages plus appendix. Submitted to SIGMOD 2017

  40. arXiv:1307.8269  [pdf, ps, other

    cs.DB

    Introducing Access Control in Webdamlog

    Authors: Serge Abiteboul, Émilien Antoine, Gerome Miklau, Julia Stoyanovich, Vera Zaychik Moffitt

    Abstract: We survey recent work on the specification of an access control mechanism in a collaborative environment. The work is presented in the context of the WebdamLog language, an extension of datalog to a distributed context. We discuss a fine-grained access control mechanism for intentional data based on provenance as well as a control mechanism for delegation, i.e., for deploying rules at remote peers… ▽ More

    Submitted 31 July, 2013; originally announced July 2013.

    Comments: Proceedings of the 14th International Symposium on Database Programming Languages (DBPL 2013), August 30, 2013, Riva del Garda, Trento, Italy

  41. arXiv:1305.4195  [pdf, other

    cs.DB

    Search and Result Presentation in Scientific Workflow Repositories

    Authors: Susan B. Davidson, Xiaocheng Huang, Julia Stoyanovich, Xiaojie Yuan

    Abstract: We study the problem of searching a repository of complex hierarchical workflows whose component modules, both composite and atomic, have been annotated with keywords. Since keyword search does not use the graph structure of a workflow, we develop a model of workflows using context-free bag grammars. We then give efficient polynomial-time algorithms that, given a workflow and a keyword query, dete… ▽ More

    Submitted 9 July, 2013; v1 submitted 17 May, 2013; originally announced May 2013.

  42. arXiv:1305.3058  [pdf, ps, other

    cs.DB

    Rule-Based Application Development using Webdamlog

    Authors: Serge Abiteboul, Émilien Antoine, Gerome Miklau, Julia Stoyanovich, Jules Testard

    Abstract: We present the WebdamLog system for managing distributed data on the Web in a peer-to-peer manner. We demonstrate the main features of the system through an application called Wepic for sharing pictures between attendees of the sigmod conference. Using Wepic, the attendees will be able to share, download, rate and annotate pictures in a highly decentralized manner. We show how WebdamLog handles he… ▽ More

    Submitted 14 May, 2013; originally announced May 2013.

    Comments: SIGMOD - Special Interest Group on Management Of Data (2013)

  43. arXiv:1304.4187  [pdf, ps, other

    cs.DB

    The Webdamlog System Managing Distributed Knowledge on the Web

    Authors: Serge Abiteboul, Émilien Antoine, Julia Stoyanovich

    Abstract: We study the use of WebdamLog, a declarative high-level lan- guage in the style of datalog, to support the distribution of both data and knowledge (i.e., programs) over a network of au- tonomous peers. The main novelty of WebdamLog compared to datalog is its use of delegation, that is, the ability for a peer to communicate a program to another peer. We present results of a user study, showing that… ▽ More

    Submitted 15 April, 2013; originally announced April 2013.

  44. arXiv:1201.0231  [pdf, other

    cs.DB

    Putting Lipstick on Pig: Enabling Database-style Workflow Provenance

    Authors: Yael Amsterdamer, Susan B. Davidson, Daniel Deutch, Tova Milo, Julia Stoyanovich, Val Tannen

    Abstract: Workflow provenance typically assumes that each module is a "black-box", so that each output depends on all inputs (coarse-grained dependencies). Furthermore, it does not model the internal state of a module, which can change between repeated executions. In practice, however, an output may depend on only a small subset of the inputs (fine-grained dependencies) as well as on the internal state of t… ▽ More

    Submitted 31 December, 2011; originally announced January 2012.

    Comments: VLDB2012

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 4, pp. 346-357 (2011)