Skip to main content

Showing 1–38 of 38 results for author: Kenthapadi, K

.
  1. arXiv:2312.02337  [pdf, other

    cs.CL

    Measuring Distributional Shifts in Text: The Advantage of Language Model-Based Embeddings

    Authors: Gyandev Gupta, Bashir Rastegarpanah, Amalendu Iyer, Joshua Rubin, Krishnaram Kenthapadi

    Abstract: An essential part of monitoring machine learning models in production is measuring input and output data drift. In this paper, we present a system for measuring distributional shifts in natural language data and highlight and investigate the potential advantage of using large language models (LLMs) for this problem. Recent advancements in LLMs and their successful adoption in different domains ind… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  2. arXiv:2305.19864  [pdf, other

    cs.HC cs.LG

    Designing Closed-Loop Models for Task Allocation

    Authors: Vijay Keswani, L. Elisa Celis, Krishnaram Kenthapadi, Matthew Lease

    Abstract: Automatically assigning tasks to people is challenging because human performance can vary across tasks for many reasons. This challenge is further compounded in real-life settings in which no oracle exists to assess the quality of human decisions and task assignments made. Instead, we find ourselves in a "closed" decision-making loop in which the same fallible human decisions we rely on in practic… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted for publication in the International Conference on Hybrid Human-Artificial Intelligence (HHAI) 2023

  3. arXiv:2207.02726  [pdf, other

    cs.LG cs.AI cs.HC eess.SP

    Towards the Use of Saliency Maps for Explaining Low-Quality Electrocardiograms to End Users

    Authors: Ana Lucic, Sheeraz Ahmad, Amanda Furtado Brinhosa, Vera Liao, Himani Agrawal, Umang Bhatt, Krishnaram Kenthapadi, Alice Xiang, Maarten de Rijke, Nicholas Drabowski

    Abstract: When using medical images for diagnosis, either by clinicians or artificial intelligence (AI) systems, it is important that the images are of high quality. When an image is of low quality, the medical exam that produced the image often needs to be redone. In telemedicine, a common problem is that the quality issue is only flagged once the patient has left the clinic, meaning they must return in or… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted to ICML 2022 Workshop on Interpretable ML in Healthcare

  4. arXiv:2206.12540  [pdf, other

    cs.HC cs.LG

    Visual Auditor: Interactive Visualization for Detection and Summarization of Model Biases

    Authors: David Munechika, Zijie J. Wang, Jack Reidy, Josh Rubin, Krishna Gade, Krishnaram Kenthapadi, Duen Horng Chau

    Abstract: As machine learning (ML) systems become increasingly widespread, it is necessary to audit these systems for biases prior to their deployment. Recent research has developed algorithms for effectively identifying intersectional bias in the form of interpretable, underperforming subsets (or slices) of the data. However, these solutions and their insights are limited without a tool for visually unders… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  5. arXiv:2206.02868  [pdf, other

    cs.LG cs.HC

    A Human-Centric Take on Model Monitoring

    Authors: Murtuza N Shergadwala, Himabindu Lakkaraju, Krishnaram Kenthapadi

    Abstract: Predictive models are increasingly used to make various consequential decisions in high-stakes domains such as healthcare, finance, and policy. It becomes critical to ensure that these models make accurate predictions, are robust to shifts in the data, do not rely on spurious features, and do not unduly discriminate against minority groups. To this end, several approaches spanning various areas su… ▽ More

    Submitted 20 September, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

  6. arXiv:2204.04440  [pdf, other

    cs.LG

    Are Two Heads the Same as One? Identifying Disparate Treatment in Fair Neural Networks

    Authors: Michael Lohaus, Matthäus Kleindessner, Krishnaram Kenthapadi, Francesco Locatello, Chris Russell

    Abstract: We show that deep networks trained to satisfy demographic parity often do so through a form of race or gender awareness, and that the more we force a network to be fair, the more accurately we can recover race or gender from the internal state of the network. Based on this observation, we investigate an alternative fairness approach: we add a second classification head to the network to explicitly… ▽ More

    Submitted 19 November, 2022; v1 submitted 9 April, 2022; originally announced April 2022.

    Comments: Accepted at NeurIPS 2022

  7. arXiv:2203.11103  [pdf, other

    cs.LG stat.ML

    Diverse Counterfactual Explanations for Anomaly Detection in Time Series

    Authors: Deborah Sulem, Michele Donini, Muhammad Bilal Zafar, Francois-Xavier Aubet, Jan Gasthaus, Tim Januschowski, Sanjiv Das, Krishnaram Kenthapadi, Cedric Archambeau

    Abstract: Data-driven methods that detect anomalies in times series data are ubiquitous in practice, but they are in general unable to provide helpful explanations for the predictions they make. In this work we propose a model-agnostic algorithm that generates counterfactual ensemble explanations for time series anomaly detection models. Our method generates a set of diverse counterfactual examples, i.e, mu… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 24 pages, 11 figures

  8. arXiv:2203.08398  [pdf, other

    cs.LG cs.CR

    COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks

    Authors: Fan Wu, Linyi Li, Chejian Xu, Huan Zhang, Bhavya Kailkhura, Krishnaram Kenthapadi, Ding Zhao, Bo Li

    Abstract: As reinforcement learning (RL) has achieved near human-level performance in a variety of tasks, its robustness has raised great attention. While a vast body of research has explored test-time (evasion) attacks in RL and corresponding defenses, its robustness against training-time (poisoning) attacks remains largely unanswered. In this work, we focus on certifying the robustness of offline RL in th… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Published as a conference paper at ICLR 2022

  9. arXiv:2202.08821  [pdf, other

    cs.CY cs.HC cs.LG

    Human-Algorithm Collaboration: Achieving Complementarity and Avoiding Unfairness

    Authors: Kate Donahue, Alexandra Chouldechova, Krishnaram Kenthapadi

    Abstract: Much of machine learning research focuses on predictive accuracy: given a task, create a machine learning model (or algorithm) that maximizes accuracy. In many settings, however, the final prediction or decision of a system is under the control of a human, who uses an algorithm's output along with their own personal expertise in order to produce a combined prediction. One ultimate goal of such col… ▽ More

    Submitted 1 June, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: Accepted to ACM FAccT 2022. Preliminary version accepted (panel presentation) at Neurips workshop on Human-Centered AI

  10. arXiv:2202.04718  [pdf, other

    cs.HC cs.LG

    Designing Closed Human-in-the-loop Deferral Pipelines

    Authors: Vijay Keswani, Matthew Lease, Krishnaram Kenthapadi

    Abstract: In hybrid human-machine deferral frameworks, a classifier can defer uncertain cases to human decision-makers (who are often themselves fallible). Prior work on simultaneous training of such classifier and deferral models has typically assumed access to an oracle during training to obtain true class labels for training samples, but in practice there often is no such oracle. In contrast, we consider… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

    Comments: A shorter version of this paper appeared in BHCC 2021

  11. arXiv:2112.12444  [pdf, other

    cs.CL

    More Than Words: Towards Better Quality Interpretations of Text Classifiers

    Authors: Muhammad Bilal Zafar, Philipp Schmidt, Michele Donini, Cédric Archambeau, Felix Biessmann, Sanjiv Ranjan Das, Krishnaram Kenthapadi

    Abstract: The large size and complex decision mechanisms of state-of-the-art text classifiers make it difficult for humans to understand their predictions, leading to a potential lack of trust by the users. These issues have led to the adoption of methods like SHAP and Integrated Gradients to explain classification decisions by assigning importance scores to input tokens. However, prior work, using differen… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

  12. arXiv:2111.13657  [pdf, other

    cs.LG cs.AI stat.ML

    Amazon SageMaker Model Monitor: A System for Real-Time Insights into Deployed Machine Learning Models

    Authors: David Nigenda, Zohar Karnin, Muhammad Bilal Zafar, Raghu Ramesha, Alan Tan, Michele Donini, Krishnaram Kenthapadi

    Abstract: With the increasing adoption of machine learning (ML) models and systems in high-stakes settings across different industries, guaranteeing a model's performance after deployment has become crucial. Monitoring models in production is a critical aspect of ensuring their continued performance and reliability. We present Amazon SageMaker Model Monitor, a fully managed service that continuously monitor… ▽ More

    Submitted 5 August, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

  13. Amazon SageMaker Clarify: Machine Learning Bias Detection and Explainability in the Cloud

    Authors: Michaela Hardt, Xiaoguang Chen, Xiaoyi Cheng, Michele Donini, Jason Gelman, Satish Gollaprolu, John He, Pedro Larroy, Xinyu Liu, Nick McCarthy, Ashish Rathi, Scott Rees, Ankit Siva, ErhYuan Tsai, Keerthan Vasist, Pinar Yilmaz, Muhammad Bilal Zafar, Sanjiv Das, Kevin Haas, Tyler Hill, Krishnaram Kenthapadi

    Abstract: Understanding the predictions made by machine learning (ML) models and their potential biases remains a challenging and labor-intensive task that depends on the application, the dataset, and the specific model. We present Amazon SageMaker Clarify, an explainability feature for Amazon SageMaker that launched in December 2020, providing insights into data and ML models by identifying biases and expl… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Journal ref: In Proc. ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2974-2983 (2021)

  14. arXiv:2107.04423  [pdf, other

    cs.LG cs.DS

    Multiaccurate Proxies for Downstream Fairness

    Authors: Emily Diana, Wesley Gill, Michael Kearns, Krishnaram Kenthapadi, Aaron Roth, Saeed Sharifi-Malvajerdi

    Abstract: We study the problem of training a model that must obey demographic fairness conditions when the sensitive features are not available at training time -- in other words, how can we train a model to be fair by race when we don't have data about race? We adopt a fairness pipeline perspective, in which an "upstream" learner that does have access to the sensitive features will learn a proxy model for… ▽ More

    Submitted 25 January, 2022; v1 submitted 9 July, 2021; originally announced July 2021.

  15. arXiv:2106.04631  [pdf, other

    cs.CL cs.LG

    On the Lack of Robust Interpretability of Neural Text Classifiers

    Authors: Muhammad Bilal Zafar, Michele Donini, Dylan Slack, Cédric Archambeau, Sanjiv Das, Krishnaram Kenthapadi

    Abstract: With the ever-increasing complexity of neural language models, practitioners have turned to methods for understanding the predictions of these models. One of the most well-adopted approaches for model interpretability is feature-based interpretability, i.e., ranking the features in terms of their impact on model predictions. Several prior studies have focused on assessing the fidelity of feature-b… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: Appearing at ACL Findings 2021

  16. arXiv:2105.06929  [pdf, other

    cs.SI cs.AI

    On Measuring the Diversity of Organizational Networks

    Authors: Zeinab S. Jalali, Krishnaram Kenthapadi, Sucheta Soundarajan

    Abstract: The interaction patterns of employees in social and professional networks play an important role in the success of employees and organizations as a whole. However, in many fields there is a severe under-representation of minority groups; moreover, minority individuals may be segregated from the rest of the network or isolated from one another. While the problem of increasing the representation of… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: 12 pages, 3 figures, accepted in CompleNet 2021

  17. arXiv:2105.03153  [pdf, other

    stat.ML cs.LG

    Pairwise Fairness for Ordinal Regression

    Authors: Matthäus Kleindessner, Samira Samadi, Muhammad Bilal Zafar, Krishnaram Kenthapadi, Chris Russell

    Abstract: We initiate the study of fairness for ordinal regression. We adapt two fairness notions previously considered in fair ranking and propose a strategy for training a predictor that is approximately fair according to either notion. Our predictor has the form of a threshold model, composed of a scoring function and a set of thresholds, and our strategy is based on a reduction to fair binary classifica… ▽ More

    Submitted 11 February, 2022; v1 submitted 7 May, 2021; originally announced May 2021.

  18. arXiv:2103.06641  [pdf, other

    cs.LG cs.CR

    Differentially Private Query Release Through Adaptive Projection

    Authors: Sergul Aydore, William Brown, Michael Kearns, Krishnaram Kenthapadi, Luca Melis, Aaron Roth, Ankit Siva

    Abstract: We propose, implement, and evaluate a new algorithm for releasing answers to very large numbers of statistical queries like $k$-way marginals, subject to differential privacy. Our algorithm makes adaptive use of a continuous relaxation of the Projection Mechanism, which answers queries on the private dataset using simple perturbation, and then attempts to find the synthetic dataset that most close… ▽ More

    Submitted 23 June, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

  19. arXiv:2102.13004  [pdf, other

    cs.LG cs.HC stat.ML

    Towards Unbiased and Accurate Deferral to Multiple Experts

    Authors: Vijay Keswani, Matthew Lease, Krishnaram Kenthapadi

    Abstract: Machine learning models are often implemented in cohort with humans in the pipeline, with the model having an option to defer to a domain expert in cases where it has low confidence in its inference. Our goal is to design mechanisms for ensuring accuracy and fairness in such prediction systems that combine machine learning model inferences and domain expert predictions. Prior work on "deferral sys… ▽ More

    Submitted 18 May, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: This paper has been accepted for publication at the AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society (AIES 2021)

    Journal ref: In Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (AIES), pages 154--165, 2021

  20. arXiv:2102.06162  [pdf, other

    cs.LG

    Defuse: Harnessing Unrestricted Adversarial Examples for Debugging Models Beyond Test Accuracy

    Authors: Dylan Slack, Nathalie Rauschmayr, Krishnaram Kenthapadi

    Abstract: We typically compute aggregate statistics on held-out test data to assess the generalization of machine learning models. However, statistics on test data often overstate model generalization, and thus, the performance of deployed machine learning models can be variable and untrustworthy. Motivated by these concerns, we develop methods to automatically discover and correct model errors beyond those… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  21. arXiv:2012.08489  [pdf, other

    cs.LG cs.AI stat.ML

    Amazon SageMaker Automatic Model Tuning: Scalable Gradient-Free Optimization

    Authors: Valerio Perrone, Huibin Shen, Aida Zolic, Iaroslav Shcherbatyi, Amr Ahmed, Tanya Bansal, Michele Donini, Fela Winkelmolen, Rodolphe Jenatton, Jean Baptiste Faddoul, Barbara Pogorzelska, Miroslav Miladinovic, Krishnaram Kenthapadi, Matthias Seeger, Cédric Archambeau

    Abstract: Tuning complex machine learning systems is challenging. Machine learning typically requires to set hyperparameters, be it regularization, architecture, or optimization parameters, whose tuning is critical to achieve good predictive performance. To democratize access to machine learning systems, it is essential to automate the tuning. This paper presents Amazon SageMaker Automatic Model Tuning (AMT… ▽ More

    Submitted 18 June, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

  22. arXiv:2011.03108  [pdf, other

    cs.LG

    Minimax Group Fairness: Algorithms and Experiments

    Authors: Emily Diana, Wesley Gill, Michael Kearns, Krishnaram Kenthapadi, Aaron Roth

    Abstract: We consider a recently introduced framework in which fairness is measured by worst-case outcomes across groups, rather than by the more standard differences between group outcomes. In this framework we provide provably convergent oracle-efficient learning algorithms (or equivalently, reductions to non-fair learning) for minimax group fairness. Here the goal is that of minimizing the maximum loss a… ▽ More

    Submitted 7 March, 2021; v1 submitted 5 November, 2020; originally announced November 2020.

  23. arXiv:2008.07433  [pdf, other

    cs.LG cs.AI cs.CY cs.SE

    LiFT: A Scalable Framework for Measuring Fairness in ML Applications

    Authors: Sriram Vasudevan, Krishnaram Kenthapadi

    Abstract: Many internet applications are powered by machine learned models, which are usually trained on labeled datasets obtained through either implicit / explicit user feedback signals or human judgments. Since societal biases may be present in the generation of such datasets, it is possible for the trained models to be biased, thereby resulting in potential discrimination and harms for disadvantaged gro… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: Accepted for publication in CIKM 2020

  24. arXiv:2007.15270  [pdf, other

    cs.AI cs.LG stat.ML

    Fairness-Aware Online Personalization

    Authors: G Roshan Lal, Sahin Cem Geyik, Krishnaram Kenthapadi

    Abstract: Decision making in crucial applications such as lending, hiring, and college admissions has witnessed increasing use of algorithmic models and techniques as a result of a confluence of factors such as ubiquitous connectivity, ability to collect, aggregate, and process large amounts of fine-grained data using cloud computing, and ease of access to applying sophisticated machine learning models. Qui… ▽ More

    Submitted 6 September, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Accepted in RecSys 2020, FAccTRec Workshop: Responsible Recommendation

  25. arXiv:2007.05124  [pdf, other

    stat.AP

    Evaluating Fairness Using Permutation Tests

    Authors: Cyrus DiCiccio, Sriram Vasudevan, Kinjal Basu, Krishnaram Kenthapadi, Deepak Agarwal

    Abstract: Machine learning models are central to people's lives and impact society in ways as fundamental as determining how people access information. The gravity of these models imparts a responsibility to model developers to ensure that they are treating users in a fair and equitable manner. Before deploying a model into production, it is crucial to examine the extent to which its predictions demonstrate… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: 11 pages, 5 figures

  26. arXiv:2006.05109  [pdf, other

    stat.ML cs.LG

    Fair Bayesian Optimization

    Authors: Valerio Perrone, Michele Donini, Muhammad Bilal Zafar, Robin Schmucker, Krishnaram Kenthapadi, Cédric Archambeau

    Abstract: Given the increasing importance of machine learning (ML) in our lives, several algorithmic fairness techniques have been proposed to mitigate biases in the outcomes of the ML models. However, most of these techniques are specialized to cater to a single family of ML models and a specific definition of fairness, limiting their adaptibility in practice. We introduce a general constrained Bayesian op… ▽ More

    Submitted 18 June, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

  27. Fairness-Aware Ranking in Search & Recommendation Systems with Application to LinkedIn Talent Search

    Authors: Sahin Cem Geyik, Stuart Ambler, Krishnaram Kenthapadi

    Abstract: We present a framework for quantifying and mitigating algorithmic bias in mechanisms designed for ranking individuals, typically used as part of web-scale search and recommendation systems. We first propose complementary measures to quantify bias with respect to protected attributes such as gender and age. We then present algorithms for computing fairness-aware re-ranking of results. For a given s… ▽ More

    Submitted 24 July, 2019; v1 submitted 30 April, 2019; originally announced May 2019.

    Comments: This paper has been accepted for publication at ACM KDD 2019

  28. arXiv:1904.05233  [pdf, other

    cs.LG cs.CL stat.ML

    What's in a Name? Reducing Bias in Bios without Access to Protected Attributes

    Authors: Alexey Romanov, Maria De-Arteaga, Hanna Wallach, Jennifer Chayes, Christian Borgs, Alexandra Chouldechova, Sahin Geyik, Krishnaram Kenthapadi, Anna Rumshisky, Adam Tauman Kalai

    Abstract: There is a growing body of work that proposes methods for mitigating bias in machine learning systems. These methods typically rely on access to protected attributes such as race, gender, or age. However, this raises two significant challenges: (1) protected attributes may not be available or it may not be legal to use them, and (2) it is often desirable to simultaneously consider multiple protect… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: Accepted at NAACL 2019; Best Thematic Paper

  29. arXiv:1901.09451  [pdf, other

    cs.IR cs.LG stat.ML

    Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting

    Authors: Maria De-Arteaga, Alexey Romanov, Hanna Wallach, Jennifer Chayes, Christian Borgs, Alexandra Chouldechova, Sahin Geyik, Krishnaram Kenthapadi, Adam Tauman Kalai

    Abstract: We present a large-scale study of gender bias in occupation classification, a task where the use of machine learning may lead to negative outcomes on peoples' lives. We analyze the potential allocation harms that can result from semantic representation bias. To do so, we study the impact on occupation classification of including explicit gender indicators---such as first names and pronouns---in di… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.

    Comments: Accepted at ACM Conference on Fairness, Accountability, and Transparency (ACM FAT*), 2019

  30. arXiv:1809.07754  [pdf, other

    cs.CR cs.IR cs.SI

    PriPeARL: A Framework for Privacy-Preserving Analytics and Reporting at LinkedIn

    Authors: Krishnaram Kenthapadi, Thanh T. L. Tran

    Abstract: Preserving privacy of users is a key requirement of web-scale analytics and reporting applications, and has witnessed a renewed focus in light of recent data breaches and new regulations such as GDPR. We focus on the problem of computing robust, reliable analytics in a privacy-preserving manner, while satisfying product requirements. We present PriPeARL, a framework for privacy-preserving analytic… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: Conference information: ACM International Conference on Information and Knowledge Management (CIKM 2018)

  31. arXiv:1809.06481  [pdf, other

    cs.AI

    Talent Search and Recommendation Systems at LinkedIn: Practical Challenges and Lessons Learned

    Authors: Sahin Cem Geyik, Qi Guo, Bo Hu, Cagri Ozcaglar, Ketan Thakkar, Xianren Wu, Krishnaram Kenthapadi

    Abstract: LinkedIn Talent Solutions business contributes to around 65% of LinkedIn's annual revenue, and provides tools for job providers to reach out to potential candidates and for job seekers to find suitable career opportunities. LinkedIn's job ecosystem has been designed as a platform to connect job providers and job seekers, and to serve as a marketplace for efficient matching between potential candid… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

    Comments: This paper has been accepted for publication at ACM SIGIR 2018

  32. arXiv:1809.06473  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Deep and Representation Learning for Talent Search at LinkedIn

    Authors: Rohan Ramanath, Hakan Inan, Gungor Polatkan, Bo Hu, Qi Guo, Cagri Ozcaglar, Xianren Wu, Krishnaram Kenthapadi, Sahin Cem Geyik

    Abstract: Talent search and recommendation systems at LinkedIn strive to match the potential candidates to the hiring needs of a recruiter or a hiring manager expressed in terms of a search query or a job posting. Recent work in this domain has mainly focused on linear models, which do not take complex relationships between features into account, as well as ensemble tree models, which introduce non-linearit… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

    Comments: This paper has been accepted for publication in ACM CIKM 2018

  33. How LinkedIn Economic Graph Bonds Information and Product: Applications in LinkedIn Salary

    Authors: Xi Chen, Yiqun Liu, Liang Zhang, Krishnaram Kenthapadi

    Abstract: The LinkedIn Salary product was launched in late 2016 with the goal of providing insights on compensation distribution to job seekers, so that they can make more informed decisions when discovering and assessing career opportunities. The compensation insights are provided based on data collected from LinkedIn members and aggregated in a privacy-preserving manner. Given the simultaneous desire for… ▽ More

    Submitted 23 June, 2018; originally announced June 2018.

    Comments: 10 pages, 5 figures

  34. arXiv:1706.03849  [pdf, other

    cs.SI cs.IR

    Dionysius: A Framework for Modeling Hierarchical User Interactions in Recommender Systems

    Authors: Jian Wang, Krishnaram Kenthapadi, Kaushik Rangadurai, David Hardtke

    Abstract: We address the following problem: How do we incorporate user item interaction signals as part of the relevance model in a large-scale personalized recommendation system such that, (1) the ability to interpret the model and explain recommendations is retained, and (2) the existing infrastructure designed for the (user profile) content-based model can be leveraged? We propose Dionysius, a hierarchic… ▽ More

    Submitted 12 June, 2017; originally announced June 2017.

  35. arXiv:1705.06976  [pdf, other

    cs.SI cs.CR

    LinkedIn Salary: A System for Secure Collection and Presentation of Structured Compensation Insights to Job Seekers

    Authors: Krishnaram Kenthapadi, Ahsan Chudhary, Stuart Ambler

    Abstract: Online professional social networks such as LinkedIn have enhanced the ability of job seekers to discover and assess career opportunities, and the ability of job providers to discover and assess potential candidates. For most job seekers, salary (or broadly compensation) is a crucial consideration in choosing a new job. At the same time, job seekers face challenges in learning the compensation ass… ▽ More

    Submitted 17 July, 2017; v1 submitted 17 May, 2017; originally announced May 2017.

    Comments: Conference information: IEEE Symposium on Privacy-Aware Computing (IEEE PAC 2017)

  36. arXiv:1703.09845  [pdf, other

    cs.SI cs.AI cs.IR

    Bringing Salary Transparency to the World: Computing Robust Compensation Insights via LinkedIn Salary

    Authors: Krishnaram Kenthapadi, Stuart Ambler, Liang Zhang, Deepak Agarwal

    Abstract: The recently launched LinkedIn Salary product has been designed with the goal of providing compensation insights to the world's professionals and thereby hel** them optimize their earning potential. We describe the overall design and architecture of the statistical modeling system underlying this product. We focus on the unique data mining challenges while designing and implementing the system,… ▽ More

    Submitted 1 September, 2017; v1 submitted 28 March, 2017; originally announced March 2017.

    Comments: Conference information: ACM International Conference on Information and Knowledge Management (CIKM 2017)

  37. arXiv:1204.2606  [pdf, ps, other

    cs.DS cs.CY cs.DB cs.SI

    Privacy via the Johnson-Lindenstrauss Transform

    Authors: Krishnaram Kenthapadi, Aleksandra Korolova, Ilya Mironov, Nina Mishra

    Abstract: Suppose that party A collects private information about its users, where each user's data is represented as a bit vector. Suppose that party B has a proprietary data mining algorithm that requires estimating the distance between users, such as clustering or nearest neighbors. We ask if it is possible for party A to publish some information about each user so that B can estimate the distance betwee… ▽ More

    Submitted 11 April, 2012; originally announced April 2012.

    Comments: 24 pages

    ACM Class: K.4.1; F.2; H.3.5; G.3; I.5.3; H.3.3; H.2.8; E.1; G.1.3

    Journal ref: Journal of Privacy and Confidentiality, Volume 5, Issue 1, Pages 39-71, 2013

  38. arXiv:cs/0510086  [pdf, ps, other

    cs.DS

    Balanced Allocation on Graphs

    Authors: K. Kenthapadi, R. Panigrahy

    Abstract: In this paper, we study the two choice balls and bins process when balls are not allowed to choose any two random bins, but only bins that are connected by an edge in an underlying graph. We show that for $n$ balls and $n$ bins, if the graph is almost regular with degree $n^ε$, where $ε$ is not too small, the previous bounds on the maximum load continue to hold. Precisely, the maximum load is… ▽ More

    Submitted 27 October, 2005; originally announced October 2005.