Skip to main content

Showing 1–6 of 6 results for author: Charles, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2110.14794  [pdf, other

    cs.CR cs.LG stat.ML

    Masked LARk: Masked Learning, Aggregation and Reporting worKflow

    Authors: Joseph J. Pfeiffer III, Denis Charles, Davis Gilton, Young Hun Jung, Mehul Parsana, Erik Anderson

    Abstract: Today, many web advertising data flows involve passive cross-site tracking of users. Enabling such a mechanism through the usage of third party tracking cookies (3PC) exposes sensitive user data to a large number of parties, with little oversight on how that data can be used. Thus, most browsers are moving towards removal of 3PC in subsequent browser iterations. In order to substantially improve e… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Microsoft Journal of Applied Research (MSJAR Volume 16)

    MSC Class: 68T07

  2. arXiv:2010.08710  [pdf, other

    cs.LG stat.ML

    Causal Transfer Random Forest: Combining Logged Data and Randomized Experiments for Robust Prediction

    Authors: Shuxi Zeng, Murat Ali Bayir, Joesph J. Pfeiffer III, Denis Charles, Emre Kiciman

    Abstract: It is often critical for prediction models to be robust to distributional shifts between training and testing data. From a causal perspective, the challenge is to distinguish the stable causal relationships from the unstable spurious correlations across shifts. We describe a causal transfer random forest (CTRF) that combines existing training data with a small amount of data from a randomized expe… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: 9 pages, 7 figures, 2 tables, accepted to WSDM 2021

  3. arXiv:2003.08485  [pdf, other

    cs.CV cs.LG stat.ML

    Self-Supervised Contextual Bandits in Computer Vision

    Authors: Aniket Anand Deshmukh, Abhimanu Kumar, Levi Boyles, Denis Charles, Eren Manavoglu, Urun Dogan

    Abstract: Contextual bandits are a common problem faced by machine learning practitioners in domains as diverse as hypothesis testing to product recommendations. There have been a lot of approaches in exploiting rich data representations for contextual bandit problems with varying degree of success. Self-supervised learning is a promising approach to find rich data representations without explicit labels. I… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  4. arXiv:2002.07384  [pdf, other

    stat.ML cs.LG math.ST

    Data Transformation Insights in Self-supervision with Clustering Tasks

    Authors: Abhimanu Kumar, Aniket Anand Deshmukh, Urun Dogan, Denis Charles, Eren Manavoglu

    Abstract: Self-supervision is key to extending use of deep learning for label scarce domains. For most of self-supervised approaches data transformations play an important role. However, up until now the impact of transformations have not been studied. Furthermore, different transformations may have different impact on the system. We provide novel insights into the use of data transformation in self-supervi… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

  5. arXiv:1809.04673  [pdf, other

    cs.LG cs.AI stat.ML

    A Unified Batch Online Learning Framework for Click Prediction

    Authors: Rishabh Iyer, Nimit Acharya, Tanuja Bompada, Denis Charles, Eren Manavoglu

    Abstract: We present a unified framework for Batch Online Learning (OL) for Click Prediction in Search Advertisement. Machine Learning models once deployed, show non-trivial accuracy and calibration degradation over time due to model staleness. It is therefore necessary to regularly update models, and do so automatically. This paper presents two paradigms of Batch Online Learning, one which incrementally up… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

  6. arXiv:1804.06909  [pdf, other

    cs.LG stat.ML

    Modeling and Simultaneously Removing Bias via Adversarial Neural Networks

    Authors: John Moore, Joel Pfeiffer, Kai Wei, Rishabh Iyer, Denis Charles, Ran Gilad-Bachrach, Levi Boyles, Eren Manavoglu

    Abstract: In real world systems, the predictions of deployed Machine Learned models affect the training data available to build subsequent models. This introduces a bias in the training data that needs to be addressed. Existing solutions to this problem attempt to resolve the problem by either casting this in the reinforcement learning framework or by quantifying the bias and re-weighting the loss functions… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.