Skip to main content

Showing 1–7 of 7 results for author: Manavoglu, E

.
  1. arXiv:2212.08136  [pdf, other

    cs.CL cs.LG

    Efficient Long Sequence Modeling via State Space Augmented Transformer

    Authors: Simiao Zuo, Xiaodong Liu, Jian Jiao, Denis Charles, Eren Manavoglu, Tuo Zhao, Jianfeng Gao

    Abstract: Transformer models have achieved superior performance in various natural language processing tasks. However, the quadratic computational cost of the attention mechanism limits its practicality for long sequences. There are existing attention variants that improve the computational efficiency, but they have limited ability to effectively compute global information. In parallel to Transformer models… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  2. arXiv:2105.01289  [pdf, other

    cs.CV cs.LG

    Representation Learning for Clustering via Building Consensus

    Authors: Aniket Anand Deshmukh, Jayanth Reddy Regatti, Eren Manavoglu, Urun Dogan

    Abstract: In this paper, we focus on unsupervised representation learning for clustering of images. Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must be close in the representation space (exemplar consistency), and/or similar images must have similar cluster assignment… ▽ More

    Submitted 25 April, 2022; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: Paper is accepted at Springer Machine Learning Journal 2022. The code and the trained models are available at https://github.com/JayanthRR/ConCURL_NCE

  3. arXiv:2010.01245  [pdf, other

    cs.CV cs.LG

    Consensus Clustering With Unsupervised Representation Learning

    Authors: Jayanth Reddy Regatti, Aniket Anand Deshmukh, Eren Manavoglu, Urun Dogan

    Abstract: Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must either be closer in the representation space, or have a similar cluster assignment. Bootstrap Your Own Latent (BYOL) is one such representation learning algorithm that has achieved state-of-the-art results in… ▽ More

    Submitted 8 July, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Accepted by the 2021 International Joint Conference on Neural Networks (IJCNN 2021)

  4. arXiv:2003.08485  [pdf, other

    cs.CV cs.LG stat.ML

    Self-Supervised Contextual Bandits in Computer Vision

    Authors: Aniket Anand Deshmukh, Abhimanu Kumar, Levi Boyles, Denis Charles, Eren Manavoglu, Urun Dogan

    Abstract: Contextual bandits are a common problem faced by machine learning practitioners in domains as diverse as hypothesis testing to product recommendations. There have been a lot of approaches in exploiting rich data representations for contextual bandit problems with varying degree of success. Self-supervised learning is a promising approach to find rich data representations without explicit labels. I… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  5. arXiv:2002.07384  [pdf, other

    stat.ML cs.LG math.ST

    Data Transformation Insights in Self-supervision with Clustering Tasks

    Authors: Abhimanu Kumar, Aniket Anand Deshmukh, Urun Dogan, Denis Charles, Eren Manavoglu

    Abstract: Self-supervision is key to extending use of deep learning for label scarce domains. For most of self-supervised approaches data transformations play an important role. However, up until now the impact of transformations have not been studied. Furthermore, different transformations may have different impact on the system. We provide novel insights into the use of data transformation in self-supervi… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

  6. arXiv:1809.04673  [pdf, other

    cs.LG cs.AI stat.ML

    A Unified Batch Online Learning Framework for Click Prediction

    Authors: Rishabh Iyer, Nimit Acharya, Tanuja Bompada, Denis Charles, Eren Manavoglu

    Abstract: We present a unified framework for Batch Online Learning (OL) for Click Prediction in Search Advertisement. Machine Learning models once deployed, show non-trivial accuracy and calibration degradation over time due to model staleness. It is therefore necessary to regularly update models, and do so automatically. This paper presents two paradigms of Batch Online Learning, one which incrementally up… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

  7. arXiv:1804.06909  [pdf, other

    cs.LG stat.ML

    Modeling and Simultaneously Removing Bias via Adversarial Neural Networks

    Authors: John Moore, Joel Pfeiffer, Kai Wei, Rishabh Iyer, Denis Charles, Ran Gilad-Bachrach, Levi Boyles, Eren Manavoglu

    Abstract: In real world systems, the predictions of deployed Machine Learned models affect the training data available to build subsequent models. This introduces a bias in the training data that needs to be addressed. Existing solutions to this problem attempt to resolve the problem by either casting this in the reinforcement learning framework or by quantifying the bias and re-weighting the loss functions… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.