Skip to main content

Showing 1–16 of 16 results for author: Killamsetty, K

.
  1. arXiv:2310.00165  [pdf, other

    cs.LG cs.CV

    SCoRe: Submodular Combinatorial Representation Learning

    Authors: Anay Majee, Suraj Kothawade, Krishnateja Killamsetty, Rishabh Iyer

    Abstract: In this paper we introduce the SCoRe (Submodular Combinatorial Representation Learning) framework, a novel approach in representation learning that addresses inter-class bias and intra-class variance. SCoRe provides a new combinatorial viewpoint to representation learning, by introducing a family of loss functions based on set-based submodular information measures. We develop two novel combinatori… ▽ More

    Submitted 6 June, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted to ICML 2024

  2. arXiv:2306.01277  [pdf, other

    cs.LG cs.HC

    Beyond Active Learning: Leveraging the Full Potential of Human Interaction via Auto-Labeling, Human Correction, and Human Verification

    Authors: Nathan Beck, Krishnateja Killamsetty, Suraj Kothawade, Rishabh Iyer

    Abstract: Active Learning (AL) is a human-in-the-loop framework to interactively and adaptively label data instances, thereby enabling significant gains in model performance compared to random sampling. AL approaches function by selecting the hardest instances to label, often relying on notions of diversity and uncertainty. However, we believe that these current paradigms of AL do not leverage the full pote… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 14 pages, 8 figures

  3. arXiv:2305.06677  [pdf, other

    cs.CL cs.AI cs.LG

    INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models

    Authors: H S V N S Kowndinya Renduchintala, Krishnateja Killamsetty, Sumit Bhatia, Milan Aggarwal, Ganesh Ramakrishnan, Rishabh Iyer, Balaji Krishnamurthy

    Abstract: A salient characteristic of pre-trained language models (PTLMs) is a remarkable improvement in their generalization capability and emergence of new capabilities with increasing model capacity and pre-training dataset size. Consequently, we are witnessing the development of enormous models pushing the state-of-the-art. It is, however, imperative to realize that this inevitably leads to prohibitivel… ▽ More

    Submitted 19 October, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  4. arXiv:2301.13287  [pdf, other

    cs.LG cs.AI

    MILO: Model-Agnostic Subset Selection Framework for Efficient Model Training and Tuning

    Authors: Krishnateja Killamsetty, Alexandre V. Evfimievski, Tejaswini Pedapati, Kiran Kate, Lucian Popa, Rishabh Iyer

    Abstract: Training deep networks and tuning hyperparameters on large datasets is computationally intensive. One of the primary research directions for efficient training is to reduce training costs by selecting well-generalizable subsets of training data. Compared to simple adaptive random subset selection baselines, existing intelligent subset selection approaches are not competitive due to the time-consum… ▽ More

    Submitted 16 June, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  5. arXiv:2203.08212  [pdf, other

    cs.LG

    AUTOMATA: Gradient Based Data Subset Selection for Compute-Efficient Hyper-parameter Tuning

    Authors: Krishnateja Killamsetty, Guttu Sai Abhishek, Aakriti, Alexandre V. Evfimievski, Lucian Popa, Ganesh Ramakrishnan, Rishabh Iyer

    Abstract: Deep neural networks have seen great success in recent years; however, training a deep model is often challenging as its performance heavily depends on the hyper-parameters used. In addition, finding the optimal hyper-parameter configuration, even with state-of-the-art (SOTA) hyper-parameter optimization (HPO) algorithms, can be time-consuming, requiring multiple training runs over the entire data… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  6. arXiv:2201.05471  [pdf, other

    physics.app-ph physics.optics

    Multimodal Anti-Reflective Coatings for Perfecting Anomalous Reflection from Arbitrary Periodic Structures

    Authors: Sherman W. Marcus, Vinay K. Killamsetty, Ariel Epstein

    Abstract: Metasurfaces possess vast wave-manipulation capabilities, including reflection and refraction of a plane wave into non-standard directions. This requires meticulously-designed sub-wavelength meta-atoms in each period of the metasurface which guarantee unitary coupling to the desired Floquet-Bloch mode or, equivalently, suppression of the coupling to other modes. Herein, we propose an entirely diff… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: 6 pages, 6 figures

  7. arXiv:2111.11210  [pdf, other

    cs.LG cs.AI

    GCR: Gradient Coreset Based Replay Buffer Selection For Continual Learning

    Authors: Rishabh Tiwari, Krishnateja Killamsetty, Rishabh Iyer, Pradeep Shenoy

    Abstract: Continual learning (CL) aims to develop techniques by which a single model adapts to an increasing number of tasks encountered sequentially, thereby potentially leveraging learnings across tasks in a resource-efficient manner. A major challenge for CL systems is catastrophic forgetting, where earlier tasks are forgotten while learning a new task. To address this, replay-based CL approaches maintai… ▽ More

    Submitted 15 April, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: Published at CVPR 2022 | Project Page: https://gradientcoreset.github.io/

  8. arXiv:2109.11410  [pdf, other

    cs.LG

    Learning to Robustly Aggregate Labeling Functions for Semi-supervised Data Programming

    Authors: Ayush Maheshwari, Krishnateja Killamsetty, Ganesh Ramakrishnan, Rishabh Iyer, Marina Danilevsky, Lucian Popa

    Abstract: A critical bottleneck in supervised machine learning is the need for large amounts of labeled data which is expensive and time consuming to obtain. However, it has been shown that a small amount of labeled data, while insufficient to re-train a model, can be effectively used to generate human-interpretable labeling functions (LFs). These LFs, in turn, have been used to generate a large amount of a… ▽ More

    Submitted 10 March, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: Findings of ACL, 2022

  9. arXiv:2107.00717  [pdf, other

    cs.LG cs.CV

    SIMILAR: Submodular Information Measures Based Active Learning In Realistic Scenarios

    Authors: Suraj Kothawade, Nathan Beck, Krishnateja Killamsetty, Rishabh Iyer

    Abstract: Active learning has proven to be useful for minimizing labeling costs by selecting the most informative samples. However, existing active learning methods do not work well in realistic scenarios such as imbalance or rare classes, out-of-distribution data in the unlabeled set, and redundancy. In this work, we propose SIMILAR (Submodular Information Measures based actIve LeARning), a unified active… ▽ More

    Submitted 3 November, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: To Appear In Thirty-fifth Conference on Neural Information Processing Systems, NeurIPS 2021

  10. arXiv:2106.07760  [pdf, other

    cs.LG cs.AI

    RETRIEVE: Coreset Selection for Efficient and Robust Semi-Supervised Learning

    Authors: Krishnateja Killamsetty, Xujiang Zhao, Feng Chen, Rishabh Iyer

    Abstract: Semi-supervised learning (SSL) algorithms have had great success in recent years in limited labeled data regimes. However, the current state-of-the-art SSL algorithms are computationally expensive and entail significant compute time and energy requirements. This can prove to be a huge limitation for many smaller companies and academic groups. Our main insight is that training on a subset of unlabe… ▽ More

    Submitted 27 October, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: To appear in NeurIPS21

  11. arXiv:2103.10774  [pdf, other

    physics.app-ph physics.optics

    Metagratings for Perfect Mode Conversion in Rectangular Waveguides: Theory and Experiment

    Authors: Vinay Kumar Killamsetty, Ariel Epstein

    Abstract: We present a complete design scheme, from theoretical formulation to experimental validation, exploiting the versatility of metagratings (MGs) for designing a rectangular waveguide (RWG) $\mbox{TE}_{10}$ - $\mbox{TE}_{20}$ mode converter (MC). MG devices, formed by sparse periodically positioned polarizable particles (meta-atoms), were mostly used to date for beam manipulation applications. In thi… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: 17 pages, 10 figures

    Journal ref: Phys. Rev. Applied 16, 014038 (2021)

  12. arXiv:2103.04362  [pdf, other

    physics.app-ph

    Semianalyitcal synthesis scheme for multifunctional metasurfaces on demand

    Authors: Vinay K. Killamsetty, Ariel Epstein

    Abstract: We propose a comprehensive field-based semianalytical method for designing fabrication-ready multifunctional periodic metasurfaces (MSs). Harnessing recent work on multielement metagratings based on capacitively-loaded strips, we have extended our previous meta-atom design formulation to generate realistic substrate-supported printed-circuit-board layouts for anomalous refraction MSs. Subsequently… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: 3 pages, 1 figure

  13. arXiv:2103.00123  [pdf, other

    cs.LG

    GRAD-MATCH: Gradient Matching based Data Subset Selection for Efficient Deep Model Training

    Authors: Krishnateja Killamsetty, Durga Sivasubramanian, Ganesh Ramakrishnan, Abir De, Rishabh Iyer

    Abstract: The great success of modern machine learning models on large datasets is contingent on extensive computational resources with high financial and environmental costs. One way to address this is by extracting subsets that generalize on par with the full data. In this work, we propose a general framework, GRAD-MATCH, which finds subsets that closely match the gradient of the training or validation se… ▽ More

    Submitted 11 June, 2021; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: To appear in Proceedings of the 38 th International Conference on Machine Learning, PMLR 139, 2021

  14. arXiv:2012.10630  [pdf, other

    cs.LG cs.AI

    GLISTER: Generalization based Data Subset Selection for Efficient and Robust Learning

    Authors: Krishnateja Killamsetty, Durga Sivasubramanian, Ganesh Ramakrishnan, Rishabh Iyer

    Abstract: Large scale machine learning and deep models are extremely data-hungry. Unfortunately, obtaining large amounts of labeled data is expensive, and training state-of-the-art models (with hyperparameter tuning) requires significant computing resources and time. Secondly, real-world data is noisy and imbalanced. As a result, several recent papers try to make the training process more efficient and robu… ▽ More

    Submitted 11 June, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence 35. 9(2021): 8110-8118

  15. arXiv:2011.06782  [pdf, other

    cs.LG

    A Nested Bi-level Optimization Framework for Robust Few Shot Learning

    Authors: Krishnateja Killamsetty, Changbin Li, Chen Zhao, Rishabh Iyer, Feng Chen

    Abstract: Model-Agnostic Meta-Learning (MAML), a popular gradient-based meta-learning framework, assumes that the contribution of each task or instance to the meta-learner is equal. Hence, it fails to address the domain shift between base and novel classes in few-shot learning. In this work, we propose a novel robust meta-learning algorithm, NestedMAML, which learns to assign weights to training tasks or in… ▽ More

    Submitted 1 December, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: To appear in the proceedings of AAAI 2022

  16. arXiv:2008.09887  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Data Programming with Subset Selection

    Authors: Ayush Maheshwari, Oishik Chatterjee, KrishnaTeja Killamsetty, Ganesh Ramakrishnan, Rishabh Iyer

    Abstract: The paradigm of data programming, which uses weak supervision in the form of rules/labelling functions, and semi-supervised learning, which augments small amounts of labelled data with a large unlabelled dataset, have shown great promise in several text classification scenarios. In this work, we argue that by not using any labelled data, data programming based approaches can yield sub-optimal perf… ▽ More

    Submitted 12 June, 2021; v1 submitted 22 August, 2020; originally announced August 2020.

    Comments: Findings of ACL, 2021