Skip to main content

Showing 1–2 of 2 results for author: Sohmshetty, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2212.00261  [pdf, other

    cs.LG

    Task Discovery: Finding the Tasks that Neural Networks Generalize on

    Authors: Andrei Atanov, Andrei Filatov, Teresa Yeo, Ajay Sohmshetty, Amir Zamir

    Abstract: When develo** deep learning models, we usually decide what task we want to solve then search for a model that generalizes well on the task. An intriguing question would be: what if, instead of fixing the task and searching in the model space, we fix the model and search in the task space? Can we find tasks that the model generalizes on? How do they look, or do they indicate anything? These are t… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: NeurIPS 2022, Project page at https://taskdiscovery.epfl.ch

  2. arXiv:1703.03939  [pdf

    cs.CL cs.LG cs.NE

    Ask Me Even More: Dynamic Memory Tensor Networks (Extended Model)

    Authors: Govardana Sachithanandam Ramachandran, Ajay Sohmshetty

    Abstract: We examine Memory Networks for the task of question answering (QA), under common real world scenario where training examples are scarce and under weakly supervised scenario, that is only extrinsic labels are available for training. We propose extensions for the Dynamic Memory Network (DMN), specifically within the attention mechanism, we call the resulting Neural Architecture as Dynamic Memory Ten… ▽ More

    Submitted 11 March, 2017; originally announced March 2017.