Skip to main content

Showing 1–9 of 9 results for author: Mondal, S S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.03458  [pdf, other

    cs.CV cs.LG

    Slot Abstractors: Toward Scalable Abstract Visual Reasoning

    Authors: Shanka Subhra Mondal, Jonathan D. Cohen, Taylor W. Webb

    Abstract: Abstract visual reasoning is a characteristically human ability, allowing the identification of relational patterns that are abstracted away from object features, and the systematic generalization of those patterns to unseen problems. Recent work has demonstrated strong systematic generalization in visual reasoning tasks involving multi-object inputs, through the integration of slot-based methods… ▽ More

    Submitted 2 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 18 pages, 9 figures

  2. arXiv:2310.00194  [pdf, other

    cs.AI cs.NE

    A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models

    Authors: Taylor Webb, Shanka Subhra Mondal, Chi Wang, Brian Krabach, Ida Momennejad

    Abstract: Large language models (LLMs) demonstrate impressive performance on a wide variety of tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed planning. To address this, we take inspiration from the human brain, in which planning is accomplished via the recurrent interaction of specialized modules in the prefrontal cortex (PFC). These modules perform functions su… ▽ More

    Submitted 5 March, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

  3. arXiv:2306.02500  [pdf, other

    cs.CV

    Systematic Visual Reasoning through Object-Centric Relational Abstraction

    Authors: Taylor W. Webb, Shanka Subhra Mondal, Jonathan D. Cohen

    Abstract: Human visual reasoning is characterized by an ability to identify abstract patterns from only a small number of examples, and to systematically generalize those patterns to novel inputs. This capacity depends in large part on our ability to represent complex visual inputs in terms of both objects and relations. Recent work in computer vision has introduced models with the capacity to extract objec… ▽ More

    Submitted 10 November, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

  4. arXiv:2305.18417  [pdf, other

    cs.LG q-bio.NC

    Determinantal Point Process Attention Over Grid Cell Code Supports Out of Distribution Generalization

    Authors: Shanka Subhra Mondal, Steven Frankland, Taylor Webb, Jonathan D. Cohen

    Abstract: Deep neural networks have made tremendous gains in emulating human-like intelligence, and have been used increasingly as ways of understanding how the brain may solve the complex computational problems on which this relies. However, these still fall short of, and therefore fail to provide insight into how the brain supports strong forms of generalization of which humans are capable. One such case… ▽ More

    Submitted 23 January, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 29 pages (including Appendix), 21 figures

  5. arXiv:2303.02260  [pdf, other

    cs.CV cs.CL

    Learning to reason over visual objects

    Authors: Shanka Subhra Mondal, Taylor Webb, Jonathan D. Cohen

    Abstract: A core component of human intelligence is the ability to identify abstract patterns inherent in complex, high-dimensional perceptual data, as exemplified by visual reasoning tasks such as Raven's Progressive Matrices (RPM). Motivated by the goal of designing AI systems with this capacity, recent work has focused on evaluating whether neural networks can learn to solve RPM-like problems. Previous w… ▽ More

    Submitted 26 October, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  6. arXiv:1907.12916  [pdf, other

    cs.DC cs.LG stat.ML

    DeepPlace: Learning to Place Applications in Multi-Tenant Clusters

    Authors: Subrata Mitra, Shanka Subhra Mondal, Nikhil Sheoran, Neeraj Dhake, Ravinder Nehra, Ramanuja Simha

    Abstract: Large multi-tenant production clusters often have to handle a variety of jobs and applications with a variety of complex resource usage characteristics. It is non-trivial and non-optimal to manually create placement rules for scheduling that would decide which applications should co-locate. In this paper, we present DeepPlace, a scheduler that learns to exploits various temporal resource usage pat… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: APSys 2019

  7. arXiv:1906.08636  [pdf, other

    q-fin.ST cs.LG

    Investment Ranking Challenge: Identifying the best performing stocks based on their semi-annual returns

    Authors: Shanka Subhra Mondal, Sharada Prasanna Mohanty, Benjamin Harlander, Mehmet Koseoglu, Lance Rane, Kirill Romanov, Wei-Kai Liu, Pranoot Hatwar, Marcel Salathe, Joe Byrum

    Abstract: In the IEEE Investment ranking challenge 2018, participants were asked to build a model which would identify the best performing stocks based on their returns over a forward six months window. Anonymized financial predictors and semi-annual returns were provided for a group of anonymized stocks from 1996 to 2017, which were divided into 42 non-overlap** six months period. The second half of 2017… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

  8. arXiv:1906.01363  [pdf, other

    cs.CV cs.LG

    KarNet: An Efficient Boolean Function Simplifier

    Authors: Shanka Subhra Mondal, Abhilash Nandy, Ritesh Agrawal, Debashis Sen

    Abstract: Many approaches such as Quine-McCluskey algorithm, Karnaugh map solving, Petrick's method and McBoole's method have been devised to simplify Boolean expressions in order to optimize hardware implementation of digital circuits. However, the algorithmic implementations of these methods are hard-coded and also their computation time is proportional to the number of minterms involved in the expression… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: 5 pages, 8 figures

  9. arXiv:1905.08315  [pdf, other

    eess.IV cs.CV cs.LG

    Multitask Learning of Temporal Connectionism in Convolutional Networks using a Joint Distribution Loss Function to Simultaneously Identify Tools and Phase in Surgical Videos

    Authors: Shanka Subhra Mondal, Rachana Sathish, Debdoot Sheet

    Abstract: Surgical workflow analysis is of importance for understanding onset and persistence of surgical phases and individual tool usage across surgery and in each phase. It is beneficial for clinical quality control and to hospital administrators for understanding surgery planning. Video acquired during surgery typically can be leveraged for this task. Currently, a combination of convolutional neural net… ▽ More

    Submitted 25 May, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

    Comments: 15 pages, 8 figures, 5th MedImage Workshop of 11th Indian Conference on Computer Vision, Graphics and Image Processing, Hyderabad, India, 2018