Skip to main content

Showing 1–14 of 14 results for author: Poudel, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.09056  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    ReCoRe: Regularized Contrastive Representation Learning of World Model

    Authors: Rudra P. K. Poudel, Harit Pandya, Stephan Liwicki, Roberto Cipolla

    Abstract: While recent model-free Reinforcement Learning (RL) methods have demonstrated human-level effectiveness in gaming environments, their success in everyday tasks like visual navigation has been limited, particularly under significant appearance variations. This limitation arises from (i) poor sample efficiency and (ii) over-fitting to training scenarios. To address these challenges, we present a wor… ▽ More

    Submitted 3 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted at CVPR 2024. arXiv admin note: text overlap with arXiv:2209.14932

  2. arXiv:2311.17593  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.RO

    LanGWM: Language Grounded World Model

    Authors: Rudra P. K. Poudel, Harit Pandya, Chao Zhang, Roberto Cipolla

    Abstract: Recent advances in deep reinforcement learning have showcased its potential in tackling complex tasks. However, experiments on visual control tasks have revealed that state-of-the-art reinforcement learning models struggle with out-of-distribution generalization. Conversely, expressing higher-level concepts and global contexts is relatively easy using language. Building upon recent success of th… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  3. arXiv:2310.09650  [pdf

    cs.LG cs.AI

    Multimodal Federated Learning in Healthcare: a Review

    Authors: Jacob Thrasher, Alina Devkota, Prasiddha Siwakotai, Rohit Chivukula, Pranav Poudel, Chaunbo Hu, Binod Bhattarai, Prashnna Gyawali

    Abstract: Recent advancements in multimodal machine learning have empowered the development of accurate and robust AI systems in the medical domain, especially within centralized database systems. Simultaneously, Federated Learning (FL) has progressed, providing a decentralized mechanism where data need not be consolidated, thereby enhancing the privacy and security of sensitive healthcare data. The integra… ▽ More

    Submitted 27 February, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: 28 pages, 5 figures

  4. arXiv:2306.13203  [pdf, other

    cs.CV

    Neural Network Pruning for Real-time Polyp Segmentation

    Authors: Suman Sapkota, Pranav Poudel, Sudarshan Regmi, Bibek Panthi, Binod Bhattarai

    Abstract: Computer-assisted treatment has emerged as a viable application of medical imaging, owing to the efficacy of deep learning models. Real-time inference speed remains a key requirement for such applications to help medical personnel. Even though there generally exists a trade-off between performance and model size, impressive efforts have been made to retain near-original performance by compromising… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  5. arXiv:2302.06294  [pdf, other

    eess.IV cs.CV cs.LG

    CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

    Authors: Chinedu Innocent Nwoye, Tong Yu, Saurav Sharma, Aditya Murali, Deepak Alapatt, Armine Vardazaryan, Kun Yuan, Jonas Hajek, Wolfgang Reiter, Amine Yamlahi, Finn-Henri Smidt, Xiaoyang Zou, Guoyan Zheng, Bruno Oliveira, Helena R. Torres, Satoshi Kondo, Satoshi Kasai, Felix Holm, Ege Özsoy, Shuangchun Gui, Han Li, Sista Raviteja, Rachana Sathish, Pranav Poudel, Binod Bhattarai , et al. (24 additional authors not shown)

    Abstract: Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier effor… ▽ More

    Submitted 14 July, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: MICCAI EndoVis CholecTriplet2022 challenge report. Published at Elsevier journal of Medical Image Analysis. 25 pages, 15 figures, 8 tables

    Journal ref: Medical Image Analysis, Volume 89, 2023, 102888, ISSN 1361-8415

  6. arXiv:2209.14932  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Contrastive Unsupervised Learning of World Model with Invariant Causal Features

    Authors: Rudra P. K. Poudel, Harit Pandya, Roberto Cipolla

    Abstract: In this paper we present a world model, which learns causal features using the invariance principle. In particular, we use contrastive unsupervised learning to learn the invariant causal features, which enforces invariance across augmentations of irrelevant parts or styles of the observation. The world-model-based reinforcement learning methods independently optimize representation learning and th… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  7. arXiv:2208.07359  [pdf, ps, other

    cs.DC

    Stable Scheduling in Transactional Memory

    Authors: Costas Busch, Bogdan S. Chlebus, Dariusz R. Kowalski, Pavan Poudel

    Abstract: We study computer systems with transactions executed on a set of shared objects. Transactions arrive continually subjects to constrains that are framed as an adversarial model and impose limits on the average rate of transaction generation and the number of objects that transactions use. We show that no deterministic distributed scheduler in the queue-free model of transaction autonomy can provide… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

  8. arXiv:2204.03440  [pdf, other

    cs.CV

    Task-Aware Active Learning for Endoscopic Image Analysis

    Authors: Shrawan Kumar Thapa, Pranav Poudel, Binod Bhattarai, Danail Stoyanov

    Abstract: Semantic segmentation of polyps and depth estimation are two important research problems in endoscopic image analysis. One of the main obstacles to conduct research on these research problems is lack of annotated data. Endoscopic annotations necessitate the specialist knowledge of expert endoscopists and due to this, it can be difficult to organise, expensive and time consuming. To address this pr… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

  9. arXiv:2201.12678  [pdf, ps, other

    cs.LG cs.CV

    A Stochastic Bundle Method for Interpolating Networks

    Authors: Alasdair Paren, Leonard Berrada, Rudra P. K. Poudel, M. Pawan Kumar

    Abstract: We propose a novel method for training deep neural networks that are capable of interpolation, that is, driving the empirical loss to zero. At each iteration, our method constructs a stochastic approximation of the learning objective. The approximation, known as a bundle, is a pointwise maximum of linear functions. Our bundle contains a constant function that lower bounds the empirical loss. This… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

  10. arXiv:2009.05429  [pdf, other

    cs.RO cs.AI cs.CV

    Embodied Visual Navigation with Automatic Curriculum Learning in Real Environments

    Authors: Steven D. Morad, Roberto Mecca, Rudra P. K. Poudel, Stephan Liwicki, Roberto Cipolla

    Abstract: We present NavACL, a method of automatic curriculum learning tailored to the navigation task. NavACL is simple to train and efficiently selects relevant tasks using geometric features. In our experiments, deep reinforcement learning agents trained using NavACL significantly outperform state-of-the-art agents trained with uniform sampling -- the current standard. Furthermore, our agents can navigat… ▽ More

    Submitted 6 January, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

  11. arXiv:1902.04502  [pdf, other

    cs.CV

    Fast-SCNN: Fast Semantic Segmentation Network

    Authors: Rudra P K Poudel, Stephan Liwicki, Roberto Cipolla

    Abstract: The encoder-decoder framework is state-of-the-art for offline semantic image segmentation. Since the rise in autonomous systems, real-time computation is increasingly desirable. In this paper, we introduce fast segmentation convolutional neural network (Fast-SCNN), an above real-time semantic segmentation model on high resolution image data (1024x2048px) suited to efficient computation on embedded… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  12. arXiv:1805.04554  [pdf, other

    cs.CV

    ContextNet: Exploring Context and Detail for Semantic Segmentation in Real-time

    Authors: Rudra P K Poudel, Ujwal Bonde, Stephan Liwicki, Christopher Zach

    Abstract: Modern deep learning architectures produce highly accurate results on many challenging semantic segmentation datasets. State-of-the-art methods are, however, not directly transferable to real-time applications or embedded devices, since naive adaptation of such systems to reduce computational cost (speed, memory and energy) causes a significant drop in accuracy. We propose ContextNet, a new deep n… ▽ More

    Submitted 5 November, 2018; v1 submitted 11 May, 2018; originally announced May 2018.

    Comments: Published as a conference paper at British Machine Vision Conference (BMVC), 2018

  13. arXiv:1612.02572  [pdf

    stat.ML cs.CV cs.LG q-bio.NC

    Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker

    Authors: James H Cole, Rudra PK Poudel, Dimosthenis Tsagkrasoulis, Matthan WA Caan, Claire Steves, Tim D Spector, Giovanni Montana

    Abstract: Machine learning analysis of neuroimaging data can accurately predict chronological age in healthy people and deviations from healthy brain ageing have been associated with cognitive impairment and disease. Here we sought to further establish the credentials of "brain-predicted age" as a biomarker of individual differences in the brain ageing process, using a predictive modelling approach based on… ▽ More

    Submitted 8 December, 2016; originally announced December 2016.

  14. arXiv:1608.03974  [pdf, other

    stat.ML cs.CV cs.LG

    Recurrent Fully Convolutional Neural Networks for Multi-slice MRI Cardiac Segmentation

    Authors: Rudra P K Poudel, Pablo Lamata, Giovanni Montana

    Abstract: In cardiac magnetic resonance imaging, fully-automatic segmentation of the heart enables precise structural and functional measurements to be taken, e.g. from short-axis MR images of the left-ventricle. In this work we propose a recurrent fully-convolutional network (RFCN) that learns image representations from the full stack of 2D slices and has the ability to leverage inter-slice spatial depende… ▽ More

    Submitted 13 August, 2016; originally announced August 2016.

    Comments: MICCAI Workshop RAMBO 2016