Skip to main content

Showing 1–44 of 44 results for author: Shrestha, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01467  [pdf, other

    cs.GR cs.CV

    RaDe-GS: Rasterizing Depth in Gaussian Splatting

    Authors: Baowen Zhang, Chuan Fang, Rakesh Shrestha, Yixun Liang, Xiaoxiao Long, ** Tan

    Abstract: Gaussian Splatting (GS) has proven to be highly effective in novel view synthesis, achieving high-quality and real-time rendering. However, its potential for reconstructing detailed 3D shapes has not been fully explored. Existing methods often suffer from limited shape accuracy due to the discrete and unstructured nature of Gaussian splats, which complicates the shape extraction. While recent tech… ▽ More

    Submitted 24 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2403.19964  [pdf, other

    cs.CV cs.CY cs.LG

    FairRAG: Fair Human Generation via Fair Retrieval Augmentation

    Authors: Robik Shrestha, Yang Zou, Qiuyu Chen, Zhiheng Li, Yusheng Xie, Siqi Deng

    Abstract: Existing text-to-image generative models reflect or even amplify societal biases ingrained in their training data. This is especially concerning for human image generation where models are biased against certain demographic groups. Existing attempts to rectify this issue are hindered by the inherent limitations of the pre-trained models and fail to substantially improve demographic diversity. In t… ▽ More

    Submitted 5 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  3. arXiv:2403.14356  [pdf, other

    cs.LG cs.SE

    DomainLab: A modular Python package for domain generalization in deep learning

    Authors: Xudong Sun, Carla Feistner, Alexej Gossmann, George Schwarz, Rao Muhammad Umer, Lisa Beer, Patrick Rockenschaub, Rahul Babu Shrestha, Armin Gruber, Nutan Chen, Sayedali Shetab Boushehri, Florian Buettner, Carsten Marr

    Abstract: Poor generalization performance caused by distribution shifts in unseen domains often hinders the trustworthy deployment of deep neural networks. Many domain generalization techniques address this problem by adding a domain invariant regularization loss terms during training. However, there is a lack of modular software that allows users to combine the advantages of different methods with minimal… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  4. arXiv:2402.07395  [pdf, other

    cs.SI physics.soc-ph

    Comparing the willingness to share for human-generated vs. AI-generated fake news

    Authors: Amirsiavosh Bashardoust, Stefan Feuerriegel, Yash Raj Shrestha

    Abstract: Generative artificial intelligence (AI) presents large risks for society when it is used to create fake news. A crucial factor for fake news to go viral on social media is that users share such content. Here, we aim to shed light on the sharing behavior of users across human-generated vs. AI-generated fake news. Specifically, we study: (1) What is the perceived veracity of human-generated fake new… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  5. arXiv:2312.15471  [pdf, other

    cs.CV cs.RO

    Residual Learning for Image Point Descriptors

    Authors: Rashik Shrestha, Ajad Chhatkuli, Menelaos Kanakis, Luc Van Gool

    Abstract: Local image feature descriptors have had a tremendous impact on the development and application of computer vision methods. It is therefore unsurprising that significant efforts are being made for learning-based image point descriptors. However, the advantage of learned methods over handcrafted methods in real applications is subtle and more nuanced than expected. Moreover, handcrafted descriptors… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  6. arXiv:2312.15242  [pdf, other

    cs.CV

    CaLDiff: Camera Localization in NeRF via Pose Diffusion

    Authors: Rashik Shrestha, Bishad Koju, Abhigyan Bhusal, Danda Pani Paudel, François Rameau

    Abstract: With the widespread use of NeRF-based implicit 3D representation, the need for camera localization in the same representation becomes manifestly apparent. Doing so not only simplifies the localization process -- by avoiding an outside-the-NeRF-based localization -- but also has the potential to offer the benefit of enhanced localization. This paper studies the problem of localizing cameras in NeRF… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  7. arXiv:2312.13253  [pdf, other

    cs.CV cs.AI cs.LG

    Conditional Image Generation with Pretrained Generative Model

    Authors: Rajesh Shrestha, Bowen Xie

    Abstract: In recent years, diffusion models have gained popularity for their ability to generate higher-quality images in comparison to GAN models. However, like any other large generative models, these models require a huge amount of data, computational resources, and meticulous tuning for successful training. This poses a significant challenge, rendering it infeasible for most individuals. As a result, th… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  8. arXiv:2312.12716  [pdf, other

    cs.CV cs.CL cs.LG

    BloomVQA: Assessing Hierarchical Multi-modal Comprehension

    Authors: Yunye Gong, Robik Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran

    Abstract: We propose a novel VQA dataset, BloomVQA, to facilitate comprehensive evaluation of large vision-language models on comprehension tasks. Unlike current benchmarks that often focus on fact-based memorization and simple reasoning tasks without theoretical grounding, we collect multiple-choice samples based on picture stories that reflect different levels of comprehension, as laid out in Bloom's Taxo… ▽ More

    Submitted 10 June, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted by ACL Findings (2024). Dataset available at https://huggingface.co/datasets/ygong/BloomVQA

  9. arXiv:2310.15106  [pdf, other

    cs.IT eess.SP

    Theoretical Analysis of the Radio Map Estimation Problem

    Authors: Daniel Romero, Tien Ngoc Ha, Raju Shrestha, Massimo Franceschetti

    Abstract: Radio maps provide radio frequency metrics, such as the received signal strength, at every location of a geographic area. These maps, which are estimated using a set of measurements collected at multiple positions, find a wide range of applications in wireless communications, including the prediction of coverage holes, network planning, resource allocation, and path planning for mobile robots. Alt… ▽ More

    Submitted 23 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  10. arXiv:2310.11036  [pdf, other

    eess.SP cs.AI physics.app-ph

    Radio Map Estimation: Empirical Validation and Analysis

    Authors: Raju Shrestha, Tien Ngoc Ha, Pham Q. Viet, Daniel Romero

    Abstract: Radio maps quantify magnitudes such as the received signal strength at every location of a geographical region. Although the estimation of radio maps has attracted widespread interest, the vast majority of works rely on simulated data and, therefore, cannot establish the effectiveness and relative performance of existing algorithms in practice. To fill this gap, this paper presents the first compr… ▽ More

    Submitted 22 January, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: 13 pages, Journal version, submitted to the IEEE Transactions on Wireless Communications

  11. arXiv:2310.05990  [pdf, other

    eess.IV cs.CV cs.LG

    Data Augmentation through Pseudolabels in Automatic Region Based Coronary Artery Segmentation for Disease Diagnosis

    Authors: Sandesh Pokhrel, Sanjay Bhandari, Eduard Vazquez, Yash Raj Shrestha, Binod Bhattarai

    Abstract: Coronary Artery Diseases(CADs) though preventable are one of the leading causes of death and disability. Diagnosis of these diseases is often difficult and resource intensive. Segmentation of arteries in angiographic images has evolved as a tool for assistance, hel** clinicians in making accurate diagnosis. However, due to the limited amount of data and the difficulty in curating a dataset, the… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2310.04749

  12. arXiv:2310.04749  [pdf, other

    cs.CV cs.AI

    ConvNeXtv2 Fusion with Mask R-CNN for Automatic Region Based Coronary Artery Stenosis Detection for Disease Diagnosis

    Authors: Sandesh Pokhrel, Sanjay Bhandari, Eduard Vazquez, Yash Raj Shrestha, Binod Bhattarai

    Abstract: Coronary Artery Diseases although preventable are one of the leading cause of mortality worldwide. Due to the onerous nature of diagnosis, tackling CADs has proved challenging. This study addresses the automation of resource-intensive and time-consuming process of manually detecting stenotic lesions in coronary arteries in X-ray coronary angiography images. To overcome this challenge, we employ a… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  13. arXiv:2310.03602  [pdf, other

    cs.CV

    Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints

    Authors: Chuan Fang, Yuan Dong, Kunming Luo, Xiaotao Hu, Rakesh Shrestha, ** Tan

    Abstract: Text-driven 3D indoor scene generation is useful for gaming, the film industry, and AR/VR applications. However, existing methods cannot faithfully capture the room layout, nor do they allow flexible editing of individual objects in the room. To address these problems, we present Ctrl-Room, which can generate convincing 3D rooms with designer-style layouts and high-fidelity textures from just a te… ▽ More

    Submitted 1 July, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  14. arXiv:2309.11594  [pdf, other

    cs.RO

    Development of a Feeding Assistive Robot Using a Six Degree of Freedom Robotic Arm

    Authors: Md Esharuzzaman Emu, Samarjith Biswas, Rajendra Shrestha

    Abstract: This project introduces a Feeding Assistive Robot tailored to individuals with physical disabilities, including those with limited arm function or hand control. The core component is a precise 6-degree freedom robotic arm, operated seamlessly through voice commands. Integration of an Arduino-based Braccio Arm, a distance sensor, and Bluetooth module enables voice-controlled movements. The primary… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 5 pages, 6 figures

    MSC Class: 15A22

  15. arXiv:2309.05142  [pdf, other

    cs.CL cs.AI

    Large Language Models for Difficulty Estimation of Foreign Language Content with Application to Language Learning

    Authors: Michalis Vlachos, Mircea Lungu, Yash Raj Shrestha, Johannes-Rudolf David

    Abstract: We use large language models to aid learners enhance proficiency in a foreign language. This is accomplished by identifying content on topics that the user is interested in, and that closely align with the learner's proficiency level in that foreign language. Our work centers on French content, but our approach is readily transferable to other languages. Our solution offers several distinctive cha… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  16. arXiv:2308.06434  [pdf, other

    cs.CV

    Distributionally Robust Optimization and Invariant Representation Learning for Addressing Subgroup Underrepresentation: Mechanisms and Limitations

    Authors: Nilesh Kumar, Ruby Shrestha, Zhiyuan Li, Linwei Wang

    Abstract: Spurious correlation caused by subgroup underrepresentation has received increasing attention as a source of bias that can be perpetuated by deep neural networks (DNNs). Distributionally robust optimization has shown success in addressing this bias, although the underlying working mechanism mostly relies on upweighting under-performing samples as surrogates for those underrepresented in data. At t… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted at FAIMI-2023

  17. arXiv:2304.05339  [pdf, other

    eess.IV cs.CV cs.LG

    Deep-learning assisted detection and quantification of (oo)cysts of Giardia and Cryptosporidium on smartphone microscopy images

    Authors: Suprim Nakarmi, Sanam Pudasaini, Safal Thapaliya, Pratima Upretee, Retina Shrestha, Basant Giri, Bhanu Bhakta Neupane, Bishesh Khanal

    Abstract: The consumption of microbial-contaminated food and water is responsible for the deaths of millions of people annually. Smartphone-based microscopy systems are portable, low-cost, and more accessible alternatives for the detection of Giardia and Cryptosporidium than traditional brightfield microscopes. However, the images from smartphone microscopes are noisier and require manual cyst identificatio… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: 18 pages (including supplementary information), 4 figures, 7 tables, submitting to Journal of Machine Learning for Biomedical Imaging

  18. arXiv:2303.05473  [pdf, other

    cs.LG

    Natural Gradient Methods: Perspectives, Efficient-Scalable Approximations, and Analysis

    Authors: Rajesh Shrestha

    Abstract: Natural Gradient Descent, a second-degree optimization method motivated by the information geometry, makes use of the Fisher Information Matrix instead of the Hessian which is typically used. However, in many cases, the Fisher Information Matrix is equivalent to the Generalized Gauss-Newton Method, that both approximate the Hessian. It is an appealing method to be used as an alternative to stochas… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

    Comments: 14 pages

  19. arXiv:2212.04133  [pdf, other

    cs.CR

    Tumult Analytics: a robust, easy-to-use, scalable, and expressive framework for differential privacy

    Authors: Skye Berghel, Philip Bohannon, Damien Desfontaines, Charles Estes, Sam Haney, Luke Hartman, Michael Hay, Ashwin Machanavajjhala, Tom Magerlein, Gerome Miklau, Amritha Pai, William Sexton, Ruchit Shrestha

    Abstract: In this short paper, we outline the design of Tumult Analytics, a Python framework for differential privacy used at institutions such as the U.S. Census Bureau, the Wikimedia Foundation, or the Internal Revenue Service.

    Submitted 8 December, 2022; originally announced December 2022.

  20. arXiv:2211.06114  [pdf, ps, other

    eess.IV cs.CV

    Treatment classification of posterior capsular opacification (PCO) using automated ground truths

    Authors: Raisha Shrestha, Waree Kongprawechnon, Teesid Leelasawassuk, Nattapon Wongcumchang, Oliver Findl, Nino Hirnschall

    Abstract: Determination of treatment need of posterior capsular opacification (PCO)-- one of the most common complication of cataract surgery -- is a difficult process due to its local unavailability and the fact that treatment is provided only after PCO occurs in the central visual axis. In this paper we propose a deep learning (DL)-based method to first segment PCO images then classify the images into \te… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  21. arXiv:2210.04573  [pdf, other

    cs.CL cs.LG

    HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response

    Authors: Selim Fekih, Nicolò Tamagnone, Benjamin Minixhofer, Ranjan Shrestha, Ximena Contla, Ewan Oglethorpe, Navid Rekabsaz

    Abstract: Timely and effective response to humanitarian crises requires quick and accurate analysis of large amounts of text data - a process that can highly benefit from expert-assisted NLP systems trained on validated and annotated data in the humanitarian response domain. To enable creation of such NLP systems, we introduce and release HumSet, a novel and rich multilingual dataset of humanitarian respons… ▽ More

    Submitted 6 November, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Published at Findings of EMNLP 2022

  22. arXiv:2208.05907  [pdf, other

    cs.IT cs.CR

    Absolute Security in High-Frequency Wireless Links

    Authors: Alejandro Cohen, Rafael G. L. D'Oliveira, Chia-Yi Yeh, Hichem Guerboukha, Rabi Shrestha, Zhaoji Fang, Edward Knightly, Muriel Médard, Daniel M. Mittleman

    Abstract: Security against eavesdrop** is one of the key concerns in the design of any communication system. Many common considerations of the security of a wireless communication channel rely on comparing the signal level measured by Bob (the intended receiver) to that accessible to Eve (an eavesdropper). Frameworks such as Wyner's wiretap model ensure the security of a link, in an average sense, when Bo… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  23. arXiv:2207.13793  [pdf, other

    cs.CR

    Precision-based attacks and interval refining: how to break, then fix, differential privacy on finite computers

    Authors: Samuel Haney, Damien Desfontaines, Luke Hartman, Ruchit Shrestha, Michael Hay

    Abstract: Despite being raised as a problem over ten years ago, the imprecision of floating point arithmetic continues to cause privacy failures in the implementations of differentially private noise mechanisms. In this paper, we highlight a new class of vulnerabilities, which we call \emph{precision-based attacks}, and which affect several open source libraries. To address this vulnerability and implement… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

  24. arXiv:2204.10785  [pdf, other

    cs.NI

    Localizing Router Configuration Errors Using Minimal Correction Sets

    Authors: Aaron Gember-Jacobson, Ruchit Shrestha, Xiaolin Sun

    Abstract: Router configuration errors are unfortunately common and difficult to localize using current network verifiers. We introduce a novel configuration error localizer (CEL) that precisely identifies which configuration segments contribute to the violation of forwarding requirements. In particular, CEL generates a system of satisfiability modulo theories (SMT) constraints-which encode a network's confi… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  25. arXiv:2204.02426  [pdf, other

    cs.LG

    OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses

    Authors: Robik Shrestha, Kushal Kafle, Christopher Kanan

    Abstract: Dataset bias and spurious correlations can significantly impair generalization in deep neural networks. Many prior efforts have addressed this problem using either alternative loss functions or sampling strategies that focus on rare patterns. We propose a new direction: modifying the network architecture to impose inductive biases that make the network robust to dataset bias. Specifically, we prop… ▽ More

    Submitted 14 April, 2024; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: ECCV 2022

  26. arXiv:2203.11397  [pdf, other

    cs.CV cs.AI

    A Real World Dataset for Multi-view 3D Reconstruction

    Authors: Rakesh Shrestha, Siqi Hu, Minghao Gou, Ziyuan Liu, ** Tan

    Abstract: We present a dataset of 998 3D models of everyday tabletop objects along with their 847,000 real world RGB and depth images. Accurate annotations of camera poses and object poses for each image are performed in a semi-automated fashion to facilitate the use of the dataset for myriad 3D applications like shape reconstruction, object pose estimation, shape retrieval etc. We primarily focus on learne… ▽ More

    Submitted 8 August, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  27. arXiv:2201.04125  [pdf, other

    eess.SP cs.LG

    Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

    Authors: Raju Shrestha, Daniel Romero, Sundeep Prabhakar Chepuri

    Abstract: Radio maps find numerous applications in wireless communications and mobile robotics tasks, including resource allocation, interference coordination, and mission planning. Although numerous techniques have been proposed to construct radio maps from spatially distributed measurements, the locations of such measurements are assumed predetermined beforehand. In contrast, this paper proposes spectrum… ▽ More

    Submitted 13 January, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: 30 pages, 10 figures, submitted to the IEEE Transactions on Wireless Communications

  28. arXiv:2109.10697  [pdf, other

    cs.AI cs.LG

    Towards Automatic Bias Detection in Knowledge Graphs

    Authors: Daphna Keidar, Mian Zhong, Ce Zhang, Yash Raj Shrestha, Bibek Paudel

    Abstract: With the recent surge in social applications relying on knowledge graphs, the need for techniques to ensure fairness in KG based methods is becoming increasingly evident. Previous works have demonstrated that KGs are prone to various social biases, and have proposed multiple methods for debiasing them. However, in such studies, the focus has been on debiasing techniques, while the relations to be… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

    Comments: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: Findings (EMNLP 2021). Nov 7--11, 2021

  29. arXiv:2104.00170  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Are Bias Mitigation Techniques for Deep Learning Effective?

    Authors: Robik Shrestha, Kushal Kafle, Christopher Kanan

    Abstract: A critical problem in deep learning is that systems learn inappropriate biases, resulting in their inability to perform well on minority groups. This has led to the creation of multiple algorithms that endeavor to mitigate bias. However, it is not clear how effective these methods are. This is because study protocols differ among papers, systems are tested on datasets that fail to test many forms… ▽ More

    Submitted 23 April, 2024; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: Published in WACV 2022 under the title "An Investigation of Critical Issues in Bias Mitigation Techniques"

  30. arXiv:2103.03048  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Detecting Spurious Correlations with Sanity Tests for Artificial Intelligence Guided Radiology Systems

    Authors: Usman Mahmood, Robik Shrestha, David D. B. Bates, Lorenzo Mannelli, Giuseppe Corrias, Yusuf Erdi, Christopher Kanan

    Abstract: Artificial intelligence (AI) has been successful at solving numerous problems in machine perception. In radiology, AI systems are rapidly evolving and show progress in guiding treatment decisions, diagnosing, localizing disease on medical images, and improving radiologists' efficiency. A critical component to deploying AI in radiology is to gain confidence in a developed system's efficacy and safe… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

  31. arXiv:2103.01752  [pdf, other

    cs.CY

    Morning or Evening? An Examination of Circadian Rhythms of CS1 Students

    Authors: Albina Zavgorodniaia, Raj Shrestha, Juho Leinonen, Arto Hellas, John Edwards

    Abstract: Circadian rhythms are the cycles of our internal clock that play a key role in governing when we sleep and when we are active. A related concept is chronotype, which is a person's natural tendency toward activity at certain times of day and typically governs when the individual is most alert and productive. In this work we investigate chronotypes in the setting of an Introductory Computer Programm… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  32. arXiv:2011.02834  [pdf

    cs.LG

    Augmenting Organizational Decision-Making with Deep Learning Algorithms: Principles, Promises, and Challenges

    Authors: Yash Raj Shrestha, Vaibhav Krishna, Georg von Krogh

    Abstract: The current expansion of theory and research on artificial intelligence in management and organization studies has revitalized the theory and research on decision-making in organizations. In particular, recent advances in deep learning (DL) algorithms promise benefits for decision-making within organizations, such as assisting employees with information processing, thereby augment their analytical… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Journal ref: Journal of Business Research 2020

  33. arXiv:2010.08682  [pdf, other

    cs.CV cs.LG eess.IV

    MeshMVS: Multi-View Stereo Guided Mesh Reconstruction

    Authors: Rakesh Shrestha, Zhiwen Fan, Qingkun Su, Zuozhuo Dai, Siyu Zhu, ** Tan

    Abstract: Deep learning based 3D shape generation methods generally utilize latent features extracted from color images to encode the semantics of objects and guide the shape generation process. These color image semantics only implicitly encode 3D information, potentially limiting the accuracy of the generated shapes. In this paper we propose a multi-view mesh generation method which incorporates geometry… ▽ More

    Submitted 11 April, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

  34. arXiv:2007.13714  [pdf

    cs.CY

    IoT Based Smart Home using Blynk Framework

    Authors: Bharat Bohara, Sunil Maharjan, Bibek Raj Shrestha

    Abstract: The project discussed in this paper is targeted at solving sundry problems faced by Nepalese people in their daily life. It is designed to control and monitor appliances via smartphone using Wi-Fi as communication protocol and raspberry pi as private server. All the appliances and sensors are connected to the internet via NodeMcu microcontroller, which serves as the gateway to the internet. Even i… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

    Comments: 5 pages, 6 figures, presented in 13th National Technological Festival, Locus-2016, Tribhuvan University, Nepal

    Journal ref: ZERONE SCHOLAR, VOL. 1, (2016) 26-30

  35. arXiv:2006.16926  [pdf, other

    cs.CY cs.CL cs.LG

    A Deep Learning Pipeline for Patient Diagnosis Prediction Using Electronic Health Records

    Authors: Leopold Franz, Yash Raj Shrestha, Bibek Paudel

    Abstract: Augmentation of disease diagnosis and decision-making in healthcare with machine learning algorithms is gaining much impetus in recent years. In particular, in the current epidemiological situation caused by COVID-19 pandemic, swift and accurate prediction of disease diagnosis with machine learning algorithms could facilitate identification and care of vulnerable clusters of population, such as th… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Journal ref: BIOKDD 2020 at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2020

  36. arXiv:2006.16309  [pdf, other

    cs.LG cs.AI cs.SI stat.ML

    Adversarial Learning for Debiasing Knowledge Graph Embeddings

    Authors: Mario Arduini, Lorenzo Noci, Federico Pirovano, Ce Zhang, Yash Raj Shrestha, Bibek Paudel

    Abstract: Knowledge Graphs (KG) are gaining increasing attention in both academia and industry. Despite their diverse benefits, recent research have identified social and cultural biases embedded in the representations learned from KGs. Such biases can have detrimental consequences on different population and minority groups as applications of KG begin to intersect and interact with social spheres. This pap… ▽ More

    Submitted 17 February, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: MLG 2020 at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2020

    Journal ref: MLG 2020 at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2020

  37. arXiv:2005.09241  [pdf, other

    cs.CV cs.LG

    On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law

    Authors: Damien Teney, Kushal Kafle, Robik Shrestha, Ehsan Abbasnejad, Christopher Kanan, Anton van den Hengel

    Abstract: Out-of-distribution (OOD) testing is increasingly popular for evaluating a machine learning system's ability to generalize beyond the biases of a training set. OOD benchmarks are designed to present a different joint distribution of data and labels between training and test time. VQA-CP has become the standard OOD benchmark for visual question answering, but we discovered three troubling practices… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  38. arXiv:2004.05704  [pdf, other

    cs.CV cs.AI cs.CL

    Visual Grounding Methods for VQA are Working for the Wrong Reasons!

    Authors: Robik Shrestha, Kushal Kafle, Christopher Kanan

    Abstract: Existing Visual Question Answering (VQA) methods tend to exploit dataset biases and spurious statistical correlations, instead of producing right answers for the right reasons. To address this issue, recent bias mitigation methods for VQA propose to incorporate visual cues (e.g., human attention maps) to better ground the VQA models, showcasing impressive gains. However, we show that the performan… ▽ More

    Submitted 23 April, 2024; v1 submitted 12 April, 2020; originally announced April 2020.

    Comments: Published in ACL 2020 under the title "A negative case analysis of visual grounding methods for VQA"

  39. arXiv:1910.02509  [pdf, other

    cs.LG cs.CV cs.NE

    REMIND Your Neural Network to Prevent Catastrophic Forgetting

    Authors: Tyler L. Hayes, Kushal Kafle, Robik Shrestha, Manoj Acharya, Christopher Kanan

    Abstract: People learn throughout life. However, incrementally updating conventional neural networks leads to catastrophic forgetting. A common remedy is replay, which is inspired by how the brain consolidates memory. Replay involves fine-tuning a network on a mixture of new and old instances. While there is neuroscientific evidence that the brain replays compressed memories, existing methods for convolutio… ▽ More

    Submitted 13 July, 2020; v1 submitted 6 October, 2019; originally announced October 2019.

    Comments: To appear in the European Conference on Computer Vision (ECCV-2020)

  40. arXiv:1908.01801  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Answering Questions about Data Visualizations using Efficient Bimodal Fusion

    Authors: Kushal Kafle, Robik Shrestha, Brian Price, Scott Cohen, Christopher Kanan

    Abstract: Chart question answering (CQA) is a newly proposed visual question answering (VQA) task where an algorithm must answer questions about data visualizations, e.g. bar charts, pie charts, and line graphs. CQA requires capabilities that natural-image VQA algorithms lack: fine-grained measurements, optical character recognition, and handling out-of-vocabulary words in both questions and answers. Withou… ▽ More

    Submitted 22 July, 2020; v1 submitted 5 August, 2019; originally announced August 2019.

    Comments: Presented at WACV, 2020

  41. arXiv:1904.09317  [pdf, other

    cs.LG cs.CL cs.CV cs.NE stat.ML

    Challenges and Prospects in Vision and Language Research

    Authors: Kushal Kafle, Robik Shrestha, Christopher Kanan

    Abstract: Language grounded image understanding tasks have often been proposed as a method for evaluating progress in artificial intelligence. Ideally, these tasks should test a plethora of capabilities that integrate computer vision, reasoning, and natural language understanding. However, rather than behaving as visual Turing tests, recent studies have demonstrated state-of-the-art systems are achieving go… ▽ More

    Submitted 24 May, 2019; v1 submitted 19 April, 2019; originally announced April 2019.

  42. arXiv:1903.00366  [pdf, other

    cs.CV

    Answer Them All! Toward Universal Visual Question Answering Models

    Authors: Robik Shrestha, Kushal Kafle, Christopher Kanan

    Abstract: Visual Question Answering (VQA) research is split into two camps: the first focuses on VQA datasets that require natural image understanding and the second focuses on synthetic datasets that test reasoning. A good VQA algorithm should be capable of both, but only a few VQA algorithms are tested in this manner. We compare five state-of-the-art VQA algorithms across eight VQA datasets covering both… ▽ More

    Submitted 5 April, 2019; v1 submitted 1 March, 2019; originally announced March 2019.

    Comments: 8 pages

  43. arXiv:1404.2464  [pdf, ps, other

    cs.MA cs.GT

    How Credible is the Prediction of a Party-Based Election?

    Authors: Jiong Guo, Yash Raj Shrestha, Yongjie Yang

    Abstract: In a party-based election system, the voters are grouped into parties and all voters of a party are assumed to vote according to the party preferences over the candidates. Hence, once the party preferences are declared the outcome of the election can be determined. However, in the actual election, the members of some "instable" parties often leave their own party to join other parties. We introduc… ▽ More

    Submitted 9 April, 2014; originally announced April 2014.

  44. arXiv:1401.2532  [pdf, ps, other

    cs.CC cs.DS

    Parameterized Complexity of Edge Interdiction Problems

    Authors: Jiong Guo, Yash Raj Shrestha

    Abstract: We study the parameterized complexity of interdiction problems in graphs. For an optimization problem on graphs, one can formulate an interdiction problem as a game consisting of two players, namely, an interdictor and an evader, who compete on an objective with opposing interests. In edge interdiction problems, every edge of the input graph has an interdiction cost associated with it and the inte… ▽ More

    Submitted 11 January, 2014; originally announced January 2014.