Skip to main content

Showing 1–50 of 342 results for author: Patel, V

.
  1. arXiv:2406.17396  [pdf, other

    cs.CV

    SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing

    Authors: Ruihuang Li, Liyi Chen, Zhengqiang Zhang, Varun Jampani, Vishal M. Patel, Lei Zhang

    Abstract: Text-based 2D diffusion models have demonstrated impressive capabilities in image generation and editing. Meanwhile, the 2D diffusion models also exhibit substantial potentials for 3D editing tasks. However, how to achieve consistent edits across multiple viewpoints remains a challenge. While the iterative dataset update method is capable of achieving global consistency, it suffers from slow conve… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 16 pages, 13 figures

  2. arXiv:2406.13237  [pdf, other

    cs.CV

    ModelMix: A New Model-Mixup Strategy to Minimize Vicinal Risk across Tasks for Few-scribble based Cardiac Segmentation

    Authors: Ke Zhang, Vishal M. Patel

    Abstract: Pixel-level dense labeling is both resource-intensive and time-consuming, whereas weak labels such as scribble present a more feasible alternative to full annotations. However, training segmentation networks with weak supervision from scribbles remains challenging. Inspired by the fact that different segmentation tasks can be correlated with each other, we introduce a new approach to few-scribble… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 10 pages, 3 figures

  3. arXiv:2406.10373  [pdf, other

    cs.CV cs.GR

    Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections

    Authors: Jiacong Xu, Yiqun Mei, Vishal M. Patel

    Abstract: Photographs captured in unstructured tourist environments frequently exhibit variable appearances and transient occlusions, challenging accurate scene reconstruction and inducing artifacts in novel view synthesis. Although prior approaches have integrated the Neural Radiance Field (NeRF) with additional learnable modules to handle the dynamic appearances and eliminate transient objects, their exte… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 15 pages, 7 figures

  4. arXiv:2406.02549  [pdf, other

    cs.CV

    Dreamguider: Improved Training free Diffusion-based Conditional Generation

    Authors: Nithin Gopalakrishnan Nair, Vishal M Patel

    Abstract: Diffusion models have emerged as a formidable tool for training-free conditional generation.However, a key hurdle in inference-time guidance techniques is the need for compute-heavy backpropagation through the diffusion network for estimating the guidance direction. Moreover, these techniques often require handcrafted parameter tuning on a case-by-case basis. Although some recent works have introd… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2405.11708  [pdf, other

    cs.LG cs.CV

    Adaptive Batch Normalization Networks for Adversarial Robustness

    Authors: Shao-Yuan Lo, Vishal M. Patel

    Abstract: Deep networks are vulnerable to adversarial examples. Adversarial Training (AT) has been a standard foundation of modern adversarial defense approaches due to its remarkable effectiveness. However, AT is extremely time-consuming, refraining it from wide deployment in practical applications. In this paper, we aim at a non-AT defense: How to design a defense method that gets rid of AT but is still r… ▽ More

    Submitted 26 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: Accepted at IEEE International Conference on Advanced Video and Signal-based Surveillance (AVSS) 2024

  6. arXiv:2405.10913  [pdf, other

    cs.CV

    Blackbox Adaptation for Medical Image Segmentation

    Authors: Jay N. Paranjape, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel

    Abstract: In recent years, various large foundation models have been proposed for image segmentation. There models are often trained on large amounts of data corresponding to general computer vision tasks. Hence, these models do not perform well on medical data. There have been some attempts in the literature to perform parameter-efficient finetuning of such foundation models for medical image segmentation.… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted early at MICCAI 2024

  7. arXiv:2405.05033  [pdf, other

    cs.CE cs.LG stat.ML

    Multi-fidelity Hamiltonian Monte Carlo

    Authors: Dhruv V. Patel, Jonghyun Lee, Matthew W. Farthing, Peter K. Kitanidis, Eric F. Darve

    Abstract: Numerous applications in biology, statistics, science, and engineering require generating samples from high-dimensional probability distributions. In recent years, the Hamiltonian Monte Carlo (HMC) method has emerged as a state-of-the-art Markov chain Monte Carlo technique, exploiting the shape of such high-dimensional target distributions to efficiently generate samples. Despite its impressive em… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  8. arXiv:2404.14406  [pdf, other

    cs.CV

    Hyp-OC: Hyperbolic One Class Classification for Face Anti-Spoofing

    Authors: Kartik Narayan, Vishal M. Patel

    Abstract: Face recognition technology has become an integral part of modern security systems and user authentication processes. However, these systems are vulnerable to spoofing attacks and can easily be circumvented. Most prior research in face anti-spoofing (FAS) approaches it as a two-class classification task where models are trained on real samples and known spoof attacks and tested for detection perfo… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted in FG2024, Project Page - https://kartik-3004.github.io/hyp-oc/

  9. arXiv:2404.12368  [pdf, other

    cs.CV cs.LG

    Gradient-Regularized Out-of-Distribution Detection

    Authors: Sina Sharifi, Taha Entesari, Bardia Safaei, Vishal M. Patel, Mahyar Fazlyab

    Abstract: One of the challenges for neural networks in real-life applications is the overconfident errors these models make when the data is not from the original training distribution. Addressing this issue is known as Out-of-Distribution (OOD) detection. Many state-of-the-art OOD methods employ an auxiliary dataset as a surrogate for OOD data during training to achieve improved performance. However,… ▽ More

    Submitted 22 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Under review

  10. arXiv:2404.11764  [pdf, other

    cs.CV

    Multimodal 3D Object Detection on Unseen Domains

    Authors: Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Vishal M. Patel

    Abstract: LiDAR datasets for autonomous driving exhibit biases in properties such as point cloud density, range, and object dimensions. As a result, object detection networks trained and evaluated in different environments often experience performance degradation. Domain adaptation approaches assume access to unannotated samples from the test distribution to address this problem. However, in the real world,… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: technical report

  11. arXiv:2404.11737  [pdf, other

    cs.CV

    Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection

    Authors: Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Vishal M. Patel

    Abstract: Popular representation learning methods encourage feature invariance under transformations applied at the input. However, in 3D perception tasks like object localization and segmentation, outputs are naturally equivariant to some transformations, such as rotation. Using pre-training loss functions that encourage equivariance of features under certain transformations provides a strong self-supervis… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: technical report

  12. arXiv:2404.09977  [pdf, other

    cs.CV

    MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

    Authors: Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M Patel

    Abstract: Large diffusion-based Text-to-Image (T2I) models have shown impressive generative powers for text-to-image generation as well as spatially conditioned image generation. For most applications, we can train the model end-toend with paired data to obtain photorealistic generation quality. However, to add an additional task, one often needs to retrain the model from scratch using paired data across al… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  13. arXiv:2404.09976  [pdf, other

    cs.CV

    Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers

    Authors: Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Recently, diffusion transformers have gained wide attention with its excellent performance in text-to-image and text-to-vidoe models, emphasizing the need for transformers as backbone for diffusion models. Transformer-based models have shown better generalization capability compared to CNN-based models for general vision tasks. However, much less has been explored in the existing literature regard… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  14. arXiv:2404.09810  [pdf, other

    math.OC stat.CO

    The Challenges of Optimization For Data Science

    Authors: Christian Varner, Vivak Patel

    Abstract: Optimization problems arising in data science have given rise to a number of new derivative-based optimization methods. Such methods often use standard smoothness assumptions -- namely, global Lipschitz continuity of the gradient function -- to establish a convergence theory. Unfortunately, in this work, we show that common optimization problems from data science applications are not globally Lips… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 24 pages, 3 tables, 2 figures, 10 algorithms

    MSC Class: 90C30; 65K05; 68T09

  15. arXiv:2404.01562  [pdf

    quant-ph

    Efficient, indistinguishable telecom C-band photons using a tapered nanobeam

    Authors: Mohammad Habibur Rahaman, Samuel Harper, Chang-Min Lee, Kyu-Young Kim, Mustafa Atabey Buyukkaya, Victor J. Patel, Samuel D. Hawkins, Je-Hyung Kim, Sadhvikas Addamane, Edo Waks

    Abstract: Telecom C-band single photons exhibit the lowest attenuation in optical fibers, enabling long-haul quantum-secured communication. However, efficient coupling with optical fibers is crucial for these single photons to be effective carriers in long-distance transmission. In this work, we demonstrate an efficient fiber-coupled single photon source at the telecom C-band using InAs/InP quantum dots cou… ▽ More

    Submitted 5 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  16. arXiv:2404.01367  [pdf, other

    cs.CV cs.LG

    Bigger is not Always Better: Scaling Properties of Latent Diffusion Models

    Authors: Kangfu Mei, Zhengzhong Tu, Mauricio Delbracio, Hossein Talebi, Vishal M. Patel, Peyman Milanfar

    Abstract: We study the scaling properties of latent diffusion models (LDMs) with an emphasis on their sampling efficiency. While improved network architecture and inference algorithms have shown to effectively boost sampling efficiency of diffusion models, the role of model size -- a critical determinant of sampling efficiency -- has not been thoroughly examined. Through empirical analysis of established te… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  17. arXiv:2404.00744  [pdf, other

    astro-ph.CO math.ST

    The distribution of Bayes' ratio

    Authors: Luca Amendola, Vrund Patel, Ziad Sakr, Elena Sellentin, Kevin Wolz

    Abstract: The ratio of Bayesian evidences is a popular tool in cosmology to compare different models. There are however several issues with this method: Bayes' ratio depends on the prior even in the limit of non-informative priors, and Jeffrey's scale, used to assess the test, is arbitrary. Moreover, the standard use of Bayes' ratio is often criticized for being unable to reject models. In this paper, we ad… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 20 pages

  18. arXiv:2403.19593  [pdf, other

    cs.CV

    Frame by Familiar Frame: Understanding Replication in Video Diffusion Models

    Authors: Aimon Rahman, Malsha V. Perera, Vishal M. Patel

    Abstract: Building on the momentum of image generation diffusion models, there is an increasing interest in video-based diffusion models. However, video generation poses greater challenges due to its higher-dimensional nature, the scarcity of training data, and the complex spatiotemporal relationships involved. Image generation models, due to their extensive data requirements, have already strained computat… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  19. arXiv:2403.14513  [pdf, other

    cs.CV

    View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network

    Authors: Quan Zhang, Lei Wang, Vishal M. Patel, Xiaohua Xie, Jianhuang Lai

    Abstract: Existing person re-identification methods have achieved remarkable advances in appearance-based identity association across homogeneous cameras, such as ground-ground matching. However, as a more practical scenario, aerial-ground person re-identification (AGPReID) among heterogeneous cameras has received minimal attention. To alleviate the disruption of discriminative identity representation by dr… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  20. arXiv:2403.14053  [pdf, other

    cs.CV cs.GR

    Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions

    Authors: Jiacong Xu, Mingqian Liao, K Ram Prabhakar, Vishal M. Patel

    Abstract: Neural Radiance Fields (NeRF) accomplishes photo-realistic novel view synthesis by learning the implicit volumetric representation of a scene from multi-view images, which faithfully convey the colorimetric information. However, sensor noises will contaminate low-value pixel signals, and the lossy camera image signal processor will further remove near-zero intensities in extremely dark situations,… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 25 pages, 13 figures

  21. arXiv:2403.12960  [pdf, other

    cs.CV

    FaceXFormer: A Unified Transformer for Facial Analysis

    Authors: Kartik Narayan, Vibashan VS, Rama Chellappa, Vishal M. Patel

    Abstract: In this work, we introduce FaceXformer, an end-to-end unified transformer model for a comprehensive range of facial analysis tasks such as face parsing, landmark detection, head pose estimation, attributes recognition, and estimation of age, gender, race, and landmarks visibility. Conventional methods in face analysis have often relied on task-specific designs and preprocessing techniques, which l… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project page: https://kartik-3004.github.io/facexformer_web/

  22. arXiv:2403.09632  [pdf, other

    cs.CV

    Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image

    Authors: Yiqun Mei, Yu Zeng, He Zhang, Zhixin Shu, Xuaner Zhang, Sai Bi, Jianming Zhang, HyunJoon Jung, Vishal M. Patel

    Abstract: At the core of portrait photography is the search for ideal lighting and viewpoint. The process often requires advanced knowledge in photography and an elaborate studio setup. In this work, we propose Holo-Relighting, a volumetric relighting method that is capable of synthesizing novel viewpoints, and novel lighting from a single image. Holo-Relighting leverages the pretrained 3D GAN (EG3D) to rec… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: CVPR2024

  23. arXiv:2403.09620  [pdf, other

    cs.CV

    PosSAM: Panoptic Open-vocabulary Segment Anything

    Authors: Vibashan VS, Shubhankar Borse, Hyo** Park, Debasmit Das, Vishal Patel, Munawar Hayat, Fatih Porikli

    Abstract: In this paper, we introduce an open-vocabulary panoptic segmentation model that effectively unifies the strengths of the Segment Anything Model (SAM) with the vision-language CLIP model in an end-to-end framework. While SAM excels in generating spatially-aware masks, it's decoder falls short in recognizing object class information and tends to oversegment without additional guidance. Existing appr… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  24. arXiv:2403.06978  [pdf, other

    cs.CV

    Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling

    Authors: Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: In this paper, we introduce Attention Prompt Tuning (APT) - a computationally efficient variant of prompt tuning for video-based applications such as action recognition. Prompt tuning approaches involve injecting a set of learnable prompts along with data tokens during fine-tuning while kee** the backbone frozen. This approach greatly reduces the number of learnable parameters compared to full t… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted at 18th IEEE International Conference on Automatic Face and Gesture Recognition (FG'24) Code available at: https://github.com/wgcban/apt 12 pages, 8 figures, 6 tables

  25. arXiv:2402.17207  [pdf, other

    cs.CV

    Deployment Prior Injection for Run-time Calibratable Object Detection

    Authors: Mo Zhou, Yiding Yang, Haoxiang Li, Vishal M. Patel, Gang Hua

    Abstract: With a strong alignment between the training and test distributions, object relation as a context prior facilitates object detection. Yet, it turns into a harmful but inevitable training set bias upon test distributions that shift differently across space and time. Nevertheless, the existing detectors cannot incorporate deployment context prior during the test phase without parameter update. Such… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  26. arXiv:2402.02263  [pdf, other

    cs.LG cs.AI cs.CV

    MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers

    Authors: Yatong Bai, Mo Zhou, Vishal M. Patel, Somayeh Sojoudi

    Abstract: Adversarial robustness often comes at the cost of degraded accuracy, impeding the real-life application of robust classification models. Training-based solutions for better trade-offs are limited by incompatibilities with already-trained high-performance large models, necessitating the exploration of training-free ensemble approaches. Observing that robust models are more confident in correct pred… ▽ More

    Submitted 12 April, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    MSC Class: 68T07

  27. arXiv:2401.02158  [pdf, other

    cs.CL cs.AI

    Shayona@SMM4H23: COVID-19 Self diagnosis classification using BERT and LightGBM models

    Authors: Rushi Chavda, Darshan Makwana, Vraj Patel, Anupam Shukla

    Abstract: This paper describes approaches and results for shared Task 1 and 4 of SMMH4-23 by Team Shayona. Shared Task-1 was binary classification of english tweets self-reporting a COVID-19 diagnosis, and Shared Task-4 was Binary classification of English Reddit posts self-reporting a social anxiety disorder diagnosis. Our team has achieved the highest f1-score 0.94 in Task-1 among all participants. We hav… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  28. arXiv:2312.14126  [pdf, other

    cs.CV

    Entropic Open-set Active Learning

    Authors: Bardia Safaei, Vibashan VS, Celso M. de Melo, Vishal M. Patel

    Abstract: Active Learning (AL) aims to enhance the performance of deep models by selecting the most informative samples for annotation from a pool of unlabeled data. Despite impressive performance in closed-set settings, most AL methods fail in real-world scenarios where the unlabeled data contains unknown categories. Recently, a few studies have attempted to tackle the AL problem for the open-set setting.… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted in AAAI 2024

  29. arXiv:2312.02156  [pdf, other

    cs.CV cs.AI

    Latent Feature-Guided Diffusion Models for Shadow Removal

    Authors: Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen, Vishal M. Patel

    Abstract: Recovering textures under shadows has remained a challenging problem due to the difficulty of inferring shadow-free scenes from shadow images. In this paper, we propose the use of diffusion models as they offer a promising approach to gradually refine the details of shadow regions during the diffusion process. Our method improves this process by conditioning on a learned latent feature space that… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: project page see https://kfmei.page/shadow-diffusion/index.html

  30. arXiv:2312.02151  [pdf, other

    cs.CV cs.AI cs.LG

    Guarding Barlow Twins Against Overfitting with Mixed Samples

    Authors: Wele Gedara Chaminda Bandara, Celso M. De Melo, Vishal M. Patel

    Abstract: Self-supervised Learning (SSL) aims to learn transferable feature representations for downstream applications without relying on labeled data. The Barlow Twins algorithm, renowned for its widespread adoption and straightforward implementation compared to its counterparts like contrastive learning methods, minimizes feature redundancy while maximizing invariance to common corruptions. Optimizing fo… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Code and checkpoints are available at: https://github.com/wgcban/mix-bt.git

  31. arXiv:2311.05574  [pdf, ps, other

    math.CO cs.DM cs.DS math-ph

    A near-optimal zero-free disk for the Ising model

    Authors: Viresh Patel, Guus Regts, Ayla Stam

    Abstract: The partition function of the Ising model of a graph $G=(V,E)$ is defined as $Z_{\text{Ising}}(G;b)=\sum_{σ:V\to \{0,1\}} b^{m(σ)}$, where $m(σ)$ denotes the number of edges $e=\{u,v\}$ such that $σ(u)=σ(v)$. We show that for any positive integer $Δ$ and any graph $G$ of maximum degree at most $Δ$, $Z_{\text{Ising}}(G;b)\neq 0$ for all $b\in \mathbb{C}$ satisfying… ▽ More

    Submitted 23 April, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 12 pages; we have added a few propositions in Section 2 and reorganized the section to clarify the proof of Lemma 3.1. Some other small modifications have also been made as per suggestion of two referees

  32. arXiv:2310.06212  [pdf, other

    physics.med-ph

    Comparison of deep-learning data fusion strategies in mandibular osteoradionecrosis prediction modelling using clinical variables and radiation dose distribution volumes

    Authors: Laia Humbert-Vidan, Vinod Patel, Andrew P King, Teresa Guerrero Urbano

    Abstract: Purpose. NTCP modelling is rapidly embracing DL methods as the need to include spatial dose information is acknowledged. Finding the most appropriate way of combining radiation dose distribution images and clinical data involves technical challenges and requires domain knowledge. We propose different data fusion strategies that we hope will serve as a starting point for future DL NTCP studies. Met… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 10 pages, 4 figures, 3 tables

  33. arXiv:2310.04690  [pdf, other

    cs.CE

    A dimension-reduced variational approach for solving physics-based inverse problems using generative adversarial network priors and normalizing flows

    Authors: Agnimitra Dasgupta, Dhruv V Patel, Deep Ray, Erik A Johnson, Assad A Oberai

    Abstract: We propose a novel modular inference approach combining two different generative models -- generative adversarial networks (GAN) and normalizing flows -- to approximate the posterior distribution of physics-based Bayesian inverse problems framed in high-dimensional ambient spaces. We dub the proposed framework GAN-Flow. The proposed method leverages the intrinsic dimension reduction and superior s… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  34. arXiv:2310.01407  [pdf, other

    cs.CV cs.AI cs.LG

    CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation

    Authors: Kangfu Mei, Mauricio Delbracio, Hossein Talebi, Zhengzhong Tu, Vishal M. Patel, Peyman Milanfar

    Abstract: Large generative diffusion models have revolutionized text-to-image generation and offer immense potential for conditional generation tasks such as image enhancement, restoration, editing, and compositing. However, their widespread adoption is hindered by the high computational cost, which limits their real-time application. To address this challenge, we introduce a novel method dubbed CoDi, that… ▽ More

    Submitted 17 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  35. arXiv:2310.00224  [pdf, other

    cs.CV cs.AI cs.LG

    Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

    Authors: Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks

    Abstract: Conditional generative models typically demand large annotated training sets to achieve high-quality synthesis. As a result, there has been significant interest in designing models that perform plug-and-play generation, i.e., to use a predefined or pretrained model, which is not explicitly trained on the generative task, to guide the generative process (e.g., using language). However, such guidanc… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted at ICCV 2023

  36. arXiv:2309.11677  [pdf, ps, other

    math.CO

    Cycle Partitions in Dense Regular Digraphs and Oriented Graphs

    Authors: Allan Lo, Viresh Patel, Mehmet Akif Yıldız

    Abstract: A conjecture of Jackson from 1981 states that every $d$-regular oriented graph on $n$ vertices with $n\leq 4d+1$ is Hamiltonian. We prove this conjecture for sufficiently large $n$. In fact we prove a more general result that for all $α>0$, there exists $n_0=n_0(α)$ such that every $d$-regular digraph on $n\geq n_0$ vertices with $d \geq αn $ can be covered by at most $n/(d+1)$ vertex-disjoint cyc… ▽ More

    Submitted 7 June, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: 33 pages, 1 figure

    MSC Class: 05C35; 05C38; 05C20; 05C70

  37. arXiv:2309.10928  [pdf, ps, other

    math.CO cs.DM cs.DS

    Improved bounds for the zeros of the chromatic polynomial via Whitney's Broken Circuit Theorem

    Authors: Matthew Jenssen, Viresh Patel, Guus Regts

    Abstract: We prove that for any graph $G$ of maximum degree at most $Δ$, the zeros of its chromatic polynomial $χ_G(x)$ (in $\mathbb{C}$) lie inside the disc of radius $5.94 Δ$ centered at $0$. This improves on the previously best known bound of approximately $6.91Δ$. We also obtain improved bounds for graphs of high girth. We prove that for every $g$ there is a constant $K_g$ such that for any graph $G$… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 16 pages

  38. arXiv:2309.10894  [pdf, other

    math.OC stat.CO

    A Novel Gradient Methodology with Economical Objective Function Evaluations for Data Science Applications

    Authors: Christian Varner, Vivak Patel

    Abstract: Gradient methods are experiencing a growth in methodological and theoretical developments owing to the challenges posed by optimization problems arising in data science. However, such gradient methods face diverging optimality gaps or exploding objective evaluations when applied to optimization problems with realistic properties for data science applications. In this work, we address this gap by d… ▽ More

    Submitted 16 April, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: 24 pages, 7 figures, 3 tables, 3 algorithms

    MSC Class: 90C30; 65K05; 68T09

  39. arXiv:2309.05213  [pdf, other

    cs.LG cs.AI cs.DC

    Towards Federated Learning Under Resource Constraints via Layer-wise Training and Depth Dropout

    Authors: Pengfei Guo, Warren Richard Morningstar, Raviteja Vemulapalli, Karan Singhal, Vishal M. Patel, Philip Andrew Mansfield

    Abstract: Large machine learning models trained on diverse data have recently seen unprecedented success. Federated learning enables training on private data that may otherwise be inaccessible, such as domain-specific datasets decentralized across many clients. However, federated learning can be difficult to scale to large models when clients have limited resources. This challenge often results in a trade-o… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  40. arXiv:2308.15615  [pdf

    physics.ins-det

    Nitrogen Precooling Heat Exchanger replacement and control system upgrade in Superfluid Cryoplant at CMTF

    Authors: J. Subedi, B. Hansen, M. White, V. Patel, J. Makara, O. Atassi, G. Johnson

    Abstract: Liquid Nitrogen precooling is used in most Cryoplants to achieve cooldown to 80 K temperature range. In one such system at Fermilab's CMTF Superfluid Cryoplant, where the Helium supply directly exchanges heat with liquid Nitrogen, freezing of Nitrogen occurred inside the heat exchanger due to heat exchanger imbalance during a Cryoplant trip. Trapped vapor pockets of N2 within the frozen heat excha… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Cryogenic Eng Conf and Intnl Cryo Materials Conf (CEC/ICMC 2023)

    Report number: FERMILAB-CONF-23-379-TD

  41. arXiv:2308.04035  [pdf, other

    cs.CV

    Cross-Dataset Adaptation for Instrument Classification in Cataract Surgery Videos

    Authors: Jay N. Paranjape, Shameema Sikder, Vishal M. Patel, S. Swaroop Vedula

    Abstract: Surgical tool presence detection is an important part of the intra-operative and post-operative analysis of a surgery. State-of-the-art models, which perform this task well on a particular dataset, however, perform poorly when tested on another dataset. This occurs due to a significant domain shift between the datasets resulting from the use of different tools, sensors, data resolution etc. In thi… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Comments: MICCAI 2023

  42. arXiv:2308.03726  [pdf, other

    cs.CV

    AdaptiveSAM: Towards Efficient Tuning of SAM for Surgical Scene Segmentation

    Authors: Jay N. Paranjape, Nithin Gopalakrishnan Nair, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel

    Abstract: Segmentation is a fundamental problem in surgical scene analysis using artificial intelligence. However, the inherent data scarcity in this domain makes it challenging to adapt traditional segmentation techniques for this task. To tackle this issue, current research employs pretrained models and finetunes them on the given data. Even so, these require training deep networks with millions of parame… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 10 pages, 6 figures, 5 tables

  43. arXiv:2307.16896  [pdf, other

    cs.CV

    Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training

    Authors: Jeya Maria Jose Valanarasu, Yucheng Tang, Dong Yang, Ziyue Xu, Can Zhao, Wenqi Li, Vishal M. Patel, Bennett Landman, Daguang Xu, Yufan He, Vishwesh Nath

    Abstract: Harnessing the power of pre-training on large-scale datasets like ImageNet forms a fundamental building block for the progress of representation learning-driven solutions in computer vision. Medical images are inherently different from natural images as they are acquired in the form of many modalities (CT, MR, PET, Ultrasound etc.) and contain granulated information like tissue, lesion, organs etc… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Preprint

  44. arXiv:2307.11081  [pdf, other

    cs.CV cs.LG

    GLSFormer: Gated - Long, Short Sequence Transformer for Step Recognition in Surgical Videos

    Authors: Nisarg A. Shah, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel

    Abstract: Automated surgical step recognition is an important task that can significantly improve patient safety and decision-making during surgeries. Existing state-of-the-art methods for surgical step recognition either rely on separate, multi-stage modeling of spatial and temporal information or operate on short-range temporal resolution when learned jointly. However, the benefits of joint modeling of sp… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted to MICCAI 2023 (Early Accept)

  45. arXiv:2307.01815  [pdf, ps, other

    math.NT

    On perfect powers that are sums of cubes of a nine term arithmetic progression

    Authors: Nirvana Coppola, Mar Curcó-Iranzo, Maleeha Khawaja, Vandita Patel, Özge Ülkem

    Abstract: We study the equation $(x-4r)^3 + (x-3r)^3 + (x-2r)^3+(x-r)^3 + x^3 + (x+r)^3+(x+2r)^3 + (x+3r)^3 + (x+4r)^3 = y^p$, which is a natural continuation of previous works carried out by A. Argáez-García and the fourth author (perfect powers that are sums of cubes of a three, five and seven term arithmetic progression). Under the assumptions $0 < r \leq 10^6$, $p \geq 5 $ a prime and $\gcd(x, r) = 1$,… ▽ More

    Submitted 19 September, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 12 pages

    MSC Class: Primary 11D61; Secondary 11D41; 11D59; 11J86; 14H52

  46. arXiv:2306.16654  [pdf, other

    eess.IV cs.CV

    Self-Supervised MRI Reconstruction with Unrolled Diffusion Models

    Authors: Yilmaz Korkmaz, Tolga Cukur, Vishal M. Patel

    Abstract: Magnetic Resonance Imaging (MRI) produces excellent soft tissue contrast, albeit it is an inherently slow imaging modality. Promising deep learning methods have recently been proposed to reconstruct accelerated MRI scans. However, existing methods still suffer from various limitations regarding image fidelity, contextual sensitivity, and reliance on fully-sampled acquisitions for model training. T… ▽ More

    Submitted 15 April, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

  47. arXiv:2306.05168  [pdf, ps, other

    math.NT

    Power values of power sums: a survey

    Authors: Nirvana Coppola, Mar Curcó-Iranzo, Maleeha Khawaja, Vandita Patel, Özge Ülkem

    Abstract: Research on power values of power sums has gained much attention of late, partially due to the explosion of refinements in multiple advanced tools in (computational) Number Theory in recent years. In this survey, we present the key tools and techniques employed thus far in the (explicit) resolution of Diophantine problems, as well as an overview of existing results. We also state some open problem… ▽ More

    Submitted 27 July, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: Added additional references and open problems. This collaboration was formed from the Women in Numbers Europe 4 workshop

  48. arXiv:2305.16310  [pdf, other

    cs.CV

    Securing Deep Generative Models with Universal Adversarial Signature

    Authors: Yu Zeng, Mo Zhou, Yuan Xue, Vishal M. Patel

    Abstract: Recent advances in deep generative models have led to the development of methods capable of synthesizing high-quality, realistic images. These models pose threats to society due to their potential misuse. Prior research attempted to mitigate these threats by detecting generated images, but the varying traces left by different generative models make it challenging to create a universal detector cap… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  49. arXiv:2305.14674  [pdf, other

    cs.CV

    T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified Visual Modalities

    Authors: Kangfu Mei, Mo Zhou, Vishal M. Patel

    Abstract: Diffusion Probabilistic Field (DPF) models the distribution of continuous functions defined over metric spaces. While DPF shows great potential for unifying data generation of various modalities including images, videos, and 3D geometry, it does not scale to a higher data resolution. This can be attributed to the ``scaling property'', where it is difficult for the model to capture local structures… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: for project page, see https://t1-diffusion-model.github.io

  50. arXiv:2305.06402  [pdf, ps, other

    cs.CV

    Analyzing Bias in Diffusion-based Face Generation Models

    Authors: Malsha V. Perera, Vishal M. Patel

    Abstract: Diffusion models are becoming increasingly popular in synthetic data generation and image editing applications. However, these models can amplify existing biases and propagate them to downstream applications. Therefore, it is crucial to understand the sources of bias in their outputs. In this paper, we investigate the presence of bias in diffusion-based face generation models with respect to attri… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.