Skip to main content

Showing 1–50 of 255 results for author: Roy, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16209  [pdf, other

    cs.CG

    Covering Simple Orthogonal Polygons with Rectangles

    Authors: Aniket Basu Roy

    Abstract: We study the problem of Covering Orthogonal Polygons with Rectangles. For polynomial-time algorithms, the best-known approximation factor is $O(\sqrt{\log n})$ when the input polygon may have holes [Kumar and Ramesh, STOC '99, SICOMP '03], and there is a $2$-factor approximation algorithm known when the polygon is hole-free [Franzblau, SIDMA '89]. Arguably, an easier problem is the Boundary Cover… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 29 pages, 19 figures

  2. arXiv:2406.15128  [pdf, other

    eess.IV cs.AI cs.CV

    A Wavelet Guided Attention Module for Skin Cancer Classification with Gradient-based Feature Fusion

    Authors: Ayush Roy, Sujan Sarkar, Sohom Ghosal, Dmitrii Kaplun, Asya Lyanova, Ram Sarkar

    Abstract: Skin cancer is a highly dangerous type of cancer that requires an accurate diagnosis from experienced physicians. To help physicians diagnose skin cancer more efficiently, a computer-aided diagnosis (CAD) system can be very helpful. In this paper, we propose a novel model, which uses a novel attention mechanism to pinpoint the differences in features across the spatial dimensions and symmetry of t… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.15117  [pdf, other

    eess.IV cs.AI cs.CV

    FA-Net: A Fuzzy Attention-aided Deep Neural Network for Pneumonia Detection in Chest X-Rays

    Authors: Ayush Roy, Anurag Bhattacharjee, Diego Oliva, Oscar Ramos-Soto, Francisco J. Alvarez-Padilla, Ram Sarkar

    Abstract: Pneumonia is a respiratory infection caused by bacteria, fungi, or viruses. It affects many people, particularly those in develo** or underdeveloped nations with high pollution levels, unhygienic living conditions, overcrowding, and insufficient medical infrastructure. Pneumonia can cause pleural effusion, where fluids fill the lungs, leading to respiratory difficulty. Early diagnosis is crucial… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2406.15113  [pdf, other

    eess.IV cs.AI cs.CV

    A Dual Attention-aided DenseNet-121 for Classification of Glaucoma from Fundus Images

    Authors: Soham Chakraborty, Ayush Roy, Payel Pramanik, Daria Valenkova, Ram Sarkar

    Abstract: Deep learning and computer vision methods are nowadays predominantly used in the field of ophthalmology. In this paper, we present an attention-aided DenseNet-121 for classifying normal and glaucomatous eyes from fundus images. It involves the convolutional block attention module to highlight relevant spatial and channel features extracted by DenseNet-121. The channel recalibration module further… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  5. arXiv:2406.10108  [pdf, other

    cs.LG cs.AI

    Precipitation Nowcasting Using Physics Informed Discriminator Generative Models

    Authors: Junzhe Yin, Cristian Meo, Ankush Roy, Zeineh Bou Cher, Yanbo Wang, Ruben Imhoff, Remko Uijlenhoet, Justin Dauwels

    Abstract: Nowcasting leverages real-time atmospheric conditions to forecast weather over short periods. State-of-the-art models, including PySTEPS, encounter difficulties in accurately forecasting extreme weather events because of their unpredictable distribution patterns. In this study, we design a physics-informed neural network to perform precipitation nowcasting using the precipitation and meteorologica… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  6. arXiv:2406.08604  [pdf, other

    eess.IV cs.CV cs.LG

    GRU-Net for breast histopathology image segmentation

    Authors: Ayush Roy, Payel Pramanik, Sohom Ghosal, Daria Valenkova, Dmitrii Kaplun, Ram Sarkar

    Abstract: Breast cancer is a major global health concern. Pathologists face challenges in analyzing complex features from pathological images, which is a time-consuming and labor-intensive task. Therefore, efficient computer-based diagnostic tools are needed for early detection and treatment planning. This paper presents a modified version of MultiResU-Net for histopathology image segmentation, which is sel… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  7. arXiv:2406.08425  [pdf, other

    cs.CV cs.AI

    AWGUNET: Attention-Aided Wavelet Guided U-Net for Nuclei Segmentation in Histopathology Images

    Authors: Ayush Roy, Payel Pramanik, Dmitrii Kaplun, Sergei Antonov, Ram Sarkar

    Abstract: Accurate nuclei segmentation in histopathological images is crucial for cancer diagnosis. Automating this process offers valuable support to clinical experts, as manual annotation is time-consuming and prone to human errors. However, automating nuclei segmentation presents challenges due to uncertain cell boundaries, intricate staining, and diverse structures. In this paper, we present a segmentat… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  8. arXiv:2406.02255  [pdf, other

    eess.AS cs.LG cs.MM cs.SD

    MidiCaps -- A large-scale MIDI dataset with text captions

    Authors: Jan Melechovsky, Abhinaba Roy, Dorien Herremans

    Abstract: Generative models guided by text prompts are increasingly becoming more popular. However, no text-to-MIDI models currently exist, mostly due to the lack of a captioned MIDI dataset. This work aims to enable research that combines LLMs with symbolic music by presenting the first large-scale MIDI dataset with text captions that is openly available: MidiCaps. MIDI (Musical Instrument Digital Interfac… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Under review

  9. arXiv:2405.19463  [pdf, other

    stat.ML cs.LG econ.EM math.OC

    Stochastic Optimization Algorithms for Instrumental Variable Regression with Streaming Data

    Authors: Xuxing Chen, Abhishek Roy, Yifan Hu, Krishnakumar Balasubramanian

    Abstract: We develop and analyze algorithms for instrumental variable regression by viewing the problem as a conditional stochastic optimization problem. In the context of least-squares instrumental variable regression, our algorithms neither require matrix inversions nor mini-batches and provides a fully online approach for performing instrumental variable regression with streaming data. When the true mode… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  10. arXiv:2405.15868  [pdf, other

    cs.NE cs.AI cs.LG

    LLS: Local Learning Rule for Deep Neural Networks Inspired by Neural Activity Synchronization

    Authors: Marco Paul E. Apolinario, Arani Roy, Kaushik Roy

    Abstract: Training deep neural networks (DNNs) using traditional backpropagation (BP) presents challenges in terms of computational complexity and energy consumption, particularly for on-device learning where computational resources are limited. Various alternatives to BP, including random feedback alignment, forward-forward, and local classifiers, have been explored to address these challenges. These metho… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 14 pages, 4 figures

  11. arXiv:2405.01421  [pdf, ps, other

    cs.IT

    Systematic Construction of Golay Complementary Sets of Arbitrary Lengths and Alphabet Sizes

    Authors: Abhishek Roy, Sudhan Majhi, Subhabrata Paul

    Abstract: One of the important applications of Golay complementary sets (GCSs) is the reduction of peak-to-mean envelope power ratio (PMEPR) in orthogonal frequency division multiplexing (OFDM) systems. OFDM has played a major role in modern wireless systems such as long-term-evolution (LTE), 5th generation (5G) wireless standards, etc. This paper searches for systematic constructions of GCSs of arbitrary l… ▽ More

    Submitted 8 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    MSC Class: 94A55; 94A15; 94D10

  12. arXiv:2404.16887  [pdf, other

    cs.LG cs.AI

    Anomaly Detection for Incident Response at Scale

    Authors: Hanzhang Wang, Gowtham Kumar Tangirala, Gilkara Pranav Naidu, Charles Mayville, Arighna Roy, Joanne Sun, Ramesh Babu Mandava

    Abstract: We present a machine learning-based anomaly detection product, AI Detect and Respond (AIDR), that monitors Walmart's business and system health in real-time. During the validation over 3 months, the product served predictions from over 3000 models to more than 25 application, platform, and operation teams, covering 63\% of major incidents and reducing the mean-time-to-detect (MTTD) by more than 7… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: ASPLOS 2024 AIOps workshop

  13. arXiv:2404.00665  [pdf, ps, other

    cs.IT

    On cumulative and relative cumulative past information generating function

    Authors: Santosh Kumar Chaudhary, Nitin Gupta, Achintya Roy

    Abstract: In this paper, we introduce the cumulative past information generating function (CPIG) and relative cumulative past information generating function (RCPIG). We study its properties. We establish its relation with generalized cumulative past entropy (GCPE). We defined CPIG stochastic order and its relation with dispersive order. We provide the results for the CPIG measure of the convoluted random v… ▽ More

    Submitted 22 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  14. arXiv:2403.20317  [pdf, other

    cs.CV

    Convolutional Prompting meets Language Models for Continual Learning

    Authors: Anurag Roy, Riddhiman Moulick, Vinay K. Verma, Saptarshi Ghosh, Abir Das

    Abstract: Continual Learning (CL) enables machine learning models to learn from continuously shifting new training data in absence of data from old tasks. Recently, pretrained vision transformers combined with prompt tuning have shown promise for overcoming catastrophic forgetting in CL. These approaches rely on a pool of learnable prompts which can be inefficient in sharing knowledge across tasks leading t… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: CVPR 2024 Camera Ready

  15. arXiv:2403.19837  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.LO

    Concept-based Analysis of Neural Networks via Vision-Language Models

    Authors: Ravi Mangal, Nina Narodytska, Divya Gopinath, Boyue Caroline Hu, Anirban Roy, Susmit Jha, Corina Pasareanu

    Abstract: The analysis of vision-based deep neural networks (DNNs) is highly desirable but it is very challenging due to the difficulty of expressing formal specifications for vision tasks and the lack of efficient verification procedures. In this paper, we propose to leverage emerging multimodal, vision-language, foundation models (VLMs) as a lens through which we can reason about vision models. VLMs have… ▽ More

    Submitted 10 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  16. How do Older Adults Set Up Voice Assistants? Lessons Learned from a Deployment Experience for Older Adults to Set Up Standalone Voice Assistants

    Authors: Chen Chen, Ella T. Lifset, Yichen Han, Arkajyoti Roy, Michael Hogarth, Alison A. Moore, Emilia Farcas, Nadir Weibel

    Abstract: While standalone Voice Assistants (VAs) are promising to support older adults' daily routine and wellbeing management, onboarding and setting up these devices can be challenging. Although some older adults choose to seek assistance from technicians and adult children, easy set up processes that facilitate independent use are still critical, especially for those who do not have access to external r… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 5 pages, 1 figure, 1 table, Companion Publication of the 2023 ACM Designing Interactive Systems Conference, July 2023, Pages 164-168

    ACM Class: J.0; J.3; J.4

  17. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  18. arXiv:2403.03929  [pdf, other

    cs.LG cs.AI

    Extreme Precipitation Nowcasting using Transformer-based Generative Models

    Authors: Cristian Meo, Ankush Roy, Mircea Lică, Junzhe Yin, Zeineb Bou Che, Yanbo Wang, Ruben Imhoff, Remko Uijlenhoet, Justin Dauwels

    Abstract: This paper presents an innovative approach to extreme precipitation nowcasting by employing Transformer-based generative models, namely NowcastingGPT with Extreme Value Loss (EVL) regularization. Leveraging a comprehensive dataset from the Royal Netherlands Meteorological Institute (KNMI), our study focuses on predicting short-term precipitation with high accuracy. We introduce a novel method for… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  19. arXiv:2403.01550  [pdf, other

    math.CO cs.DM math.NT math.SP

    Spectral antisymmetry of twisted graph adjacency

    Authors: Ye Luo, Arindam Roy

    Abstract: We address a prime counting problem across the homology classes of a graph, presenting a graph-theoretical Dirichlet-type analogue of the prime number theorem. The main machinery we have developed and employed is a spectral antisymmetry theorem, revealing that the spectra of the twisted graph adjacency matrices have an antisymmetric distribution over the character group of the graph. Additionally,… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 5 figures

    MSC Class: 05C50; 05C38; 11M41

  20. arXiv:2402.04541  [pdf, other

    cs.CV

    BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception

    Authors: Aniket Roy, Anirban Roy, Soma Mitra, Kuntal Ghosh

    Abstract: Visual illusions play a significant role in understanding visual perception. Current methods in understanding and evaluating visual illusions are mostly deterministic filtering based approach and they evaluate on a handful of visual illusions, and the conclusions therefore, are not generic. To this end, we generate a large-scale dataset of 22,366 images (BRI3L: BRightness Illusion Image dataset fo… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  21. arXiv:2401.12032  [pdf, other

    cs.HC cs.AI

    MINT: A wrapper to make multi-modal and multi-image AI models interactive

    Authors: Jan Freyberg, Abhijit Guha Roy, Terry Spitz, Beverly Freeman, Mike Schaekermann, Patricia Strachan, Eva Schnider, Renee Wong, Dale R Webster, Alan Karthikesalingam, Yun Liu, Krishnamurthy Dvijotham, Umesh Telang

    Abstract: During the diagnostic process, doctors incorporate multimodal information including imaging and the medical history - and similarly medical AI development has increasingly become multimodal. In this paper we tackle a more subtle challenge: doctors take a targeted medical history to obtain only the most pertinent pieces of information; how do we enable AI to do the same? We develop a wrapper method… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 15 pages, 7 figures

  22. arXiv:2401.11735  [pdf, other

    cs.CR

    zkLogin: Privacy-Preserving Blockchain Authentication with Existing Credentials

    Authors: Foteini Baldimtsi, Konstantinos Kryptos Chalkias, Yan Ji, Jonas Lindstrøm, Deepak Maram, Ben Riva, Arnab Roy, Mahdi Sedaghat, Joy Wang

    Abstract: For many users, a private key based wallet serves as the primary entry point to blockchains. Commonly recommended wallet authentication methods, such as mnemonics or hardware wallets, can be cumbersome. This difficulty in user onboarding has significantly hindered the adoption of blockchain-based applications. We develop zkLogin, a novel technique that leverages identity tokens issued by popular… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  23. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  24. arXiv:2311.10571  [pdf, other

    stat.ML cs.LG stat.CO

    Direct Amortized Likelihood Ratio Estimation

    Authors: Adam D. Cobb, Brian Matejek, Daniel Elenius, Anirban Roy, Susmit Jha

    Abstract: We introduce a new amortized likelihood ratio estimator for likelihood-free simulation-based inference (SBI). Our estimator is simple to train and estimates the likelihood ratio using a single forward pass of the neural estimator. Our approach directly computes the likelihood ratio between two competing parameter sets which is different from the previous approach of comparing two neural network ou… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 12 Pages, 10 Figures, GitHub: https://github.com/SRI-CSL/dnre

  25. arXiv:2311.09753  [pdf, other

    cs.CV

    DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics

    Authors: Aniket Roy, Maiterya Suin, Anshul Shah, Ketul Shah, Jiang Liu, Rama Chellappa

    Abstract: Diffusion models have advanced generative AI significantly in terms of editing and creating naturalistic images. However, efficiently improving generated image quality is still of paramount interest. In this context, we propose a generic "naturalness" preserving loss function, viz., kurtosis concentration (KC) loss, which can be readily applied to any standard diffusion model pipeline to elevate t… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  26. Stacked Autoencoder Based Feature Extraction and Superpixel Generation for Multifrequency PolSAR Image Classification

    Authors: Tushar Gadhiya, Sumanth Tangirala, Anil K. Roy

    Abstract: In this paper we are proposing classification algorithm for multifrequency Polarimetric Synthetic Aperture Radar (PolSAR) image. Using PolSAR decomposition algorithms 33 features are extracted from each frequency band of the given image. Then, a two-layer autoencoder is used to reduce the dimensionality of input feature vector while retaining useful features of the input. This reduced dimensional… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Journal ref: Pattern Recognition and Machine Intelligence: 8th International Conference, PReMI 2019, Tezpur, India, December 17-20, 2019, Proceedings, Part II, Dec 2019, Pages 331-339

  27. arXiv:2310.15055  [pdf, other

    cs.CL cs.AI cs.HC

    Towards Conceptualization of "Fair Explanation": Disparate Impacts of anti-Asian Hate Speech Explanations on Content Moderators

    Authors: Tin Nguyen, Jiannan Xu, Aayushi Roy, Hal Daumé III, Marine Carpuat

    Abstract: Recent research at the intersection of AI explainability and fairness has focused on how explanations can improve human-plus-AI task performance as assessed by fairness measures. We propose to characterize what constitutes an explanation that is itself "fair" -- an explanation that does not adversely impact specific populations. We formulate a novel evaluation method of "fair explanations" using n… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference (Long Paper)

  28. arXiv:2310.13746  [pdf, other

    cs.LG cs.CY

    FairBranch: Fairness Conflict Correction on Task-group Branches for Fair Multi-Task Learning

    Authors: Arjun Roy, Christos Koutlis, Symeon Papadopoulos, Eirini Ntoutsi

    Abstract: The generalization capacity of Multi-Task Learning (MTL) becomes limited when unrelated tasks negatively impact each other by updating shared parameters with conflicting gradients, resulting in negative transfer and a reduction in MTL accuracy compared to single-task learning (STL). Recently, there has been an increasing focus on the fairness of MTL models, necessitating the optimization of both a… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  29. arXiv:2310.11049  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation

    Authors: Shubham Kumar Nigam, Aniket Deroy, Noel Shallum, Ayush Kumar Mishra, Anup Roy, Shubham Kumar Mishra, Arnab Bhattacharya, Saptarshi Ghosh, Kripabandhu Ghosh

    Abstract: This paper describes our submission to the SemEval-2023 for Task 6 on LegalEval: Understanding Legal Texts. Our submission concentrated on three subtasks: Legal Named Entity Recognition (L-NER) for Task-B, Legal Judgment Prediction (LJP) for Task-C1, and Court Judgment Prediction with Explanation (CJPE) for Task-C2. We conducted various experiments on these subtasks and presented the results in de… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Journal ref: https://aclanthology.org/2023.semeval-1.180

  30. arXiv:2310.00116  [pdf, other

    cs.LG cs.AI

    Certified Robustness via Dynamic Margin Maximization and Improved Lipschitz Regularization

    Authors: Mahyar Fazlyab, Taha Entesari, Aniket Roy, Rama Chellappa

    Abstract: To improve the robustness of deep classifiers against adversarial perturbations, many approaches have been proposed, such as designing new architectures with better robustness properties (e.g., Lipschitz-capped networks), or modifying the training process itself (e.g., min-max optimization, constrained learning, or regularization). These approaches, however, might not be effective at increasing th… ▽ More

    Submitted 12 March, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  31. arXiv:2308.11357  [pdf, other

    cs.CV

    Exemplar-Free Continual Transformer with Convolutions

    Authors: Anurag Roy, Vinay Kumar Verma, Sravan Voonna, Kripabandhu Ghosh, Saptarshi Ghosh, Abir Das

    Abstract: Continual Learning (CL) involves training a machine learning model in a sequential manner to learn new information while retaining previously learned tasks without the presence of previous training data. Although there has been significant interest in CL, most recent CL approaches in computer vision have focused on convolutional architectures only. However, with the recent success of vision transf… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted in ICCV 2023

  32. arXiv:2308.06338  [pdf, other

    cs.LG cs.CC math.AP math.NA

    Size Lowerbounds for Deep Operator Networks

    Authors: Anirbit Mukherjee, Amartya Roy

    Abstract: Deep Operator Networks are an increasingly popular paradigm for solving regression in infinite dimensions and hence solve families of PDEs in one shot. In this work, we aim to establish a first-of-its-kind data-dependent lowerbound on the size of DeepONets required for them to be able to reduce empirical error on noisy data. In particular, we show that for low training errors to be obtained on… ▽ More

    Submitted 23 February, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: 25 pages, 13 figures

    Journal ref: Published in Transactions on Machine Learning Research (TMLR) in February 2024

  33. arXiv:2308.03906  [pdf, other

    cs.CV

    TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models

    Authors: Indranil Sur, Karan Sikka, Matthew Walmer, Kaushik Koneripalli, Anirban Roy, Xiao Lin, Ajay Divakaran, Susmit Jha

    Abstract: We present a Multimodal Backdoor Defense technique TIJO (Trigger Inversion using Joint Optimization). Recent work arXiv:2112.07668 has demonstrated successful backdoor attacks on multimodal models for the Visual Question Answering task. Their dual-key backdoor trigger is split across two modalities (image and text), such that the backdoor is activated if and only if the trigger is present in both… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Published as conference paper at ICCV 2023. 13 pages, 6 figures, 7 tables

  34. arXiv:2308.02145  [pdf, other

    math.OC cs.LG

    Optimization on Pareto sets: On a theory of multi-objective optimization

    Authors: Abhishek Roy, Geelon So, Yi-An Ma

    Abstract: In multi-objective optimization, a single decision vector must balance the trade-offs between many objectives. Solutions achieving an optimal trade-off are said to be Pareto optimal: these are decision vectors for which improving any one objective must come at a cost to another. But as the set of Pareto optimal vectors can be very large, we further consider a more practically significant Pareto-co… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  35. arXiv:2308.01481  [pdf, other

    math.ST cs.LG math.OC stat.ML

    Online covariance estimation for stochastic gradient descent under Markovian sampling

    Authors: Abhishek Roy, Krishnakumar Balasubramanian

    Abstract: We investigate the online overlap** batch-means covariance estimator for Stochastic Gradient Descent (SGD) under Markovian sampling. Convergence rates of order $O\big(\sqrt{d}\,n^{-1/8}(\log n)^{1/4}\big)$ and $O\big(\sqrt{d}\,n^{-1/8}\big)$ are established under state-dependent and state-independent Markovian sampling, respectively, where $d$ is the dimensionality and $n$ denotes observations o… ▽ More

    Submitted 5 November, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  36. arXiv:2308.00215  [pdf, other

    cs.CY

    From Talent Shortage to Workforce Excellence in the CHIPS Act Era: Harnessing Industry 4.0 Paradigms for a Sustainable Future in Domestic Chip Production

    Authors: Aida Damanpak Rizi, Antika Roy, Rouhan Noor, Hyo Kang, Nitin Varshney, Katja Jacob, Sindia Rivera-Jimenez, Nathan Edwards, Volker J. Sorger, Hamed Dalir, Navid Asadizanjani

    Abstract: The CHIPS Act is driving the U.S. towards a self-sustainable future in domestic chip production. Decades of outsourced manufacturing, assembly, testing, and packaging has diminished the workforce ecosystem, imposing major limitations on semiconductor companies racing to build new fabrication sites as part of the CHIPS Act. In response, a systemic alliance between academic institutions, the industr… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Comments: 18 pages, 8 figures

  37. arXiv:2307.10060  [pdf, other

    physics.flu-dyn cs.AI cs.LG cs.NE physics.comp-ph

    Accurate deep learning sub-grid scale models for large eddy simulations

    Authors: Rikhi Bose, Arunabha M. Roy

    Abstract: We present two families of sub-grid scale (SGS) turbulence models developed for large-eddy simulation (LES) purposes. Their development required the formulation of physics-informed robust and efficient Deep Learning (DL) algorithms which, unlike state-of-the-art analytical modeling techniques can produce high-order complex non-linear relations between inputs and outputs. Explicit filtering of data… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  38. arXiv:2307.09302  [pdf, other

    cs.LG cs.CV stat.ME stat.ML

    Conformal prediction under ambiguous ground truth

    Authors: David Stutz, Abhijit Guha Roy, Tatiana Matejovicova, Patricia Strachan, Ali Taylan Cemgil, Arnaud Doucet

    Abstract: Conformal Prediction (CP) allows to perform rigorous uncertainty quantification by constructing a prediction set $C(X)$ satisfying $\mathbb{P}(Y \in C(X))\geq 1-α$ for a user-chosen $α\in [0,1]$ by relying on calibration data $(X_1,Y_1),...,(X_n,Y_n)$ from $\mathbb{P}=\mathbb{P}^{X} \otimes \mathbb{P}^{Y|X}$. It is typically implicitly assumed that $\mathbb{P}^{Y|X}$ is the "true" posterior label… ▽ More

    Submitted 24 October, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

  39. Screen or No Screen? Lessons Learnt from a Real-World Deployment Study of Using Voice Assistants With and Without Touchscreen for Older Adults

    Authors: Chen Chen, Ella T. Lifset, Yichen Han, Arkajyoti Roy, Michael Hogarth, Alison A. Moore, Emilia Farcas, Nadir Weibel

    Abstract: While voice user interfaces offer increased accessibility due to hands-free and eyes-free interactions, older adults often have challenges such as constructing structured requests and perceiving how such devices operate. Voice-first user interfaces have the potential to address these challenges by enabling multimodal interactions. Standalone voice + touchscreen Voice Assistants (VAs), such as Echo… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: 25 pages

    ACM Class: J.3; J.4; K.4

  40. arXiv:2307.06691  [pdf, other

    cs.FL cs.LO

    A Local-Time Semantics for Negotiations

    Authors: Madhavan Mukund, Adwitee Roy, B Srivathsan

    Abstract: Negotiations, introduced by Esparza et al., are a model for concurrent systems where computations involving a set of agents are described in terms of their interactions. In many situations, it is natural to impose timing constraints between interactions -- for instance, to limit the time available to enter the PIN after inserting a card into an ATM. To model this, we introduce a real-time aspect t… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: A shorter version appears in FORMATS 2023

    MSC Class: 68Q60 ACM Class: F.4.3

  41. arXiv:2307.02191  [pdf, other

    cs.LG cs.CV stat.ME stat.ML

    Evaluating AI systems under uncertain ground truth: a case study in dermatology

    Authors: David Stutz, Ali Taylan Cemgil, Abhijit Guha Roy, Tatiana Matejovicova, Melih Barsbey, Patricia Strachan, Mike Schaekermann, Jan Freyberg, Rajeev Rikhye, Beverly Freeman, Javier Perez Matos, Umesh Telang, Dale R. Webster, Yuan Liu, Greg S. Corrado, Yossi Matias, Pushmeet Kohli, Yun Liu, Arnaud Doucet, Alan Karthikesalingam

    Abstract: For safety, AI systems in health undergo thorough evaluations before deployment, validating their predictions against a ground truth that is assumed certain. However, this is actually not the case and the ground truth may be uncertain. Unfortunately, this is largely ignored in standard evaluation of AI models but can have severe consequences such as overestimating the future performance. To avoid… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  42. arXiv:2306.05562  [pdf, other

    cs.RO cs.AI cs.CE

    AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle Designs

    Authors: Adam D. Cobb, Anirban Roy, Daniel Elenius, F. Michael Heim, Brian Swenson, Sydney Whittington, James D. Walker, Theodore Bapty, Joseph Hite, Karthik Ramani, Christopher McComb, Susmit Jha

    Abstract: We present AircraftVerse, a publicly available aerial vehicle design dataset. Aircraft design encompasses different physics domains and, hence, multiple modalities of representation. The evaluation of these cyber-physical system (CPS) designs requires the use of scientific analytical and simulation models ranging from computer-aided design tools for structural and manufacturing analysis, computati… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: The dataset is hosted at https://zenodo.org/record/6525446, baseline models and code at https://github.com/SRI-CSL/AircraftVerse, and the dataset description at https://aircraftverse.onrender.com/

  43. GAD-NR: Graph Anomaly Detection via Neighborhood Reconstruction

    Authors: Amit Roy, Juan Shu, Jia Li, Carl Yang, Olivier Elshocht, Jeroen Smeets, Pan Li

    Abstract: Graph Anomaly Detection (GAD) is a technique used to identify abnormal nodes within graphs, finding applications in network security, fraud detection, social media spam detection, and various other domains. A common method for GAD is Graph Auto-Encoders (GAEs), which encode graph data into node representations and identify anomalies by assessing the reconstruction quality of the graphs based on th… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted at the 17th ACM International Conference on Web Search and Data Mining (WSDM-2024)

    Journal ref: The 17th ACM International Conference on Web Search and Data Mining (WSDM-2024)

  44. arXiv:2305.10403  [pdf, other

    cs.CL cs.AI

    PaLM 2 Technical Report

    Authors: Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yan** Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yu**g Zhang, Gustavo Hernandez Abrego , et al. (103 additional authors not shown)

    Abstract: We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  45. arXiv:2305.09034  [pdf, other

    cs.DC

    Blizzard: Adding True Persistence to Main Memory Data Structures

    Authors: Pradeep Fernando, Daniel Zahka, Ada Gavrilovska, Amitabha Roy, Subramanya R. Dulloor

    Abstract: Persistent memory (PMEM) devices present an opportunity to retain the flexibility of main memory data structures and algorithms, but augment them with reliability and persistence. The challenge in doing this is to combine replication (for reliability) and failure atomicity (for persistence) with concurrency (for fully utilizing persistent memory bandwidth). These requirements are at odds due to th… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 14 pages. 10 figures. Blizzard code repo : https://github.com/pradeepfn/Blizzard

  46. arXiv:2305.04698  [pdf, ps, other

    cs.IT

    Direct Constructions of Multiple Shift Complementary Sets of Flexible Lengths

    Authors: Abhishek Roy, Sudhan Majhi

    Abstract: Golay complementary set (GCS) plays a vital role in reducing peak-to-mean envelope power ratio (PMEPR) in orthogonal frequency division multiplexing (OFDM). A more general version of GCS is a multiple shift complementary set (MSCS), where by relaxing the condition of zero auto-correlation sum throughout all the non-zero time shifts to the integer multiples of some fixed time shift, more sequence s… ▽ More

    Submitted 10 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

  47. arXiv:2305.03180  [pdf, other

    cs.CG

    On Range Summary Queries

    Authors: Peyman Afshani, **an Cheng, Aniket Basu Roy, Zhewei Wei

    Abstract: We study the query version of the approximate heavy hitter and quantile problems. In the former problem, the input is a parameter $\varepsilon$ and a set $P$ of $n$ points in $\mathbb{R}^d$ where each point is assigned a color from a set $C$, and we want to build a structure s.t. given any geometric range $γ$, we can efficiently find a list of approximate heavy hitters in $γ\cap P$, i.e., colors t… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: To appear in ICALP'23

  48. arXiv:2304.13950  [pdf, other

    stat.ML cs.CY cs.LG

    Fairness Uncertainty Quantification: How certain are you that the model is fair?

    Authors: Abhishek Roy, Prasant Mohapatra

    Abstract: Fairness-aware machine learning has garnered significant attention in recent years because of extensive use of machine learning in sensitive applications like judiciary systems. Various heuristics, and optimization frameworks have been proposed to enforce fairness in classification \cite{del2020review} where the later approaches either provides empirical results or provides fairness guarantee for… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  49. arXiv:2304.09218  [pdf, other

    cs.CV

    Generative models improve fairness of medical classifiers under distribution shifts

    Authors: Ira Ktena, Olivia Wiles, Isabela Albuquerque, Sylvestre-Alvise Rebuffi, Ryutaro Tanno, Abhijit Guha Roy, Shekoofeh Azizi, Danielle Belgrave, Pushmeet Kohli, Alan Karthikesalingam, Taylan Cemgil, Sven Gowal

    Abstract: A ubiquitous challenge in machine learning is the problem of domain generalisation. This can exacerbate bias against groups or labels that are underrepresented in the datasets used for model development. Model bias can lead to unintended harms, especially in safety-critical applications like healthcare. Furthermore, the challenge is compounded by the difficulty of obtaining labelled data due to hi… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  50. arXiv:2304.07555  [pdf, other

    cs.DC

    SerPyTor: A distributed context-aware computational graph execution framework for durable execution

    Authors: Anuran Roy, Sridhar Raj S

    Abstract: Distributed computation is always a tricky topic to deal with, especially in context of various requirements in various scenarios. A popular solution is to use Apache Spark with a setup of multiple systems forming a cluster. However, the prerequisite setup for a Spark cluster often induces an additional overhead, often limiting usage in constrained scenarios, especially in scenarios requiring cont… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: 5 pages, 2 figures