Skip to main content

Showing 1–50 of 192 results for author: Sharma, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00186  [pdf

    eess.IV cs.CV cs.LG

    DCSM 2.0: Deep Conditional Shape Models for Data Efficient Segmentation

    Authors: Athira J Jacob, Puneet Sharma, Daniel Rueckert

    Abstract: Segmentation is often the first step in many medical image analyses workflows. Deep learning approaches, while giving state-of-the-art accuracies, are data intensive and do not scale well to low data regimes. We introduce Deep Conditional Shape Models 2.0, which uses an edge detector, along with an implicit shape function conditioned on edge maps, to leverage cross-modality shape information. The… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Best oral paper award at ISBI 2024

  2. arXiv:2406.13248  [pdf, other

    cs.IT eess.SP

    Overlay Space-Air-Ground Integrated Networks with SWIPT-Empowered Aerial Communications

    Authors: Anuradha Verma, Pankaj Kumar Sharma, Pawan Kumar, Dong In Kim

    Abstract: In this article, we consider overlay space-air-ground integrated networks (OSAGINs) where a low earth orbit (LEO) satellite communicates with ground users (GUs) with the assistance of an energy-constrained coexisting air-to-air (A2A) network. Particularly, a non-linear energy harvester with a hybrid SWIPT utilizing both power-splitting and time-switching energy harvesting (EH) techniques is employ… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 36 pages, 14 figures, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2406.06221  [pdf, other

    cs.PL

    Synchronous Programming with Refinement Types

    Authors: Jiawei Chen, José Luiz Vargas de Mendonça, Bereket Shimels Ayele, Bereket Ngussie Bekele, Shayan Jalili, Pranjal Sharma, Nicholas Wohlfeil, Yicheng Zhang, Jean-Baptiste Jeannin

    Abstract: Cyber-Physical Systems (CPS) consist of software interacting with the physical world, such as robots, vehicles, and industrial processes. CPS are frequently responsible for the safety of lives, property, or the environment, and so software correctness must be determined with a high degree of certainty. To that end, simply testing a CPS is insufficient, as its interactions with the physical world m… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2406.00302  [pdf, other

    cs.LG cs.DC

    FedAST: Federated Asynchronous Simultaneous Training

    Authors: Baris Askin, Pranay Sharma, Carlee Joe-Wong, Gauri Joshi

    Abstract: Federated Learning (FL) enables edge devices or clients to collaboratively train machine learning (ML) models without sharing their private data. Much of the existing work in FL focuses on efficiently learning a model for a single task. In this paper, we study simultaneous training of multiple FL models using a common set of clients. The few existing simultaneous training methods employ synchronou… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted to UAI 2024

  5. arXiv:2405.15548  [pdf, other

    cs.NI cs.ET

    UAV-assisted C-RAN for On-demand Cellular Coverage: Opportunities and Challenges

    Authors: Byomakesh Mahapatra, Deepika Gupta, Pankaj Kumar Sharma

    Abstract: The deployment of beyond fifth-generation (5G) infrastructure over disaster-affected regions, temporary hotspot situations (e.g., massive gatherings, etc.), complex terrains (e.g., sea, hills, marshes, etc.) poses numerous challenges for cellular service providers. Recently, unmanned aerial vehicles (UAVs) have emerged as potential candidates to overcome the aforementioned technical issues based o… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 15 pages, 4 figures, 2 Tables, Submitted for possible publication as a magazine article

  6. arXiv:2405.01409  [pdf, other

    cs.CV cs.AI

    Goal-conditioned reinforcement learning for ultrasound navigation guidance

    Authors: Abdoul Aziz Amadou, Vivek Singh, Florin C. Ghesu, Young-Ho Kim, Laura Stanciulescu, Harshitha P. Sai, Puneet Sharma, Alistair Young, Ronak Rajani, Kawal Rhode

    Abstract: Transesophageal echocardiography (TEE) plays a pivotal role in cardiology for diagnostic and interventional procedures. However, using it effectively requires extensive training due to the intricate nature of image acquisition and interpretation. To enhance the efficiency of novice sonographers and reduce variability in scan acquisitions, we propose a novel ultrasound (US) navigation assistance me… ▽ More

    Submitted 22 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 11 pages, 3 figures

    ACM Class: I.4.0; I.5.0

  7. arXiv:2405.01156  [pdf, other

    cs.CV cs.AI

    Self-Supervised Learning for Interventional Image Analytics: Towards Robust Device Trackers

    Authors: Saahil Islam, Venkatesh N. Murthy, Dominik Neumann, Badhan Kumar Das, Puneet Sharma, Andreas Maier, Dorin Comaniciu, Florin C. Ghesu

    Abstract: An accurate detection and tracking of devices such as guiding catheters in live X-ray image acquisitions is an essential prerequisite for endovascular cardiac interventions. This information is leveraged for procedural guidance, e.g., directing stent placements. To ensure procedural safety and efficacy, there is a need for high robustness no failures during tracking. To achieve that, one needs to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  8. arXiv:2404.13242  [pdf, other

    cs.NI

    5G-WAVE: A Core Network Framework with Decentralized Authorization for Network Slices

    Authors: Pragya Sharma, Tolga Atalay, Hans-Andrew Gibbs, Dragoslav Stojadinovic, Angelos Stavrou, Haining Wang

    Abstract: 5G mobile networks leverage Network Function Virtualization (NFV) to offer services in the form of network slices. Each network slice is a logically isolated fragment constructed by service chaining a set of Virtual Network Functions (VNFs). The Network Repository Function (NRF) acts as a central OpenAuthorization (OAuth) 2.0 server to secure inter-VNF communications resulting in a single point of… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  9. arXiv:2404.06425  [pdf, other

    cs.CV

    ZeST: Zero-Shot Material Transfer from a Single Image

    Authors: Ta-Ying Cheng, Prafull Sharma, Andrew Markham, Niki Trigoni, Varun Jampani

    Abstract: We propose ZeST, a method for zero-shot material transfer to an object in the input image given a material exemplar image. ZeST leverages existing diffusion adapters to extract implicit material representation from the exemplar image. This representation is used to transfer the material using pre-trained inpainting diffusion model on the object in the input image using depth estimates as geometry… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Project Page: https://ttchengab.github.io/zest

  10. arXiv:2403.19708  [pdf, other

    cs.CL cs.LG

    Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention

    Authors: Bin Gao, Zhuomin He, Puru Sharma, Qingxuan Kang, Djordje Jevdjic, Junbo Deng, Xingkun Yang, Zhou Yu, Pengfei Zuo

    Abstract: Interacting with humans through multi-turn conversations is a fundamental feature of large language models (LLMs). However, existing LLM serving engines executing multi-turn conversations are inefficient due to the need to repeatedly compute the key-value (KV) caches of historical tokens, incurring high serving costs. To address the problem, this paper proposes CachedAttention, a new attention mec… ▽ More

    Submitted 30 June, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted to USENIX Annual Technical Conference (ATC) 2024

  11. arXiv:2402.18102  [pdf, other

    eess.IV cs.CV

    Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging

    Authors: Bhargav Ghanekar, Salman Siddique Khan, Pranav Sharma, Shreyas Singh, Vivek Boominathan, Kaushik Mitra, Ashok Veeraraghavan

    Abstract: Passive, compact, single-shot 3D sensing is useful in many application areas such as microscopy, medical imaging, surgical navigation, and autonomous driving where form factor, time, and power constraints can exist. Obtaining RGB-D scene information over a short imaging distance, in an ultra-compact form factor, and in a passive, snapshot manner is challenging. Dual-pixel (DP) sensors are a potent… ▽ More

    Submitted 30 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  12. arXiv:2402.16354  [pdf, other

    cs.LG cs.AI cs.CL

    Language-guided Skill Learning with Temporal Variational Inference

    Authors: Haotian Fu, Pratyusha Sharma, Elias Stengel-Eskin, George Konidaris, Nicolas Le Roux, Marc-Alexandre Côté, Xingdi Yuan

    Abstract: We present an algorithm for skill discovery from expert demonstrations. The algorithm first utilizes Large Language Models (LLMs) to propose an initial segmentation of the trajectories. Following that, a hierarchical variational inference framework incorporates the LLM-generated segmentation information to discover reusable skills by merging trajectory segments. To further control the trade-off be… ▽ More

    Submitted 27 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  13. arXiv:2402.09553  [pdf, other

    cs.AI cs.LG stat.ML

    Statistical and Machine Learning Models for Predicting Fire and Other Emergency Events

    Authors: Dilli Prasad Sharma, Nasim Beigi-Mohammadi, Hongxiang Geng, Dawn Dixon, Rob Madro, Phil Emmenegger, Carlos Tobar, Jeff Li, Alberto Leon-Garcia

    Abstract: Emergency events in a city cause considerable economic loss to individuals, their families, and the community. Accurate and timely prediction of events can help the emergency fire and rescue services in preparing for and mitigating the consequences of emergency events. In this paper, we present a systematic development of predictive models for various types of emergency events in the City of Edmon… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Journal ref: IEEE Access 12(2024) 56880-56909

  14. arXiv:2401.09856  [pdf, other

    cs.NI

    EDAF: An End-to-End Delay Analytics Framework for 5G-and-Beyond Networks

    Authors: Samie Mostafavi, Marius Tillner, Gourav Prateek Sharma, James Gross

    Abstract: Supporting applications in emerging domains like cyber-physical systems and human-in-the-loop scenarios typically requires adherence to strict end-to-end delay guarantees. Contributions of many tandem processes unfolding layer by layer within the wireless network result in violations of delay constraints, thereby severely degrading application performance. Meeting the application's stringent requi… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Submitted to the 11th International Workshop on Computer and Networking Experimental Research using Testbeds (CNERT 2024)

  15. arXiv:2401.01862  [pdf, other

    cs.CV cs.CL cs.LG

    A Vision Check-up for Language Models

    Authors: Pratyusha Sharma, Tamar Rott Shaham, Manel Baradad, Stephanie Fu, Adrian Rodriguez-Munoz, Shivam Duggal, Phillip Isola, Antonio Torralba

    Abstract: What does learning to model relationships between strings teach large language models (LLMs) about the visual world? We systematically evaluate LLMs' abilities to generate and recognize an assortment of visual concepts of increasing complexity and then demonstrate how a preliminary visual representation learning system can be trained using models of text. As language models lack the ability to con… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  16. arXiv:2312.13558  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

    Authors: Pratyusha Sharma, Jordan T. Ash, Dipendra Misra

    Abstract: Transformer-based Large Language Models (LLMs) have become a fixture in modern machine learning. Correspondingly, significant resources are allocated towards research that aims to further advance this technology, typically resulting in models of increasing size that are trained on increasing amounts of data. This work, however, demonstrates the surprising result that it is often possible to signif… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  17. arXiv:2312.11593  [pdf, other

    cs.CV

    Towards Establishing Dense Correspondence on Multiview Coronary Angiography: From Point-to-Point to Curve-to-Curve Query Matching

    Authors: Yifan Wu, Rohit Jena, Mehmet Gulsun, Vivek Singh, Puneet Sharma, James C. Gee

    Abstract: Coronary angiography is the gold standard imaging technique for studying and diagnosing coronary artery disease. However, the resulting 2D X-ray projections lose 3D information and exhibit visual ambiguities. In this work, we aim to establish dense correspondence in multi-view angiography, serving as a fundamental basis for various clinical applications and downstream tasks. To overcome the challe… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  18. arXiv:2312.08566  [pdf, other

    cs.AI cs.CL cs.RO

    Learning adaptive planning representations with natural language guidance

    Authors: Lionel Wong, Jiayuan Mao, Pratyusha Sharma, Zachary S. Siegel, Jiahai Feng, Noa Korneev, Joshua B. Tenenbaum, Jacob Andreas

    Abstract: Effective planning in the real world requires not only world knowledge, but the ability to leverage that knowledge to build the right representation of the task at hand. Decades of hierarchical planning techniques have used domain-specific temporal action abstractions to support efficient and accurate planning, almost always relying on human priors and domain knowledge to decompose hard tasks into… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  19. arXiv:2312.08109  [pdf, ps, other

    cs.IT

    Construction of $(σ,δ)$-cyclic codes over a non-chain ring and their applications in DNA codes

    Authors: Ashutosh Singh, Priyanka Sharma, Om Prakash

    Abstract: For a prime $p$ and a positive integer $m$, let $\mathbb{F}_{p^m}$ be the finite field of characteristic $p$, and $\mathfrak{R}_l:=\mathbb{F}_{p^m}[v]/\langle v^l-v\rangle$ be a non-chain ring. In this paper, we study the $(σ,δ)$-cyclic codes over $\mathfrak{R}_l$. Further, we study the application of these codes in finding DNA codes. Towards this, we first define a Gray map to find classical code… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 12 pages

    MSC Class: 94B05; 94B15; 94B60

  20. arXiv:2312.02970  [pdf, other

    cs.CV cs.AI cs.GR

    Alchemist: Parametric Control of Material Properties with Diffusion Models

    Authors: Prafull Sharma, Varun Jampani, Yuanzhen Li, Xuhui Jia, Dmitry Lagun, Fredo Durand, William T. Freeman, Mark Matthews

    Abstract: We propose a method to control material attributes of objects like roughness, metallic, albedo, and transparency in real images. Our method capitalizes on the generative prior of text-to-image models known for photorealism, employing a scalar value and instructions to alter low-level material properties. Addressing the lack of datasets with controlled material attributes, we generated an object-ce… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  21. arXiv:2311.06323  [pdf, ps, other

    cs.IR cs.AI cs.LG

    Reviewing Developments of Graph Convolutional Network Techniques for Recommendation Systems

    Authors: Haojun Zhu, Vikram Kapoor, Priya Sharma

    Abstract: The Recommender system is a vital information service on today's Internet. Recently, graph neural networks have emerged as the leading approach for recommender systems. We try to review recent literature on graph neural network-based recommender systems, covering the background and development of both recommender systems and graph neural networks. Then categorizing recommender systems by their set… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2103.08976 by other authors

  22. ExPECA: An Experimental Platform for Trustworthy Edge Computing Applications

    Authors: Samie Mostafavi, Vishnu Narayanan Moothedath, Stefan Rönngren, Neelabhro Roy, Gourav Prateek Sharma, Sangwon Seo, Manuel Olguín Muñoz, James Gross

    Abstract: This paper presents ExPECA, an edge computing and wireless communication research testbed designed to tackle two pressing challenges: comprehensive end-to-end experimentation and high levels of experimental reproducibility. Leveraging OpenStack-based Chameleon Infrastructure (CHI) framework for its proven flexibility and ease of operation, ExPECA is located in a unique, isolated underground facili… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  23. arXiv:2310.19089  [pdf, other

    cs.CL

    Pushdown Layers: Encoding Recursive Structure in Transformer Language Models

    Authors: Shikhar Murty, Pratyusha Sharma, Jacob Andreas, Christopher D. Manning

    Abstract: Recursion is a prominent feature of human language, and fundamentally challenging for self-attention due to the lack of an explicit recursive-state tracking mechanism. Consequently, Transformer language models poorly capture long-tail recursive structure and exhibit sample-inefficient syntactic generalization. This work introduces Pushdown Layers, a new self-attention layer that models recursive s… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 (Long Papers)

  24. arXiv:2310.18868  [pdf, other

    cs.DC cs.LG

    Correlation Aware Sparsified Mean Estimation Using Random Projection

    Authors: Shuli Jiang, Pranay Sharma, Gauri Joshi

    Abstract: We study the problem of communication-efficient distributed vector mean estimation, a commonly used subroutine in distributed optimization and Federated Learning (FL). Rand-$k$ sparsification is a commonly used technique to reduce communication cost, where each client sends $k < d$ of its coordinates to the server. However, Rand-$k$ is agnostic to any correlations, that might exist between clients… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: 32 pages, 13 figures. Proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, USA

  25. arXiv:2310.18784  [pdf, other

    cs.LG math.OC math.ST stat.ML

    High-probability Convergence Bounds for Nonlinear Stochastic Gradient Descent Under Heavy-tailed Noise

    Authors: Aleksandar Armacki, Pranay Sharma, Gauri Joshi, Dragana Bajovic, Dusan Jakovetic, Soummya Kar

    Abstract: We study high-probability convergence guarantees of learning on streaming data in the presence of heavy-tailed noise. In the proposed scenario, the model is updated in an online fashion, as new information is observed, without storing any additional data. To combat the heavy-tailed noise, we consider a general framework of nonlinear stochastic gradient descent (SGD), providing several strong resul… ▽ More

    Submitted 30 April, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 30 pages, 3 figures

  26. arXiv:2310.17164  [pdf, other

    cs.CV

    Bridging Phylogeny and Taxonomy with Protein-protein Interaction Networks

    Authors: Long-Huei Chen, Mohana Prasad Sathya Moorthy, Pratyaksh Sharma

    Abstract: The protein-protein interaction (PPI) network provides an overview of the complex biological reactions vital to an organism's metabolism and survival. Even though in the past PPI network were compared across organisms in detail, there has not been large-scale research on how individual PPI networks reflect on the species relationships. In this study we aim to increase our understanding of the tree… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  27. arXiv:2310.12135  [pdf, other

    cs.CL

    Pseudointelligence: A Unifying Framework for Language Model Evaluation

    Authors: Shikhar Murty, Orr Paradise, Pratyusha Sharma

    Abstract: With large language models surpassing human performance on an increasing number of benchmarks, we must take a principled approach for targeted evaluation of model capabilities. Inspired by pseudorandomness, we propose pseudointelligence, which captures the maxim that "(perceived) intelligence lies in the eye of the beholder". That is, that claims of intelligence are meaningful only when their eval… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  28. arXiv:2310.10756  [pdf

    eess.IV cs.CV cs.LG

    Deep Conditional Shape Models for 3D cardiac image segmentation

    Authors: Athira J Jacob, Puneet Sharma, Daniel Ruckert

    Abstract: Delineation of anatomical structures is often the first step of many medical image analysis workflows. While convolutional neural networks achieve high performance, these do not incorporate anatomical shape information. We introduce a novel segmentation algorithm that uses Deep Conditional Shape models (DCSMs) as a core component. Using deep implicit shape representations, the algorithm learns a m… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted and presented as oral presentation at Statistical Atlases and Computational Modeling of the Heart (STACOM) workshop at MICCAI 2023

  29. arXiv:2309.14856  [pdf, other

    cs.PF cs.NI

    PTPerf: On the performance evaluation of Tor Pluggable Transports

    Authors: Zeya Umayya, Dhruv Malik, Devashish Gosain, Piyush Kumar Sharma

    Abstract: Tor, one of the most popular censorship circumvention systems, faces regular blocking attempts by censors. Thus, to facilitate access, it relies on "pluggable transports" (PTs) that disguise Tor's traffic and make it hard for the adversary to block Tor. However, these are not yet well studied and compared for the performance they provide to the users. Thus, we conduct a first comparative performan… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 25 pages, 12 figures

  30. arXiv:2309.14393  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

    Authors: Ahmad Faiz, Sotaro Kaneda, Ruhan Wang, Rita Osi, Prateek Sharma, Fan Chen, Lei Jiang

    Abstract: The carbon footprint associated with large language models (LLMs) is a significant concern, encompassing emissions from their training, inference, experimentation, and storage processes, including operational and embodied carbon emissions. An essential aspect is accurately estimating the carbon impact of emerging LLMs even before their training, which heavily relies on GPU usage. Existing studies… ▽ More

    Submitted 19 January, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: 15 pages, 8 figures

    Journal ref: published in ICLR2024

  31. arXiv:2309.13457  [pdf, other

    cs.LG cs.CV physics.comp-ph physics.flu-dyn

    Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data

    Authors: Wai Tong Chung, Bassem Akoush, Pushan Sharma, Alex Tamkin, Ki Sung Jung, Jacqueline H. Chen, Jack Guo, Davy Brouzet, Mohsen Talei, Bruno Savard, Alexei Y. Poludnenko, Matthias Ihme

    Abstract: Analysis of compressible turbulent flows is essential for applications related to propulsion, energy generation, and the environment. Here, we present BLASTNet 2.0, a 2.2 TB network-of-datasets containing 744 full-domain samples from 34 high-fidelity direct numerical simulations, which addresses the current limited availability of 3D high-fidelity reacting and non-reacting compressible turbulent f… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted in Adv. in Neural Information Processing Systems 36 (NeurIPS 2023). Link: https://nips.cc/virtual/2023/poster/73433 . 55 pages, 21 figures. Keywords: Super-resolution, 3D, Neural Scaling, Physics-informed Loss, Computational Fluid Dynamics, Partial Differential Equations, Turbulent Reacting Flows, Direct Numerical Simulation, Fluid Mechanics, Combustion, Computer Vision

  32. arXiv:2308.10015  [pdf, other

    cs.CV

    DyFFPAD: Dynamic Fusion of Convolutional and Handcrafted Features for Fingerprint Presentation Attack Detection

    Authors: Anuj Rai, Parsheel Kumar Tiwari, Jyotishna Baishya, Ram Prakash Sharma, Somnath Dey

    Abstract: Automatic fingerprint recognition systems suffer from the threat of presentation attacks due to their wide range of applications in areas including national borders and commercial applications. Presentation attacks can be performed by fabricating the fake fingerprint of a user with or without the intention of the subject. This paper presents a dynamic ensemble of deep learning and handcrafted feat… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.09397

  33. arXiv:2307.10648  [pdf, other

    cs.NI cs.LG

    Data-Driven Latency Probability Prediction for Wireless Networks: Focusing on Tail Probabilities

    Authors: Samie Mostafavi, Gourav Prateek Sharma, James Gross

    Abstract: With the emergence of new application areas, such as cyber-physical systems and human-in-the-loop applications, there is a need to guarantee a certain level of end-to-end network latency with extremely high reliability, e.g., 99.999%. While mechanisms specified under IEEE 802.1as time-sensitive networking (TSN) can be used to achieve these requirements for switched Ethernet networks, implementing… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Submitted to IEEE Global Communications (GLOBECOM) 2023 conference

  34. arXiv:2307.07062  [pdf, other

    eess.AS cs.LG cs.SD

    Controllable Emphasis with zero data for text-to-speech

    Authors: Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova

    Abstract: We present a scalable method to produce high quality emphasis for text-to-speech (TTS) that does not require recordings or annotations. Many TTS models include a phoneme duration model. A simple but effective method to achieve emphasized speech consists in increasing the predicted duration of the emphasised word. We show that this is significantly better than spectrogram modification techniques im… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: In proceeding of 12th Speech Synthesis Workshop (SSW) 2023

  35. arXiv:2307.01909  [pdf, other

    cs.LG cs.AI

    ClimateLearn: Benchmarking Machine Learning for Weather and Climate Modeling

    Authors: Tung Nguyen, Jason Jewik, Hritik Bansal, Prakhar Sharma, Aditya Grover

    Abstract: Modeling weather and climate is an essential endeavor to understand the near- and long-term impacts of climate change, as well as inform technology and policymaking for adaptation and mitigation efforts. In recent years, there has been a surging interest in applying data-driven methods based on machine learning for solving core problems such as weather forecasting and climate downscaling. Despite… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  36. arXiv:2307.01666  [pdf

    cs.HC cs.CV

    Sensors and Systems for Monitoring Mental Fatigue: A systematic review

    Authors: Prabin Sharma, Joanna C. Justus, Megha Thapa, Govinda R. Poudel

    Abstract: Mental fatigue is a leading cause of motor vehicle accidents, medical errors, loss of workplace productivity, and student disengagements in e-learning environment. Development of sensors and systems that can reliably track mental fatigue can prevent accidents, reduce errors, and help increase workplace productivity. This review provides a critical summary of theoretical models of mental fatigue, a… ▽ More

    Submitted 10 September, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 19 Pages, 3 Figures

  37. arXiv:2307.01470  [pdf, other

    cs.CV cs.HC cs.LG

    A Review of Driver Gaze Estimation and Application in Gaze Behavior Understanding

    Authors: Pavan Kumar Sharma, Pranamesh Chakraborty

    Abstract: Driver gaze plays an important role in different gaze-based applications such as driver attentiveness detection, visual distraction detection, gaze behavior understanding, and building driver assistance system. The main objective of this study is to perform a comprehensive summary of driver gaze fundamentals, methods to estimate driver gaze, and it's applications in real world driving scenarios. W… ▽ More

    Submitted 21 February, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

  38. arXiv:2307.00112  [pdf

    cs.CY cs.AI

    Performance of ChatGPT on USMLE: Unlocking the Potential of Large Language Models for AI-Assisted Medical Education

    Authors: Prabin Sharma, Kisan Thapa, Dikshya Thapa, Prastab Dhakal, Mala Deep Upadhaya, Santosh Adhikari, Salik Ram Khanal

    Abstract: Artificial intelligence is gaining traction in more ways than ever before. The popularity of language models and AI-based businesses has soared since ChatGPT was made available to the general public via OpenAI. It is becoming increasingly common for people to use ChatGPT both professionally and personally. Considering the widespread use of ChatGPT and the reliance people place on it, this study de… ▽ More

    Submitted 27 July, 2023; v1 submitted 30 June, 2023; originally announced July 2023.

    Comments: 12 pages, 4 Figues, 4 tables

  39. arXiv:2306.17636  [pdf, other

    cs.CV cs.AI cs.LG

    Achieving RGB-D level Segmentation Performance from a Single ToF Camera

    Authors: Pranav Sharma, Jigyasa Singh Katrolia, Jason Rambach, Bruno Mirbach, Didier Stricker, Juergen Seiler

    Abstract: Depth is a very important modality in computer vision, typically used as complementary information to RGB, provided by RGB-D cameras. In this work, we show that it is possible to obtain the same level of accuracy as RGB-D cameras on a semantic segmentation task using infrared (IR) and depth images from a single Time-of-Flight (ToF) camera. In order to fuse the IR and depth modalities of the ToF ca… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

  40. arXiv:2306.14300  [pdf

    cs.CV cs.AI

    Screening Autism Spectrum Disorder in childrens using Deep Learning Approach : Evaluating the classification model of YOLOv8 by comparing with other models

    Authors: Subash Gautam, Prabin Sharma, Kisan Thapa, Mala Deep Upadhaya, Dikshya Thapa, Salik Ram Khanal, Vítor Manuel de Jesus Filipe

    Abstract: Autism spectrum disorder (ASD) is a developmental condition that presents significant challenges in social interaction, communication, and behavior. Early intervention plays a pivotal role in enhancing cognitive abilities and reducing autistic symptoms in children with ASD. Numerous clinical studies have highlighted distinctive facial characteristics that distinguish ASD children from typically de… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: 17 pages,12 figures

  41. arXiv:2306.01648  [pdf, other

    cs.LG cs.DC

    Federated Multi-Sequence Stochastic Approximation with Local Hypergradient Estimation

    Authors: Davoud Ataee Tarzanagh, Mingchen Li, Pranay Sharma, Samet Oymak

    Abstract: Stochastic approximation with multiple coupled sequences (MSA) has found broad applications in machine learning as it encompasses a rich class of problems including bilevel optimization (BLO), multi-level compositional optimization (MCO), and reinforcement learning (specifically, actor-critic methods). However, designing provably-efficient federated algorithms for MSA has been an elusive question… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  42. arXiv:2305.18741  [pdf, other

    cs.CL

    Grokking of Hierarchical Structure in Vanilla Transformers

    Authors: Shikhar Murty, Pratyusha Sharma, Jacob Andreas, Christopher D. Manning

    Abstract: For humans, language production and comprehension is sensitive to the hierarchical structure of sentences. In natural language processing, past work has questioned how effectively neural sequence models like transformers capture this hierarchical structure when generalizing to structurally novel inputs. We show that transformer language models can learn to generalize hierarchically after training… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  43. arXiv:2305.13291  [pdf, other

    cs.CV cs.GR cs.LG

    Materialistic: Selecting Similar Materials in Images

    Authors: Prafull Sharma, Julien Philip, Michaël Gharbi, William T. Freeman, Fredo Durand, Valentin Deschaintre

    Abstract: Separating an image into meaningful underlying components is a crucial first step for both editing and understanding images. We present a method capable of selecting the regions of a photograph exhibiting the same material as an artist-chosen area. Our proposed approach is robust to shading, specular highlights, and cast shadows, enabling selection in real images. As we do not rely on semantic seg… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  44. Taming Resource Heterogeneity In Distributed ML Training With Dynamic Batching

    Authors: Sahil Tyagi, Prateek Sharma

    Abstract: Current techniques and systems for distributed model training mostly assume that clusters are comprised of homogeneous servers with a constant resource availability. However, cluster heterogeneity is pervasive in computing infrastructure, and is a fundamental characteristic of low-cost transient resources (such as EC2 spot instances). In this paper, we develop a dynamic batching technique for dist… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Journal ref: https://2020.acsos.org/

  45. arXiv:2305.04889  [pdf

    cs.IR cs.LG

    Improving Real-Time Bidding in Online Advertising Using Markov Decision Processes and Machine Learning Techniques

    Authors: Parikshit Sharma

    Abstract: Real-time bidding has emerged as an effective online advertising technique. With real-time bidding, advertisers can position ads per impression, enabling them to optimise ad campaigns by targeting specific audiences in real-time. This paper proposes a novel method for real-time bidding that combines deep learning and reinforcement learning techniques to enhance the efficiency and precision of the… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 12 pages

  46. arXiv:2304.04934  [pdf, other

    cs.LG

    Model Sparsity Can Simplify Machine Unlearning

    Authors: **ghan Jia, Jiancheng Liu, Parikshit Ram, Yuguang Yao, Gaowen Liu, Yang Liu, Pranay Sharma, Sijia Liu

    Abstract: In response to recent data regulation requirements, machine unlearning (MU) has emerged as a critical process to remove the influence of specific examples from a given model. Although exact unlearning can be achieved through complete model retraining using the remaining dataset, the associated computational costs have driven the development of efficient, approximate unlearning techniques. Moving b… ▽ More

    Submitted 27 January, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: NeurIPS'23 spotlight

  47. arXiv:2304.01299  [pdf

    cs.NI

    Towards Deterministic Communications in 6G Networks: State of the Art, Open Challenges and the Way Forward

    Authors: Gourav Prateek Sharma, Dhruvin Patel, Joachim Sachs, Marilet De Andrade, Janos Farkas, Janos Harmatos, Balazs Varga, Hans-Peter Bernhard, Raheeb Muzaffar, Mahin K. Atiq, Frank Duerr, Dietmar Bruckner, Edgardo Montesdeoca, Drissa Houatra, Hongwei Zhang, James Gross

    Abstract: Over the last decade, society and industries are undergoing rapid digitization that is expected to lead to the evolution of the cyber-physical continuum. End-to-end deterministic communications infrastructure is the essential glue that will bridge the digital and physical worlds of the continuum. We describe the state of the art and open challenges with respect to contemporary deterministic commun… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 22 pages, 8 figures

  48. arXiv:2304.01052  [pdf, other

    cs.RO

    Investigation of risk-aware MDP and POMDP contingency management autonomy for UAS

    Authors: Prashin Sharma, Benjamin Kraske, Joseph Kim, Zakariya Laouar, Zachary Sunberg, Ella Atkins

    Abstract: Unmanned aircraft systems (UAS) are being increasingly adopted for various applications. The risk UAS poses to people and property must be kept to acceptable levels. This paper proposes risk-aware contingency management autonomy to prevent an accident in the event of component malfunction, specifically propulsion unit failure and/or battery degradation. The proposed autonomy is modeled as a Markov… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  49. Anatomically aware dual-hop learning for pulmonary embolism detection in CT pulmonary angiograms

    Authors: Florin Condrea, Saikiran Rapaka, Lucian Itu, Puneet Sharma, Jonathan Sperl, A Mohamed Ali, Marius Leordeanu

    Abstract: Pulmonary Embolisms (PE) represent a leading cause of cardiovascular death. While medical imaging, through computed tomographic pulmonary angiography (CTPA), represents the gold standard for PE diagnosis, it is still susceptible to misdiagnosis or significant diagnosis delays, which may be fatal for critical cases. Despite the recently demonstrated power of deep learning to bring a significant boo… ▽ More

    Submitted 17 May, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted to Computers in Biology and Medicine journal

  50. arXiv:2303.12974  [pdf

    cs.CV

    Performance Analysis and Evaluation of Cloud Vision Emotion APIs

    Authors: Salik Ram Khanal, Prabin Sharma, Hugo Fernandes, João Barroso, Vítor Manuel de Jesus Filipe

    Abstract: Facial expression is a way of communication that can be used to interact with computers or other electronic devices and the recognition of emotion from faces is an emerging practice with application in many fields. There are many cloud-based vision application programming interfaces available that recognize emotion from facial images and video. In this article, the performances of two well-known A… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 10 pages, 6 figures