Skip to main content

Showing 1–50 of 3,387 results for author: Krishna

.
  1. arXiv:2407.06077  [pdf, other

    cs.RO cs.AI cs.CV

    Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Map** in Mobile Robots

    Authors: Siva Krishna Ravipati, Ehsan Latif, Ramviyas Parasuraman, Suchendra M. Bhandarkar

    Abstract: Classification of different object surface material types can play a significant role in the decision-making algorithms for mobile robots and autonomous vehicles. RGB-based scene-level semantic segmentation has been well-addressed in the literature. However, improving material recognition using the depth modality and its integration with SLAM algorithms for 3D semantic map** could unlock new pot… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted to IROS 2024

  2. arXiv:2407.05931  [pdf, other

    cond-mat.stat-mech cond-mat.mtrl-sci cond-mat.soft physics.chem-ph physics.comp-ph

    Competing nucleation pathways in nanocrystal formation

    Authors: Carlos R. Salazar, Akshay Krishna Ammothum Kandy, Jean Furstoss, Quentin Gromoff, Jacek Goniakowski, Julien Lam

    Abstract: Despite numerous efforts from numerical approaches to complement experimental measurements, several fundamental challenges have still hindered one's ability to truly provide an atomistic picture of the nucleation process in nanocrystals. Among them, our study resolves three obstacles: (1) Machine-learning force fields including long-range interactions able to capture the finesse of the underlying… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2407.05528  [pdf, other

    cs.CV

    An accurate detection is not all you need to combat label noise in web-noisy datasets

    Authors: Paul Albert, Jack Valmadre, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness

    Abstract: Training a classifier on web-crawled data demands learning algorithms that are robust to annotation errors and irrelevant examples. This paper builds upon the recent empirical observation that applying unsupervised contrastive learning to noisy, web-crawled datasets yields a feature representation under which the in-distribution (ID) and out-of-distribution (OOD) samples are linearly separable. We… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted in the European Conference on Computer Vision (ECCV) 2024

  4. arXiv:2407.05266  [pdf, other

    cs.CV cs.AI eess.IV

    CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs

    Authors: Akshat Ramachandran, Souvik Kundu, Tushar Krishna

    Abstract: We present CLAMP-ViT, a data-free post-training quantization method for vision transformers (ViTs). We identify the limitations of recent techniques, notably their inability to leverage meaningful inter-patch relationships, leading to the generation of simplistic and semantically vague data, impacting quantization accuracy. CLAMP-ViT employs a two-stage approach, cyclically adapting between data g… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  5. arXiv:2407.05185  [pdf

    math.NA physics.geo-ph

    Sequential hybrid finite element and material point method to simulate slope failures

    Authors: Brent Sordo, Ellen Rathje, Krishna Kumar

    Abstract: Numerical modeling of slope failures seeks to predict two key phenomena: the initiation of failure and the post-failure runout. Currently, most modeling methods for slope failure analysis excel at one of these two but are deficient in the other. For example, the Finite Element Method (FEM) models the initiation of instability well but quickly loses accuracy when modeling large deformations because… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Journal ref: Computers and Geotechnics (2024)

  6. arXiv:2407.04953  [pdf, other

    eess.IV cs.CV

    Effective-LDAM: An Effective Loss Function To Mitigate Data Imbalance for Robust Chest X-Ray Disease Classification

    Authors: Sree Rama Vamsidhar S, Bhargava Satya, Rama Krishna Gorthi

    Abstract: Deep Learning (DL) approaches have gained prominence in medical imaging for disease diagnosis. Chest X-ray (CXR) classification has emerged as an effective method for detecting various diseases. Among these methodologies, Chest X-ray (CXR) classification has proven to be an effective approach for detecting and analyzing various diseases. However, the reliable performance of DL classification algor… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  7. arXiv:2407.04865  [pdf, other

    physics.bio-ph

    A differentiable Gillespie algorithm for simulating chemical kinetics, parameter estimation, and designing synthetic biological circuits

    Authors: Krishna Rijal, Pankaj Mehta

    Abstract: The Gillespie algorithm is commonly used to simulate and analyze complex chemical reaction networks. Here, we leverage recent breakthroughs in deep learning to develop a fully differentiable variant of the Gillespie algorithm. The differentiable Gillespie algorithm (DGA) approximates discontinuous operations in the exact Gillespie algorithm using smooth functions, allowing for the calculation of g… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  8. arXiv:2407.04815  [pdf, other

    cs.CV

    NSD-DIL: Null-Shot Deblurring Using Deep Identity Learning

    Authors: Sree Rama Vamsidhar S, Rama Krishna Gorthi

    Abstract: In this paper, we propose to reformulate the blind image deblurring task to directly learn an inverse of the degradation model using a deep linear network. We introduce Deep Identity Learning (DIL), a novel learning strategy that includes a dedicated regularization term based on the properties of linear systems, to exploit the identity relation between the degradation and inverse degradation model… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  9. arXiv:2407.04325  [pdf, other

    cs.LG

    Understanding the Role of Invariance in Transfer Learning

    Authors: Till Speicher, Vedant Nanda, Krishna P. Gummadi

    Abstract: Transfer learning is a powerful technique for knowledge-sharing between different tasks. Recent work has found that the representations of models with certain invariances, such as to adversarial input perturbations, achieve higher performance on downstream tasks. These findings suggest that invariance may be an important property in the context of transfer learning. However, the relationship of in… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Published at TMLR 2024

  10. arXiv:2407.03462  [pdf, other

    astro-ph.IM astro-ph.CO

    An Update on the External Calibrator for Hydrogen Observatories (ECHO)

    Authors: Yifan Zhao, Daniel C. Jacobs, Titu Samson, Mrudula Gopal Krishna, Michael Horn, Marc-Olivier R. Lalonde, Raven Braithwaite, Logan Skabelund

    Abstract: Precision measurements of the beam pattern response are needed to predict the response of a radio telescope. Map** the beam of a low frequency radio array presents a unique challenge and science cases such as the observation of the 21\,cm line at high redshift have demanding requirements. Drone-based systems offer the unique potential for a measurement which is entirely under experimenter contro… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  11. arXiv:2407.03267  [pdf

    cond-mat.mtrl-sci

    Insulator-to-Metal Transition and Isotropic Gigantic Magnetoresistance in Layered Magnetic Semiconductors

    Authors: Gokul Acharya, Bimal Neupane, Chia-Hsiu Hsu, Xian P. Yang, David Graf, Eun Sang Choi, Krishna Pandey, Md Rafique Un Nabi, Santosh Karki Chhetri, Rabindra Basnet, Sumaya Rahman, Jian Wang, Zhengxin Hu, Bo Da, Hugh Churchill, Guoqing Chang, M. Zahid Hasan, Yuanxi Wang, ** Hu

    Abstract: Magnetotransport, the response of electrical conduction to external magnetic field, acts as an important tool to reveal fundamental concepts behind exotic phenomena and plays a key role in enabling spintronic applications. Magnetotransport is generally sensitive to magnetic field orientations. In contrast, efficient and isotropic modulation of electronic transport, which is useful in technology ap… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 44 pages, 18 figures

  12. arXiv:2407.03093  [pdf, other

    cs.SE cs.AI cs.CR cs.LG

    Revisiting the Performance of Deep Learning-Based Vulnerability Detection on Realistic Datasets

    Authors: Partha Chakraborty, Krishna Kanth Arumugam, Mahmoud Alfadel, Meiyappan Nagappan, Shane McIntosh

    Abstract: The impact of software vulnerabilities on everyday software systems is significant. Despite deep learning models being proposed for vulnerability detection, their reliability is questionable. Prior evaluations show high recall/F1 scores of up to 99%, but these models underperform in practical scenarios, particularly when assessed on entire codebases rather than just the fixing commit. This paper i… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    ACM Class: D.2; I.2

    Journal ref: 10.1109/TSE.2024.3423712

  13. arXiv:2407.02960  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    ObfuscaTune: Obfuscated Offsite Fine-tuning and Inference of Proprietary LLMs on Private Datasets

    Authors: Ahmed Frikha, Nassim Walha, Ricardo Mendes, Krishna Kanth Nakka, Xue Jiang, Xuebing Zhou

    Abstract: This work addresses the timely yet underexplored problem of performing inference and finetuning of a proprietary LLM owned by a model provider entity on the confidential/private data of another data owner entity, in a way that ensures the confidentiality of both the model and the data. Hereby, the finetuning is conducted offsite, i.e., on the computation infrastructure of a third-party cloud provi… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Preprint

  14. arXiv:2407.02956  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    IncogniText: Privacy-enhancing Conditional Text Anonymization via LLM-based Private Attribute Randomization

    Authors: Ahmed Frikha, Nassim Walha, Krishna Kanth Nakka, Ricardo Mendes, Xue Jiang, Xuebing Zhou

    Abstract: In this work, we address the problem of text anonymization where the goal is to prevent adversaries from correctly inferring private attributes of the author, while kee** the text utility, i.e., meaning and semantics. We propose IncogniText, a technique that anonymizes the text to mislead a potential adversary into predicting a wrong private attribute value. Our empirical evaluation shows a redu… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Preprint

  15. arXiv:2407.02943  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    PII-Compass: Guiding LLM training data extraction prompts towards the target PII via grounding

    Authors: Krishna Kanth Nakka, Ahmed Frikha, Ricardo Mendes, Xue Jiang, Xuebing Zhou

    Abstract: The latest and most impactful advances in large models stem from their increased size. Unfortunately, this translates into an improved memorization capacity, raising data privacy concerns. Specifically, it has been shown that models can output personal identifiable information (PII) contained in their training data. However, reported PIII extraction performance varies widely, and there is no conse… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024

  16. arXiv:2407.02543  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Towards the Next Frontier in Speech Representation Learning Using Disentanglement

    Authors: Varun Krishna, Sriram Ganapathy

    Abstract: The popular frameworks for self-supervised learning of speech representations have largely focused on frame-level masked prediction of speech regions. While this has shown promising downstream task performance for speech recognition and related tasks, this has largely ignored factors of speech that are encoded at coarser level, like characteristics of the speaker or channel that remain consistent… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  17. arXiv:2407.02013  [pdf, other

    cs.LG

    DiGRAF: Diffeomorphic Graph-Adaptive Activation Function

    Authors: Krishna Sri Ipsit Mantri, Xinzhi Wang, Carola-Bibiane Schönlieb, Bruno Ribeiro, Beatrice Bevilacqua, Moshe Eliasof

    Abstract: In this paper, we propose a novel activation function tailored specifically for graph data in Graph Neural Networks (GNNs). Motivated by the need for graph-adaptive and flexible activation functions, we introduce DiGRAF, leveraging Continuous Piecewise-Affine Based (CPAB) transformations, which we augment with an additional GNN to learn a graph-adaptive diffeomorphic activation function in an end-… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  18. arXiv:2407.01732  [pdf, other

    cs.CY cs.HC cs.IR

    Investigating Nudges toward Related Sellers on E-commerce Marketplaces: A Case Study on Amazon

    Authors: Abhisek Dash, Abhijnan Chakraborty, Saptarshi Ghosh, Animesh Mukherjee, Krishna P. Gummadi

    Abstract: E-commerce marketplaces provide business opportunities to millions of sellers worldwide. Some of these sellers have special relationships with the marketplace by virtue of using their subsidiary services (e.g., fulfillment and/or ship** services provided by the marketplace) -- we refer to such sellers collectively as Related Sellers. When multiple sellers offer to sell the same product, the mark… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: This work has been accepted for presentation at the ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW) 2024. It will appear in Proceedings of the ACM on Human-Computer Interaction

  19. arXiv:2407.01549  [pdf, other

    eess.SP

    FFT and Linear Convolution Implementation with Bit Slicing Multiplier: A Novel Approach

    Authors: Aravind Kumar N, Hari Krishna S, Anita Angeline A

    Abstract: This paper presents a comprehensive exploration of Fast Fourier Transform (FFT) and linear convolution implementations, integrating both conventional methods and novel approaches leveraging the Bit Slicing Multiplier (BSM) technique. The Bit Slicing Multiplier utilizes Look-Up Tables (LUTs) to execute bitwise operations in parallel, offering efficient arithmetic operations ideally suited for digit… ▽ More

    Submitted 25 April, 2024; originally announced July 2024.

  20. arXiv:2407.00385  [pdf, other

    eess.SY

    Sparse Actuator Scheduling for Discrete-Time Linear Dynamical Systems

    Authors: Krishna Praveen V. S. Kondapi, Chandrasekhar Sriram, Geethu Joseph, Chandra R. Murthy

    Abstract: We consider the control of discrete-time linear dynamical systems using sparse inputs where we limit the number of active actuators at every time step. We develop an algorithm for determining a sparse actuator schedule that ensures the existence of a sparse control input sequence, following the schedule, that takes the system from any given initial state to any desired final state. Since such an a… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  21. arXiv:2407.00167  [pdf, other

    cs.CL cs.AI cs.ET cs.HC cs.SI

    Can GPT-4 Help Detect Quit Va** Intentions? An Exploration of Automatic Data Annotation Approach

    Authors: Sai Krishna Revanth Vuruma, Dezhi Wu, Saborny Sen Gupta, Lucas Aust, Valerie Lookingbill, Wyatt Bellamy, Yang Ren, Erin Kasson, Li-Shiun Chen, Patricia Cavazos-Rehg, Dian Hu, Ming Huang

    Abstract: In recent years, the United States has witnessed a significant surge in the popularity of va** or e-cigarette use, leading to a notable rise in cases of e-cigarette and va** use-associated lung injury (EVALI) that caused hospitalizations and fatalities during the EVALI outbreak in 2019, highlighting the urgency to comprehend va** behaviors and develop effective strategies for cessation. Due… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Accepted for the AI Applications in Public Health and Social Services workshop at the 22nd International Conference on Artificial Intelligence in Medicine (AIME 2024)

  22. arXiv:2406.19954  [pdf, other

    cs.CL cs.HC cs.SD eess.AS

    BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5

    Authors: Zhehuai Chen, He Huang, Oleksii Hrinchuk, Krishna C. Puvvada, Nithin Rao Koluguri, Piotr Żelasko, Jagadeesh Balam, Boris Ginsburg

    Abstract: Incorporating speech understanding capabilities into pretrained large-language models has become a vital research direction (SpeechLLM). The previous architectures can be categorized as: i) GPT-style, prepend speech prompts to the text prompts as a sequence of LLM inputs like a decoder-only model; ii) T5-style, introduce speech cross-attention to each layer of the pretrained LLMs. We propose BESTO… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    MSC Class: 68T10 ACM Class: I.2.7

  23. arXiv:2406.19738  [pdf, other

    quant-ph cs.AI cs.LG

    Classical Bandit Algorithms for Entanglement Detection in Parameterized Qubit States

    Authors: Bharati. K, Vikesh Siddhu, Krishna Jagannathan

    Abstract: Entanglement is a key resource for a wide range of tasks in quantum information and computing. Thus, verifying availability of this quantum resource is essential. Extensive research on entanglement detection has led to no-go theorems (Lu et al. [Phys. Rev. Lett., 116, 230501 (2016)]) that highlight the need for full state tomography (FST) in the absence of adaptive or joint measurements. Recent ad… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 20 pages, 5 figures

  24. arXiv:2406.19674  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

    Authors: Krishna C. Puvvada, Piotr Żelasko, He Huang, Oleksii Hrinchuk, Nithin Rao Koluguri, Kunal Dhawan, Somshubra Majumdar, Elena Rastorgueva, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Boris Ginsburg

    Abstract: Recent advances in speech recognition and translation rely on hundreds of thousands of hours of Internet speech data. We argue that state-of-the art accuracy can be reached without relying on web-scale data. Canary - multilingual ASR and speech translation model, outperforms current state-of-the-art models - Whisper, OWSM, and Seamless-M4T on English, French, Spanish, and German languages, while b… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech-2024

  25. arXiv:2406.19580  [pdf, other

    cs.AR cs.LG

    FRED: Flexible REduction-Distribution Interconnect and Communication Implementation for Wafer-Scale Distributed Training of DNN Models

    Authors: Saeed Rashidi, William Won, Sudarshan Srinivasan, Puneet Gupta, Tushar Krishna

    Abstract: Distributed Deep Neural Network (DNN) training is a technique to reduce the training overhead by distributing the training tasks into multiple accelerators, according to a parallelization strategy. However, high-performance compute and interconnects are needed for maximum speed-up and linear scaling of the system. Wafer-scale systems are a promising technology that allows for tightly integrating h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  26. arXiv:2406.18915  [pdf, other

    cs.RO cs.CV

    Manipulate-Anything: Automating Real-World Robots using Vision-Language Models

    Authors: Jiafei Duan, Wentao Yuan, Wilbert Pumacay, Yi Ru Wang, Kiana Ehsani, Dieter Fox, Ranjay Krishna

    Abstract: Large-scale endeavors like RT-1 and widespread community efforts such as Open-X-Embodiment have contributed to growing the scale of robot demonstration data. However, there is still an opportunity to improve the quality, quantity, and diversity of robot demonstration data. Although vision-language models have been shown to automatically generate demonstration data, their utility has been limited t… ▽ More

    Submitted 27 June, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Project page: https://robot-ma.github.io/

  27. arXiv:2406.18095  [pdf, other

    astro-ph.CO

    Observational Evidence to Logistic Dark Energy Driving the Accelerating Universe

    Authors: Sarath Nelleri, Gopi Krishna, Navaneeth Poonthottathil

    Abstract: We present logistic dark energy model (LDEM), where the dark energy density follows a logistic function for the scale factor. The equation of state parameter of dark energy ($w_D$) transitioned from $-1$ in the distant past to its current value of $-0.76$, closely resembling the $Λ$CDM model in the early epoch and showing significant deviation in the late phase. The evolution of the deceleration p… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  28. arXiv:2406.17968  [pdf, other

    cs.IR cs.AI cs.LG stat.ML

    Efficient Document Ranking with Learnable Late Interactions

    Authors: Ziwei Ji, Himanshu Jain, Andreas Veit, Sashank J. Reddi, Sadeep Jayasumana, Ankit Singh Rawat, Aditya Krishna Menon, Felix Yu, Sanjiv Kumar

    Abstract: Cross-Encoder (CE) and Dual-Encoder (DE) models are two fundamental approaches for query-document relevance in information retrieval. To predict relevance, CE models use joint query-document embeddings, while DE models maintain factorized query and document embeddings; usually, the former has higher quality while the latter benefits from lower latency. Recently, late-interaction models have been p… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  29. arXiv:2406.17774  [pdf, other

    cs.CV cs.GR

    Fast and Uncertainty-Aware SVBRDF Recovery from Multi-View Capture using Frequency Domain Analysis

    Authors: Ruben Wiersma, Julien Philip, Miloš Hašan, Krishna Mullia, Fujun Luan, Elmar Eisemann, Valentin Deschaintre

    Abstract: Relightable object acquisition is a key challenge in simplifying digital asset creation. Complete reconstruction of an object typically requires capturing hundreds to thousands of photographs under controlled illumination, with specialized equipment. The recent progress in differentiable rendering improved the quality and accessibility of inverse rendering optimization. Nevertheless, under uncontr… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Project page: https://brdf-uncertainty.github.io

  30. arXiv:2406.17562  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Low Excess Noise, High Quantum Efficiency Avalanche Photodiodes for Beyond 2 μm Wavelength Detection

    Authors: Hyemin Jung, Seunghyun Lee, Xiao **, Yifan Liu, Theodore J. Ronningen, Christoph H. Grein, John P. R. David, Sanjay Krishna

    Abstract: The increasing concentration of greenhouse gases, notably CH4 and CO2, has fueled global temperature increases, intensifying concerns regarding the prevailing climate crisis. Effectively monitoring these gases demands a detector spanning the extended short-wavelength infrared (~2.4 μm) range, covering wavelengths of CH4 (1.65 μm) and CO2 (2.05 μm). The state-of-the-art HgCdTe avalanche photodetect… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  31. arXiv:2406.17377  [pdf, other

    cs.CL

    A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs

    Authors: Vaibhav Singh, Amrith Krishna, Karthika NJ, Ganesh Ramakrishnan

    Abstract: Low-resource languages, by its very definition, tend to be under represented in the pre-training corpora of Large Language Models. In this work, we investigate three low-resource cross-lingual approaches that enable an LLM adapt to tasks in previously unseen languages. Llama-2 is an LLM where Indic languages, among many other language families, contribute to less than $0.005\%$ of the total $2$ tr… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  32. arXiv:2406.16820  [pdf

    stat.ME

    EFECT -- A Method and Metric to Assess the Reproducibility of Stochastic Simulation Studies

    Authors: T. J. Sego, Matthias König, Luis L. Fonseca, Baylor Fain, Adam C. Knapp, Krishna Tiwari, Henning Hermjakob, Herbert M. Sauro, James A. Glazier, Reinhard C. Laubenbacher, Rahuman S. Malik-Sheriff

    Abstract: Reproducibility is a foundational standard for validating scientific claims in computational research. Stochastic computational models are employed across diverse fields such as systems biology, financial modelling and environmental sciences. Existing infrastructure and software tools support various aspects of reproducible model development, application, and dissemination, but do not adequately a… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 25 pages, 4 figures

  33. arXiv:2406.16532  [pdf

    cond-mat.mes-hall

    Terahertz photocurrent probe of quantum geometry and interactions in magic-angle twisted bilayer graphene

    Authors: Roshan Krishna Kumar, Geng Li, Riccardo Bertini, Swati Chaudhary, Krystian Nowakowski, Jeong Min Park, Sebastian Castilla, Zhen Zhan, Pierre A. Pantaleón, Hitesh Agarwal, Sergi Battle-Porro, Eike Icking, Matteo Ceccanti, Antoine Reserbat-Plantey, Giulia Piccinini, Julien Barrier, Ekaterina Khestanova, Takashi Taniguchi, Kenji Watanabe, Christoph Stampfer, Gil Refael, Francisco Guinea, Pablo Jarillo-Herrero, Justin C. W. Song, Petr Stepanov , et al. (2 additional authors not shown)

    Abstract: Moiré materials represent strongly interacting electron systems bridging topological and correlated physics. Despite significant advances, decoding wavefunction properties underlying the quantum geometry remains challenging. Here, we utilize polarization-resolved photocurrent measurements to probe magic-angle twisted bilayer graphene, leveraging its sensitivity to the Berry connection that encompa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  34. arXiv:2406.16008  [pdf, other

    cs.CL cs.AI cs.LG

    Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization

    Authors: Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister

    Abstract: Large language models (LLMs), even when specifically trained to process long input contexts, struggle to capture relevant information located in the middle of their input. This phenomenon has been known as the lost-in-the-middle problem. In this work, we make three contributions. First, we set out to understand the factors that cause this phenomenon. In doing so, we establish a connection between… ▽ More

    Submitted 3 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: ACL Findings 2024

  35. arXiv:2406.14517  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    PostMark: A Robust Blackbox Watermark for Large Language Models

    Authors: Yapei Chang, Kalpesh Krishna, Amir Houmansadr, John Wieting, Mohit Iyyer

    Abstract: The most effective techniques to detect LLM-generated text rely on inserting a detectable signature -- or watermark -- during the model's decoding process. Most existing watermarking methods require access to the underlying LLM's logits, which LLM API providers are loath to share due to fears of model distillation. As such, these watermarks must be implemented independently by each LLM provider. I… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: preprint; 18 pages, 5 figures

  36. arXiv:2406.14486  [pdf, other

    eess.IV

    Rule-based outlier detection of AI-generated anatomy segmentations

    Authors: Deepa Krishnaswamy, Vamsi Krishna Thiriveedhi, Cosmin Ciausu, David Clunie, Steve Pieper, Ron Kikinis, Andrey Fedorov

    Abstract: There is a dire need for medical imaging datasets with accompanying annotations to perform downstream patient analysis. However, it is difficult to manually generate these annotations, due to the time-consuming nature, and the variability in clinical conventions. Artificial intelligence has been adopted in the field as a potential method to annotate these large datasets, however, a lack of expert… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  37. arXiv:2406.14458  [pdf, other

    cs.LG cs.AI cs.IT eess.SP

    Centimeter Positioning Accuracy using AI/ML for 6G Applications

    Authors: Sai Prasanth Kotturi, Radha Krishna Ganti

    Abstract: This research looks at using AI/ML to achieve centimeter-level user positioning in 6G applications such as the Industrial Internet of Things (IIoT). Initial results show that our AI/ML-based method can estimate user positions with an accuracy of 17 cm in an indoor factory environment. In this proposal, we highlight our approaches and future directions.

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 2 Pages, 2 Figures, ICMLCN Conference, Stockholm, Sweden

  38. arXiv:2406.14433  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Structural and Electrical Properties of Grafted Si/GaAsSb Heterojunction

    Authors: Haris Naeem Abbasi, Seunghyun Lee, Hyemin Jung, Nathan Gajowski, Yi Lu, Linus Wang, Donghyeok Kim, Jie Zhou, Jiarui Gong, Chris Chae, **woo Hwang, Manisha Muduli, Subramanya Nookala, Zhenqiang Ma, Sanjay Krishna

    Abstract: The short-wave infrared (SWIR) wavelength, especially 1.55 um, has attracted significant attention in various areas such as high-speed optical communication and LiDAR systems. Avalanche photodiodes (APDs) are a critical component as a receiver in these systems due to their internal gain which enhances the system performance. Silicon-based APDs are promising since they are CMOS compatible, but they… ▽ More

    Submitted 24 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 14 pages, 6 figures

  39. arXiv:2406.14072  [pdf, other

    astro-ph.EP

    IGRINS observations of WASP-127 b: H$_2$O, CO, and super-Solar atmospheric metallicity in the inflated sub-Saturn

    Authors: Krishna Kanumalla, Michael R. Line, Megan Weiner Mansfield, Luis Welbanks, Peter C. B. Smith, Jacob L. Bean, Lorenzo Pino, Matteo Brogi, Vatsal Panwar

    Abstract: High resolution spectroscopy of exoplanet atmospheres provides insights into their composition and dynamics from the resolved line shape and depth of thousands of spectral lines. WASP-127 b is an extremely inflated sub-Saturn (R$_\mathrm{p}$= 1.311 R$_\mathrm{Jup}$, M$_\mathrm{p}$= 0.16 M$_\mathrm{Jup}$) with previously reported detections of H$_2$O, CO$_2$, and Na. However, the seeming absence of… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 18 pages, 15 figures, submitted to AJ, poster at Exo5 conference area-A

  40. arXiv:2406.13868  [pdf, other

    cs.LG cs.AI

    SDQ: Sparse Decomposed Quantization for LLM Inference

    Authors: Geonhwa Jeong, Po-An Tsai, Stephen W. Keckler, Tushar Krishna

    Abstract: Recently, large language models (LLMs) have shown surprising performance in task-specific workloads as well as general tasks with the given prompts. However, to achieve unprecedented performance, recent LLMs use billions to trillions of parameters, which hinder the wide adaptation of those models due to their extremely large compute and memory requirements. To resolve the issue, various model comp… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Preprint

  41. arXiv:2406.13129  [pdf, other

    cs.CV cs.LG

    M3T: Multi-Modal Medical Transformer to bridge Clinical Context with Visual Insights for Retinal Image Medical Description Generation

    Authors: Nagur Shareef Shaik, Teja Krishna Cherukuri, Dong Hye Ye

    Abstract: Automated retinal image medical description generation is crucial for streamlining medical diagnosis and treatment planning. Existing challenges include the reliance on learned retinal image representations, difficulties in handling multiple imaging modalities, and the lack of clinical context in visual representations. Addressing these issues, we propose the Multi-Modal Medical Transformer (M3T),… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted for presentation at the IEEE International Conference on Image Processing (ICIP 2024)

  42. arXiv:2406.13126  [pdf, other

    cs.CV cs.LG

    Guided Context Gating: Learning to leverage salient lesions in retinal fundus images

    Authors: Teja Krishna Cherukuri, Nagur Shareef Shaik, Dong Hye Ye

    Abstract: Effectively representing medical images, especially retinal images, presents a considerable challenge due to variations in appearance, size, and contextual information of pathological signs called lesions. Precise discrimination of these lesions is crucial for diagnosing vision-threatening issues such as diabetic retinopathy. While visual attention-based neural networks have been introduced to lea… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted for presentation at the IEEE International Conference on Image Processing (ICIP 2024)

  43. arXiv:2406.12997  [pdf, other

    cs.CL

    Suitability of CCA for Generating Latent State/ Variables in Multi-View Textual Data

    Authors: Akanksha Mehndiratta, Krishna Asawa

    Abstract: The probabilistic interpretation of Canonical Correlation Analysis (CCA) for learning low-dimensional real vectors, called as latent variables, has been exploited immensely in various fields. This study takes a step further by demonstrating the potential of CCA in discovering a latent state that captures the contextual information within the textual data under a two-view setting. The interpretatio… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  44. arXiv:2406.12818  [pdf, other

    econ.TH cs.SI

    Optimal Bailouts in Diversified Financial Networks

    Authors: Krishna Dasaratha, Santosh Venkatesh, Rakesh Vohra

    Abstract: Widespread default involves substantial deadweight costs which could be countered by injecting capital into failing firms. Injections have positive spillovers that can trigger a repayment cascade. But which firms should a regulator bailout so as to minimize the total injection of capital while ensuring solvency of all firms? While the problem is, in general, NP-hard, for a wide range of networks t… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  45. arXiv:2406.12683  [pdf, other

    cs.CV cs.LG

    Spatial Sequence Attention Network for Schizophrenia Classification from Structural Brain MR Images

    Authors: Nagur Shareef Shaik, Teja Krishna Cherukuri, Vince Calhoun, Dong Hye Ye

    Abstract: Schizophrenia is a debilitating, chronic mental disorder that significantly impacts an individual's cognitive abilities, behavior, and social interactions. It is characterized by subtle morphological changes in the brain, particularly in the gray matter. These changes are often imperceptible through manual observation, demanding an automated approach to diagnosis. This study introduces a deep lear… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted for the 21st IEEE International Symposium on Biomedical Imaging (ISBI 2024)

  46. arXiv:2406.12336  [pdf, other

    cs.CL cs.LG

    A Compass for Navigating the World of Sentence Embeddings for the Telecom Domain

    Authors: Sujoy Roychowdhury, Sumit Soman, H. G. Ranjani, Vansh Chhabra, Neeraj Gunda, Subhadip Bandyopadhyay, Sai Krishna Bala

    Abstract: A plethora of sentence embedding models makes it challenging to choose one, especially for domains such as telecom, rich with specialized vocabulary. We evaluate multiple embeddings obtained from publicly available models and their domain-adapted variants, on both point retrieval accuracies as well as their (95\%) confidence intervals. We establish a systematic method to obtain thresholds for simi… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 3 figures, 4 tables

    MSC Class: 68T50 ACM Class: I.2.7

  47. arXiv:2406.11930  [pdf, other

    cs.SE cs.AI cs.CL

    A Critical Study of What Code-LLMs (Do Not) Learn

    Authors: Abhinav Anand, Shweta Verma, Krishna Narasimhan, Mira Mezini

    Abstract: Large Language Models trained on code corpora (code-LLMs) have demonstrated impressive performance in various coding assistance tasks. However, despite their increased size and training dataset, code-LLMs still have limitations such as suggesting codes with syntactic errors, variable misuse etc. Some studies argue that code-LLMs perform well on coding tasks because they use self-attention and hidd… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  48. arXiv:2406.11877  [pdf

    physics.ao-ph cs.LG

    Solar Power Prediction Using Satellite Data in Different Parts of Nepal

    Authors: Raj Krishna Nepal, Bibek Khanal, Vibek Ghimire, Kismat Neupane, Atul Pokharel, Kshitij Niraula, Baburam Tiwari, Nawaraj Bhattarai, Khem N. Poudyal, Nawaraj Karki, Mohan B Dangi, John Biden

    Abstract: Due to the unavailability of solar irradiance data for many potential sites of Nepal, the paper proposes predicting solar irradiance based on alternative meteorological parameters. The study focuses on five distinct regions in Nepal and utilizes a dataset spanning almost ten years, obtained from CERES SYN1deg and MERRA-2. Machine learning models such as Random Forest, XGBoost, K-Nearest Neighbors,… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 20 pages, 12 figures, 5 tables

  49. arXiv:2406.11775  [pdf, other

    cs.CV cs.AI

    Task Me Anything

    Authors: Jieyu Zhang, Weikai Huang, Zixian Ma, Oscar Michel, Dong He, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna

    Abstract: Benchmarks for large multimodal language models (MLMs) now serve to simultaneously assess the general capabilities of models instead of evaluating for a specific capability. As a result, when a developer wants to identify which models to use for their application, they are overwhelmed by the number of benchmarks and remain uncertain about which benchmark's results are most reflective of their spec… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: website: https://www.task-me-anything.org

  50. arXiv:2406.11488  [pdf, other

    cs.FL

    Reversible Transducers over Infinite Words

    Authors: Luc Dartois, Paul Gastin, Loïc Germerie Guizouarn, R. Govind, Shankaranarayanan Krishna

    Abstract: Deterministic two-way transducers capture the class of regular functions. The efficiency of composing two-way transducers has a direct implication in algorithmic problems related to reactive synthesis, where transformation specifications are converted into equivalent transducers. These specifications are presented in a modular way, and composing the resultant machines simulates the full specificat… ▽ More

    Submitted 28 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.