Skip to main content

Showing 51–100 of 3,387 results for author: Krishna

.
  1. arXiv:2406.10721  [pdf, other

    cs.RO cs.AI cs.CV

    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics

    Authors: Wentao Yuan, Jiafei Duan, Valts Blukis, Wilbert Pumacay, Ranjay Krishna, Adithyavairavan Murali, Arsalan Mousavian, Dieter Fox

    Abstract: From rearranging objects on a table to putting groceries into shelves, robots must plan precise action points to perform tasks accurately and reliably. In spite of the recent adoption of vision language models (VLMs) to control robot behavior, VLMs struggle to precisely articulate robot actions using language. We introduce an automatic synthetic data generation pipeline that instruction-tunes VLMs… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  2. arXiv:2406.09617  [pdf, other

    cs.CL cs.HC eess.AS

    Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection

    Authors: Shruti Palaskar, Oggi Rudovic, Sameer Dharur, Florian Pesce, Gautam Krishna, Aswin Sivaraman, Jack Berkowitz, Ahmed Hussen Abdelaziz, Saurabh Adya, Ahmed Tewfik

    Abstract: Although Large Language Models (LLMs) have shown promise for human-like conversations, they are primarily pre-trained on text data. Incorporating audio or video improves performance, but collecting large-scale multimodal data and pre-training multimodal LLMs is challenging. To this end, we propose a Fusion Low Rank Adaptation (FLoRA) technique that efficiently adapts a pre-trained unimodal LLM to… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  3. arXiv:2406.09403  [pdf, other

    cs.CV cs.CL

    Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

    Authors: Yushi Hu, Weijia Shi, Xingyu Fu, Dan Roth, Mari Ostendorf, Luke Zettlemoyer, Noah A Smith, Ranjay Krishna

    Abstract: Humans draw to facilitate reasoning: we draw auxiliary lines when solving geometry problems; we mark and circle when reasoning on maps; we use sketches to amplify our ideas and relieve our limited-capacity working memory. However, such actions are missing in current multimodal language models (LMs). Current chain-of-thought and tool-use paradigms only use text as intermediate reasoning steps. In t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 26 pages

  4. arXiv:2406.09264  [pdf, other

    cs.HC cs.AI cs.CL

    Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions

    Authors: Hua Shen, Tiffany Knearem, Reshmi Ghosh, Kenan Alkiek, Kundan Krishna, Yachuan Liu, Ziqiao Ma, Savvas Petridis, Yi-Hao Peng, Li Qiwei, Sushrita Rakshit, Chenglei Si, Yutong Xie, Jeffrey P. Bigham, Frank Bentley, Joyce Chai, Zachary Lipton, Qiaozhu Mei, Rada Mihalcea, Michael Terry, Diyi Yang, Meredith Ringel Morris, Paul Resnick, David Jurgens

    Abstract: Recent advancements in general-purpose AI have highlighted the importance of guiding AI systems towards the intended goals, ethical principles, and values of individuals and groups, a concept broadly recognized as alignment. However, the lack of clarified definitions and scopes of human-AI alignment poses a significant obstacle, hampering collaborative efforts across research domains to achieve th… ▽ More

    Submitted 17 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 56 pages

  5. arXiv:2406.08803  [pdf, other

    quant-ph math-ph

    Asymptotic Birkhoff-Violation in Operational Theories: Thermodynamic Implications and Information Processing

    Authors: Ananya Chakraborty, Sahil Gopalkrishna Naik, Samrat Sen, Ram Krishna Patra, Pratik Ghosal, Mir Alimuddin, Manik Banik

    Abstract: In accordance with the entropy principle of thermodynamics, under spontaneous evolutions, physical systems always evolve towards states with equal or greater randomness. But, where does this randomness originate? Renowned Birkhoff-von Neumann theorem, often referred to as Birkhoff theorem, identifies source of this randomness to be the stochastic application of reversible operations on the system… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: (4.25 + 7) Pages, 6 Figures, Comments are welcome

  6. arXiv:2406.08714  [pdf, other

    eess.SP

    Real-time Digital RF Emulation -- II: A Near Memory Custom Accelerator

    Authors: Mandovi Mukherjee, Xiangyu Mao, Nael Rahman, Coleman DeLude, Joe Driscoll, Sudarshan Sharma, Payman Behnam, Uday Kamal, Jongseok Woo, Daehyun Kim, Sharjeel Khan, Jianming Tong, Jamin Seo, Prachi Sinha, Madhavan Swaminathan, Tushar Krishna, Santosh Pande, Justin Romberg, Saibal Mukhopadhyay

    Abstract: A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  7. arXiv:2406.08504  [pdf, ps, other

    math.OA math.FA

    Noncommutative Donoho-Stark-Elad-Bruckstein-Ricaud-Torrésani Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $\{τ_n\}_{n=1}^\infty$ and $\{ω_m\}_{m=1}^\infty$ be two modular Parseval frames for a Hilbert C*-module $\mathcal{E}$. Then for every $x \in \mathcal{E}\setminus\{0\}$, we show that \begin{align} (1) \quad \quad \quad \quad \|θ_τx \|_0 \|θ_ωx \|_0 \geq \frac{1}{\sup_{n, m \in \mathbb{N}} \|\langle τ_n, ω_m\rangle \|^2}. \end{align} We call Inequality (1) as \textbf{Noncommutative Donoho-Stark… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 5 pages, 0 figures

    MSC Class: 42C15; 46L08

  8. arXiv:2406.07892  [pdf, ps, other

    cs.LG cs.AI

    Finite Time Analysis of Temporal Difference Learning for Mean-Variance in a Discounted MDP

    Authors: Tejaram Sangadi, L. A. Prashanth, Krishna Jagannathan

    Abstract: Motivated by risk-sensitive reinforcement learning scenarios, we consider the problem of policy evaluation for variance in a discounted reward Markov decision process (MDP). For this problem, a temporal difference (TD) type learning algorithm with linear function approximation (LFA) exists in the literature, though only asymptotic guarantees are available for this algorithm. We derive finite sampl… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  9. arXiv:2406.07332  [pdf, other

    cs.CV

    Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach

    Authors: Challapalli Phanindra Revanth, Sumohana S. Channappayya, C Krishna Mohan

    Abstract: Computing the loss gradient via backpropagation consumes considerable energy during deep learning (DL) model training. In this paper, we propose a novel approach to efficiently compute DL models' gradients to mitigate the substantial energy overhead associated with backpropagation. Exploiting the over-parameterized nature of DL models and the smoothness of their loss landscapes, we propose a metho… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  10. arXiv:2406.07246  [pdf, other

    cs.LG

    Marginalization Consistent Mixture of Separable Flows for Probabilistic Irregular Time Series Forecasting

    Authors: Vijaya Krishna Yalavarthi, Randolf Scholz, Kiran Madhusudhanan, Stefan Born, Lars Schmidt-Thieme

    Abstract: Probabilistic forecasting models for joint distributions of targets in irregular time series are a heavily under-researched area in machine learning with, to the best of our knowledge, only three models researched so far: GPR, the Gaussian Process Regression model~\citep{Durichen2015.Multitask}, TACTiS, the Transformer-Attentional Copulas for Time Series~\cite{Drouin2022.Tactis, ashok2024tactis} a… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  11. arXiv:2406.06712  [pdf, ps, other

    math.RT

    Classification of Non-Degenerate Symmetric Bilinear and Quadratic Forms in the Verlinde Category $\mathrm{Ver}_4^+$

    Authors: Iz Chen, Arun S. Kannan, Krishna Pothapragada

    Abstract: Although Deligne's theorem classifies all symmetric tensor categories (STCs) with moderate growth over algebraically closed fields of characteristic zero, the classification does not extend to positive characteristic. At the forefront of the study of STCs is the search for an analog to Deligne's theorem in positive characteristic, and it has become increasingly apparent that the Verlinde categorie… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  12. arXiv:2406.06361  [pdf, ps, other

    quant-ph math.OC

    Challenges with Differentiable Quantum Dynamics

    Authors: Sri Hari Krishna Narayanan, Michael Perlin, Robert Lewis-Swan, Jeffrey Larson, Matt Menickelly, Jan Hückelheim, Paul Hovland

    Abstract: Differentiable quantum dynamics require automatic differentiation of a complex-valued initial value problem, which numerically integrates a system of ordinary differential equations from a specified initial condition, as well as the eigendecomposition of a matrix. We explored several automatic differentiation frameworks for these tasks, finding that no framework natively supports our application r… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  13. arXiv:2406.05342  [pdf

    eess.SY

    Compensation for reactive power and harmonic current drawn by a non-linear load in a pv-micro hydro grid

    Authors: Raj Krishna Nepal, Bibek Khanal, Sanket Khatiwada, Nirajan Bhandari, Bishal Rijal, Raisha Karmacharya, Ajay Thapa

    Abstract: This paper presents a simulation approach to enhance the power quality of a PV-micro hydro grid supplying both linear consumer load and non-linear industrial load by integrating Shunt Active Power Filter (SAPF), utilizing instantaneous PQ theory and hysteresis current control band logic. The non-linear load draws reactive power and harmonic current from the source thereby affecting the power quali… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 5 pages, 21 figures, submitted on IEEE powercon 2024 conference

  14. arXiv:2406.05184  [pdf, other

    cs.CV

    The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better

    Authors: Scott Geng, Cheng-Yu Hsieh, Vivek Ramanujan, Matthew Wallingford, Chun-Liang Li, Pang Wei Koh, Ranjay Krishna

    Abstract: Generative text-to-image models enable us to synthesize unlimited amounts of images in a controllable manner, spurring many recent efforts to train vision models with synthetic data. However, every synthetic image ultimately originates from the upstream data used to train the generator. What additional value does the intermediate generator provide over directly training on relevant parts of the up… ▽ More

    Submitted 3 July, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Correspondence to sgeng at cs dot washington dot edu. RK and PWK equally advised the project

  15. arXiv:2406.05112  [pdf

    cond-mat.mes-hall physics.optics

    Ohms law lost and regained: observation and impact of zeros and poles

    Authors: Krishna Joshi, Israel Kurtz, Zhou Shi, Azriel Z. Genack

    Abstract: The quantum conductance and its classical wave analogue, the transmittance, are given by the sum of the eigenvalues of the transmission matrix. The lowest transmission eigenvalue in diffusive media might be expected to play a negligible role in the conductance, and, in any case, to be too small to be observed. Here, we observe the lowest transmission eigenchannel in microwave waveguides, though it… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  16. arXiv:2406.04548  [pdf, other

    cs.LG cs.IR cs.SI

    GNNAnatomy: Systematic Generation and Evaluation of Multi-Level Explanations for Graph Neural Networks

    Authors: Hsiao-Ying Lu, Yiran Li, Ujwal Pratap Krishna Kaluvakolanu Thyagarajan, Kwan-Liu Ma

    Abstract: Graph Neural Networks (GNNs) have proven highly effective in various machine learning (ML) tasks involving graphs, such as node/graph classification and link prediction. However, explaining the decisions made by GNNs poses challenges because of the aggregated relational information based on graph structure, leading to complex data transformations. Existing methods for explaining GNNs often face li… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  17. arXiv:2406.03093  [pdf, other

    astro-ph.SR

    Modelling the propagation of slow magneto-acoustic waves in a multi-stranded coronal loop

    Authors: S. Krishna Prasad, T. Van Doorsselaere

    Abstract: We study the propagation properties of slow magneto-acoustic waves in a multi-thermal coronal loop using a 3D MHD model, for the first time. A bundle of 33 vertical cylinders, each of 100{\,}km radius, randomly distributed over a circular region of radius 1{\,}Mm is considered to represent the coronal loop. The slow waves are driven by perturbing the vertical velocity ($v_z$) at the base of the lo… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in ApJ

  18. arXiv:2406.01859  [pdf, other

    quant-ph

    Variational quantum state preparation for quantum-enhanced metrology in noisy systems

    Authors: Juan C. Zuñiga Castro, Jeffrey Larson, Sri Hari Krishna Narayanan, Victor E. Colussi, Michael A. Perlin, Robert J. Lewis-Swan

    Abstract: We investigate optimized quantum state preparation for quantum metrology applications in noisy environments. We simulate a low-depth variational quantum circuit (VQC) composed of a sequence of global rotations and entangling operations applied to a chain of qubits that are subject to dephasing noise. The parameters controlling the VQC are numerically optimized to maximize the quantum Fisher inform… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures

  19. arXiv:2406.01698  [pdf, other

    cs.AR cs.AI cs.DC cs.LG

    Demystifying Platform Requirements for Diverse LLM Inference Use Cases

    Authors: Abhimanyu Bambhaniya, Ritik Raj, Geonhwa Jeong, Souvik Kundu, Sudarshan Srinivasan, Midhilesh Elavazhagan, Madhu Kumar, Tushar Krishna

    Abstract: Large language models (LLMs) have shown remarkable performance across a wide range of applications, often outperforming human experts. However, deploying these parameter-heavy models efficiently for diverse inference use cases requires carefully designed hardware platforms with ample computing, memory, and network resources. With LLM deployment scenarios and models evolving at breakneck speed, the… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 12 Pages, https://github.com/abhibambhaniya/GenZ-LLM-Analyzer

  20. Seasonal variation in nighttime NO radiative cooling as observed by TIMED/SABER in lower thermosphere during solar maximum and solar minimum

    Authors: Alok Kumar Ranjan, MV Sunil Krishna, Akash Kumar, Dayakrishna Nailwal, Sumanta Sarkhel

    Abstract: Both composition and temperature play a crucial role in determining the NO radiative cooling in lower thermosphere as observed by TIMED/SABER. In this work, we present a detailed investigation of seasonal variation in thermospheric NO radiative cooling. We have carried forward the investigation of \cite{li2018} regarding the variations in local nighttime peak NO radiative cooling and its altitude… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 19 pages, 10 figures

  21. arXiv:2406.00060  [pdf, other

    cs.CL cs.LG

    Cascade-Aware Training of Language Models

    Authors: Congchao Wang, Sean Augenstein, Keith Rush, Wittawat Jitkrittum, Harikrishna Narasimhan, Ankit Singh Rawat, Aditya Krishna Menon, Alec Go

    Abstract: Reducing serving cost and latency is a fundamental concern for the deployment of language models (LMs) in business applications. To address this, cascades of LMs offer an effective solution that conditionally employ smaller models for simpler queries. Cascaded systems are typically built with independently trained models, neglecting the advantages of considering inference-time interactions of the… ▽ More

    Submitted 29 May, 2024; originally announced June 2024.

    Comments: 22 pages, 13 figures

  22. arXiv:2405.20933  [pdf, ps, other

    cs.LG stat.ML

    Concentration Bounds for Optimized Certainty Equivalent Risk Estimation

    Authors: Ayon Ghosh, L. A. Prashanth, Krishna Jagannathan

    Abstract: We consider the problem of estimating the Optimized Certainty Equivalent (OCE) risk from independent and identically distributed (i.i.d.) samples. For the classic sample average approximation (SAA) of OCE, we derive mean-squared error as well as concentration bounds (assuming sub-Gaussianity). Further, we analyze an efficient stochastic approximation-based OCE estimator, and derive finite sample b… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  23. arXiv:2405.20654  [pdf, other

    cs.CL cs.IR

    Passage-specific Prompt Tuning for Passage Reranking in Question Answering with Large Language Models

    Authors: Xuyang Wu, Zhiyuan Peng, Krishna Sravanthi Rajanala Sai, Hsin-Tai Wu, Yi Fang

    Abstract: Effective passage retrieval and reranking methods have been widely utilized to identify suitable candidates in open-domain question answering tasks, recent studies have resorted to LLMs for reranking the retrieved passages by the log-likelihood of the question conditioned on each passage. Although these methods have demonstrated promising results, the performance is notably sensitive to the human-… ▽ More

    Submitted 20 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted at Gen-IR@SIGIR24

  24. arXiv:2405.20617  [pdf, other

    eess.SP

    Large-scale Outdoor Cell-free mMIMO Channel Measurement in an Urban Scenario at 3.5 GHz

    Authors: Yuning Zhang, Thomas Choi, Zihang Cheng, Issei Kanno, Masaaki Ito, Jorge Gomez-Ponce, Hussein Hammoud, Bowei Wu, Ashwani Pradhan, Kelvin Arana, Pramod Krishna, Tianyi Yang, Tyler Chen, Ishita Vasishtha, Haoyu Xie, Linyu Sun, Andreas F. Molisch

    Abstract: The design of cell-free massive MIMO (CF-mMIMO) systems requires accurate, measurement-based channel models. This paper provides the first results from the by far most extensive outdoor measurement campaign for CF-mMIMO channels in an urban environment. We measured impulse responses between over 20,000 potential access point (AP) locations and 80 user equipments (UEs) at 3.5 GHz with 350 MHz bandw… ▽ More

    Submitted 6 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Submitted to: VTC 2024-Fall

  25. arXiv:2405.20457  [pdf, other

    cs.SI cs.CY cs.HC

    Online network topology shapes personal narratives and hashtag generation

    Authors: J. Hunter Priniski, Bryce Linford, Sai Krishna, Fred Morstatter, Jeff Brantingham, Hong**g Lu

    Abstract: While narratives have shaped cognition and cultures for centuries, digital media and online social networks have introduced new narrative phenomena. With increased narrative agency, networked groups of individuals can directly contribute and steer narratives that center our collective discussions of politics, science, and morality. We report the results of an online network experiment on narrative… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Will be published in the 2024 Proceedings of the Cognitive Science Society

  26. arXiv:2405.19801  [pdf, other

    physics.space-ph

    Modeling of Nitric Oxide Infrared radiative flux in lower thermosphere: a machine learning perspective

    Authors: Dayakrishna Nailwal, MV Sunil Krishna, Alok Kumar Ranjan, Jia Yue

    Abstract: Nitric Oxide (NO) significantly impacts energy distribution and chemical processes in the mesosphere and lower thermosphere (MLT). During geomagnetic storms, a substantial influx of energy in the thermosphere leads to an increase in NO infrared emissions. Accurately predicting the radiative flux of Nitric Oxide is crucial for understanding the thermospheric energy budget, particularly during extre… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 18 pages, 7 figures

    Journal ref: Under review in Advances in Space Research 2024

  27. arXiv:2405.19597  [pdf, other

    cs.LG cs.AI cs.CL

    SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors

    Authors: Vijay Lingam, Atula Tejaswi, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Alex Dimakis, Eunsol Choi, Aleksandar Bojchevski, Sujay Sanghavi

    Abstract: Popular parameter-efficient fine-tuning (PEFT) methods, such as LoRA and its variants, freeze pre-trained model weights \(W\) and inject learnable matrices \(ΔW\). These \(ΔW\) matrices are structured for efficient parameterization, often using techniques like low-rank approximations or scaling vectors. However, these methods typically show a performance gap compared to full fine-tuning. Although… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 17 pages, 5 figures, 14 tables

  28. arXiv:2405.19261  [pdf, other

    cs.CL cs.AI cs.LG

    Faster Cascades via Speculative Decoding

    Authors: Harikrishna Narasimhan, Wittawat Jitkrittum, Ankit Singh Rawat, Seungyeon Kim, Neha Gupta, Aditya Krishna Menon, Sanjiv Kumar

    Abstract: Cascades and speculative decoding are two common approaches to improving language models' inference efficiency. Both approaches involve interleaving models of different sizes, but via fundamentally distinct mechanisms: cascades employ a deferral rule that invokes the larger model only for "hard" inputs, while speculative decoding uses speculative execution to primarily invoke the larger model in p… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  29. arXiv:2405.18400  [pdf, other

    cs.CL cs.LG

    Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass

    Authors: Ethan Shen, Alan Fan, Sarah M. Pratt, Jae Sung Park, Matthew Wallingford, Sham M. Kakade, Ari Holtzman, Ranjay Krishna, Ali Farhadi, Aditya Kusupati

    Abstract: Many applications today provide users with multiple auto-complete drafts as they type, including GitHub's code completion, Gmail's smart compose, and Apple's messaging auto-suggestions. Under the hood, language models support this by running an autoregressive inference pass to provide a draft. Consequently, providing $k$ drafts to the user requires running an expensive language model $k$ times. To… ▽ More

    Submitted 24 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 22 pages, 15 figures

  30. arXiv:2405.17731  [pdf, other

    cs.DB

    Evaluating NoSQL Databases for OLAP Workloads: A Benchmarking Study of MongoDB, Redis, Kudu and ArangoDB

    Authors: Rishi Kesav Mohan, Risheek Rakshit Sukumar Kanmani, Krishna Anandan Ganesan, Nisha Ramasubramanian

    Abstract: In the era of big data, conventional RDBMS models have become impractical for handling colossal workloads. Consequently, NoSQL databases have emerged as the preferred storage solutions for executing processing-intensive Online Analytical Processing (OLAP) tasks. Within the realm of NoSQL databases, various classifications exist based on their data storage mechanisms, making it challenging to selec… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  31. arXiv:2405.17545  [pdf, other

    hep-ph nucl-th

    Adiabatic Hydrodynamization and the Emergence of Attractors: a Unified Description of Hydrodynamization in Kinetic Theory

    Authors: Krishna Rajagopal, Bruno Scheihing-Hitschfeld, Rachel Steinhorst

    Abstract: "Attractor" solutions for the pre-hydrodynamic, far-from-equilibrium, evolution of the matter produced in relativistic heavy ion collisions have emerged as crucial descriptors of the rapid hydrodynamization of quark-gluon plasma (QGP). Adiabatic Hydrodynamization (AH) has been proposed as a framework with which to describe, explain, and predict attractor behavior that draws upon an analogy to the… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 63 pages, 20 figures

    Report number: MIT-CTP/5724

  32. arXiv:2405.17309  [pdf, other

    cs.LG cs.NI

    Survey of Graph Neural Network for Internet of Things and NextG Networks

    Authors: Sabarish Krishna Moorthy, Jithin Jagannath

    Abstract: The exponential increase in Internet of Things (IoT) devices coupled with 6G pushing towards higher data rates and connected devices has sparked a surge in data. Consequently, harnessing the full potential of data-driven machine learning has become one of the important thrusts. In addition to the advancement in wireless technology, it is important to efficiently use the resources available and mee… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  33. arXiv:2405.16915  [pdf, other

    cs.CV cs.LG

    Multilingual Diversity Improves Vision-Language Representations

    Authors: Thao Nguyen, Matthew Wallingford, Sebastin Santy, Wei-Chiu Ma, Sewoong Oh, Ludwig Schmidt, Pang Wei Koh, Ranjay Krishna

    Abstract: Massive web-crawled image-text datasets lay the foundation for recent progress in multimodal learning. These datasets are designed with the goal of training a model to do well on standard computer vision benchmarks, many of which, however, have been shown to be English-centric (e.g., ImageNet). Consequently, existing data curation techniques gravitate towards using predominantly English image-text… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  34. arXiv:2405.16500  [pdf, other

    math.DS

    Optimal Intervention Strategies and Cost-effectiveness Analysis study of Tuberculosis with reference to TPT, Malnutrition and Diabetes Management

    Authors: Sushil Chhetri, Krishna Kiran Vamsi Dasu, K N Kavya, Sharath B N, Uma Shankar S, Somashekar N, Vineet Kumar Chadda

    Abstract: Tuberculosis remains a significant global health challenge, with millions of new cases reported annually. Recent studies suggest that expanding the accessibility of TB intervention programs can lead to a substantial decrease in both TB incidence and prevalence. This paper initiates by examining a deterministic mathematical model for TB transmission, aiming to analyze the underlying dynamics. Subse… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  35. arXiv:2405.16389  [pdf, ps, other

    math.SP math-ph

    Decorrelation in Local Statistics for random operators

    Authors: M. Krishna

    Abstract: In this paper we study the local spectral statistics in the localised region of various random operator models, including the $d$-dimensional the Anderson model and random Schrödinger operators. It is already established, in the above models, that at an energy $E$, in the localised energy region of the spectrum, where the density of states $n(E) > 0$, the local eigenvalue statistics $X_E$ is a Poi… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  36. arXiv:2405.15839  [pdf, ps, other

    math.NT

    Balancing and Lucas-balancing numbers as difference of two repdigits

    Authors: Monalisa Mohapatra, Pritam Kumar Bhoi, Gopal Krishna Panda

    Abstract: Positive integers with all digits equal are called repdigits. In this paper, we find all balancing and Lucas-balancing numbers, which can be expressed as the difference of two repdigits. The method of proof involves the application of Baker's theory for linear forms in logarithms of algebraic numbers and the Baker-Davenport reduction procedure.

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 13 pages. arXiv admin note: text overlap with arXiv:2405.04801

    MSC Class: Primary 11B39; Secondary 11J86; 11D61

  37. arXiv:2405.15590  [pdf, ps, other

    cs.CL

    Profiling checkpointing schedules in adjoint ST-AD

    Authors: Laurent Hascoët, Jean-Luc Bouchot, Shreyas Sunil Gaikwad, Sri Hari Krishna Narayanan, Jan Hückelheim

    Abstract: Checkpointing is a cornerstone of data-flow reversal in adjoint algorithmic differentiation. Checkpointing is a storage/recomputation trade-off that can be applied at different levels, one of which being the call tree. We are looking for good placements of checkpoints onto the call tree of a given application, to reduce run time and memory footprint of its adjoint. There is no known optimal soluti… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  38. arXiv:2405.13762  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation

    Authors: Gwanghyun Kim, Alonso Martinez, Yu-Chuan Su, Brendan Jou, José Lezama, Agrim Gupta, Lijun Yu, Lu Jiang, Aren Jansen, Jacob Walker, Krishna Somandepalli

    Abstract: Training diffusion models for audiovisual sequences allows for a range of generation tasks by learning conditional distributions of various input-output combinations of the two modalities. Nevertheless, this strategy often requires training a separate model for each task which is expensive. Here, we propose a novel training approach to effectively learn arbitrary conditional distributions in the a… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  39. arXiv:2405.13181  [pdf, other

    cs.CL cs.LG

    Comparative Analysis of Different Efficient Fine Tuning Methods of Large Language Models (LLMs) in Low-Resource Setting

    Authors: Krishna Prasad Varadarajan Srinivasan, Prasanth Gumpena, Madhusudhana Yattapu, Vishal H. Brahmbhatt

    Abstract: In the domain of large language models (LLMs), arXiv:2305.16938 showed that few-shot full-model fine-tuning -- namely Vanilla Fine Tuning (FT) and Pattern-Based Fine Tuning (PBFT) --, and In-Context Learning (ICL) generalize similarly on Out-Of-Domain (OOD) datasets, but vary in terms of task adaptation. However, they both pose challenges, especially in term of memory requirements. In this paper,… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 9 pages of main paper, 1 page of references, 6 appendix pages, 11 figures, 18 tables

  40. arXiv:2405.13170  [pdf, other

    cs.AR

    FEATHER: A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching

    Authors: Jianming Tong, Anirudh Itagi, Prasanth Chatarasi, Tushar Krishna

    Abstract: The inference of ML models composed of diverse structures, types, and sizes boils down to the execution of different dataflows (i.e. different tiling, ordering, parallelism, and shapes). Using the optimal dataflow for every layer of workload can reduce latency by up to two orders of magnitude over a suboptimal dataflow. Unfortunately, reconfiguring hardware for different dataflows involves on-chip… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 17 pages, 14 figures. International Symposium on Computer Architecture (ISCA), Jun 2024

  41. arXiv:2405.12983  [pdf, other

    eess.AS cs.AI cs.CV cs.MM cs.SD

    Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer

    Authors: Maxime Burchi, Krishna C. Puvvada, Jagadeesh Balam, Boris Ginsburg, Radu Timofte

    Abstract: Humans are adept at leveraging visual cues from lip movements for recognizing speech in adverse listening conditions. Audio-Visual Speech Recognition (AVSR) models follow similar approach to achieve robust speech recognition in noisy conditions. In this work, we present a multilingual AVSR model incorporating several enhancements to improve performance and audio noise robustness. Notably, we adapt… ▽ More

    Submitted 13 March, 2024; originally announced May 2024.

  42. arXiv:2405.12011  [pdf, ps, other

    math.CO cs.IT

    Higher weight spectra of ternary codes associated to the quadratic Veronese $3$-fold

    Authors: Krishna Kaipa, Puspendu Pradhan

    Abstract: The problem studied in this work is to determine the higher weight spectra of the Projective Reed-Muller codes associated to the Veronese $3$-fold $\mathcal V$ in $PG(9,q)$, which is the image of the quadratic Veronese embedding of $PG(3,q)$ in $PG(9,q)$. We reduce the problem to the following combinatorial problem in finite geometry: For each subset $S$ of $\mathcal V$, determine the dimension of… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    MSC Class: 94B27; 51E20; 05B25

  43. arXiv:2405.09792  [pdf

    physics.app-ph cond-mat.mes-hall cond-mat.mtrl-sci

    CMOS-compatible Strain Engineering for High-Performance Monolayer Semiconductor Transistors

    Authors: Marc Jaikissoon, Çağıl Köroğlu, Jerry A. Yang, Kathryn M. Neilson, Krishna C. Saraswat, Eric Pop

    Abstract: Strain engineering has played a key role in modern silicon electronics, having been introduced as a mobility booster in the 1990s and commercialized in the early 2000s. Achieving similar advances with two-dimensional (2D) semiconductors in a CMOS (complementary metal oxide semiconductor) compatible manner would radically improve the industrial viability of 2D transistors. Here, we show silicon nit… ▽ More

    Submitted 29 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  44. Motion Prediction with Gaussian Processes for Safe Human-Robot Interaction in Virtual Environments

    Authors: Stanley Mugisha, Vamsi Krishna Guda, Christine Chevallereau, Damien Chablat, Matteo Zoppi

    Abstract: Humans use collaborative robots as tools for accomplishing various tasks. The interaction between humans and robots happens in tight shared workspaces. However, these machines must be safe to operate alongside humans to minimize the risk of accidental collisions. Ensuring safety imposes many constraints, such as reduced torque and velocity limits during operation, thus increasing the time to accom… ▽ More

    Submitted 18 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: 16 pages

    ACM Class: I.2.6; I.2.9; I.3.2; H.5.2

  45. arXiv:2405.08003  [pdf, ps, other

    math.FA cs.IT math.OA math.QA

    Continuous Krishna-Parthasarathy Entropic Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: In 2002, Krishna and Parthasarathy [\textit{Sankhyā Ser. A}] derived discrete quantum version of Maassen-Uffink [\textit{Phys. Rev. Lett., 1988}] entropic uncertainty principle. In this paper, using the notion of continuous operator-valued frames, we derive an entropic uncertainty principle for arbitrary family of operators indexed by measure spaces having finite measure. We give an application to… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 7 pages, 0 Figures

    MSC Class: 81P15; 94A17; 42C15

    Journal ref: Special issue of Infinite Dimensional Analysis, Quantum Probability and Related Topics in honour of Prof. K. R. Parthasarathy, 18 March 2024

  46. arXiv:2405.05572  [pdf, other

    cs.CL cs.AI

    From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences

    Authors: Prashant Kodali, Anmol Goel, Likhith Asapu, Vamshi Krishna Bonagiri, Anirudh Govil, Monojit Choudhury, Manish Shrivastava, Ponnurangam Kumaraguru

    Abstract: Current computational approaches for analysing or generating code-mixed sentences do not explicitly model "naturalness" or "acceptability" of code-mixed sentences, but rely on training corpora to reflect distribution of acceptable code-mixed sentences. Modelling human judgement for the acceptability of code-mixed text can help in distinguishing natural code-mixed text and enable quality-controlled… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  47. arXiv:2405.04801  [pdf, ps, other

    math.GM

    Repdigits as difference of two balancing or Lucas-balancing numbers

    Authors: Monalisa Mohapatra, Pritam Kumar Bhoi, Gopal Krishna Panda

    Abstract: Repdigits are natural numbers formed by the repetition of a single digit. In this paper, we study the problem of writing repdigits as the difference of two balancing or Lucas-balancing numbers. The method of proof involves the application of Baker's theory for linear forms in logarithms of algebraic numbers and the Baker-Davenport reduction procedure. Computations are done with the help of a simpl… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 12

    MSC Class: Primary 11B39; Secondary 11J86; 11D61

  48. arXiv:2405.04619  [pdf, other

    hep-th cond-mat.str-el math-ph math.QA

    Non-anomalous non-invertible symmetries in 1+1D from gapped boundaries of SymTFTs

    Authors: Pavel Putrov, Rajath Radhakrishnan

    Abstract: We study the anomalies of non-invertible symmetries in 1+1D QFTs using gapped boundaries of its SymTFT. We establish the explicit relation between Lagrangian algebras which determine gapped boundaries of the SymTFT, and algebras which determine non-anomalous/gaugeable topological line operators in the 1+1D QFT. If the Lagrangian algebras in the SymTFT are known, this provides a method to compute a… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 72 pages, 24 figures

  49. arXiv:2405.03582  [pdf, other

    cs.LG

    Functional Latent Dynamics for Irregularly Sampled Time Series Forecasting

    Authors: Christian Klötergens, Vijaya Krishna Yalavarthi, Maximilian Stubbemann, Lars Schmidt-Thieme

    Abstract: Irregularly sampled time series with missing values are often observed in multiple real-world applications such as healthcare, climate and astronomy. They pose a significant challenge to standard deep learn- ing models that operate only on fully observed and regularly sampled time series. In order to capture the continuous dynamics of the irreg- ular time series, many models rely on solving an Ord… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  50. arXiv:2405.03348  [pdf, other

    cs.IT

    Evolution of the 5G New Radio Two-Step Random Access towards 6G Unsourced MAC

    Authors: Patrick Agostini, Jean-Francois Chamberland, Federico Clazzer, Johannes Dommel, Gianluigi Liva, Andrea Munari, Krishna Narayanan, Yury Polyanskiy, Slawomir Stanczak, Zoran Utkovski

    Abstract: This report summarizes some considerations on possible evolutions of grant-free random access in the next generation of the 3GPP wireless cellular standard. The analysis is carried out by map** the problem to the recently-introduced unsourced multiple access channel (UMAC) setup. By doing so, the performance of existing solutions can be benchmarked with information-theoretic bounds, assessing th… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Version 1.0 of the report