Search | arXiv e-print repository

doi 10.1109/TRO.2024.3392077

Complete and Near-Optimal Robotic Crack Coverage and Filling in Civil Infrastructure

Authors: Vishnu Veeraraghavan, Kyle Hunte, **gang Yi, Kaiyan Yu

Abstract: We present a simultaneous sensor-based inspection and footprint coverage (SIFC) planning and control design with applications to autonomous robotic crack map** and filling. The main challenge of the SIFC problem lies in the coupling of complete sensing (for map**) and robotic footprint (for filling) coverage tasks. Initially, we assume known target information (e.g., crack) and employ classic… ▽ More We present a simultaneous sensor-based inspection and footprint coverage (SIFC) planning and control design with applications to autonomous robotic crack map** and filling. The main challenge of the SIFC problem lies in the coupling of complete sensing (for map**) and robotic footprint (for filling) coverage tasks. Initially, we assume known target information (e.g., crack) and employ classic cell decomposition methods to achieve complete sensing coverage of the workspace and complete robotic footprint coverage using the least-cost route. Subsequently, we generalize the algorithm to handle unknown target information, allowing the robot to scan and incrementally construct the target graph online while conducting robotic footprint coverage. The online polynomial-time SIFC planning algorithm minimizes the total robot traveling distance, guarantees complete sensing coverage of the entire workspace, and achieves near-optimal robotic footprint coverage, as demonstrated through empirical experiments. For the demonstrated application, we design coordinated nozzle motion control with the planned robot trajectory to efficiently fill all cracks within the robot's footprint. Experimental results are presented to illustrate the algorithm's design, performance, and comparisons. The SIFC algorithm offers a high-efficiency motion planning solution for various robotic applications requiring simultaneous sensing and actuation coverage. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Journal ref: in IEEE Transactions on Robotics, vol. 40, pp. 2850-2867, 2024

arXiv:2402.18116 [pdf, other]

Block and Detail: Scaffolding Sketch-to-Image Generation

Authors: Vishnu Sarukkai, Lu Yuan, Mia Tang, Maneesh Agrawala, Kayvon Fatahalian

Abstract: We introduce a novel sketch-to-image tool that aligns with the iterative refinement process of artists. Our tool lets users sketch blocking strokes to coarsely represent the placement and form of objects and detail strokes to refine their shape and silhouettes. We develop a two-pass algorithm for generating high-fidelity images from such sketches at any point in the iterative process. In the first… ▽ More We introduce a novel sketch-to-image tool that aligns with the iterative refinement process of artists. Our tool lets users sketch blocking strokes to coarsely represent the placement and form of objects and detail strokes to refine their shape and silhouettes. We develop a two-pass algorithm for generating high-fidelity images from such sketches at any point in the iterative process. In the first pass we use a ControlNet to generate an image that strictly follows all the strokes (blocking and detail) and in the second pass we add variation by renoising regions surrounding blocking strokes. We also present a dataset generation scheme that, when used to train a ControlNet architecture, allows regions that do not contain strokes to be interpreted as not-yet-specified regions rather than empty space. We show that this partial-sketch-aware ControlNet can generate coherent elements from partial sketches that only contain a small number of strokes. The high-fidelity images produced by our approach serve as scaffolds that can help the user adjust the shape and proportions of objects or add additional elements to the composition. We demonstrate the effectiveness of our approach with a variety of examples and evaluative comparisons. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Comments: 12 pages, 13 figures

arXiv:2402.15492 [pdf, other]

Mechanics-Informed Autoencoder Enables Automated Detection and Localization of Unforeseen Structural Damage

Authors: Xuyang Li, Hamed Bolandi, Mahdi Masmoudi, Talal Salem, Nizar Lajnef, Vishnu Naresh Boddeti

Abstract: Structural health monitoring (SHM) is vital for ensuring the safety and longevity of structures like buildings and bridges. As the volume and scale of structures and the impact of their failure continue to grow, there is a dire need for SHM techniques that are scalable, inexpensive, operate passively without human intervention, and customized for each mechanical structure without the need for comp… ▽ More Structural health monitoring (SHM) is vital for ensuring the safety and longevity of structures like buildings and bridges. As the volume and scale of structures and the impact of their failure continue to grow, there is a dire need for SHM techniques that are scalable, inexpensive, operate passively without human intervention, and customized for each mechanical structure without the need for complex baseline models. We present a novel "deploy-and-forget" approach for automated detection and localization of damages in structures. It is based on a synergistic combination of fully passive measurements from inexpensive sensors and a mechanics-informed autoencoder. Once deployed, our solution continuously learns and adapts a bespoke baseline model for each structure, learning from its undamaged state's response characteristics. After learning from just 3 hours of data, it can autonomously detect and localize different types of unforeseen damage. Results from numerical simulations and experiments indicate that incorporating the mechanical characteristics into the variational autoencoder allows for up to 35\% earlier detection and localization of damage over a standard autoencoder. Our approach holds substantial promise for a significant reduction in human intervention and inspection costs and enables proactive and preventive maintenance strategies, thus extending the lifespan, reliability, and sustainability of civil infrastructures. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.13757 [pdf, other]

Critical Behavior and Collective Modes at the Superfluid Transition in Amorphous Systems

Authors: Vishnu Pulloor Kuttanikkad, Martin Puschmann, Rajesh Narayanan, Thomas Vojta

Abstract: We investigate the critical behavior and the dynamics of the amplitude (Higgs) mode close to the superfluid-insulator quantum phase transition in an amorphous system (i.e., a system subject to topological randomness). In particular, we map the two-dimensional Bose-Hubbard Hamiltonian defined on a random Voronoi-Delaunay lattice onto a (2+1)-dimensional layered classical XY model with correlated to… ▽ More We investigate the critical behavior and the dynamics of the amplitude (Higgs) mode close to the superfluid-insulator quantum phase transition in an amorphous system (i.e., a system subject to topological randomness). In particular, we map the two-dimensional Bose-Hubbard Hamiltonian defined on a random Voronoi-Delaunay lattice onto a (2+1)-dimensional layered classical XY model with correlated topological disorder. We study the resulting model by laying recourse to classical Monte Carlo simulations. We specifically focus on the scalar susceptibility of the order parameter to study the dynamics of the amplitude mode. To do so, we harness the maximum entropy method to perform the analytic continuation of the scalar susceptibility to real frequencies. Our analysis shows that the amplitude mode remains delocalized in the presence of such topological disorder, quite at odds with its behavior in generic disordered systems, where the randomness localizes the Higgs mode. Furthermore, we show that the critical behavior of the topologically disordered system is identical to that of its translationally invariant counterpart, consistent with a modified Harris criterion. This suggests that the localization of the collective excitations in the presence of disorder is tied to the critical behavior of the quantum phase transition rather than a simple Anderson-localization-type interference mechanism. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 13 pages, 12 figures

arXiv:2402.11582 [pdf, other]

doi 10.1109/CSF61375.2024.00032

Publicly auditable privacy-preserving electoral rolls

Authors: Prashant Agrawal, Mahabir Prasad Jhanwar, Subodh Vishnu Sharma, Subhashis Banerjee

Abstract: While existing literature on electronic voting has extensively addressed verifiability of voting protocols, the vulnerability of electoral rolls in large public elections remains a critical concern. To ensure integrity of electoral rolls, the current practice is to either make electoral rolls public or share them with the political parties. However, this enables construction of detailed voter prof… ▽ More While existing literature on electronic voting has extensively addressed verifiability of voting protocols, the vulnerability of electoral rolls in large public elections remains a critical concern. To ensure integrity of electoral rolls, the current practice is to either make electoral rolls public or share them with the political parties. However, this enables construction of detailed voter profiles and selective targeting and manipulation of voters, thereby undermining the fundamental principle of free and fair elections. In this paper, we study the problem of designing publicly auditable yet privacy-preserving electoral rolls. We first formulate a threat model and provide formal security definitions. We then present a protocol for creation, maintenance and usage of electoral rolls that mitigates the threats. Eligible voters can verify their inclusion, whereas political parties and auditors can statistically audit the electoral roll. Further, the audit can also detect polling-day ballot stuffing and denials to eligible voters by malicious polling officers. The entire electoral roll is never revealed, which prevents any large-scale systematic voter targeting and manipulation. △ Less

Submitted 2 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

Report number: CSF 2024

Journal ref: 2024 IEEE 37th Computer Security Foundations Symposium (CSF)

arXiv:2402.09607 [pdf, other]

Numerical Study of a Strongly Coupled Two-scale System with Nonlinear Dispersion

Authors: Surendra Nepal, Vishnu Raveendran, Michael Eden, Rainey Lyons, Adrian Muntean

Abstract: Thinking of flows crossing through regular porous media, we numerically explore the behavior of weak solutions to a two-scale elliptic-parabolic system that is strongly coupled by means of a suitable nonlinear dispersion term. The two-scale system of interest originates from the fast-drift periodic homogenization of a nonlinear convective-diffusion-reaction problem, where the structure of the non-… ▽ More Thinking of flows crossing through regular porous media, we numerically explore the behavior of weak solutions to a two-scale elliptic-parabolic system that is strongly coupled by means of a suitable nonlinear dispersion term. The two-scale system of interest originates from the fast-drift periodic homogenization of a nonlinear convective-diffusion-reaction problem, where the structure of the non-linearity in the drift fits to the hydrodynamic limit of a totally asymmetric simple exclusion process for a population of particles. In this article, we focus exclusively on numerical simulations that employ two decoupled approximation schemes, viz. 'scheme 1' - a Picard-type iteration - and 'scheme 2' - a time discretization decoupling. Additionally, we describe a computational strategy which helps to drastically improve computation times. Finally, we provide several numerical experiments to illustrate what dispersion effects are introduced by a specific choice of microstructure and model ingredients. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: 27 pages, 10 figures, 3 tables

MSC Class: 65M60; 47J25; 35M30; 35G55

arXiv:2402.08648 [pdf, other]

Generating Universal Adversarial Perturbations for Quantum Classifiers

Authors: Gautham Anil, Vishnu Vinod, Apurva Narayan

Abstract: Quantum Machine Learning (QML) has emerged as a promising field of research, aiming to leverage the capabilities of quantum computing to enhance existing machine learning methodologies. Recent studies have revealed that, like their classical counterparts, QML models based on Parametrized Quantum Circuits (PQCs) are also vulnerable to adversarial attacks. Moreover, the existence of Universal Advers… ▽ More Quantum Machine Learning (QML) has emerged as a promising field of research, aiming to leverage the capabilities of quantum computing to enhance existing machine learning methodologies. Recent studies have revealed that, like their classical counterparts, QML models based on Parametrized Quantum Circuits (PQCs) are also vulnerable to adversarial attacks. Moreover, the existence of Universal Adversarial Perturbations (UAPs) in the quantum domain has been demonstrated theoretically in the context of quantum classifiers. In this work, we introduce QuGAP: a novel framework for generating UAPs for quantum classifiers. We conceptualize the notion of additive UAPs for PQC-based classifiers and theoretically demonstrate their existence. We then utilize generative models (QuGAP-A) to craft additive UAPs and experimentally show that quantum classifiers are susceptible to such attacks. Moreover, we formulate a new method for generating unitary UAPs (QuGAP-U) using quantum generative models and a novel loss function based on fidelity constraints. We evaluate the performance of the proposed framework and show that our method achieves state-of-the-art misclassification rates, while maintaining high fidelity between legitimate and adversarial samples. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: Accepted at AAAI 2024

arXiv:2402.04986 [pdf, other]

Heat transport through an open coupled scalar field theory hosting stability-to-instability transition

Authors: T. R. Vishnu, Dibyendu Roy

Abstract: We investigate heat transport through a one-dimensional open coupled scalar field theory, depicted as a network of harmonic oscillators connected to thermal baths at the boundaries. The non-Hermitian dynamical matrix of the network undergoes a stability-to-instability transition at the exceptional points as the coupling strength between the scalar fields increases. The open network in the unstable… ▽ More We investigate heat transport through a one-dimensional open coupled scalar field theory, depicted as a network of harmonic oscillators connected to thermal baths at the boundaries. The non-Hermitian dynamical matrix of the network undergoes a stability-to-instability transition at the exceptional points as the coupling strength between the scalar fields increases. The open network in the unstable regime, marked by the emergence of inverted oscillator modes, does not acquire a steady state, and the heat conduction is then unbounded for general bath couplings. In this work, we engineer a unique bath coupling where a single bath is connected to two fields at each edge with the same strength. This configuration leads to a finite steady-state heat conduction in the network, even in the unstable regime. We also study general bath couplings, e.g., connecting two fields to two separate baths at each boundary, which shows an exciting signature of approaching the unstable regime for massive fields. We derive analytical expressions for high-temperature classical heat current through the network for different bath couplings at the edges and compare them. Furthermore, we determine the temperature dependence of low-temperature quantum heat current in different cases. Our study will help to probe topological phases and phase transitions in various quadratic Hermitian bosonic models whose dynamical matrices resemble non-Hermitian Hamiltonians, hosting exciting topological phases. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 19 pages, 4 figures

arXiv:2402.04495 [pdf, other]

Using bi-fluxon tunneling to protect the Fluxonium qubit

Authors: Waël Ardati, Sébastien Léger, Shelender Kumar, Vishnu Narayanan Suresh, Dorian Nicolas, Cyril Mori, Francesca D'Esposito, Tereza Vakhtel, Olivier Buisson, Quentin Ficheux, Nicolas Roch

Abstract: Encoding quantum information in quantum states with disjoint wave-function support and noise insensitive energies is the key behind the idea of qubit protection. While fully protected qubits are expected to offer exponential protection against both energy relaxation and pure dephasing, simpler circuits may grant partial protection with currently achievable parameters. Here, we study a fluxonium ci… ▽ More Encoding quantum information in quantum states with disjoint wave-function support and noise insensitive energies is the key behind the idea of qubit protection. While fully protected qubits are expected to offer exponential protection against both energy relaxation and pure dephasing, simpler circuits may grant partial protection with currently achievable parameters. Here, we study a fluxonium circuit in which the wave-functions are engineered to minimize their overlap while benefiting from a first-order-insensitive flux sweet spot. Taking advantage of a large superinductance ($L\sim 1~μ\rm{H}$), our circuit incorporates a resonant tunneling mechanism at zero external flux that couples states with the same fluxon parity, thus enabling bifluxon tunneling. The states $|0\rangle$ and $|1\rangle$ are encoded in wave-functions with parities 0 and 1, respectively, ensuring a minimal form of protection against relaxation. Two-tone spectroscopy reveals the energy level structure of the circuit and the presence of $4 π$ quantum-phase slips between different potential wells corresponding to $m=\pm 1$ fluxons, which can be precisely described by a simple fluxonium Hamiltonian or by an effective bifluxon Hamiltonian. Despite suboptimal fabrication, the measured relaxation ($T_1 = 177\pm 3 ~μs$) and dephasing ($T_2^E = 75\pm 5~μ\rm{s}$) times not only demonstrate the relevance of our approach but also opens an alternative direction towards quantum computing using partially-protected fluxonium qubits. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: 14 pages, 12 figures

arXiv:2402.02603 [pdf]

A Review of Full-Sized Autonomous Racing Vehicle Sensor Architecture

Authors: Manuel Mar, Vishnu Chellapandi, Liangqi Yuan, Ziran Wang, Eric Dietz

Abstract: In the landscape of technological innovation, autonomous racing is a dynamic and challenging domain that not only pushes the limits of technology, but also plays a crucial role in advancing and fostering a greater acceptance of autonomous systems. This paper thoroughly explores challenges and advances in autonomous racing vehicle design and performance, focusing on Roborace and the Indy Autonomous… ▽ More In the landscape of technological innovation, autonomous racing is a dynamic and challenging domain that not only pushes the limits of technology, but also plays a crucial role in advancing and fostering a greater acceptance of autonomous systems. This paper thoroughly explores challenges and advances in autonomous racing vehicle design and performance, focusing on Roborace and the Indy Autonomous Challenge (IAC). This review provides a detailed analysis of sensor setups, architectural nuances, and test metrics on these cutting-edge platforms. In Roborace, the evolution from Devbot 1.0 to Robocar and Devbot 2.0 is detailed, revealing insights into sensor configurations and performance outcomes. The examination extends to the IAC, which is dedicated to high-speed self-driving vehicles, emphasizing developmental trajectories and sensor adaptations. By reviewing these platforms, the analysis provides valuable insight into autonomous driving racing, contributing to a broader understanding of sensor architectures and the challenges faced. This review supports future advances in full-scale autonomous racing technology. △ Less

Submitted 4 February, 2024; originally announced February 2024.

arXiv:2402.01711 [pdf, other]

LLM on FHIR -- Demystifying Health Records

Authors: Paul Schmiedmayer, Adrit Rao, Philipp Zagar, Vishnu Ravi, Aydin Zahedivash, Arash Fereydooni, Oliver Aalami

Abstract: Objective: To enhance health literacy and accessibility of health information for a diverse patient population by develo** a patient-centered artificial intelligence (AI) solution using large language models (LLMs) and Fast Healthcare Interoperability Resources (FHIR) application programming interfaces (APIs). Materials and Methods: The research involved develo** LLM on FHIR, an open-source mo… ▽ More Objective: To enhance health literacy and accessibility of health information for a diverse patient population by develo** a patient-centered artificial intelligence (AI) solution using large language models (LLMs) and Fast Healthcare Interoperability Resources (FHIR) application programming interfaces (APIs). Materials and Methods: The research involved develo** LLM on FHIR, an open-source mobile application allowing users to interact with their health records using LLMs. The app is built on Stanford's Spezi ecosystem and uses OpenAI's GPT-4. A pilot study was conducted with the SyntheticMass patient dataset and evaluated by medical experts to assess the app's effectiveness in increasing health literacy. The evaluation focused on the accuracy, relevance, and understandability of the LLM's responses to common patient questions. Results: LLM on FHIR demonstrated varying but generally high degrees of accuracy and relevance in providing understandable health information to patients. The app effectively translated medical data into patient-friendly language and was able to adapt its responses to different patient profiles. However, challenges included variability in LLM responses and the need for precise filtering of health data. Discussion and Conclusion: LLMs offer significant potential in improving health literacy and making health records more accessible. LLM on FHIR, as a pioneering application in this field, demonstrates the feasibility and challenges of integrating LLMs into patient care. While promising, the implementation and pilot also highlight risks such as inconsistent responses and the importance of replicable output. Future directions include better resource identification mechanisms and executing LLMs on-device to enhance privacy and reduce costs. △ Less

Submitted 25 January, 2024; originally announced February 2024.

Comments: Pre-print of the paper submitted to the Call for Papers for the Special Focus Issue on ChatGPT and Large Language Models (LLMs) in Biomedicine and Health at the Journal of the American Medical Informatics Association: https://academic.oup.com/jamia/pages/call-for-papers-for-special-focus-issue

arXiv:2402.00197 [pdf, other]

doi 10.1021/acs.est.3c06447

Determination of Trace Organic Contaminant Concentration via Machine Classification of Surface-Enhanced Raman Spectra

Authors: Vishnu Jayaprakash, Jae Bem You, Chiranjeevi Kanike, **feng Liu, Christopher McCallum, Xuehua Zhang

Abstract: Accurate detection and analysis of traces of persistent organic pollutants in water is important in many areas, including environmental monitoring and food quality control, due to their long environmental stability and potential bioaccumulation. While conventional analysis of organic pollutants requires expensive equipment, surface enhanced Raman spectroscopy (SERS) has demonstrated great potentia… ▽ More Accurate detection and analysis of traces of persistent organic pollutants in water is important in many areas, including environmental monitoring and food quality control, due to their long environmental stability and potential bioaccumulation. While conventional analysis of organic pollutants requires expensive equipment, surface enhanced Raman spectroscopy (SERS) has demonstrated great potential for accurate detection of these contaminants. However, SERS analytical difficulties, such as spectral preprocessing, denoising, and substrate-based spectral variation, have hindered widespread use of the technique. Here, we demonstrate an approach for predicting the concentration of sample pollutants from messy, unprocessed Raman data using machine learning. Frequency domain transform methods, including the Fourier and Walsh Hadamard transforms, are applied to sets of Raman spectra of three model micropollutants in water (rhodamine 6G, chlorpyrifos, and triclosan), which are then used to train machine learning algorithms. Using standard machine learning models, the concentration of sample pollutants are predicted with more than 80 percent cross-validation accuracy from raw Raman data. cross-validation accuracy of 85 percent was achieved using deep learning for a moderately sized dataset (100 spectra), and 70 to 80 percent cross-validation accuracy was achieved even for very small datasets (50 spectra). Additionally, standard models were shown to accurately identify characteristic peaks via analysis of their importance scores. The approach shown here has the potential to be applied to facilitate accurate detection and analysis of persistent organic pollutants by surface-enhanced Raman spectroscopy. △ Less

Submitted 31 January, 2024; originally announced February 2024.

arXiv:2402.00090 [pdf]

Classification of attention performance post-longitudinal tDCS via functional connectivity and machine learning methods

Authors: Akash K Rao, Vishnu K Menon, Arnav Bhavsar, Shubhajit Roy Chowdhury, Ramsingh Negi, Varun Dutt

Abstract: Attention is the brain's mechanism for selectively processing specific stimuli while filtering out irrelevant information. Characterizing changes in attention following long-term interventions (such as transcranial direct current stimulation (tDCS)) has seldom been emphasized in the literature. To classify attention performance post-tDCS, this study uses functional connectivity and machine learnin… ▽ More Attention is the brain's mechanism for selectively processing specific stimuli while filtering out irrelevant information. Characterizing changes in attention following long-term interventions (such as transcranial direct current stimulation (tDCS)) has seldom been emphasized in the literature. To classify attention performance post-tDCS, this study uses functional connectivity and machine learning algorithms. Fifty individuals were split into experimental and control conditions. On Day 1, EEG data was obtained as subjects executed an attention task. From Day 2 through Day 8, the experimental group was administered 1mA tDCS, while the control group received sham tDCS. On Day 10, subjects repeated the task mentioned on Day 1. Functional connectivity metrics were used to classify attention performance using various machine learning methods. Results revealed that combining the Adaboost model and recursive feature elimination yielded a classification accuracy of 91.84%. We discuss the implications of our results in develo** neurofeedback frameworks to assess attention. △ Less

Submitted 31 January, 2024; originally announced February 2024.

Comments: 6 pages, to be presented in the IEEE 9th International Conference for Convergence in Technology (I2CT),Pune, April 2024. arXiv admin note: substantial text overlap with arXiv:2401.17700

arXiv:2401.17745 [pdf]

Gesture Controlled Robot For Human Detection

Authors: Athira T. S, Honey Manoj, R S Vishnu Priya, Vishnu K Menon, Srilekshmi M

Abstract: It is very important to locate survivors from collapsed buildings so that rescue operations can be arranged. Many lives are lost due to lack of competent systems to detect people in these collapsed buildings at the right time. So here we have designed a hand gesture controlled robot which is capable of detecting humans under these collapsed building parts. The proposed work can be used to access s… ▽ More It is very important to locate survivors from collapsed buildings so that rescue operations can be arranged. Many lives are lost due to lack of competent systems to detect people in these collapsed buildings at the right time. So here we have designed a hand gesture controlled robot which is capable of detecting humans under these collapsed building parts. The proposed work can be used to access specific locations that are not humanly possible, and detect those humans trapped under the rubble of collapsed buildings. This information is then used to notify the rescue team to take adequate measures and initiate rescue operations accordingly. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 6 pages, presented at the 2nd International Conference on IoT Based Control Networks and Intelligent Systems(ICICNIS 2021)

Journal ref: proceedings of International Conference on IoT Based Control Networks & Intelligent Systems - ICICNIS 2021, 6 pages,2021

arXiv:2401.17711 [pdf]

Prediction of multitasking performance post-longitudinal tDCS via EEG-based functional connectivity and machine learning methods

Authors: Akash K Rao, Shashank Uttrani, Vishnu K Menon, Darshil Shah, Arnav Bhavsar, Shubhajit Roy Chowdhury, Varun Dutt

Abstract: Predicting and understanding the changes in cognitive performance, especially after a longitudinal intervention, is a fundamental goal in neuroscience. Longitudinal brain stimulation-based interventions like transcranial direct current stimulation (tDCS) induce short-term changes in the resting membrane potential and influence cognitive processes. However, very little research has been conducted o… ▽ More Predicting and understanding the changes in cognitive performance, especially after a longitudinal intervention, is a fundamental goal in neuroscience. Longitudinal brain stimulation-based interventions like transcranial direct current stimulation (tDCS) induce short-term changes in the resting membrane potential and influence cognitive processes. However, very little research has been conducted on predicting these changes in cognitive performance post-intervention. In this research, we intend to address this gap in the literature by employing different EEG-based functional connectivity analyses and machine learning algorithms to predict changes in cognitive performance in a complex multitasking task. Forty subjects were divided into experimental and active-control conditions. On Day 1, all subjects executed a multitasking task with simultaneous 32-channel EEG being acquired. From Day 2 to Day 7, subjects in the experimental condition undertook 15 minutes of 2mA anodal tDCS stimulation during task training. Subjects in the active-control condition undertook 15 minutes of sham stimulation during task training. On Day 10, all subjects again executed the multitasking task with EEG acquisition. Source-level functional connectivity metrics, namely phase lag index and directed transfer function, were extracted from the EEG data on Day 1 and Day 10. Various machine learning models were employed to predict changes in cognitive performance. Results revealed that the multi-layer perceptron and directed transfer function recorded a cross-validation training RMSE of 5.11% and a test RMSE of 4.97%. We discuss the implications of our results in develo** real-time cognitive state assessors for accurately predicting cognitive performance in dynamic and complex tasks post-tDCS intervention △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 16 pages, presented at the 30th International Conference on Neural Information Processing (ICONIP2023), Changsha, China, November 2023

arXiv:2401.17705 [pdf]

Predicting suicidal behavior among Indian adults using childhood trauma, mental health questionnaires and machine learning cascade ensembles

Authors: Akash K Rao, Gunjan Y Trivedi, Riri G Trivedi, Anshika Bajpai, Gajraj Singh Chauhan, Vishnu K Menon, Kathirvel Soundappan, Hemalatha Ramani, Neha Pandya, Varun Dutt

Abstract: Among young adults, suicide is India's leading cause of death, accounting for an alarming national suicide rate of around 16%. In recent years, machine learning algorithms have emerged to predict suicidal behavior using various behavioral traits. But to date, the efficacy of machine learning algorithms in predicting suicidal behavior in the Indian context has not been explored in literature. In th… ▽ More Among young adults, suicide is India's leading cause of death, accounting for an alarming national suicide rate of around 16%. In recent years, machine learning algorithms have emerged to predict suicidal behavior using various behavioral traits. But to date, the efficacy of machine learning algorithms in predicting suicidal behavior in the Indian context has not been explored in literature. In this study, different machine learning algorithms and ensembles were developed to predict suicide behavior based on childhood trauma, different mental health parameters, and other behavioral factors. The dataset was acquired from 391 individuals from a wellness center in India. Information regarding their childhood trauma, psychological wellness, and other mental health issues was acquired through standardized questionnaires. Results revealed that cascade ensemble learning methods using a support vector machine, decision trees, and random forest were able to classify suicidal behavior with an accuracy of 95.04% using data from childhood trauma and mental health questionnaires. The study highlights the potential of using these machine learning ensembles to identify individuals with suicidal tendencies so that targeted interinterventions could be provided efficiently. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 11 pages, presnted at the 4th International Conference on Frontiers in Computing and Systems (COMSYS 2023), Himachal Pradesh, October 2023

arXiv:2401.17700 [pdf]

Classification of executive functioning performance post-longitudinal tDCS using functional connectivity and machine learning methods

Authors: Akash K Rao, Vishnu K Menon, Shashank Uttrani, Ayushman Dixit, Dipanshu Verma, Varun Dutt

Abstract: Executive functioning is a cognitive process that enables humans to plan, organize, and regulate their behavior in a goal-directed manner. Understanding and classifying the changes in executive functioning after longitudinal interventions (like transcranial direct current stimulation (tDCS)) has not been explored in the literature. This study employs functional connectivity and machine learning al… ▽ More Executive functioning is a cognitive process that enables humans to plan, organize, and regulate their behavior in a goal-directed manner. Understanding and classifying the changes in executive functioning after longitudinal interventions (like transcranial direct current stimulation (tDCS)) has not been explored in the literature. This study employs functional connectivity and machine learning algorithms to classify executive functioning performance post-tDCS. Fifty subjects were divided into experimental and placebo control groups. EEG data was collected while subjects performed an executive functioning task on Day 1. The experimental group received tDCS during task training from Day 2 to Day 8, while the control group received sham tDCS. On Day 10, subjects repeated the tasks specified on Day 1. Different functional connectivity metrics were extracted from EEG data and eventually used for classifying executive functioning performance using different machine learning algorithms. Results revealed that a novel combination of partial directed coherence and multi-layer perceptron (along with recursive feature elimination) resulted in a high classification accuracy of 95.44%. We discuss the implications of our results in develo** real-time neurofeedback systems for assessing and enhancing executive functioning performance post-tDCS administration. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 7 pages, presented at the IEEE 20th India Council International Conference (INDICON 2023), Hyderabad, India, December 2023

arXiv:2401.15078 [pdf, other]

Physical Yukawa Couplings in Heterotic String Compactifications

Authors: Giorgi Butbaia, Damián Mayorga Peña, Justin Tan, Per Berglund, Tristan Hübsch, Vishnu Jejjala, Challenger Mishra

Abstract: One of the challenges of heterotic compactification on a Calabi-Yau threefold is to determine the physical $(\mathbf{27})^3$ Yukawa couplings of the resulting four-dimensional $\mathcal{N}=1$ theory. In general, the calculation necessitates knowledge of the Ricci-flat metric. However, in the standard embedding, which references the tangent bundle, we can compute normalized Yukawa couplings from th… ▽ More One of the challenges of heterotic compactification on a Calabi-Yau threefold is to determine the physical $(\mathbf{27})^3$ Yukawa couplings of the resulting four-dimensional $\mathcal{N}=1$ theory. In general, the calculation necessitates knowledge of the Ricci-flat metric. However, in the standard embedding, which references the tangent bundle, we can compute normalized Yukawa couplings from the Weil-Petersson metric on the moduli space of complex structure deformations of the Calabi-Yau manifold. In various examples (the Fermat quintic, the intersection of two cubics in $\mathbb{P}^5$, and the Tian-Yau manifold), we calculate the normalized Yukawa couplings for $(2,1)$-forms using the Weil-Petersson metric obtained from the Kodaira-Spencer map. In cases where $h^{1,1}=1$, this is compared to a complementary calculation based on performing period integrals. A third expression for the normalized Yukawa couplings is obtained from a machine learned approximate Ricci-flat metric making use of explicit harmonic representatives. The excellent agreement between the different approaches opens the door to precision string phenomenology. △ Less

Submitted 1 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

Comments: 33 pages, 11 figures, 2 tables, 3 lemmas, 1 theorem. v2: Minor edits

arXiv:2401.14292 [pdf, other]

Single and bi-layered 2-D acoustic soft tactile skin (AST2)

Authors: Vishnu Rajendran, Simon Parsons, Amir Ghalamzan E

Abstract: This paper aims to present an innovative and cost-effective design for Acoustic Soft Tactile (AST) Skin, with the primary goal of significantly enhancing the accuracy of 2-D tactile feature estimation. The existing challenge lies in achieving precise tactile feature estimation, especially concerning contact geometry characteristics, using cost-effective solutions. We hypothesise that by harnessing… ▽ More This paper aims to present an innovative and cost-effective design for Acoustic Soft Tactile (AST) Skin, with the primary goal of significantly enhancing the accuracy of 2-D tactile feature estimation. The existing challenge lies in achieving precise tactile feature estimation, especially concerning contact geometry characteristics, using cost-effective solutions. We hypothesise that by harnessing acoustic energy through dedicated acoustic channels in 2 layers beneath the sensing surface and analysing amplitude modulation, we can effectively decode interactions on the sensory surface, thereby improving tactile feature estimation. Our approach involves the distinct separation of hardware components responsible for emitting and receiving acoustic signals, resulting in a modular and highly customizable skin design. Practical tests demonstrate the effectiveness of this novel design, achieving remarkable precision in estimating contact normal forces (MAE < 0.8 N), 2D contact localisation (MAE < 0.7 mm), and contact surface diameter (MAE < 0.3 mm). In conclusion, the AST skin, with its innovative design and modular architecture, successfully addresses the challenge of tactile feature estimation. The presented results showcase its ability to precisely estimate various tactile features, making it a practical and cost-effective solution for robotic applications. △ Less

Submitted 29 February, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

Comments: IEEE Robosoft conference 2024 (accepted)

arXiv:2401.10377 [pdf, other]

doi 10.3847/PSJ/acf298

Grain Size Effects on UV-MIR (0.2-14 micron) Spectra of Carbonaceous Chondrite Groups

Authors: David C. Cantillo, Vishnu Reddy, Adam Battle, Benjamin N. L. Sharkey, Neil C. Pearson, Tanner Campbell, Akash Satpathy, Mario De Florio, Roberto Furfaro, Juan Sanchez

Abstract: Carbonaceous chondrites are among the most important meteorite types and have played a vital role in deciphering the origin and evolution of our solar system. They have been linked to low-albedo C-type asteroids, but due to subdued absorption bands, definitive asteroid-meteorite linkages remain elusive. A majority of these existing linkages rely on fine-grained (typically < 45 micron) powders acro… ▽ More Carbonaceous chondrites are among the most important meteorite types and have played a vital role in deciphering the origin and evolution of our solar system. They have been linked to low-albedo C-type asteroids, but due to subdued absorption bands, definitive asteroid-meteorite linkages remain elusive. A majority of these existing linkages rely on fine-grained (typically < 45 micron) powders across a limited wavelength range in the visible to near-infrared (0.35-2.5 microns). While this is useful in interpreting the fine-grained regolith of larger main-belt objects like Ceres, recent spacecraft missions to smaller near-Earth asteroids (NEAs), such as Bennu and Ryugu, have shown that their surfaces are dominated by larger grain size material. To better interpret the surfaces of these smaller, carbonaceous NEAs, we obtained laboratory reflectance spectra of seven carbonaceous chondrite meteorite groups (CI, CM, CO, CV, CR, CK, C2-ungrouped) over the ultraviolet to mid-infrared range (0.2-14 microns). Each meteorite contained five grain size bins (45-1000 microns) to help constrain spectral grain size effects. We find a correlation between grain size and absolute reflectance, spectral slope, band depth, and the Christiansen feature band center. Principal component analysis of grain size variation illustrates a similar trend to lunar-style space weathering. We also show that the Bus-DeMeo asteroid taxonomic classification of our samples is affected by grain size, specifically shifting CM2 Aguas Zarcas from a Ch-type to B-type with increasing grain size. This has implications for the parent body of the OSIRIS-REx target, Bennu. With Aguas Zarcas, we present results from Hapke modeling. △ Less

Submitted 18 January, 2024; originally announced January 2024.

Comments: 40 pages, 15 figures, published in the Planetary Science Journal

Journal ref: Planet. Sci. J. 4 177 (2023)

arXiv:2401.03815 [pdf, other]

doi 10.1007/s11071-023-08976-9

Degenerate soliton solutions and their interactions in coupled Hirota equation with trivial and nontrivial background

Authors: S. Monisha, N. Vishnu Priya, M. Senthilvelan

Abstract: We construct two kinds of degenerate soliton solutions, one on the zero background and another on the plane wave background for the coupled Hirota equation. In the case of zero background field, we derive positon solutions of various orders. We also study interaction dynamics between positon solutions through asymptotic analysis and show that the positons exhibit time dependent phase shift during… ▽ More We construct two kinds of degenerate soliton solutions, one on the zero background and another on the plane wave background for the coupled Hirota equation. In the case of zero background field, we derive positon solutions of various orders. We also study interaction dynamics between positon solutions through asymptotic analysis and show that the positons exhibit time dependent phase shift during collision. We also construct hybrid solutions which composed of positons and solitons and examine the interaction between higher order positon and multi-solitons in detail. From the interaction, we demonstrate that the occurrence of elastic and inelastic interaction between multi-solitons and higher order positons. Further, we construct bound states among solitons and positons for the coupled Hirota equation. In the case of plane wave background, we construct breather-positon solutions. For the coupled Hirota equation, the breather-positon solutions are being reported first time in the literature. From the breather-positon solutions, we bring out certain interesting collision dynamics between breather-positons and positons. △ Less

Submitted 8 January, 2024; originally announced January 2024.

Comments: 33 pages, 12 figures

Journal ref: Nonlinear Dynamics,111, 21877-21894, 2023

arXiv:2401.02677 [pdf, other]

Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

Authors: Yatharth Gupta, Vishnu V. Jaddipal, Harish Prabhala, Sayak Paul, Patrick Von Platen

Abstract: Stable Diffusion XL (SDXL) has become the best open source text-to-image model (T2I) for its versatility and top-notch image quality. Efficiently addressing the computational demands of SDXL models is crucial for wider reach and applicability. In this work, we introduce two scaled-down variants, Segmind Stable Diffusion (SSD-1B) and Segmind-Vega, with 1.3B and 0.74B parameter UNets, respectively,… ▽ More Stable Diffusion XL (SDXL) has become the best open source text-to-image model (T2I) for its versatility and top-notch image quality. Efficiently addressing the computational demands of SDXL models is crucial for wider reach and applicability. In this work, we introduce two scaled-down variants, Segmind Stable Diffusion (SSD-1B) and Segmind-Vega, with 1.3B and 0.74B parameter UNets, respectively, achieved through progressive removal using layer-level losses focusing on reducing the model size while preserving generative quality. We release these models weights at https://hf.co/Segmind. Our methodology involves the elimination of residual networks and transformer blocks from the U-Net structure of SDXL, resulting in significant reductions in parameters, and latency. Our compact models effectively emulate the original SDXL by capitalizing on transferred knowledge, achieving competitive results against larger multi-billion parameter SDXL. Our work underscores the efficacy of knowledge distillation coupled with layer-level losses in reducing model size while preserving the high-quality generative capabilities of SDXL, thus facilitating more accessible deployment in resource-constrained environments. △ Less

Submitted 5 January, 2024; originally announced January 2024.

arXiv:2312.13122 [pdf, other]

doi 10.1016/j.physd.2024.134170

Screwon spectral statistics and dispersion relation in the quantum Rajeev-Ranken model

Authors: Govind S. Krishnaswami, T. R. Vishnu

Abstract: The Rajeev-Ranken (RR) model is a Hamiltonian system describing screw-type nonlinear waves (screwons) of wavenumber $k$ in a scalar field theory pseudodual to the 1+1D SU(2) principal chiral model. Classically, the RR model based on a quadratic Hamiltonian on a nilpotent/Euclidean Poisson algebra is Liouville integrable. Upon adopting canonical variables in a slightly extended phase space, the mod… ▽ More The Rajeev-Ranken (RR) model is a Hamiltonian system describing screw-type nonlinear waves (screwons) of wavenumber $k$ in a scalar field theory pseudodual to the 1+1D SU(2) principal chiral model. Classically, the RR model based on a quadratic Hamiltonian on a nilpotent/Euclidean Poisson algebra is Liouville integrable. Upon adopting canonical variables in a slightly extended phase space, the model was interpreted as a novel 3D cylindrically symmetric quartic oscillator with a rotational energy. Here, we examine the spectral statistics and dispersion relation of quantized screwons via numerical diagonalization validated by variational and perturbative approximations. We also derive a semiclassical estimate for the cumulative level distribution which compares favorably with the one from numerical diagonalization. The spectrum shows level crossings typical of an integrable system. The $i^{\rm th}$ unfolded nearest neighbor spacings are found to follow Poisson statistics for small $i$. Nonoverlap** spacing ratios also indicate that successive spectral gaps are independently distributed. After displaying universal linear behavior over energy windows of short lengths, the spectral rigidity saturates at a length and value that scales with the square-root of energy. For strong coupling $λ$ and intermediate $k$, we argue that reduced screwon energies can depend only on the product $λk$. Numerically, we find power law dependences on $λ$ and $k$ with an approximately common exponent $2/3$ provided the angular momentum quantum number $l$ is small compared to the number of nodes $n$ in the radial wavefunction. On the other hand, for the ground state $n = l = 0$, the common exponent becomes 1. △ Less

Submitted 26 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: 17 pages, 21 figure files, Discussion section expanded

Journal ref: Physica D, 463 (2024) 134170

arXiv:2312.11682 [pdf, other]

doi 10.1109/ACCESS.2022.3190418

Joint Phase-Time Arrays: A Paradigm for Frequency-Dependent Analog Beamforming in 6G

Authors: Vishnu V. Ratnam, Jianhua Mo, Ahmad AlAmmouri, Boon L. Ng, Jianzhong, Zhang, Andreas F. Molisch

Abstract: Hybrid beamforming is an attractive solution to build cost-effective and energy-efficient transceivers for millimeter-wave and terahertz systems. However, conventional hybrid beamforming techniques rely on analog components that generate a frequency flat response such as phase-shifters and switches, which limits the flexibility of the achievable beam patterns. As a novel alternative, this paper pr… ▽ More Hybrid beamforming is an attractive solution to build cost-effective and energy-efficient transceivers for millimeter-wave and terahertz systems. However, conventional hybrid beamforming techniques rely on analog components that generate a frequency flat response such as phase-shifters and switches, which limits the flexibility of the achievable beam patterns. As a novel alternative, this paper proposes a new class of hybrid beamforming called Joint phase-time arrays (JPTA), that additionally use true-time delay elements in the analog beamforming to create frequency-dependent analog beams. Using as an example two important frequency-dependent beam behaviors, the numerous benefits of such flexibility are exemplified. Subsequently, the JPTA beamformer design problem to generate any desired beam behavior is formulated and near-optimal algorithms to the problem are proposed. Simulations show that the proposed algorithms can outperform heuristics solutions for JPTA beamformer update. Furthermore, it is shown that JPTA can achieve the two exemplified beam behaviors with one radio-frequency chain, while conventional hybrid beamforming requires the radio-frequency chains to scale with the number of antennas to achieve similar performance. Finally, a wide range of problems to further tap into the potential of JPTA are also listed as future directions. △ Less

Submitted 18 December, 2023; originally announced December 2023.

Comments: The paper is a revised version of the IEEE Access paper, that includes the full operation of Algorithms 1-3 to help curtail incorrect implementations

Journal ref: IEEE Access, vol. 10, pp. 73364-73377, 2022

arXiv:2312.08453 [pdf, other]

Integrating Particle Flavor into Deep Learning Models for Hadronization

Authors: Jay Chan, Xiangyang Ju, Adam Kania, Benjamin Nachman, Vishnu Sangli, Andrzej Siodmok

Abstract: Hadronization models used in event generators are physics-inspired functions with many tunable parameters. Since we do not understand hadronization from first principles, there have been multiple proposals to improve the accuracy of hadronization models by utilizing more flexible parameterizations based on neural networks. These recent proposals have focused on the kinematic properties of hadrons,… ▽ More Hadronization models used in event generators are physics-inspired functions with many tunable parameters. Since we do not understand hadronization from first principles, there have been multiple proposals to improve the accuracy of hadronization models by utilizing more flexible parameterizations based on neural networks. These recent proposals have focused on the kinematic properties of hadrons, but a full model must also include particle flavor. In this paper, we show how to build a deep learning-based hadronization model that includes both kinematic (continuous) and flavor (discrete) degrees of freedom. Our approach is based on Generative Adversarial Networks and we show the performance within the context of the cluster hadronization model within the Herwig event generator. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Comments: 9 pages, 4 figures

arXiv:2312.08442 [pdf, other]

Learning holographic horizons

Authors: Vishnu Jejjala, Sukrut Mondkar, Ayan Mukhopadhyay, Rishi Raj

Abstract: We apply machine learning to understand fundamental aspects of holographic duality, specifically the entropies obtained from the apparent and event horizon areas. We show that simple features of only the time series of the pressure anisotropy, namely the values and half-widths of the maxima and minima, the times these are attained, and the times of the first zeroes can predict the areas of the app… ▽ More We apply machine learning to understand fundamental aspects of holographic duality, specifically the entropies obtained from the apparent and event horizon areas. We show that simple features of only the time series of the pressure anisotropy, namely the values and half-widths of the maxima and minima, the times these are attained, and the times of the first zeroes can predict the areas of the apparent and event horizons in the dual bulk geometry at all times with a fixed maximum length (30) of the input vector. Given that simple Vaidya-type metrics constructed just from the apparent and event horizon areas can be used to approximately obtain unequal time correlation functions, we argue that the corresponding entropy functions are the measures of information that need to be extracted from simple one-point functions to reconstruct specific aspects of correlation functions of the dual state with the best possible approximations. △ Less

Submitted 3 February, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

Comments: 10+10 pages, 1 Figure; Discussion improved, k-fold cross validation added

arXiv:2312.07098 [pdf, ps, other]

On near orthogonality of certain $k$-vectors involving generalized Ramanujan sums

Authors: Neha Elizabeth Thomas, K Vishnu Namboothiri

Abstract: The near orthgonality of certain $k$-vectors involving the Ramanujan sums were studied by E. Alkan in [J. Number Theory, 140:147--168 (2014)]. Here we undertake the study of similar vectors involving a generalization of the Ramanujan sums defined by E. Cohen in [Duke Math. J., 16(2):85--90 (1949)]. We also prove that the weighted average… ▽ More The near orthgonality of certain $k$-vectors involving the Ramanujan sums were studied by E. Alkan in [J. Number Theory, 140:147--168 (2014)]. Here we undertake the study of similar vectors involving a generalization of the Ramanujan sums defined by E. Cohen in [Duke Math. J., 16(2):85--90 (1949)]. We also prove that the weighted average $\frac{1}{k^{s(r+1)}}\sum \limits_{j=1}^{k^s}j^rc_k^{(s)}(j)$ remains positve for all $r\geq 1$. Further, we give a lower bound for $\max\limits_{N}\left|\sum \limits_{j=1}^{N^s}c_k^{(s)}(j) \right|$. △ Less

Submitted 12 December, 2023; originally announced December 2023.

MSC Class: 11L03; 11N37; 11N64

arXiv:2312.05938 [pdf, ps, other]

A reciprocity theorem for the Cohen-Ramanujan sums and its application to Cohen-Ramanujan expansions in the second variable

Authors: K Vishnu Namboothiri, Vinod Sivadasan

Abstract: For an arithmetical function $f$, its Ramanujan expansion is a series expansion in the form $f(n)=\sum\limits_{k=1}^{\infty}a(k) c_k(n)$ where $a(k)$ are complex numbers and $c_k(n):= \sum\limits_{\substack{m=1\\(m, k)=1}}^{k}e^{\frac{2πimn}{k}}$ is the Ramanujan sum. Here we prove a reciprocity result on Cohen-Ramanujan sums… ▽ More For an arithmetical function $f$, its Ramanujan expansion is a series expansion in the form $f(n)=\sum\limits_{k=1}^{\infty}a(k) c_k(n)$ where $a(k)$ are complex numbers and $c_k(n):= \sum\limits_{\substack{m=1\\(m, k)=1}}^{k}e^{\frac{2πimn}{k}}$ is the Ramanujan sum. Here we prove a reciprocity result on Cohen-Ramanujan sums $c_k^s(n) :=\sum\limits_{\substack{{h=1}\\(h,k^s)_s=1}}^{k^s}e^{\frac{2πi n h}{k^s}}$ to change the position of $k$ and $n$ in a twisted function and use it to prove that for certain arithmetical functions $f$, Cohen-Ramanujan series expansions in the form $\sum\limits_{k=1}^{\infty}a(k) c_k^{(s)}(n)$ exist if and only if expansions in the form $\sum\limits_{k=1}^{\infty}b(k/n) c_n^{(s)}(k)$ exist. △ Less

Submitted 10 December, 2023; originally announced December 2023.

MSC Class: 11A25; 11L03

arXiv:2312.05936 [pdf, ps, other]

On an identity of Delange and its application to Cohen-Ramanujan expansions

Authors: Vinod Sivadasan, K Vishnu Namboothiri

Abstract: Srinivasa Ramanujan provided Fourier series expansions of certain arithmetical functions in terms of the exponential sum defined by $c_q(n)=\sum\limits_{\substack{{m=1}\\(m,q)=1}}^{q}e^{\frac{2 πimn}{q}}$. Later, H. Delange derived the bound $\sum\limits_{q|k}|c_q(n)|\leq n\, 2^{ω(k)}$ and gave a sufficient condition for such expansions to exist. A. Grytczuk gave an exact value for this bound, and… ▽ More Srinivasa Ramanujan provided Fourier series expansions of certain arithmetical functions in terms of the exponential sum defined by $c_q(n)=\sum\limits_{\substack{{m=1}\\(m,q)=1}}^{q}e^{\frac{2 πimn}{q}}$. Later, H. Delange derived the bound $\sum\limits_{q|k}|c_q(n)|\leq n\, 2^{ω(k)}$ and gave a sufficient condition for such expansions to exist. A. Grytczuk gave an exact value for this bound, and derived a converse implication of the absolute convergence stated by H. Delange. We here show that these results have natural generalizations in terms of the Cohen-Ramanujan sum $c_q^{(s)}(n)$ defined by E. Cohen in [\emph{Duke Mathematical Journal, 16(85-90):2, 1949}]. We derive a bound as well as exact value for $\sum\limits_{q|k}|c_q^{(s)}(n)|$ and provide a sufficient condition for the Cohen-Ramanujan expansions to exist. △ Less

Submitted 10 December, 2023; originally announced December 2023.

MSC Class: 11A25; 11L03

arXiv:2312.05299 [pdf, other]

Learning to be Simple

Authors: Yang-Hui He, Vishnu Jejjala, Challenger Mishra, Em Sharnoff

Abstract: In this work we employ machine learning to understand structured mathematical data involving finite groups and derive a theorem about necessary properties of generators of finite simple groups. We create a database of all 2-generated subgroups of the symmetric group on n-objects and conduct a classification of finite simple groups among them using shallow feed-forward neural networks. We show that… ▽ More In this work we employ machine learning to understand structured mathematical data involving finite groups and derive a theorem about necessary properties of generators of finite simple groups. We create a database of all 2-generated subgroups of the symmetric group on n-objects and conduct a classification of finite simple groups among them using shallow feed-forward neural networks. We show that this neural network classifier can decipher the property of simplicity with varying accuracies depending on the features. Our neural network model leads to a natural conjecture concerning the generators of a finite simple group. We subsequently prove this conjecture. This new toy theorem comments on the necessary properties of generators of finite simple groups. We show this explicitly for a class of sporadic groups for which the result holds. Our work further makes the case for a machine motivated study of algebraic structures in pure mathematics and highlights the possibility of generating new conjectures and theorems in mathematics with the aid of machine learning. △ Less

Submitted 8 December, 2023; originally announced December 2023.

Comments: 25 pages, 6 figures and 5 tables

arXiv:2312.01874 [pdf, ps, other]

Fair Division via Quantile Shares

Authors: Yakov Babichenko, Michal Feldman, Ron Holzman, Vishnu V. Narayan

Abstract: We consider the problem of fair division, where a set of indivisible goods should be distributed fairly among a set of agents with combinatorial valuations. To capture fairness, we adopt the notion of shares, where each agent is entitled to a fair share, based on some fairness criterion, and an allocation is considered fair if the value of every agent (weakly) exceeds her fair share. A share-based… ▽ More We consider the problem of fair division, where a set of indivisible goods should be distributed fairly among a set of agents with combinatorial valuations. To capture fairness, we adopt the notion of shares, where each agent is entitled to a fair share, based on some fairness criterion, and an allocation is considered fair if the value of every agent (weakly) exceeds her fair share. A share-based notion is considered universally feasible if it admits a fair allocation for every profile of monotone valuations. A major question arises: is there a non-trivial share-based notion that is universally feasible? The most well-known share-based notions, namely proportionality and maximin share, are not universally feasible, nor are any constant approximations of them. We propose a novel share notion, where an agent assesses the fairness of a bundle by comparing it to her valuation in a random allocation. In this framework, a bundle is considered $q$-quantile fair, for $q\in[0,1]$, if it is at least as good as a bundle obtained in a uniformly random allocation with probability at least $q$. Our main question is whether there exists a constant value of $q$ for which the $q$-quantile share is universally feasible. Our main result establishes a strong connection between the feasibility of quantile shares and the classical Erdős Matching Conjecture. Specifically, we show that if a version of this conjecture is true, then the $\frac{1}{2e}$-quantile share is universally feasible. Furthermore, we provide unconditional feasibility results for additive, unit-demand and matroid-rank valuations for constant values of $q$. Finally, we discuss the implications of our results for other share notions. △ Less

Submitted 4 December, 2023; originally announced December 2023.

Comments: 23 pages, no figures

arXiv:2311.17713 [pdf, other]

Ab-initio tensile tests applied to BCC refractory alloys

Authors: Vishnu Raghuraman, Saro San, Michael C. Gao, Michael Widom

Abstract: Refractory metals exhibit high strength at high temperature, but often lack ductility. Multiprinciple element alloys such as high entropy alloys offer the potential to improve ductility while maintaining strength, but we don't know $a-priori$ what compositions will be suitable. A number of measures have been proposed to predict the ductility of metals, notably the Pugh ratio, the Rice-Thomson D-pa… ▽ More Refractory metals exhibit high strength at high temperature, but often lack ductility. Multiprinciple element alloys such as high entropy alloys offer the potential to improve ductility while maintaining strength, but we don't know $a-priori$ what compositions will be suitable. A number of measures have been proposed to predict the ductility of metals, notably the Pugh ratio, the Rice-Thomson D-parameter, among others. Here we examine direct $ab-initio$ simulation of deformation under tensile strain, and we apply this to a variety of Nb- and Mo-based binary alloys and to several quaternary alloy systems. Our results exhibit peak stresses for elastic deformation, beyond which defects such as lattice slip, stacking faults, transformation, and twinning, relieve the stress. The peak stress grows strongly with increasing valence electron count. Correlations are examined among several physical properties, including the above-mentioned ductility parameters. △ Less

Submitted 5 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

Comments: 22 pages, 13 figures, accepted for publication in Physical Review Materials

arXiv:2311.16484 [pdf, other]

Eye vs. AI: Human Gaze and Model Attention in Video Memorability

Authors: Prajneya Kumar, Eshika Khandelwal, Makarand Tapaswi, Vishnu Sreekumar

Abstract: Understanding the factors that determine video memorability has important applications in areas such as educational technology and advertising. Towards this goal, we investigate the semantic and temporal attention mechanisms underlying video memorability. We propose a Transformer-based model with spatio-temporal attention that matches SoTA performance on video memorability prediction on a large na… ▽ More Understanding the factors that determine video memorability has important applications in areas such as educational technology and advertising. Towards this goal, we investigate the semantic and temporal attention mechanisms underlying video memorability. We propose a Transformer-based model with spatio-temporal attention that matches SoTA performance on video memorability prediction on a large naturalistic video dataset. More importantly, the self-attention patterns show us where the model looks to predict memorability. We compare model attention against human gaze fixation density maps collected through a small-scale eye-tracking experiment where humans perform a video memory task. Quantitative saliency metrics show that the model attention and human gaze follow similar patterns. Furthermore, while panoptic segmentation confirms that the model and humans attend more to thing classes, stuff classes that receive increased/decreased attention tend to have higher memorability scores. We also observe that the model assigns greater importance to the initial frames, mimicking temporal attention patterns found in humans. △ Less

Submitted 26 November, 2023; originally announced November 2023.

arXiv:2311.12251 [pdf, other]

Strongly Coupled Two-scale System with Nonlinear Dispersion: Weak Solvability and Numerical Simulation

Authors: Vishnu Raveendran, Surendra Nepal, Rainey Lyons, Michael Eden, Adrian Muntean

Abstract: We investigate a two-scale system featuring an upscaled parabolic dispersion-reaction equation intimately linked to a family of elliptic cell problems. The system is strongly coupled through a dispersion tensor, which depends on the solutions to the cell problems, and via the cell problems themselves, where the solution of the parabolic problem interacts nonlinearly with the drift term. This parti… ▽ More We investigate a two-scale system featuring an upscaled parabolic dispersion-reaction equation intimately linked to a family of elliptic cell problems. The system is strongly coupled through a dispersion tensor, which depends on the solutions to the cell problems, and via the cell problems themselves, where the solution of the parabolic problem interacts nonlinearly with the drift term. This particular mathematical structure is motivated by a rigorously derived upscaled reaction-diffusion-convection model that describes the evolution of a population of interacting particles pushed by a large drift through an array of periodically placed obstacles (i.e., through a regular porous medium). We prove the existence and uniqueness of weak solutions to our system by means of an iterative scheme, where particular care is needed to ensure the uniform positivity of the dispersion tensor. Additionally, we use finite element-based approximations for the same iteration scheme to perform multiple simulation studies. Finally, we highlight how the choice of micro-geometry (building the regular porous medium) and of the nonlinear drift coupling affects the macroscopic dispersion of particles. △ Less

Submitted 13 March, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

MSC Class: 35G55; 35A01; 35M30; 47J25; 65M60

arXiv:2311.12134 [pdf, other]

First principles residual resistivity using locally self-consistent multiple scattering method

Authors: Vishnu Raghuraman, Markus Eisenbach, Michael Widom, Yang Wang

Abstract: The locally self-consistent multiple scattering (LSMS) method can perform efficient first-principles calculations of systems with large number of atoms. In this work, we combine the Kubo-Greenwood equation with LSMS, enabling us to calculate first-principles residual resistivity of large systems. This has been implemented in the open-source code lsms. We apply this method to selected pure elements… ▽ More The locally self-consistent multiple scattering (LSMS) method can perform efficient first-principles calculations of systems with large number of atoms. In this work, we combine the Kubo-Greenwood equation with LSMS, enabling us to calculate first-principles residual resistivity of large systems. This has been implemented in the open-source code lsms. We apply this method to selected pure elements and binary random alloys. The results compare well with experiment, and with values obtained from a first-principles effective medium technique (KKR-CPA). We discuss future applications of this method to complex systems where other methods are not applicable. △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 16 pages, 5 figures

arXiv:2311.08123 [pdf, other]

Memory-efficient Stochastic methods for Memory-based Transformers

Authors: Vishwajit Kumar Vishnu, C. Chandra Sekhar

Abstract: Training Memory-based transformers can require a large amount of memory and can be quite inefficient. We propose a novel two-phase training mechanism and a novel regularization technique to improve the training efficiency of memory-based transformers, which are often used for long-range context problems. For our experiments, we consider transformer-XL as our baseline model which is one of memoryba… ▽ More Training Memory-based transformers can require a large amount of memory and can be quite inefficient. We propose a novel two-phase training mechanism and a novel regularization technique to improve the training efficiency of memory-based transformers, which are often used for long-range context problems. For our experiments, we consider transformer-XL as our baseline model which is one of memorybased transformer models. We show that our resultant model, Skip Cross-head TransformerXL, outperforms the baseline on character level language modeling task with similar parameters and outperforms the baseline on word level language modelling task with almost 20% fewer parameters. Our proposed methods do not require any additional memory. We also demonstrate the effectiveness of our regularization mechanism on BERT which shows similar performance with reduction in standard deviation of scores of around 30% on multiple GLUE tasks. △ Less

Submitted 14 November, 2023; originally announced November 2023.

arXiv:2311.07695 [pdf, other]

Co-Buchi Barrier Certificates for Discrete-time Dynamical Systems

Authors: Vishnu Murali, Ashutosh Trivedi, Majid Zamani

Abstract: Barrier certificates provide functional overapproximations for the reachable set of dynamical systems and provide inductive guarantees on the safe evolution of the system. Formally a barrier certificate is a real-valued function over the state set that is required to be non-positive for the initial states, positive over the set of unsafe states and nonincreasing along the state transitions. These… ▽ More Barrier certificates provide functional overapproximations for the reachable set of dynamical systems and provide inductive guarantees on the safe evolution of the system. Formally a barrier certificate is a real-valued function over the state set that is required to be non-positive for the initial states, positive over the set of unsafe states and nonincreasing along the state transitions. These conditions together provide an inductive argument that the system will not reach an unsafe state even once as the barrier certificate remains non-positive for all reachable states. In the automata-theoretic approach to verification, a key query is to determine whether the system visits a given predicate over the states finitely often, typically resulting from the complement of the traditional Buchi acceptance condition. This paper proposes a barrier certificate approach to answer such queries by develo** a notion of co-Buchi barrier certificates (CBBCs) that generalize classic barrier certificates to ensure that the traces of a system visit a given predicate a fixed number of times. Our notion of CBBC is inspired from bounded synthesis paradigm to LTL realizability, where the LTL specifications are converted to safety automata via universal co-Buchi automata with a bound on final state visitations provided as a hyperparameter. Our application of CBBCs in verification is analogous in nature: we fix a bound and search for a suitable barrier certificate, increasing the bound if no suitable function can be found. We then use these CBBCs to verify our system against properties specified by co-Buchi automata and demonstrate their effectiveness via some case studies. We also show that the present approach strictly generalizes performant barrier certificate based approaches that rely on cutting the paths of the automata that start from an initial state and reach some accepting states. △ Less

Submitted 13 November, 2023; originally announced November 2023.

arXiv:2311.03449 [pdf, other]

Into the LAIONs Den: Investigating Hate in Multimodal Datasets

Authors: Abeba Birhane, Vinay Prabhu, Sang Han, Vishnu Naresh Boddeti, Alexandra Sasha Luccioni

Abstract: 'Scale the model, scale the data, scale the compute' is the reigning sentiment in the world of generative AI today. While the impact of model scaling has been extensively studied, we are only beginning to scratch the surface of data scaling and its consequences. This is especially of critical importance in the context of vision-language datasets such as LAION. These datasets are continually growin… ▽ More 'Scale the model, scale the data, scale the compute' is the reigning sentiment in the world of generative AI today. While the impact of model scaling has been extensively studied, we are only beginning to scratch the surface of data scaling and its consequences. This is especially of critical importance in the context of vision-language datasets such as LAION. These datasets are continually growing in size and are built based on large-scale internet dumps such as the Common Crawl, which is known to have numerous drawbacks ranging from quality, legality, and content. The datasets then serve as the backbone for large generative models, contributing to the operationalization and perpetuation of harmful societal and historical biases and stereotypes. In this paper, we investigate the effect of scaling datasets on hateful content through a comparative audit of two datasets: LAION-400M and LAION-2B. Our results show that hate content increased by nearly 12% with dataset scale, measured both qualitatively and quantitatively using a metric that we term as Hate Content Rate (HCR). We also found that filtering dataset contents based on Not Safe For Work (NSFW) values calculated based on images alone does not exclude all the harmful content in alt-text. Instead, we found that trace amounts of hateful, targeted, and aggressive text remain even when carrying out conservative filtering. We end with a reflection and a discussion of the significance of our results for dataset curation and usage in the AI community. Code and the meta-data assets curated in this paper are publicly available at https://github.com/vinayprabhu/hate_scaling. Content warning: This paper contains examples of hateful text that might be disturbing, distressing, and/or offensive. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: To appear at 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Datasets and Benchmarks Track. arXiv admin note: substantial text overlap with arXiv:2306.13141

arXiv:2311.03363 [pdf, ps, other]

The Path to a Modular and Standards-based Digital Health Ecosystem

Authors: Paul Schmiedmayer, Vishnu Ravi, Oliver Aalami

Abstract: Software engineering for digital health applications entails several challenges, including heterogeneous data acquisition, data standardization, software reuse, security, and privacy considerations. We explore these challenges and how our Stanford Spezi ecosystem addresses these challenges by providing a modular and standards-based open-source digital health ecosystem. Spezi enables developers to… ▽ More Software engineering for digital health applications entails several challenges, including heterogeneous data acquisition, data standardization, software reuse, security, and privacy considerations. We explore these challenges and how our Stanford Spezi ecosystem addresses these challenges by providing a modular and standards-based open-source digital health ecosystem. Spezi enables developers to select and integrate modules according to their needs and facilitates an open-source community to democratize access to building digital health innovations. △ Less

Submitted 28 September, 2023; originally announced November 2023.

Comments: IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI'23) - Workshop - Unraveling Challenges in Time Series Analysis with Open Source Tools for Digital Health Applications

arXiv:2311.01279 [pdf, other]

doi 10.1145/3583740.3626819

ExPECA: An Experimental Platform for Trustworthy Edge Computing Applications

Authors: Samie Mostafavi, Vishnu Narayanan Moothedath, Stefan Rönngren, Neelabhro Roy, Gourav Prateek Sharma, Sangwon Seo, Manuel Olguín Muñoz, James Gross

Abstract: This paper presents ExPECA, an edge computing and wireless communication research testbed designed to tackle two pressing challenges: comprehensive end-to-end experimentation and high levels of experimental reproducibility. Leveraging OpenStack-based Chameleon Infrastructure (CHI) framework for its proven flexibility and ease of operation, ExPECA is located in a unique, isolated underground facili… ▽ More This paper presents ExPECA, an edge computing and wireless communication research testbed designed to tackle two pressing challenges: comprehensive end-to-end experimentation and high levels of experimental reproducibility. Leveraging OpenStack-based Chameleon Infrastructure (CHI) framework for its proven flexibility and ease of operation, ExPECA is located in a unique, isolated underground facility, providing a highly controlled setting for wireless experiments. The testbed is engineered to facilitate integrated studies of both communication and computation, offering a diverse array of Software-Defined Radios (SDR) and Commercial Off-The-Shelf (COTS) wireless and wired links, as well as containerized computational environments. We exemplify the experimental possibilities of the testbed using OpenRTiST, a latency-sensitive, bandwidth-intensive application, and analyze its performance. Lastly, we highlight an array of research domains and experimental setups that stand to gain from ExPECA's features, including closed-loop applications and time-sensitive networking. △ Less

Submitted 2 November, 2023; originally announced November 2023.

arXiv:2311.00421 [pdf, ps, other]

Near-IR Spectral Observations of the Didymos System -- Daily Evolution Before and After the DART Impact, Indicates Dimorphos Originated from Didymos

Authors: David Polishook, Francesca E. DeMeo, Brian J. Burt, Cristina . A. Thomas, Andrew . S. Rivkin, Juan . A. Sanchez, Vishnu Reddy

Abstract: Ejecta from Dimorphos following the DART mission impact, significantly increased the brightness of the Didymos-Dimorphos system, allowing us to examine sub-surface material. We report daily near-IR spectroscopic observations of the Didymos system using NASA's IRTF, that follow the evolution of the spectral signature of the ejecta cloud over one week, from one day before the impact. Overall, the sp… ▽ More Ejecta from Dimorphos following the DART mission impact, significantly increased the brightness of the Didymos-Dimorphos system, allowing us to examine sub-surface material. We report daily near-IR spectroscopic observations of the Didymos system using NASA's IRTF, that follow the evolution of the spectral signature of the ejecta cloud over one week, from one day before the impact. Overall, the spectral features remained fixed (S-type classification) while the ejecta dissipated, confirming both Didymos and Dimorphos are constructed from the same silicate material. This novel result strongly supports binary asteroid formation models that include breaking up of a single body, due to rotational breakup of km-wide bodies. At impact time +14 and +38 hours, the spectral slope decreased, but following nights presented increasing spectral slope that almost returned to the pre-impact slope. However, the parameters of the $1~μm$ band remained fixed, and no "fresh" / Q-type-like spectrum was measured. We interpret these as follow: 1. The ejecta cloud is the main contributor ($60-70\%$) to the overall light during the $\sim40$ hours after impact. 2. Coarser debris ($\geq 100~μm$) dominated the ejecta cloud, decreasing the spectral slope (after radiation pressure removed the fine grains at $\leq10$ hours after impact); 3. after approximately one week, the ejecta cloud dispersed enough to make the fine grains on Didymos surface the dominating part of the light, increasing the spectral slope to pre-impact level. 4. a negligible amount of non-weathered material was ejected from Dimorphos' sub-surface, suggesting Dimorphos was accumulated from weathered material, ejected from Didymos surface. △ Less

Submitted 1 November, 2023; originally announced November 2023.

Comments: 10 pages, 8 figures, accepted for publication on PSJ

arXiv:2311.00226 [pdf, other]

Transformers are Provably Optimal In-context Estimators for Wireless Communications

Authors: Vishnu Teja Kunde, Vicram Rajagopalan, Chandra Shekhara Kaushik Valmeekam, Krishna Narayanan, Srinivas Shakkottai, Dileep Kalathil, Jean-Francois Chamberland

Abstract: Pre-trained transformers exhibit the capability of adapting to new tasks through in-context learning (ICL), where they efficiently utilize a limited set of prompts without explicit model optimization. The canonical communication problem of estimating transmitted symbols from received observations can be modelled as an in-context learning problem: Received observations are essentially a noisy fun… ▽ More Pre-trained transformers exhibit the capability of adapting to new tasks through in-context learning (ICL), where they efficiently utilize a limited set of prompts without explicit model optimization. The canonical communication problem of estimating transmitted symbols from received observations can be modelled as an in-context learning problem: Received observations are essentially a noisy function of transmitted symbols, and this function can be represented by an unknown parameter whose statistics depend on an (also unknown) latent context. This problem, which we term in-context estimation (ICE), has significantly greater complexity than the extensively studied linear regression problem. The optimal solution to the ICE problem is a non-linear function of the underlying context. In this paper, we prove that, for a subclass of such problems, a single layer softmax attention transformer (SAT) computes the optimal solution of the above estimation problem in the limit of large prompt length. We also prove that the optimal configuration of such transformer is indeed the minimizer of the corresponding training loss. Further, we empirically demonstrate the proficiency of multi-layer transformers in efficiently solving broader in-context estimation problems. Through extensive simulations, we show that solving ICE problems using transformers significantly outperforms standard approaches. Moreover, just with a few context examples, it achieves the same performance as an estimator with perfect knowledge of the latent context. △ Less

Submitted 14 June, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

Comments: 13 pages, 2 figures, 2 tables, preprint; abstract, references, theory updated

arXiv:2310.19934 [pdf, other]

Spectroscopic Links Among Giant Planet Irregular Satellites and Trojans

Authors: Benjamin N. L. Sharkey, Vishnu Reddy, Olga Kuhn, Juan A. Sanchez, William F. Bottke

Abstract: We collect near-infrared spectra ($\sim0.75-2.55\ μm$) of four Jovian irregular satellites and visible spectra ($\sim0.32-1.00\ μm$) of two Jovian irregular satellites, two Uranian irregular satellites, and four Neptune Trojans. We find close similarities between observed Jovian irregular satellites and previously characterized Jovian Trojans. However, irregular satellites' unique collisional hist… ▽ More We collect near-infrared spectra ($\sim0.75-2.55\ μm$) of four Jovian irregular satellites and visible spectra ($\sim0.32-1.00\ μm$) of two Jovian irregular satellites, two Uranian irregular satellites, and four Neptune Trojans. We find close similarities between observed Jovian irregular satellites and previously characterized Jovian Trojans. However, irregular satellites' unique collisional histories complicate comparisons to other groups. Laboratory study of CM and CI chondrites show that grain size and regolith packing conditions strongly affect spectra of dark, carbonaceous materials. We hypothesize that different activity histories of these objects, which may have originally contained volatile ices that subsequently sublimated, could cause differences in regolith grain-size or packing properties and therefore drive spectral variation. The Uranian satellites Sycorax and Caliban appear similar to TNOs. However, we detect a feature near 0.7 $μm$ on Sycorax, suggesting the presence of hydrated materials. While the sample of Neptune Trojans have more neutral spectra than the Uranian satellites we observe, they remain consistent with the broad color distribution of the Kuiper belt. We detect a possible feature near 0.65-0.70 $μm$ on Neptune Trojan 2006 RJ103, suggesting that hydrated material may also be present in this population. Characterizing hydrated materials in the outer solar system may provide critical context regarding the origins of hydrated CI and CM chondrite meteorites. We discuss how the hydration state(s) of the irregular satellites constrains the thermal histories of the interiors of their parent bodies, which may have formed among the primordial Kuiper belt. △ Less

Submitted 30 October, 2023; originally announced October 2023.

Comments: 4 Tables, 8 Figures. Accepted to PSJ

arXiv:2310.16152 [pdf, other]

FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering

Authors: Md Rafi Ur Rashid, Vishnu Asutosh Dasu, Kang Gu, Najrin Sultana, Shagufta Mehnaz

Abstract: Federated learning (FL) has become a key component in various language modeling applications such as machine translation, next-word prediction, and medical record analysis. These applications are trained on datasets from many FL participants that often include privacy-sensitive data, such as healthcare records, phone/credit card numbers, login credentials, etc. Although FL enables computation with… ▽ More Federated learning (FL) has become a key component in various language modeling applications such as machine translation, next-word prediction, and medical record analysis. These applications are trained on datasets from many FL participants that often include privacy-sensitive data, such as healthcare records, phone/credit card numbers, login credentials, etc. Although FL enables computation without necessitating clients to share their raw data, determining the extent of privacy leakage in federated language models is challenging and not straightforward. Moreover, existing attacks aim to extract data regardless of how sensitive or naive it is. To fill this research gap, we introduce two novel findings with regard to leaking privacy-sensitive user data from federated large language models. Firstly, we make a key observation that model snapshots from the intermediate rounds in FL can cause greater privacy leakage than the final trained model. Secondly, we identify that privacy leakage can be aggravated by tampering with a model's selective weights that are specifically responsible for memorizing the sensitive training data. We show how a malicious client can leak the privacy-sensitive data of some other users in FL even without any cooperation from the server. Our best-performing method improves the membership inference recall by 29% and achieves up to 71% private data reconstruction, evidently outperforming existing attacks with stronger assumptions of adversary capabilities. △ Less

Submitted 25 May, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

Comments: 20 pages (including bibliography and Appendix), Submitted to ACM CCS '24

arXiv:2310.14482 [pdf, other]

An Inexact Frank-Wolfe Algorithm for Composite Convex Optimization Involving a Self-Concordant Function

Authors: Nimita Shinde, Vishnu Narayanan, James Saunderson

Abstract: In this paper, we consider Frank-Wolfe-based algorithms for composite convex optimization problems with objective involving a logarithmically-homogeneous, self-concordant functions. Recent Frank-Wolfe-based methods for this class of problems assume an oracle that returns exact solutions of a linearized subproblem. We relax this assumption and propose a variant of the Frank-Wolfe method with inexac… ▽ More In this paper, we consider Frank-Wolfe-based algorithms for composite convex optimization problems with objective involving a logarithmically-homogeneous, self-concordant functions. Recent Frank-Wolfe-based methods for this class of problems assume an oracle that returns exact solutions of a linearized subproblem. We relax this assumption and propose a variant of the Frank-Wolfe method with inexact oracle for this class of problems. We show that our inexact variant enjoys similar convergence guarantees to the exact case, while allowing considerably more flexibility in approximately solving the linearized subproblem. In particular, our approach can be applied if the subproblem can be solved prespecified additive error or to prespecified relative error (even though the optimal value of the subproblem may not be uniformly bounded). Furthermore, our approach can also handle the situation where the subproblem is solved via a randomized algorithm that fails with positive probability. Our inexact oracle model is motivated by certain large-scale semidefinite programs where the subproblem reduces to computing an extreme eigenvalue-eigenvector pair, and we demonstrate the practical performance of our algorithm with numerical experiments on problems of this form. △ Less

Submitted 22 October, 2023; originally announced October 2023.

arXiv:2310.12502 [pdf, ps, other]

Comparison of empirical and particle force-based density segregation models

Authors: Soniya Kumawat, Vishnu Kumar Sahu, Anurag Tripathi

Abstract: The empirical and particle force-based models of granular segregation due to density differences among the species are compared in this work. Dependency of the empirical segregation parameters on the initial configuration, the observation time duration, inclination angle, and mixture composition are discussed in detail. The parameters obtained from empirical models are used to predict the steady-s… ▽ More The empirical and particle force-based models of granular segregation due to density differences among the species are compared in this work. Dependency of the empirical segregation parameters on the initial configuration, the observation time duration, inclination angle, and mixture composition are discussed in detail. The parameters obtained from empirical models are used to predict the steady-state concentration profiles for different density ratios and compositions. In addition, we utilize the predictions from the particle force-based segregation model and compare them with the predictions of the empirical segregation models. Our results show that the linear empirical segregation model predictions agree well with the simulation results for mixtures rich in light species where as quadratic empirical segregation model works better for mixtures rich in heavy species. Particle force-based segregation model, on the other hand, seems to be in very good agreement with the DEM simulation data across all mixture compositions. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 20 pages, 15 figures

arXiv:2310.08012 [pdf, other]

AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE

Authors: Wei Ao, Vishnu Naresh Boddeti

Abstract: Secure inference of deep convolutional neural networks (CNNs) under RNS-CKKS involves polynomial approximation of unsupported non-linear activation functions. However, existing approaches have three main limitations: 1) Inflexibility: The polynomial approximation and associated homomorphic evaluation architecture are customized manually for each CNN architecture and do not generalize to other netw… ▽ More Secure inference of deep convolutional neural networks (CNNs) under RNS-CKKS involves polynomial approximation of unsupported non-linear activation functions. However, existing approaches have three main limitations: 1) Inflexibility: The polynomial approximation and associated homomorphic evaluation architecture are customized manually for each CNN architecture and do not generalize to other networks. 2) Suboptimal Approximation: Each activation function is approximated instead of the function represented by the CNN. 3) Restricted Design: Either high-degree or low-degree polynomial approximations are used. The former retains high accuracy but slows down inference due to bootstrap** operations, while the latter accelerates ciphertext inference but compromises accuracy. To address these limitations, we present AutoFHE, which automatically adapts standard CNNs for secure inference under RNS-CKKS. The key idea is to adopt layerwise mixed-degree polynomial activation functions, which are optimized jointly with the homomorphic evaluation architecture in terms of the placement of bootstrap** operations. The problem is modeled within a multi-objective optimization framework to maximize accuracy and minimize the number of bootstrap** operations. AutoFHE can be applied flexibly on any CNN architecture, and it provides diverse solutions that span the trade-off between accuracy and latency. Experimental evaluation over RNS-CKKS encrypted CIFAR datasets shows that AutoFHE accelerates secure inference by $1.32\times$ to $1.8\times$ compared to methods employing high-degree polynomials. It also improves accuracy by up to 2.56% compared to methods using low-degree polynomials. Lastly, AutoFHE accelerates inference and improves accuracy by $103\times$ and 3.46%, respectively, compared to CNNs under TFHE. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: USENIX Security Symposium 2024

arXiv:2310.08004 [pdf, other]

On the Rational Degree of Boolean Functions and Applications

Authors: Vishnu Iyer, Siddhartha Jain, Matt Kovacs-Deak, Vinayak M. Kumar, Luke Schaeffer, Daochen Wang, Michael Whitmeyer

Abstract: We study a natural complexity measure of Boolean functions known as the (exact) rational degree. For total functions $f$, it is conjectured that $\mathrm{rdeg}(f)$ is polynomially related to $\mathrm{deg}(f)$, where $\mathrm{deg}(f)$ is the Fourier degree. Towards this conjecture, we show that symmetric functions have rational degree at least $\mathrm{deg}(f)/2$ and monotone functions have rationa… ▽ More We study a natural complexity measure of Boolean functions known as the (exact) rational degree. For total functions $f$, it is conjectured that $\mathrm{rdeg}(f)$ is polynomially related to $\mathrm{deg}(f)$, where $\mathrm{deg}(f)$ is the Fourier degree. Towards this conjecture, we show that symmetric functions have rational degree at least $\mathrm{deg}(f)/2$ and monotone functions have rational degree at least $\sqrt{\mathrm{deg}(f)}$. We observe that both of these lower bounds are tight. In addition, we show that all read-once depth-$d$ Boolean formulae have rational degree at least $Ω(\mathrm{deg}(f)^{1/d})$. Furthermore, we show that almost every Boolean function on $n$ variables has rational degree at least $n/2 - O(\sqrt{n})$. In contrast to total functions, we exhibit partial functions that witness unbounded separations between rational and approximate degree, in both directions. As a consequence, we show that for quantum computers, post-selection and bounded-error are incomparable resources in the black-box model. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: 17 pages, 3 figures

arXiv:2310.07048 [pdf, other]

FedMFS: Federated Multimodal Fusion Learning with Selective Modality Communication

Authors: Liangqi Yuan, Dong-Jun Han, Vishnu Pandi Chellapandi, Stanislaw H. Żak, Christopher G. Brinton

Abstract: Multimodal federated learning (FL) aims to enrich model training in FL settings where devices are collecting measurements across multiple modalities (e.g., sensors measuring pressure, motion, and other types of data). However, key challenges to multimodal FL remain unaddressed, particularly in heterogeneous network settings: (i) the set of modalities collected by each device will be diverse, and (… ▽ More Multimodal federated learning (FL) aims to enrich model training in FL settings where devices are collecting measurements across multiple modalities (e.g., sensors measuring pressure, motion, and other types of data). However, key challenges to multimodal FL remain unaddressed, particularly in heterogeneous network settings: (i) the set of modalities collected by each device will be diverse, and (ii) communication limitations prevent devices from uploading all their locally trained modality models to the server. In this paper, we propose Federated Multimodal Fusion learning with Selective modality communication (FedMFS), a new multimodal fusion FL methodology that can tackle the above mentioned challenges. The key idea is the introduction of a modality selection criterion for each device, which weighs (i) the impact of the modality, gauged by Shapley value analysis, against (ii) the modality model size as a gauge for communication overhead. This enables FedMFS to flexibly balance performance against communication costs, depending on resource constraints and application requirements. Experiments on the real-world ActionSense dataset demonstrate the ability of FedMFS to achieve comparable accuracy to several baselines while reducing the communication overhead by over 4x. △ Less

Submitted 12 February, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

Comments: ICC 2024

arXiv:2310.07021 [pdf, other]

Pre-Trained Masked Image Model for Mobile Robot Navigation

Authors: Vishnu Dutt Sharma, Anukriti Singh, Pratap Tokekar

Abstract: 2D top-down maps are commonly used for the navigation and exploration of mobile robots through unknown areas. Typically, the robot builds the navigation maps incrementally from local observations using onboard sensors. Recent works have shown that predicting the structural patterns in the environment through learning-based approaches can greatly enhance task efficiency. While many such works build… ▽ More 2D top-down maps are commonly used for the navigation and exploration of mobile robots through unknown areas. Typically, the robot builds the navigation maps incrementally from local observations using onboard sensors. Recent works have shown that predicting the structural patterns in the environment through learning-based approaches can greatly enhance task efficiency. While many such works build task-specific networks using limited datasets, we show that the existing foundational vision networks can accomplish the same without any fine-tuning. Specifically, we use Masked Autoencoders, pre-trained on street images, to present novel applications for field-of-view expansion, single-agent topological exploration, and multi-agent exploration for indoor map**, across different input modalities. Our work motivates the use of foundational vision models for generalized structure prediction-driven applications, especially in the dearth of training data. For more qualitative results see https://raaslab.org/projects/MIM4Robots. △ Less

Submitted 25 March, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

Comments: Accepted at ICRA 2024

Showing 51–100 of 740 results for author: Vishnu