Search | arXiv e-print repository

doi 10.1007/978-3-031-26354-5_1

Quantifying the Effect of Image Similarity on Diabetic Foot Ulcer Classification

Authors: Imran Chowdhury Dipto, Bill Cassidy, Connah Kendrick, Neil D. Reeves, Joseph M. Pappachan, Vishnu Chandrabalan, Moi Hoon Yap

Abstract: This research conducts an investigation on the effect of visually similar images within a publicly available diabetic foot ulcer dataset when training deep learning classification networks. The presence of binary-identical duplicate images in datasets used to train deep learning algorithms is a well known issue that can introduce unwanted bias which can degrade network performance. However, the ef… ▽ More This research conducts an investigation on the effect of visually similar images within a publicly available diabetic foot ulcer dataset when training deep learning classification networks. The presence of binary-identical duplicate images in datasets used to train deep learning algorithms is a well known issue that can introduce unwanted bias which can degrade network performance. However, the effect of visually similar non-identical images is an under-researched topic, and has so far not been investigated in any diabetic foot ulcer studies. We use an open-source fuzzy algorithm to identify groups of increasingly similar images in the Diabetic Foot Ulcers Challenge 2021 (DFUC2021) training dataset. Based on each similarity threshold, we create new training sets that we use to train a range of deep learning multi-class classifiers. We then evaluate the performance of the best performing model on the DFUC2021 test set. Our findings show that the model trained on the training set with the 80\% similarity threshold images removed achieved the best performance using the InceptionResNetV2 network. This model showed improvements in F1-score, precision, and recall of 0.023, 0.029, and 0.013, respectively. These results indicate that highly similar images can contribute towards the presence of performance degrading bias within the Diabetic Foot Ulcers Challenge 2021 dataset, and that the removal of images that are 80\% similar from the training set can help to boost classification performance. △ Less

Submitted 25 April, 2023; originally announced April 2023.

arXiv:2304.12657 [pdf, other]

Vertical convection regimes in a two-dimensional rectangular cavity: Prandtl and aspect ratio dependance

Authors: Arman Khoubani, Ashwin Vishnu Mohanan, Pierre Augier, Jan-Bert Flór

Abstract: Vertical convection is the fluid motion that is induced by the heating and cooling of two opposed vertical boundaries of a rectangular cavity (see e.g. Wang et al. 2021). We consider the linear stability of the steady two-dimensional flow reached at Rayleigh numbers of O($10^8$). As a function of the Prandtl number, $Pr$, and the height-to-width aspect ratio of the domain, $A$, the base flow of… ▽ More Vertical convection is the fluid motion that is induced by the heating and cooling of two opposed vertical boundaries of a rectangular cavity (see e.g. Wang et al. 2021). We consider the linear stability of the steady two-dimensional flow reached at Rayleigh numbers of O($10^8$). As a function of the Prandtl number, $Pr$, and the height-to-width aspect ratio of the domain, $A$, the base flow of each case is computed numerically and linear simulations are used to obtain the properties of the leading linear instability mode. Flow regimes depend on the presence of a circulation in the entire cavity, detachment of the thermal layer from the boundary or the corner regions, and on the oscillation frequency relative to the natural frequency of oscillation in the stably temperature-stratified interior, allowing for the presence of internal waves or not. Accordingly the regime is called slow or fast, respectively. Either the global circulation or internal waves in the interior may couple the top and bottom buoyancy currents, while their absence implies asymmetry in their perturbation amplitude. Six flow regimes are found in the range of $0.1 \leq Pr \leq 4$ and $0.5 \leq A \leq 2$. For $Pr \lessapprox 0.4 $ and $A>1$ the base flow is driven by a large circulation in the entire cavity. For $Pr \gtrapprox 0.7$ the thermal boundary layers are thin and the instability is driven by the motion along the wall and the detached boundary layer. A transition between these regimes is marked by a dramatic change in oscillation frequency at $Pr = 0.55 \pm0.15$ and $A <2$. △ Less

Submitted 13 December, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

Comments: Accepted to the Journal for Fluid Mechanics

arXiv:2304.12001 [pdf, other]

doi 10.1007/978-3-031-26354-5_10

Diabetic Foot Ulcer Grand Challenge 2022 Summary

Authors: Connah Kendrick, Bill Cassidy, Neil D. Reeves, Joseph M. Pappachan, Claire O'Shea, Vishnu Chandrabalan, Moi Hoon Yap

Abstract: The Diabetic Foot Ulcer Challenge 2022 focused on the task of diabetic foot ulcer segmentation, based on the work completed in previous DFU challenges. The challenge provided 4000 images of full-view foot ulcer images together with corresponding delineation of ulcer regions. This paper provides an overview of the challenge, a summary of the methods proposed by the challenge participants, the resul… ▽ More The Diabetic Foot Ulcer Challenge 2022 focused on the task of diabetic foot ulcer segmentation, based on the work completed in previous DFU challenges. The challenge provided 4000 images of full-view foot ulcer images together with corresponding delineation of ulcer regions. This paper provides an overview of the challenge, a summary of the methods proposed by the challenge participants, the results obtained from each technique, and a comparison of the challenge results. The best-performing network was a modified HarDNet-MSEG, with a Dice score of 0.7287. △ Less

Submitted 24 April, 2023; originally announced April 2023.

arXiv:2304.11763 [pdf, other]

The Case for Hierarchical Deep Learning Inference at the Network Edge

Authors: Ghina Al-Atat, Andrea Fresa, Adarsh Prasad Behera, Vishnu Narayanan Moothedath, James Gross, Jaya Prakash Champati

Abstract: Resource-constrained Edge Devices (EDs), e.g., IoT sensors and microcontroller units, are expected to make intelligent decisions using Deep Learning (DL) inference at the edge of the network. Toward this end, there is a significant research effort in develo** tinyML models - Deep Learning (DL) models with reduced computation and memory storage requirements - that can be embedded on these devices… ▽ More Resource-constrained Edge Devices (EDs), e.g., IoT sensors and microcontroller units, are expected to make intelligent decisions using Deep Learning (DL) inference at the edge of the network. Toward this end, there is a significant research effort in develo** tinyML models - Deep Learning (DL) models with reduced computation and memory storage requirements - that can be embedded on these devices. However, tinyML models have lower inference accuracy. On a different front, DNN partitioning and inference offloading techniques were studied for distributed DL inference between EDs and Edge Servers (ESs). In this paper, we explore Hierarchical Inference (HI), a novel approach proposed by Vishnu et al. 2023, arXiv:2304.00891v1 , for performing distributed DL inference at the edge. Under HI, for each data sample, an ED first uses a local algorithm (e.g., a tinyML model) for inference. Depending on the application, if the inference provided by the local algorithm is incorrect or further assistance is required from large DL models on edge or cloud, only then the ED offloads the data sample. At the outset, HI seems infeasible as the ED, in general, cannot know if the local inference is sufficient or not. Nevertheless, we present the feasibility of implementing HI for machine fault detection and image classification applications. We demonstrate its benefits using quantitative analysis and argue that using HI will result in low latency, bandwidth savings, and energy savings in edge AI systems. △ Less

Submitted 23 April, 2023; originally announced April 2023.

Comments: This paper consists of 9 pages, with 6 tables and 8 figures

arXiv:2304.11465 [pdf, other]

Pred-NBV: Prediction-guided Next-Best-View for 3D Object Reconstruction

Authors: Harnaik Dhami, Vishnu D. Sharma, Pratap Tokekar

Abstract: Prediction-based active perception has shown the potential to improve the navigation efficiency and safety of the robot by anticipating the uncertainty in the unknown environment. The existing works for 3D shape prediction make an implicit assumption about the partial observations and therefore cannot be used for real-world planning and do not consider the control effort for next-best-view plannin… ▽ More Prediction-based active perception has shown the potential to improve the navigation efficiency and safety of the robot by anticipating the uncertainty in the unknown environment. The existing works for 3D shape prediction make an implicit assumption about the partial observations and therefore cannot be used for real-world planning and do not consider the control effort for next-best-view planning. We present Pred-NBV, a realistic object shape reconstruction method consisting of PoinTr-C, an enhanced 3D prediction model trained on the ShapeNet dataset, and an information and control effort-based next-best-view method to address these issues. Pred-NBV shows an improvement of 25.46% in object coverage over the traditional methods in the AirSim simulator, and performs better shape completion than PoinTr, the state-of-the-art shape completion model, even on real data obtained from a Velodyne 3D LiDAR mounted on DJI M600 Pro. △ Less

Submitted 7 August, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

Comments: 6 pages, 4 figures, 2 tables. Accepted to IROS 2023

arXiv:2304.09617 [pdf, other]

Towards Autonomous Selective Harvesting: A Review of Robot Perception, Robot Design, Motion Planning and Control

Authors: Vishnu Rajendran S, Bappaditya Debnath, Bappaditya Debnath, Sariah Mghames, Willow Mandil, Soran Parsa, Simon Parsons, Amir Ghalamzan-E

Abstract: This paper provides an overview of the current state-of-the-art in selective harvesting robots (SHRs) and their potential for addressing the challenges of global food production. SHRs have the potential to increase productivity, reduce labour costs, and minimise food waste by selectively harvesting only ripe fruits and vegetables. The paper discusses the main components of SHRs, including percepti… ▽ More This paper provides an overview of the current state-of-the-art in selective harvesting robots (SHRs) and their potential for addressing the challenges of global food production. SHRs have the potential to increase productivity, reduce labour costs, and minimise food waste by selectively harvesting only ripe fruits and vegetables. The paper discusses the main components of SHRs, including perception, gras**, cutting, motion planning, and control. It also highlights the challenges in develo** SHR technologies, particularly in the areas of robot design, motion planning and control. The paper also discusses the potential benefits of integrating AI and soft robots and data-driven methods to enhance the performance and robustness of SHR systems. Finally, the paper identifies several open research questions in the field and highlights the need for further research and development efforts to advance SHR technologies to meet the challenges of global food production. Overall, this paper provides a starting point for researchers and practitioners interested in develo** SHRs and highlights the need for more research in this field. △ Less

Submitted 19 April, 2023; originally announced April 2023.

Comments: Preprint: to be appeared in Journal of Field Robotics

arXiv:2304.06856 [pdf, other]

Application of the Bell polynomials for the solution of some differential-algebraic equations

Authors: Hari Mohan Srivastava, Giriraj Methi, Anil Kumar, Mohammad Izadi, Vishnu Narayan Mishra, Brahim Benhammouda

Abstract: The differential transform method is used to find numerical approximation of solution to a class of certain nonlinear differential algebraic equations. The method is based on Taylor's theorem. Coefficients of the Taylor series are determined by constructing a recurrence relation. To deal with nonlinearity of the problems, the Faà di Bruno's formula containing the partial ordinary Bell polynomials… ▽ More The differential transform method is used to find numerical approximation of solution to a class of certain nonlinear differential algebraic equations. The method is based on Taylor's theorem. Coefficients of the Taylor series are determined by constructing a recurrence relation. To deal with nonlinearity of the problems, the Faà di Bruno's formula containing the partial ordinary Bell polynomials is applied within the differential transform to avoid computation of symbolic derivatives. The error estimation results are presented too. Four concrete problems are studied to show efficiency and reliability of the method. The obtained results are compared to other methods. △ Less

Submitted 13 April, 2023; originally announced April 2023.

arXiv:2304.06177 [pdf, other]

Visual based Tomato Size Measurement System for an Indoor Farming Environment

Authors: Andy Kweon, Vishnu Hu, Jong Yoon Lim, Trevor Gee, Edmond Liu, Henry Williams, Bruce A. MacDonald, Mahla Nejati, Inkyu Sa, Ho Seok Ahn

Abstract: As technology progresses, smart automated systems will serve an increasingly important role in the agricultural industry. Current existing vision systems for yield estimation face difficulties in occlusion and scalability as they utilize a camera system that is large and expensive, which are unsuitable for orchard environments. To overcome these problems, this paper presents a size measurement met… ▽ More As technology progresses, smart automated systems will serve an increasingly important role in the agricultural industry. Current existing vision systems for yield estimation face difficulties in occlusion and scalability as they utilize a camera system that is large and expensive, which are unsuitable for orchard environments. To overcome these problems, this paper presents a size measurement method combining a machine learning model and depth images captured from three low cost RGBD cameras to detect and measure the height and width of tomatoes. The performance of the presented system is evaluated on a lab environment with real tomato fruits and fake leaves to simulate occlusion in the real farm environment. To improve accuracy by addressing fruit occlusion, our three-camera system was able to achieve a height measurement accuracy of 0.9114 and a width accuracy of 0.9443. △ Less

Submitted 12 April, 2023; originally announced April 2023.

Comments: 10 Pages, 12 Figures

arXiv:2304.05621 [pdf, other]

doi 10.1063/5.0153797

Hydrodynamic aggregation of membrane inclusions due to non-Newtonian surface rheology

Authors: Vishnu Vig, Harishankar Manikantan

Abstract: Biological membranes are self-assembled complex fluid interfaces that host proteins, molecular motors and other macromolecules essential for cellular function. These membranes have a distinct in-plane fluid response with a surface viscosity that has been well characterized. The resulting quasi-2D fluid dynamical problem describes the motion of embedded proteins or particles. However, the viscous r… ▽ More Biological membranes are self-assembled complex fluid interfaces that host proteins, molecular motors and other macromolecules essential for cellular function. These membranes have a distinct in-plane fluid response with a surface viscosity that has been well characterized. The resulting quasi-2D fluid dynamical problem describes the motion of embedded proteins or particles. However, the viscous response of biological membranes is often non-Newtonian: in particular, the surface shear viscosity of phospholipids that comprise the membrane depends strongly on the surface pressure. We use the Lorentz reciprocal theorem to extract the effective long-ranged hydrodynamic interaction among membrane inclusions that arises due to such non-trivial rheology. We show that the corrective force that emerges ties back to the interplay between membrane flow and non-constant viscosity, which suggests a mechanism for biologically favorable protein aggregation within membranes. We quantify and describe the mechanism for such a large-scale concentration instability using a mean-field model. Finally, we employ numerical simulations to demonstrate the formation of hexatic crystals due to the effective hydrodynamic interactions within the membrane. △ Less

Submitted 12 April, 2023; originally announced April 2023.

arXiv:2304.03706 [pdf, other]

Breaking the Envy Cycle: Best-of-Both-Worlds Guarantees for Subadditive Valuations

Authors: Michal Feldman, Simon Mauras, Vishnu V. Narayan, Tomasz Ponitka

Abstract: We study best-of-both-worlds guarantees for the fair division of indivisible items among agents with subadditive valuations. Our main result establishes the existence of a random allocation that is simultaneously ex-ante $\frac{1}{2}$-envy-free, ex-post $\frac{1}{2}$-EFX and ex-post EF1, for every instance with subadditive valuations. We achieve this result by a novel polynomial-time algorithm tha… ▽ More We study best-of-both-worlds guarantees for the fair division of indivisible items among agents with subadditive valuations. Our main result establishes the existence of a random allocation that is simultaneously ex-ante $\frac{1}{2}$-envy-free, ex-post $\frac{1}{2}$-EFX and ex-post EF1, for every instance with subadditive valuations. We achieve this result by a novel polynomial-time algorithm that randomizes the well-established envy cycles procedure in a way that provides ex-ante fairness. Notably, this is the first best-of-both-worlds fairness guarantee for subadditive valuations, even when considering only EF1 without EFX. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: 33 pages, 4 figures

arXiv:2304.01693 [pdf, other]

Performance of 802.11be Wi-Fi 7 with Multi-Link Operation on AR Applications

Authors: Molham Alsakati, Charlie Pettersson, Sebastian Max, Vishnu Narayanan Moothedath, James Gross

Abstract: Since its first release in the late 1990s, Wi-Fi has been updated to keep up with evolving user needs. Recently, Wi-Fi and other radio access technologies have been pushed to their edge when serving Augmented Reality (AR) applications. AR applications require high throughput, low latency, and high reliability to ensure a high-quality user experience. The 802.11be amendment, which will be marketed… ▽ More Since its first release in the late 1990s, Wi-Fi has been updated to keep up with evolving user needs. Recently, Wi-Fi and other radio access technologies have been pushed to their edge when serving Augmented Reality (AR) applications. AR applications require high throughput, low latency, and high reliability to ensure a high-quality user experience. The 802.11be amendment, which will be marketed as Wi-Fi 7, introduces several features that aim to enhance its capabilities to support challenging applications like AR. One of the main features introduced in this amendment is Multi-Link Operation (MLO) which allows nodes to transmit and receive over multiple links concurrently. When using MLO, traffic is distributed among links using an implementation-specific traffic-to-link allocation policy. This paper aims to evaluate the performance of MLO, using different policies, in serving AR applications compared to Single-Link (SL). Experimental simulations using an event-based Wi-Fi simulator have been conducted. Our results show the general superiority of MLO when serving AR applications. MLO achieves lower latency and serves a higher number of AR users compared to SL with the same frequency resources. In addition, increasing the number of links can improve the performance of MLO. Regarding traffic-to-link allocation policies, we found that policies can be more susceptible to channel blocking, resulting in possible performance degradation. △ Less

Submitted 4 April, 2023; originally announced April 2023.

arXiv:2304.00891 [pdf, ps, other]

Online Algorithms for Hierarchical Inference in Deep Learning applications at the Edge

Authors: Vishnu Narayanan Moothedath, Jaya Prakash Champati, James Gross

Abstract: We consider a resource-constrained Edge Device (ED), such as an IoT sensor or a microcontroller unit, embedded with a small-size ML model (S-ML) for a generic classification application and an Edge Server (ES) that hosts a large-size ML model (L-ML). Since the inference accuracy of S-ML is lower than that of the L-ML, offloading all the data samples to the ES results in high inference accuracy, bu… ▽ More We consider a resource-constrained Edge Device (ED), such as an IoT sensor or a microcontroller unit, embedded with a small-size ML model (S-ML) for a generic classification application and an Edge Server (ES) that hosts a large-size ML model (L-ML). Since the inference accuracy of S-ML is lower than that of the L-ML, offloading all the data samples to the ES results in high inference accuracy, but it defeats the purpose of embedding S-ML on the ED and deprives the benefits of reduced latency, bandwidth savings, and energy efficiency of doing local inference. In order to get the best out of both worlds, i.e., the benefits of doing inference on the ED and the benefits of doing inference on ES, we explore the idea of Hierarchical Inference (HI), wherein S-ML inference is only accepted when it is correct, otherwise the data sample is offloaded for L-ML inference. However, the ideal implementation of HI is infeasible as the correctness of the S-ML inference is not known to the ED. We propose an online meta-learning framework that the ED can use to predict the correctness of the S-ML inference. In particular, we propose to use the maximum softmax value output by S-ML for a data sample and decide whether to offload it or not. The resulting online learning problem turns out to be a Prediction with Expert Advice (PEA) problem with continuous expert space. We propose two different algorithms and prove sublinear regret bounds for them without any assumption on the smoothness of the loss function. We evaluate and benchmark the performance of the proposed algorithms for image classification application using four datasets, namely, Imagenette and Imagewoof, MNIST, and CIFAR-10. △ Less

Submitted 15 February, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

Comments: The original version was submitted to a journal and was later revised. The updated version was accepted in a journal and will be published soon. The 'Journal reference' will be updated as and when the information is available

arXiv:2303.17355 [pdf, other]

Acoustic Soft Tactile Skin (AST Skin)

Authors: Vishnu Rajendran S, Willow Mandil, Simon Parsons, Amir Ghalamzan E

Abstract: This paper presents a novel soft tactile skin (STS) technology operating with sound waves. In this innovative approach, the sound waves generated by a speaker travel in channels embedded in a soft membrane and get modulated due to a deformation of the channel when pressed by an external force and received by a microphone at the end of the channel. The sensor leverages regression and classification… ▽ More This paper presents a novel soft tactile skin (STS) technology operating with sound waves. In this innovative approach, the sound waves generated by a speaker travel in channels embedded in a soft membrane and get modulated due to a deformation of the channel when pressed by an external force and received by a microphone at the end of the channel. The sensor leverages regression and classification methods for estimating the normal force and its contact location. Our sensor can be affixed to any robot part, e.g., end effectors or arm. We tested several regression and classifier methods to learn the relation between sound wave modulation, the applied force, and its location, respectively and picked the best-performing models for force and location predictions. Our novel tactile sensor yields 93% of the force estimation within 1.5 N tolerances for a range of 0-30+1 N and estimates contact locations with over 96% accuracy. We also demonstrated the performance of STS technology for a real-time grip** force control application. △ Less

Submitted 29 February, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

Comments: IEEE International Conference on Robotics and Automation (ICRA) 2024 (accepted)

arXiv:2303.14451 [pdf, other]

Near Equilibrium Constraints on Bulk Viscous Models in $f(R,T)=R+2λT$ Gravity

Authors: Vishnu A Pai, Titus K Mathew

Abstract: Recent studies indicate that, near equilibrium condition could not be maintained for bulk viscous matter models during the accelerated expansion of the universe in the context of Einstein's gravity, without including the cosmological constant. But from our investigation in $f(R,T)$ gravity, it is observed that, this condition can be satisfied in this modified gravity regime by properly constrainin… ▽ More Recent studies indicate that, near equilibrium condition could not be maintained for bulk viscous matter models during the accelerated expansion of the universe in the context of Einstein's gravity, without including the cosmological constant. But from our investigation in $f(R,T)$ gravity, it is observed that, this condition can be satisfied in this modified gravity regime by properly constraining the coupling and viscous parameters. Accordingly, strict constraints are developed for free parameters in bulk viscous models in $f(R,T)=R+2λT$ gravity based on fulfillment near equilibrium condition. Then, for assessing the validity of NEC during different stages of evolution, two cosmological models are studied for each case based on the developed constraints. Initially, the data analysis of the models is performed using the Observational Hubble Data (OHD) and then later, model showing the best result is analyzed using combined OHD+SNe Ia data sets. From the obtained best fit values of model parameters, inferences are made regarding the possibilities of achieving recent acceleration for viscous models in $R+2λT$ gravity while simultaneously satisfying the required conditions both in the presence and absence of cosmological constant. △ Less

Submitted 25 March, 2023; originally announced March 2023.

Comments: 13 pages and 6 figures

arXiv:2303.10853 [pdf, ps, other]

On an exponential power sum

Authors: Neha Elizabeth Thomas, K Vishnu Namboothiri

Abstract: Using combinatorial techniques, we derive a recurrence identity that expresses an exponential power sum with negative powers in terms of another exponential power sum with positive powers. Consequently, we derive a formula for the power sum of the first $k$ natural numbers when the power is odd, which when used in combination with Faulhaber's formula for computing power sums helps us to retrieve t… ▽ More Using combinatorial techniques, we derive a recurrence identity that expresses an exponential power sum with negative powers in terms of another exponential power sum with positive powers. Consequently, we derive a formula for the power sum of the first $k$ natural numbers when the power is odd, which when used in combination with Faulhaber's formula for computing power sums helps us to retrieve the Bernoulli numbers in certain cases. △ Less

Submitted 21 November, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

MSC Class: 05A18; 11B37; 11B75; 11L03

arXiv:2303.10695 [pdf, other]

On the Convergence of Decentralized Federated Learning Under Imperfect Information Sharing

Authors: Vishnu Pandi Chellapandi, Antesh Upadhyay, Abolfazl Hashemi, Stanislaw H /. Zak

Abstract: Decentralized learning and optimization is a central problem in control that encompasses several existing and emerging applications, such as federated learning. While there exists a vast literature on this topic and most methods centered around the celebrated average-consensus paradigm, less attention has been devoted to scenarios where the communication between the agents may be imperfect. To thi… ▽ More Decentralized learning and optimization is a central problem in control that encompasses several existing and emerging applications, such as federated learning. While there exists a vast literature on this topic and most methods centered around the celebrated average-consensus paradigm, less attention has been devoted to scenarios where the communication between the agents may be imperfect. To this end, this paper presents three different algorithms of Decentralized Federated Learning (DFL) in the presence of imperfect information sharing modeled as noisy communication channels. The first algorithm, Federated Noisy Decentralized Learning (FedNDL1), comes from the literature, where the noise is added to their parameters to simulate the scenario of the presence of noisy communication channels. This algorithm shares parameters to form a consensus with the clients based on a communication graph topology through a noisy communication channel. The proposed second algorithm (FedNDL2) is similar to the first algorithm but with added noise to the parameters, and it performs the gossip averaging before the gradient optimization. The proposed third algorithm (FedNDL3), on the other hand, shares the gradients through noisy communication channels instead of the parameters. Theoretical and experimental results demonstrate that under imperfect information sharing, the third scheme that mixes gradients is more robust in the presence of a noisy channel compared with the algorithms from the literature that mix the parameters. △ Less

Submitted 19 March, 2023; originally announced March 2023.

Comments: 24 pages, 2 figures

arXiv:2303.10677 [pdf, other]

A Survey of Federated Learning for Connected and Automated Vehicles

Authors: Vishnu Pandi Chellapandi, Liangqi Yuan, Stanislaw H /. Zak, Ziran Wang

Abstract: Connected and Automated Vehicles (CAVs) are one of the emerging technologies in the automotive domain that has the potential to alleviate the issues of accidents, traffic congestion, and pollutant emissions, leading to a safe, efficient, and sustainable transportation system. Machine learning-based methods are widely used in CAVs for crucial tasks like perception, motion planning, and motion contr… ▽ More Connected and Automated Vehicles (CAVs) are one of the emerging technologies in the automotive domain that has the potential to alleviate the issues of accidents, traffic congestion, and pollutant emissions, leading to a safe, efficient, and sustainable transportation system. Machine learning-based methods are widely used in CAVs for crucial tasks like perception, motion planning, and motion control, where machine learning models in CAVs are solely trained using the local vehicle data, and the performance is not certain when exposed to new environments or unseen conditions. Federated learning (FL) is an effective solution for CAVs that enables a collaborative model development with multiple vehicles in a distributed learning framework. FL enables CAVs to learn from a wide range of driving environments and improve their overall performance while ensuring the privacy and security of local vehicle data. In this paper, we review the progress accomplished by researchers in applying FL to CAVs. A broader view of the various data modalities and algorithms that have been implemented on CAVs is provided. Specific applications of FL are reviewed in detail, and an analysis of the challenges and future scope of research are presented. △ Less

Submitted 19 March, 2023; originally announced March 2023.

Comments: 8 pages, 1 figure

arXiv:2303.09363 [pdf, ps, other]

Some asymptotic formulae involving Cohen-Ramanujan expansions

Authors: Arya Chandran, K Vishnu Namboothiri

Abstract: Some necessary and sufficient conditions for the existence of Cohen-Ramanujan expansions for arithmetical functions were provided by these authors in [\textit{arXive preprint arXive:2205.08466}, 2022]. Given two arithmetical functions $f$ and $g$ with absolutely convergent Cohen-Ramanujan expansions, we derive an asymptotic formula for $\sum_{n\leq N}f(n)g(n+h)$ where $h$ is a fixed positive integ… ▽ More Some necessary and sufficient conditions for the existence of Cohen-Ramanujan expansions for arithmetical functions were provided by these authors in [\textit{arXive preprint arXive:2205.08466}, 2022]. Given two arithmetical functions $f$ and $g$ with absolutely convergent Cohen-Ramanujan expansions, we derive an asymptotic formula for $\sum_{n\leq N}f(n)g(n+h)$ where $h$ is a fixed positive integer. We also provide Cohen-Ramanujan expansions for certain functions to illustrate some of the results we prove consequently. △ Less

Submitted 1 January, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

MSC Class: 11A25; 11L03; 11N05; 11N37

arXiv:2303.09231 [pdf, other]

doi 10.1093/mnras/stad1900

The MPIfR-MeerKAT Galactic Plane survey I -- System setup and early results

Authors: P. V. Padmanabh, E. D. Barr, S. S. Sridhar, M. R. Rugel, A. Damas-Segovia, A. M. Jacob, V. Balakrishnan, M. Berezina, M. C. i Bernadich, A. Brunthaler, D. J. Champion, P. C. C. Freire, S. Khan, H. -R. Klöckner, M. Kramer, Y. K. Ma, S. A. Mao, Y. P. Men, K. M. Menten, S. Sengupta, V. Venkatraman Krishnan, O. Wucknitz, F. Wyrowski, M. C. Bezuidenhout, S. Buchner , et al. (8 additional authors not shown)

Abstract: Galactic plane radio surveys play a key role in improving our understanding of a wide range of astrophysical phenomena. Performing such a survey using the latest interferometric telescopes produces large data rates necessitating a shift towards fully or quasi-real-time data analysis with data being stored for only the time required to process them. We present here the overview and setup for the 30… ▽ More Galactic plane radio surveys play a key role in improving our understanding of a wide range of astrophysical phenomena. Performing such a survey using the latest interferometric telescopes produces large data rates necessitating a shift towards fully or quasi-real-time data analysis with data being stored for only the time required to process them. We present here the overview and setup for the 3000 hour Max-Planck-Institut fuer Radioastronomie (MPIfR) MeerKAT Galactic Plane survey (MMGPS). The survey is unique by operating in a commensal mode, addressing key science objectives of the survey including the discovery of new pulsars and transients as well as studies of Galactic magnetism, the interstellar medium and star formation rates. We explain the strategy coupled with the necessary hardware and software infrastructure needed for data reduction in the imaging, spectral and time domains. We have so far discovered 78 new pulsars including 17 confirmed binary systems of which two are potential double neutron star systems. We have also developed an imaging pipeline sensitive to the order of a few tens of micro-Jansky with a spatial resolution of a few arcseconds. Further science operations with an in-house built S-Band receiver operating between 1.7-3.5 GHz are about to commence. Early spectral line commissioning observations conducted at S-Band, targeting transitions of the key molecular gas tracer CH at 3.3 GHz already illustrate the spectroscopic capabilities of this instrument. These results lay a strong foundation for future surveys with telescopes like the Square Kilometre Array (SKA). △ Less

Submitted 21 June, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

Comments: 25 pages, 10 figures, Accepted in MNRAS

arXiv:2303.07476 [pdf, other]

Challenges and Practices of Deep Learning Model Reengineering: A Case Study on Computer Vision

Authors: Wenxin Jiang, Vishnu Banna, Naveen Vivek, Abhinav Goel, Nicholas Synovic, George K. Thiruvathukal, James C. Davis

Abstract: Many engineering organizations are reimplementing and extending deep neural networks from the research community. We describe this process as deep learning model reengineering. Deep learning model reengineering - reusing, reproducing, adapting, and enhancing state-of-the-art deep learning approaches - is challenging for reasons including under-documented reference models, changing requirements, an… ▽ More Many engineering organizations are reimplementing and extending deep neural networks from the research community. We describe this process as deep learning model reengineering. Deep learning model reengineering - reusing, reproducing, adapting, and enhancing state-of-the-art deep learning approaches - is challenging for reasons including under-documented reference models, changing requirements, and the cost of implementation and testing. In addition, individual engineers may lack expertise in software engineering, yet teams must apply knowledge of software engineering and deep learning to succeed. Prior work has examined on DL systems from a "product" view, examining defects from projects regardless of the engineers' purpose. Our study is focused on reengineering activities from a "process" view, and focuses on engineers specifically engaged in the reengineering process. Our goal is to understand the characteristics and challenges of deep learning model reengineering. We conducted a case study of this phenomenon, focusing on the context of computer vision. Our results draw from two data sources: defects reported in open-source reeengineering projects, and interviews conducted with open-source project contributors and the leaders of a reengineering team. Our results describe how deep learning-based computer vision techniques are reengineered, analyze the distribution of defects in this process, and discuss challenges and practices. Integrating our quantitative and qualitative data, we proposed a novel reengineering workflow. Our findings inform several future directions, including: measuring additional unknown aspects of model reengineering; standardizing engineering practices to facilitate reengineering; and develo** tools to support model reengineering and model reuse. △ Less

Submitted 25 August, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

Comments: Under submission to EMSE

arXiv:2303.07147 [pdf, other]

doi 10.1093/pnasnexus/pgad289

Enhanced Vibrational Stability in Glass Droplets

Authors: Surajit Chakraborty, Vishnu V. Krishnan, Kabir Ramola, Smarajit Karmakar

Abstract: We show through simulations of amorphous solids prepared in open boundary conditions that they possess significantly fewer low-frequency vibrational modes compared to their periodic boundary counterparts. Specifically, using measurements of the vibrational density of states, we find that the $D(ω) \sim ω^4$ law changes to $D(ω) \sim ω^δ$ with $δ\approx 5$ in two dimensions and $δ\approx 4.5$ in th… ▽ More We show through simulations of amorphous solids prepared in open boundary conditions that they possess significantly fewer low-frequency vibrational modes compared to their periodic boundary counterparts. Specifically, using measurements of the vibrational density of states, we find that the $D(ω) \sim ω^4$ law changes to $D(ω) \sim ω^δ$ with $δ\approx 5$ in two dimensions and $δ\approx 4.5$ in three dimensions. Crucially, this enhanced stability is achieved when utilizing slow annealing protocols to generate solid configurations. We perform an anharmonic analysis of the minima corresponding to the lowest-frequency modes in such open-boundary systems and discuss their correlation with the density of states. A study of various system sizes further reveals that small systems display a higher degree of localization in vibrations. Lastly, we confine open-boundary solids in order to introduce macroscopic stresses in the system which are absent in the unconfined system, and find that the $D(ω) \sim ω^4$ behavior is recovered. △ Less

Submitted 10 October, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

Comments: 12 pages, 11 figures

Journal ref: PNAS Nexus, Volume 2, Issue 9, September 2023, pgad289

arXiv:2303.05393 [pdf, other]

Deep Functional Predictive Control for Strawberry Cluster Manipulation using Tactile Prediction

Authors: Kiyanoush Nazari, Gabriele Gandolfi, Zeynab Talebpour, Vishnu Rajendran, Paolo Rocco, Amir Ghalamzan E.

Abstract: This paper introduces a novel approach to address the problem of Physical Robot Interaction (PRI) during robot pushing tasks. The approach uses a data-driven forward model based on tactile predictions to inform the controller about potential future movements of the object being pushed, such as a strawberry stem, using a robot tactile finger. The model is integrated into a Deep Functional Predictiv… ▽ More This paper introduces a novel approach to address the problem of Physical Robot Interaction (PRI) during robot pushing tasks. The approach uses a data-driven forward model based on tactile predictions to inform the controller about potential future movements of the object being pushed, such as a strawberry stem, using a robot tactile finger. The model is integrated into a Deep Functional Predictive Control (d-FPC) system to control the displacement of the stem on the tactile finger during pushes. Pushing an object with a robot finger along a desired trajectory in 3D is a highly nonlinear and complex physical robot interaction, especially when the object is not stably grasped. The proposed approach controls the stem movements on the tactile finger in a prediction horizon. The effectiveness of the proposed FPC is demonstrated in a series of tests involving a real robot pushing a strawberry in a cluster. The results indicate that the d-FPC controller can successfully control PRI in robotic manipulation tasks beyond the handling of strawberries. The proposed approach offers a promising direction for addressing the challenging PRI problem in robotic manipulation tasks. Future work will explore the generalisation of the approach to other objects and tasks. △ Less

Submitted 9 March, 2023; originally announced March 2023.

Comments: Submitted to IEEE IROS 2023

arXiv:2303.03480 [pdf, other]

doi 10.1109/LRA.2023.3346800

Can an Embodied Agent Find Your "Cat-shaped Mug"? LLM-Guided Exploration for Zero-Shot Object Navigation

Authors: Vishnu Sashank Dorbala, James F. Mullen Jr., Dinesh Manocha

Abstract: We present LGX (Language-guided Exploration), a novel algorithm for Language-Driven Zero-Shot Object Goal Navigation (L-ZSON), where an embodied agent navigates to a uniquely described target object in a previously unseen environment. Our approach makes use of Large Language Models (LLMs) for this task by leveraging the LLM's commonsense reasoning capabilities for making sequential navigational de… ▽ More We present LGX (Language-guided Exploration), a novel algorithm for Language-Driven Zero-Shot Object Goal Navigation (L-ZSON), where an embodied agent navigates to a uniquely described target object in a previously unseen environment. Our approach makes use of Large Language Models (LLMs) for this task by leveraging the LLM's commonsense reasoning capabilities for making sequential navigational decisions. Simultaneously, we perform generalized target object detection using a pre-trained Vision-Language grounding model. We achieve state-of-the-art zero-shot object navigation results on RoboTHOR with a success rate (SR) improvement of over 27% over the current baseline of the OWL-ViT CLIP on Wheels (OWL CoW). Furthermore, we study the usage of LLMs for robot navigation and present an analysis of various prompting strategies affecting the model output. Finally, we showcase the benefits of our approach via \textit{real-world} experiments that indicate the superior performance of LGX in detecting and navigating to visually unique objects. △ Less

Submitted 5 November, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

Comments: 10 pages

Journal ref: IEEE Robotics and Automation Letters 9.5 (2024) 4083-4090

arXiv:2303.03286 [pdf]

Giant electromechanical response from defective non-ferroelectric epitaxial BaTiO3 integrated on Si 100

Authors: Sandeep Vura, Shubham Kumar Parate, Subhajit Pal, Upanya Khandelwal, Rajeev Kumar Rai, Sri Harsha Molleti, Vishnu Kumar, Rama Satya Sandilya Ventrapragada, Girish Patil, Mudit Jain, Ambresh Mallya, Majid Ahmadi, Bart Kooi, Sushobhan Avasthi, Rajeev Ranjan, Srinivasan Raghavan, Saurabh Chandorkar, Pavan Nukala

Abstract: Lead free, silicon compatible materials showing large electromechanical responses comparable to, or better than conventional relaxor ferroelectrics, are desirable for various nanoelectromechanical devices and applications. Defect-engineered electrostriction has recently been gaining popularity to obtain enhanced electromechanical responses at sub 100 Hz frequencies. Here, we report record values o… ▽ More Lead free, silicon compatible materials showing large electromechanical responses comparable to, or better than conventional relaxor ferroelectrics, are desirable for various nanoelectromechanical devices and applications. Defect-engineered electrostriction has recently been gaining popularity to obtain enhanced electromechanical responses at sub 100 Hz frequencies. Here, we report record values of electrostrictive strain coefficients (M31) at frequencies as large as 5 kHz (1.04 x 10-14 m2 per V2 at 1 kHz, and 3.87 x 10-15 m2 per V2 at 5 kHz) using A-site and oxygen-deficient barium titanate thin-films, epitaxially integrated onto Si. The effect is robust and retained even after cycling the devices >5000 times. Our perovskite films are non-ferroelectric, exhibit a different symmetry compared to stoichiometric BaTiO3 and are characterized by twin boundaries and nano polar-like regions. We show that the dielectric relaxation arising from the defect-induced features correlates very well with the observed giant electrostrictive response. These films show large coefficient of thermal expansion (2.36 x 10-5/K), which along with the giant M31 implies a considerable increase in the lattice anharmonicity induced by the defects. Our work provides a crucial step forward towards formulating guidelines to engineer large electromechanical responses even at higher frequencies in lead-free thin films. △ Less

Submitted 6 March, 2023; originally announced March 2023.

Comments: 26 pages, 4 figures, 8 supplementary figures

arXiv:2303.01693 [pdf, other]

doi 10.1109/ICRA48891.2023.10160662

Cross-domain Transfer Learning and State Inference for Soft Robots via a Semi-supervised Sequential Variational Bayes Framework

Authors: Shageenderan Sapai, Junn Yong Loo, Ze Yang Ding, Chee Pin Tan, Raphael CW Phan, Vishnu Monn Baskaran, Surya Girinatha Nurzaman

Abstract: Recently, data-driven models such as deep neural networks have shown to be promising tools for modelling and state inference in soft robots. However, voluminous amounts of data are necessary for deep models to perform effectively, which requires exhaustive and quality data collection, particularly of state labels. Consequently, obtaining labelled state data for soft robotic systems is challenged f… ▽ More Recently, data-driven models such as deep neural networks have shown to be promising tools for modelling and state inference in soft robots. However, voluminous amounts of data are necessary for deep models to perform effectively, which requires exhaustive and quality data collection, particularly of state labels. Consequently, obtaining labelled state data for soft robotic systems is challenged for various reasons, including difficulty in the sensorization of soft robots and the inconvenience of collecting data in unstructured environments. To address this challenge, in this paper, we propose a semi-supervised sequential variational Bayes (DSVB) framework for transfer learning and state inference in soft robots with missing state labels on certain robot configurations. Considering that soft robots may exhibit distinct dynamics under different robot configurations, a feature space transfer strategy is also incorporated to promote the adaptation of latent features across multiple configurations. Unlike existing transfer learning approaches, our proposed DSVB employs a recurrent neural network to model the nonlinear dynamics and temporal coherence in soft robot data. The proposed framework is validated on multiple setup configurations of a pneumatic-based soft robot finger. Experimental results on four transfer scenarios demonstrate that DSVB performs effective transfer learning and accurate state inference amidst missing state labels. The data and code are available at https://github.com/shageenderan/DSVB. △ Less

Submitted 25 August, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

Comments: Accepted at the International Conference on Robotics and Automation (ICRA) 2023

arXiv:2303.00444 [pdf, other]

Insights into air cushion dynamics during drop impact on heated substrate at low impact energy

Authors: Durbar Roy, Srinivas Rao S, Vishnu Hariharan, Saptarshi Basu

Abstract: We study the air layer dynamics beneath a drop im**ing a heated surface at low impact energy using high-speed reflection interferometry imaging and theoretical analysis. The air film has been subdivided into two distinct disjoint regions, the central dimple and the peripheral disc. We decipher that a gaussian profile can approximate the dynamic shape evolution of the central air dimple. We furth… ▽ More We study the air layer dynamics beneath a drop im**ing a heated surface at low impact energy using high-speed reflection interferometry imaging and theoretical analysis. The air film has been subdivided into two distinct disjoint regions, the central dimple and the peripheral disc. We decipher that a gaussian profile can approximate the dynamic shape evolution of the central air dimple. We further observe that the dimple geometry is a function of impact energy and its dependence on surface temperature is relatively weak. The air layer rupture time and rupture radius increases with increase in substrate temperature. We characterize the air layer profile as a 2D Knudsen field and show that a unified treatment, including continuum and non-continuum mechanics, is required to comprehend the air layer dynamics coherently. The airflow dynamics in the central dimple region falls within the purview of continuum stokes regime. In contrast, the peripheral air disc falls within the non-continuum (gas kinetic effects) slip flow and transition regime characterized by a high Knudsen number. However, the initial average air disc expansion dynamics could be understood in terms of stokes approximation. In non-continuum regimes of the peripheral air disc, we discover intriguing asymmetric interface perturbations. The asymmetric wetting of the substrate initiates at the edge of the peripheral disc region.These perturbative structures cause asymmetric wetting/contact between the droplet and the substrate. Due to the asymptotic effects of capillary and van der Waals interaction in the disc region, the sub-micron spatial structures can exist at short time scales. △ Less

Submitted 15 September, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

arXiv:2303.00262 [pdf, other]

Collage Diffusion

Authors: Vishnu Sarukkai, Linden Li, Arden Ma, Christopher Ré, Kayvon Fatahalian

Abstract: We seek to give users precise control over diffusion-based image generation by modeling complex scenes as sequences of layers, which define the desired spatial arrangement and visual attributes of objects in the scene. Collage Diffusion harmonizes the input layers to make objects fit together -- the key challenge involves minimizing changes in the positions and key visual attributes of the input l… ▽ More We seek to give users precise control over diffusion-based image generation by modeling complex scenes as sequences of layers, which define the desired spatial arrangement and visual attributes of objects in the scene. Collage Diffusion harmonizes the input layers to make objects fit together -- the key challenge involves minimizing changes in the positions and key visual attributes of the input layers while allowing other attributes to change in the harmonization process. We ensure that objects are generated in the correct locations by modifying text-image cross-attention with the layers' alpha masks. We preserve key visual attributes of input layers by learning specialized text representations per layer and by extending ControlNet to operate on layers. Layer input allows users to control the extent of image harmonization on a per-object basis, and users can even iteratively edit individual objects in generated images while kee** other objects fixed. By leveraging the rich information present in layer input, Collage Diffusion generates globally harmonized images that maintain desired object characteristics better than prior approaches. △ Less

Submitted 31 August, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

arXiv:2302.13721 [pdf, other]

Wireless End-to-End Image Transmission System using Semantic Communications

Authors: Maheshi Lokumarambage, Vishnu Gowrisetty, Hossein Rezaei, Thushan Sivalingam, Nandana Rajatheva, Anil Fernando

Abstract: Semantic communication is considered the future of mobile communication, which aims to transmit data beyond Shannon's theorem of communications by transmitting the semantic meaning of the data rather than the bit-by-bit reconstruction of the data at the receiver's end. The semantic communication paradigm aims to bridge the gap of limited bandwidth problems in modern high-volume multimedia applicat… ▽ More Semantic communication is considered the future of mobile communication, which aims to transmit data beyond Shannon's theorem of communications by transmitting the semantic meaning of the data rather than the bit-by-bit reconstruction of the data at the receiver's end. The semantic communication paradigm aims to bridge the gap of limited bandwidth problems in modern high-volume multimedia application content transmission. Integrating AI technologies with the 6G communications networks paved the way to develop semantic communication-based end-to-end communication systems. In this study, we have implemented a semantic communication-based end-to-end image transmission system, and we discuss potential design considerations in develo** semantic communication systems in conjunction with physical channel characteristics. A Pre-trained GAN network is used at the receiver as the transmission task to reconstruct the realistic image based on the Semantic segmented image at the receiver input. The semantic segmentation task at the transmitter (encoder) and the GAN network at the receiver (decoder) is trained on a common knowledge base, the COCO-Stuff dataset. The research shows that the resource gain in the form of bandwidth saving is immense when transmitting the semantic segmentation map through the physical channel instead of the ground truth image in contrast to conventional communication systems. Furthermore, the research studies the effect of physical channel distortions and quantization noise on semantic communication-based multimedia content transmission. △ Less

Submitted 10 April, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

Comments: Accepted for IEEE Access

arXiv:2302.11530 [pdf, ps, other]

Fair Chore Division under Binary Supermodular Costs

Authors: Siddharth Barman, Vishnu V. Narayan, Paritosh Verma

Abstract: We study the problem of dividing indivisible chores among agents whose costs (for the chores) are supermodular set functions with binary marginals. Such functions capture complementarity among chores, i.e., they constitute an expressive class wherein the marginal disutility of each chore is either one or zero, and the marginals increase with respect to supersets. In this setting, we study the broa… ▽ More We study the problem of dividing indivisible chores among agents whose costs (for the chores) are supermodular set functions with binary marginals. Such functions capture complementarity among chores, i.e., they constitute an expressive class wherein the marginal disutility of each chore is either one or zero, and the marginals increase with respect to supersets. In this setting, we study the broad landscape of finding fair and efficient chore allocations. In particular, we establish the existence of $(i)$ EF1 and Pareto efficient chore allocations, $(ii)$ MMS-fair and Pareto efficient allocations, and $(iii)$ Lorenz dominating chore allocations. Furthermore, we develop polynomial-time algorithms--in the value oracle model--for computing the chore allocations for each of these fairness and efficiency criteria. Complementing these existential and algorithmic results, we show that in this chore division setting, the aforementioned fairness notions, namely EF1, MMS, and Lorenz domination are incomparable: an allocation that satisfies any one of these notions does not necessarily satisfy the others. Additionally, we study EFX chore division. In contrast to the above-mentioned positive results, we show that, for binary supermodular costs, Pareto efficient allocations that are even approximately EFX do not exist, for any arbitrarily small approximation constant. Focusing on EFX fairness alone, when the cost functions are identical we present an algorithm (Add-and-Fix) that computes an EFX allocation. For binary marginals, we show that Add-and-Fix runs in polynomial time. △ Less

Submitted 22 February, 2023; originally announced February 2023.

Comments: 25 pages

arXiv:2302.11361 [pdf, other]

HDR image watermarking using saliency detection and quantization index modulation

Authors: Ahmed Khan, Minoru Kuribayashi, KokSheik Wong, Vishnu Monn Baskaran

Abstract: High-dynamic range (HDR) images are circulated rapidly over the internet with risks of being exploited for unauthorized usage. To protect these images, some HDR image based watermarking (HDR-IW) methods were put forward. However, they inherited the same problem faced by conventional IW methods for standard dynamic range (SDR) images, where only trade-offs among conflicting requirements are managed… ▽ More High-dynamic range (HDR) images are circulated rapidly over the internet with risks of being exploited for unauthorized usage. To protect these images, some HDR image based watermarking (HDR-IW) methods were put forward. However, they inherited the same problem faced by conventional IW methods for standard dynamic range (SDR) images, where only trade-offs among conflicting requirements are managed instead of simultaneous improvement. In this paper, a novel saliency (eye-catching object) detection based trade-off independent HDR-IW is proposed, to simultaneously improve robustness, imperceptibility and payload. First, the host image goes through our proposed salient object detection model to produce a saliency map, which is, in turn, exploited to segment the foreground and background of the host image. Next, the binary watermark is partitioned into the foregrounds and backgrounds using the same mask and scrambled using a random permutation algorithm. Finally, the watermark segments are embedded into selected bit-plane of the corresponding host segments using quantized indexed modulation. Experimental results suggest that the proposed work outperforms state-of-the-art methods in terms of improving the conflicting requirements. △ Less

Submitted 23 February, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

arXiv:2302.11156 [pdf, other]

doi 10.1145/3555641

Multi-Objective Personalization in Multi-Stakeholder Organizational Bulk E-mail: A Field Experiment

Authors: Ruoyan Kong, Chuankai Zhang, Ruixuan Sun, Vishnu Chhabra, Tanushsrisai Nadimpalli, Joseph A. Konstan

Abstract: Bulk email is often used in organizations to communicate ``important-to-organization'' messages such as policy changes, organizational plans, and administrative updates. However, normal employees may prefer messages more relevant to their jobs or interests. Organizations face the challenge of balancing prioritizing the messages they prefer employees to know (tactical goals) while maintaining emplo… ▽ More Bulk email is often used in organizations to communicate ``important-to-organization'' messages such as policy changes, organizational plans, and administrative updates. However, normal employees may prefer messages more relevant to their jobs or interests. Organizations face the challenge of balancing prioritizing the messages they prefer employees to know (tactical goals) while maintaining employees' positive experiences with these bulk emails, then they continue to read these emails in the future (strategic goals). Could personalization help organizations achieve these tactical and strategic goals? In an 8-week field experiment with a university newsletter, we implemented a 4x5x5 factorial design on personalizing subject lines, top news, and message order based on both the employees' and the organization's preferences. We measured these designs' influences on the open/interest/recognition/read-in-detail rate of the whole newsletter and the single messages within it. We found that ``important-to-organization'' messages only got higher recognition rates when being put on subject lines / top news (tactical goal). Mixing them with employee-preferred messages in top news did not bring further improvement to their own recognition rates but could improve the whole newsletter's recognition rate. Only when the top news solely contained the employee-preferred messages were the employees slightly more interested in the newsletter (strategic goal). We further analyze on which topics the employees and the organization's preferences conflicted. Finally, we discuss the design suggestions on personalization and recommendation for organizational bulk email. △ Less

Submitted 17 July, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

Comments: This is a pre-print version of a paper accepted to CSCW 2022, The 25th ACM Conference On Computer-Supported Cooperative Work And Social Computing; Ruoyan Kong et al. 2022. Multi-Objective Personalization in Multi-Stakeholder Organizational Bulk E-mail: A Field Experiment. Proc. ACM Hum.-Comput. Interact. 6, CSCW2, Article 528 (November 2022)

arXiv:2302.09124 [pdf, other]

doi 10.1145/3544548.3581302

ImageAssist: Tools for Enhancing Touchscreen-Based Image Exploration Systems for Blind and Low Vision Users

Authors: Vishnu Nair, Hanxiu 'Hazel' Zhu, Brian A. Smith

Abstract: Blind and low vision (BLV) users often rely on alt text to understand what a digital image is showing. However, recent research has investigated how touch-based image exploration on touchscreens can supplement alt text. Touchscreen-based image exploration systems allow BLV users to deeply understand images while granting a strong sense of agency. Yet, prior work has found that these systems requir… ▽ More Blind and low vision (BLV) users often rely on alt text to understand what a digital image is showing. However, recent research has investigated how touch-based image exploration on touchscreens can supplement alt text. Touchscreen-based image exploration systems allow BLV users to deeply understand images while granting a strong sense of agency. Yet, prior work has found that these systems require a lot of effort to use, and little work has been done to explore these systems' bottlenecks on a deeper level and propose solutions to these issues. To address this, we present ImageAssist, a set of three tools that assist BLV users through the process of exploring images by touch -- scaffolding the exploration process. We perform a series of studies with BLV users to design and evaluate ImageAssist, and our findings reveal several implications for image exploration tools for BLV users. △ Less

Submitted 17 February, 2023; originally announced February 2023.

Journal ref: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23), April 2023

arXiv:2302.07734 [pdf, other]

TFormer: A Transmission-Friendly ViT Model for IoT Devices

Authors: Zhichao Lu, Chuntao Ding, Felix Juefei-Xu, Vishnu Naresh Boddeti, Shangguang Wang, Yun Yang

Abstract: Deploying high-performance vision transformer (ViT) models on ubiquitous Internet of Things (IoT) devices to provide high-quality vision services will revolutionize the way we live, work, and interact with the world. Due to the contradiction between the limited resources of IoT devices and resource-intensive ViT models, the use of cloud servers to assist ViT model training has become mainstream. H… ▽ More Deploying high-performance vision transformer (ViT) models on ubiquitous Internet of Things (IoT) devices to provide high-quality vision services will revolutionize the way we live, work, and interact with the world. Due to the contradiction between the limited resources of IoT devices and resource-intensive ViT models, the use of cloud servers to assist ViT model training has become mainstream. However, due to the larger number of parameters and floating-point operations (FLOPs) of the existing ViT models, the model parameters transmitted by cloud servers are large and difficult to run on resource-constrained IoT devices. To this end, this paper proposes a transmission-friendly ViT model, TFormer, for deployment on resource-constrained IoT devices with the assistance of a cloud server. The high performance and small number of model parameters and FLOPs of TFormer are attributed to the proposed hybrid layer and the proposed partially connected feed-forward network (PCS-FFN). The hybrid layer consists of nonlearnable modules and a pointwise convolution, which can obtain multitype and multiscale features with only a few parameters and FLOPs to improve the TFormer performance. The PCS-FFN adopts group convolution to reduce the number of parameters. The key idea of this paper is to propose TFormer with few model parameters and FLOPs to facilitate applications running on resource-constrained IoT devices to benefit from the high performance of the ViT models. Experimental results on the ImageNet-1K, MS COCO, and ADE20K datasets for image classification, object detection, and semantic segmentation tasks demonstrate that the proposed model outperforms other state-of-the-art models. Specifically, TFormer-S achieves 5% higher accuracy on ImageNet-1K than ResNet18 with 1.4$\times$ fewer parameters and FLOPs. △ Less

Submitted 15 February, 2023; originally announced February 2023.

Comments: IEEE Transactions on Parallel and Distributed Systems

arXiv:2302.06924 [pdf, other]

doi 10.1007/s12036-023-09944-w

Solar Mean Magnetic Field of the Chromosphere

Authors: M. Vishnu, K. Nagaraju, Harsh Mathur

Abstract: The Solar Mean Magnetic Field (SMMF) is the mean value of the line of sight (LOS) component of the solar vector magnetic field averaged over the visible hemisphere of the Sun. So far, the studies on SMMF have mostly been confined to the magnetic field measurements at the photosphere. In this study, we calculate and analyse the SMMF using magnetic field measurements at the chromosphere, in conjunct… ▽ More The Solar Mean Magnetic Field (SMMF) is the mean value of the line of sight (LOS) component of the solar vector magnetic field averaged over the visible hemisphere of the Sun. So far, the studies on SMMF have mostly been confined to the magnetic field measurements at the photosphere. In this study, we calculate and analyse the SMMF using magnetic field measurements at the chromosphere, in conjunction with that of photospheric measurements. For this purpose, we have used full disk LOS magnetograms derived from spectropolarimetric observations carried out in Fe I 630.15 nm and Ca II 854.2 nm by the Synoptic Optical Long term Investigations of the Sun (SOLIS)/Vector Spectromagnetograph (VSM) instrument during 2010 to 2017. It is found from this study that the SMMF at the chromosphere is weaker by a factor of 0.60 compared to the SMMF at the upper photosphere. The correlation analysis between them gives a Pearson correlation coefficient of 0.80. The similarity and reduced intensity of the chromospheric SMMF with respect to the photospheric SMMF corroborate the idea that it is the source of the Interplanetary Magnetic Field (IMF). △ Less

Submitted 22 February, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

Comments: 11 pages, 6 figure, 3 tables

arXiv:2302.00478 [pdf, ps, other]

Energy-Optimal Sampling for Edge Computing Feedback Systems: Aperiodic Case

Authors: Vishnu Narayanan Moothedath

Abstract: We study the problem of optimal sampling in an edge-based video analytics system (VAS), where sensor samples collected at a terminal device are offloaded to a back-end server that processes them and generates feedback for a user. Sampling the system with the maximum allowed frequency results in the timely detection of relevant events with minimum delay. However, it incurs high energy costs and cau… ▽ More We study the problem of optimal sampling in an edge-based video analytics system (VAS), where sensor samples collected at a terminal device are offloaded to a back-end server that processes them and generates feedback for a user. Sampling the system with the maximum allowed frequency results in the timely detection of relevant events with minimum delay. However, it incurs high energy costs and causes unnecessary usage of network and compute resources via communication and processing of redundant samples. On the other hand, an infrequent sampling result in a higher delay in detecting the relevant event, thus increasing the idle energy usage and degrading the quality of experience in terms of responsiveness of the system. We quantify this sampling frequency trade-off as a weighted function between the number of samples and the responsiveness. We propose an energy-optimal aperiodic sampling policy that improves over the state-of-the-art optimal periodic sampling policy. Numerically, we show the proposed policy provides a consistent improvement of more than 10$\mathbf{\%}$ over the state-of-the-art. △ Less

Submitted 21 February, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

arXiv:2301.09467 [pdf]

doi 10.1364/OE.496855

High-field THz source centered at 2.6 THz

Authors: Wei Cui, Eeswar Kumar Yalavarthi, Aswin Vishnu Radhan, Mohammad Bashirpour, Angela Gamouras, Jean-Michel Ménard

Abstract: We demonstrate a table-top high-field terahertz (THz) source based on optical rectification of a collimated near-infrared pulse in gallium phosphide (GaP) to produce peak fields exceeding 300 kV/cm with a spectrum centered at 2.6 THz. The experimental configuration, based on tilted-pulse-front phase matching, is implemented with a phase grating etched directly onto the front surface of the GaP cry… ▽ More We demonstrate a table-top high-field terahertz (THz) source based on optical rectification of a collimated near-infrared pulse in gallium phosphide (GaP) to produce peak fields exceeding 300 kV/cm with a spectrum centered at 2.6 THz. The experimental configuration, based on tilted-pulse-front phase matching, is implemented with a phase grating etched directly onto the front surface of the GaP crystal. Although the THz generation efficiency starts showing a saturation onset as the near-infrared pulse energy reaches 0.57 mJ, we can expect our configuration to yield THz peak fields up to 866 kV/cm when a 5 mJ generation NIR pulse is used. This work paves the way towards broadband, high-field THz sources able to access a new class of THz coherent control and nonlinear phenomena driven at frequencies above 2 THz. △ Less

Submitted 18 December, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

Comments: 8 pages, 3 figures

Journal ref: Optics Express 31, 32468 (2023)

arXiv:2301.04983 [pdf, other]

doi 10.3847/2041-8213/acae99

Missing for 20 years: MeerKAT re-detects the elusive binary pulsar M30B

Authors: Vishnu Balakrishnan, Paulo Freire, Scott Ransom, Alessandro Ridolfi, Ewan Barr, Weiwei Chen, Vivek Venkatraman Krishnan, David J. Champion, Michael Kramer, Tasha Gautam, Prajwal Padmanabh, Yunpeng Men, Federico Abbate, Benjamin Stappers, Ingrid Stairs, Evan Keane, Andrea Possenti

Abstract: PSR J2140$-$2311B is a 13-ms pulsar discovered in 2001 in a 7.8-hour Green Bank Telescope (GBT) observation of the core-collapsed globular cluster M30 and predicted to be in a highly eccentric binary orbit. This pulsar has eluded detection since then, therefore its precise orbital parameters have remained a mystery until now. In this work, we present the confirmation of this pulsar using observati… ▽ More PSR J2140$-$2311B is a 13-ms pulsar discovered in 2001 in a 7.8-hour Green Bank Telescope (GBT) observation of the core-collapsed globular cluster M30 and predicted to be in a highly eccentric binary orbit. This pulsar has eluded detection since then, therefore its precise orbital parameters have remained a mystery until now. In this work, we present the confirmation of this pulsar using observations taken with the UHF receivers of the MeerKAT telescope as part of the TRAPUM Large Survey Project. Taking advantage of the beamforming capability of our backends, we have localized it, placing it $1.2(1)^\prime$ from the cluster centre. Our observations have enabled the determination of its orbit: it is highly eccentric ($e = 0.879$) with an orbital period of $6.2$ days. We also measured the rate of periastron advance, $\dotω = 0.078 \pm 0.002\, \rm deg \, yr^{-1}$. Assuming that this effect is fully relativistic, general relativity provides an estimate of the total mass of the system, $M_{\rm TOT} = 2.53 \pm 0.08$ M$_{\odot}$, consistent with the lightest double neutron star systems known. Combining this with the mass function of the system gives the pulsar and companion masses of $m_p < 1.43 \, \rm M_{\odot}$ and $m_c > 1.10 \, \rm M_{\odot}$ respectively. The massive, undetected companion could either be a massive WD or a NS. M30B likely formed as a result of a secondary exchange encounter. Future timing observations will allow the determination of a phase-coherent timing solution, vastly improving our uncertainty in $\dotω$ and likely enabling the detection of additional relativistic effects which will determine $m_p$ and $m_c$. △ Less

Submitted 12 January, 2023; originally announced January 2023.

Comments: Accepted for publication in the Astrophysical Journal Letters (ApJL)

arXiv:2301.04035 [pdf, other]

doi 10.1016/j.icarus.2023.115426

Constraints on the lunar core viscosity from tidal deformation

Authors: Arthur Briaud, Agnès Fienga, Daniele Melini, Nicolas Rambaux, Anthony Mémin, Giorgio Spada, Christelle Saliby, Hauke Hussmann, Alexander Stark, Vishnu Viswanathan, Daniel Baguet

Abstract: We use the tidal deformations of the Moon induced by the Earth and the Sun as a tool for studying the inner structure of our satellite. Based on measurements of the degree-two tidal Love numbers k2 and h2 and dissipation coefficients from the GRAIL mission, Lunar Laser Ranging and Laser Altimetry on board of the LRO spacecraft, we perform Monte Carlo samplings for 120,000 possible combinations of… ▽ More We use the tidal deformations of the Moon induced by the Earth and the Sun as a tool for studying the inner structure of our satellite. Based on measurements of the degree-two tidal Love numbers k2 and h2 and dissipation coefficients from the GRAIL mission, Lunar Laser Ranging and Laser Altimetry on board of the LRO spacecraft, we perform Monte Carlo samplings for 120,000 possible combinations of thicknesses and viscosities for two classes of the lunar models. The first one includes a uniform core, a low viscosity zone (LVZ) at the core-mantle boundary, a mantle and a crust. The second one has an additional inner core. All models are consistent with the lunar total mass as well as its moment of inertia. By comparing predicted and observed parameters for the tidal deformations we find that the existence of an inner core cannot be ruled out. Furthermore, by deducing temperature profiles for the LVZ and an Earth-like mantle, we obtain stringent constraints on the radius (500 +- 1) km, viscosity,21 (4.5 +- 0.8) x 10^16 Pa.s and the density (3400 +- 10) kg/m^3 of the LVZ. We also infer the first estimation for the outer core viscosity, (2.07 +- 1.03) x 10^17 Pa.s, for two different possible structures: a Moon with a 70 km thick outer core and a large inner core (290 km radius with a density of 6000 kg/m3), and a Moon with a thicker outer core (169 km thick) but a denser and smaller inner core (219 km radius for 8000 kg/m^3). △ Less

Submitted 10 January, 2023; originally announced January 2023.

arXiv:2301.03864 [pdf, other]

doi 10.1093/mnras/stad029

MeerKAT discovery of 13 new pulsars in Omega Centauri

Authors: W. Chen, P. C. C. Freire, A. Ridolfi, E. D. Barr, B. Stappers, M. Kramer, A. Possenti, S. M. Ransom, L. Levin, R. P. Breton, M. Burgay, F. Camilo, S. Buchner, D. J. Champion, F. Abbate, V. Venkatraman Krishnan, P. V. Padmanabh, T. Gautam, L. Vleeschower, M. Geyer, J-M. Grießmeier, Y. P. Men, V. Balakrishnan, M. C. Bezuidenhout

Abstract: The most massive globular cluster in our Galaxy, Omega Centauri, is an interesting target for pulsar searches, because of its multiple stellar populations and the intriguing possibility that it was once the nucleus of a galaxy that was absorbed into the Milky Way. The recent discoveries of pulsars in this globular cluster and their association with known X-ray sources was a hint that, given the la… ▽ More The most massive globular cluster in our Galaxy, Omega Centauri, is an interesting target for pulsar searches, because of its multiple stellar populations and the intriguing possibility that it was once the nucleus of a galaxy that was absorbed into the Milky Way. The recent discoveries of pulsars in this globular cluster and their association with known X-ray sources was a hint that, given the large number of known X-ray sources, there is a much larger undiscovered pulsar population. We used the superior sensitivity of the MeerKAT radio telescope to search for pulsars in Omega Centauri. In this paper, we present some of the first results of this survey, including the discovery of 13 new pulsars; the total number of known pulsars in this cluster currently stands at 18. At least half of them are in binary systems and preliminary orbital constraints suggest that most of the binaries have light companions. We also discuss the ratio between isolated and binaries pulsars and how they were formed in this cluster. △ Less

Submitted 10 January, 2023; originally announced January 2023.

arXiv:2301.02580 [pdf, other]

Neuro-DynaStress: Predicting Dynamic Stress Distributions in Structural Components

Authors: Hamed Bolandi, Gautam Sreekumar, Xuyang Li, Nizar Lajnef, Vishnu Naresh Boddeti

Abstract: Structural components are typically exposed to dynamic loading, such as earthquakes, wind, and explosions. Structural engineers should be able to conduct real-time analysis in the aftermath or during extreme disaster events requiring immediate corrections to avoid fatal failures. As a result, it is crucial to predict dynamic stress distributions during highly disruptive events in real-time. Curren… ▽ More Structural components are typically exposed to dynamic loading, such as earthquakes, wind, and explosions. Structural engineers should be able to conduct real-time analysis in the aftermath or during extreme disaster events requiring immediate corrections to avoid fatal failures. As a result, it is crucial to predict dynamic stress distributions during highly disruptive events in real-time. Currently available high-fidelity methods, such as Finite Element Models (FEMs), suffer from their inherent high complexity and are computationally prohibitive. Therefore, to reduce computational cost while preserving accuracy, a deep learning model, Neuro-DynaStress, is proposed to predict the entire sequence of stress distribution based on finite element simulations using a partial differential equation (PDE) solver. The model was designed and trained to use the geometry, boundary conditions and sequence of loads as input and predict the sequences of high-resolution stress contours. The performance of the proposed framework is compared to finite element simulations using a PDE solver. △ Less

Submitted 18 December, 2022; originally announced January 2023.

Comments: 16 pages, 12 figures. arXiv admin note: text overlap with arXiv:2211.16190

arXiv:2212.11005 [pdf, other]

Revisiting Residual Networks for Adversarial Robustness: An Architectural Perspective

Authors: Shihua Huang, Zhichao Lu, Kalyanmoy Deb, Vishnu Naresh Boddeti

Abstract: Efforts to improve the adversarial robustness of convolutional neural networks have primarily focused on develo** more effective adversarial training methods. In contrast, little attention was devoted to analyzing the role of architectural elements (such as topology, depth, and width) on adversarial robustness. This paper seeks to bridge this gap and present a holistic study on the impact of arc… ▽ More Efforts to improve the adversarial robustness of convolutional neural networks have primarily focused on develo** more effective adversarial training methods. In contrast, little attention was devoted to analyzing the role of architectural elements (such as topology, depth, and width) on adversarial robustness. This paper seeks to bridge this gap and present a holistic study on the impact of architectural design on adversarial robustness. We focus on residual networks and consider architecture design at the block level, i.e., topology, kernel size, activation, and normalization, as well as at the network scaling level, i.e., depth and width of each block in the network. In both cases, we first derive insights through systematic ablative experiments. Then we design a robust residual block, dubbed RobustResBlock, and a compound scaling rule, dubbed RobustScaling, to distribute depth and width at the desired FLOP count. Finally, we combine RobustResBlock and RobustScaling and present a portfolio of adversarially robust residual networks, RobustResNets, spanning a broad spectrum of model capacities. Experimental validation across multiple datasets and adversarial attacks demonstrate that RobustResNets consistently outperform both the standard WRNs and other existing robust architectures, achieving state-of-the-art AutoAttack robust accuracy of 61.1% without additional data and 63.7% with 500K external data while being $2\times$ more compact in terms of parameters. Code is available at \url{ https://github.com/zhichao-lu/robust-residual-network} △ Less

Submitted 21 December, 2022; originally announced December 2022.

arXiv:2212.07425 [pdf, other]

Robust and Explainable Identification of Logical Fallacies in Natural Language Arguments

Authors: Zhivar Sourati, Vishnu Priya Prasanna Venkatesh, Darshan Deshpande, Himanshu Rawlani, Filip Ilievski, Hông-Ân Sandlin, Alain Mermoud

Abstract: The spread of misinformation, propaganda, and flawed argumentation has been amplified in the Internet era. Given the volume of data and the subtlety of identifying violations of argumentation norms, supporting information analytics tasks, like content moderation, with trustworthy methods that can identify logical fallacies is essential. In this paper, we formalize prior theoretical work on logical… ▽ More The spread of misinformation, propaganda, and flawed argumentation has been amplified in the Internet era. Given the volume of data and the subtlety of identifying violations of argumentation norms, supporting information analytics tasks, like content moderation, with trustworthy methods that can identify logical fallacies is essential. In this paper, we formalize prior theoretical work on logical fallacies into a comprehensive three-stage evaluation framework of detection, coarse-grained, and fine-grained classification. We adapt existing evaluation datasets for each stage of the evaluation. We employ three families of robust and explainable methods based on prototype reasoning, instance-based reasoning, and knowledge injection. The methods combine language models with background knowledge and explainable mechanisms. Moreover, we address data sparsity with strategies for data augmentation and curriculum learning. Our three-stage framework natively consolidates prior datasets and methods from existing tasks, like propaganda detection, serving as an overarching evaluation testbed. We extensively evaluate these methods on our datasets, focusing on their robustness and explainability. Our results provide insight into the strengths and weaknesses of the methods on different components and fallacy classes, indicating that fallacy identification is a challenging task that may require specialized forms of reasoning to capture various classes. We share our open-source code and data on GitHub to support further work on logical fallacy identification. △ Less

Submitted 25 September, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

arXiv:2212.06100 [pdf, other]

Realistic Modeling of Human Timings for Wearable Cognitive Assistance

Authors: Manuel O. J. Olguín Muñoz, Vishnu N. Moothedath, Jaya Prakash Champati, Roberta Klatzky, Mahadev Satyanarayanan, James Gross

Abstract: Wearable Cognitive Assistance (WCA) applications present a challenge to benchmark and characterize due to their human-in-the-loop nature. Employing user testing to optimize system parameters is generally not feasible, given the scope of the problem and the number of observations needed to detect small but important effects in controlled experiments. Considering the intended mass-scale deployment o… ▽ More Wearable Cognitive Assistance (WCA) applications present a challenge to benchmark and characterize due to their human-in-the-loop nature. Employing user testing to optimize system parameters is generally not feasible, given the scope of the problem and the number of observations needed to detect small but important effects in controlled experiments. Considering the intended mass-scale deployment of WCA applications in the future, there exists a need for tools enabling human-independent benchmarking. We present in this paper the first model for the complete end-to-end emulation of humans in WCA. We build this model through statistical analysis of data collected from previous work in this field, and demonstrate its utility by studying application task durations. Compared to first-order approximations, our model shows a ~36% larger gap between step execution times at high system impairment versus low. We further introduce a novel framework for stochastic optimization of resource consumption-responsiveness tradeoffs in WCA, and show that by combining this framework with our realistic model of human behavior, significant reductions of up to 50% in number processed frame samples and 20% in energy consumption can be achieved with respect to the state-of-the-art. △ Less

Submitted 12 December, 2022; originally announced December 2022.

Comments: 16 total pages. 12 figures, 2 tables, 1 appendix. Main document body by Manuel Olguín Muñoz and Vishnu N. Moothedath; appendix by Vishu N. Moothedath and Jaya Prakash Champati; editing and feedback by all authors; funding by James Gross and Mahadev Satyanarayanan. Submitted to IEEE Transactions on Mobile Computing

arXiv:2211.16808 [pdf, other]

doi 10.1109/PRDC59308.2023.00013

Efficient Adversarial Input Generation via Neural Net Patching

Authors: Tooba Khan, Kumar Madhukar, Subodh Vishnu Sharma

Abstract: The generation of adversarial inputs has become a crucial issue in establishing the robustness and trustworthiness of deep neural nets, especially when they are used in safety-critical application domains such as autonomous vehicles and precision medicine. However, the problem poses multiple practical challenges, including scalability issues owing to large-sized networks, and the generation of adv… ▽ More The generation of adversarial inputs has become a crucial issue in establishing the robustness and trustworthiness of deep neural nets, especially when they are used in safety-critical application domains such as autonomous vehicles and precision medicine. However, the problem poses multiple practical challenges, including scalability issues owing to large-sized networks, and the generation of adversarial inputs that lack important qualities such as naturalness and output-impartiality. This problem shares its end goal with the task of patching neural nets where small changes in some of the network's weights need to be discovered so that upon applying these changes, the modified net produces the desirable output for a given set of inputs. We exploit this connection by proposing to obtain an adversarial input from a patch, with the underlying observation that the effect of changing the weights can also be brought about by changing the inputs instead. Thus, this paper presents a novel way to generate input perturbations that are adversarial for a given network by using an efficient network patching technique. We note that the proposed method is significantly more effective than the prior state-of-the-art techniques. △ Less

Submitted 28 September, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

arXiv:2211.16649 [pdf, other]

CLIP-Nav: Using CLIP for Zero-Shot Vision-and-Language Navigation

Authors: Vishnu Sashank Dorbala, Gunnar Sigurdsson, Robinson Piramuthu, Jesse Thomason, Gaurav S. Sukhatme

Abstract: Household environments are visually diverse. Embodied agents performing Vision-and-Language Navigation (VLN) in the wild must be able to handle this diversity, while also following arbitrary language instructions. Recently, Vision-Language models like CLIP have shown great performance on the task of zero-shot object recognition. In this work, we ask if these models are also capable of zero-shot la… ▽ More Household environments are visually diverse. Embodied agents performing Vision-and-Language Navigation (VLN) in the wild must be able to handle this diversity, while also following arbitrary language instructions. Recently, Vision-Language models like CLIP have shown great performance on the task of zero-shot object recognition. In this work, we ask if these models are also capable of zero-shot language grounding. In particular, we utilize CLIP to tackle the novel problem of zero-shot VLN using natural language referring expressions that describe target objects, in contrast to past work that used simple language templates describing object classes. We examine CLIP's capability in making sequential navigational decisions without any dataset-specific finetuning, and study how it influences the path that an agent takes. Our results on the coarse-grained instruction following task of REVERIE demonstrate the navigational capability of CLIP, surpassing the supervised baseline in terms of both success rate (SR) and success weighted by path length (SPL). More importantly, we quantitatively show that our CLIP-based zero-shot approach generalizes better to show consistent performance across environments when compared to SOTA, fully supervised learning approaches when evaluated via Relative Change in Success (RCS). △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: 8 pages, Accepted at LangRob Workshop at Conference on Robot Learning (CoRL), 2022

arXiv:2211.16190 [pdf, other]

Physics Informed Neural Network for Dynamic Stress Prediction

Authors: Hamed Bolandi, Gautam Sreekumar, Xuyang Li, Nizar Lajnef, Vishnu Naresh Boddeti

Abstract: Structural failures are often caused by catastrophic events such as earthquakes and winds. As a result, it is crucial to predict dynamic stress distributions during highly disruptive events in real time. Currently available high-fidelity methods, such as Finite Element Models (FEMs), suffer from their inherent high complexity. Therefore, to reduce computational cost while maintaining accuracy, a P… ▽ More Structural failures are often caused by catastrophic events such as earthquakes and winds. As a result, it is crucial to predict dynamic stress distributions during highly disruptive events in real time. Currently available high-fidelity methods, such as Finite Element Models (FEMs), suffer from their inherent high complexity. Therefore, to reduce computational cost while maintaining accuracy, a Physics Informed Neural Network (PINN), PINN-Stress model, is proposed to predict the entire sequence of stress distribution based on Finite Element simulations using a partial differential equation (PDE) solver. Using automatic differentiation, we embed a PDE into a deep neural network's loss function to incorporate information from measurements and PDEs. The PINN-Stress model can predict the sequence of stress distribution in almost real-time and can generalize better than the model without PINN. △ Less

Submitted 28 November, 2022; originally announced November 2022.

Comments: 14 pages, 13 figures

arXiv:2211.16172 [pdf, other]

Learnings from Technological Interventions in a Low Resource Language: Enhancing Information Access in Gondi

Authors: Devansh Mehta, Harshita Diddee, Ananya Saxena, Anurag Shukla, Sebastin Santy, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava, Alok Sharma, Vishnu Prasad, Venkanna U, Kalika Bali

Abstract: The primary obstacle to develo** technologies for low-resource languages is the lack of representative, usable data. In this paper, we report the deployment of technology-driven data collection methods for creating a corpus of more than 60,000 translations from Hindi to Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. During this pr… ▽ More The primary obstacle to develo** technologies for low-resource languages is the lack of representative, usable data. In this paper, we report the deployment of technology-driven data collection methods for creating a corpus of more than 60,000 translations from Hindi to Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. During this process, we help expand information access in Gondi across 2 different dimensions (a) The creation of linguistic resources that can be used by the community, such as a dictionary, children's stories, Gondi translations from multiple sources and an Interactive Voice Response (IVR) based mass awareness platform; (b) Enabling its use in the digital domain by develo** a Hindi-Gondi machine translation model, which is compressed by nearly 4 times to enable it's edge deployment on low-resource edge devices and in areas of little to no internet connectivity. We also present preliminary evaluations of utilizing the developed machine translation model to provide assistance to volunteers who are involved in collecting more data for the target language. Through these interventions, we not only created a refined and evaluated corpus of 26,240 Hindi-Gondi translations that was used for building the translation model but also engaged nearly 850 community members who can help take Gondi onto the internet. △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: In Submission (Revised) to Language Resources and Evaluation Journal. arXiv admin note: text overlap with arXiv:2004.10270

arXiv:2211.09801 [pdf, other]

doi 10.4310/ATMP.2023.v27.n4.a3

Machine Learned Calabi-Yau Metrics and Curvature

Authors: Per Berglund, Giorgi Butbaia, Tristan Hübsch, Vishnu Jejjala, Damián Mayorga Peña, Challenger Mishra, Justin Tan

Abstract: Finding Ricci-flat (Calabi-Yau) metrics is a long standing problem in geometry with deep implications for string theory and phenomenology. A new attack on this problem uses neural networks to engineer approximations to the Calabi-Yau metric within a given Kähler class. In this paper we investigate numerical Ricci-flat metrics over smooth and singular K3 surfaces and Calabi-Yau threefolds. Using th… ▽ More Finding Ricci-flat (Calabi-Yau) metrics is a long standing problem in geometry with deep implications for string theory and phenomenology. A new attack on this problem uses neural networks to engineer approximations to the Calabi-Yau metric within a given Kähler class. In this paper we investigate numerical Ricci-flat metrics over smooth and singular K3 surfaces and Calabi-Yau threefolds. Using these Ricci-flat metric approximations for the Cefalú family of quartic twofolds and the Dwork family of quintic threefolds, we study characteristic forms on these geometries. We observe that the numerical stability of the numerically computed topological characteristic is heavily influenced by the choice of the neural network model, in particular, we briefly discuss a different neural network model, namely Spectral networks, which correctly approximate the topological characteristic of a Calabi-Yau. Using persistent homology, we show that high curvature regions of the manifolds form clusters near the singular points. For our neural network approximations, we observe a Bogomolov--Yau type inequality $3c_2 \geq c_1^2$ and observe an identity when our geometries have isolated $A_1$ type singularities. We sketch a proof that $χ(X~\smallsetminus~\mathrm{Sing}\,{X}) + 2~|\mathrm{Sing}\,{X}| = 24$ also holds for our numerical approximations. △ Less

Submitted 6 June, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

Comments: Version accepted for publication: 48 pages, 32 figures, 8 tables, 3 appendices

Journal ref: ATMP v.27 no.4 (2023) 1107-1158

arXiv:2211.08749 [pdf, ps, other]

A few more Lonely Runners

Authors: Avinash Bhardwaj, Vishnu Narayanan, Hrishikesh Venkataraman

Abstract: Lonely Runner Conjecture, proposed by Jörg M. Wills and so nomenclatured by Luis Goddyn, has been an object of interest since it was first conceived in 1967 : Given positive integers $k$ and $n_1,n_2,\ldots,n_k$ there exists a positive real number $t$ such that the distance of $t\cdot n_j$ to the nearest integer is at least $\frac{1}{k+1}$, $\forall~~1\leq j\leq k$. In a recent article Beck, Hoste… ▽ More Lonely Runner Conjecture, proposed by Jörg M. Wills and so nomenclatured by Luis Goddyn, has been an object of interest since it was first conceived in 1967 : Given positive integers $k$ and $n_1,n_2,\ldots,n_k$ there exists a positive real number $t$ such that the distance of $t\cdot n_j$ to the nearest integer is at least $\frac{1}{k+1}$, $\forall~~1\leq j\leq k$. In a recent article Beck, Hosten and Schymura described the Lonely Runner polyhedron and provided a polyhedral approach to identifying families of lonely runner instances. We revisit the Lonely Runner polyhedron and highlight some new families of instances satisfying the conjecture. In addition, we relax the sufficiency of existence of an integer point in the Lonely Runner polyhedron to prove the conjecture. Specifically, we propose that it suffices to show the existence of a lattice point of certain superlattices of the integer lattice in the Lonely Runner polyhedron. △ Less

Submitted 28 July, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

MSC Class: 51M20; 52C07

arXiv:2211.04987 [pdf, other]

Interpretable Deep Reinforcement Learning for Green Security Games with Real-Time Information

Authors: Vishnu Dutt Sharma, John P. Dickerson, Pratap Tokekar

Abstract: Green Security Games with real-time information (GSG-I) add the real-time information about the agents' movement to the typical GSG formulation. Prior works on GSG-I have used deep reinforcement learning (DRL) to learn the best policy for the agent in such an environment without any need to store the huge number of state representations for GSG-I. However, the decision-making process of DRL method… ▽ More Green Security Games with real-time information (GSG-I) add the real-time information about the agents' movement to the typical GSG formulation. Prior works on GSG-I have used deep reinforcement learning (DRL) to learn the best policy for the agent in such an environment without any need to store the huge number of state representations for GSG-I. However, the decision-making process of DRL methods is largely opaque, which results in a lack of trust in their predictions. To tackle this issue, we present an interpretable DRL method for GSG-I that generates visualization to explain the decisions taken by the DRL algorithm. We also show that this approach performs better and works well with a simpler training regimen compared to the existing method. △ Less

Submitted 9 November, 2022; originally announced November 2022.

Showing 151–200 of 740 results for author: Vishnu