Search | arXiv e-print repository

Real-time Deformation Correction in Additively Printed Flexible Antenna Arrays

Authors: Sreeni Poolakkal, Abdullah Islam, Shrestha Bansal, Arpit Rao, Ted Dabrowski, Kalsi Kwan, Amit Mishra, Quiyan Xu, Erfan Ghaderi, Pradeep Lall, Sudip Shekhar, Julio Navarro, Shenqiang Ren, John Williams, Subhanshu Gupta

Abstract: Conformal phased arrays provide multiple degrees of freedom to the scan angle, which is typically limited by antenna aperture in rigid arrays. Silicon-based RF signal processing offers reliable, reconfigurable, multi-functional, and compact control for conformal phased arrays that can be used for on-the-move communication. While the lightweight, compactness, and shape-changing properties of the co… ▽ More Conformal phased arrays provide multiple degrees of freedom to the scan angle, which is typically limited by antenna aperture in rigid arrays. Silicon-based RF signal processing offers reliable, reconfigurable, multi-functional, and compact control for conformal phased arrays that can be used for on-the-move communication. While the lightweight, compactness, and shape-changing properties of the conformal phased arrays are attractive, these features result in dynamic deformation of the array during motion leading to significant dynamic beam pointing errors. We propose a silicon-based, compact, reconfigurable solution to self-correct these dynamic deformation-induced beam pointing errors. Furthermore, additive printing is leveraged to enhance the flexibility of the conformal phased arrays, as the printed conductive ink is more flexible than bulk copper and can be easily deposited on flexible sheets using different printing tools, providing an environmentally-friendly solution for large-scale production. The inks such as conventional silver inks are expensive and copper-based printable inks suffer from spontaneous metal oxidation that alters trace impedance and degrades beamforming performance. This work uses a low-cost molecular copper decomposition ink with reliable RF properties at different temperature and strain to print the proposed intelligent conformal phased array operating at 2.1 GHz. Proof-of-concept prototype $2\times2$ array self-corrects the deformation induces beampointing error with an error $<1.25^\circ$. The silicon based array processing part occupying only 2.58 mm$^2$ area and 83 mW power per tile. △ Less

Submitted 21 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

arXiv:2404.10965 [pdf, other]

IMIL: Interactive Medical Image Learning Framework

Authors: Adrit Rao, Andrea Fisher, Ken Chang, John Christopher Panagides, Katherine McNamara, Joon-Young Lee, Oliver Aalami

Abstract: Data augmentations are widely used in training medical image deep learning models to increase the diversity and size of sparse datasets. However, commonly used augmentation techniques can result in loss of clinically relevant information from medical images, leading to incorrect predictions at inference time. We propose the Interactive Medical Image Learning (IMIL) framework, a novel approach for… ▽ More Data augmentations are widely used in training medical image deep learning models to increase the diversity and size of sparse datasets. However, commonly used augmentation techniques can result in loss of clinically relevant information from medical images, leading to incorrect predictions at inference time. We propose the Interactive Medical Image Learning (IMIL) framework, a novel approach for improving the training of medical image analysis algorithms that enables clinician-guided intermediate training data augmentations on misprediction outliers, focusing the algorithm on relevant visual information. To prevent the model from using irrelevant features during training, IMIL will 'blackout' clinician-designated irrelevant regions and replace the original images with the augmented samples. This ensures that for originally mispredicted samples, the algorithm subsequently attends only to relevant regions and correctly correlates them with the respective diagnosis. We validate the efficacy of IMIL using radiology residents and compare its performance to state-of-the-art data augmentations. A 4.2% improvement in accuracy over ResNet-50 was observed when using IMIL on only 4% of the training set. Our study demonstrates the utility of clinician-guided interactive training to achieve meaningful data augmentations for medical image analysis algorithms. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: CVPR 2024 Workshop on Domain adaptation, Explainability and Fairness in AI for Medical Image Analysis (DEF-AI-MIA)

arXiv:2401.14498 [pdf, other]

Predictive Analysis for Optimizing Port Operations

Authors: Aniruddha Rajendra Rao, Haiyan Wang, Chetan Gupta

Abstract: Maritime transport is a pivotal logistics mode for the long-distance and bulk transportation of goods. However, the intricate planning involved in this mode is often hindered by uncertainties, including weather conditions, cargo diversity, and port dynamics, leading to increased costs. Consequently, accurately estimating vessel total (stay) time at port and potential delays becomes imperative for… ▽ More Maritime transport is a pivotal logistics mode for the long-distance and bulk transportation of goods. However, the intricate planning involved in this mode is often hindered by uncertainties, including weather conditions, cargo diversity, and port dynamics, leading to increased costs. Consequently, accurately estimating vessel total (stay) time at port and potential delays becomes imperative for effective planning and scheduling in port operations. This study aims to develop a port operation solution with competitive prediction and classification capabilities for estimating vessel Total and Delay times. This research addresses a significant gap in port analysis models for vessel Stay and Delay times, offering a valuable contribution to the field of maritime logistics. The proposed solution is designed to assist decision-making in port environments and predict service delays. This is demonstrated through a case study on Brazil ports. Additionally, feature analysis is used to understand the key factors impacting maritime logistics, enhancing the overall understanding of the complexities involved in port operations. △ Less

Submitted 25 January, 2024; originally announced January 2024.

Comments: 13 pages, 9 figures, 4 Tables. Submitted at IEEE IJCNN 2024

arXiv:2311.08085 [pdf, other]

Optimizing Electric Vehicle Efficiency with Real-Time Telemetry using Machine Learning

Authors: Aryaman Rao, Harshit Gupta, Parth Singh, Shivam Mittal, Utkrash Singh, Dinesh Kumar Vishwakarma

Abstract: In the contemporary world with degrading natural resources, the urgency of energy efficiency has become imperative due to the conservation and environmental safeguarding. Therefore, it's crucial to look for advanced technology to minimize energy consumption. This research focuses on the optimization of battery-electric city style vehicles through the use of a real-time in-car telemetry system that… ▽ More In the contemporary world with degrading natural resources, the urgency of energy efficiency has become imperative due to the conservation and environmental safeguarding. Therefore, it's crucial to look for advanced technology to minimize energy consumption. This research focuses on the optimization of battery-electric city style vehicles through the use of a real-time in-car telemetry system that communicates between components through the robust Controller Area Network (CAN) protocol. By harnessing real-time data from various sensors embedded within vehicles, our driving assistance system provides the driver with visual and haptic actionable feedback that guides the driver on using the optimum driving style to minimize power consumed by the vehicle. To develop the pace feedback mechanism for the driver, real-time data is collected through a Shell Eco Marathon Urban Concept vehicle platform and after pre-processing, it is analyzed using the novel machine learning algorithm TEMSL, that outperforms the existing baseline approaches across various performance metrics. This innovative method after numerous experimentation has proven effective in enhancing energy efficiency, guiding the driver along the track, and reducing human errors. The driving-assistance system offers a range of utilities, from cost savings and extended vehicle lifespan to significant contributions to environmental conservation and sustainable driving practices. △ Less

Submitted 14 November, 2023; originally announced November 2023.

arXiv:2311.07569 [pdf, other]

Optimal Load Shedding for Public Safety Power Shutoffs

Authors: Aniruddha Rajendra Rao, Chandrasekar Venkatraman, Robert Ellis, Chetan Gupta

Abstract: Public utilities are faced with situations where high winds can bring trees and debris into contact with energized power lines and other equipments, which could ignite wildfires. As a result, they need to turn off power during severe weather to help prevent wildfires. This is called Public Safety Power Shutoff (PSPS). We present a method for load reduction using a multi-step genetic algorithm for… ▽ More Public utilities are faced with situations where high winds can bring trees and debris into contact with energized power lines and other equipments, which could ignite wildfires. As a result, they need to turn off power during severe weather to help prevent wildfires. This is called Public Safety Power Shutoff (PSPS). We present a method for load reduction using a multi-step genetic algorithm for Public Safety Power Shutoff events. The proposed method optimizes load shedding using partial load shedding based on load importance (critical loads like hospitals, fire stations, etc). The multi-step genetic algorithm optimizes load shedding while minimizing the impact on important loads and preserving grid stability. The effectiveness of the method is demonstrated through network examples. The results show that the proposed method achieves minimal load shedding while maintaining the critical loads at acceptable levels. This approach will help utilities to effectively manage PSPS events and reduce the risk of wildfires caused by the power lines. △ Less

Submitted 13 November, 2023; originally announced November 2023.

Comments: 10 pages, 5 figures, 3 Tables. Accepted at IEEE ETFG 2023

arXiv:2309.02338 [pdf]

Sustainability assessment of Low Earth Orbit (LEO) satellite broadband megaconstellations

Authors: Ogutu B. Osoro, Edward J. Oughton, Andrew R. Wilson, Akhil Rao

Abstract: The growth of megaconstellations is rapidly increasing the number of rocket launches. While Low Earth Orbit (LEO) broadband satellites help to connect unconnected communities and achieve the Sustainable Development Goals (SDGs), there are also significant environmental emissions impacts from burning rocket fuels. We present sustainability analytics for phase 1 of the three main LEO constellations… ▽ More The growth of megaconstellations is rapidly increasing the number of rocket launches. While Low Earth Orbit (LEO) broadband satellites help to connect unconnected communities and achieve the Sustainable Development Goals (SDGs), there are also significant environmental emissions impacts from burning rocket fuels. We present sustainability analytics for phase 1 of the three main LEO constellations including Amazon Kuiper (3,236 satellites), Eutelsat Group`s OneWeb (648 satellites), and SpaceX Starlink (4,425 satellites). We find that LEO megaconstellations provide substantially improved broadband speeds for rural and remote communities, but are roughly 6-8 times more emissions intensive (250 kg CO2eq/subscriber/year) than comparative terrestrial mobile broadband. In the worst-case emissions scenario, this rises to 12-14 times more (469 kg CO2eq/subscriber/year). Policy makers must carefully consider the trade-off between connecting unconnected communities to further the SDGs and mitigating the growing space sector environmental footprint, particularly regarding phase 2 plans to launch an order-of-magnitude more satellites. △ Less

Submitted 7 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

arXiv:2308.11902 [pdf, other]

Studying the Impact of Augmentations on Medical Confidence Calibration

Authors: Adrit Rao, Joon-Young Lee, Oliver Aalami

Abstract: The clinical explainability of convolutional neural networks (CNN) heavily relies on the joint interpretation of a model's predicted diagnostic label and associated confidence. A highly certain or uncertain model can significantly impact clinical decision-making. Thus, ensuring that confidence estimates reflect the true correctness likelihood for a prediction is essential. CNNs are often poorly ca… ▽ More The clinical explainability of convolutional neural networks (CNN) heavily relies on the joint interpretation of a model's predicted diagnostic label and associated confidence. A highly certain or uncertain model can significantly impact clinical decision-making. Thus, ensuring that confidence estimates reflect the true correctness likelihood for a prediction is essential. CNNs are often poorly calibrated and prone to overconfidence leading to improper measures of uncertainty. This creates the need for confidence calibration. However, accuracy and performance-based evaluations of CNNs are commonly used as the sole benchmark for medical tasks. Taking into consideration the risks associated with miscalibration is of high importance. In recent years, modern augmentation techniques, which cut, mix, and combine images, have been introduced. Such augmentations have benefited CNNs through regularization, robustness to adversarial samples, and calibration. Standard augmentations based on image scaling, rotating, and zooming, are widely leveraged in the medical domain to combat the scarcity of data. In this paper, we evaluate the effects of three modern augmentation techniques, CutMix, MixUp, and CutOut on the calibration and performance of CNNs for medical tasks. CutMix improved calibration the most while CutOut often lowered the level of calibration. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Comments: ICCV CVAMD 2023

arXiv:2305.05532 [pdf, other]

doi 10.1109/ICPHM57936.2023.10194112

An ensemble of convolution-based methods for fault detection using vibration signals

Authors: Xian Yeow Lee, Aman Kumar, Lasitha Vidyaratne, Aniruddha Rajendra Rao, Ahmed Farahat, Chetan Gupta

Abstract: This paper focuses on solving a fault detection problem using multivariate time series of vibration signals collected from planetary gearboxes in a test rig. Various traditional machine learning and deep learning methods have been proposed for multivariate time-series classification, including distance-based, functional data-oriented, feature-driven, and convolution kernel-based methods. Recent st… ▽ More This paper focuses on solving a fault detection problem using multivariate time series of vibration signals collected from planetary gearboxes in a test rig. Various traditional machine learning and deep learning methods have been proposed for multivariate time-series classification, including distance-based, functional data-oriented, feature-driven, and convolution kernel-based methods. Recent studies have shown using convolution kernel-based methods like ROCKET, and 1D convolutional neural networks with ResNet and FCN, have robust performance for multivariate time-series data classification. We propose an ensemble of three convolution kernel-based methods and show its efficacy on this fault detection problem by outperforming other approaches and achieving an accuracy of more than 98.8\%. △ Less

Submitted 4 May, 2023; originally announced May 2023.

Comments: 12 Pages, 9 Figures, 2 Tables. Accepted at ICPHM 2023

Journal ref: 2023 IEEE International Conference on Prognostics and Health Management (ICPHM)

arXiv:2301.12688 [pdf, other]

Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production

Authors: Anyi Rao, Xuekun Jiang, Yuwei Guo, Linning Xu, Lei Yang, Libiao **, Dahua Lin, Bo Dai

Abstract: Amateurs working on mini-films and short-form videos usually spend lots of time and effort on the multi-round complicated process of setting and adjusting scenes, plots, and cameras to deliver satisfying video shots. We present Virtual Dynamic Storyboard (VDS) to allow users storyboarding shots in virtual environments, where the filming staff can easily test the settings of shots before the actual… ▽ More Amateurs working on mini-films and short-form videos usually spend lots of time and effort on the multi-round complicated process of setting and adjusting scenes, plots, and cameras to deliver satisfying video shots. We present Virtual Dynamic Storyboard (VDS) to allow users storyboarding shots in virtual environments, where the filming staff can easily test the settings of shots before the actual filming. VDS runs on a "propose-simulate-discriminate" mode: Given a formatted story script and a camera script as input, it generates several character animation and camera movement proposals following predefined story and cinematic rules to allow an off-the-shelf simulation engine to render videos. To pick up the top-quality dynamic storyboard from the candidates, we equip it with a shot ranking discriminator based on shot quality criteria learned from professional manual-created data. VDS is comprehensively validated via extensive experiments and user studies, demonstrating its efficiency, effectiveness, and great potential in assisting amateur video production. △ Less

Submitted 21 July, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

Comments: Project page: https://virtualfilmstudio.github.io/

arXiv:2204.10836 [pdf, other]

doi 10.1038/s41467-022-33407-5

Federated Learning Enables Big Data for Rare Cancer Boundary Detection

Authors: Sarthak Pati, Ujjwal Baid, Brandon Edwards, Micah Sheller, Shih-Han Wang, G Anthony Reina, Patrick Foley, Alexey Gruzdev, Deepthi Karkada, Christos Davatzikos, Chiharu Sako, Satyam Ghodasara, Michel Bilello, Suyash Mohan, Philipp Vollmuth, Gianluca Brugnara, Chandrakanth J Preetha, Felix Sahm, Klaus Maier-Hein, Maximilian Zenk, Martin Bendszus, Wolfgang Wick, Evan Calabrese, Jeffrey Rudie, Javier Villanueva-Meyer , et al. (254 additional authors not shown)

Abstract: Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train acc… ▽ More Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train accurate and generalizable ML models, by only sharing numerical model updates. Here we present findings from the largest FL study to-date, involving data from 71 healthcare institutions across 6 continents, to generate an automatic tumor boundary detector for the rare disease of glioblastoma, utilizing the largest dataset of such patients ever used in the literature (25,256 MRI scans from 6,314 patients). We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent. We anticipate our study to: 1) enable more studies in healthcare informed by large and diverse data, ensuring meaningful results for rare diseases and underrepresented populations, 2) facilitate further quantitative analyses for glioblastoma via performance optimization of our consensus model for eventual public release, and 3) demonstrate the effectiveness of FL at such scale and task complexity as a paradigm shift for multi-site collaborations, alleviating the need for data sharing. △ Less

Submitted 25 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

Comments: federated learning, deep learning, convolutional neural network, segmentation, brain tumor, glioma, glioblastoma, FeTS, BraTS

arXiv:2201.05184 [pdf, ps, other]

Achieving AI-enabled Robust End-to-End Quality of Experience over Radio Access Networks

Authors: Dibbendu Roy, Aravinda S. Rao, Tansu Alpcan, Goutam Das, Marimuthu Palaniswami

Abstract: Emerging applications such as Augmented Reality, the Internet of Vehicles and Remote Surgery require both computing and networking functions working in harmony. The End-to-end (E2E) quality of experience (QoE) for these applications depends on the synchronous allocation of networking and computing resources. However, the relationship between the resources and the E2E QoE outcomes is typically stoc… ▽ More Emerging applications such as Augmented Reality, the Internet of Vehicles and Remote Surgery require both computing and networking functions working in harmony. The End-to-end (E2E) quality of experience (QoE) for these applications depends on the synchronous allocation of networking and computing resources. However, the relationship between the resources and the E2E QoE outcomes is typically stochastic and non-linear. In order to make efficient resource allocation decisions, it is essential to model these relationships. This article presents a novel machine-learning based approach to learn these relationships and concurrently orchestrate both resources for this purpose. The machine learning models further help make robust allocation decisions regarding stochastic variations and simplify robust optimization to a conventional constrained optimization. When resources are insufficient to accommodate all application requirements, our framework supports executing some of the applications with minimal degradation (graceful degradation) of E2E QoE. We also show how we can implement the learning and optimization methods in a distributed fashion by the Software-Defined Network (SDN) and Kubernetes technologies. Our results show that deep learning-based modelling achieves E2E QoE with approximately 99.8\% accuracy, and our robust joint-optimization technique allocates resources efficiently when compared to existing differential services alternatives. △ Less

Submitted 13 January, 2022; originally announced January 2022.

arXiv:2109.02186 [pdf, other]

Achieving QoS for Real-Time Bursty Applications over Passive Optical Networks

Authors: Dibbendu Roy, Aravinda S. Rao, Tansu Alpcan, Goutam Das, Marimuthu Palaniswami

Abstract: Emerging real-time applications such as those classified under ultra-reliable low latency (uRLLC) generate bursty traffic and have strict Quality of Service (QoS) requirements. Passive Optical Network (PON) is a popular access network technology, which is envisioned to handle such applications at the access segment of the network. However, the existing standards cannot handle strict QoS constraint… ▽ More Emerging real-time applications such as those classified under ultra-reliable low latency (uRLLC) generate bursty traffic and have strict Quality of Service (QoS) requirements. Passive Optical Network (PON) is a popular access network technology, which is envisioned to handle such applications at the access segment of the network. However, the existing standards cannot handle strict QoS constraints. The available solutions rely on instantaneous heuristic decisions and maintain QoS constraints (mostly bandwidth) in an average sense. Existing works with optimal strategies are computationally complex and are not suitable for uRLLC applications. This paper presents a novel computationally-efficient, far-sighted bandwidth allocation policy design for facilitating bursty traffic in a PON framework while satisfying strict QoS (age of information/delay and bandwidth) requirements of modern applications. To this purpose, first we design a delay-tracking mechanism which allows us to model the resource allocation problem from a control-theoretic viewpoint as a Model Predictive Control (MPC). MPC helps in taking far-sighted decisions regarding resource allocations and captures the time-varying dynamics of the network. We provide computationally efficient polynomial-time solutions and show its implementation in the PON framework. Compared to existing approaches, MPC reduces delay violations by approximately 15% for a delay-constrained application of 1ms target. Our approach is also robust to varying traffic arrivals. △ Less

Submitted 5 September, 2021; originally announced September 2021.

arXiv:2109.01486 [pdf, other]

Studying the Effects of Self-Attention for Medical Image Analysis

Authors: Adrit Rao, Jongchan Park, Sanghyun Woo, Joon-Young Lee, Oliver Aalami

Abstract: When the trained physician interprets medical images, they understand the clinical importance of visual features. By applying cognitive attention, they apply greater focus onto clinically relevant regions while disregarding unnecessary features. The use of computer vision to automate the classification of medical images is widely studied. However, the standard convolutional neural network (CNN) do… ▽ More When the trained physician interprets medical images, they understand the clinical importance of visual features. By applying cognitive attention, they apply greater focus onto clinically relevant regions while disregarding unnecessary features. The use of computer vision to automate the classification of medical images is widely studied. However, the standard convolutional neural network (CNN) does not necessarily employ subconscious feature relevancy evaluation techniques similar to the trained medical specialist and evaluates features more generally. Self-attention mechanisms enable CNNs to focus more on semantically important regions or aggregated relevant context with long-range dependencies. By using attention, medical image analysis systems can potentially become more robust by focusing on more important clinical feature regions. In this paper, we provide a comprehensive comparison of various state-of-the-art self-attention mechanisms across multiple medical image analysis tasks. Through both quantitative and qualitative evaluations along with a clinical user-centric survey study, we aim to provide a deeper understanding of the effects of self-attention in medical computer vision tasks. △ Less

Submitted 2 September, 2021; originally announced September 2021.

Comments: ICCV 2021 CVAMD

arXiv:2107.07137 [pdf]

Direct-drive ocean wave-powered batch reverse osmosis

Authors: Katie M. Brodersen, Emily A. Bywater, Alec M. Lanter, Hayden H. Schennum, Kumansh N. Furia, Maulee K. Sheth, Nathaniel S. Kiefer, Brittany K. Cafferty, Akshay K. Rao, Jose M. Garcia, David M. Warsinger

Abstract: Ocean waves provide a consistent, reliable source of clean energy making them a viable energy source for desalination. Ocean wave energy is useful to coastal communities, especially island nations. However, large capital costs render current wave-powered desalination technologies economically infeasible. This work presents a high efficiency configuration for ocean wave energy powering batch revers… ▽ More Ocean waves provide a consistent, reliable source of clean energy making them a viable energy source for desalination. Ocean wave energy is useful to coastal communities, especially island nations. However, large capital costs render current wave-powered desalination technologies economically infeasible. This work presents a high efficiency configuration for ocean wave energy powering batch reverse osmosis. The proposed system uses seawater as the working fluid in a hydro-mechanical wave energy converter and replaces the reverse osmosis high-pressure pump with a hydraulic converter for direct-drive coupling. This allows for minimal intermediary power conversions, fewer components, and higher efficiencies. The concept was analyzed with MATLAB to model the transient energy dynamics of the wave energy converter, power take-off system, and desalination load. The fully hydro-mechanical coupling, incorporating energy recovery, could achieve an SEC and LCOW as low as 2.30 kWh/m3 and $1.96, respectively, for different sea states. The results were validated at the sub-system level against existing literature on wave energy models and previous work completed on batch reverse osmosis models, as this system was the first to combine these two technologies. SEC and LCOW values were validated by comparing to known and predicted values for various types of RO systems. △ Less

Submitted 15 July, 2021; originally announced July 2021.

Comments: 23 pages, 10 figures

arXiv:2104.09748 [pdf, other]

Waveform Phasicity Prediction from Arterial Sounds through Spectrogram Analysis using Convolutional Neural Networks for Limb Perfusion Assessment

Authors: Adrit Rao, Kevin Battenfield, Oliver Aalami

Abstract: Peripheral Arterial Disease (PAD) is a common form of arterial occlusive disease that is challenging to evaluate at the point-of-care. Hand-held dopplers are the most ubiquitous device used to evaluate circulation and allows providers to audibly "listen" to the blood flow. Providers use the audible feedback to subjectively assess whether the sound characteristics are consistent with Monophasic, Bi… ▽ More Peripheral Arterial Disease (PAD) is a common form of arterial occlusive disease that is challenging to evaluate at the point-of-care. Hand-held dopplers are the most ubiquitous device used to evaluate circulation and allows providers to audibly "listen" to the blood flow. Providers use the audible feedback to subjectively assess whether the sound characteristics are consistent with Monophasic, Biphasic, or Triphasic waveforms. Subjective assessment of doppler sounds raises suspicion of PAD and leads to further testing, often delaying definitive treatment. Misdiagnoses are also possible with subjective interpretation of doppler waveforms. This paper presents a Deep Learning system that has the ability to predict waveform phasicity through analysis of hand-held doppler sounds. We collected 268 four-second recordings on an iPhone taken during a formal vascular lab study in patients with cardiovascular disease. Our end-to-end system works by converting input sound into a spectrogram which visually represents frequency changes in temporal patterns. This conversion enables visual differentiation between the phasicity classes. With these changes present, a custom trained Convolutional Neural Network (CNN) is used for prediction through learned feature extraction. The performance of the model was evaluated via calculation of the F1 score and accuracy metrics. The system received an F1 score of 90.57% and an accuracy of 96.23%. Our Deep Learning system is not computationally expensive and has the ability for integration within several applications. When used in a clinic, this system has the capability of preventing misdiagnosis and gives practitioners a second opinion that can be useful in the evaluation of PAD. △ Less

Submitted 15 June, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

Comments: 5 pages, 8 figures

arXiv:2103.10533 [pdf, other]

Resilient Cooperative Adaptive Cruise Control for Autonomous Vehicles Using Machine Learning

Authors: Srivalli Boddupalli, Akash Someshwar Rao, Sandip Ray

Abstract: Cooperative Adaptive Cruise Control (CACC) is a fundamental connected vehicle application that extends Adaptive Cruise Control by exploiting vehicle-to-vehicle (V2V) communication. CACC is a crucial ingredient for numerous autonomous vehicle functionalities including platooning, distributed route management, etc. Unfortunately, malicious V2V communications can subvert CACC, leading to string insta… ▽ More Cooperative Adaptive Cruise Control (CACC) is a fundamental connected vehicle application that extends Adaptive Cruise Control by exploiting vehicle-to-vehicle (V2V) communication. CACC is a crucial ingredient for numerous autonomous vehicle functionalities including platooning, distributed route management, etc. Unfortunately, malicious V2V communications can subvert CACC, leading to string instability and road accidents. In this paper, we develop a novel resiliency infrastructure, RACCON, for detecting and mitigating V2V attacks on CACC. RACCON uses machine learning to develop an on-board prediction model that captures anomalous vehicular responses and performs mitigation in real time. RACCON-enabled vehicles can exploit the high efficiency of CACC without compromising safety, even under potentially adversarial scenarios. We present extensive experimental evaluation to demonstrate the efficacy of RACCON. △ Less

Submitted 18 March, 2021; originally announced March 2021.

arXiv:2008.03548 [pdf, other]

A Unified Framework for Shot Type Classification Based on Subject Centric Lens

Authors: Anyi Rao, Jiaze Wang, Linning Xu, Xuekun Jiang, Qingqiu Huang, Bolei Zhou, Dahua Lin

Abstract: Shots are key narrative elements of various videos, e.g. movies, TV series, and user-generated videos that are thriving over the Internet. The types of shots greatly influence how the underlying ideas, emotions, and messages are expressed. The technique to analyze shot types is important to the understanding of videos, which has seen increasing demand in real-world applications in this era. Classi… ▽ More Shots are key narrative elements of various videos, e.g. movies, TV series, and user-generated videos that are thriving over the Internet. The types of shots greatly influence how the underlying ideas, emotions, and messages are expressed. The technique to analyze shot types is important to the understanding of videos, which has seen increasing demand in real-world applications in this era. Classifying shot type is challenging due to the additional information required beyond the video content, such as the spatial composition of a frame and camera movement. To address these issues, we propose a learning framework Subject Guidance Network (SGNet) for shot type recognition. SGNet separates the subject and background of a shot into two streams, serving as separate guidance maps for scale and movement type classification respectively. To facilitate shot type analysis and model evaluations, we build a large-scale dataset MovieShots, which contains 46K shots from 7K movie trailers with annotations of their scale and movement types. Experiments show that our framework is able to recognize these two attributes of shot accurately, outperforming all the previous methods. △ Less

Submitted 8 August, 2020; originally announced August 2020.

Comments: ECCV2020. Project page: https://anyirao.com/projects/ShotType.html

arXiv:2008.03546 [pdf, other]

Online Multi-modal Person Search in Videos

Authors: Jiangyue Xia, Anyi Rao, Qingqiu Huang, Linning Xu, Jiangtao Wen, Dahua Lin

Abstract: The task of searching certain people in videos has seen increasing potential in real-world applications, such as video organization and editing. Most existing approaches are devised to work in an offline manner, where identities can only be inferred after an entire video is examined. This working manner precludes such methods from being applied to online services or those applications that require… ▽ More The task of searching certain people in videos has seen increasing potential in real-world applications, such as video organization and editing. Most existing approaches are devised to work in an offline manner, where identities can only be inferred after an entire video is examined. This working manner precludes such methods from being applied to online services or those applications that require real-time responses. In this paper, we propose an online person search framework, which can recognize people in a video on the fly. This framework maintains a multimodal memory bank at its heart as the basis for person recognition, and updates it dynamically with a policy obtained by reinforcement learning. Our experiments on a large movie dataset show that the proposed method is effective, not only achieving remarkable improvements over online schemes but also outperforming offline methods. △ Less

Submitted 8 August, 2020; originally announced August 2020.

Comments: ECCV2020. Project page: http://movienet.site/projects/eccv20onlineperson.html

arXiv:2006.15460 [pdf]

Fast automatic segmentation of thalamic nuclei from MP2RAGE acquisition at 7 Tesla

Authors: Ritobrato Datta, Micky K. Bacchus, Dushyant Kumar, Mark A. Elliott, Aditya Rao, Sudipto Dolui, Ravinder Reddy, Brenda L Banwell, Manojkumar Saranathan

Abstract: Purpose: Thalamic nuclei are largely invisible in conventional MRI due to poor contrast. Thalamus Optimized Multi-Atlas Segmentation (THOMAS) provides automatic segmentation of 12 thalamic nuclei using white-matter-nulled (WMn) MPRAGE sequence at 7T. Application of THOMAS to Magnetization Prepared 2 Rapid Gradient Echo (MP2RAGE) sequence acquired at 7T has been investigated in this study. Method… ▽ More Purpose: Thalamic nuclei are largely invisible in conventional MRI due to poor contrast. Thalamus Optimized Multi-Atlas Segmentation (THOMAS) provides automatic segmentation of 12 thalamic nuclei using white-matter-nulled (WMn) MPRAGE sequence at 7T. Application of THOMAS to Magnetization Prepared 2 Rapid Gradient Echo (MP2RAGE) sequence acquired at 7T has been investigated in this study. Methods: 8 healthy volunteers and 5 pediatric-onset multiple sclerosis patients were recruited at the Children's Hospital of Philadelphia and scanned at Siemens 7T with WMn-MPRAGE and multi-echo MP2RAGE (ME-MP2RAGE) sequences. White-matter-nulled contrast was synthesized (MP2-SYN) from T1 maps from ME-MP2RAGE sequence. Thalamic nuclei were segmented using THOMAS joint label fusion algorithm from WMn-MPRAGE and MP2-SYN datasets. THOMAS pipeline was modified to use majority voting to segment the bias corrected MP2-UNI images. Thalamic nuclei from MP2-SYN and MP2-UNI images were evaluated against corresponding nuclei obtained from WMn-MPRAGE images using dice coefficients, volume similarity indices (VSI) and distance between centroids. Results: For MP2-SYN, dice > 0.85 and VSI > 0.95 was achieved for the 5 larger nuclei and dice > 0.6 and VSI > 0.7 was achieved for the 7 smaller nuclei. The dice and VSI were slightly higher whilst the distance between centroids were smaller for MP2-SYN compared to MP2-UNI, indicating improved performance using the synthesized WMn image. Discussion: THOMAS algorithm can successfully segment thalamic nuclei in routinely acquired bias-free MP2RAGE images with essentially equivalent quality when evaluated against WMn-MPRAGE, hence has wider applicability in studies focused on thalamic involvement in aging and disease. △ Less

Submitted 27 June, 2020; originally announced June 2020.

Comments: 13 pages, 4 figures

arXiv:2002.03454 [pdf]

Intelligent Receivers for Electronic Warfare Applications

Authors: Anantha K. Karthik, Jameer Ali M. S, A. Bhagavathi Rao

Abstract: In this paper, we propose an algorithm to perform modulation classification on a 5-class problem consisting of AM, 2-PSK, 4-PSK, 8-PSK and 16-QAM modulation schemes using a combination of features based on the first order cyclostationarity, second- and higher-order moments, and then extend the idea of classification to an intelligent receiver which classifies and demodulates the signal without pri… ▽ More In this paper, we propose an algorithm to perform modulation classification on a 5-class problem consisting of AM, 2-PSK, 4-PSK, 8-PSK and 16-QAM modulation schemes using a combination of features based on the first order cyclostationarity, second- and higher-order moments, and then extend the idea of classification to an intelligent receiver which classifies and demodulates the signal without prior information regarding the transmitted signal. △ Less

Submitted 9 February, 2020; originally announced February 2020.

Comments: Classification, Higher-order statistics, moments, fading, adaptive receivers. Paper published at EWCI-2014

arXiv:2002.03451 [pdf]

A Novel Method for Spectrum Sensing of Linear Modulation Schemes

Authors: Anantha K. Karthik, Jameer Ali M. S, Mohammed Zafar Ali Khan, A. Bhagavathi Rao

Abstract: In this paper, we propose and evaluate a novel algorithm for performing spectrum sensing on linear modulations based on second-order cyclic features of the received signals. The proposed approach has similar computational complexity to that of energy detection and outperforms energy detection and other sensing schemes such as Eigenvalue based sensing in the presence of noise uncertainties for a gi… ▽ More In this paper, we propose and evaluate a novel algorithm for performing spectrum sensing on linear modulations based on second-order cyclic features of the received signals. The proposed approach has similar computational complexity to that of energy detection and outperforms energy detection and other sensing schemes such as Eigenvalue based sensing in the presence of noise uncertainties for a given value of the probability of false alarm. △ Less

Submitted 9 February, 2020; originally announced February 2020.

Comments: Cyclostationary based detection, eigen-value base sensing, electronic warfare, spectrum sensing. Published and presented at EWCI-2014

arXiv:2001.00977 [pdf, other]

RF Fingerprinting and Deep Learning Assisted UE Positioning in 5G

Authors: M Majid Butt, Anil Rao, Daejung Yoon

Abstract: In this work, we investigate user equipment (UE) positioning assisted by deep learning (DL) in 5G and beyond networks. As compared to state of the art positioning algorithms used in today's networks, radio signal fingerprinting and machine learning (ML) assisted positioning requires smaller additional feedback overhead; and the positioning estimates are made directly inside the radio access networ… ▽ More In this work, we investigate user equipment (UE) positioning assisted by deep learning (DL) in 5G and beyond networks. As compared to state of the art positioning algorithms used in today's networks, radio signal fingerprinting and machine learning (ML) assisted positioning requires smaller additional feedback overhead; and the positioning estimates are made directly inside the radio access network (RAN), thereby assisting in radio resource management. The conventional positioning algorithms will be used as back-up for the environments with high variability in conditions; but ML-assisted positioning serves as more efficient and simpler technique to provide better or similar positioning accuracy. In this regard, we study ML-assisted positioning methods and evaluate their performance using system level simulations for an outdoor scenario in Lincoln park Chicago. The study is based on the use of raytracing tools, a 3GPP 5G NR compliant system level simulator and DL framework to estimate positioning accuracy of the UE. The use of raytracing tool and system level simulator helps avoid expensive drive test measurements in practical scenarios. Our proposed mechanism is a first step towards more proactive mobility management in future networks. We evaluate and compare performance of various DL models and show mean positioning error in the range of 1-1.5m for the best DL configuration with appropriate system feature-modeling. △ Less

Submitted 3 January, 2020; originally announced January 2020.

Comments: submitted to ICC 2020

arXiv:1905.10841 [pdf]

Utilizing Automated Breast Cancer Detection to Identify Spatial Distributions of Tumor Infiltrating Lymphocytes in Invasive Breast Cancer

Authors: Han Le, Rajarsi Gupta, Le Hou, Shahira Abousamra, Danielle Fassler, Tahsin Kurc, Dimitris Samaras, Rebecca Batiste, Tianhao Zhao, Arvind Rao, Alison L. Van Dyke, Ashish Sharma, Erich Bremer, Jonas S. Almeida, Joel Saltz

Abstract: Quantitative assessment of Tumor-TIL spatial relationships is increasingly important in both basic science and clinical aspects of breast cancer research. We have developed and evaluated convolutional neural network (CNN) analysis pipelines to generate combined maps of cancer regions and tumor infiltrating lymphocytes (TILs) in routine diagnostic breast cancer whole slide tissue images (WSIs). We… ▽ More Quantitative assessment of Tumor-TIL spatial relationships is increasingly important in both basic science and clinical aspects of breast cancer research. We have developed and evaluated convolutional neural network (CNN) analysis pipelines to generate combined maps of cancer regions and tumor infiltrating lymphocytes (TILs) in routine diagnostic breast cancer whole slide tissue images (WSIs). We produce interactive whole slide maps that provide 1) insight about the structural patterns and spatial distribution of lymphocytic infiltrates and 2) facilitate improved quantification of TILs. We evaluated both tumor and TIL analyses using three CNN networks - Resnet-34, VGG16 and Inception v4, and demonstrated that the results compared favorably to those obtained by what believe are the best published methods. We have produced open-source tools and generated a public dataset consisting of tumor/TIL maps for 1,015 TCGA breast cancer images. We also present a customized web-based interface that enables easy visualization and interactive exploration of high-resolution combined Tumor-TIL maps for 1,015TCGA invasive breast cancer cases that can be downloaded for further downstream analyses. △ Less

Submitted 13 January, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

Comments: The American Journal of Pathology

arXiv:1803.09033 [pdf, other]

Automatic Music Accompanist

Authors: Anyi Rao, Francis Lau

Abstract: Automatic musical accompaniment is where a human musician is accompanied by a computer musician. The computer musician is able to produce musical accompaniment that relates musically to the human performance. The accompaniment should follow the performance using observations of the notes they are playing. This paper describes a complete and detailed construction of a score following and accompanyi… ▽ More Automatic musical accompaniment is where a human musician is accompanied by a computer musician. The computer musician is able to produce musical accompaniment that relates musically to the human performance. The accompaniment should follow the performance using observations of the notes they are playing. This paper describes a complete and detailed construction of a score following and accompanying system using Hidden Markov Models (HMMs). It details how to train a score HMM, how to deal with polyphonic input, how this HMM work when following score, how to build up a musical accompanist. It proposes a new parallel hidden Markov model for score following and a fast decoding algorithm to deal with performance errors. △ Less

Submitted 23 March, 2018; originally announced March 2018.

arXiv:1712.08227 [pdf, other]

Analysis-synthesis model learning with shared features: a new framework for histopathological image classification

Authors: Xuelu Li, Vishal Monga, U. K. Arvind Rao

Abstract: Automated histopathological image analysis offers exciting opportunities for the early diagnosis of several medical conditions including cancer. There are however stiff practical challenges: 1.) discriminative features from such images for separating diseased vs. healthy classes are not readily apparent, and 2.) distinct classes, e.g. healthy vs. stages of disease continue to share several geometr… ▽ More Automated histopathological image analysis offers exciting opportunities for the early diagnosis of several medical conditions including cancer. There are however stiff practical challenges: 1.) discriminative features from such images for separating diseased vs. healthy classes are not readily apparent, and 2.) distinct classes, e.g. healthy vs. stages of disease continue to share several geometric features. We propose a novel Analysis-synthesis model Learning with Shared Features algorithm (ALSF) for classifying such images more effectively. In ALSF, a joint analysis and synthesis learning model is introduced to learn the classifier and the feature extractor at the same time. In this way, the computation load in patch-level based image classification can be much reduced. Crucially, we integrate into this framework the learning of a low rank shared dictionary and a shared analysis operator, which more accurately represents both similarities and differences in histopathological images from distinct classes. ALSF is evaluated on two challenging databases: (1) kidney tissue images provided by the Animal Diagnosis Lab (ADL) at the Pennsylvania State University and (2) brain tumor images from The Cancer Genome Atlas (TCGA) database. Experimental results confirm that ALSF can offer benefits over state of the art alternatives. △ Less

Submitted 21 December, 2017; originally announced December 2017.

Comments: 2018 ISBI conference accepted paper

Showing 1–25 of 25 results for author: Rao, A