Search | arXiv e-print repository

Enhancing Track Management Systems with Vehicle-To-Vehicle Enabled Sensor Fusion

Authors: Thomas Billington, Ansh Gwash, Aadi Kothari, Lucas Izquierdo, Timothy Talty

Abstract: In the rapidly advancing landscape of connected and automated vehicles (CAV), the integration of Vehicle-to-Everything (V2X) communication in traditional fusion systems presents a promising avenue for enhancing vehicle perception. Addressing current limitations with vehicle sensing, this paper proposes a novel Vehicle-to-Vehicle (V2V) enabled track management system that leverages the synergy betw… ▽ More In the rapidly advancing landscape of connected and automated vehicles (CAV), the integration of Vehicle-to-Everything (V2X) communication in traditional fusion systems presents a promising avenue for enhancing vehicle perception. Addressing current limitations with vehicle sensing, this paper proposes a novel Vehicle-to-Vehicle (V2V) enabled track management system that leverages the synergy between V2V signals and detections from radar and camera sensors. The core innovation lies in the creation of independent priority track lists, consisting of fused detections validated through V2V communication. This approach enables more flexible and resilient thresholds for track management, particularly in scenarios with numerous occlusions where the tracked objects move outside the field of view of the perception sensors. The proposed system considers the implications of falsification of V2X signals which is combated through an initial vehicle identification process using detection from perception sensors. Presented are the fusion algorithm, simulated environments, and validation mechanisms. Experimental results demonstrate the improved accuracy and robustness of the proposed system in common driving scenarios, highlighting its potential to advance the reliability and efficiency of autonomous vehicles. △ Less

Submitted 26 April, 2024; originally announced April 2024.

Comments: 6 pages, 5 figures

arXiv:2404.15549 [pdf, other]

PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models

Authors: Shashi Kant Gupta, Aditya Basu, Mauro Nievas, Jerrin Thomas, Nathan Wolfrath, Adhitya Ramamurthi, Bradley Taylor, Anai N. Kothari, Regina Schwind, Therica M. Miller, Sorena Nadaf-Rahrov, Yanshan Wang, Hrituraj Singh

Abstract: Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients miss… ▽ More Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients missing out on potential therapeutic options. Recent advancements in Large Language Models (LLMs) have made automating patient-trial matching possible, as shown in multiple concurrent research studies. However, the current approaches are confined to constrained, often synthetic datasets that do not adequately mirror the complexities encountered in real-world medical data. In this study, we present the first, end-to-end large-scale empirical evaluation of clinical trial matching using real-world EHRs. Our study showcases the capability of LLMs to accurately match patients with appropriate clinical trials. We perform experiments with proprietary LLMs, including GPT-4 and GPT-3.5, as well as our custom fine-tuned model called OncoLLM and show that OncoLLM, despite its significantly smaller size, not only outperforms GPT-3.5 but also matches the performance of qualified medical doctors. All experiments were carried out on real-world EHRs that include clinical notes and available clinical trials from a single cancer center in the United States. △ Less

Submitted 26 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

Comments: 30 Pages, 8 Figures, Supplementary Work Attached

arXiv:2404.06680 [pdf, other]

Onco-Retriever: Generative Classifier for Retrieval of EHR Records in Oncology

Authors: Shashi Kant Gupta, Aditya Basu, Bradley Taylor, Anai Kothari, Hrituraj Singh

Abstract: Retrieving information from EHR systems is essential for answering specific questions about patient journeys and improving the delivery of clinical care. Despite this fact, most EHR systems still rely on keyword-based searches. With the advent of generative large language models (LLMs), retrieving information can lead to better search and summarization capabilities. Such retrievers can also feed R… ▽ More Retrieving information from EHR systems is essential for answering specific questions about patient journeys and improving the delivery of clinical care. Despite this fact, most EHR systems still rely on keyword-based searches. With the advent of generative large language models (LLMs), retrieving information can lead to better search and summarization capabilities. Such retrievers can also feed Retrieval-augmented generation (RAG) pipelines to answer any query. However, the task of retrieving information from EHR real-world clinical data contained within EHR systems in order to solve several downstream use cases is challenging due to the difficulty in creating query-document support pairs. We provide a blueprint for creating such datasets in an affordable manner using large language models. Our method results in a retriever that is 30-50 F-1 points better than propriety counterparts such as Ada and Mistral for oncology data elements. We further compare our model, called Onco-Retriever, against fine-tuned PubMedBERT model as well. We conduct an extensive manual evaluation on real-world EHR data along with latency analysis of the different models and provide a path forward for healthcare organizations to build domain-specific retrievers. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 18 pages

arXiv:2310.03314 [pdf, other]

Enhanced Human-Robot Collaboration using Constrained Probabilistic Human-Motion Prediction

Authors: Aadi Kothari, Tony Tohme, Xiaotong Zhang, Kamal Youcef-Toumi

Abstract: Human motion prediction is an essential step for efficient and safe human-robot collaboration. Current methods either purely rely on representing the human joints in some form of neural network-based architecture or use regression models offline to fit hyper-parameters in the hope of capturing a model encompassing human motion. While these methods provide good initial results, they are missing out… ▽ More Human motion prediction is an essential step for efficient and safe human-robot collaboration. Current methods either purely rely on representing the human joints in some form of neural network-based architecture or use regression models offline to fit hyper-parameters in the hope of capturing a model encompassing human motion. While these methods provide good initial results, they are missing out on leveraging well-studied human body kinematic models as well as body and scene constraints which can help boost the efficacy of these prediction frameworks while also explicitly avoiding implausible human joint configurations. We propose a novel human motion prediction framework that incorporates human joint constraints and scene constraints in a Gaussian Process Regression (GPR) model to predict human motion over a set time horizon. This formulation is combined with an online context-aware constraints model to leverage task-dependent motions. It is tested on a human arm kinematic model and implemented on a human-robot collaborative setup with a UR5 robot arm to demonstrate the real-time capability of our approach. Simulations were also performed on datasets like HA4M and ANDY. The simulation and experimental results demonstrate considerable improvements in a Gaussian Process framework when these constraints are explicitly considered. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: 7 pages, 5 figures. Associated video demonstration can be found at https://www.youtube.com/@MITMechatronics

arXiv:2308.12820 [pdf, other]

Prediction without Preclusion: Recourse Verification with Reachable Sets

Authors: Avni Kothari, Bogdan Kulynych, Tsui-Wei Weng, Berk Ustun

Abstract: Machine learning models are often used to decide who receives a loan, a job interview, or a public benefit. Models in such settings use features without considering their actionability. As a result, they can assign predictions that are fixed $-$ meaning that individuals who are denied loans and interviews are, in fact, precluded from access to credit and employment. In this work, we introduce a pr… ▽ More Machine learning models are often used to decide who receives a loan, a job interview, or a public benefit. Models in such settings use features without considering their actionability. As a result, they can assign predictions that are fixed $-$ meaning that individuals who are denied loans and interviews are, in fact, precluded from access to credit and employment. In this work, we introduce a procedure called recourse verification to test if a model assigns fixed predictions to its decision subjects. We propose a model-agnostic approach for recourse verification with reachable sets $-$ i.e., the set of all points that a person can reach through their actions in feature space. We develop methods to construct reachable sets for discrete feature spaces, which can certify the responsiveness of any model by simply querying its predictions. We conduct a comprehensive empirical study on the infeasibility of recourse on datasets from consumer finance. Our results highlight how models can inadvertently preclude access by assigning fixed predictions and underscore the need to account for actionability in model development. △ Less

Submitted 1 May, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

Comments: ICLR 2024 Spotlight. The first two authors contributed equally

arXiv:2304.14807 [pdf, other]

Deep Learning assisted microwave-plasma interaction based technique for plasma density estimation

Authors: Pratik Ghosh, Bhaskar Chaudhury, Shishir Purohit, Vishv Joshi, Ashray Kothari, Devdeep Shetranjiwala

Abstract: The electron density is a key parameter to characterize any plasma. Most of the plasma applications and research in the area of low-temperature plasmas (LTPs) are based on the accurate estimations of plasma density and plasma temperature. The conventional methods for electron density measurements offer axial and radial profiles for any given linear LTP device. These methods have major disadvantage… ▽ More The electron density is a key parameter to characterize any plasma. Most of the plasma applications and research in the area of low-temperature plasmas (LTPs) are based on the accurate estimations of plasma density and plasma temperature. The conventional methods for electron density measurements offer axial and radial profiles for any given linear LTP device. These methods have major disadvantages of operational range (not very wide), cumbersome instrumentation, and complicated data analysis procedures. The article proposes a Deep Learning (DL) assisted microwave-plasma interaction-based non-invasive strategy, which can be used as a new alternative approach to address some of the challenges associated with existing plasma density measurement techniques. The electric field pattern due to microwave scattering from plasma is utilized to estimate the density profile. The proof of concept is tested for a simulated training data set comprising a low-temperature, unmagnetized, collisional plasma. Different types of symmetric (Gaussian-shaped) and asymmetrical density profiles, in the range $10^{16}-10^{19}$ m$^{-3}$, addressing a range of experimental configurations have been considered in our study. Real-life experimental issues such as the presence of noise and the amount of measured data (dense vs sparse) have been taken into consideration while preparing the synthetic training data-sets. The DL-based technique has the capability to determine the electron density profile within the plasma. The performance of the proposed deep learning-based approach has been evaluated using three metrics- SSIM, RMSLE, and MAPE. The obtained results show promising performance in estimating the 2D radial profile of the density for the given linear plasma device and affirms the potential of the proposed ML-based approach in plasma diagnostics. △ Less

Submitted 28 June, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

arXiv:2304.10200 [pdf, other]

Inkwell: Design and Validation of a Low-Cost Open Electricity-Free 3D Printed Device for Automated Thin Smearing of Whole Blood

Authors: Jerome Nowak, Anesta Kothari, Hongquan Li, Jaspreet Pannu, Dani Algazi, Manu Prakash

Abstract: Microscopy plays a crucial role in hematology and diagnosis of infectious diseases worldwide. For malaria alone, more than 200 million slides are read by manual microscopists every year. High quality thin blood smears are essential for subsequent microscopy examinations including malaria microscopy, but are hard to make in field settings. Existing devices for assisting in making thin smears are av… ▽ More Microscopy plays a crucial role in hematology and diagnosis of infectious diseases worldwide. For malaria alone, more than 200 million slides are read by manual microscopists every year. High quality thin blood smears are essential for subsequent microscopy examinations including malaria microscopy, but are hard to make in field settings. Existing devices for assisting in making thin smears are available but are limited by cost or complexity for wider use. Here we present Inkwell, a portable mechanical device capable of making high quality thin blood smears in field settings. Inkwell is simple, low-cost, does not use electricity, and requires minimal training prior to use. By utilizing passive dissipative dynamics of a spiral spring coupled to an air dashpot with a tunable valve - we demonstrate a highly tunable mechanism for constant velocity smears at prescribed angle. Inkwell is capable of producing high quality blood smears of tunable cell density with more than 12 million individually distinguishable red blood cells on a single slide. The current design, which exploits precision manufacturing of a 17 cents plastic syringe and a spring, can be printed on a standard 3D printer with overall unit cost of less than a few dollars in large quantities. We further present usability tests to confirm performance over 10,000 unit cycle operations with no degradation in quality of the smear and demonstrate ease of use with minimal training. Inkwell enhances the broader toolbox of open innovations in diagnostics for providing high quality medical care in low and medium resource settings. Combined with rise of 3D printing, Inkwell presents an alternative to traditional centralized manufacturing and opens up distributed manufacturing of medical diagnostics in global context. △ Less

Submitted 20 April, 2023; originally announced April 2023.

arXiv:2201.01483 [pdf, other]

Risk Bounded Nonlinear Robot Motion Planning With Integrated Perception & Control

Authors: Venkatraman Renganathan, Sleiman Safaoui, Aadi Kothari, Benjamin Gravell, Iman Shames, Tyler Summers

Abstract: Robust autonomy stacks require tight integration of perception, motion planning, and control layers, but these layers often inadequately incorporate inherent perception and prediction uncertainties, either ignoring them altogether or making questionable assumptions of Gaussianity. Robots with nonlinear dynamics and complex sensing modalities operating in an uncertain environment demand more carefu… ▽ More Robust autonomy stacks require tight integration of perception, motion planning, and control layers, but these layers often inadequately incorporate inherent perception and prediction uncertainties, either ignoring them altogether or making questionable assumptions of Gaussianity. Robots with nonlinear dynamics and complex sensing modalities operating in an uncertain environment demand more careful consideration of how uncertainties propagate across stack layers. We propose a framework to integrate perception, motion planning, and control by explicitly incorporating perception and prediction uncertainties into planning so that risks of constraint violation can be mitigated. Specifically, we use a nonlinear model predictive control based steering law coupled with a decorrelation scheme based Unscented Kalman Filter for state and environment estimation to propagate the robot state and environment uncertainties. Subsequently, we use distributionally robust risk constraints to limit the risk in the presence of these uncertainties. Finally, we present a layered autonomy stack consisting of a nonlinear steering-based distributionally robust motion planning module and a reference trajectory tracking module. Our numerical experiments with nonlinear robot models and an urban driving simulator show the effectiveness of our proposed approaches. △ Less

Submitted 5 January, 2022; originally announced January 2022.

Comments: arXiv admin note: text overlap with arXiv:2002.02928

arXiv:2107.04767 [pdf, other]

Anomaly Detection in Residential Video Surveillance on Edge Devices in IoT Framework

Authors: Mayur R. Parate, Kishor M. Bhurchandi, Ashwin G. Kothari

Abstract: Intelligent resident surveillance is one of the most essential smart community services. The increasing demand for security needs surveillance systems to be able to detect anomalies in surveillance scenes. Employing high-capacity computational devices for intelligent surveillance in residential societies is costly and not feasible. Therefore, we propose anomaly detection for intelligent surveillan… ▽ More Intelligent resident surveillance is one of the most essential smart community services. The increasing demand for security needs surveillance systems to be able to detect anomalies in surveillance scenes. Employing high-capacity computational devices for intelligent surveillance in residential societies is costly and not feasible. Therefore, we propose anomaly detection for intelligent surveillance using CPU-only edge devices. A modular framework to capture object-level inferences and tracking is developed. To cope with partial occlusions, posture deformations, and complex scenes, we employed feature encoding and trajectory association governed by two metrices complementing to each other. The elements of an anomaly detection framework are optimized to run on CPU-only edge devices with sufficient frames per second (FPS). The experimental results indicate the proposed method is feasible and achieves satisfactory results in real-life scenarios. △ Less

Submitted 9 August, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

Comments: 7 Pages, 7 Figures and 3 Tables

arXiv:2005.14408 [pdf, other]

doi 10.18653/v1/2021.naacl-industry.25

Noise Robust Named Entity Understanding for Voice Assistants

Authors: Deepak Muralidharan, Joel Ruben Antony Moniz, Sida Gao, Xiao Yang, Justine Kao, Stephen Pulman, Atish Kothari, Ray Shen, Yinying Pan, Vivek Kaul, Mubarak Seyed Ibrahim, Gang Xiang, Nan Dun, Yidan Zhou, Andy O, Yuan Zhang, Pooja Chitkara, Xuan Wang, Alkesh Patel, Kushal Tayal, Roger Zheng, Peter Grasch, Jason D. Williams, Lin Li

Abstract: Named Entity Recognition (NER) and Entity Linking (EL) play an essential role in voice assistant interaction, but are challenging due to the special difficulties associated with spoken user queries. In this paper, we propose a novel architecture that jointly solves the NER and EL tasks by combining them in a joint reranking module. We show that our proposed framework improves NER accuracy by up to… ▽ More Named Entity Recognition (NER) and Entity Linking (EL) play an essential role in voice assistant interaction, but are challenging due to the special difficulties associated with spoken user queries. In this paper, we propose a novel architecture that jointly solves the NER and EL tasks by combining them in a joint reranking module. We show that our proposed framework improves NER accuracy by up to 3.13% and EL accuracy by up to 3.6% in F1 score. The features used also lead to better accuracies in other natural language understanding tasks, such as domain classification and semantic parsing. △ Less

Submitted 10 August, 2021; v1 submitted 29 May, 2020; originally announced May 2020.

Comments: NAACL 2021 Industry Track

MSC Class: 68T50 ACM Class: I.2.7

arXiv:2004.13494 [pdf, other]

Project 1000 x 1000: Centrifugal melt spinning for distributed manufacturing of N95 filtering facepiece respirators

Authors: Anton Molina, Pranav Vyas, Nikita Khlystov, Shailabh Kumar, Anesta Kothari, Dave Deriso, Zhiru Liu, Samhita Banavar, Eliott Flaum, Manu Prakash

Abstract: The COVID-19 pandemic has caused a global shortage of personal protective equipment. While existing supply chains are struggling to meet the surge in demand, the limited supply of N95 filtering facepiece respirators (FFRs) has placed healthcare workers at risk. This paper presents a method for scalable and distributed manufacturing of FFR filter material based on a combination of centrifugal melt… ▽ More The COVID-19 pandemic has caused a global shortage of personal protective equipment. While existing supply chains are struggling to meet the surge in demand, the limited supply of N95 filtering facepiece respirators (FFRs) has placed healthcare workers at risk. This paper presents a method for scalable and distributed manufacturing of FFR filter material based on a combination of centrifugal melt spinning utilizing readily available cotton candy machines as an example. The proposed method produces nonwoven polypropylene fabric material with filtering efficiency of up to 96% for particles 0.30-0.49 μm in diameter. We additionally demonstrate a scalable means to test for filtration efficiency and pressure drop to ensure a standardized degree of quality in the output material. We perform preliminary optimization of relevant parameters for scale-up and propose that this is a viable method to rapidly produce up to one million N95 FFRs per day in distributed manner with just six machines per site operating across 200 locations. We share this work as a starting point for others to rapidly construct, replicate and develop their own affordable modular processes aimed at producing high quality filtration material to address the current FFR shortage globally. △ Less

Submitted 26 April, 2020; originally announced April 2020.

Comments: 12 pages, 5 figures, To whom correspondence should be addressed: [email protected]

arXiv:1909.09143 [pdf, ps, other]

Leveraging User Engagement Signals For Entity Labeling in a Virtual Assistant

Authors: Deepak Muralidharan, Justine Kao, Xiao Yang, Lin Li, Lavanya Viswanathan, Mubarak Seyed Ibrahim, Kevin Luikens, Stephen Pulman, Ashish Garg, Atish Kothari, Jason Williams

Abstract: Personal assistant AI systems such as Siri, Cortana, and Alexa have become widely used as a means to accomplish tasks through natural language commands. However, components in these systems generally rely on supervised machine learning algorithms that require large amounts of hand-annotated training data, which is expensive and time consuming to collect. The ability to incorporate unsupervised, we… ▽ More Personal assistant AI systems such as Siri, Cortana, and Alexa have become widely used as a means to accomplish tasks through natural language commands. However, components in these systems generally rely on supervised machine learning algorithms that require large amounts of hand-annotated training data, which is expensive and time consuming to collect. The ability to incorporate unsupervised, weakly supervised, or distantly supervised data holds significant promise in overcoming this bottleneck. In this paper, we describe a framework that leverages user engagement signals (user behaviors that demonstrate a positive or negative response to content) to automatically create granular entity labels for training data augmentation. Strategies such as multi-task learning and validation using an external knowledge base are employed to incorporate the engagement annotated data and to boost the model's accuracy on a sequence labeling task. Our results show that learning from data automatically labeled by user engagement signals achieves significant accuracy gains in a production deep learning system, when measured on both the sequence labeling task as well as on user facing results produced by the system end-to-end. We believe this is the first use of user engagement signals to help generate training data for a sequence labeling task on a large scale, and can be applied in practical settings to speed up new feature deployment when little human annotated data is available. △ Less

Submitted 18 September, 2019; originally announced September 2019.

Comments: NeurIPS 2018 Conversational AI Workshop

arXiv:1811.02881 [pdf]

Blockchain and human episodic memory

Authors: Seong Hah Cho, Cody A Cushing, Kunal Patel, Alok Kothari, Rongjian Lan, Matthias Michel, Mouslim Cherkaoui, Hakwan Lau

Abstract: We relate the concepts used in decentralized ledger technology to studies of episodic memory in the mammalian brain. Specifically, we introduce the standard concepts of linked list, hash functions, and sharding, from computer science. We argue that these concepts may be more relevant to studies of the neural mechanisms of memory than has been previously appreciated. In turn, we also highlight that… ▽ More We relate the concepts used in decentralized ledger technology to studies of episodic memory in the mammalian brain. Specifically, we introduce the standard concepts of linked list, hash functions, and sharding, from computer science. We argue that these concepts may be more relevant to studies of the neural mechanisms of memory than has been previously appreciated. In turn, we also highlight that certain phenomena studied in the brain, namely metacognition, reality monitoring, and how perceptual conscious experiences come about, may inspire development in blockchain technology too, specifically regarding probabilistic consensus protocols. △ Less

Submitted 16 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

Comments: 30 pages, 2 figures; Minor edits, added figures, revised and updated sections

arXiv:1510.03964 [pdf, other]

Pathway Tools version 24.0: Integrated Software for Pathway/Genome Informatics and Systems Biology

Authors: Peter D. Karp, Suzanne M. Paley, Peter E. Midford, Markus Krummenacker, Richard Billington, Anamika Kothari, Wai Kit Ong, Pallavi Subhraveti, Ingrid M. Keseler, Ron Caspi

Abstract: Pathway Tools is a bioinformatics software environment with a broad set of capabilities. The software provides genome-informatics tools such as a genome browser, sequence alignments, a genome-variant analyzer, and comparative-genomics operations. It offers metabolic-informatics tools, such as metabolic reconstruction, quantitative metabolic modeling, prediction of reaction atom map**s, and metab… ▽ More Pathway Tools is a bioinformatics software environment with a broad set of capabilities. The software provides genome-informatics tools such as a genome browser, sequence alignments, a genome-variant analyzer, and comparative-genomics operations. It offers metabolic-informatics tools, such as metabolic reconstruction, quantitative metabolic modeling, prediction of reaction atom map**s, and metabolic route search. Pathway Tools also provides regulatory-informatics tools, such as the ability to represent and visualize a wide range of regulatory interactions. The software creates and manages a type of organism-specific database called a Pathway/Genome Database (PGDB), which the software enables database curators to interactively edit. It supports web publishing of PGDBs and provides a large number of query, visualization, and omics-data analysis tools. Scientists around the world have created more than 9,800 PGDBs by using Pathway Tools, many of which are curated databases for important model organisms. Those PGDBs can be exchanged using a peer-to-peer database-sharing system called the PGDB Registry. △ Less

Submitted 12 November, 2020; v1 submitted 14 October, 2015; originally announced October 2015.

Comments: Reflects Pathway Tools version 24.0 in October 2020; new information since the previous version is in blue text. 98 pages, 23 figures

Showing 1–14 of 14 results for author: Kothari, A