-
Unraveling the Dynamics of SPY Trading Volumes: A Comprehensive Analysis of Daily and Intraday Liquidity Trends
Authors:
Ananya Krishnan,
Martin Pollack,
Alma Cooper
Abstract:
In this project, we investigate the accuracy of forecasting intraday and daily trading volume of the exchange-traded fund SPY. The ability to forecast volume over varying time intervals with high accuracy is a critical element to many trading strategies. After performing exploratory data analysis on intraday and daily SPY data we identify three methods for our analysis: ARIMA and ARIMAX models, wi…
▽ More
In this project, we investigate the accuracy of forecasting intraday and daily trading volume of the exchange-traded fund SPY. The ability to forecast volume over varying time intervals with high accuracy is a critical element to many trading strategies. After performing exploratory data analysis on intraday and daily SPY data we identify three methods for our analysis: ARIMA and ARIMAX models, with or without seasonality, as well as a Frequency Domain Process Representation. To evaluate predictive power of our models, we use mean squared error, mean absolute percentage error, and volume weighted average price (VWAP) tracking error. All models for both intraday and daily data output strong VWAP predictions in comparison to the VWAP estimates produced by naive baseline methodologies. In both cases volume is most accurately forecasted using ARIMA models with exogenous variables in the form of technical indicators, with intraday incorporating a seasonal component and daily not.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control
Authors:
Alexander Blatt,
Aravind Krishnan,
Dietrich Klakow
Abstract:
Utilizing air-traffic control (ATC) data for downstream natural-language processing tasks requires preprocessing steps. Key steps are the transcription of the data via automatic speech recognition (ASR) and speaker diarization, respectively speaker role detection (SRD) to divide the transcripts into pilot and air-traffic controller (ATCO) transcripts. While traditional approaches take on these tas…
▽ More
Utilizing air-traffic control (ATC) data for downstream natural-language processing tasks requires preprocessing steps. Key steps are the transcription of the data via automatic speech recognition (ASR) and speaker diarization, respectively speaker role detection (SRD) to divide the transcripts into pilot and air-traffic controller (ATCO) transcripts. While traditional approaches take on these tasks separately, we propose a transformer-based joint ASR-SRD system that solves both tasks jointly while relying on a standard ASR architecture. We compare this joint system against two cascaded approaches for ASR and SRD on multiple ATC datasets. Our study shows in which cases our joint system can outperform the two traditional approaches and in which cases the other architectures are preferable. We additionally evaluate how acoustic and lexical differences influence all architectures and show how to overcome them for our joint architecture.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
On the Encoding of Gender in Transformer-based ASR Representations
Authors:
Aravind Krishnan,
Badr M. Abdullah,
Dietrich Klakow
Abstract:
While existing literature relies on performance differences to uncover gender biases in ASR models, a deeper analysis is essential to understand how gender is encoded and utilized during transcript generation. This work investigates the encoding and utilization of gender in the latent representations of two transformer-based ASR models, Wav2Vec2 and HuBERT. Using linear erasure, we demonstrate the…
▽ More
While existing literature relies on performance differences to uncover gender biases in ASR models, a deeper analysis is essential to understand how gender is encoded and utilized during transcript generation. This work investigates the encoding and utilization of gender in the latent representations of two transformer-based ASR models, Wav2Vec2 and HuBERT. Using linear erasure, we demonstrate the feasibility of removing gender information from each layer of an ASR model and show that such an intervention has minimal impacts on the ASR performance. Additionally, our analysis reveals a concentration of gender information within the first and last frames in the final layers, explaining the ease of erasing gender in these layers. Our findings suggest the prospect of creating gender-neutral embeddings that can be integrated into ASR frameworks without compromising their efficacy.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Learning from Natural Language Explanations for Generalizable Entity Matching
Authors:
Somin Wadhwa,
Adit Krishnan,
Runhui Wang,
Byron C. Wallace,
Chris Kong
Abstract:
Entity matching is the task of linking records from different sources that refer to the same real-world entity. Past work has primarily treated entity linking as a standard supervised learning problem. However, supervised entity matching models often do not generalize well to new data, and collecting exhaustive labeled training data is often cost prohibitive. Further, recent efforts have adopted L…
▽ More
Entity matching is the task of linking records from different sources that refer to the same real-world entity. Past work has primarily treated entity linking as a standard supervised learning problem. However, supervised entity matching models often do not generalize well to new data, and collecting exhaustive labeled training data is often cost prohibitive. Further, recent efforts have adopted LLMs for this task in few/zero-shot settings, exploiting their general knowledge. But LLMs are prohibitively expensive for performing inference at scale for real-world entity matching tasks.
As an efficient alternative, we re-cast entity matching as a conditional generation task as opposed to binary classification. This enables us to "distill" LLM reasoning into smaller entity matching models via natural language explanations. This approach achieves strong performance, especially on out-of-domain generalization tests (10.85% F-1) where standalone generative methods struggle. We perform ablations that highlight the importance of explanations, both for performance and model robustness.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
CoNO: Complex Neural Operator for Continous Dynamical Physical Systems
Authors:
Karn Tiwari,
N M Anoop Krishnan,
A P Prathosh
Abstract:
Neural operators extend data-driven models to map between infinite-dimensional functional spaces. While these operators perform effectively in either the time or frequency domain, their performance may be limited when applied to non-stationary spatial or temporal signals whose frequency characteristics change with time. Here, we introduce Complex Neural Operator (CoNO) that parameterizes the integ…
▽ More
Neural operators extend data-driven models to map between infinite-dimensional functional spaces. While these operators perform effectively in either the time or frequency domain, their performance may be limited when applied to non-stationary spatial or temporal signals whose frequency characteristics change with time. Here, we introduce Complex Neural Operator (CoNO) that parameterizes the integral kernel using Fractional Fourier Transform (FrFT), better representing non-stationary signals in a complex-valued domain. Theoretically, we prove the universal approximation capability of CoNO. We perform an extensive empirical evaluation of CoNO on seven challenging partial differential equations (PDEs), including regular grids, structured meshes, and point clouds. Empirically, CoNO consistently attains state-of-the-art performance, showcasing an average relative gain of 10.9%. Further, CoNO exhibits superior performance, outperforming all other models in additional tasks such as zero-shot super-resolution and robustness to noise. CoNO also exhibits the ability to learn from small amounts of data -- giving the same performance as the next best model with just 60% of the training data. Altogether, CoNO presents a robust and superior model for modeling continuous dynamical systems, providing a fillip to scientific machine learning.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
TAGMol: Target-Aware Gradient-guided Molecule Generation
Authors:
Vineeth Dorna,
D. Subhalingam,
Keshav Kolluru,
Shreshth Tuli,
Mrityunjay Singh,
Saurabh Singal,
N. M. Anoop Krishnan,
Sayan Ranu
Abstract:
3D generative models have shown significant promise in structure-based drug design (SBDD), particularly in discovering ligands tailored to specific target binding sites. Existing algorithms often focus primarily on ligand-target binding, characterized by binding affinity. Moreover, models trained solely on target-ligand distribution may fall short in addressing the broader objectives of drug disco…
▽ More
3D generative models have shown significant promise in structure-based drug design (SBDD), particularly in discovering ligands tailored to specific target binding sites. Existing algorithms often focus primarily on ligand-target binding, characterized by binding affinity. Moreover, models trained solely on target-ligand distribution may fall short in addressing the broader objectives of drug discovery, such as the development of novel ligands with desired properties like drug-likeness, and synthesizability, underscoring the multifaceted nature of the drug design process. To overcome these challenges, we decouple the problem into molecular generation and property prediction. The latter synergistically guides the diffusion sampling process, facilitating guided diffusion and resulting in the creation of meaningful molecules with the desired properties. We call this guided molecular generation process as TAGMol. Through experiments on benchmark datasets, TAGMol demonstrates superior performance compared to state-of-the-art baselines, achieving a 22% improvement in average Vina Score and yielding favorable outcomes in essential auxiliary properties. This establishes TAGMol as a comprehensive framework for drug generation.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records
Authors:
Adibvafa Fallahpour,
Mahshid Alinoori,
Arash Afkanpour,
Amrit Krishnan
Abstract:
Transformers have significantly advanced the modeling of Electronic Health Records (EHR), yet their deployment in real-world healthcare is limited by several key challenges. Firstly, the quadratic computational cost and insufficient context length of these models pose significant obstacles for hospitals in processing the extensive medical histories typical in EHR data. Additionally, existing model…
▽ More
Transformers have significantly advanced the modeling of Electronic Health Records (EHR), yet their deployment in real-world healthcare is limited by several key challenges. Firstly, the quadratic computational cost and insufficient context length of these models pose significant obstacles for hospitals in processing the extensive medical histories typical in EHR data. Additionally, existing models employ separate finetuning for each clinical task, complicating maintenance in healthcare environments. Moreover, these models focus exclusively on either clinical prediction or EHR forecasting, lacking the flexibility to perform well across both. To overcome these limitations, we introduce EHRMamba, a robust foundation model built on the Mamba architecture. EHRMamba can process sequences up to four times longer than previous models due to its linear computational cost. We also introduce a novel approach to Multitask Prompted Finetuning (MTF) for EHR data, which enables EHRMamba to simultaneously learn multiple clinical tasks in a single finetuning phase, significantly enhancing deployment and cross-task generalization. Furthermore, our model leverages the HL7 FHIR data standard to simplify integration into existing hospital systems. Alongside EHRMamba, we open-source Odyssey, a toolkit designed to support the development and deployment of EHR foundation models, with an emphasis on data standardization and interpretability. Our evaluations on the MIMIC-IV dataset demonstrate that EHRMamba advances state-of-the-art performance across 6 major clinical tasks and excels in EHR forecasting, marking a significant leap forward in the field.
△ Less
Submitted 23 May, 2024; v1 submitted 23 May, 2024;
originally announced May 2024.
-
Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search
Authors:
Sebastian Bruch,
Aditya Krishnan,
Franco Maria Nardini
Abstract:
Clustering-based nearest neighbor search is a simple yet effective method in which data points are partitioned into geometric shards to form an index, and only a few shards are searched during query processing to find an approximate set of top-$k$ vectors. Even though the search efficacy is heavily influenced by the algorithm that identifies the set of shards to probe, it has received little atten…
▽ More
Clustering-based nearest neighbor search is a simple yet effective method in which data points are partitioned into geometric shards to form an index, and only a few shards are searched during query processing to find an approximate set of top-$k$ vectors. Even though the search efficacy is heavily influenced by the algorithm that identifies the set of shards to probe, it has received little attention in the literature. This work attempts to bridge that gap by studying the problem of routing in clustering-based maximum inner product search (MIPS). We begin by unpacking existing routing protocols and notice the surprising contribution of optimism. We then take a page from the sequential decision making literature and formalize that insight following the principle of ``optimism in the face of uncertainty.'' In particular, we present a new framework that incorporates the moments of the distribution of inner products within each shard to optimistically estimate the maximum inner product. We then present a simple instance of our algorithm that uses only the first two moments to reach the same accuracy as state-of-the-art routers such as \scann by probing up to $50%$ fewer points on a suite of benchmark MIPS datasets. Our algorithm is also space-efficient: we design a sketch of the second moment whose size is independent of the number of points and in practice requires storing only $O(1)$ additional vectors per shard.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
No winners: Performance of lung cancer prediction models depends on screening-detected, incidental, and biopsied pulmonary nodule use cases
Authors:
Thomas Z. Li,
Kaiwen Xu,
Aravind Krishnan,
Riqiang Gao,
Michael N. Kammer,
Sanja Antic,
David Xiao,
Michael Knight,
Yency Martinez,
Rafael Paez,
Robert J. Lentz,
Stephen Deppen,
Eric L. Grogan,
Thomas A. Lasko,
Kim L. Sandler,
Fabien Maldonado,
Bennett A. Landman
Abstract:
Statistical models for predicting lung cancer have the potential to facilitate earlier diagnosis of malignancy and avoid invasive workup of benign disease. Many models have been published, but comparative studies of their utility in different clinical settings in which patients would arguably most benefit are scarce. This study retrospectively evaluated promising predictive models for lung cancer…
▽ More
Statistical models for predicting lung cancer have the potential to facilitate earlier diagnosis of malignancy and avoid invasive workup of benign disease. Many models have been published, but comparative studies of their utility in different clinical settings in which patients would arguably most benefit are scarce. This study retrospectively evaluated promising predictive models for lung cancer prediction in three clinical settings: lung cancer screening with low-dose computed tomography, incidentally detected pulmonary nodules, and nodules deemed suspicious enough to warrant a biopsy. We leveraged 9 cohorts (n=898, 896, 882, 219, 364, 117, 131, 115, 373) from multiple institutions to assess the area under the receiver operating characteristic curve (AUC) of validated models including logistic regressions on clinical variables and radiologist nodule characterizations, artificial intelligence on chest CTs, longitudinal imaging AI, and multi-modal approaches. We implemented each model from their published literature, re-training the models if necessary, and curated each cohort from primary data sources. We observed that model performance varied greatly across clinical use cases. No single predictive model emerged as a clear winner across all cohorts, but certain models excelled in specific clinical contexts. Single timepoint chest CT AI performed well in lung screening, but struggled to generalize to other clinical settings. Longitudinal imaging and multimodal models demonstrated comparatively promising performance on incidentally-detected nodules. However, when applied to nodules that underwent biopsy, all models underperformed. These results underscore the strengths and limitations of 8 validated predictive models and highlight promising directions towards personalized, noninvasive lung cancer diagnosis.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Comparison of On-Orbit Manual Attitude Control Methods for Non-Docking Spacecraft Through Virtual Reality Simulation
Authors:
Ajit Krishnan,
Himanshu Vishwakarma,
Maharudra Kharsade,
Pradipta Biswas
Abstract:
On-orbit manual attitude control of manned spacecraft is accomplished using external visual references and some method of three axis attitude control. All past, present, and developmental spacecraft feature the capability to manually control attitude for deorbit. National Aeronautics and Space Administration (NASA) spacecraft permit an aircraft windshield type front view, wherein an arc of the Ear…
▽ More
On-orbit manual attitude control of manned spacecraft is accomplished using external visual references and some method of three axis attitude control. All past, present, and developmental spacecraft feature the capability to manually control attitude for deorbit. National Aeronautics and Space Administration (NASA) spacecraft permit an aircraft windshield type front view, wherein an arc of the Earths horizon is visible to the crew in deorbit attitude. Russian and Chinese spacecraft permit the crew a bottom view wherein the entire circular Earth horizon disk is visible to the crew in deorbit attitude. Our study compared these two types of external views for efficiency in achievement of deorbit attitude. We used a Unity Virtual Reality (VR) spacecraft simulator that we built in house. The task was to accurately achieve deorbit attitude while in a 400 km circular orbit. Six military test pilots and six civilians with gaming experience flew the task using two methods of visual reference. Comparison was based on time taken, fuel consumed, cognitive workload assessment and user preference. We used ocular parameters, EEG, NASA TLX and IBM SUS to quantify our results. Our study found that the bottom view was easier to operate for manual deorbit task. Additionally, we realized that a VR based system can work as a training simulator for manual on-orbit flight path control tasks by pilots and non pilots. Results from our study can be used for design of manual on orbit attitude control of present and future spacecrafts.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Are large language models superhuman chemists?
Authors:
Adrian Mirza,
Nawaf Alampara,
Sreekanth Kunchapu,
Benedict Emoekabu,
Aswanth Krishnan,
Mara Wilhelmi,
Macjonathan Okereke,
Juliane Eberhardt,
Amir Mohammad Elahi,
Maximilian Greiner,
Caroline T. Holick,
Tanya Gupta,
Mehrdad Asgari,
Christina Glaubitz,
Lea C. Klepsch,
Yannik Köster,
Jakob Meyer,
Santiago Miret,
Tim Hoffmann,
Fabian Alexander Kreth,
Michael Ringleb,
Nicole Roesner,
Ulrich S. Schubert,
Leanne M. Stafast,
Dinga Wonanke
, et al. (3 additional authors not shown)
Abstract:
Large language models (LLMs) have gained widespread interest due to their ability to process human language and perform tasks on which they have not been explicitly trained. This is relevant for the chemical sciences, which face the problem of small and diverse datasets that are frequently in the form of text. LLMs have shown promise in addressing these issues and are increasingly being harnessed…
▽ More
Large language models (LLMs) have gained widespread interest due to their ability to process human language and perform tasks on which they have not been explicitly trained. This is relevant for the chemical sciences, which face the problem of small and diverse datasets that are frequently in the form of text. LLMs have shown promise in addressing these issues and are increasingly being harnessed to predict chemical properties, optimize reactions, and even design and conduct experiments autonomously. However, we still have only a very limited systematic understanding of the chemical reasoning capabilities of LLMs, which would be required to improve models and mitigate potential harms. Here, we introduce "ChemBench," an automated framework designed to rigorously evaluate the chemical knowledge and reasoning abilities of state-of-the-art LLMs against the expertise of human chemists. We curated more than 7,000 question-answer pairs for a wide array of subfields of the chemical sciences, evaluated leading open and closed-source LLMs, and found that the best models outperformed the best human chemists in our study on average. The models, however, struggle with some chemical reasoning tasks that are easy for human experts and provide overconfident, misleading predictions, such as about chemicals' safety profiles. These findings underscore the dual reality that, although LLMs demonstrate remarkable proficiency in chemical tasks, further research is critical to enhancing their safety and utility in chemical sciences. Our findings also indicate a need for adaptations to chemistry curricula and highlight the importance of continuing to develop evaluation frameworks to improve safe and useful LLMs.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Data-Driven Ergonomic Risk Assessment of Complex Hand-intensive Manufacturing Processes
Authors:
Anand Krishnan,
Xingjian Yang,
Utsav Seth,
Jonathan M. Jeyachandran,
Jonathan Y. Ahn,
Richard Gardner,
Samuel F. Pedigo,
Adriana,
Blom-Schieber,
Ashis G. Banerjee,
Krithika Manohar
Abstract:
Hand-intensive manufacturing processes, such as composite layup and textile dra**, require significant human dexterity to accommodate task complexity. These strenuous hand motions often lead to musculoskeletal disorders and rehabilitation surgeries. We develop a data-driven ergonomic risk assessment system with a special focus on hand and finger activity to better identify and address ergonomic…
▽ More
Hand-intensive manufacturing processes, such as composite layup and textile dra**, require significant human dexterity to accommodate task complexity. These strenuous hand motions often lead to musculoskeletal disorders and rehabilitation surgeries. We develop a data-driven ergonomic risk assessment system with a special focus on hand and finger activity to better identify and address ergonomic issues related to hand-intensive manufacturing processes. The system comprises a multi-modal sensor testbed to collect and synchronize operator upper body pose, hand pose and applied forces; a Biometric Assessment of Complete Hand (BACH) formulation to measure high-fidelity hand and finger risks; and industry-standard risk scores associated with upper body posture, RULA, and hand activity, HAL. Our findings demonstrate that BACH captures injurious activity with a higher granularity in comparison to the existing metrics. Machine learning models are also used to automate RULA and HAL scoring, and generalize well to unseen participants. Our assessment system, therefore, provides ergonomic interpretability of the manufacturing processes studied, and could be used to mitigate risks through minor workplace optimization and posture corrections.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
CEV-LM: Controlled Edit Vector Language Model for Sha** Natural Language Generations
Authors:
Samraj Moorjani,
Adit Krishnan,
Hari Sundaram
Abstract:
As large-scale language models become the standard for text generation, there is a greater need to tailor the generations to be more or less concise, targeted, and informative, depending on the audience/application. Existing control approaches primarily adjust the semantic (e.g., emotion, topics), structural (e.g., syntax tree, parts-of-speech), and lexical (e.g., keyword/phrase inclusion) propert…
▽ More
As large-scale language models become the standard for text generation, there is a greater need to tailor the generations to be more or less concise, targeted, and informative, depending on the audience/application. Existing control approaches primarily adjust the semantic (e.g., emotion, topics), structural (e.g., syntax tree, parts-of-speech), and lexical (e.g., keyword/phrase inclusion) properties of text, but are insufficient to accomplish complex objectives such as pacing which control the complexity and readability of the text. In this paper, we introduce CEV-LM - a lightweight, semi-autoregressive language model that utilizes constrained edit vectors to control three complementary metrics (speed, volume, and circuitousness) that quantify the shape of text (e.g., pacing of content). We study an extensive set of state-of-the-art CTG models and find that CEV-LM provides significantly more targeted and precise control of these three metrics while preserving semantic content, using less training data, and containing fewer parameters.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
On the phase diagram of the polymer model
Authors:
Arjun Krishnan,
Sevak Mkrtchyan,
Scott Neville
Abstract:
In dimension 1, the directed polymer model is in the celebrated KPZ universality class, and for all positive temperatures, a typical polymer path shows non-Brownian KPZ scaling behavior. In dimensions 3 or larger, it is a classical fact that the polymer has two phases: Brownian behavior at high temperature, and non-Brownian behavior at low temperature. We consider the response of the polymer to an…
▽ More
In dimension 1, the directed polymer model is in the celebrated KPZ universality class, and for all positive temperatures, a typical polymer path shows non-Brownian KPZ scaling behavior. In dimensions 3 or larger, it is a classical fact that the polymer has two phases: Brownian behavior at high temperature, and non-Brownian behavior at low temperature. We consider the response of the polymer to an external field or tilt, and show that at fixed temperature, the polymer has Brownian behavior for some fields and non-Brownian behavior for others. In other words, the external field can \emph{induce} the phase transition in the directed polymer model.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Improving Model's Interpretability and Reliability using Biomarkers
Authors:
Gautam Rajendrakumar Gare,
Tom Fox,
Beam Chansangavej,
Amita Krishnan,
Ricardo Luis Rodriguez,
Bennett P deBoisblanc,
Deva Kannan Ramanan,
John Michael Galeotti
Abstract:
Accurate and interpretable diagnostic models are crucial in the safety-critical field of medicine. We investigate the interpretability of our proposed biomarker-based lung ultrasound diagnostic pipeline to enhance clinicians' diagnostic capabilities. The objective of this study is to assess whether explanations from a decision tree classifier, utilizing biomarkers, can improve users' ability to id…
▽ More
Accurate and interpretable diagnostic models are crucial in the safety-critical field of medicine. We investigate the interpretability of our proposed biomarker-based lung ultrasound diagnostic pipeline to enhance clinicians' diagnostic capabilities. The objective of this study is to assess whether explanations from a decision tree classifier, utilizing biomarkers, can improve users' ability to identify inaccurate model predictions compared to conventional saliency maps. Our findings demonstrate that decision tree explanations, based on clinically established biomarkers, can assist clinicians in detecting false positives, thus improving the reliability of diagnostic models in medicine.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Are LLMs Ready for Real-World Materials Discovery?
Authors:
Santiago Miret,
N M Anoop Krishnan
Abstract:
Large Language Models (LLMs) create exciting possibilities for powerful language processing tools to accelerate research in materials science. While LLMs have great potential to accelerate materials understanding and discovery, they currently fall short in being practical materials science tools. In this position paper, we show relevant failure cases of LLMs in materials science that reveal curren…
▽ More
Large Language Models (LLMs) create exciting possibilities for powerful language processing tools to accelerate research in materials science. While LLMs have great potential to accelerate materials understanding and discovery, they currently fall short in being practical materials science tools. In this position paper, we show relevant failure cases of LLMs in materials science that reveal current limitations of LLMs related to comprehending and reasoning over complex, interconnected materials science knowledge. Given those shortcomings, we outline a framework for develo** Materials Science LLMs (MatSci-LLMs) that are grounded in materials science knowledge and hypothesis generation followed by hypothesis testing. The path to attaining performant MatSci-LLMs rests in large part on building high-quality, multi-modal datasets sourced from scientific literature where various information extraction challenges persist. As such, we describe key materials science information extraction challenges which need to be overcome in order to build large-scale, multi-modal datasets that capture valuable materials science knowledge. Finally, we outline a roadmap for applying future MatSci-LLMs for real-world materials discovery via: 1. Automated Knowledge Base Generation; 2. Automated In-Silico Material Design; and 3. MatSci-LLM Integrated Self-Driving Materials Laboratories.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Evaluation of Mean Shift, ComBat, and CycleGAN for Harmonizing Brain Connectivity Matrices Across Sites
Authors:
Hanliang Xu,
Nancy R. Newlin,
Michael E. Kim,
Chenyu Gao,
Praitayini Kanakaraj,
Aravind R. Krishnan,
Lucas W. Remedios,
Nazirah Mohd Khairi,
Kimberly Pechman,
Derek Archer,
Timothy J. Hohman,
Angela L. Jefferson,
The BIOCARD Study Team,
Ivana Isgum,
Yuankai Huo,
Daniel Moyer,
Kurt G. Schilling,
Bennett A. Landman
Abstract:
Connectivity matrices derived from diffusion MRI (dMRI) provide an interpretable and generalizable way of understanding the human brain connectome. However, dMRI suffers from inter-site and between-scanner variation, which impedes analysis across datasets to improve robustness and reproducibility of results. To evaluate different harmonization approaches on connectivity matrices, we compared graph…
▽ More
Connectivity matrices derived from diffusion MRI (dMRI) provide an interpretable and generalizable way of understanding the human brain connectome. However, dMRI suffers from inter-site and between-scanner variation, which impedes analysis across datasets to improve robustness and reproducibility of results. To evaluate different harmonization approaches on connectivity matrices, we compared graph measures derived from these matrices before and after applying three harmonization techniques: mean shift, ComBat, and CycleGAN. The sample comprises 168 age-matched, sex-matched normal subjects from two studies: the Vanderbilt Memory and Aging Project (VMAP) and the Biomarkers of Cognitive Decline Among Normal Individuals (BIOCARD). First, we plotted the graph measures and used coefficient of variation (CoV) and the Mann-Whitney U test to evaluate different methods' effectiveness in removing site effects on the matrices and the derived graph measures. ComBat effectively eliminated site effects for global efficiency and modularity and outperformed the other two methods. However, all methods exhibited poor performance when harmonizing average betweenness centrality. Second, we tested whether our harmonization methods preserved correlations between age and graph measures. All methods except for CycleGAN in one direction improved correlations between age and global efficiency and between age and modularity from insignificant to significant with p-values less than 0.05.
△ Less
Submitted 24 January, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Disorder-induced enhancement of lithium-ion transport in solid-state electrolytes
Authors:
Zhimin Chen,
Tao Du,
N. M. Anoop Krishnan,
Yuanzheng Yue,
Morten M. Smedskjaer
Abstract:
Enhancing the ion conduction in solid electrolytes is critically important for the development of high-performance all-solid-state lithium-ion batteries (LIBs). Lithium thiophosphates are among the most promising solid electrolytes, as they exhibit superionic conductivity at room temperature. However, the lack of comprehensive understanding regarding their ion conduction mechanism, especially the…
▽ More
Enhancing the ion conduction in solid electrolytes is critically important for the development of high-performance all-solid-state lithium-ion batteries (LIBs). Lithium thiophosphates are among the most promising solid electrolytes, as they exhibit superionic conductivity at room temperature. However, the lack of comprehensive understanding regarding their ion conduction mechanism, especially the effect of structural disorder on ionic conductivity, is a long-standing problem that limits further innovations of all-solid-state LIBs. Here, we address this challenge by establishing and employing a deep learning potential to simulate Li3PS4 electrolyte systems with varying levels of disorder. The results show that disorder-driven diffusion dynamics significantly enhances the room-temperature conductivity. We further establish bridges between dynamical characteristics, local structural features, and atomic rearrangements by applying a machine learning-based structure fingerprint termed "softness". This metric allows the classification of the disorder-induced "soft" hop** lithium ions. Our findings offer insights into ion conduction mechanisms in complex disordered structures, thereby contributing to the development of superior solid-state electrolytes for LIBs.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Production of highly charged ions inside a cryogenic Penning trap by electron-impact ionisation
Authors:
Kanika,
A Krishnan,
J W Klimes,
B Reich,
K K Anjum,
P Baus,
G Birkl,
W Quint,
M Vogel
Abstract:
We have built and operated a cryogenic Penning trap arrangement that allows for the efficient production, selection, and long-term storage of highly charged atomic ions. In close similarity to an electron-beam ion trap (EBIT) it works by electron-impact ionisation of atoms inside a dedicated confinement region. The electrons are produced by field emission at liquid-helium temperature and are subse…
▽ More
We have built and operated a cryogenic Penning trap arrangement that allows for the efficient production, selection, and long-term storage of highly charged atomic ions. In close similarity to an electron-beam ion trap (EBIT) it works by electron-impact ionisation of atoms inside a dedicated confinement region. The electrons are produced by field emission at liquid-helium temperature and are subsequently accelerated to the keV energy range. The electron beam is reflected through the trap multiple times to increase the ionisation efficiency. We show a characterisation of the system and measurements with argon and tungsten ions up to Ar$^{16+}$ and W$^{27+}$, respectively.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
-
Modeling Risk in Reinforcement Learning: A Literature Map**
Authors:
Leonardo Villalobos-Arias,
Derek Martin,
Abhijeet Krishnan,
Madeleine Gagné,
Colin M. Potts,
Arnav Jhala
Abstract:
Safe reinforcement learning deals with mitigating or avoiding unsafe situations by reinforcement learning (RL) agents. Safe RL approaches are based on specific risk representations for particular problems or domains. In order to analyze agent behaviors, compare safe RL approaches, and effectively transfer techniques between application domains, it is necessary to understand the types of risk speci…
▽ More
Safe reinforcement learning deals with mitigating or avoiding unsafe situations by reinforcement learning (RL) agents. Safe RL approaches are based on specific risk representations for particular problems or domains. In order to analyze agent behaviors, compare safe RL approaches, and effectively transfer techniques between application domains, it is necessary to understand the types of risk specific to safe RL problems. We performed a systematic literature map** with the objective to characterize risk in safe RL. Based on the obtained results, we present definitions, characteristics, and types of risk that hold on multiple application domains. Our literature map** covers literature from the last 5 years (2017-2022), from a variety of knowledge areas (AI, finance, engineering, medicine) where RL approaches emphasize risk representation and management. Our map** covers 72 papers filtered systematically from over thousands of papers on the topic. Our proposed notion of risk covers a variety of representations, disciplinary differences, common training exercises, and types of techniques. We encourage researchers to include explicit and detailed accounts of risk in future safe RL research reports, using this map** as a starting point. With this information, researchers and practitioners could draw stronger conclusions on the effectiveness of techniques on different problems.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Toral symmetries of collapsed ancient solutions to the homogeneous Ricci flow
Authors:
Anusha M. Krishnan,
Francesco Pediconi,
Sammy Sbiti
Abstract:
Collapsed ancient solutions to the homogeneous Ricci flow on compact manifolds occur only on the total space of principal torus bundles. Under a geometric assumption on the torus fibers and an algebraic assumption that guarantees flowing through diagonal metrics, we prove that such solutions have additional symmetries. In particular, we show they are invariant under the right action of their colla…
▽ More
Collapsed ancient solutions to the homogeneous Ricci flow on compact manifolds occur only on the total space of principal torus bundles. Under a geometric assumption on the torus fibers and an algebraic assumption that guarantees flowing through diagonal metrics, we prove that such solutions have additional symmetries. In particular, we show they are invariant under the right action of their collapsing torus.
△ Less
Submitted 3 December, 2023;
originally announced December 2023.
-
Distributed Global Structure-from-Motion with a Deep Front-End
Authors:
Ayush Baid,
John Lambert,
Travis Driver,
Akshay Krishnan,
Hayk Stepanyan,
Frank Dellaert
Abstract:
While initial approaches to Structure-from-Motion (SfM) revolved around both global and incremental methods, most recent applications rely on incremental systems to estimate camera poses due to their superior robustness. Though there has been tremendous progress in SfM `front-ends' powered by deep models learned from data, the state-of-the-art (incremental) SfM pipelines still rely on classical SI…
▽ More
While initial approaches to Structure-from-Motion (SfM) revolved around both global and incremental methods, most recent applications rely on incremental systems to estimate camera poses due to their superior robustness. Though there has been tremendous progress in SfM `front-ends' powered by deep models learned from data, the state-of-the-art (incremental) SfM pipelines still rely on classical SIFT features, developed in 2004. In this work, we investigate whether leveraging the developments in feature extraction and matching helps global SfM perform on par with the SOTA incremental SfM approach (COLMAP). To do so, we design a modular SfM framework that allows us to easily combine developments in different stages of the SfM pipeline. Our experiments show that while developments in deep-learning based two-view correspondence estimation do translate to improvements in point density for scenes reconstructed with global SfM, none of them outperform SIFT when comparing with incremental SfM results on a range of datasets. Our SfM system is designed from the ground up to leverage distributed computation, enabling us to parallelize computation on multiple machines and scale to large scenes.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
$10$-dimensional positively curved manifolds with $T^3$-symmetry
Authors:
Anusha M. Krishnan,
Michael Wiemeler
Abstract:
We show that ten-dimensional closed simply connected positively curved manifolds with isometric effective actions of three-dimensional tori are homotopy spheres or homotopy complex projective spaces.
We show that ten-dimensional closed simply connected positively curved manifolds with isometric effective actions of three-dimensional tori are homotopy spheres or homotopy complex projective spaces.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction
Authors:
Kausik Hira,
Mohd Zaki,
Dhruvil Sheth,
Mausam,
N M Anoop Krishnan
Abstract:
The discovery of new materials has a documented history of propelling human progress for centuries and more. The behaviour of a material is a function of its composition, structure, and properties, which further depend on its processing and testing conditions. Recent developments in deep learning and natural language processing have enabled information extraction at scale from published literature…
▽ More
The discovery of new materials has a documented history of propelling human progress for centuries and more. The behaviour of a material is a function of its composition, structure, and properties, which further depend on its processing and testing conditions. Recent developments in deep learning and natural language processing have enabled information extraction at scale from published literature such as peer-reviewed publications, books, and patents. However, this information is spread in multiple formats, such as tables, text, and images, and with little or no uniformity in reporting style giving rise to several machine learning challenges. Here, we discuss, quantify, and document these challenges in automated information extraction (IE) from materials science literature towards the creation of a large materials science knowledge base. Specifically, we focus on IE from text and tables and outline several challenges with examples. We hope the present work inspires researchers to address the challenges in a coherent fashion, providing a fillip to IE towards develo** a materials knowledge base.
△ Less
Submitted 26 April, 2024; v1 submitted 12 October, 2023;
originally announced October 2023.
-
ClimateNLP: Analyzing Public Sentiment Towards Climate Change Using Natural Language Processing
Authors:
Ajay Krishnan,
V. S. Anoop
Abstract:
Climate change's impact on human health poses unprecedented and diverse challenges. Unless proactive measures based on solid evidence are implemented, these threats will likely escalate and continue to endanger human well-being. The escalating advancements in information and communication technologies have facilitated the widespread availability and utilization of social media platforms. Individua…
▽ More
Climate change's impact on human health poses unprecedented and diverse challenges. Unless proactive measures based on solid evidence are implemented, these threats will likely escalate and continue to endanger human well-being. The escalating advancements in information and communication technologies have facilitated the widespread availability and utilization of social media platforms. Individuals utilize platforms such as Twitter and Facebook to express their opinions, thoughts, and critiques on diverse subjects, encompassing the pressing issue of climate change. The proliferation of climate change-related content on social media necessitates comprehensive analysis to glean meaningful insights. This paper employs natural language processing (NLP) techniques to analyze climate change discourse and quantify the sentiment of climate change-related tweets. We use ClimateBERT, a pretrained model fine-tuned specifically for the climate change domain. The objective is to discern the sentiment individuals express and uncover patterns in public opinion concerning climate change. Analyzing tweet sentiments allows a deeper comprehension of public perceptions, concerns, and emotions about this critical global challenge. The findings from this experiment unearth valuable insights into public sentiment and the entities associated with climate change discourse. Policymakers, researchers, and organizations can leverage such analyses to understand public perceptions, identify influential actors, and devise informed strategies to address climate change challenges.
△ Less
Submitted 19 October, 2023; v1 submitted 12 October, 2023;
originally announced October 2023.
-
EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations
Authors:
Vaibhav Bihani,
Utkarsh Pratiush,
Sajid Mannan,
Tao Du,
Zhimin Chen,
Santiago Miret,
Matthieu Micoulaut,
Morten M Smedskjaer,
Sayan Ranu,
N M Anoop Krishnan
Abstract:
Equivariant graph neural networks force fields (EGraFFs) have shown great promise in modelling complex interactions in atomic systems by exploiting the graphs' inherent symmetries. Recent works have led to a surge in the development of novel architectures that incorporate equivariance-based inductive biases alongside architectural innovations like graph transformers and message passing to model at…
▽ More
Equivariant graph neural networks force fields (EGraFFs) have shown great promise in modelling complex interactions in atomic systems by exploiting the graphs' inherent symmetries. Recent works have led to a surge in the development of novel architectures that incorporate equivariance-based inductive biases alongside architectural innovations like graph transformers and message passing to model atomic interactions. However, thorough evaluations of these deploying EGraFFs for the downstream task of real-world atomistic simulations, is lacking. To this end, here we perform a systematic benchmarking of 6 EGraFF algorithms (NequIP, Allegro, BOTNet, MACE, Equiformer, TorchMDNet), with the aim of understanding their capabilities and limitations for realistic atomistic simulations. In addition to our thorough evaluation and analysis on eight existing datasets based on the benchmarking literature, we release two new benchmark datasets, propose four new metrics, and three challenging tasks. The new datasets and tasks evaluate the performance of EGraFF to out-of-distribution data, in terms of different crystal structures, temperatures, and new molecules. Interestingly, evaluation of the EGraFF models based on dynamic simulations reveals that having a lower error on energy or force does not guarantee stable or reliable simulation or faithful replication of the atomic structures. Moreover, we find that no model clearly outperforms other models on all datasets and tasks. Importantly, we show that the performance of all the models on out-of-distribution datasets is unreliable, pointing to the need for the development of a foundation model for force fields that can be used in real-world simulations. In summary, this work establishes a rigorous framework for evaluating machine learning force fields in the context of atomic simulations and points to open research challenges within this domain.
△ Less
Submitted 24 November, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
CoNO: Complex Neural Operator for Continuous Dynamical Systems
Authors:
Karn Tiwari,
N M Anoop Krishnan,
Prathosh A P
Abstract:
Neural operators extend data-driven models to map between infinite-dimensional functional spaces. These models have successfully solved continuous dynamical systems represented by differential equations, viz weather forecasting, fluid flow, or solid mechanics. However, the existing operators still rely on real space, thereby losing rich representations potentially captured in the complex space by…
▽ More
Neural operators extend data-driven models to map between infinite-dimensional functional spaces. These models have successfully solved continuous dynamical systems represented by differential equations, viz weather forecasting, fluid flow, or solid mechanics. However, the existing operators still rely on real space, thereby losing rich representations potentially captured in the complex space by functional transforms. In this paper, we introduce a Complex Neural Operator (CoNO), that parameterizes the integral kernel in the complex fractional Fourier domain. Additionally, the model employing a complex-valued neural network along with aliasing-free activation functions preserves the complex values and complex algebraic properties, thereby enabling improved representation, robustness to noise, and generalization. We show that the model effectively captures the underlying partial differential equation with a single complex fractional Fourier transform. We perform an extensive empirical evaluation of CoNO on several datasets and additional tasks such as zero-shot super-resolution, evaluation of out-of-distribution data, data efficiency, and robustness to noise. CoNO exhibits comparable or superior performance to all the state-of-the-art models in these tasks. Altogether, CoNO presents a robust and superior model for modeling continuous dynamical systems, providing a fillip to scientific machine learning.
△ Less
Submitted 4 October, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
CoDBench: A Critical Evaluation of Data-driven Models for Continuous Dynamical Systems
Authors:
Priyanshu Burark,
Karn Tiwari,
Meer Mehran Rashid,
Prathosh A P,
N M Anoop Krishnan
Abstract:
Continuous dynamical systems, characterized by differential equations, are ubiquitously used to model several important problems: plasma dynamics, flow through porous media, weather forecasting, and epidemic dynamics. Recently, a wide range of data-driven models has been used successfully to model these systems. However, in contrast to established fields like computer vision, limited studies are a…
▽ More
Continuous dynamical systems, characterized by differential equations, are ubiquitously used to model several important problems: plasma dynamics, flow through porous media, weather forecasting, and epidemic dynamics. Recently, a wide range of data-driven models has been used successfully to model these systems. However, in contrast to established fields like computer vision, limited studies are available analyzing the strengths and potential applications of different classes of these models that could steer decision-making in scientific machine learning. Here, we introduce CodBench, an exhaustive benchmarking suite comprising 11 state-of-the-art data-driven models for solving differential equations. Specifically, we comprehensively evaluate 4 distinct categories of models, viz., feed forward neural networks, deep operator regression models, frequency-based neural operators, and transformer architectures against 8 widely applicable benchmark datasets encompassing challenges from fluid and solid mechanics. We conduct extensive experiments, assessing the operators' capabilities in learning, zero-shot super-resolution, data efficiency, robustness to noise, and computational efficiency. Interestingly, our findings highlight that current operators struggle with the newer mechanics datasets, motivating the need for more robust neural operators. All the datasets and codes will be shared in an easy-to-use fashion for the scientific community. We hope this resource will be an impetus for accelerated progress and exploration in modeling dynamical systems.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
FENDA-FL: Personalized Federated Learning on Heterogeneous Clinical Datasets
Authors:
Fatemeh Tavakoli,
D. B. Emerson,
Sana Ayromlou,
John Jewell,
Amrit Krishnan,
Yuchong Zhang,
Amol Verma,
Fahad Razak
Abstract:
Federated learning (FL) is increasingly being recognized as a key approach to overcoming the data silos that so frequently obstruct the training and deployment of machine-learning models in clinical settings. This work contributes to a growing body of FL research specifically focused on clinical applications along three important directions. First, we expand the FLamby benchmark (du Terrail et al.…
▽ More
Federated learning (FL) is increasingly being recognized as a key approach to overcoming the data silos that so frequently obstruct the training and deployment of machine-learning models in clinical settings. This work contributes to a growing body of FL research specifically focused on clinical applications along three important directions. First, we expand the FLamby benchmark (du Terrail et al., 2022a) to include evaluation of personalized FL methods and demonstrate substantive performance improvements over the original results. Next, we advocate for a comprehensive checkpointing and evaluation framework for FL to reflect practical settings and provide multiple comparison baselines. Finally, we study an important ablation of PerFCL (Zhang et al., 2022). This ablation is a natural extension of FENDA (Kim et al., 2016) to the FL setting. Experiments conducted on the FLamby benchmarks and GEMINI datasets (Verma et al., 2017) show that the approach is robust to heterogeneous clinical data and often outperforms existing global and personalized FL techniques, including PerFCL.
△ Less
Submitted 6 February, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Inter-vendor harmonization of Computed Tomography (CT) reconstruction kernels using unpaired image translation
Authors:
Aravind R. Krishnan,
Kaiwen Xu,
Thomas Li,
Chenyu Gao,
Lucas W. Remedios,
Praitayini Kanakaraj,
Ho Hin Lee,
Shunxing Bao,
Kim L. Sandler,
Fabien Maldonado,
Ivana Isgum,
Bennett A. Landman
Abstract:
The reconstruction kernel in computed tomography (CT) generation determines the texture of the image. Consistency in reconstruction kernels is important as the underlying CT texture can impact measurements during quantitative image analysis. Harmonization (i.e., kernel conversion) minimizes differences in measurements due to inconsistent reconstruction kernels. Existing methods investigate harmoni…
▽ More
The reconstruction kernel in computed tomography (CT) generation determines the texture of the image. Consistency in reconstruction kernels is important as the underlying CT texture can impact measurements during quantitative image analysis. Harmonization (i.e., kernel conversion) minimizes differences in measurements due to inconsistent reconstruction kernels. Existing methods investigate harmonization of CT scans in single or multiple manufacturers. However, these methods require paired scans of hard and soft reconstruction kernels that are spatially and anatomically aligned. Additionally, a large number of models need to be trained across different kernel pairs within manufacturers. In this study, we adopt an unpaired image translation approach to investigate harmonization between and across reconstruction kernels from different manufacturers by constructing a multipath cycle generative adversarial network (GAN). We use hard and soft reconstruction kernels from the Siemens and GE vendors from the National Lung Screening Trial dataset. We use 50 scans from each reconstruction kernel and train a multipath cycle GAN. To evaluate the effect of harmonization on the reconstruction kernels, we harmonize 50 scans each from Siemens hard kernel, GE soft kernel and GE hard kernel to a reference Siemens soft kernel (B30f) and evaluate percent emphysema. We fit a linear model by considering the age, smoking status, sex and vendor and perform an analysis of variance (ANOVA) on the emphysema scores. Our approach minimizes differences in emphysema measurement and highlights the impact of age, sex, smoking status and vendor on emphysema quantification.
△ Less
Submitted 26 January, 2024; v1 submitted 22 September, 2023;
originally announced September 2023.
-
Current and future directions in network biology
Authors:
Marinka Zitnik,
Michelle M. Li,
Aydin Wells,
Kimberly Glass,
Deisy Morselli Gysi,
Arjun Krishnan,
T. M. Murali,
Predrag Radivojac,
Sushmita Roy,
Anaïs Baudot,
Serdar Bozdag,
Danny Z. Chen,
Lenore Cowen,
Kapil Devkota,
Anthony Gitter,
Sara Gosline,
Pengfei Gu,
Pietro H. Guzzi,
Heng Huang,
Meng Jiang,
Ziynet Nesibe Kesimoglu,
Mehmet Koyuturk,
Jian Ma,
Alexander R. Pico,
Nataša Pržulj
, et al. (12 additional authors not shown)
Abstract:
Network biology is an interdisciplinary field bridging computational and biological sciences that has proved pivotal in advancing the understanding of cellular functions and diseases across biological systems and scales. Although the field has been around for two decades, it remains nascent. It has witnessed rapid evolution, accompanied by emerging challenges. These challenges stem from various fa…
▽ More
Network biology is an interdisciplinary field bridging computational and biological sciences that has proved pivotal in advancing the understanding of cellular functions and diseases across biological systems and scales. Although the field has been around for two decades, it remains nascent. It has witnessed rapid evolution, accompanied by emerging challenges. These challenges stem from various factors, notably the growing complexity and volume of data together with the increased diversity of data types describing different tiers of biological organization. We discuss prevailing research directions in network biology and highlight areas of inference and comparison of biological networks, multimodal data integration and heterogeneous networks, higher-order network analysis, machine learning on networks, and network-based personalized medicine. Following the overview of recent breakthroughs across these five areas, we offer a perspective on the future directions of network biology. Additionally, we offer insights into scientific communities, educational initiatives, and the importance of fostering diversity within the field. This paper establishes a roadmap for an immediate and long-term vision for network biology.
△ Less
Submitted 11 June, 2024; v1 submitted 15 September, 2023;
originally announced September 2023.
-
A central limit theorem in the framework of the Thompson group $F$
Authors:
Arundhathi Krishnan
Abstract:
We discuss a central limit theorem in the framework of the group algebra of the Thompson group $F$. We consider the sequence of self-adjoint elements given by $a_n=\frac{g_n+g_n^{*}}{\sqrt{2}}$ in the noncommutative probability space $(\mathbb{C}(F),\varphi)$, where the expectation functional $\varphi$ is the trace associated to the left regular representation of $F$, and the $g_n$-s are the gener…
▽ More
We discuss a central limit theorem in the framework of the group algebra of the Thompson group $F$. We consider the sequence of self-adjoint elements given by $a_n=\frac{g_n+g_n^{*}}{\sqrt{2}}$ in the noncommutative probability space $(\mathbb{C}(F),\varphi)$, where the expectation functional $\varphi$ is the trace associated to the left regular representation of $F$, and the $g_n$-s are the generators of $F$ in its standard infinite presentation. We show that the limit law of the sequence $s_n = \frac{a_0+\cdots+a_{n-1}}{\sqrt{n}}$ is the standard normal distribution.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Pre-trained Neural Recommenders: A Transferable Zero-Shot Framework for Recommendation Systems
Authors:
Junting Wang,
Adit Krishnan,
Hari Sundaram,
Yunzhe Li
Abstract:
Modern neural collaborative filtering techniques are critical to the success of e-commerce, social media, and content-sharing platforms. However, despite technical advances -- for every new application domain, we need to train an NCF model from scratch. In contrast, pre-trained vision and language models are routinely applied to diverse applications directly (zero-shot) or with limited fine-tuning…
▽ More
Modern neural collaborative filtering techniques are critical to the success of e-commerce, social media, and content-sharing platforms. However, despite technical advances -- for every new application domain, we need to train an NCF model from scratch. In contrast, pre-trained vision and language models are routinely applied to diverse applications directly (zero-shot) or with limited fine-tuning. Inspired by the impact of pre-trained models, we explore the possibility of pre-trained recommender models that support building recommender systems in new domains, with minimal or no retraining, without the use of any auxiliary user or item information. Zero-shot recommendation without auxiliary information is challenging because we cannot form associations between users and items across datasets when there are no overlap** users or items. Our fundamental insight is that the statistical characteristics of the user-item interaction matrix are universally available across different domains and datasets. Thus, we use the statistical characteristics of the user-item interaction matrix to identify dataset-independent representations for users and items. We show how to learn universal (i.e., supporting zero-shot adaptation without user or item auxiliary information) representations for nodes and edges from the bipartite user-item interaction graph. We learn representations by exploiting the statistical properties of the interaction data, including user and item marginals, and the size and density distributions of their clusters.
△ Less
Submitted 29 September, 2023; v1 submitted 3 September, 2023;
originally announced September 2023.
-
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants
Authors:
Lucas Bandarkar,
Davis Liang,
Benjamin Muller,
Mikel Artetxe,
Satya Narayan Shukla,
Donald Husa,
Naman Goyal,
Abhinandan Krishnan,
Luke Zettlemoyer,
Madian Khabsa
Abstract:
We present Belebele, a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. Significantly expanding the language coverage of natural language understanding (NLU) benchmarks, this dataset enables the evaluation of text models in high-, medium-, and low-resource languages. Each question is based on a short passage from the Flores-200 dataset and has four multip…
▽ More
We present Belebele, a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. Significantly expanding the language coverage of natural language understanding (NLU) benchmarks, this dataset enables the evaluation of text models in high-, medium-, and low-resource languages. Each question is based on a short passage from the Flores-200 dataset and has four multiple-choice answers. The questions were carefully curated to discriminate between models with different levels of general language comprehension. The English dataset on its own proves difficult enough to challenge state-of-the-art language models. Being fully parallel, this dataset enables direct comparison of model performance across all languages. We use this dataset to evaluate the capabilities of multilingual masked language models (MLMs) and large language models (LLMs). We present extensive results and find that despite significant cross-lingual transfer in English-centric LLMs, much smaller MLMs pretrained on balanced multilingual data still understand far more languages. We also observe that larger vocabulary size and conscious vocabulary construction correlate with better performance on low-resource languages. Overall, Belebele opens up new avenues for evaluating and analyzing the multilingual capabilities of NLP systems.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Control of thermoacoustic instability through Lagrangian saddle point analysis
Authors:
C. P. Premchand,
Abin Krishnan,
Manikandan Raghunathan,
Midhun Raghunath,
Reeja K. V.,
R. I. Sujith,
Vineeth Nair
Abstract:
We propose a framework of Lagrangian Coherent Structures (LCS) to elucidate the shear layers and coherent structures to understand the mechanism of generation of tonal sound during thermoacoustic instability. Experiments were performed on a bluff-body stabilized turbulent combustor in the state of thermoacoustic instability. We use dynamic mode decomposition (DMD) on the flow-field to identify dyn…
▽ More
We propose a framework of Lagrangian Coherent Structures (LCS) to elucidate the shear layers and coherent structures to understand the mechanism of generation of tonal sound during thermoacoustic instability. Experiments were performed on a bluff-body stabilized turbulent combustor in the state of thermoacoustic instability. We use dynamic mode decomposition (DMD) on the flow-field to identify dynamical regions where the acoustic frequency is dominant. We find that the separating shear layer from the backward-facing step of the combustor envelops a cylindrical vortex in the outer recirculation zone (ORZ), eventually im**ing on the top wall of the combustor during thermoacoustic instability. The production of tonal sound is due to the synchronous motion of this shear layer with the shear layer emerging from the leading edge of the bluff-body causing the fluid to squeeze and expand between them periodically. To de-synchronize the oscillations in an optimal manner, we track the Lagrangian saddle points in the shear layer emerging from the backward facing step over several acoustic cycles. A passive control strategy is then developed by injecting a steady stream of secondary air targeting the optimal locations thus identified. After implementing the control action, the flow-field is also analysed using LCS to understand the key differences in flow dynamics. We find that the shear layer emerging from the dump plane are deflected in a direction almost parallel to the axis of the combustor after the control action. This deflection in turn prevents the shear layer from envelo** the vortex and im**ing on the combustor walls, resulting in a drastic reduction of the amplitude of the acoustic field.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
Exploring the Power of Topic Modeling Techniques in Analyzing Customer Reviews: A Comparative Analysis
Authors:
Anusuya Krishnan
Abstract:
The exponential growth of online social network platforms and applications has led to a staggering volume of user-generated textual content, including comments and reviews. Consequently, users often face difficulties in extracting valuable insights or relevant information from such content. To address this challenge, machine learning and natural language processing algorithms have been deployed to…
▽ More
The exponential growth of online social network platforms and applications has led to a staggering volume of user-generated textual content, including comments and reviews. Consequently, users often face difficulties in extracting valuable insights or relevant information from such content. To address this challenge, machine learning and natural language processing algorithms have been deployed to analyze the vast amount of textual data available online. In recent years, topic modeling techniques have gained significant popularity in this domain. In this study, we comprehensively examine and compare five frequently used topic modeling methods specifically applied to customer reviews. The methods under investigation are latent semantic analysis (LSA), latent Dirichlet allocation (LDA), non-negative matrix factorization (NMF), pachinko allocation model (PAM), Top2Vec, and BERTopic. By practically demonstrating their benefits in detecting important topics, we aim to highlight their efficacy in real-world scenarios. To evaluate the performance of these topic modeling methods, we carefully select two textual datasets. The evaluation is based on standard statistical evaluation metrics such as topic coherence score. Our findings reveal that BERTopic consistently yield more meaningful extracted topics and achieve favorable results.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
Optimizing Multi-Class Text Classification: A Diverse Stacking Ensemble Framework Utilizing Transformers
Authors:
Anusuya Krishnan
Abstract:
Customer reviews play a crucial role in assessing customer satisfaction, gathering feedback, and driving improvements for businesses. Analyzing these reviews provides valuable insights into customer sentiments, including compliments, comments, and suggestions. Text classification techniques enable businesses to categorize customer reviews into distinct categories, facilitating a better understandi…
▽ More
Customer reviews play a crucial role in assessing customer satisfaction, gathering feedback, and driving improvements for businesses. Analyzing these reviews provides valuable insights into customer sentiments, including compliments, comments, and suggestions. Text classification techniques enable businesses to categorize customer reviews into distinct categories, facilitating a better understanding of customer feedback. However, challenges such as overfitting and bias limit the effectiveness of a single classifier in ensuring optimal prediction. This study proposes a novel approach to address these challenges by introducing a stacking ensemble-based multi-text classification method that leverages transformer models. By combining multiple single transformers, including BERT, ELECTRA, and DistilBERT, as base-level classifiers, and a meta-level classifier based on RoBERTa, an optimal predictive model is generated. The proposed stacking ensemble-based multi-text classification method aims to enhance the accuracy and robustness of customer review analysis. Experimental evaluations conducted on a real-world customer review dataset demonstrate the effectiveness and superiority of the proposed approach over traditional single classifier models. The stacking ensemble-based multi-text classification method using transformers proves to be a promising solution for businesses seeking to extract valuable insights from customer reviews and make data-driven decisions to enhance customer satisfaction and drive continuous improvement.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
MaScQA: A Question Answering Dataset for Investigating Materials Science Knowledge of Large Language Models
Authors:
Mohd Zaki,
Jayadeva,
Mausam,
N. M. Anoop Krishnan
Abstract:
Information extraction and textual comprehension from materials literature are vital for develo** an exhaustive knowledge base that enables accelerated materials discovery. Language models have demonstrated their capability to answer domain-specific questions and retrieve information from knowledge bases. However, there are no benchmark datasets in the materials domain that can evaluate the unde…
▽ More
Information extraction and textual comprehension from materials literature are vital for develo** an exhaustive knowledge base that enables accelerated materials discovery. Language models have demonstrated their capability to answer domain-specific questions and retrieve information from knowledge bases. However, there are no benchmark datasets in the materials domain that can evaluate the understanding of the key concepts by these language models. In this work, we curate a dataset of 650 challenging questions from the materials domain that require the knowledge and skills of a materials student who has cleared their undergraduate degree. We classify these questions based on their structure and the materials science domain-based subcategories. Further, we evaluate the performance of GPT-3.5 and GPT-4 models on solving these questions via zero-shot and chain of thought prompting. It is observed that GPT-4 gives the best performance (~62% accuracy) as compared to GPT-3.5. Interestingly, in contrast to the general observation, no significant improvement in accuracy is observed with the chain of thought prompting. To evaluate the limitations, we performed an error analysis, which revealed conceptual errors (~64%) as the major contributor compared to computational errors (~36%) towards the reduced performance of LLMs. We hope that the dataset and analysis performed in this work will promote further research in develo** better materials science domain-specific LLMs and strategies for information extraction.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Exploring Machine Learning and Transformer-based Approaches for Deceptive Text Classification: A Comparative Analysis
Authors:
Anusuya Krishnan
Abstract:
Deceptive text classification is a critical task in natural language processing that aims to identify deceptive o fraudulent content. This study presents a comparative analysis of machine learning and transformer-based approaches for deceptive text classification. We investigate the effectiveness of traditional machine learning algorithms and state-of-the-art transformer models, such as BERT, XLNE…
▽ More
Deceptive text classification is a critical task in natural language processing that aims to identify deceptive o fraudulent content. This study presents a comparative analysis of machine learning and transformer-based approaches for deceptive text classification. We investigate the effectiveness of traditional machine learning algorithms and state-of-the-art transformer models, such as BERT, XLNET, DistilBERT, and RoBERTa, in detecting deceptive text. A labeled dataset consisting of deceptive and non-deceptive texts is used for training and evaluation purposes. Through extensive experimentation, we compare the performance metrics, including accuracy, precision, recall, and F1 score, of the different approaches. The results of this study shed light on the strengths and limitations of machine learning and transformer-based methods for deceptive text classification, enabling researchers and practitioners to make informed decisions when dealing with deceptive content.
△ Less
Submitted 10 August, 2023; v1 submitted 10 August, 2023;
originally announced August 2023.
-
Control of Vortex Dynamics using Invariants
Authors:
Kartik Krishna,
Aditya G. Nair,
Anand Krishnan,
Steven L. Brunton,
Eurika Kaiser
Abstract:
Vortex-dominated flows are ubiquitous in engineering, and the ability to efficiently manipulate the dynamics of these vortices has broad applications, from wake sha** to mixing enhancement. However, the strongly nonlinear behavior of the vortex dynamics makes this a challenging task. In this work, we investigate the control of vortex dynamics by using a change of coordinates from the Biot-Savart…
▽ More
Vortex-dominated flows are ubiquitous in engineering, and the ability to efficiently manipulate the dynamics of these vortices has broad applications, from wake sha** to mixing enhancement. However, the strongly nonlinear behavior of the vortex dynamics makes this a challenging task. In this work, we investigate the control of vortex dynamics by using a change of coordinates from the Biot-Savart equations into well-known invariants, such as the Hamiltonian, linear, and angular impulses, which are Koopman eigenfunctions. We then combine the resulting model with model predictive control to generate control laws that force the vortex system using "virtual cylinders". The invariant model is beneficial as it provides a linear, global description of the vortex dynamics through a recently developed Koopman control scheme for conserved quantities and invariants. The use of this model has not been well studied in the literature in the context of control. In this paper, we seek to understand the effect of changing each invariant individually or multiple invariants simultaneously. We use the 4-vortex system as our primary test bed, as it is the simplest configuration that exhibits chaotic behavior. We show that by controlling to specific invariant quantities, we can modify the transition from chaotic to quasiperiodic states. Finally, we computationally demonstrate the effectiveness of invariant control on a toy example of tracer mixing in the 4-vortex system.
△ Less
Submitted 7 November, 2023; v1 submitted 7 August, 2023;
originally announced August 2023.
-
Spatiotemporal Patterns Corresponding to Phase Synchronization and Generalized Synchronization States of Thermoacoustic Instability
Authors:
Samadhan A. Pawar,
P. R. Midhun,
K. V. Reeja,
Abin Krishnan,
Krishna Manoj,
R. I. Sujith
Abstract:
Thermoacoustic instability in turbulent combustion systems emerges from the complex interplay among the flame, flow, and acoustic subsystems. While the onset of thermoacoustic instability exhibits global order in system dynamics, the characteristics of local interactions between subsystems responsible for this order are not well understood. In this study, we utilize the framework of synchronizatio…
▽ More
Thermoacoustic instability in turbulent combustion systems emerges from the complex interplay among the flame, flow, and acoustic subsystems. While the onset of thermoacoustic instability exhibits global order in system dynamics, the characteristics of local interactions between subsystems responsible for this order are not well understood. In this study, we utilize the framework of synchronization to elucidate the spatiotemporal interactions among heat release rate fluctuations in the flame, velocity fluctuations in the flow, and acoustic pressure fluctuations in a turbulent combustor. We examine two forms of thermoacoustic instability, characterized by phase synchronization and generalized synchronization of the acoustic pressure and global heat release rate oscillations. Despite the presence of global synchrony, we uncover a coexistence of frequency synchrony and desynchrony in the local interaction of these oscillations within the reaction field. In regions of frequency-locked oscillations, various phase-locking patterns occur, including phase synchrony and partial phase synchrony. We discover that the local development of small pockets of phase synchrony and strong amplitude correlation between these oscillations is sufficient to trigger global phase synchronization in the system. Furthermore, as the global dynamics approach generalized synchronization, these local regions of synchrony expand throughout the reaction field. Additionally, through coupled analysis of acoustic pressure and local flow velocity fluctuations, we infer that the spatial region of flow-acoustic synchrony plays a significant role in governing thermoacoustic instabilities. Our findings imply that, in turbulent combustors, an intrinsic local balance between order, partial order, and disorder within the coupled subsystems sustains the global order during thermoacoustic instability.
△ Less
Submitted 29 July, 2023;
originally announced July 2023.
-
Unmasking Falsehoods in Reviews: An Exploration of NLP Techniques
Authors:
Anusuya Baby Hari Krishnan
Abstract:
In the contemporary digital landscape, online reviews have become an indispensable tool for promoting products and services across various businesses. Marketers, advertisers, and online businesses have found incentives to create deceptive positive reviews for their products and negative reviews for their competitors' offerings. As a result, the writing of deceptive reviews has become an unavoidabl…
▽ More
In the contemporary digital landscape, online reviews have become an indispensable tool for promoting products and services across various businesses. Marketers, advertisers, and online businesses have found incentives to create deceptive positive reviews for their products and negative reviews for their competitors' offerings. As a result, the writing of deceptive reviews has become an unavoidable practice for businesses seeking to promote themselves or undermine their rivals. Detecting such deceptive reviews has become an intense and ongoing area of research. This research paper proposes a machine learning model to identify deceptive reviews, with a particular focus on restaurants. This study delves into the performance of numerous experiments conducted on a dataset of restaurant reviews known as the Deceptive Opinion Spam Corpus. To accomplish this, an n-gram model and max features are developed to effectively identify deceptive content, particularly focusing on fake reviews. A benchmark study is undertaken to explore the performance of two different feature extraction techniques, which are then coupled with five distinct machine learning classification algorithms. The experimental results reveal that the passive aggressive classifier stands out among the various algorithms, showcasing the highest accuracy not only in text classification but also in identifying fake reviews. Moreover, the research delves into data augmentation and implements various deep learning techniques to further enhance the process of detecting deceptive reviews. The findings shed light on the efficacy of the proposed machine learning approach and offer valuable insights into dealing with deceptive reviews in the realm of online businesses.
△ Less
Submitted 24 July, 2023; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Revealing the Predictive Power of Neural Operators for Strain Evolution in Digital Composites
Authors:
Meer Mehran Rashid,
Souvik Chakraborty,
N. M. Anoop Krishnan
Abstract:
The demand for high-performance materials, along with advanced synthesis technologies such as additive manufacturing and 3D printing, has spurred the development of hierarchical composites with superior properties. However, computational modelling of such composites using physics-based solvers, while enabling the discovery of optimal microstructures, have prohibitively high computational cost hind…
▽ More
The demand for high-performance materials, along with advanced synthesis technologies such as additive manufacturing and 3D printing, has spurred the development of hierarchical composites with superior properties. However, computational modelling of such composites using physics-based solvers, while enabling the discovery of optimal microstructures, have prohibitively high computational cost hindering their practical application. To this extent, we show that Neural Operators (NOs) can be used to learn and predict the strain evolution in 2D digital composites. Specifically, we consider three architectures, namely, Fourier NO (FNO), Wavelet NO (WNO), and Multi-wavelet NO (MWT). We demonstrate that by providing a few initial strain frames as input, NOs can accurately predict multiple future time steps in an extremely data-efficient fashion, especially WNO. Further, once trained, NOs forecast the strain trajectories for completely unseen boundary conditions. Among NOs, only FNO offers super-resolution capabilities for estimating strains at multiple length scales, which can provide higher material and pixel-wise resolution. We also show that NOs can generalize to arbitrary geometries with finer domain resolution without the need for additional training. Based on all the results presented, we note that the FNO exhibits the best performance among the NOs, while also giving minimum inference time that is almost three orders magnitude lower than the conventional finite element solutions. Thus, FNOs can be used as a surrogate for accelerated simulation of the strain evolution in complex microstructures toward designing the next composite materials.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.
-
Discovering Symbolic Laws Directly from Trajectories with Hamiltonian Graph Neural Networks
Authors:
Suresh Bishnoi,
Ravinder Bhattoo,
Jayadeva,
Sayan Ranu,
N M Anoop Krishnan
Abstract:
The time evolution of physical systems is described by differential equations, which depend on abstract quantities like energy and force. Traditionally, these quantities are derived as functionals based on observables such as positions and velocities. Discovering these governing symbolic laws is the key to comprehending the interactions in nature. Here, we present a Hamiltonian graph neural networ…
▽ More
The time evolution of physical systems is described by differential equations, which depend on abstract quantities like energy and force. Traditionally, these quantities are derived as functionals based on observables such as positions and velocities. Discovering these governing symbolic laws is the key to comprehending the interactions in nature. Here, we present a Hamiltonian graph neural network (HGNN), a physics-enforced GNN that learns the dynamics of systems directly from their trajectory. We demonstrate the performance of HGNN on n-springs, n-pendulums, gravitational systems, and binary Lennard Jones systems; HGNN learns the dynamics in excellent agreement with the ground truth from small amounts of data. We also evaluate the ability of HGNN to generalize to larger system sizes, and to hybrid spring-pendulum system that is a combination of two original systems (spring and pendulum) on which the models are trained independently. Finally, employing symbolic regression on the learned HGNN, we infer the underlying equations relating the energy functionals, even for complex systems such as the binary Lennard-Jones liquid. Our framework facilitates the interpretable discovery of interaction laws directly from physical system trajectories. Furthermore, this approach can be extended to other systems with topology-dependent dynamics, such as cells, polydisperse gels, or deformable bodies.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Graph Neural Stochastic Differential Equations for Learning Brownian Dynamics
Authors:
Suresh Bishnoi,
Jayadeva,
Sayan Ranu,
N. M. Anoop Krishnan
Abstract:
Neural networks (NNs) that exploit strong inductive biases based on physical laws and symmetries have shown remarkable success in learning the dynamics of physical systems directly from their trajectory. However, these works focus only on the systems that follow deterministic dynamics, for instance, Newtonian or Hamiltonian dynamics. Here, we propose a framework, namely Brownian graph neural netwo…
▽ More
Neural networks (NNs) that exploit strong inductive biases based on physical laws and symmetries have shown remarkable success in learning the dynamics of physical systems directly from their trajectory. However, these works focus only on the systems that follow deterministic dynamics, for instance, Newtonian or Hamiltonian dynamics. Here, we propose a framework, namely Brownian graph neural networks (BROGNET), combining stochastic differential equations (SDEs) and GNNs to learn Brownian dynamics directly from the trajectory. We theoretically show that BROGNET conserves the linear momentum of the system, which in turn, provides superior performance on learning dynamics as revealed empirically. We demonstrate this approach on several systems, namely, linear spring, linear spring with binary particle types, and non-linear spring systems, all following Brownian dynamics at finite temperatures. We show that BROGNET significantly outperforms proposed baselines across all the benchmarked Brownian systems. In addition, we demonstrate zero-shot generalizability of BROGNET to simulate unseen system sizes that are two orders of magnitude larger and to different temperatures than those used during training. Altogether, our study contributes to advancing the understanding of the intricate dynamics of Brownian motion and demonstrates the effectiveness of graph neural networks in modeling such complex systems.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
On the N-gram Approximation of Pre-trained Language Models
Authors:
Aravind Krishnan,
Jesujoba Alabi,
Dietrich Klakow
Abstract:
Large pre-trained language models (PLMs) have shown remarkable performance across various natural language understanding (NLU) tasks, particularly in low-resource settings. Nevertheless, their potential in Automatic Speech Recognition (ASR) remains largely unexplored. This study investigates the potential usage of PLMs for language modelling in ASR. We compare the application of large-scale text s…
▽ More
Large pre-trained language models (PLMs) have shown remarkable performance across various natural language understanding (NLU) tasks, particularly in low-resource settings. Nevertheless, their potential in Automatic Speech Recognition (ASR) remains largely unexplored. This study investigates the potential usage of PLMs for language modelling in ASR. We compare the application of large-scale text sampling and probability conversion for approximating GPT-2 into an n-gram model. Furthermore, we introduce a vocabulary-restricted decoding method for random sampling, and evaluate the effects of domain difficulty and data size on the usability of generated text. Our findings across eight domain-specific corpora support the use of sampling-based approximation and show that interpolating with a large sampled corpus improves test perplexity over a baseline trigram by 15%. Our vocabulary-restricted decoding method pushes this improvement further by 5% in domain-specific settings.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.
-
MLHOps: Machine Learning for Healthcare Operations
Authors:
Faiza Khan Khattak,
Vallijah Subasri,
Amrit Krishnan,
Elham Dolatabadi,
Deval Pandya,
Laleh Seyyed-Kalantari,
Frank Rudzicz
Abstract:
Machine Learning Health Operations (MLHOps) is the combination of processes for reliable, efficient, usable, and ethical deployment and maintenance of machine learning models in healthcare settings. This paper provides both a survey of work in this area and guidelines for developers and clinicians to deploy and maintain their own models in clinical practice. We cover the foundational concepts of g…
▽ More
Machine Learning Health Operations (MLHOps) is the combination of processes for reliable, efficient, usable, and ethical deployment and maintenance of machine learning models in healthcare settings. This paper provides both a survey of work in this area and guidelines for developers and clinicians to deploy and maintain their own models in clinical practice. We cover the foundational concepts of general machine learning operations, describe the initial setup of MLHOps pipelines (including data sources, preparation, engineering, and tools). We then describe long-term monitoring and updating (including data distribution shifts and model updating) and ethical considerations (including bias, fairness, interpretability, and privacy). This work therefore provides guidance across the full pipeline of MLHOps from conception to initial and ongoing deployment.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Exploring shared memory architectures for end-to-end gigapixel deep learning
Authors:
Lucas W. Remedios,
Leon Y. Cai,
Samuel W. Remedios,
Karthik Ramadass,
Aravind Krishnan,
Ruining Deng,
Can Cui,
Shunxing Bao,
Lori A. Coburn,
Yuankai Huo,
Bennett A. Landman
Abstract:
Deep learning has made great strides in medical imaging, enabled by hardware advances in GPUs. One major constraint for the development of new models has been the saturation of GPU memory resources during training. This is especially true in computational pathology, where images regularly contain more than 1 billion pixels. These pathological images are traditionally divided into small patches to…
▽ More
Deep learning has made great strides in medical imaging, enabled by hardware advances in GPUs. One major constraint for the development of new models has been the saturation of GPU memory resources during training. This is especially true in computational pathology, where images regularly contain more than 1 billion pixels. These pathological images are traditionally divided into small patches to enable deep learning due to hardware limitations. In this work, we explore whether the shared GPU/CPU memory architecture on the M1 Ultra systems-on-a-chip (SoCs) recently released by Apple, Inc. may provide a solution. These affordable systems (less than \$5000) provide access to 128 GB of unified memory (Mac Studio with M1 Ultra SoC). As a proof of concept for gigapixel deep learning, we identified tissue from background on gigapixel areas from whole slide images (WSIs). The model was a modified U-Net (4492 parameters) leveraging large kernels and high stride. The M1 Ultra SoC was able to train the model directly on gigapixel images (16000$\times$64000 pixels, 1.024 billion pixels) with a batch size of 1 using over 100 GB of unified memory for the process at an average speed of 1 minute and 21 seconds per batch with Tensorflow 2/Keras. As expected, the model converged with a high Dice score of 0.989 $\pm$ 0.005. Training up until this point took 111 hours and 24 minutes over 4940 steps. Other high RAM GPUs like the NVIDIA A100 (largest commercially accessible at 80 GB, $\sim$\$15000) are not yet widely available (in preview for select regions on Amazon Web Services at \$40.96/hour as a group of 8). This study is a promising step towards WSI-wise end-to-end deep learning with prevalent network architectures.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Zero-shot CT Field-of-view Completion with Unconditional Generative Diffusion Prior
Authors:
Kaiwen Xu,
Aravind R. Krishnan,
Thomas Z. Li,
Yuankai Huo,
Kim L. Sandler,
Fabien Maldonado,
Bennett A. Landman
Abstract:
Anatomically consistent field-of-view (FOV) completion to recover truncated body sections has important applications in quantitative analyses of computed tomography (CT) with limited FOV. Existing solution based on conditional generative models relies on the fidelity of synthetic truncation patterns at training phase, which poses limitations for the generalizability of the method to potential unkn…
▽ More
Anatomically consistent field-of-view (FOV) completion to recover truncated body sections has important applications in quantitative analyses of computed tomography (CT) with limited FOV. Existing solution based on conditional generative models relies on the fidelity of synthetic truncation patterns at training phase, which poses limitations for the generalizability of the method to potential unknown types of truncation. In this study, we evaluate a zero-shot method based on a pretrained unconditional generative diffusion prior, where truncation pattern with arbitrary forms can be specified at inference phase. In evaluation on simulated chest CT slices with synthetic FOV truncation, the method is capable of recovering anatomically consistent body sections and subcutaneous adipose tissue measurement error caused by FOV truncation. However, the correction accuracy is inferior to the conditionally trained counterpart.
△ Less
Submitted 7 April, 2023;
originally announced April 2023.
-
LANe: Lighting-Aware Neural Fields for Compositional Scene Synthesis
Authors:
Akshay Krishnan,
Amit Raj,
Xianling Zhang,
Alexandra Carlson,
Nathan Tseng,
Sandhya Sridhar,
Nikita Jaipuria,
James Hays
Abstract:
Neural fields have recently enjoyed great success in representing and rendering 3D scenes. However, most state-of-the-art implicit representations model static or dynamic scenes as a whole, with minor variations. Existing work on learning disentangled world and object neural fields do not consider the problem of composing objects into different world neural fields in a lighting-aware manner. We pr…
▽ More
Neural fields have recently enjoyed great success in representing and rendering 3D scenes. However, most state-of-the-art implicit representations model static or dynamic scenes as a whole, with minor variations. Existing work on learning disentangled world and object neural fields do not consider the problem of composing objects into different world neural fields in a lighting-aware manner. We present Lighting-Aware Neural Field (LANe) for the compositional synthesis of driving scenes in a physically consistent manner. Specifically, we learn a scene representation that disentangles the static background and transient elements into a world-NeRF and class-specific object-NeRFs to allow compositional synthesis of multiple objects in the scene. Furthermore, we explicitly designed both the world and object models to handle lighting variation, which allows us to compose objects into scenes with spatially varying lighting. This is achieved by constructing a light field of the scene and using it in conjunction with a learned shader to modulate the appearance of the object NeRFs. We demonstrate the performance of our model on a synthetic dataset of diverse lighting conditions rendered with the CARLA simulator, as well as a novel real-world dataset of cars collected at different times of the day. Our approach shows that it outperforms state-of-the-art compositional scene synthesis on the challenging dataset setup, via composing object-NeRFs learned from one scene into an entirely different scene whilst still respecting the lighting variations in the novel scene. For more results, please visit our project website https://lane-composition.github.io/.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.