-
State-Based Automation for Time-Restricted Eating Adherence
Authors:
Samuel E. Armstrong,
Aaron D. Mullen,
J. Matthew Thomas,
Dorothy D. Sears,
Julie S. Pendergast,
Jeffrey Talbert,
Cody Bumgardner
Abstract:
Develo** and enforcing study protocols is a foundational component of medical research. As study complexity for participant interactions increases, translating study protocols to supporting application code becomes challenging. A collaboration exists between the University of Kentucky and Arizona State University to determine the efficacy of time-restricted eating in improving metabolic risk amo…
▽ More
Develo** and enforcing study protocols is a foundational component of medical research. As study complexity for participant interactions increases, translating study protocols to supporting application code becomes challenging. A collaboration exists between the University of Kentucky and Arizona State University to determine the efficacy of time-restricted eating in improving metabolic risk among postmenopausal women. This study utilizes a graph-based approach to monitor and support adherence to a designated schedule, enabling the validation and step-wise audit of participants' statuses to derive dependable conclusions. A texting service, driven by a participant graph, automatically manages interactions and collects data. Participant data is then accessible to the research study team via a website, which enables viewing, management, and exportation. This paper presents a system for automatically managing participants in a time-restricted eating study that eliminates time-consuming interactions with participants.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
High Noise Scheduling is a Must
Authors:
Mahmut S. Gokmen,
Cody Bumgardner,
Jie Zhang,
Ge Wang,
** Chen
Abstract:
Consistency models possess high capabilities for image generation, advancing sampling steps to a single step through their advanced techniques. Current advancements move one step forward consistency training techniques and eliminates the limitation of distillation training. Even though the proposed curriculum and noise scheduling in improved training techniques yield better results than basic cons…
▽ More
Consistency models possess high capabilities for image generation, advancing sampling steps to a single step through their advanced techniques. Current advancements move one step forward consistency training techniques and eliminates the limitation of distillation training. Even though the proposed curriculum and noise scheduling in improved training techniques yield better results than basic consistency models, it lacks well balanced noise distribution and its consistency between curriculum. In this study, it is investigated the balance between high and low noise levels in noise distribution and offered polynomial noise distribution to maintain the stability. This proposed polynomial noise distribution is also supported with a predefined Karras noises to prevent unique noise levels arises with Karras noise generation algorithm. Furthermore, by elimination of learned noisy steps with a curriculum based on sinusoidal function increase the performance of the model in denoising. To make a fair comparison with the latest released consistency model training techniques, experiments are conducted with same hyper-parameters except curriculum and noise distribution. The models utilized during experiments are determined with low depth to prove the robustness of our proposed technique. The results show that the polynomial noise distribution outperforms the model trained with log-normal noise distribution, yielding a 33.54 FID score after 100,000 training steps with constant discretization steps. Additionally, the implementation of a sinusoidal-based curriculum enhances denoising performance, resulting in a FID score of 30.48.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Multi-Modal Machine Learning Framework for Automated Seizure Detection in Laboratory Rats
Authors:
Aaron Mullen,
Samuel E. Armstrong,
Jasmine Perdeh,
Bjorn Bauer,
Jeffrey Talbert,
V. K. Cody Bumgardner
Abstract:
A multi-modal machine learning system uses multiple unique data sources and types to improve its performance. This article proposes a system that combines results from several types of models, all of which are trained on different data signals. As an example to illustrate the efficacy of the system, an experiment is described in which multiple types of data are collected from rats suffering from s…
▽ More
A multi-modal machine learning system uses multiple unique data sources and types to improve its performance. This article proposes a system that combines results from several types of models, all of which are trained on different data signals. As an example to illustrate the efficacy of the system, an experiment is described in which multiple types of data are collected from rats suffering from seizures. This data includes electrocorticography readings, piezoelectric motion sensor data, and video recordings. Separate models are trained on each type of data, with the goal of classifying each time frame as either containing a seizure or not. After each model has generated its classification predictions, these results are combined. While each data signal works adequately on its own for prediction purposes, the significant imbalance in class labels leads to increased numbers of false positives, which can be filtered and removed by utilizing all data sources. This paper will demonstrate that, after postprocessing and combination techniques, classification accuracy is improved with this multi-modal system when compared to the performance of each individual data source.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Institutional Platform for Secure Self-Service Large Language Model Exploration
Authors:
V. K. Cody Bumgardner,
Mitchell A. Klusty,
W. Vaiden Logan,
Samuel E. Armstrong,
Caylin Hickey,
Jeff Talbert
Abstract:
This paper introduces a user-friendly platform developed by the University of Kentucky Center for Applied AI, designed to make large, customized language models (LLMs) more accessible. By capitalizing on recent advancements in multi-LoRA inference, the system efficiently accommodates custom adapters for a diverse range of users and projects. The paper outlines the system's architecture and key fea…
▽ More
This paper introduces a user-friendly platform developed by the University of Kentucky Center for Applied AI, designed to make large, customized language models (LLMs) more accessible. By capitalizing on recent advancements in multi-LoRA inference, the system efficiently accommodates custom adapters for a diverse range of users and projects. The paper outlines the system's architecture and key features, encompassing dataset curation, model training, secure inference, and text-based feature extraction.
We illustrate the establishment of a tenant-aware computational network using agent-based methods, securely utilizing islands of isolated resources as a unified system. The platform strives to deliver secure LLM services, emphasizing process and data isolation, end-to-end encryption, and role-based resource authentication. This contribution aligns with the overarching goal of enabling simplified access to cutting-edge AI models and technology in support of scientific discovery.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
CLASSify: A Web-Based Tool for Machine Learning
Authors:
Aaron D. Mullen,
Samuel E. Armstrong,
Jeff Talbert,
V. K. Cody Bumgardner
Abstract:
Machine learning classification problems are widespread in bioinformatics, but the technical knowledge required to perform model training, optimization, and inference can prevent researchers from utilizing this technology. This article presents an automated tool for machine learning classification problems to simplify the process of training models and producing results while providing informative…
▽ More
Machine learning classification problems are widespread in bioinformatics, but the technical knowledge required to perform model training, optimization, and inference can prevent researchers from utilizing this technology. This article presents an automated tool for machine learning classification problems to simplify the process of training models and producing results while providing informative visualizations and insights into the data. This tool supports both binary and multiclass classification problems, and it provides access to a variety of models and methods. Synthetic data can be generated within the interface to fill missing values, balance class labels, or generate entirely new datasets. It also provides support for feature evaluation and generates explainability scores to indicate which features influence the output the most. We present CLASSify, an open-source tool for simplifying the user experience of solving classification problems without the need for knowledge of machine learning.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Local Large Language Models for Complex Structured Medical Tasks
Authors:
V. K. Cody Bumgardner,
Aaron Mullen,
Sam Armstrong,
Caylin Hickey,
Jeff Talbert
Abstract:
This paper introduces an approach that combines the language reasoning capabilities of large language models (LLMs) with the benefits of local training to tackle complex, domain-specific tasks. Specifically, the authors demonstrate their approach by extracting structured condition codes from pathology reports. The proposed approach utilizes local LLMs, which can be fine-tuned to respond to specifi…
▽ More
This paper introduces an approach that combines the language reasoning capabilities of large language models (LLMs) with the benefits of local training to tackle complex, domain-specific tasks. Specifically, the authors demonstrate their approach by extracting structured condition codes from pathology reports. The proposed approach utilizes local LLMs, which can be fine-tuned to respond to specific generative instructions and provide structured outputs. The authors collected a dataset of over 150k uncurated surgical pathology reports, containing gross descriptions, final diagnoses, and condition codes. They trained different model architectures, including LLaMA, BERT and LongFormer and evaluated their performance. The results show that the LLaMA-based models significantly outperform BERT-style models across all evaluated metrics, even with extremely reduced precision. The LLaMA models performed especially well with large datasets, demonstrating their ability to handle complex, multi-label tasks. Overall, this work presents an effective approach for utilizing LLMs to perform domain-specific tasks using accessible hardware, with potential applications in the medical domain, where complex data extraction and classification are required.
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
SmartState: A Protocol-driven Human Interface
Authors:
Samuel E. Armstrong,
Aaron D. Mullen,
V. K. Cody Bumgardner
Abstract:
Since the inception of human research studies, researchers must often interact with participants on a set schedule to collect data. Researchers manually perform many interactions, leading to considerable time and financial expenses. Usually, user-provided data collection consists of surveys administered via telephone or email. These methods are tedious for the survey administrators, which could ca…
▽ More
Since the inception of human research studies, researchers must often interact with participants on a set schedule to collect data. Researchers manually perform many interactions, leading to considerable time and financial expenses. Usually, user-provided data collection consists of surveys administered via telephone or email. These methods are tedious for the survey administrators, which could cause fatigue and potentially lead to collection mistakes. This project leverages recent advancements in automatic speech recognition, speech-to-text, natural language understanding (NLU), and finite-state machines to automate research protocols. This generalized application is fully customizable and irrespective of any research study. New research protocols can be quickly created based on these parameters once envisioned. Thus, we present SmartState, a fully-customizable, state-driven protocol manager combined with supporting AI components to autonomously manage user data and intelligently determine users' intentions through chat and end-device interactions.
△ Less
Submitted 31 July, 2023; v1 submitted 7 May, 2023;
originally announced May 2023.
-
Semantic Enrichment of Streaming Healthcare Data
Authors:
Daniel Cotter,
V. K. Cody Bumgardner
Abstract:
In the past decade, the healthcare industry has made significant advances in the digitization of patient information. However, a lack of interoperability among healthcare systems still imposes a high cost to patients, hospitals, and insurers. Currently, most systems pass messages using idiosyncratic messaging standards that require specialized knowledge to interpret. This increases the cost of sys…
▽ More
In the past decade, the healthcare industry has made significant advances in the digitization of patient information. However, a lack of interoperability among healthcare systems still imposes a high cost to patients, hospitals, and insurers. Currently, most systems pass messages using idiosyncratic messaging standards that require specialized knowledge to interpret. This increases the cost of systems integration and often puts more advanced uses of data out of reach. In this project, we demonstrate how two open standards, FHIR and RDF, can be combined both to integrate data from disparate sources in real-time and make that data queryable and susceptible to automated inference. To validate the effectiveness of the semantic engine, we perform simulations of real-time data feeds and demonstrate how they can be combined and used by client-side applications with no knowledge of the underlying sources.
△ Less
Submitted 1 December, 2019;
originally announced December 2019.
-
Toward Edge-enabled Cyber-Physical Systems Testbeds
Authors:
V. K. Cody Bumgardner,
Nima Seyedtalebi,
Caylin Hickey
Abstract:
The use of edge computing can be extremely valuable in support of CPS efforts. However, few if any testbeds provide the type of resource control and provisioning required to support edge-enabled CPS experimentation. Likewise, commercial offerings provide operational capabilities, but lack the distributed infrastructure and transparency provided by research testbed. In this paper we propose methods…
▽ More
The use of edge computing can be extremely valuable in support of CPS efforts. However, few if any testbeds provide the type of resource control and provisioning required to support edge-enabled CPS experimentation. Likewise, commercial offerings provide operational capabilities, but lack the distributed infrastructure and transparency provided by research testbed. In this paper we propose methods to develop new and augment existing testbeds to better support the challenges of edge computing and CPS research. The proposed network is specifically designed to address the challenges associated with edge-based provisioning, data collection, analysis, monitoring, and measurement across islands of edge and data center resources.
We present the purpose of our work, the basic architecture, initial results, the relationship to the existing software, and the potential of an existing edge-focused framework to support the foundations of edge-focused CPS testbeds.
△ Less
Submitted 2 October, 2019;
originally announced October 2019.