-
Cost-efficient Active Illumination Camera For Hyper-spectral Reconstruction
Authors:
Yuxuan Zhang,
T. M. Sazzad,
Yangyang Song,
Spencer J. Chang,
Ritesh Chowdhry,
Tomas Mejia,
Anna Hampton,
Shelby Kucharski,
Stefan Gerber,
Barry Tillman,
Marcio F. R. Resende,
William M. Hammond,
Chris H. Wilson,
Alina Zare,
Sanjeev J. Koppal
Abstract:
Hyper-spectral imaging has recently gained increasing attention for use in different applications, including agricultural investigation, ground tracking, remote sensing and many other. However, the high cost, large physical size and complicated operation process stop hyperspectral cameras from being employed for various applications and research fields. In this paper, we introduce a cost-efficient…
▽ More
Hyper-spectral imaging has recently gained increasing attention for use in different applications, including agricultural investigation, ground tracking, remote sensing and many other. However, the high cost, large physical size and complicated operation process stop hyperspectral cameras from being employed for various applications and research fields. In this paper, we introduce a cost-efficient, compact and easy to use active illumination camera that may benefit many applications. We developed a fully functional prototype of such camera. With the hope of hel** with agricultural research, we tested our camera for plant root imaging. In addition, a U-Net model for spectral reconstruction was trained by using a reference hyperspectral camera's data as ground truth and our camera's data as input. We demonstrated our camera's ability to obtain additional information over a typical RGB camera. In addition, the ability to reconstruct hyperspectral data from multi-spectral input makes our device compatible to models and algorithms developed for hyperspectral applications with no modifications required.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Jump Starting Bandits with LLM-Generated Prior Knowledge
Authors:
Parand A. Alamdari,
Yanshuai Cao,
Kevin H. Wilson
Abstract:
We present substantial evidence demonstrating the benefits of integrating Large Language Models (LLMs) with a Contextual Multi-Armed Bandit framework. Contextual bandits have been widely used in recommendation systems to generate personalized suggestions based on user-specific contexts. We show that LLMs, pre-trained on extensive corpora rich in human knowledge and preferences, can simulate human…
▽ More
We present substantial evidence demonstrating the benefits of integrating Large Language Models (LLMs) with a Contextual Multi-Armed Bandit framework. Contextual bandits have been widely used in recommendation systems to generate personalized suggestions based on user-specific contexts. We show that LLMs, pre-trained on extensive corpora rich in human knowledge and preferences, can simulate human behaviours well enough to jump-start contextual multi-armed bandits to reduce online learning regret. We propose an initialization algorithm for contextual bandits by prompting LLMs to produce a pre-training dataset of approximate human preferences for the bandit. This significantly reduces online learning regret and data-gathering costs for training such models. Our approach is validated empirically through two sets of experiments with different bandit setups: one which utilizes LLMs to serve as an oracle and a real-world experiment utilizing data from a conjoint survey experiment.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Counterfactual Explanations for Multivariate Time-Series without Training Datasets
Authors:
Xiangyu Sun,
Raquel Aoki,
Kevin H. Wilson
Abstract:
Machine learning (ML) methods have experienced significant growth in the past decade, yet their practical application in high-impact real-world domains has been hindered by their opacity. When ML methods are responsible for making critical decisions, stakeholders often require insights into how to alter these decisions. Counterfactual explanations (CFEs) have emerged as a solution, offering interp…
▽ More
Machine learning (ML) methods have experienced significant growth in the past decade, yet their practical application in high-impact real-world domains has been hindered by their opacity. When ML methods are responsible for making critical decisions, stakeholders often require insights into how to alter these decisions. Counterfactual explanations (CFEs) have emerged as a solution, offering interpretations of opaque ML models and providing a pathway to transition from one decision to another. However, most existing CFE methods require access to the model's training dataset, few methods can handle multivariate time-series, and none can handle multivariate time-series without training datasets. These limitations can be formidable in many scenarios. In this paper, we present CFWoT, a novel reinforcement-learning-based CFE method that generates CFEs when training datasets are unavailable. CFWoT is model-agnostic and suitable for both static and multivariate time-series datasets with continuous and discrete features. Users have the flexibility to specify non-actionable, immutable, and preferred features, as well as causal constraints which CFWoT guarantees will be respected. We demonstrate the performance of CFWoT against four baselines on several datasets and find that, despite not having access to a training dataset, CFWoT finds CFEs that make significantly fewer and significantly smaller changes to the input time-series. These properties make CFEs more actionable, as the magnitude of change required to alter an outcome is vastly reduced.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Plug-and-Play Stability for Intracortical Brain-Computer Interfaces: A One-Year Demonstration of Seamless Brain-to-Text Communication
Authors:
Chaofei Fan,
Nick Hahn,
Foram Kamdar,
Donald Avansino,
Guy H. Wilson,
Leigh Hochberg,
Krishna V. Shenoy,
Jaimie M. Henderson,
Francis R. Willett
Abstract:
Intracortical brain-computer interfaces (iBCIs) have shown promise for restoring rapid communication to people with neurological disorders such as amyotrophic lateral sclerosis (ALS). However, to maintain high performance over time, iBCIs typically need frequent recalibration to combat changes in the neural recordings that accrue over days. This requires iBCI users to stop using the iBCI and engag…
▽ More
Intracortical brain-computer interfaces (iBCIs) have shown promise for restoring rapid communication to people with neurological disorders such as amyotrophic lateral sclerosis (ALS). However, to maintain high performance over time, iBCIs typically need frequent recalibration to combat changes in the neural recordings that accrue over days. This requires iBCI users to stop using the iBCI and engage in supervised data collection, making the iBCI system hard to use. In this paper, we propose a method that enables self-recalibration of communication iBCIs without interrupting the user. Our method leverages large language models (LMs) to automatically correct errors in iBCI outputs. The self-recalibration process uses these corrected outputs ("pseudo-labels") to continually update the iBCI decoder online. Over a period of more than one year (403 days), we evaluated our Continual Online Recalibration with Pseudo-labels (CORP) framework with one clinical trial participant. CORP achieved a stable decoding accuracy of 93.84% in an online handwriting iBCI task, significantly outperforming other baseline methods. Notably, this is the longest-running iBCI stability demonstration involving a human participant. Our results provide the first evidence for long-term stabilization of a plug-and-play, high-performance communication iBCI, addressing a major barrier for the clinical translation of iBCIs.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Performance of data-driven inner speech decoding with same-task EEG-fMRI data fusion and bimodal models
Authors:
Holly Wilson,
Scott Wellington,
Foteini Simistira Liwicki,
Vibha Gupta,
Rajkumar Saini,
Kanjar De,
Nosheen Abid,
Sumit Rakesh,
Johan Eriksson,
Oliver Watts,
Xi Chen,
Mohammad Golbabaee,
Michael J. Proulx,
Marcus Liwicki,
Eamonn O'Neill,
Benjamin Metcalfe
Abstract:
Decoding inner speech from the brain signal via hybridisation of fMRI and EEG data is explored to investigate the performance benefits over unimodal models. Two different bimodal fusion approaches are examined: concatenation of probability vectors output from unimodal fMRI and EEG machine learning models, and data fusion with feature engineering. Same task inner speech data are recorded from four…
▽ More
Decoding inner speech from the brain signal via hybridisation of fMRI and EEG data is explored to investigate the performance benefits over unimodal models. Two different bimodal fusion approaches are examined: concatenation of probability vectors output from unimodal fMRI and EEG machine learning models, and data fusion with feature engineering. Same task inner speech data are recorded from four participants, and different processing strategies are compared and contrasted to previously-employed hybridisation methods. Data across participants are discovered to encode different underlying structures, which results in varying decoding performances between subject-dependent fusion models. Decoding performance is demonstrated as improved when pursuing bimodal fMRI-EEG fusion strategies, if the data show underlying structure.
△ Less
Submitted 19 June, 2023;
originally announced June 2023.
-
A Closer Look at Some Recent Proof Compression-Related Claims
Authors:
Michael C. Chavrimootoo,
Ethan Ferland,
Erin Gibson,
Ashley H. Wilson
Abstract:
Gordeev and Haeusler [GH19] claim that each tautology $ρ$ of minimal propositional logic can be proved with a natural deduction of size polynomial in $|ρ|$. This builds on work from Hudelmaier [Hud93] that found a similar result for intuitionistic propositional logic, but for which only the height of the proof was polynomially bounded, not the overall size. They arrive at this result by transformi…
▽ More
Gordeev and Haeusler [GH19] claim that each tautology $ρ$ of minimal propositional logic can be proved with a natural deduction of size polynomial in $|ρ|$. This builds on work from Hudelmaier [Hud93] that found a similar result for intuitionistic propositional logic, but for which only the height of the proof was polynomially bounded, not the overall size. They arrive at this result by transforming a proof in Hudelmaier's sequent calculus into an equivalent tree-like proof in Prawitz's system of natural deduction, and then compressing the tree-like proof into an equivalent DAG-like proof in such a way that a polynomial bound on the height and foundation implies a polynomial bound on the overall size. Our paper, however, observes that this construction was performed only on minimal implicational logic, which we show to be weaker than the minimal propositional logic for which they claim the result (see Section 4.2). Simply extending the logic systems used to cover minimal propositional logic would not be sufficient to recover the results of the paper, as it would entirely disrupt proofs of a number of the theorems that are critical to proving the main result. Relying heavily on their aforementioned work, Gordeev and Haeusler [GH20] claim to establish NP=PSPACE. The argument centrally depends on the polynomial bound on proof size in minimal propositional logic. Since we show that that bound has not been correctly established by them, their purported proof does not correctly establish NP=PSPACE.
△ Less
Submitted 23 December, 2022;
originally announced December 2022.
-
Multi-scale metrics and self-organizing maps: a computational approach to the structure of sensory maps
Authors:
William H. Wilson
Abstract:
This paper introduces the concept of a bi-scale metric for use in the cooperative phase of the self-organizing map (SOM) algorithm. Use of a bi-scale metric allows segmentation of the map into a number of regions, corresponding to anticipated cluster structure in the data. Such a situation occurs, for example, in the somatotopic maps which inspired the SOM algo- rithm, where clusters of data may c…
▽ More
This paper introduces the concept of a bi-scale metric for use in the cooperative phase of the self-organizing map (SOM) algorithm. Use of a bi-scale metric allows segmentation of the map into a number of regions, corresponding to anticipated cluster structure in the data. Such a situation occurs, for example, in the somatotopic maps which inspired the SOM algo- rithm, where clusters of data may correspond to body surface regions whose general structure is known. When a bi-scale metric is appropriately applied, issues with map neurons that are not activated by any point in the training data are reduced or eliminated. The paper also presents results of simulation studies on the plasticity of bi-scale metric maps when they are retrained af- ter loss of groups of map neurons or after changes in training data (such as would occur in a somatotopic map when a body surface region like a finger is lost/removed). The paper further considers situations where tri-scale met- rics may be useful, and an alternative approach suggested by neurobiology, where some map regions adapt more slowly to stimuli because they have a lower learning rate parameter.
△ Less
Submitted 8 May, 2018;
originally announced May 2018.
-
Rayleigh Quotient Iteration with a Multigrid in Energy Preconditioner for Massively Parallel Neutron Transport
Authors:
R. N. Slaybaugh,
T. M. Evans,
G. G. Davidson,
P. P. H. Wilson
Abstract:
Three complementary methods have been implemented in the code Denovo that accelerate neutral particle transport calculations with methods that use leadership-class computers fully and effectively: a multigroup block (MG) Krylov solver, a Rayleigh quotient iteration (RQI) eigenvalue solver, and a multigrid in energy preconditioner. The multigroup Krylov solver converges more quickly than Gauss Seid…
▽ More
Three complementary methods have been implemented in the code Denovo that accelerate neutral particle transport calculations with methods that use leadership-class computers fully and effectively: a multigroup block (MG) Krylov solver, a Rayleigh quotient iteration (RQI) eigenvalue solver, and a multigrid in energy preconditioner. The multigroup Krylov solver converges more quickly than Gauss Seidel and enables energy decomposition such that Denovo can scale to hundreds of thousands of cores. The new multigrid in energy preconditioner reduces iteration count for many problem types and takes advantage of the new energy decomposition such that it can scale efficiently. These two tools are useful on their own, but together they enable the RQI eigenvalue solver to work. Each individual method has been described before, but this is the first time they have been demonstrated to work together effectively.
RQI should converge in fewer iterations than power iteration (PI) for large and challenging problems. RQI creates shifted systems that would not be tractable without the MG Krylov solver. It also creates ill-conditioned matrices that cannot converge without the multigrid in energy preconditioner. Using these methods together, RQI converged in fewer iterations and in less time than all PI calculations for a full pressurized water reactor core. It also scaled reasonably well out to 275,968 cores.
△ Less
Submitted 7 February, 2017;
originally announced February 2017.
-
Challenging Fuel Cycle Modeling Assumptions: Facility and Time Step Discretization Effects
Authors:
Robert W. Carlsen,
Paul P. H. Wilson
Abstract:
Due to the diversity of fuel cycle simulator modeling assumptions, direct comparison and benchmarking can be difficult. In 2012 the Organisation for Economic Co-operation and Development completed a benchmark study that is perhaps the most complete published comparison performed. Despite this, various results from the simulators were often significantly different because of inconsistencies in mode…
▽ More
Due to the diversity of fuel cycle simulator modeling assumptions, direct comparison and benchmarking can be difficult. In 2012 the Organisation for Economic Co-operation and Development completed a benchmark study that is perhaps the most complete published comparison performed. Despite this, various results from the simulators were often significantly different because of inconsistencies in modeling decisions involving reprocessing strategies, refueling behavior, reactor end-of-life handling, etc. This work identifies and quantifies the effects of selected modeling choices that may sometimes be taken for granted in the fuel cycle simulation domain. Four scenarios are compared using combinations of fleet-based or individually modeled reactors with monthly or quarterly (3-month) time steps. The scenarios approximate a transition from the current U.S. once-through light water reactor fleet to a full sodium fast reactor fuel cycle. The Cyclus fuel cycle simulator's plug-in capability along with its market-like dynamic material routing allow it to be used as a level playing field for comparing the scenarios. When under supply-constraint pressure, the four cases exhibit noticeably different behavior. Fleet-based modeling is more efficient in supply-constrained environments at the expense of losing insight on issues such as realistically suboptimal fuel distribution and challenges in reactor refueling cycle staggering. Finer-grained time steps enable more efficient material use in supply-constrained environments resulting in lower standing inventories of separated Pu. Large simulations with fleet-based reactors run much more quickly than their individual reactor counterparts. Gaining a better understanding of how these and other modeling choices affect fuel cycle dynamics will enable making more deliberate decisions with respect to trade-offs such as computational investment vs. realism.
△ Less
Submitted 4 May, 2016;
originally announced May 2016.
-
Back to the Basics: Bayesian extensions of IRT outperform neural networks for proficiency estimation
Authors:
Kevin H. Wilson,
Yan Karklin,
Bojian Han,
Chaitanya Ekanadham
Abstract:
Estimating student proficiency is an important task for computer based learning systems. We compare a family of IRT-based proficiency estimation methods to Deep Knowledge Tracing (DKT), a recently proposed recurrent neural network model with promising initial results. We evaluate how well each model predicts a student's future response given previous responses using two publicly available and one…
▽ More
Estimating student proficiency is an important task for computer based learning systems. We compare a family of IRT-based proficiency estimation methods to Deep Knowledge Tracing (DKT), a recently proposed recurrent neural network model with promising initial results. We evaluate how well each model predicts a student's future response given previous responses using two publicly available and one proprietary data set. We find that IRT-based methods consistently matched or outperformed DKT across all data sets at the finest level of content granularity that was tractable for them to be trained on. A hierarchical extension of IRT that captured item grou** structure performed best overall. When data sets included non-trivial autocorrelations in student response patterns, a temporal extension of IRT improved performance over standard IRT while the RNN-based method did not. We conclude that IRT-based models provide a simpler, better-performing alternative to existing RNN-based models of student interaction data while also affording more interpretability and guarantees due to their formulation as Bayesian probabilistic models.
△ Less
Submitted 21 May, 2016; v1 submitted 8 April, 2016;
originally announced April 2016.
-
Cyclus Archetypes
Authors:
Anthony M. Scopatz,
Matthew J. Gidden,
Robert W. Carlsen,
Robert R. Flanagan,
Kathryn D. Huff,
Meghan B. McGarry,
Arrielle C. Opotowsky,
Olzhas Rakhimov,
Zach Welch,
Paul P. H. Wilson
Abstract:
The current state of nuclear fuel cycle simulation exists in highly customized form. Satisfying a wide range of users requires model modularity within such a tool. Cyclus is a fuel cycle simulator specifically designed to combat the lack of adaptability of previous generations of simulators. This is accomplished through an agent-based infrastructure and treating time discretely. The Cyclus kernel…
▽ More
The current state of nuclear fuel cycle simulation exists in highly customized form. Satisfying a wide range of users requires model modularity within such a tool. Cyclus is a fuel cycle simulator specifically designed to combat the lack of adaptability of previous generations of simulators. This is accomplished through an agent-based infrastructure and treating time discretely. The Cyclus kernel was developed to allow for models, called archetypes, of differing fidelity and function depending on need of the users. To take advantage of this flexibility, a user must write an archetype for their desired simulation if it does not yet exist within the Cyclus ecosystem. At this stage, a user graduates to the title of archetype developer.
Without automation, archetype development is difficult for the uninitiated. This paper presents the framework developed for simplifying the writing of archetypes: the Cyclus preprocessor, or cycpp. cycpp addresses the computer science and software development aspects of archetype development that can be addressed algorithmically, allowing the developer to focus on modeling the physics, social policies, and economics. cycpp passes through the code three times to perform the following tasks: normalizing the code via the C preprocessor, accumulation of notations, and code generation. Not only does this reduce the amount of code a developer must write by approximately an order of magnitude, but the archetypes are automatically validated.
△ Less
Submitted 17 November, 2015;
originally announced November 2015.
-
Fundamental concepts in the Cyclus nuclear fuel cycle simulation framework
Authors:
Kathryn D. Huff,
Matthew J. Gidden,
Robert W. Carlsen,
Robert R. Flanagan,
Meghan B. McGarry,
Arrielle C. Opotowsky,
Erich A. Schneider,
Anthony M. Scopatz,
Paul P. H. Wilson
Abstract:
As nuclear power expands, technical, economic, political, and environmental analyses of nuclear fuel cycles by simulators increase in importance. To date, however, current tools are often fleet-based rather than discrete and restrictively licensed rather than open source. Each of these choices presents a challenge to modeling fidelity, generality, efficiency, robustness, and scientific transparenc…
▽ More
As nuclear power expands, technical, economic, political, and environmental analyses of nuclear fuel cycles by simulators increase in importance. To date, however, current tools are often fleet-based rather than discrete and restrictively licensed rather than open source. Each of these choices presents a challenge to modeling fidelity, generality, efficiency, robustness, and scientific transparency. The Cyclus nuclear fuel cycle simulator framework and its modeling ecosystem incorporate modern insights from simulation science and software architecture to solve these problems so that challenges in nuclear fuel cycle analysis can be better addressed. A summary of the Cyclus fuel cycle simulator framework and its modeling ecosystem are presented. Additionally, the implementation of each is discussed in the context of motivating challenges in nuclear fuel cycle simulation. Finally, the current capabilities of Cyclus are demonstrated for both open and closed fuel cycles.
△ Less
Submitted 11 March, 2016; v1 submitted 11 September, 2015;
originally announced September 2015.