-
Health Index Estimation Through Integration of General Knowledge with Unsupervised Learning
Authors:
Kristupas Bajarunas,
Marcia L. Baptista,
Kai Goebel,
Manuel A. Chao
Abstract:
Accurately estimating a Health Index (HI) from condition monitoring data (CM) is essential for reliable and interpretable prognostics and health management (PHM) in complex systems. In most scenarios, complex systems operate under varying operating conditions and can exhibit different fault modes, making unsupervised inference of an HI from CM data a significant challenge. Hybrid models combining…
▽ More
Accurately estimating a Health Index (HI) from condition monitoring data (CM) is essential for reliable and interpretable prognostics and health management (PHM) in complex systems. In most scenarios, complex systems operate under varying operating conditions and can exhibit different fault modes, making unsupervised inference of an HI from CM data a significant challenge. Hybrid models combining prior knowledge about degradation with deep learning models have been proposed to overcome this challenge. However, previously suggested hybrid models for HI estimation usually rely heavily on system-specific information, limiting their transferability to other systems. In this work, we propose an unsupervised hybrid method for HI estimation that integrates general knowledge about degradation into the convolutional autoencoder's model architecture and learning algorithm, enhancing its applicability across various systems. The effectiveness of the proposed method is demonstrated in two case studies from different domains: turbofan engines and lithium batteries. The results show that the proposed method outperforms other competitive alternatives, including residual-based methods, in terms of HI quality and their utility for Remaining Useful Life (RUL) predictions. The case studies also highlight the comparable performance of our proposed method with a supervised model trained with HI labels.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
From Local to Global: A Graph RAG Approach to Query-Focused Summarization
Authors:
Darren Edge,
Ha Trinh,
Newman Cheng,
Joshua Bradley,
Alex Chao,
Apurva Mody,
Steven Truitt,
Jonathan Larson
Abstract:
The use of retrieval-augmented generation (RAG) to retrieve relevant information from an external knowledge source enables large language models (LLMs) to answer questions over private and/or previously unseen document collections. However, RAG fails on global questions directed at an entire text corpus, such as "What are the main themes in the dataset?", since this is inherently a query-focused s…
▽ More
The use of retrieval-augmented generation (RAG) to retrieve relevant information from an external knowledge source enables large language models (LLMs) to answer questions over private and/or previously unseen document collections. However, RAG fails on global questions directed at an entire text corpus, such as "What are the main themes in the dataset?", since this is inherently a query-focused summarization (QFS) task, rather than an explicit retrieval task. Prior QFS methods, meanwhile, fail to scale to the quantities of text indexed by typical RAG systems. To combine the strengths of these contrasting methods, we propose a Graph RAG approach to question answering over private text corpora that scales with both the generality of user questions and the quantity of source text to be indexed. Our approach uses an LLM to build a graph-based text index in two stages: first to derive an entity knowledge graph from the source documents, then to pregenerate community summaries for all groups of closely-related entities. Given a question, each community summary is used to generate a partial response, before all partial responses are again summarized in a final response to the user. For a class of global sensemaking questions over datasets in the 1 million token range, we show that Graph RAG leads to substantial improvements over a naïve RAG baseline for both the comprehensiveness and diversity of generated answers. An open-source, Python-based implementation of both global and local Graph RAG approaches is forthcoming at https://aka.ms/graphrag.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
Authors:
Tula Masterman,
Sandi Besen,
Mason Sawtell,
Alex Chao
Abstract:
This survey paper examines the recent advancements in AI agent implementations, with a focus on their ability to achieve complex goals that require enhanced reasoning, planning, and tool execution capabilities. The primary objectives of this work are to a) communicate the current capabilities and limitations of existing AI agent implementations, b) share insights gained from our observations of th…
▽ More
This survey paper examines the recent advancements in AI agent implementations, with a focus on their ability to achieve complex goals that require enhanced reasoning, planning, and tool execution capabilities. The primary objectives of this work are to a) communicate the current capabilities and limitations of existing AI agent implementations, b) share insights gained from our observations of these systems in action, and c) suggest important considerations for future developments in AI agent design. We achieve this by providing overviews of single-agent and multi-agent architectures, identifying key patterns and divergences in design choices, and evaluating their overall impact on accomplishing a provided goal. Our contribution outlines key themes when selecting an agentic architecture, the impact of leadership on agent systems, agent communication styles, and key phases for planning, execution, and reflection that enable robust AI agent systems.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
MOSAIC: A Modular System for Assistive and Interactive Cooking
Authors:
Huaxiaoyue Wang,
Kushal Kedia,
Juntao Ren,
Rahma Abdullah,
Atiksh Bhardwaj,
Angela Chao,
Kelly Y Chen,
Nathaniel Chin,
Prithwish Dan,
Xinyi Fan,
Gonzalo Gonzalez-Pumariega,
Aditya Kompella,
Maximus Adrian Pace,
Yash Sharma,
Xiangwan Sun,
Neha Sunkara,
Sanjiban Choudhury
Abstract:
We present MOSAIC, a modular architecture for home robots to perform complex collaborative tasks, such as cooking with everyday users. MOSAIC tightly collaborates with humans, interacts with users using natural language, coordinates multiple robots, and manages an open vocabulary of everyday objects. At its core, MOSAIC employs modularity: it leverages multiple large-scale pre-trained models for g…
▽ More
We present MOSAIC, a modular architecture for home robots to perform complex collaborative tasks, such as cooking with everyday users. MOSAIC tightly collaborates with humans, interacts with users using natural language, coordinates multiple robots, and manages an open vocabulary of everyday objects. At its core, MOSAIC employs modularity: it leverages multiple large-scale pre-trained models for general tasks like language and image recognition, while using streamlined modules designed for task-specific control. We extensively evaluate MOSAIC on 60 end-to-end trials where two robots collaborate with a human user to cook a combination of 6 recipes. We also extensively test individual modules with 180 episodes of visuomotor picking, 60 episodes of human motion forecasting, and 46 online user evaluations of the task planner. We show that MOSAIC is able to efficiently collaborate with humans by running the overall system end-to-end with a real human user, completing 68.3% (41/60) collaborative cooking trials of 6 different recipes with a subtask completion rate of 91.6%. Finally, we discuss the limitations of the current system and exciting open challenges in this domain. The project's website is at https://portal-cornell.github.io/MOSAIC/
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Estimation and inference for causal spillover effects in egocentric-network randomized trials in the presence of network membership misclassification
Authors:
Ariel Chao,
Donna Spiegelman,
Ashley Buchanan,
Laura Forastiere
Abstract:
To leverage peer influence and increase population behavioral changes, behavioral interventions often rely on peer-based strategies. A common study design that assesses such strategies is the egocentric-network randomized trial (ENRT), in which those receiving the intervention are encouraged to disseminate information to their peers. The Average Spillover Effect (ASpE) measures the impact of the i…
▽ More
To leverage peer influence and increase population behavioral changes, behavioral interventions often rely on peer-based strategies. A common study design that assesses such strategies is the egocentric-network randomized trial (ENRT), in which those receiving the intervention are encouraged to disseminate information to their peers. The Average Spillover Effect (ASpE) measures the impact of the intervention on participants who do not receive it, but whose outcomes may be affected by others who do. The assessment of the ASpE relies on assumptions about, and correct measurement of, interference sets within which individuals may influence one another's outcomes. It can be challenging to properly specify interference sets, such as networks in ENRTs, and when mismeasured, intervention effects estimated by existing methods will be biased. In HIV prevention studies where social networks play an important role in disease transmission, correcting ASpE estimates for bias due to network misclassification is critical for accurately evaluating the full impact of interventions. We combined measurement error and causal inference methods to bias-correct the ASpE estimate for network misclassification in ENRTs, when surrogate networks are recorded in place of true ones, and validation data that relate the misclassified to the true networks are available. We investigated finite sample properties of our methods in an extensive simulation study, and illustrated our methods in the HIV Prevention Trials Network (HPTN) 037 study.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
A Generative Approach for Image Registration of Visible-Thermal (VT) Cancer Faces
Authors:
Catherine Ordun,
Alexandra Cha,
Edward Raff,
Sanjay Purushotham,
Karen Kwok,
Mason Rule,
James Gulley
Abstract:
Since thermal imagery offers a unique modality to investigate pain, the U.S. National Institutes of Health (NIH) has collected a large and diverse set of cancer patient facial thermograms for AI-based pain research. However, differing angles from camera capture between thermal and visible sensors has led to misalignment between Visible-Thermal (VT) images. We modernize the classic computer vision…
▽ More
Since thermal imagery offers a unique modality to investigate pain, the U.S. National Institutes of Health (NIH) has collected a large and diverse set of cancer patient facial thermograms for AI-based pain research. However, differing angles from camera capture between thermal and visible sensors has led to misalignment between Visible-Thermal (VT) images. We modernize the classic computer vision task of image registration by applying and modifying a generative alignment algorithm to register VT cancer faces, without the need for a reference or alignment parameters. By registering VT faces, we demonstrate that the quality of thermal images produced in the generative AI downstream task of Visible-to-Thermal (V2T) image translation significantly improves up to 52.5\%, than without registration. Images in this paper have been approved by the NIH NCI for public dissemination.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Current-induced deterministic switching of van der Waals ferromagnet at room temperature
Authors:
Shivam N. Kajale,
Thanh Nguyen,
Corson A. Chao,
David C. Bono,
Artittaya Boonkird,
Mingda Li,
Deblina Sarkar
Abstract:
Recent discovery of emergent magnetism in van der Waals magnetic materials (vdWMM) has broadened the material space for develo** spintronic devices for energy-efficient computation. While there has been appreciable progress in vdWMM discovery, with strong perpendicular magnetic anisotropy (PMA) and Curie temperatures exceeding room temperature, a solution for non-volatile, deterministic switchin…
▽ More
Recent discovery of emergent magnetism in van der Waals magnetic materials (vdWMM) has broadened the material space for develo** spintronic devices for energy-efficient computation. While there has been appreciable progress in vdWMM discovery, with strong perpendicular magnetic anisotropy (PMA) and Curie temperatures exceeding room temperature, a solution for non-volatile, deterministic switching of vdWMMs at room temperature has been missing, limiting the prospects of their adoption into commercial spintronic devices. Here, we report the first demonstration of current-controlled non-volatile, deterministic magnetization switching in a vdW magnetic material at room temperature. We have achieved spin-orbit torque (SOT) switching of the PMA vdW magnet Fe3GaTe2 using a Pt spin-Hall layer up to 320 K, with a threshold switching current density as low as $J_{sw} = 1.69\times10^6 A/cm^2$ at room temperature. We have also quantitatively estimated the anti-dam**-like SOT efficiency of our Fe3GaTe2/Pt bilayer system to be $ξ_{DL}$ = 0.093, using second harmonic Hall voltage measurement technique. These results mark a crucial step in making vdW magnetic materials a viable choice for the development of scalable, future spintronic devices.
△ Less
Submitted 25 June, 2023;
originally announced June 2023.
-
A Benchmark on Uncertainty Quantification for Deep Learning Prognostics
Authors:
Luis Basora,
Arthur Viens,
Manuel Arias Chao,
Xavier Olive
Abstract:
Reliable uncertainty quantification on RUL prediction is crucial for informative decision-making in predictive maintenance. In this context, we assess some of the latest developments in the field of uncertainty quantification for prognostics deep learning. This includes the state-of-the-art variational inference algorithms for Bayesian neural networks (BNN) as well as popular alternatives such as…
▽ More
Reliable uncertainty quantification on RUL prediction is crucial for informative decision-making in predictive maintenance. In this context, we assess some of the latest developments in the field of uncertainty quantification for prognostics deep learning. This includes the state-of-the-art variational inference algorithms for Bayesian neural networks (BNN) as well as popular alternatives such as Monte Carlo Dropout (MCD), deep ensembles (DE) and heteroscedastic neural networks (HNN). All the inference techniques share the same inception deep learning architecture as a functional model. We performed hyperparameter search to optimize the main variational and learning parameters of the algorithms. The performance of the methods is evaluated on a subset of the large NASA NCMAPSS dataset for aircraft engines. The assessment includes RUL prediction accuracy, the quality of predictive uncertainty, and the possibility to break down the total predictive uncertainty into its aleatoric and epistemic parts. The results show no method clearly outperforms the others in all the situations. Although all methods are close in terms of accuracy, we find differences in the way they estimate uncertainty. Thus, DE and MCD generally provide more conservative predictive uncertainty than BNN. Surprisingly, HNN can achieve strong results without the added training complexity and extra parameters of the BNN. For tasks like active learning where a separation of epistemic and aleatoric uncertainty is required, radial BNN and MCD seem the best options.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
SynthA1c: Towards Clinically Interpretable Patient Representations for Diabetes Risk Stratification
Authors:
Michael S. Yao,
Allison Chae,
Matthew T. MacLean,
Anurag Verma,
Jeffrey Duda,
James Gee,
Drew A. Torigian,
Daniel Rader,
Charles Kahn,
Walter R. Witschey,
Hersh Sagreiya
Abstract:
Early diagnosis of Type 2 Diabetes Mellitus (T2DM) is crucial to enable timely therapeutic interventions and lifestyle modifications. As the time available for clinical office visits shortens and medical imaging data become more widely available, patient image data could be used to opportunistically identify patients for additional T2DM diagnostic workup by physicians. We investigated whether imag…
▽ More
Early diagnosis of Type 2 Diabetes Mellitus (T2DM) is crucial to enable timely therapeutic interventions and lifestyle modifications. As the time available for clinical office visits shortens and medical imaging data become more widely available, patient image data could be used to opportunistically identify patients for additional T2DM diagnostic workup by physicians. We investigated whether image-derived phenotypic data could be leveraged in tabular learning classifier models to predict T2DM risk in an automated fashion to flag high-risk patients without the need for additional blood laboratory measurements. In contrast to traditional binary classifiers, we leverage neural networks and decision tree models to represent patient data as 'SynthA1c' latent variables, which mimic blood hemoglobin A1c empirical lab measurements, that achieve sensitivities as high as 87.6%. To evaluate how SynthA1c models may generalize to other patient populations, we introduce a novel generalizable metric that uses vanilla data augmentation techniques to predict model performance on input out-of-domain covariates. We show that image-derived phenotypes and physical examination data together can accurately predict diabetes risk as a means of opportunistic risk stratification enabled by artificial intelligence and medical imaging. Our code is available at https://github.com/allisonjchae/DMT2RiskAssessment.
△ Less
Submitted 27 July, 2023; v1 submitted 20 September, 2022;
originally announced September 2022.
-
Intelligent Sight and Sound: A Chronic Cancer Pain Dataset
Authors:
Catherine Ordun,
Alexandra N. Cha,
Edward Raff,
Byron Gaskin,
Alex Hanson,
Mason Rule,
Sanjay Purushotham,
James L. Gulley
Abstract:
Cancer patients experience high rates of chronic pain throughout the treatment process. Assessing pain for this patient population is a vital component of psychological and functional well-being, as it can cause a rapid deterioration of quality of life. Existing work in facial pain detection often have deficiencies in labeling or methodology that prevent them from being clinically relevant. This p…
▽ More
Cancer patients experience high rates of chronic pain throughout the treatment process. Assessing pain for this patient population is a vital component of psychological and functional well-being, as it can cause a rapid deterioration of quality of life. Existing work in facial pain detection often have deficiencies in labeling or methodology that prevent them from being clinically relevant. This paper introduces the first chronic cancer pain dataset, collected as part of the Intelligent Sight and Sound (ISS) clinical trial, guided by clinicians to help ensure that model findings yield clinically relevant results. The data collected to date consists of 29 patients, 509 smartphone videos, 189,999 frames, and self-reported affective and activity pain scores adopted from the Brief Pain Inventory (BPI). Using static images and multi-modal data to predict self-reported pain levels, early models show significant gaps between current methods available to predict pain today, with room for improvement. Due to the especially sensitive nature of the inherent Personally Identifiable Information (PII) of facial images, the dataset will be released under the guidance and control of the National Institutes of Health (NIH).
△ Less
Submitted 7 April, 2022;
originally announced April 2022.
-
Learning GraphQL Query Costs (Extended Version)
Authors:
Georgios Mavroudeas,
Guillaume Baudart,
Alan Cha,
Martin Hirzel,
Jim A. Laredo,
Malik Magdon-Ismail,
Louis Mandel,
Erik Wittern
Abstract:
GraphQL is a query language for APIs and a runtime for executing those queries, fetching the requested data from existing microservices, REST APIs, databases, or other sources. Its expressiveness and its flexibility have made it an attractive candidate for API providers in many industries, especially through the web. A major drawback to blindly servicing a client's query in GraphQL is that the cos…
▽ More
GraphQL is a query language for APIs and a runtime for executing those queries, fetching the requested data from existing microservices, REST APIs, databases, or other sources. Its expressiveness and its flexibility have made it an attractive candidate for API providers in many industries, especially through the web. A major drawback to blindly servicing a client's query in GraphQL is that the cost of a query can be unexpectedly large, creating computation and resource overload for the provider, and API rate-limit overages and infrastructure overload for the client. To mitigate these drawbacks, it is necessary to efficiently estimate the cost of a query before executing it. Estimating query cost is challenging, because GraphQL queries have a nested structure, GraphQL APIs follow different design conventions, and the underlying data sources are hidden. Estimates based on worst-case static query analysis have had limited success because they tend to grossly overestimate cost. We propose a machine-learning approach to efficiently and accurately estimate the query cost. We also demonstrate the power of this approach by testing it on query-response data from publicly available commercial APIs. Our framework is efficient and predicts query costs with high accuracy, consistently outperforming the static analysis by a large margin.
△ Less
Submitted 26 August, 2021; v1 submitted 25 August, 2021;
originally announced August 2021.
-
Uncertainty-aware Remaining Useful Life predictor
Authors:
Luca Biggio,
Alexander Wieland,
Manuel Arias Chao,
Iason Kastanis,
Olga Fink
Abstract:
Remaining Useful Life (RUL) estimation is the problem of inferring how long a certain industrial asset can be expected to operate within its defined specifications. Deploying successful RUL prediction methods in real-life applications is a prerequisite for the design of intelligent maintenance strategies with the potential of drastically reducing maintenance costs and machine downtimes. In light o…
▽ More
Remaining Useful Life (RUL) estimation is the problem of inferring how long a certain industrial asset can be expected to operate within its defined specifications. Deploying successful RUL prediction methods in real-life applications is a prerequisite for the design of intelligent maintenance strategies with the potential of drastically reducing maintenance costs and machine downtimes. In light of their superior performance in a wide range of engineering fields, Machine Learning (ML) algorithms are natural candidates to tackle the challenges involved in the design of intelligent maintenance systems. In particular, given the potentially catastrophic consequences or substantial costs associated with maintenance decisions that are either too late or too early, it is desirable that ML algorithms provide uncertainty estimates alongside their predictions. However, standard data-driven methods used for uncertainty estimation in RUL problems do not scale well to large datasets or are not sufficiently expressive to model the high-dimensional map** from raw sensor data to RUL estimates. In this work, we consider Deep Gaussian Processes (DGPs) as possible solutions to the aforementioned limitations. We perform a thorough evaluation and comparison of several variants of DGPs applied to RUL predictions. The performance of the algorithms is evaluated on the N-CMAPSS (New Commercial Modular Aero-Propulsion System Simulation) dataset from NASA for aircraft engines. The results show that the proposed methods are able to provide very accurate RUL predictions along with sensible uncertainty estimates, providing more reliable solutions for (safety-critical) real-life industrial applications.
△ Less
Submitted 8 April, 2021;
originally announced April 2021.
-
TapNet: The Design, Training, Implementation, and Applications of a Multi-Task Learning CNN for Off-Screen Mobile Input
Authors:
Michael Xuelin Huang,
Yang Li,
Nazneen Nazneen,
Alexander Chao,
Shumin Zhai
Abstract:
To make off-screen interaction without specialized hardware practical, we investigate using deep learning methods to process the common built-in IMU sensor (accelerometers and gyroscopes) on mobile phones into a useful set of one-handed interaction events. We present the design, training, implementation and applications of TapNet, a multi-task network that detects tap** on the smartphone. With p…
▽ More
To make off-screen interaction without specialized hardware practical, we investigate using deep learning methods to process the common built-in IMU sensor (accelerometers and gyroscopes) on mobile phones into a useful set of one-handed interaction events. We present the design, training, implementation and applications of TapNet, a multi-task network that detects tap** on the smartphone. With phone form factor as auxiliary information, TapNet can jointly learn from data across devices and simultaneously recognize multiple tap properties, including tap direction and tap location. We developed two datasets consisting of over 135K training samples, 38K testing samples, and 32 participants in total. Experimental evaluation demonstrated the effectiveness of the TapNet design and its significant improvement over the state of the art. Along with the datasets, (https://sites.google.com/site/michaelxlhuang/datasets/tapnet-dataset), and extensive experiments, TapNet establishes a new technical foundation for off-screen mobile input.
△ Less
Submitted 17 February, 2021;
originally announced February 2021.
-
Generative Interventions for Causal Learning
Authors:
Chengzhi Mao,
Augustine Cha,
Amogh Gupta,
Hao Wang,
Junfeng Yang,
Carl Vondrick
Abstract:
We introduce a framework for learning robust visual representations that generalize to new viewpoints, backgrounds, and scene contexts. Discriminative models often learn naturally occurring spurious correlations, which cause them to fail on images outside of the training distribution. In this paper, we show that we can steer generative models to manufacture interventions on features caused by conf…
▽ More
We introduce a framework for learning robust visual representations that generalize to new viewpoints, backgrounds, and scene contexts. Discriminative models often learn naturally occurring spurious correlations, which cause them to fail on images outside of the training distribution. In this paper, we show that we can steer generative models to manufacture interventions on features caused by confounding factors. Experiments, visualizations, and theoretical results show this method learns robust representations more consistent with the underlying causal relationships. Our approach improves performance on multiple datasets demanding out-of-distribution generalization, and we demonstrate state-of-the-art performance generalizing from ImageNet to ObjectNet dataset.
△ Less
Submitted 27 March, 2021; v1 submitted 22 December, 2020;
originally announced December 2020.
-
A Principled Approach to GraphQL Query Cost Analysis
Authors:
Alan Cha,
Erik Wittern,
Guillaume Baudart,
James C. Davis,
Louis Mandel,
Jim A. Laredo
Abstract:
The landscape of web APIs is evolving to meet new client requirements and to facilitate how providers fulfill them. A recent web API model is GraphQL, which is both a query language and a runtime. Using GraphQL, client queries express the data they want to retrieve or mutate, and servers respond with exactly those data or changes. GraphQL's expressiveness is risky for service providers because cli…
▽ More
The landscape of web APIs is evolving to meet new client requirements and to facilitate how providers fulfill them. A recent web API model is GraphQL, which is both a query language and a runtime. Using GraphQL, client queries express the data they want to retrieve or mutate, and servers respond with exactly those data or changes. GraphQL's expressiveness is risky for service providers because clients can succinctly request stupendous amounts of data, and responding to overly complex queries can be costly or disrupt service availability. Recent empirical work has shown that many service providers are at risk. Using traditional API management methods is not sufficient, and practitioners lack principled means of estimating and measuring the cost of the GraphQL queries they receive. In this work, we present a linear-time GraphQL query analysis that can measure the cost of a query without executing it. Our approach can be applied in a separate API management layer and used with arbitrary GraphQL backends. In contrast to existing static approaches, our analysis supports common GraphQL conventions that affect query cost, and our analysis is provably correct based on our formal specification of GraphQL semantics. We demonstrate the potential of our approach using a novel GraphQL query-response corpus for two commercial GraphQL APIs. Our query analysis consistently obtains upper cost bounds, tight enough relative to the true response sizes to be actionable for service providers. In contrast, existing static GraphQL query analyses exhibit over-estimates and under-estimates because they fail to support GraphQL conventions.
△ Less
Submitted 11 September, 2020;
originally announced September 2020.
-
Dynamical Systems, Representation of Particle Beams
Authors:
Alex Chao
Abstract:
An overview of dynamical systems in accelerator physics is presented with a suggestion of a few issues to be addressed. Also mentioned are a few possible developments in the future. Technical details supporting the views are not presented.
An overview of dynamical systems in accelerator physics is presented with a suggestion of a few issues to be addressed. Also mentioned are a few possible developments in the future. Technical details supporting the views are not presented.
△ Less
Submitted 25 June, 2020;
originally announced June 2020.
-
Real-Time Model Calibration with Deep Reinforcement Learning
Authors:
Yuan Tian,
Manuel Arias Chao,
Chetan Kulkarni,
Kai Goebel,
Olga Fink
Abstract:
The dynamic, real-time, and accurate inference of model parameters from empirical data is of great importance in many scientific and engineering disciplines that use computational models (such as a digital twin) for the analysis and prediction of complex physical processes. However, fast and accurate inference for processes with large and high dimensional datasets cannot easily be achieved with st…
▽ More
The dynamic, real-time, and accurate inference of model parameters from empirical data is of great importance in many scientific and engineering disciplines that use computational models (such as a digital twin) for the analysis and prediction of complex physical processes. However, fast and accurate inference for processes with large and high dimensional datasets cannot easily be achieved with state-of-the-art methods under noisy real-world conditions. The primary reason is that the inference of model parameters with traditional techniques based on optimisation or sampling often suffers from computational and statistical challenges, resulting in a trade-off between accuracy and deployment time. In this paper, we propose a novel framework for inference of model parameters based on reinforcement learning. The contribution of the paper is twofold: 1) We reformulate the inference problem as a tracking problem with the objective of learning a policy that forces the response of the physics-based model to follow the observations; 2) We propose the constrained Lyapunov-based actor-critic (CLAC) algorithm to enable the robust and accurate inference of physics-based model parameters in real time under noisy real-world conditions. The proposed methodology is demonstrated and evaluated on two model-based diagnostics test cases utilizing two different physics-based models of turbofan engines. The performance of the methodology is compared to that of two alternative approaches: a state update method (unscented Kalman filter) and a supervised end-to-end map** with deep neural networks. The experimental results demonstrate that the proposed methodology outperforms all other tested methods in terms of speed and robustness, with high inference accuracy.
△ Less
Submitted 9 June, 2020; v1 submitted 6 June, 2020;
originally announced June 2020.
-
Fusing Physics-based and Deep Learning Models for Prognostics
Authors:
Manuel Arias Chao,
Chetan Kulkarni,
Kai Goebel,
Olga Fink
Abstract:
Physics-based and data-driven models for remaining useful lifetime (RUL) prediction typically suffer from two major challenges that limit their applicability to complex real-world domains: (1) incompleteness of physics-based models and (2) limited representativeness of the training dataset for data-driven models. Combining the advantages of these two directions while overcoming some of their limit…
▽ More
Physics-based and data-driven models for remaining useful lifetime (RUL) prediction typically suffer from two major challenges that limit their applicability to complex real-world domains: (1) incompleteness of physics-based models and (2) limited representativeness of the training dataset for data-driven models. Combining the advantages of these two directions while overcoming some of their limitations, we propose a novel hybrid framework for fusing the information from physics-based performance models with deep learning algorithms for prognostics of complex safety-critical systems under real-world scenarios. In the proposed framework, we use physics-based performance models to infer unobservable model parameters related to a system's components health solving a calibration problem. These parameters are subsequently combined with sensor readings and used as input to a deep neural network to generate a data-driven prognostics model with physics-augmented features. The performance of the hybrid framework is evaluated on an extensive case study comprising run-to-failure degradation trajectories from a fleet of nine turbofan engines under real flight conditions. The experimental results show that the hybrid framework outperforms purely data-driven approaches by extending the prediction horizon by nearly 127\%. Furthermore, it requires less training data and is less sensitive to the limited representativeness of the dataset compared to purely data-driven approaches.
△ Less
Submitted 27 October, 2020; v1 submitted 2 March, 2020;
originally announced March 2020.
-
Patient Specific Biomechanics Are Clinically Significant In Accurate Computer Aided Surgical Image Guidance
Authors:
Michael Barrow,
Alice Chao,
Qizhi He,
Sonia Ramamoorthy,
Claude Sirlin,
Ryan Kastner
Abstract:
Augmented Reality is used in Image Guided surgery (AR IG) to fuse surgical landmarks from preoperative images into a video overlay. Physical simulation is essential to maintaining accurate position of the landmarks as surgery progresses and ensuring patient safety by avoiding accidental damage to vessels etc. In liver procedures, AR IG simulation accuracy is hampered by an inability to model stiff…
▽ More
Augmented Reality is used in Image Guided surgery (AR IG) to fuse surgical landmarks from preoperative images into a video overlay. Physical simulation is essential to maintaining accurate position of the landmarks as surgery progresses and ensuring patient safety by avoiding accidental damage to vessels etc. In liver procedures, AR IG simulation accuracy is hampered by an inability to model stiffness variations unique to the patients disease. We introduce a novel method to account for patient specific stiffness variation based on Magnetic Resonance Elastography (MRE) data. To the best of our knowledge we are the first to demonstrate the use of in-vivo biomechanical data for AR IG landmark placement. In this early work, a comparative evaluation of our MRE data driven simulation and the traditional method shows clinically significant differences in accuracy during landmark placement and motivates further animal model trials.
△ Less
Submitted 29 January, 2020;
originally announced January 2020.
-
Implicit supervision for fault detection and segmentation of emerging fault types with Deep Variational Autoencoders
Authors:
Manuel Arias Chao,
Bryan T. Adey,
Olga Fink
Abstract:
Data-driven fault diagnostics of safety-critical systems often faces the challenge of a complete lack of labeled data associated with faulty system conditions (i.e., fault types) at training time. Since an unknown number and nature of fault types can arise during deployment, data-driven fault diagnostics in this scenario is an open-set learning problem. Most of the algorithms for open-set diagnost…
▽ More
Data-driven fault diagnostics of safety-critical systems often faces the challenge of a complete lack of labeled data associated with faulty system conditions (i.e., fault types) at training time. Since an unknown number and nature of fault types can arise during deployment, data-driven fault diagnostics in this scenario is an open-set learning problem. Most of the algorithms for open-set diagnostics are one-class classification and unsupervised algorithms that do not leverage all the available labeled and unlabeled data in the learning algorithm. As a result, their fault detection and segmentation performance (i.e., identifying and separating faults of different types) are sub-optimal. With this work, we propose training a variational autoencoder (VAE) with labeled and unlabeled samples while inducing implicit supervision on the latent representation of the healthy conditions. This, together with a modified sampling process of VAE, creates a compact and informative latent representation that allows good detection and segmentation of unseen fault types using existing one-class and clustering algorithms. We refer to the proposed methodology as "knowledge induced variational autoencoder with adaptive sampling" (KIL-AdaVAE). The fault detection and segmentation capabilities of the proposed methodology are demonstrated in a new simulated case study using the Advanced Geared Turbofan 30000 (AGTF30) dynamical model under real flight conditions. In an extensive comparison, we demonstrate that the proposed method outperforms other learning strategies (supervised learning, supervised learning with embedding and semi-supervised learning) and deep learning algorithms, yielding significant performance improvements on fault detection and fault segmentation.
△ Less
Submitted 29 September, 2020; v1 submitted 28 December, 2019;
originally announced December 2019.
-
Hybrid deep fault detection and isolation: Combining deep neural networks and system performance models
Authors:
Manuel Arias Chao,
Chetan Kulkarni,
Kai Goebel,
Olga Fink
Abstract:
With the increased availability of condition monitoring data and the increased complexity of explicit system physics-based models, the application of data-driven approaches for fault detection and isolation has recently grown. While detection accuracy of such approaches is generally good, their performance on fault isolation often suffers from the fact that fault conditions affect a large portion…
▽ More
With the increased availability of condition monitoring data and the increased complexity of explicit system physics-based models, the application of data-driven approaches for fault detection and isolation has recently grown. While detection accuracy of such approaches is generally good, their performance on fault isolation often suffers from the fact that fault conditions affect a large portion of the measured signals thereby masking the fault source. To overcome this limitation and enable a more accurate fault detection, we propose a hybrid approach combining physical performance models with deep learning algorithms. Unobserved process variables are inferred with a physics-based performance model to enhance the input space of a data-driven diagnostics model. To validate the effectiveness of the proposed method, we generate a condition monitoring dataset of an advanced gas turbine during flight conditions under healthy and four faulty operative conditions based on the Commercial Modular Aero-Propulsion System Simulation (C-MAPSS) dynamical model. We evaluate the performance of the proposed method in combination with two different deep learning algorithms: feed forward neural networks and Variational Autoencoders, both of which demonstrate a significant improvement when applied within the hybrid fault detection and diagnostics framework. The proposed method is able to outperform pure data-driven solutions, particularly for systems with a high variability of operating conditions. It provides superior results both for fault detection as well as for fault isolation. For fault isolation, it overcomes the smearing effect that is observed in pure data-driven approaches and enables a precise isolation of the affected signal. We also demonstrate that deep learning algorithms provide a better performance on fault detection compared to the traditional machine learning algorithms.
△ Less
Submitted 28 December, 2019; v1 submitted 5 August, 2019;
originally announced August 2019.
-
An Empirical Study of GraphQL Schemas
Authors:
Erik Wittern,
Alan Cha,
James C. Davis,
Guillaume Baudart,
Louis Mandel
Abstract:
GraphQL is a query language for APIs and a runtime to execute queries. Using GraphQL queries, clients define precisely what data they wish to retrieve or mutate on a server, leading to fewer round trips and reduced response sizes. Although interest in GraphQL is on the rise, with increasing adoption at major organizations, little is known about what GraphQL interfaces look like in practice. This l…
▽ More
GraphQL is a query language for APIs and a runtime to execute queries. Using GraphQL queries, clients define precisely what data they wish to retrieve or mutate on a server, leading to fewer round trips and reduced response sizes. Although interest in GraphQL is on the rise, with increasing adoption at major organizations, little is known about what GraphQL interfaces look like in practice. This lack of knowledge makes it hard for providers to understand what practices promote idiomatic, easy-to-use APIs, and what pitfalls to avoid. To address this gap, we study the design of GraphQL interfaces in practice by analyzing their schemas - the descriptions of their exposed data types and the possible operations on the underlying data. We base our study on two novel corpuses of GraphQL schemas, one of 16 commercial GraphQL schemas and the other of 8,399 GraphQL schemas mined from GitHub projects. We make both corpuses available to other researchers. Using these corpuses, we characterize the size of schemas and their use of GraphQL features and assess the use of both prescribed and organic naming conventions. We also report that a majority of APIs are susceptible to denial of service through complex queries, posing real security risks previously discussed only in theory. We also assess ways in which GraphQL APIs attempt to address these concerns.
△ Less
Submitted 30 July, 2019;
originally announced July 2019.
-
Generating GraphQL-Wrappers for REST(-like) APIs
Authors:
Erik Wittern,
Alan Cha,
Jim A. Laredo
Abstract:
GraphQL is a query language and thereupon-based paradigm for implementing web Application Programming Interfaces (APIs) for client-server interactions. Using GraphQL, clients define precise, nested data-requirements in typed queries, which are resolved by servers against (possibly multiple) backend systems, like databases, object storages, or other APIs. Clients receive only the data they care abo…
▽ More
GraphQL is a query language and thereupon-based paradigm for implementing web Application Programming Interfaces (APIs) for client-server interactions. Using GraphQL, clients define precise, nested data-requirements in typed queries, which are resolved by servers against (possibly multiple) backend systems, like databases, object storages, or other APIs. Clients receive only the data they care about, in a single request. However, providers of existing REST(-like) APIs need to implement additional GraphQL interfaces to enable these advantages. We here assess the feasibility of automatically generating GraphQL wrappers for existing REST(-like) APIs. A wrapper, upon receiving GraphQL queries, translates them to requests against the target API. We discuss the challenges for creating such wrappers, including dealing with data sanitation, authentication, or handling nested queries. We furthermore present a prototypical implementation of OASGraph. OASGraph takes as input an OpenAPI Specification (OAS) describing an existing REST(-like) web API and generates a GraphQL wrapper for it. We evaluate OASGraph by running it, as well as an existing open source alternative, against 959 publicly available OAS. This experiment shows that OASGraph outperforms the existing alternative and is able to create a GraphQL wrapper for 89.5% of the APIs -- however, with limitations in many cases. A subsequent analysis of errors and warnings produced by OASGraph shows that missing or ambiguous information in the assessed OAS hinders creating complete wrappers. Finally, we present a use case of the IBM Watson Language Translator API that shows that small changes to an OAS allow OASGraph to generate more idiomatic and more expressive GraphQL wrappers.
△ Less
Submitted 21 September, 2018;
originally announced September 2018.
-
The feasibility of automated identification of six algae types using neural networks and fluorescence-based spectral-morphological features
Authors:
Jason L. Deglint,
Chao **,
Angela Chao,
Alexander Wong
Abstract:
Harmful algae blooms (HABs), which produce lethal toxins, are a growing global concern since they negatively affect the quality of drinking water and have major negative impact on wildlife, the fishing industry, as well as tourism and recreational water use. In this study, we investigate the feasibility of leveraging machine learning and fluorescence-based spectral-morphological features to enable…
▽ More
Harmful algae blooms (HABs), which produce lethal toxins, are a growing global concern since they negatively affect the quality of drinking water and have major negative impact on wildlife, the fishing industry, as well as tourism and recreational water use. In this study, we investigate the feasibility of leveraging machine learning and fluorescence-based spectral-morphological features to enable the identification of six different algae types in an automated fashion. More specifically, a custom multi-band fluorescence imaging microscope is used to capture fluorescence imaging data of a water sample at six different excitation wavelengths ranging from 405 nm - 530 nm. A number of morphological and spectral fluorescence features are then extracted from the isolated micro-organism imaging data, and used to train neural network classification models designed for the purpose of identification of the six algae types given an isolated micro-organism. Experimental results using three different neural network classification models showed that the use of either fluorescence-based spectral features or fluorescence-based spectral-morphological features to train neural network classification models led to statistically significant improvements in identification accuracy when compared to the use of morphological features (with average identification accuracies of 95.7%+/-3.5% and 96.1%+/-1.5%, respectively). These preliminary results are quite promising, given that the identification accuracy of human taxonomists are typically between the range of 67% and 83%, and thus illustrates the feasibility of leveraging machine learning and fluorescence-based spectral-morphological features as a viable method for automated identification of different algae types.
△ Less
Submitted 2 May, 2018;
originally announced May 2018.
-
Accelerator Based Fusion Reactor
Authors:
Keh-Fei Liu,
Alexander Wu Chao
Abstract:
A feasibility study of fusion reactors based on accelerators is carried out. We consider a novel scheme where a beam from the accelerator hits the target plasma on the resonance of the fusion reaction and establish characteristic criteria for a workable reactor. We consider the reactions $ d + t \rightarrow n + α, d + {}^3H_e \rightarrow p + α$, and $p + {}^{11}B \rightarrow 3 α$ in this study. Th…
▽ More
A feasibility study of fusion reactors based on accelerators is carried out. We consider a novel scheme where a beam from the accelerator hits the target plasma on the resonance of the fusion reaction and establish characteristic criteria for a workable reactor. We consider the reactions $ d + t \rightarrow n + α, d + {}^3H_e \rightarrow p + α$, and $p + {}^{11}B \rightarrow 3 α$ in this study. The critical temperature of the plasma is determined from overcoming the stop** power of the beam with the fusion energy gain. The needed plasma lifetime is determined from the width of the resonance, the beam velocity and the plasma density. We estimate the critical beam flux by balancing the energy of fusion production against the plasma thermo-energy and the loss due to stop** power for the case of an inert plasma. The product of critical flux and plasma lifetime is independent of plasma density and has a weak dependence on temperature. Even though the critical temperatures for these reactions are lower than those for the thermonuclear reactors, the critical flux is in the range of $10^{22} - 10^{24}/\rm{cm^2/s}$ for the plasma density $ρ_t = 10^{15}/{\rm cm^3}$ in the case of an inert plasma. Several approaches to control the growth of the two-stream instability are discussed. We have also considered several scenarios for practical implementation which will require further studies. Finally, we consider the case where the injected beam at the resonance energy maintains the plasma temperature and prolongs its lifetime to reach a steady state. The equations for power balance and particle number conservation are given for this case.
△ Less
Submitted 10 July, 2017;
originally announced July 2017.
-
A Feasibility Study of an e+e- Ring Collider for Higgs Factory
Authors:
Yunhai Cai,
Alex Chao,
Yuri Nosochkov,
Uli Wienands,
Frank Zimmermann
Abstract:
A ring-based Higgs factory with a center-of-mass energy of 240 GeV residing in the existing LEP tunnel is studied at a level of concrete lattice. We found that low-emittance lattice is essential to mitigate the effect of beamstrahlung on the beam lifetime. To achieve a luminosity of $1.0\times10^{34}{\rm cm}^{-2}{\rm s}^{-1}$, we simplified final focusing system and improved its momentum aperture…
▽ More
A ring-based Higgs factory with a center-of-mass energy of 240 GeV residing in the existing LEP tunnel is studied at a level of concrete lattice. We found that low-emittance lattice is essential to mitigate the effect of beamstrahlung on the beam lifetime. To achieve a luminosity of $1.0\times10^{34}{\rm cm}^{-2}{\rm s}^{-1}$, we simplified final focusing system and improved its momentum aperture by a factor of 5 to 2.5%.
△ Less
Submitted 10 September, 2013;
originally announced September 2013.
-
Report of the ICFA Beam Dynamics Workshop 'Accelerators for a Higgs Factory: Linear vs. Circular' (HF2012)
Authors:
Alain Blondel,
Alex Chao,
Weiren Chou,
Jie Gao,
Daniel Schulte,
Kaoru Yokoya
Abstract:
This paper is a summary report of the ICFA Beam Dynamics Workshop 'Accelerators for a Higgs Factory: Linear vs. Circular' (HF2012). It discusses four types of accelerators as possible candidates for a Higgs factory: linear e+e- colliders, circular e+e- colliders, muon collider and photon colliders. The comparison includes: physics reach, performance (energy and luminosity), upgrade potential, tech…
▽ More
This paper is a summary report of the ICFA Beam Dynamics Workshop 'Accelerators for a Higgs Factory: Linear vs. Circular' (HF2012). It discusses four types of accelerators as possible candidates for a Higgs factory: linear e+e- colliders, circular e+e- colliders, muon collider and photon colliders. The comparison includes: physics reach, performance (energy and luminosity), upgrade potential, technology maturity and readiness, and technical challenges requiring further R&D.
△ Less
Submitted 15 February, 2013; v1 submitted 14 February, 2013;
originally announced February 2013.
-
Status of the Super-B factory Design
Authors:
W. Wittmer,
K. Bertsche,
A. Chao,
A. Novokhatski,
Y. Nosochkov,
J. Seeman,
M. K. Sullivan,
U. Wienands,
S. Weathersby,
A. V. Bogomyagkov,
E. Levichev,
S. Nikitin,
P. Piminov,
D. Shatilov,
S. Sinyatkin,
P. Vobly,
I. N. Okunev,
B. Bolzon,
L. Brunetti,
A. Jeremie,
M. E. Biagini,
R. Boni,
M. Boscolo,
T. Demma,
A. Drago
, et al. (20 additional authors not shown)
Abstract:
The SuperB international team continues to optimize the design of an electron-positron collider, which will allow the enhanced study of the origins of flavor physics. The project combines the best features of a linear collider (high single-collision luminosity) and a storage-ring collider (high repetition rate), bringing together all accelerator physics aspects to make a very high luminosity of 10…
▽ More
The SuperB international team continues to optimize the design of an electron-positron collider, which will allow the enhanced study of the origins of flavor physics. The project combines the best features of a linear collider (high single-collision luminosity) and a storage-ring collider (high repetition rate), bringing together all accelerator physics aspects to make a very high luminosity of 10$^{36}$ cm$^{-2}$ sec$^{-1}$. This asymmetric-energy collider with a polarized electron beam will produce hundreds of millions of B-mesons at the $Υ$(4S) resonance. The present design is based on extremely low emittance beams colliding at a large Piwinski angle to allow very low $β_y^\star$ without the need for ultra short bunches. Use of crab-waist sextupoles will enhance the luminosity, suppressing dangerous resonances and allowing for a higher beam-beam parameter. The project has flexible beam parameters, improved dynamic aperture, and spin-rotators in the Low Energy Ring for longitudinal polarization of the electron beam at the Interaction Point. Optimized for best colliding-beam performance, the facility may also provide high-brightness photon beams for synchrotron radiation applications.
△ Less
Submitted 9 October, 2011;
originally announced October 2011.
-
Wide spin resonance with an rf-bunched proton beam
Authors:
V. S. Morozov,
A. W. Chao,
A. D. Krisch,
M. A. Leonova,
J. Liu,
R. S. Raymond,
D. W. Sivers,
V. K. Wong,
A. M. Kondratenko
Abstract:
We recently used an rf solenoid to study the widths of rf spin resonances with both unbunched and bunched beams of 2.1 GeV_c polarized protons stored in the COSY synchrotron. A map, with unbunched beam at different fixed rf-solenoid frequencies, showed a very shallow possible depolarization dip at the resonance. Next we made frequency sweeps of 400Hz, centered at similar frequencies, which great…
▽ More
We recently used an rf solenoid to study the widths of rf spin resonances with both unbunched and bunched beams of 2.1 GeV_c polarized protons stored in the COSY synchrotron. A map, with unbunched beam at different fixed rf-solenoid frequencies, showed a very shallow possible depolarization dip at the resonance. Next we made frequency sweeps of 400Hz, centered at similar frequencies, which greatly enhanced the dip. But, with a bunched proton beam, both the fixed-frequency and frequency-sweep techniques produced similar maps, and both bunched maps showed full beam depolarization over a wide region. Moreover, both were more than twice as wide as the unbunched dip. This widening of the proton resonance due to bunching is exactly opposite to the recently observed narrowing of deuteron resonances due to bunching.
△ Less
Submitted 9 January, 2010;
originally announced January 2010.
-
Collective deceleration: toward a compact beam dump
Authors:
H. -C. Wu,
T. Tajima,
D. Habs,
A. W. Chao,
J. Meyer-ter-Vehn
Abstract:
With the increasing development of laser accelerators, the electron energy is already beyond GeV and even higher in near future. Conventional beam dump based on ionization or radiation loss mechanism is cumbersome and costly, also has radiological hazards. We revisit the stop** power of high-energy charged particles in matter and discuss the associated problem of beam dump from the point of vi…
▽ More
With the increasing development of laser accelerators, the electron energy is already beyond GeV and even higher in near future. Conventional beam dump based on ionization or radiation loss mechanism is cumbersome and costly, also has radiological hazards. We revisit the stop** power of high-energy charged particles in matter and discuss the associated problem of beam dump from the point of view of collective deceleration. The collective stop** length in an ionized gas can be several orders of magnitude shorter than the Bethe-Bloch and multiple electromagnetic cascades' stop** length in solid. At the mean time, the tenuous density of the gas makes the radioactivation negligible. Such a compact and non-radioactivating beam dump works well for short and dense bunches, which is typically generated from laser wakefield accelerator.
△ Less
Submitted 10 December, 2009; v1 submitted 8 September, 2009;
originally announced September 2009.
-
Diaphragm as an anatomic surrogate for lung tumor motion
Authors:
Laura I. Cervino,
Alvin. K. Y. Chao,
Ajay Sandhu,
Steve B. Jiang
Abstract:
Lung tumor motion due to respiration poses a challenge in the application of modern three-dimensional conformal radiotherapy. Direct tracking of the lung tumor during radiation therapy is very difficult without implanted fiducial markers. Indirect tracking relies on the correlation of the tumor's motion and the surrogate's motion. The present paper presents an analysis of the correlation between…
▽ More
Lung tumor motion due to respiration poses a challenge in the application of modern three-dimensional conformal radiotherapy. Direct tracking of the lung tumor during radiation therapy is very difficult without implanted fiducial markers. Indirect tracking relies on the correlation of the tumor's motion and the surrogate's motion. The present paper presents an analysis of the correlation between the tumor motion and the diaphragm motion in order to evaluate the potential use of diaphragm as a surrogate for tumor motion. We have analyzed the correlation between diaphragm motion and superior-inferior lung tumor motion in 32 fluoroscopic image sequences from 10 lung cancer patients. A simple linear model and a more complex linear model that accounts for phase delays between the two motions have been used. Results show that the diaphragm is a good surrogate for tumor motion prediction for most patients, resulting in an average correlation factor of 0.94 and 0.98 with each model respectively. The model that accounts for delays leads to an average localization prediction error of 0.8mm and an error at the 95% confidence level of 2.1mm. However, for one patient studied, the correlation is much weaker compared to other patients. This indicates that, before using diaphragm for lung tumor prediction, the correlation should be examined on a patient-by-patient basis.
△ Less
Submitted 20 April, 2009;
originally announced April 2009.
-
Spin resonance strengths due to rf solenoids and dipoles for stored deuteron beams
Authors:
M. A. Leonova,
A. W. Chao,
E. D. Courant,
A. D. Krisch,
V. S. Morozov,
R. S. Raymond,
D. W. Sivers,
J. M. Williams,
V. K. Wong,
A. Garishvili,
R. Gebel,
A. Lehrach,
B. Lorentz,
R. Maier,
D. Prasuhn,
H. Stockhorst,
D. Welsch,
F. Hinterberger,
K. Ulbrich,
Ya. S. Derbenev,
A. M. Kondratenko,
Y. F. Orlov,
E. J. Stephenson,
N. P. M. Brantjes,
C. J. G. Onderwater
, et al. (1 additional authors not shown)
Abstract:
This submission was withdrawn because of an unresolved dispute between the authors [arXiv admin 2009-4-13].
This submission was withdrawn because of an unresolved dispute between the authors [arXiv admin 2009-4-13].
△ Less
Submitted 13 April, 2009; v1 submitted 16 January, 2009;
originally announced January 2009.
-
Opportunities for TeV Laser Acceleration
Authors:
M. Kando,
H. Kiriyama,
J. K. Koga,
S. Bulanov,
A. W. Chao,
T. Esirkepov,
R. Hajima,
T. Tajima
Abstract:
A set of ballpark parameters for laser, plasma, and accelerator technologies that define for electron energies reaching as high as TeV are identified. These ballpark parameters are carved out from the fundamental scaling laws that govern laser acceleration, theoretically suggested and experimentally explored over a wide range in the recent years. In the density regime on the order of 10^{16} cm^…
▽ More
A set of ballpark parameters for laser, plasma, and accelerator technologies that define for electron energies reaching as high as TeV are identified. These ballpark parameters are carved out from the fundamental scaling laws that govern laser acceleration, theoretically suggested and experimentally explored over a wide range in the recent years. In the density regime on the order of 10^{16} cm^{-3}, the appropriate laser technology, we find, matches well with that of a highly efficient high fluence LD driven Yb ceramic laser. Further, the collective acceleration technique applies to compactify the beam stoppage stage by adopting the beam-plasma wave deceleration, which contributes to significantly enhance the stop** power and energy recovery capability of the beam. Thus we find the confluence of the needed laser acceleration parameters dictated by these scaling laws and the emerging laser technology. This may herald a new technology in the ultrahigh energy frontier.
△ Less
Submitted 29 April, 2008;
originally announced April 2008.
-
On the production of flat electron bunches for laser wake field acceleration
Authors:
M. Kando,
Y. Fukuda,
H. Kotaki,
J. Koga,
S. V. Bulanov,
T. Tajima,
A. Chao,
R. Pitthan,
K. -P. Schuler,
A. G. Zhidkov,
K. Nemoto
Abstract:
We suggest a novel method for injection of electrons into the acceleration phase of particle accelerators, producing low emittance beams appropriate even for the demanding high energy Linear Collider specifications. In this paper we work out the injection into the acceleration phase of the wake field in a plasma behind a high intensity laser pulse, taking advantage of the laser polarization and…
▽ More
We suggest a novel method for injection of electrons into the acceleration phase of particle accelerators, producing low emittance beams appropriate even for the demanding high energy Linear Collider specifications. In this paper we work out the injection into the acceleration phase of the wake field in a plasma behind a high intensity laser pulse, taking advantage of the laser polarization and focusing. With the aid of catastrophe theory we categorize the injection dynamics. The scheme uses the structurally stable regime of transverse wake wave breaking, when electron trajectory self-intersection leads to the formation of a flat electron bunch. As shown in three-dimensional particle-in-cell simulations of the interaction of a laser pulse in a line-focus with an underdense plasma, the electrons, injected via the transverse wake wave breaking and accelerated by the wake wave, perform betatron oscillations with different amplitudes and frequencies along the two transverse coordinates. The polarization and focusing geometry lead to a way to produce relativistic electron bunches with asymmetric emittance (flat beam). An approach for generating flat laser accelerated ion beams is briefly discussed.
△ Less
Submitted 7 June, 2006;
originally announced June 2006.
-
Spin-Hall effect on edge magnetization and electric conductance of a 2D semiconductor strip
Authors:
A. G. Mal'shukov,
L. Y. Wang,
C. S. Chu,
K. A. Chao
Abstract:
The intrinsic spin-Hall effect on spin accumulation and electric conductance in a diffusive regime of a 2D electron gas has been studied for a 2D strip of a finite width. It is shown that the spin polarization near the flanks of the strip, as well as the electric current in the longitudinal direction exhibit damped oscillations as a function of the width and strength of the Dresselhaus spin-orbi…
▽ More
The intrinsic spin-Hall effect on spin accumulation and electric conductance in a diffusive regime of a 2D electron gas has been studied for a 2D strip of a finite width. It is shown that the spin polarization near the flanks of the strip, as well as the electric current in the longitudinal direction exhibit damped oscillations as a function of the width and strength of the Dresselhaus spin-orbit interaction. Cubic terms of this interaction are crucial for spin accumulation near the edges. As expected, no effect on the spin accumulation and electric conductance have been found in case of Rashba spin-orbit interaction.
△ Less
Submitted 7 December, 2005; v1 submitted 28 June, 2005;
originally announced June 2005.
-
Strain-Induced Coupling of Spin Current to Nanomechanical Oscillations
Authors:
A. G. Mal'shukov,
C. S. Tang,
C. S. Chu,
K. A. Chao
Abstract:
We propose a setup which allows to couple the electron spin degree of freedom to the mechanical motions of a nanomechanical system not involving any of the ferromagnetic components. The proposed method employs the strain induced spin-orbit interaction of electrons in narrow gap semiconductors. We have shown how this method can be used for detection and manipulation of the spin flow through a sus…
▽ More
We propose a setup which allows to couple the electron spin degree of freedom to the mechanical motions of a nanomechanical system not involving any of the ferromagnetic components. The proposed method employs the strain induced spin-orbit interaction of electrons in narrow gap semiconductors. We have shown how this method can be used for detection and manipulation of the spin flow through a suspended rod in a nanomechanical device.
△ Less
Submitted 8 September, 2005; v1 submitted 29 April, 2005;
originally announced April 2005.
-
Generation of spin current and polarization under dynamic gate control of spin-orbit interaction in low-dimensional semiconductor systems
Authors:
C. S. Tang,
A. G. Mal'shukov,
K. A. Chao
Abstract:
Based on the Keldysh formalism, the Boltzmann kinetic equation and the drift diffusion equation have been derived for studying spin polarization flow and spin accumulation under effect of the time dependent Rashba spin-orbit interaction in a semiconductor quantum well. The time dependent Rashba interaction is provided by time dependent electric gates of appropriate shapes. Several examples of sp…
▽ More
Based on the Keldysh formalism, the Boltzmann kinetic equation and the drift diffusion equation have been derived for studying spin polarization flow and spin accumulation under effect of the time dependent Rashba spin-orbit interaction in a semiconductor quantum well. The time dependent Rashba interaction is provided by time dependent electric gates of appropriate shapes. Several examples of spin manipulation by gates have been considered. Mechanisms and conditions for obtaining the stationary spin density and the induced rectified DC spin current are studied.
△ Less
Submitted 22 March, 2005; v1 submitted 8 December, 2004;
originally announced December 2004.
-
Spin-Hall conductivity of a disordered 2D electron gas with Dresselhaus spin-orbit interaction
Authors:
A. G. Mal'shukov,
K. A. Chao
Abstract:
The spin-Hall conductivity of a disordered 2D electron gas has been calculated for an arbitrary spin-orbit interaction. We have found that in the diffusive regime of electron transport, in accordance with previous calculations, the dc spin-Hall conductivity of a homogeneous system turns to zero due to impurity scattering when the spin-orbit coupling is represented only by the Rashba interaction.…
▽ More
The spin-Hall conductivity of a disordered 2D electron gas has been calculated for an arbitrary spin-orbit interaction. We have found that in the diffusive regime of electron transport, in accordance with previous calculations, the dc spin-Hall conductivity of a homogeneous system turns to zero due to impurity scattering when the spin-orbit coupling is represented only by the Rashba interaction. However, when the Dresselhaus interaction is taken into account, the spin-Hall current is not zero. We also considered the spin-Hall currents induced by an inhomogeneous electric field. It is shown that a time dependent electric charge induces a vortex of spin-Hall currents.
△ Less
Submitted 6 April, 2005; v1 submitted 24 October, 2004;
originally announced October 2004.
-
Synchro-Betatron Stop-Bands due to a Single Crab Cavity
Authors:
Georg H. Hoffstaetter,
Alexander W. Chao
Abstract:
We analyze the stop-band due to crab cavities for horizontal tunes that are either close to integers or close to half integers. The latter case is relevant for today's electron/positron colliders. We compare this stop-band to that created by dispersion in an accelerating cavity and show that a single typical crab cavity creates larger stop-bands than a typical dispersion at an accelerating cavit…
▽ More
We analyze the stop-band due to crab cavities for horizontal tunes that are either close to integers or close to half integers. The latter case is relevant for today's electron/positron colliders. We compare this stop-band to that created by dispersion in an accelerating cavity and show that a single typical crab cavity creates larger stop-bands than a typical dispersion at an accelerating cavity.
We furthermore analyze whether it is beneficial to place the crab cavity at a position where the dispersion and its slope vanish. We find that this choice is worth while if the horizontal tune is close to a half integer, but not if it is close to an integer. Furthermore we find that stop-bands can be avoided when the horizontal tune is located at a favorable side of the integer or the half integer.
While we are here concerned with the installation of a single crab cavity in a storage ring, we show that the stop-bands can be weakened, although not eliminated, significantly when two crab cavities per ring are chosen suitably.
△ Less
Submitted 20 May, 2004;
originally announced May 2004.
-
Spin relaxation dynamics of quasiclassical electrons in ballistic quantum dots with strong spin-orbit coupling
Authors:
Cheng-Hung Chang,
A. G. Mal'shukov,
K. A. Chao
Abstract:
We performed path integral simulations of spin evolution controlled by the Rashba spin-orbit interaction in the semiclassical regime for chaotic and regular quantum dots. The spin polarization dynamics have been found to be strikingly different from the D'yakonov-Perel' (DP) spin relaxation in bulk systems. Also an important distinction have been found between long time spin evolutions in classi…
▽ More
We performed path integral simulations of spin evolution controlled by the Rashba spin-orbit interaction in the semiclassical regime for chaotic and regular quantum dots. The spin polarization dynamics have been found to be strikingly different from the D'yakonov-Perel' (DP) spin relaxation in bulk systems. Also an important distinction have been found between long time spin evolutions in classically chaotic and regular systems. In the former case the spin polarization relaxes to zero within relaxation time much larger than the DP relaxation, while in the latter case it evolves to a time independent residual value. The quantum mechanical analysis of the spin evolution based on the exact solution of the Schroedinger equation with Rashba SOI has confirmed the results of the classical simulations for the circular dot, which is expected to be valid in general regular systems. In contrast, the spin relaxation down to zero in chaotic dots contradicts to what have to be expected from quantum mechanics. This signals on importance at long time of the mesoscopic echo effect missed in the semiclassical simulations.
△ Less
Submitted 11 May, 2004;
originally announced May 2004.
-
Spin Current Generation and Detection in the Presence of AC Gate
Authors:
A. G. Mal'shukov,
C. S. Tang,
C. S. Chu,
K. A. Chao
Abstract:
We predict that in a narrow gap III-V semiconductor quantum well or a wire an observable spin current can be generated with a time dependent gate to modify the Rashba spin-orbit coupling constant. Methods to rectify the so generated AC current are discussed. An all-electric method of spin current detection is suggested, which measures the voltage on the gate in the vicinity of a 2D electron gas…
▽ More
We predict that in a narrow gap III-V semiconductor quantum well or a wire an observable spin current can be generated with a time dependent gate to modify the Rashba spin-orbit coupling constant. Methods to rectify the so generated AC current are discussed. An all-electric method of spin current detection is suggested, which measures the voltage on the gate in the vicinity of a 2D electron gas carrying a time dependent spin current. Both the generation and detection do not involve any optical or magnetic mediators.
△ Less
Submitted 6 November, 2003; v1 submitted 25 November, 2002;
originally announced November 2002.
-
Quantum oscillations of spin current through a III-V semiconductor loop
Authors:
A. G. Mal'shukov,
V. Shlyapin,
K. A. Chao
Abstract:
We have investigated the transport of spin polarization through a classically chaotic semiconductor loop with a strong Rashba spin-orbit interaction. We found that if the escape time of a particle is long enough, the configuration averaged spin conductance oscillates strongly with the geometric spin phase. We predict a sizable rotation of spin polarization along its flowing path across the loop…
▽ More
We have investigated the transport of spin polarization through a classically chaotic semiconductor loop with a strong Rashba spin-orbit interaction. We found that if the escape time of a particle is long enough, the configuration averaged spin conductance oscillates strongly with the geometric spin phase. We predict a sizable rotation of spin polarization along its flowing path across the loop from the injector to the collector. We have also discovered a quantized universal spin relaxation in a 2D reservoir connected to such a semiconductor loop.
△ Less
Submitted 13 December, 2001;
originally announced December 2001.
-
Optoelectric spin injection in semiconductor heterostructures without ferromagnet
Authors:
A. G. Mal'shukov,
K. A. Chao
Abstract:
We have shown that electron spin density can be generated by a dc current flowing across a $pn$ junction with an embedded asymmetric quantum well. Spin polarization is created in the quantum well by radiative electron-hole recombination when the conduction electron momentum distribution is shifted with respect to the momentum distribution of holes in the spin split valence subbands. Spin current…
▽ More
We have shown that electron spin density can be generated by a dc current flowing across a $pn$ junction with an embedded asymmetric quantum well. Spin polarization is created in the quantum well by radiative electron-hole recombination when the conduction electron momentum distribution is shifted with respect to the momentum distribution of holes in the spin split valence subbands. Spin current appears when the spin polarization is injected from the quantum well into the $n$-doped region of the $pn$ junction. The accompanied emission of circularly polarized light from the quantum well can serve as a spin polarization detector.
△ Less
Submitted 21 August, 2001;
originally announced August 2001.
-
Nonlinear $δf$ Method for Beam-Beam Simulation
Authors:
Yunhai Cai,
Alexander W. Chao,
Stephan I. Tzenov,
Toshi Tajima
Abstract:
We have developed an efficacious algorithm for simulation of the beam-beam interaction in synchrotron colliders based on the nonlinear $δf$ method, where $δf$ is the much smaller deviation of the beam distribution from the slowly evolving main distribution $f_0$. In the presence of dam** and quantum fluctuations of synchrotron radiation it has been shown that the slowly evolving part of the di…
▽ More
We have developed an efficacious algorithm for simulation of the beam-beam interaction in synchrotron colliders based on the nonlinear $δf$ method, where $δf$ is the much smaller deviation of the beam distribution from the slowly evolving main distribution $f_0$. In the presence of dam** and quantum fluctuations of synchrotron radiation it has been shown that the slowly evolving part of the distribution function satisfies a Fokker-Planck equation. Its solution has been obtained in terms of a beam envelope function and an amplitude of the distribution, which satisfy a coupled system of ordinary differential equations. A numerical algorithm suited for direct code implementation of the evolving distributions for both $δf$ and $f_0$ has been developed. Explicit expressions for the dynamical weights of macro-particles for $δf$ as well as an expression for the slowly changing $f_0$ have been obtained.
△ Less
Submitted 23 October, 2000;
originally announced October 2000.
-
Simulation of the Beam-Beam Effects in $e^+e^-$ Storage Rings with a Method of Reducing the Region of Mesh
Authors:
Yunhai Cai,
Alex W. Chao,
Stephan I. Tzenov,
Toshi Tajima
Abstract:
A highly accurate self-consistent particle code to simulate the beam-beam collision in $e^+e^-$ storage rings has been developed. It adopts a method of solving the Poisson equation with an open boundary. The method consists of two steps: assigning the potential on a finite boundary using the Green's function, and then solving the potential inside the boundary with a fast Poisson solver. Since th…
▽ More
A highly accurate self-consistent particle code to simulate the beam-beam collision in $e^+e^-$ storage rings has been developed. It adopts a method of solving the Poisson equation with an open boundary. The method consists of two steps: assigning the potential on a finite boundary using the Green's function, and then solving the potential inside the boundary with a fast Poisson solver. Since the solution of the Poisson's equation is unique, our solution is exactly the same as the one obtained by simply using the Green's function. The method allows us to select much smaller region of mesh and therefore increase the resolution of the solver. The better resolution makes more accurate the calculation of the dynamics in the core of the beams. The luminosity simulated with this method agrees quantitatively with the measurement for the PEP-II B-factory ring in the linear and nonlinear beam current regimes, demonstrating its predictive capability in detail.
△ Less
Submitted 30 August, 2000;
originally announced August 2000.
-
Optical transitions in broken gap heterostructures
Authors:
E. Halvorsen,
Y. Galperin,
K. A. Chao
Abstract:
We have used an eight band model to investigate the electronic structures and to calculate the optical matrix elements of InAs-GaSb broken gap semiconductor heterostructures. The unusual hybridization of the conduction band states in InAs layers with the valence band states in GaSb layers has been analyzed in details. We have studied the dependence of optical matrix elements on the degree of con…
▽ More
We have used an eight band model to investigate the electronic structures and to calculate the optical matrix elements of InAs-GaSb broken gap semiconductor heterostructures. The unusual hybridization of the conduction band states in InAs layers with the valence band states in GaSb layers has been analyzed in details. We have studied the dependence of optical matrix elements on the degree of conduction-valence hybridization, the tuning of hybridization by varying the width of the GaSb layers and/or InAs layers, and the sensitivity of quantized levels to this tuning. Large spin-orbit splitting in energy bands has been demonstrated. Our calculation can serve as a theoretical modeling for infrared lasers based on broken gap quantum well heterostructures.
△ Less
Submitted 14 March, 2000;
originally announced March 2000.
-
The Local Interstellar Medium in Puppis-Vela
Authors:
A. Cha,
M. Sahu,
H. W. Moos,
A. Blaauw
Abstract:
The first study of the local interstellar medium (LISM) toward Puppis-Vela (l = 245\degr to 275\degr, b = -15\degr to +5\degr, d < 200 pc) is presented in this paper. A study of the locations, sizes, and physical characteristics of local interstellar gas, i.e. ``astronephography,'' is included, and relies upon the improved distance measurements provided by Hipparcos parallax measurements. All sp…
▽ More
The first study of the local interstellar medium (LISM) toward Puppis-Vela (l = 245\degr to 275\degr, b = -15\degr to +5\degr, d < 200 pc) is presented in this paper. A study of the locations, sizes, and physical characteristics of local interstellar gas, i.e. ``astronephography,'' is included, and relies upon the improved distance measurements provided by Hipparcos parallax measurements. All spectra of more distant sight lines contain absorption features due to intervening local gas, and more distant structures can only be studied accurately if components due to the LISM have been isolated. Towards this end, high resolution (R ~ 95,000), high signal-to-noise (S/N ~ 110 to 250) Na I spectra of 11 nearby stars in the direction of Puppis-Vela have been obtained. Toward Puppis-Vela, absorption due to the LIC was not observed, but components at three distinct velocities were found, and the extent of the local gas producing the features was estimated. The conclusions regarding the UV spectrum of gamma^2 Vel presented by Fitzpatrick & Spitzer (1994) were re-examined in light of this new LISM data, and the ambiguity in their conclusions about several absorption components is resolved. The stars in Puppis-Vela flank the region of the apparent extension of the Local Bubble (or Cavity) known as the beta CMa tunnel,and measurements of the Na I column density towards the sample stars have been used to modify existing estimates of the extent of the tunnel. A compilation of all existing Na I observations of < 200 pc sight lines around the tunnel reveal that low column densities have been exclusively detected within l ~ 210\degr to 250\degr,and b ~ -21\degr to -9\degr. Near the Galactic plane,at latitudes -10\degr < b < 0\degr and d ~< 150 pc, the tunnel is confined to l < 270\degr, a lower longitude than was previously reported.
△ Less
Submitted 28 January, 2000;
originally announced January 2000.
-
Waveguide diffusion modes and slowdown of D'yakonov-Perel' spin relaxation in narrow 2-D semiconductor channels
Authors:
A. G. Mal'shukov,
K. A. Chao
Abstract:
We have shown that in narrow 2D semiconductor channels the D'yakonov-Perel' spin relaxation rate is strongly reduced. This relaxation slowdown appears in special waveguide diffusion modes which determine the propagation of spin density in long channels. Experiments are suggested to detect the theoretically predicted effects. A possible application is a field effect transistor operated with injec…
▽ More
We have shown that in narrow 2D semiconductor channels the D'yakonov-Perel' spin relaxation rate is strongly reduced. This relaxation slowdown appears in special waveguide diffusion modes which determine the propagation of spin density in long channels. Experiments are suggested to detect the theoretically predicted effects. A possible application is a field effect transistor operated with injected spin current.
△ Less
Submitted 18 October, 1999;
originally announced October 1999.
-
Spectroscopy and Time Variability of Absorption Lines in the Direction of the Vela Supernova Remnant
Authors:
Alexandra N. Cha,
Kenneth R. Sembach
Abstract:
We present high resolution (R~75,000), high signal-to-noise (S/N~100) Ca II $λ$3933.663 and Na I $λλ$5889.951, 5895.924 spectra of 68 stars in the direction of the Vela supernova remnant. The spectra comprise the most complete high resolution, high S/N, optical survey of early type stars in this region of the sky. A subset of the sight lines has been observed at multiple epochs, 1993/1994 and 19…
▽ More
We present high resolution (R~75,000), high signal-to-noise (S/N~100) Ca II $λ$3933.663 and Na I $λλ$5889.951, 5895.924 spectra of 68 stars in the direction of the Vela supernova remnant. The spectra comprise the most complete high resolution, high S/N, optical survey of early type stars in this region of the sky. A subset of the sight lines has been observed at multiple epochs, 1993/1994 and 1996. Of the thirteen stars observed twice, seven have spectra revealing changes in the equivalent width and/or velocity structure of lines, most of which arise from remnant gas. Such time variability has been reported previously for the sight lines towards HD 72089 and HD 72997 by Danks & Sembach (1995) and for HD 72127 by Hobbs et al. (1991). We have confirmed the ongoing time variability of these spectra and present new evidence of variability in the spectra of HD 73658, HD 74455, HD 75309 and HD 75821. We have tabulated Na I and Ca II absorption line information for the sight lines in our sample to serve as a benchmark for further investigations of the dynamics and evolution of the Vela SNR.
△ Less
Submitted 7 September, 1999;
originally announced September 1999.
-
Circuit Effect On The Current-Voltage Characteristics Of Ultrasmall Tunnel Junctions
Authors:
X. H. Wang,
K. A. Chao
Abstract:
We have used the method of generating functional in imaginary time to derive the current-voltage characteristics of a tunnel junction with arbitrary tunneling conductance, connected in series with an external impedance and a voltage source. We have shown that via the renormalized charging energy and the renormalized environment conductance, our nonperturbative expressions of the total action can…
▽ More
We have used the method of generating functional in imaginary time to derive the current-voltage characteristics of a tunnel junction with arbitrary tunneling conductance, connected in series with an external impedance and a voltage source. We have shown that via the renormalized charging energy and the renormalized environment conductance, our nonperturbative expressions of the total action can be mapped onto the corresponding perturbative formulas. This provides a straightforward way to go beyond the perturbation theory. For the impedance being a pure resistance, we have calculated the conductance for various voltages and temperatures, and the results agree very well with experiments.
△ Less
Submitted 15 March, 1999;
originally announced March 1999.