-
Elicitron: An LLM Agent-Based Simulation Framework for Design Requirements Elicitation
Authors:
Mohammadmehdi Ataei,
Hyunmin Cheong,
Daniele Grandi,
Ye Wang,
Nigel Morris,
Alexander Tessier
Abstract:
Requirements elicitation, a critical, yet time-consuming and challenging step in product development, often fails to capture the full spectrum of user needs. This may lead to products that fall short of expectations. This paper introduces a novel framework that leverages Large Language Models (LLMs) to automate and enhance the requirements elicitation process. LLMs are used to generate a vast arra…
▽ More
Requirements elicitation, a critical, yet time-consuming and challenging step in product development, often fails to capture the full spectrum of user needs. This may lead to products that fall short of expectations. This paper introduces a novel framework that leverages Large Language Models (LLMs) to automate and enhance the requirements elicitation process. LLMs are used to generate a vast array of simulated users (LLM agents), enabling the exploration of a much broader range of user needs and unforeseen use cases. These agents engage in product experience scenarios, through explaining their actions, observations, and challenges. Subsequent agent interviews and analysis uncover valuable user needs, including latent ones. We validate our framework with three experiments. First, we explore different methodologies for diverse agent generation, discussing their advantages and shortcomings. We measure the diversity of identified user needs and demonstrate that context-aware agent generation leads to greater diversity. Second, we show how our framework effectively mimics empathic lead user interviews, identifying a greater number of latent needs than conventional human interviews. Third, we showcase that LLMs can be used to analyze interviews, capture needs, and classify them as latent or not. Our work highlights the potential of using LLM agents to accelerate early-stage product development, reduce costs, and increase innovation.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
The Equitable AI Research Roundtable (EARR): Towards Community-Based Decision Making in Responsible AI Development
Authors:
Jamila Smith-Loud,
Andrew Smart,
Darlene Neal,
Amber Ebinama,
Eric Corbett,
Paul Nicholas,
Qazi Rashid,
Anne Peckham,
Sarah Murphy-Gray,
Nicole Morris,
Elisha Smith Arrillaga,
Nicole-Marie Cotton,
Emnet Almedom,
Olivia Araiza,
Eliza McCullough,
Abbie Langston,
Christopher Nellum
Abstract:
This paper reports on our initial evaluation of The Equitable AI Research Roundtable -- a coalition of experts in law, education, community engagement, social justice, and technology. EARR was created in collaboration among a large tech firm, nonprofits, NGO research institutions, and universities to provide critical research based perspectives and feedback on technology's emergent ethical and soc…
▽ More
This paper reports on our initial evaluation of The Equitable AI Research Roundtable -- a coalition of experts in law, education, community engagement, social justice, and technology. EARR was created in collaboration among a large tech firm, nonprofits, NGO research institutions, and universities to provide critical research based perspectives and feedback on technology's emergent ethical and social harms. Through semi-structured workshops and discussions within the large tech firm, EARR has provided critical perspectives and feedback on how to conceptualize equity and vulnerability as they relate to AI technology. We outline three principles in practice of how EARR has operated thus far that are especially relevant to the concerns of the FAccT community: how EARR expands the scope of expertise in AI development, how it fosters opportunities for epistemic curiosity and responsibility, and that it creates a space for mutual learning. This paper serves as both an analysis and translation of lessons learned through this engagement approach, and the possibilities for future research.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Balanced and Robust Randomized Treatment Assignments: The Finite Selection Model for the Health Insurance Experiment and Beyond
Authors:
Ambarish Chattopadhyay,
Carl N. Morris,
Jose R. Zubizarreta
Abstract:
The Finite Selection Model (FSM) was developed by Carl Morris in the 1970s for the design of the RAND Health Insurance Experiment (HIE) (Morris 1979, Newhouse et al. 1993), one of the largest and most comprehensive social science experiments conducted in the U.S. The idea behind the FSM is that each treatment group takes its turns selecting units in a fair and random order to optimize a common ass…
▽ More
The Finite Selection Model (FSM) was developed by Carl Morris in the 1970s for the design of the RAND Health Insurance Experiment (HIE) (Morris 1979, Newhouse et al. 1993), one of the largest and most comprehensive social science experiments conducted in the U.S. The idea behind the FSM is that each treatment group takes its turns selecting units in a fair and random order to optimize a common assignment criterion. At each of its turns, a treatment group selects the available unit that maximally improves the combined quality of its resulting group of units in terms of the criterion. In the HIE and beyond, we revisit, formalize, and extend the FSM as a general tool for experimental design.
Leveraging the idea of D-optimality, we propose and analyze a new selection criterion in the FSM. The FSM using the D-optimal selection function has no tuning parameters, is affine invariant, and when appropriate, retrieves several classical designs such as randomized block and matched-pair designs. For multi-arm experiments, we propose algorithms to generate a fair and random selection order of treatments. We demonstrate FSM's performance in a case study based on the HIE and in ten randomized studies from the health and social sciences. On average, the FSM achieves 68% better covariate balance than complete randomization and 56% better covariate balance than rerandomization in a typical study. We recommend the FSM be considered in experimental design for its conceptual simplicity, efficiency, and robustness.
△ Less
Submitted 4 July, 2023; v1 submitted 19 May, 2022;
originally announced May 2022.
-
SolidGen: An Autoregressive Model for Direct B-rep Synthesis
Authors:
Pradeep Kumar Jayaraman,
Joseph G. Lambourne,
Nishkrit Desai,
Karl D. D. Willis,
Aditya Sanghi,
Nigel J. W. Morris
Abstract:
The Boundary representation (B-rep) format is the de-facto shape representation in computer-aided design (CAD) to model solid and sheet objects. Recent approaches to generating CAD models have focused on learning sketch-and-extrude modeling sequences that are executed by a solid modeling kernel in postprocess to recover a B-rep. In this paper we present a new approach that enables learning from an…
▽ More
The Boundary representation (B-rep) format is the de-facto shape representation in computer-aided design (CAD) to model solid and sheet objects. Recent approaches to generating CAD models have focused on learning sketch-and-extrude modeling sequences that are executed by a solid modeling kernel in postprocess to recover a B-rep. In this paper we present a new approach that enables learning from and synthesizing B-reps without the need for supervision through CAD modeling sequence data. Our method SolidGen, is an autoregressive neural network that models the B-rep directly by predicting the vertices, edges, and faces using Transformer-based and pointer neural networks. Key to achieving this is our Indexed Boundary Representation that references B-rep vertices, edges and faces in a well-defined hierarchy to capture the geometric and topological relations suitable for use with machine learning. SolidGen can be easily conditioned on contexts e.g., class labels, images, and voxels thanks to its probabilistic modeling of the B-rep distribution. We demonstrate qualitatively, quantitatively, and through perceptual evaluation by human subjects that SolidGen can produce high quality, realistic CAD models.
△ Less
Submitted 20 February, 2023; v1 submitted 25 March, 2022;
originally announced March 2022.
-
DeepTrack: Lightweight Deep Learning for Vehicle Path Prediction in Highways
Authors:
Vinit Katariya,
Mohammadreza Baharani,
Nichole Morris,
Omidreza Shoghli,
Hamed Tabkhi
Abstract:
Vehicle trajectory prediction is essential for enabling safety-critical intelligent transportation systems (ITS) applications used in management and operations. While there have been some promising advances in the field, there is a need for modern deep learning algorithms that allow real-time trajectory prediction on embedded IoT devices. This article presents DeepTrack, a novel deep learning algo…
▽ More
Vehicle trajectory prediction is essential for enabling safety-critical intelligent transportation systems (ITS) applications used in management and operations. While there have been some promising advances in the field, there is a need for modern deep learning algorithms that allow real-time trajectory prediction on embedded IoT devices. This article presents DeepTrack, a novel deep learning algorithm customized for real-time vehicle trajectory prediction and monitoring applications in arterial management, freeway management, traffic incident management, and work zone management for high-speed incoming traffic. In contrast to previous methods, the vehicle dynamics are encoded using Temporal Convolutional Networks (TCNs) to provide more robust time prediction with less computation. DeepTrack also uses depthwise convolution, which reduces the complexity of models compared to existing approaches in terms of model size and operations. Overall, our experimental results demonstrate that DeepTrack achieves comparable accuracy to state-of-the-art trajectory prediction models but with smaller model sizes and lower computational complexity, making it more suitable for real-world deployment.
△ Less
Submitted 25 May, 2022; v1 submitted 1 August, 2021;
originally announced August 2021.
-
Randomized and Balanced Allocation of Units into Treatment Groups Using the Finite Selection Model for R
Authors:
Ambarish Chattopadhyay,
Carl N. Morris,
Jose R. Zubizarreta
Abstract:
The original Finite Selection Model (FSM) was developed in the 1970s to enhance the design of the RAND Health Insurance Experiment (HIE; Newhouse et al. 1993). At the time of its development by Carl Morris (Morris 1979), there were fundamental computational limitations to make the method widely available for practitioners. Today, as randomized experiments increasingly become more common, there is…
▽ More
The original Finite Selection Model (FSM) was developed in the 1970s to enhance the design of the RAND Health Insurance Experiment (HIE; Newhouse et al. 1993). At the time of its development by Carl Morris (Morris 1979), there were fundamental computational limitations to make the method widely available for practitioners. Today, as randomized experiments increasingly become more common, there is a need for implementing experimental designs that are randomized, balanced, robust, and easily applicable to several treatment groups. To help address this problem, we revisit the original FSM under the potential outcome framework for causal inference and provide its first readily available software implementation. In this paper, we provide an introduction to the FSM and a step-by-step guide for its use in R.
△ Less
Submitted 5 May, 2021;
originally announced May 2021.
-
Longitudinal Mediation Analysis with Latent Growth Curves
Authors:
Adam J. Sullivan,
Douglas D. Gunzler,
Nathan Morris,
Tyler J. VanderWeele
Abstract:
The paper considers mediation analysis with longitudinal data under latent growth curve models within a counterfactual framework. Estimators and their standard errors are derived for natural direct and indirect effects when the mediator, the outcome, and possibly also the exposure can be modeled by an underlying latent variable giving rise to a growth curve. Settings are also considered in which t…
▽ More
The paper considers mediation analysis with longitudinal data under latent growth curve models within a counterfactual framework. Estimators and their standard errors are derived for natural direct and indirect effects when the mediator, the outcome, and possibly also the exposure can be modeled by an underlying latent variable giving rise to a growth curve. Settings are also considered in which the exposure is instead fixed at a single point in time.
△ Less
Submitted 9 March, 2021;
originally announced March 2021.
-
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep Learning Clusters
Authors:
Isabelly Rocha,
Nathaniel Morris,
Lydia Y. Chen,
Pascal Felber,
Robert Birke,
Valerio Schiavoni
Abstract:
DNN learning jobs are common in today's clusters due to the advances in AI driven services such as machine translation and image recognition. The most critical phase of these jobs for model performance and learning cost is the tuning of hyperparameters. Existing approaches make use of techniques such as early stop** criteria to reduce the tuning impact on learning cost. However, these strategies…
▽ More
DNN learning jobs are common in today's clusters due to the advances in AI driven services such as machine translation and image recognition. The most critical phase of these jobs for model performance and learning cost is the tuning of hyperparameters. Existing approaches make use of techniques such as early stop** criteria to reduce the tuning impact on learning cost. However, these strategies do not consider the impact that certain hyperparameters and systems parameters have on training time. This paper presents PipeTune, a framework for DNN learning jobs that addresses the trade-offs between these two types of parameters. PipeTune takes advantage of the high parallelism and recurring characteristics of such jobs to minimize the learning cost via a pipelined simultaneous tuning of both hyper and system parameters. Our experimental evaluation using three different types of workloads indicates that PipeTune achieves up to 22.6% reduction and 1.7x speed up on tuning and training time, respectively. PipeTune not only improves performance but also lowers energy consumption up to 29%.
△ Less
Submitted 2 October, 2020; v1 submitted 1 October, 2020;
originally announced October 2020.
-
Using Online Implicit Association Tests in Opinion Polling
Authors:
Alan Smeaton,
Hyowon Lee,
Niamh Morris,
David Hanley
Abstract:
Opinion polls have now become a very important component of society because they are now a defacto component of our daily news cycle and because their results influence governments and business in ways which are not always obvious to us. However, polling is not always accurate and there have been some really inaccurate polling results which have had major influences on the world going back to the…
▽ More
Opinion polls have now become a very important component of society because they are now a defacto component of our daily news cycle and because their results influence governments and business in ways which are not always obvious to us. However, polling is not always accurate and there have been some really inaccurate polling results which have had major influences on the world going back to the 1930s but also as recently as just the last 3 or 4 years. In this paper we analyse the phenomenon of socially desirable responding (shy voters) which has emerged as one of the reasons for modern day inaccurate polling. We describe how it can be exposed through implicit association tests (IATs) and we demonstrate the shy voter effect in a small survey on opinions in Ireland towards the United Kingdom. We argue for inclusion of IATs in traditional polling and point to the fact that these can be conducted accurately online, which also allows polling to reach a larger and more diverse sample of respondents in the days of Covid-19 restrictions which restricts the opportunities for poll sampling from the general public.
△ Less
Submitted 8 July, 2020;
originally announced July 2020.
-
UV-Net: Learning from Boundary Representations
Authors:
Pradeep Kumar Jayaraman,
Aditya Sanghi,
Joseph G. Lambourne,
Karl D. D. Willis,
Thomas Davies,
Hooman Shayani,
Nigel Morris
Abstract:
We introduce UV-Net, a novel neural network architecture and representation designed to operate directly on Boundary representation (B-rep) data from 3D CAD models. The B-rep format is widely used in the design, simulation and manufacturing industries to enable sophisticated and precise CAD modeling operations. However, B-rep data presents some unique challenges when used with modern machine learn…
▽ More
We introduce UV-Net, a novel neural network architecture and representation designed to operate directly on Boundary representation (B-rep) data from 3D CAD models. The B-rep format is widely used in the design, simulation and manufacturing industries to enable sophisticated and precise CAD modeling operations. However, B-rep data presents some unique challenges when used with modern machine learning due to the complexity of the data structure and its support for both continuous non-Euclidean geometric entities and discrete topological entities. In this paper, we propose a unified representation for B-rep data that exploits the U and V parameter domain of curves and surfaces to model geometry, and an adjacency graph to explicitly model topology. This leads to a unique and efficient network architecture, UV-Net, that couples image and graph convolutional neural networks in a compute and memory-efficient manner. To aid in future research we present a synthetic labelled B-rep dataset, SolidLetters, derived from human designed fonts with variations in both geometry and topology. Finally we demonstrate that UV-Net can generalize to supervised and unsupervised tasks on five datasets, while outperforming alternate 3D shape representations such as point clouds, voxels, and meshes.
△ Less
Submitted 25 April, 2021; v1 submitted 17 June, 2020;
originally announced June 2020.
-
A subtractive manufacturing constraint for level set topology optimization
Authors:
Nigel Morris,
Adrian Butscher,
Francesco Iorio
Abstract:
We present a method for enforcing manufacturability constraints in generated parts such that they will be automatically ready for fabrication using a subtractive approach. We primarily target multi-axis CNC milling approaches but the method should generalize to other subtractive methods as well. To this end, we take as user input: the radius of curvature of the tool bit, a coarse model of the tool…
▽ More
We present a method for enforcing manufacturability constraints in generated parts such that they will be automatically ready for fabrication using a subtractive approach. We primarily target multi-axis CNC milling approaches but the method should generalize to other subtractive methods as well. To this end, we take as user input: the radius of curvature of the tool bit, a coarse model of the tool head and optionally a set of milling directions. This allows us to enforce the following manufacturability conditions: 1) surface smoothness such that the radius of curvature of the part does not exceed the milling bit radius, 2) orientation such that every part of the surface to be milled is visible from at least one milling direction, 3) accessibility such that every surface patch can be reached by the tool bit without interference with the tool or head mount. We will show how to efficiently enforce the constraint during level set-based topology optimization modifying the advection velocity such that at each iteration the topology optimization maintains a descent optimization direction and does not violate any of the manufacturability conditions. This approach models the actual subtractive process by carving away material accessible to the machine at each iteration until a local optimum is achieved.
△ Less
Submitted 19 February, 2020;
originally announced February 2020.
-
The Solar Orbiter SPICE instrument -- An extreme UV imaging spectrometer
Authors:
The SPICE Consortium,
:,
M. Anderson,
T. Appourchaux,
F. Auchère,
R. Aznar Cuadrado,
J. Barbay,
F. Baudin,
S. Beardsley,
K. Bocchialini,
B. Borgo,
D. Bruzzi,
E. Buchlin,
G. Burton,
V. Blüchel,
M. Caldwell,
S. Caminade,
M. Carlsson,
W. Curdt,
J. Davenne,
J. Davila,
C. E. DeForest,
G. Del Zanna,
D. Drummond,
J. Dubau
, et al. (66 additional authors not shown)
Abstract:
The Spectral Imaging of the Coronal Environment (SPICE) instrument is a high-resolution imaging spectrometer operating at extreme ultraviolet (EUV) wavelengths. In this paper, we present the concept, design, and pre-launch performance of this facility instrument on the ESA/NASA Solar Orbiter mission. The goal of this paper is to give prospective users a better understanding of the possible types o…
▽ More
The Spectral Imaging of the Coronal Environment (SPICE) instrument is a high-resolution imaging spectrometer operating at extreme ultraviolet (EUV) wavelengths. In this paper, we present the concept, design, and pre-launch performance of this facility instrument on the ESA/NASA Solar Orbiter mission. The goal of this paper is to give prospective users a better understanding of the possible types of observations, the data acquisition, and the sources that contribute to the instrument's signal. The paper discusses the science objectives, with a focus on the SPICE-specific aspects, before presenting the instrument's design, including optical, mechanical, thermal, and electronics aspects. This is followed by a characterisation and calibration of the instrument's performance. The paper concludes with descriptions of the operations concept and data processing. The performance measurements of the various instrument parameters meet the requirements derived from the mission's science objectives. The SPICE instrument is ready to perform measurements that will provide vital contributions to the scientific success of the Solar Orbiter mission.
△ Less
Submitted 3 September, 2019;
originally announced September 2019.
-
Zodiacal Exoplanets in Time (ZEIT) VI: a three-planet system in the Hyades cluster including an Earth-sized planet
Authors:
Andrew W. Mann,
Andrew Vanderburg,
Aaron C. Rizzuto,
Adam L. Kraus,
Perry Berlind,
Allyson Bieryla,
Michael L. Calkins,
Gilbert A. Esquerdo,
David W. Latham,
Gregory N. Mace,
Nathan R. Morris,
Samuel N. Quinn,
Kimberly R. Sokal,
Robert P. Stefanik
Abstract:
Planets in young clusters are powerful probes of the evolution of planetary systems. Here we report the discovery of three planets transiting K2-136 (EPIC 247589423), a late K dwarf in the Hyades (~800 Myr) cluster, and robust detection limits for additional planets in the system. The planets were identified from their K2 light curves, as part of our survey of young clusters and star forming regio…
▽ More
Planets in young clusters are powerful probes of the evolution of planetary systems. Here we report the discovery of three planets transiting K2-136 (EPIC 247589423), a late K dwarf in the Hyades (~800 Myr) cluster, and robust detection limits for additional planets in the system. The planets were identified from their K2 light curves, as part of our survey of young clusters and star forming regions. The smallest planet has a radius comparable to Earth (0.99 +/- 0.05 Earth radii), making it one of the few Earth-sized planets with a known, young age. The two larger planets are likely a mini-Neptune and a super-Earth, with radii of 2.91+/-0.11 and 1.45+/-0.10 Earth radii, respectively. The predicted radial velocity signals from these planets are between 0.4 and 2 m/s, achievable with modern precision RV spectrographs. Because the target star is bright (V=11.2) and has relatively low-amplitude stellar variability for a young star (2-6 mmag), EPIC 247589423 hosts the best planets known in a young open cluster for precise radial velocity follow-up, enabling a robust test of earlier claims that young planets are less dense than their older counterparts.
△ Less
Submitted 6 November, 2017; v1 submitted 29 September, 2017;
originally announced September 2017.
-
Estimating the risk associated with transportation technology using multifidelity simulation
Authors:
Erik J. Schlicht,
Nichole L. Morris
Abstract:
This paper provides a quantitative method for estimating the risk associated with candidate transportation technology, before it is developed and deployed. The proposed solution extends previous methods that rely exclusively on low-fidelity human-in-the-loop experimental data, or high-fidelity traffic data, by adopting a multifidelity approach that leverages data from both low- and high-fidelity s…
▽ More
This paper provides a quantitative method for estimating the risk associated with candidate transportation technology, before it is developed and deployed. The proposed solution extends previous methods that rely exclusively on low-fidelity human-in-the-loop experimental data, or high-fidelity traffic data, by adopting a multifidelity approach that leverages data from both low- and high-fidelity sources. The multifidelity method overcomes limitations inherent to existing approaches by allowing a model to be trained inexpensively, while still assuring that its predictions generalize to the real-world. This allows for candidate technologies to be evaluated at the stage of conception, and enables a mechanism for only the safest and most effective technology to be developed and released.
△ Less
Submitted 31 January, 2017; v1 submitted 30 January, 2017;
originally announced January 2017.
-
Rgbp: An R Package for Gaussian, Poisson, and Binomial Random Effects Models with Frequency Coverage Evaluations
Authors:
Hyungsuk Tak,
Joseph Kelly,
Carl N. Morris
Abstract:
Rgbp is an R package that provides estimates and verifiable confidence intervals for random effects in two-level conjugate hierarchical models for overdispersed Gaussian, Poisson, and Binomial data. Rgbp models aggregate data from k independent groups summarized by observed sufficient statistics for each random effect, such as sample means, possibly with covariates. Rgbp uses approximate Bayesian…
▽ More
Rgbp is an R package that provides estimates and verifiable confidence intervals for random effects in two-level conjugate hierarchical models for overdispersed Gaussian, Poisson, and Binomial data. Rgbp models aggregate data from k independent groups summarized by observed sufficient statistics for each random effect, such as sample means, possibly with covariates. Rgbp uses approximate Bayesian machinery with unique improper priors for the hyper-parameters, which leads to good repeated sampling coverage properties for random effects. A special feature of Rgbp is an option that generates synthetic data sets to check whether the interval estimates for random effects actually meet the nominal confidence levels. Additionally, Rgbp provides inference statistics for the hyper-parameters, e.g., regression coefficients.
△ Less
Submitted 5 December, 2016;
originally announced December 2016.
-
Data-dependent Posterior Propriety of Bayesian Beta-Binomial-Logit Model
Authors:
Hyungsuk Tak,
Carl N. Morris
Abstract:
A Beta-Binomial-Logit model is a Beta-Binomial model with covariate information incorporated via a logistic regression. Posterior propriety of a Bayesian Beta-Binomial-Logit model can be data-dependent for improper hyper-prior distributions. Various researchers in the literature have unknowingly used improper posterior distributions or have given incorrect statements about posterior propriety beca…
▽ More
A Beta-Binomial-Logit model is a Beta-Binomial model with covariate information incorporated via a logistic regression. Posterior propriety of a Bayesian Beta-Binomial-Logit model can be data-dependent for improper hyper-prior distributions. Various researchers in the literature have unknowingly used improper posterior distributions or have given incorrect statements about posterior propriety because checking posterior propriety can be challenging due to the complicated functional form of a Beta-Binomial-Logit model. We derive data-dependent necessary and sufficient conditions for posterior propriety within a class of hyper-prior distributions that encompass those used in previous studies.
△ Less
Submitted 22 December, 2015;
originally announced December 2015.
-
Detailed analysis of the predictions of loop quantum cosmology for the primordial power spectra
Authors:
Ivan Agullo,
Noah A. Morris
Abstract:
We provide an exhaustive numerical exploration of the predictions of loop quantum cosmology (LQC) with a post-bounce phase of inflation for the primordial power spectrum of scalar and tensor perturbations. We extend previous analysis by characterizing the phenomenologically relevant parameter space and by constraining it using observations. Furthermore, we characterize the shape of LQC-corrections…
▽ More
We provide an exhaustive numerical exploration of the predictions of loop quantum cosmology (LQC) with a post-bounce phase of inflation for the primordial power spectrum of scalar and tensor perturbations. We extend previous analysis by characterizing the phenomenologically relevant parameter space and by constraining it using observations. Furthermore, we characterize the shape of LQC-corrections to observable quantities across this parameter space. Our analysis provides a framework to contrast more accurately the theory with forthcoming polarization data, and it also paves the road for the computation of other observables beyond the power spectra, such as non-Gaussianity.
△ Less
Submitted 18 September, 2015;
originally announced September 2015.
-
The Mid-Infrared Instrument for JWST, II: Design and Build
Authors:
G. S. Wright,
David Wright,
G. B. Goodson,
G. H. Rieke,
Gabby Aitink-Kroes,
J. Amiaux,
Ana Aricha-Yanguas,
Ruyman Azzolini,
Kimberly Banks,
D. Barrado-Navascues,
T. Belenguer-Davila,
J. A. D. L. Bloemmart,
Patrice Bouchet,
B. R. Brandl,
L. Colina,
Ors Detre,
Eva Diaz-Catala,
Paul Eccleston,
Scott D. Friedman,
Macarena Garcia-Marin,
Manuel Guedel,
Alistair Glasse,
Adrian M. Glauser,
T. P. Greene,
Uli Groezinger
, et al. (48 additional authors not shown)
Abstract:
The Mid-InfraRed Instrument (MIRI) on the James Webb Space Telescope (JWST) provides measurements over the wavelength range 5 to 28.5 microns. MIRI has, within a single 'package', four key scientific functions: photometric imaging, coronagraphy, single-source low-spectral resolving power (R ~ 100) spectroscopy, and medium-resolving power (R ~ 1500 to 3500) integral field spectroscopy. An associate…
▽ More
The Mid-InfraRed Instrument (MIRI) on the James Webb Space Telescope (JWST) provides measurements over the wavelength range 5 to 28.5 microns. MIRI has, within a single 'package', four key scientific functions: photometric imaging, coronagraphy, single-source low-spectral resolving power (R ~ 100) spectroscopy, and medium-resolving power (R ~ 1500 to 3500) integral field spectroscopy. An associated cooler system maintains MIRI at its operating temperature of < 6.7 K. This paper describes the driving principles behind the design of MIRI, the primary design parameters, and their realization in terms of the 'as-built' instrument. It also describes the test program that led to delivery of the tested and calibrated Flight Model to NASA in 2012, and the confirmation after delivery of the key interface requirements.
△ Less
Submitted 10 August, 2015;
originally announced August 2015.
-
Measuring and Managing Answer Quality for Online Data-Intensive Services
Authors:
Jaimie Kelley,
Christopher Stewart,
Nathaniel Morris,
Devesh Tiwari,
Yuxiong He,
Sameh Elnikety
Abstract:
Online data-intensive services parallelize query execution across distributed software components. Interactive response time is a priority, so online query executions return answers without waiting for slow running components to finish. However, data from these slow components could lead to better answers. We propose Ubora, an approach to measure the effect of slow running components on the qualit…
▽ More
Online data-intensive services parallelize query execution across distributed software components. Interactive response time is a priority, so online query executions return answers without waiting for slow running components to finish. However, data from these slow components could lead to better answers. We propose Ubora, an approach to measure the effect of slow running components on the quality of answers. Ubora randomly samples online queries and executes them twice. The first execution elides data from slow components and provides fast online answers; the second execution waits for all components to complete. Ubora uses memoization to speed up mature executions by replaying network messages exchanged between components. Our systems-level implementation works for a wide range of platforms, including Hadoop/Yarn, Apache Lucene, the EasyRec Recommendation Engine, and the OpenEphyra question answering system. Ubora computes answer quality much faster than competing approaches that do not use memoization. With Ubora, we show that answer quality can and should be used to guide online admission control. Our adaptive controller processed 37% more queries than a competing controller guided by the rate of timeouts.
△ Less
Submitted 16 June, 2015;
originally announced June 2015.
-
Shrinkage Estimation in Multilevel Normal Models
Authors:
Carl N. Morris,
Martin Lysy
Abstract:
This review traces the evolution of theory that started when Charles Stein in 1955 [In Proc. 3rd Berkeley Sympos. Math. Statist. Probab. I (1956) 197--206, Univ. California Press] showed that using each separate sample mean from $k\ge3$ Normal populations to estimate its own population mean $μ_i$ can be improved upon uniformly for every possible $μ=(μ_1,...,μ_k)'$. The dominating estimators, refer…
▽ More
This review traces the evolution of theory that started when Charles Stein in 1955 [In Proc. 3rd Berkeley Sympos. Math. Statist. Probab. I (1956) 197--206, Univ. California Press] showed that using each separate sample mean from $k\ge3$ Normal populations to estimate its own population mean $μ_i$ can be improved upon uniformly for every possible $μ=(μ_1,...,μ_k)'$. The dominating estimators, referred to here as being "Model-I minimax," can be found by shrinking the sample means toward any constant vector. Admissible minimax shrinkage estimators were derived by Stein and others as posterior means based on a random effects model, "Model-II" here, wherein the $μ_i$ values have their own distributions. Section 2 centers on Figure 2, which organizes a wide class of priors on the unknown Level-II hyperparameters that have been proved to yield admissible Model-I minimax shrinkage estimators in the "equal variance case." Putting a flat prior on the Level-II variance is unique in this class for its scale-invariance and for its conjugacy, and it induces Stein's harmonic prior (SHP) on $μ_i$.
△ Less
Submitted 26 March, 2012;
originally announced March 2012.
-
Guidelines for reporting the use of gel electrophoresis in proteomics
Authors:
Frank Gibson,
Leigh Anderson,
Gyorgy Babnigg,
Mark Baker,
Matthias Berth,
Pierre-Alain Binz,
Andy Borthwick,
Phil Cash,
Billy W Day,
David B Friedman,
Donita Garland,
Howard B Gutstein,
Christine Hoogland,
Neil A Jones,
Alamgir Khan,
Joachim Klose,
Angus I Lamond,
Peter F Lemkin,
Kathryn S Lilley,
Jonathan Minden,
Nicholas J Morris,
Norman W Paton,
Michael R Pisano,
John E Prime,
Thierry Rabilloud
, et al. (5 additional authors not shown)
Abstract:
the MIAPE Gel Electrophoresis (MIAPE-GE) guidelines specify the minimum information that should be provided when reporting the use of n-dimensional gel electrophoresis in a proteomics experiment. Developed through a joint effort between the gel-based analysis working group of the Human Proteome Organisation's Proteomics Standards Initiative (HUPO-PSI; http://www.psidev.info/) and the wider prote…
▽ More
the MIAPE Gel Electrophoresis (MIAPE-GE) guidelines specify the minimum information that should be provided when reporting the use of n-dimensional gel electrophoresis in a proteomics experiment. Developed through a joint effort between the gel-based analysis working group of the Human Proteome Organisation's Proteomics Standards Initiative (HUPO-PSI; http://www.psidev.info/) and the wider proteomics community, they constitute one part of the overall Minimum Information about a Proteomics Experiment (MIAPE) documentation system published last August in Nature Biotechnology
△ Less
Submitted 4 April, 2009;
originally announced April 2009.
-
Comment: Microarrays, Empirical Bayes and the Two-Groups Model
Authors:
Carl N. Morris
Abstract:
Brad Efron's paper [arXiv:0808.0572] has inspired a return to the ideas behind Bayes, frequency and empirical Bayes. The latter preferably would not be limited to exchangeable models for the data and hyperparameters. Parallels are revealed between microarray analyses and profiling of hospitals, with advances suggesting more decision modeling for gene identification also. Then good multilevel and…
▽ More
Brad Efron's paper [arXiv:0808.0572] has inspired a return to the ideas behind Bayes, frequency and empirical Bayes. The latter preferably would not be limited to exchangeable models for the data and hyperparameters. Parallels are revealed between microarray analyses and profiling of hospitals, with advances suggesting more decision modeling for gene identification also. Then good multilevel and empirical Bayes models for random effects should be sought when regression toward the mean is anticipated.
△ Less
Submitted 5 August, 2008;
originally announced August 2008.