-
Unlocking the Potential of Renewable Energy Through Curtailment Prediction
Authors:
Bilge Acun,
Brent Morgan,
Henry Richardson,
Nat Steinsultz,
Carole-Jean Wu
Abstract:
A significant fraction (5-15%) of renewable energy generated goes into waste in the grids around the world today due to oversupply issues and transmission constraints. Being able to predict when and where renewable curtailment occurs would improve renewable utilization. The core of this work is to enable the machine learning community to help decarbonize electricity grids by unlocking the potentia…
▽ More
A significant fraction (5-15%) of renewable energy generated goes into waste in the grids around the world today due to oversupply issues and transmission constraints. Being able to predict when and where renewable curtailment occurs would improve renewable utilization. The core of this work is to enable the machine learning community to help decarbonize electricity grids by unlocking the potential of renewable energy through curtailment prediction.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Challenges for Responsible AI Design and Workflow Integration in Healthcare: A Case Study of Automatic Feeding Tube Qualification in Radiology
Authors:
Anja Thieme,
Abhijith Rajamohan,
Benjamin Cooper,
Heather Groombridge,
Robert Simister,
Barney Wong,
Nicholas Woznitza,
Mark Ames Pinnock,
Maria Teodora Wetscherek,
Cecily Morrison,
Hannah Richardson,
Fernando Pérez-García,
Stephanie L. Hyland,
Shruthi Bannur,
Daniel C. Castro,
Kenza Bouzid,
Anton Schwaighofer,
Mercy Ranjit,
Harshita Sharma,
Matthew P. Lungren,
Ozan Oktay,
Javier Alvarez-Valle,
Aditya Nori,
Stephen Harris,
Joseph Jacob
Abstract:
Nasogastric tubes (NGTs) are feeding tubes that are inserted through the nose into the stomach to deliver nutrition or medication. If not placed correctly, they can cause serious harm, even death to patients. Recent AI developments demonstrate the feasibility of robustly detecting NGT placement from Chest X-ray images to reduce risks of sub-optimally or critically placed NGTs being missed or delay…
▽ More
Nasogastric tubes (NGTs) are feeding tubes that are inserted through the nose into the stomach to deliver nutrition or medication. If not placed correctly, they can cause serious harm, even death to patients. Recent AI developments demonstrate the feasibility of robustly detecting NGT placement from Chest X-ray images to reduce risks of sub-optimally or critically placed NGTs being missed or delayed in their detection, but gaps remain in clinical practice integration. In this study, we present a human-centered approach to the problem and describe insights derived following contextual inquiry and in-depth interviews with 15 clinical stakeholders. The interviews helped understand challenges in existing workflows, and how best to align technical capabilities with user needs and expectations. We discovered the trade-offs and complexities that need consideration when choosing suitable workflow stages, target users, and design configurations for different AI proposals. We explored how to balance AI benefits and risks for healthcare staff and patients within broader organizational and medical-legal constraints. We also identified data issues related to edge cases and data biases that affect model training and evaluation; how data documentation practices influence data preparation and labelling; and how to measure relevant AI outcomes reliably in future evaluations. We discuss how our work informs design and development of AI applications that are clinically useful, ethical, and acceptable in real-world healthcare services.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Authors:
Gemini Team,
Petko Georgiev,
Ving Ian Lei,
Ryan Burnell,
Libin Bai,
Anmol Gulati,
Garrett Tanzer,
Damien Vincent,
Zhufeng Pan,
Shibo Wang,
Soroosh Mariooryad,
Yifan Ding,
Xinyang Geng,
Fred Alcober,
Roy Frostig,
Mark Omernick,
Lexi Walker,
Cosmin Paduraru,
Christina Sorokin,
Andrea Tacchetti,
Colin Gaffney,
Samira Daruki,
Olcan Sercinoglu,
Zach Gleicher,
Juliette Love
, et al. (1092 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February…
▽ More
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
△ Less
Submitted 14 June, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology
Authors:
Nur Yildirim,
Hannah Richardson,
Maria T. Wetscherek,
Junaid Bajwa,
Joseph Jacob,
Mark A. Pinnock,
Stephen Harris,
Daniel Coelho de Castro,
Shruthi Bannur,
Stephanie L. Hyland,
Pratik Ghosh,
Mercy Ranjit,
Kenza Bouzid,
Anton Schwaighofer,
Fernando Pérez-García,
Harshita Sharma,
Ozan Oktay,
Matthew Lungren,
Javier Alvarez-Valle,
Aditya Nori,
Anja Thieme
Abstract:
Recent advances in AI combine large language models (LLMs) with vision encoders that bring forward unprecedented technical capabilities to leverage for a wide range of healthcare applications. Focusing on the domain of radiology, vision-language models (VLMs) achieve good performance results for tasks such as generating radiology findings based on a patient's medical image, or answering visual que…
▽ More
Recent advances in AI combine large language models (LLMs) with vision encoders that bring forward unprecedented technical capabilities to leverage for a wide range of healthcare applications. Focusing on the domain of radiology, vision-language models (VLMs) achieve good performance results for tasks such as generating radiology findings based on a patient's medical image, or answering visual questions (e.g., 'Where are the nodules in this chest X-ray?'). However, the clinical utility of potential applications of these capabilities is currently underexplored. We engaged in an iterative, multidisciplinary design process to envision clinically relevant VLM interactions, and co-designed four VLM use concepts: Draft Report Generation, Augmented Report Review, Visual Search and Querying, and Patient Imaging History Highlights. We studied these concepts with 13 radiologists and clinicians who assessed the VLM concepts as valuable, yet articulated many design considerations. Reflecting on our findings, we discuss implications for integrating VLM capabilities in radiology, and for healthcare AI more generally.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
RadEdit: stress-testing biomedical vision models via diffusion image editing
Authors:
Fernando Pérez-García,
Sam Bond-Taylor,
Pedro P. Sanchez,
Boris van Breugel,
Daniel C. Castro,
Harshita Sharma,
Valentina Salvatelli,
Maria T. A. Wetscherek,
Hannah Richardson,
Matthew P. Lungren,
Aditya Nori,
Javier Alvarez-Valle,
Ozan Oktay,
Maximilian Ilse
Abstract:
Biomedical imaging datasets are often small and biased, meaning that real-world performance of predictive models can be substantially lower than expected from internal testing. This work proposes using generative image editing to simulate dataset shifts and diagnose failure modes of biomedical vision models; this can be used in advance of deployment to assess readiness, potentially reducing cost a…
▽ More
Biomedical imaging datasets are often small and biased, meaning that real-world performance of predictive models can be substantially lower than expected from internal testing. This work proposes using generative image editing to simulate dataset shifts and diagnose failure modes of biomedical vision models; this can be used in advance of deployment to assess readiness, potentially reducing cost and patient harm. Existing editing methods can produce undesirable changes, with spurious correlations learned due to the co-occurrence of disease and treatment interventions, limiting practical applicability. To address this, we train a text-to-image diffusion model on multiple chest X-ray datasets and introduce a new editing method RadEdit that uses multiple masks, if present, to constrain changes and ensure consistency in the edited images. We consider three types of dataset shifts: acquisition shift, manifestation shift, and population shift, and demonstrate that our approach can diagnose failures and quantify model robustness without additional data collection, complementing more qualitative tools for explainable AI.
△ Less
Submitted 3 April, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Co-audit: tools to help humans double-check AI-generated content
Authors:
Andrew D. Gordon,
Carina Negreanu,
José Cambronero,
Rasika Chakravarthy,
Ian Drosos,
Hao Fang,
Bhaskar Mitra,
Hannah Richardson,
Advait Sarkar,
Stephanie Simmons,
Jack Williams,
Ben Zorn
Abstract:
Users are increasingly being warned to check AI-generated content for correctness. Still, as LLMs (and other generative models) generate more complex output, such as summaries, tables, or code, it becomes harder for the user to audit or evaluate the output for quality or correctness. Hence, we are seeing the emergence of tool-assisted experiences to help the user double-check a piece of AI-generat…
▽ More
Users are increasingly being warned to check AI-generated content for correctness. Still, as LLMs (and other generative models) generate more complex output, such as summaries, tables, or code, it becomes harder for the user to audit or evaluate the output for quality or correctness. Hence, we are seeing the emergence of tool-assisted experiences to help the user double-check a piece of AI-generated content. We refer to these as co-audit tools. Co-audit tools complement prompt engineering techniques: one helps the user construct the input prompt, while the other helps them check the output response. As a specific example, this paper describes recent research on co-audit tools for spreadsheet computations powered by generative models. We explain why co-audit experiences are essential for any application of generative AI where quality is important and errors are consequential (as is common in spreadsheet computations). We propose a preliminary list of principles for co-audit, and outline research challenges.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Analysis of parallel I/O use on the UK national supercomputing service, ARCHER using Cray LASSi and EPCC SAFE
Authors:
Andrew Turner,
Dominic Sloan-Murphy,
Karthee Sivalingam,
Harvey Richardson,
Julian Kunkel
Abstract:
In this paper, we describe how we have used a combination of the LASSi tool (developed by Cray) and the SAFE software (developed by EPCC) to collect and analyse Lustre I/O performance data for all jobs running on the UK national supercomputing service, ARCHER; and to provide reports on I/O usage for users in our standard reporting framework. We also present results from analysis of parallel I/O us…
▽ More
In this paper, we describe how we have used a combination of the LASSi tool (developed by Cray) and the SAFE software (developed by EPCC) to collect and analyse Lustre I/O performance data for all jobs running on the UK national supercomputing service, ARCHER; and to provide reports on I/O usage for users in our standard reporting framework. We also present results from analysis of parallel I/O use on ARCHER and analysis on the potential impact of different applications on file system performance using metrics we have derived from the LASSi data. We show that the performance data from LASSi reveals how the same application can stress different components of the file system depending on how it is run, and how the LASSi risk metrics allow us to identify use cases that could potentially cause issues for global I/O performance and work with users to improve their I/O use. We use the IO-500 benchmark to help us understand how LASSi risk metrics correspond to observed performance on the ARCHER file systems. We also use LASSi data imported into SAFE to identify I/O use patterns associated with different research areas, understand how the research workflow gives rise to the observed patterns and project how this will affect I/O requirements in the future. Finally, we provide an overview of likely future directions for the continuation of this work.
△ Less
Submitted 10 June, 2019;
originally announced June 2019.
-
LASSi: Metric based I/O analytics for HPC
Authors:
Karthee Sivalingam,
Harvey Richardson,
Adrian Tate,
Martin Lafferty
Abstract:
LASSi is a tool aimed at analyzing application usage and contention caused by use of shared resources (filesystem or network) in a HPC system. LASSi was initially developed to support the ARCHER system where there are large variations in application requirements and occasional user complaints regarding filesystem performance manifested by variation in job runtimes or poor interactive response. LAS…
▽ More
LASSi is a tool aimed at analyzing application usage and contention caused by use of shared resources (filesystem or network) in a HPC system. LASSi was initially developed to support the ARCHER system where there are large variations in application requirements and occasional user complaints regarding filesystem performance manifested by variation in job runtimes or poor interactive response. LASSi takes an approach of defining derivative risk and ops metrics that relate to unusually high application I/O behaviour. The metrics are shown to correlate to applications that can experience variable performance or that may impact the performance of other applications. LASSi uses I/O statistics over time to provide application I/O profiles and has been automated to generate daily reports for ARCHER. We demonstrate how LASSi provides holistic I/O analysis by monitoring filesystem I/O, generating coarse profiles of filesystems and application runs and automating analysis of application slowdown using metrics.
△ Less
Submitted 10 June, 2019;
originally announced June 2019.
-
The PRL Stabilized High Resolution Echelle Fiber-fed Spectrograph: Instrument Description & First Radial Velocity Results
Authors:
Abhijit Chakraborty,
Suvrath Mahadevan,
Arpita Roy,
Vaibhav Dixit,
Eric Harvey Richardson,
Varun Dongre,
F. M. Pathan,
Priyanka Chaturvedi,
Vishal Shah,
Girish P. Ubale,
B. G. Anandarao
Abstract:
We present spectrograph design details and initial radial velocity results from the PRL optical fiber-fed high-resolution cross-dispersed echelle spectrograph (PARAS), which has recently been commissioned at the Mt Abu 1.2 m telescope, in India. Data obtained as part of the post-commissioning tests with PARAS show velocity precision better than 2m/s over a period of several months on bright RV sta…
▽ More
We present spectrograph design details and initial radial velocity results from the PRL optical fiber-fed high-resolution cross-dispersed echelle spectrograph (PARAS), which has recently been commissioned at the Mt Abu 1.2 m telescope, in India. Data obtained as part of the post-commissioning tests with PARAS show velocity precision better than 2m/s over a period of several months on bright RV standard stars. For observations of sigma-Dra we report 1.7m/s precision for a period of seven months and 2.1m/s for HD 9407 over a period of 2 months. PARAS is capable of a single-shot spectral coverage of 3800A - 9500A at a resolution of about 67,000. The RV results were obtained between 3800A and 6900A using simultaneous wavelength calibration with a Thorium-Argon (ThAr) hollow cathode lamp. The spectrograph is maintained under stable conditions of temperature with a precision of 0.01 - 0.02C (rms) at 25.55C, and enclosed in a vacuum vessel at pressure of 0.1 +/-0.03 mbar. The blaze peak efficiency of the spectrograph between 5000A and 6500A, including the detector, is 30%; and about 25% with the fiber transmission. The total efficiency, including spectrograph, fiber transmission, focal ratio degradation (FRD), and telescope (with 81% reflectivity) is about 7% in the same wavelength region on a clear night with good seeing conditions.
△ Less
Submitted 19 December, 2013;
originally announced December 2013.
-
First Light results from PARAS: The PRL Echelle Spectrograph
Authors:
Abhijit Chakraborty,
Suvrath Mahadevan,
Arpita Roy,
Fazalahmed M. Pathan,
Vishal Shah,
Eric H. Richardson,
Girish Ubale,
Rajesh Shah
Abstract:
We present the first light commissioning results from the Physical Research Laboratory (PRL) optical fiber-fed high resolution cross-dispersed Echelle Spectrograph. It is capable of a single- shot spectral coverage of 3700A to 8600A at R ~ 63,000 and is under very stable conditions of temperature (0.04°C at 23°C). In the very near future pressure control will also be achieved by enclosing the enti…
▽ More
We present the first light commissioning results from the Physical Research Laboratory (PRL) optical fiber-fed high resolution cross-dispersed Echelle Spectrograph. It is capable of a single- shot spectral coverage of 3700A to 8600A at R ~ 63,000 and is under very stable conditions of temperature (0.04°C at 23°C). In the very near future pressure control will also be achieved by enclosing the entire spectrograph in a low-pressure vacuum chamber (~0.01mbar). It is attached to a 1.2m telescope using two 50micron core optical fibers (one for the star and another for simultaneous Th-Ar spectral calibration). The 1.2m telescope is located at Mt. Abu, India, and we are guaranteed about 80 to 100 nights a year for observations with the spectrograph. The instrument will be ultimately used for radial-velocity searches of exoplanets around 1000 dwarf stars, brighter than 10th magnitude, for the next 5 years with a precision of 3 to 5m/s using the simultaneous Th-Ar spectral lamp reference technique. The spectrograph has already achieved a stability of 3.7m/s in short-term time scale and in the near future we expect the stability to be at 1m/s once we install the spectrograph inside the vacuum chamber.
△ Less
Submitted 24 July, 2010;
originally announced July 2010.
-
A Quantum Description of Brownian Motion and the RC Circuit
Authors:
W. H. Richardson
Abstract:
An equation for the reduced density matrix which describes a free particle, that is interacting with a linearly dissipative medium, is derived using the total Hamiltonian, and without resorting to any artificial model.
A Master equation is also obtained for the dissipative medium. The theory erases the notion of the reservoir. It is shown that the dynamical interaction with the medium is not c…
▽ More
An equation for the reduced density matrix which describes a free particle, that is interacting with a linearly dissipative medium, is derived using the total Hamiltonian, and without resorting to any artificial model.
A Master equation is also obtained for the dissipative medium. The theory erases the notion of the reservoir. It is shown that the dynamical interaction with the medium is not completely determined by the friction constant (or the corresponding fluctuation-dissipation relation). Unexpectedly, through the density-density correlation, the longitudinal dielectric function is shown to play a critical role.
△ Less
Submitted 4 April, 2001;
originally announced April 2001.
-
Focussing Image Slicers: Refractive and Reflective
Authors:
E. H. Richardson,
A. Moore,
T. Tilleman,
David Crampton
Abstract:
A number of design options for image slicers for NGST and Gemini are being investigated. These image slicers are all of the focussing type and both refractive and reflective solutions are being explored. One such device, an image slicer that focuses 10 slices on a spectrograph slit is now in operation at the McMath Solar telescope. It consists of three lenslet arrays, and additionally acts as a…
▽ More
A number of design options for image slicers for NGST and Gemini are being investigated. These image slicers are all of the focussing type and both refractive and reflective solutions are being explored. One such device, an image slicer that focuses 10 slices on a spectrograph slit is now in operation at the McMath Solar telescope. It consists of three lenslet arrays, and additionally acts as a focal reducer and provides correction for astigmatism of the telescope. A combined refractive and reflective slicer designed for use on NGST delivers near-diffraction limited images for up to 40 slices.
△ Less
Submitted 10 November, 1999;
originally announced November 1999.
-
A Curvature-Compensated Corrector for Drift-Scan Observations
Authors:
Paul Hickson,
E. Harvey Richardson
Abstract:
Images obtained by drift-scanning with a stationary telescope are affected by the declination-dependent curvature of star trails. The image displacement to curvature and drift rate variation increases with the angular field of view and can lead to significant loss of resolution with modern large-format CCD arrays. We show that these effects can be essentially eliminated by means of an optical co…
▽ More
Images obtained by drift-scanning with a stationary telescope are affected by the declination-dependent curvature of star trails. The image displacement to curvature and drift rate variation increases with the angular field of view and can lead to significant loss of resolution with modern large-format CCD arrays. We show that these effects can be essentially eliminated by means of an optical corrector design in which individual lenses are tilted and decentered. A specific example is presented, of a four-element corrector designed for the Large-Zenith Telescope. The design reduces curvature errors to less than 0.074 arcsec over a 10 arcmin x 20 arcmin field of view centered at 49 degrees declination. By changing the positions and tilts of the lenses, the same design can also be used for any field centers between 0 degrees and 49 degrees.
△ Less
Submitted 22 June, 1998;
originally announced June 1998.