-
ZynqMP-based board-management mezzanines for Serenity ATCA-blades
Authors:
T. Mehner,
L. E. Ardila-Perez,
M. N. Balzer,
O. Sander,
D. Tcherniakhovski,
M. Schleicher,
M. Fuchs,
G. Fedi,
G. Gimas,
G. M. Iles,
M. Pesaresi,
A. W. Rose,
T. Schuh
Abstract:
In the context of the CMS Phase-2 tracker back-end processing system, two mezzanines based on the Zynq Ultrascale+ Multi-Processor System-on-Chip (MPSoC) device have been developed to serve as centralized slow control and board management solution for the Serenity-family \textcolor{black}{Advanced Telecommunications Computing Architecture (ATCA)} blades.
This paper presents the developments of t…
▽ More
In the context of the CMS Phase-2 tracker back-end processing system, two mezzanines based on the Zynq Ultrascale+ Multi-Processor System-on-Chip (MPSoC) device have been developed to serve as centralized slow control and board management solution for the Serenity-family \textcolor{black}{Advanced Telecommunications Computing Architecture (ATCA)} blades.
This paper presents the developments of the MPSoC mezzanines to execute the Intelligent Platform Management Controller (IPMC) software in the real-time capable processors of the MPSoC. In coordination with the Shelf Manager, once full-power is enabled, a CentOS-based Linux distribution is executed in the application processors of the MPSoC, on which EMPButler and the Serenity Management Shell (SMASH) are running.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Biomedical knowledge graph-optimized prompt generation for large language models
Authors:
Karthik Soman,
Peter W Rose,
John H Morris,
Rabia E Akbas,
Brett Smith,
Braian Peetoom,
Catalina Villouta-Reyes,
Gabriel Cerono,
Yongmei Shi,
Angela Rizk-Jackson,
Sharat Israni,
Charlotte A Nelson,
Sui Huang,
Sergio E Baranzini
Abstract:
Large Language Models (LLMs) are being adopted at an unprecedented rate, yet still face challenges in knowledge-intensive domains like biomedicine. Solutions such as pre-training and domain-specific fine-tuning add substantial computational overhead, requiring further domain expertise. Here, we introduce a token-optimized and robust Knowledge Graph-based Retrieval Augmented Generation (KG-RAG) fra…
▽ More
Large Language Models (LLMs) are being adopted at an unprecedented rate, yet still face challenges in knowledge-intensive domains like biomedicine. Solutions such as pre-training and domain-specific fine-tuning add substantial computational overhead, requiring further domain expertise. Here, we introduce a token-optimized and robust Knowledge Graph-based Retrieval Augmented Generation (KG-RAG) framework by leveraging a massive biomedical KG (SPOKE) with LLMs such as Llama-2-13b, GPT-3.5-Turbo and GPT-4, to generate meaningful biomedical text rooted in established knowledge. Compared to the existing RAG technique for Knowledge Graphs, the proposed method utilizes minimal graph schema for context extraction and uses embedding methods for context pruning. This optimization in context extraction results in more than 50% reduction in token consumption without compromising the accuracy, making a cost-effective and robust RAG implementation on proprietary LLMs. KG-RAG consistently enhanced the performance of LLMs across diverse biomedical prompts by generating responses rooted in established knowledge, accompanied by accurate provenance and statistical evidence (if available) to substantiate the claims. Further benchmarking on human curated datasets, such as biomedical true/false and multiple-choice questions (MCQ), showed a remarkable 71% boost in the performance of the Llama-2 model on the challenging MCQ dataset, demonstrating the framework's capacity to empower open-source models with fewer parameters for domain specific questions. Furthermore, KG-RAG enhanced the performance of proprietary GPT models, such as GPT-3.5 and GPT-4. In summary, the proposed framework combines explicit and implicit knowledge of KG and LLM in a token optimized fashion, thus enhancing the adaptability of general-purpose LLMs to tackle domain-specific questions in a cost-effective fashion.
△ Less
Submitted 13 May, 2024; v1 submitted 28 November, 2023;
originally announced November 2023.
-
NatCS: Eliciting Natural Customer Support Dialogues
Authors:
James Gung,
Emily Moeng,
Wesley Rose,
Arshit Gupta,
Yi Zhang,
Saab Mansour
Abstract:
Despite growing interest in applications based on natural customer support conversations, there exist remarkably few publicly available datasets that reflect the expected characteristics of conversations in these settings. Existing task-oriented dialogue datasets, which were collected to benchmark dialogue systems mainly in written human-to-bot settings, are not representative of real customer sup…
▽ More
Despite growing interest in applications based on natural customer support conversations, there exist remarkably few publicly available datasets that reflect the expected characteristics of conversations in these settings. Existing task-oriented dialogue datasets, which were collected to benchmark dialogue systems mainly in written human-to-bot settings, are not representative of real customer support conversations and do not provide realistic benchmarks for systems that are applied to natural data. To address this gap, we introduce NatCS, a multi-domain collection of spoken customer service conversations. We describe our process for collecting synthetic conversations between customers and agents based on natural language phenomena observed in real conversations. Compared to previous dialogue datasets, the conversations collected with our approach are more representative of real human-to-human conversations along multiple metrics. Finally, we demonstrate potential uses of NatCS, including dialogue act classification and intent induction from conversations as potential applications, showing that dialogue act annotations in NatCS provide more effective training data for modeling real conversations compared to existing synthetic written datasets. We publicly release NatCS to facilitate research in natural dialog systems
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Intent Induction from Conversations for Task-Oriented Dialogue Track at DSTC 11
Authors:
James Gung,
Raphael Shu,
Emily Moeng,
Wesley Rose,
Salvatore Romeo,
Yassine Benajiba,
Arshit Gupta,
Saab Mansour,
Yi Zhang
Abstract:
With increasing demand for and adoption of virtual assistants, recent work has investigated ways to accelerate bot schema design through the automatic induction of intents or the induction of slots and dialogue states. However, a lack of dedicated benchmarks and standardized evaluation has made progress difficult to track and comparisons between systems difficult to make. This challenge track, hel…
▽ More
With increasing demand for and adoption of virtual assistants, recent work has investigated ways to accelerate bot schema design through the automatic induction of intents or the induction of slots and dialogue states. However, a lack of dedicated benchmarks and standardized evaluation has made progress difficult to track and comparisons between systems difficult to make. This challenge track, held as part of the Eleventh Dialog Systems Technology Challenge, introduces a benchmark that aims to evaluate methods for the automatic induction of customer intents in a realistic setting of customer service interactions between human agents and customers. We propose two subtasks for progressively tackling the automatic induction of intents and corresponding evaluation methodologies. We then present three datasets suitable for evaluating the tasks and propose simple baselines. Finally, we summarize the submissions and results of the challenge track, for which we received submissions from 34 teams.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
Nanometer-Scale Nuclear Magnetic Resonance Diffraction with Sub-Ångstrom Precision
Authors:
Holger Haas,
Sahand Tabatabaei,
William Rose,
Pardis Sahafi,
Michèle Piscitelli,
Andrew Jordan,
Pritam Priyadarsi,
Namanish Singh,
Ben Yager,
Philip J. Poole,
Dan Dalacu,
Raffi Budakian
Abstract:
Achieving atomic resolution is the ultimate limit of magnetic resonance imaging (MRI), and attaining this capability offers enormous technological and scientific opportunities, from drug development to understanding the dynamics in interacting quantum systems. In this work, we present a new approach to nanoMRI utilizing nuclear magnetic resonance diffraction (NMRd) -- a method that extends NMR ima…
▽ More
Achieving atomic resolution is the ultimate limit of magnetic resonance imaging (MRI), and attaining this capability offers enormous technological and scientific opportunities, from drug development to understanding the dynamics in interacting quantum systems. In this work, we present a new approach to nanoMRI utilizing nuclear magnetic resonance diffraction (NMRd) -- a method that extends NMR imaging to probe the structure of periodic spin systems. The realization of NMRd on the atomic scale would create a powerful new methodology for materials characterization utilizing the spectroscopic capabilities of NMR. We describe two experiments that realize NMRd measurement of $^{31}$P spins in an indium-phosphide (InP) nanowire with sub-Ångstrom precision. In the first experiment, we encode a nanometer-scale spatial modulation of the $z$-axis magnetization by periodically inverting the $^{31}$P spins, and detect the period and position of the modulation with a precision of $<0.8$ Å. In the second experiment, we demonstrate an interferometric technique, utilizing NMRd, for detecting an Ångstrom-scale displacement of the InP sample with a precision of 0.07 Å. The diffraction-based techniques developed in this work represent new measurement modalities in NMR for probing the structure and dynamics of spins on sub-Ångstrom length scales, and demonstrate the feasibility of crystallographic MRI measurements.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
Numerical Engineering of Robust Adiabatic Operations
Authors:
Sahand Tabatabaei,
Holger Haas,
William Rose,
Ben Yager,
Michèle Piscitelli,
Pardis Sahafi,
Andrew Jordan,
Philip J. Poole,
Dan Dalacu,
Raffi Budakian
Abstract:
Adiabatic operations are powerful tools for robust quantum control in numerous fields of physics, chemistry and quantum information science. The inherent robustness due to adiabaticity can, however, be impaired in applications requiring short evolution times. We present a single versatile gradient-based optimization protocol that combines adiabatic control with effective Hamiltonian engineering in…
▽ More
Adiabatic operations are powerful tools for robust quantum control in numerous fields of physics, chemistry and quantum information science. The inherent robustness due to adiabaticity can, however, be impaired in applications requiring short evolution times. We present a single versatile gradient-based optimization protocol that combines adiabatic control with effective Hamiltonian engineering in order to design adiabatic operations tailored to the specific imperfections and resources of an experimental setup. The practicality of the protocol is demonstrated by engineering a fast, 2.3 Rabi cycle-long adiabatic inversion pulse for magnetic resonance with built-in robustness to Rabi field inhomogeneities and resonance offsets. The performance and robustness of the pulse is validated in a nanoscale force-detected magnetic resonance experiment on a solid-state sample, indicating an ensemble-averaged inversion accuracy of $\sim 99.997\%$. We further showcase the utility of our protocol by providing examples of adiabatic pulses robust to spin-spin interactions, parameter-selective operations and operations connecting arbitrary states, each motivated by experiments.
△ Less
Submitted 30 April, 2021; v1 submitted 7 September, 2020;
originally announced September 2020.
-
Ultra-low dissipation patterned silicon nanowire arrays for scanning probe microscopy
Authors:
Pardis Sahafi,
William Rose,
Andrew Jordan,
Ben Yager,
Michèle Piscitelli,
Raffi Budakian
Abstract:
In recent years, self-assembled semiconductor nanowires have been successfully used as ultra-sensitive cantilevers in a number of unique scanning probe microscopy (SPM) settings. We describe the fabrication of ultra-low dissipation patterned silicon nanowire (SiNW) arrays optimized for scanning probe applications. Our fabrication process produces, with high yield, ultra-high aspect ratio vertical…
▽ More
In recent years, self-assembled semiconductor nanowires have been successfully used as ultra-sensitive cantilevers in a number of unique scanning probe microscopy (SPM) settings. We describe the fabrication of ultra-low dissipation patterned silicon nanowire (SiNW) arrays optimized for scanning probe applications. Our fabrication process produces, with high yield, ultra-high aspect ratio vertical SiNWs that exhibit exceptional force sensitivity. The highest sensitivity SiNWs have thermomechanical-noise limited force sensitivity of $9.7\pm0.4~\text{aN}/\sqrt{\text{Hz}}$ at room temperature and $500\pm20~\text{zN}/\sqrt{\text{Hz}}$ at 4 K. To facilitate their use in SPM, the SiNWs are patterned within $7~μ\text{m}$ from the edge of the substrate, allowing convenient optical access for displacement detection.
△ Less
Submitted 6 September, 2019;
originally announced September 2019.
-
Ten Simple Rules for Reproducible Research in Jupyter Notebooks
Authors:
Adam Rule,
Amanda Birmingham,
Cristal Zuniga,
Ilkay Altintas,
Shih-Cheng Huang,
Rob Knight,
Niema Moshiri,
Mai H. Nguyen,
Sara Brin Rosenthal,
Fernando Pérez,
Peter W. Rose
Abstract:
Reproducibility of computational studies is a hallmark of scientific methodology. It enables researchers to build with confidence on the methods and findings of others, reuse and extend computational pipelines, and thereby drive scientific progress. Since many experimental studies rely on computational analyses, biologists need guidance on how to set up and document reproducible data analyses or s…
▽ More
Reproducibility of computational studies is a hallmark of scientific methodology. It enables researchers to build with confidence on the methods and findings of others, reuse and extend computational pipelines, and thereby drive scientific progress. Since many experimental studies rely on computational analyses, biologists need guidance on how to set up and document reproducible data analyses or simulations.
In this paper, we address several questions about reproducibility. For example, what are the technical and non-technical barriers to reproducible computational studies? What opportunities and challenges do computational notebooks offer to overcome some of these barriers? What tools are available and how can they be used effectively?
We have developed a set of rules to serve as a guide to scientists with a specific focus on computational notebook systems, such as Jupyter Notebooks, which have become a tool of choice for many applications. Notebooks combine detailed workflows with narrative text and visualization of results. Combined with software repositories and open source licensing, notebooks are powerful tools for transparent, collaborative, reproducible, and reusable data analyses.
△ Less
Submitted 13 October, 2018;
originally announced October 2018.
-
High-Resolution Nanoscale Solid-State Nuclear Magnetic Resonance Spectroscopy
Authors:
William Rose,
Holger Haas,
Angela Q. Chen,
Nari Jeon,
Lincoln J. Lauhon,
David G. Cory,
Raffi Budakian
Abstract:
We present a new method for high-resolution nanoscale magnetic resonance imaging (nano-MRI) that combines the high spin sensitivity of nanowire-based magnetic resonance detection with high spectral resolution nuclear magnetic resonance (NMR) spectroscopy. By applying NMR pulses designed using optimal control theory, we demonstrate a factor of $500$ reduction of the proton spin resonance linewidth…
▽ More
We present a new method for high-resolution nanoscale magnetic resonance imaging (nano-MRI) that combines the high spin sensitivity of nanowire-based magnetic resonance detection with high spectral resolution nuclear magnetic resonance (NMR) spectroscopy. By applying NMR pulses designed using optimal control theory, we demonstrate a factor of $500$ reduction of the proton spin resonance linewidth in a $(50\text{-nm})^{\text{3}}$ volume of polystyrene and image proton spins in one dimension with a spatial resolution below $2~\text{nm}$.
△ Less
Submitted 4 July, 2017;
originally announced July 2017.
-
The Q_weak Experimental Apparatus
Authors:
Qweak Collaboration,
T. Allison,
M. Anderson,
D. Androic,
D. S. Armstrong,
A. Asaturyan,
T. D. Averett,
R. Averill,
J. Balewski,
J. Beaufait,
R. S. Beminiwattha,
J. Benesch,
F. Benmokhtar,
J. Bessuille,
J. Birchall,
E. Bonnell,
J. Bowman,
P. Brindza,
D. B. Brown,
R. D. Carlini,
G. D. Cates,
B. Cavness,
G. Clark,
J. C. Cornejo,
S. Covrig Dusa
, et al. (104 additional authors not shown)
Abstract:
The Jefferson Lab Q_weak experiment determined the weak charge of the proton by measuring the parity-violating elastic scattering asymmetry of longitudinally polarized electrons from an unpolarized liquid hydrogen target at small momentum transfer. A custom apparatus was designed for this experiment to meet the technical challenges presented by the smallest and most precise ${\vec{e}}$p asymmetry…
▽ More
The Jefferson Lab Q_weak experiment determined the weak charge of the proton by measuring the parity-violating elastic scattering asymmetry of longitudinally polarized electrons from an unpolarized liquid hydrogen target at small momentum transfer. A custom apparatus was designed for this experiment to meet the technical challenges presented by the smallest and most precise ${\vec{e}}$p asymmetry ever measured. Technical milestones were achieved at Jefferson Lab in target power, beam current, beam helicity reversal rate, polarimetry, detected rates, and control of helicity-correlated beam properties. The experiment employed 180 microA of 89% longitudinally polarized electrons whose helicity was reversed 960 times per second. The electrons were accelerated to 1.16 GeV and directed to a beamline with extensive instrumentation to measure helicity-correlated beam properties that can induce false asymmetries. Moller and Compton polarimetry were used to measure the electron beam polarization to better than 1%. The electron beam was incident on a 34.4 cm liquid hydrogen target. After passing through a triple collimator system, scattered electrons between 5.8 degrees and 11.6 degrees were bent in the toroidal magnetic field of a resistive copper-coil magnet. The electrons inside this acceptance were focused onto eight fused silica Cerenkov detectors arrayed symmetrically around the beam axis. A total scattered electron rate of about 7 GHz was incident on the detector array. The detectors were read out in integrating mode by custom-built low-noise pre-amplifiers and 18-bit sampling ADC modules. The momentum transfer Q^2 = 0.025 GeV^2 was determined using dedicated low-current (~100 pA) measurements with a set of drift chambers before (and a set of drift chambers and trigger scintillation counters after) the toroidal magnet.
△ Less
Submitted 6 January, 2015; v1 submitted 24 September, 2014;
originally announced September 2014.