-
Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting
Authors:
Nicholas Harris,
Anand Butani,
Syed Hashmy
Abstract:
Embedding models are crucial for various natural language processing tasks but can be limited by factors such as limited vocabulary, lack of context, and grammatical errors. This paper proposes a novel approach to improve embedding performance by leveraging large language models (LLMs) to enrich and rewrite input text before the embedding process. By utilizing ChatGPT 3.5 to provide additional con…
▽ More
Embedding models are crucial for various natural language processing tasks but can be limited by factors such as limited vocabulary, lack of context, and grammatical errors. This paper proposes a novel approach to improve embedding performance by leveraging large language models (LLMs) to enrich and rewrite input text before the embedding process. By utilizing ChatGPT 3.5 to provide additional context, correct inaccuracies, and incorporate metadata, the proposed method aims to enhance the utility and accuracy of embedding models. The effectiveness of this approach is evaluated on three datasets: Banking77Classification, TwitterSemEval 2015, and Amazon Counter-factual Classification. Results demonstrate significant improvements over the baseline model on the TwitterSemEval 2015 dataset, with the best-performing prompt achieving a score of 85.34 compared to the previous best of 81.52 on the Massive Text Embedding Benchmark (MTEB) Leaderboard. However, performance on the other two datasets was less impressive, highlighting the importance of considering domain-specific characteristics. The findings suggest that LLM-based text enrichment has shown promising results to improve embedding performance, particularly in certain domains. Hence, numerous limitations in the process of embedding can be avoided.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Direct Experimental Constraints on the Spatial Extent of a Neutrino Wavepacket
Authors:
Joseph Smolsky,
Kyle G Leach,
Ryan Abells,
Pedro Amaro,
Adrien Andoche,
Keith Borbridge,
Connor Bray,
Robin Cantor,
David Diercks,
Spencer Fretwell,
Stephan Friedrich,
Abigail Gillespie,
Mauro Guerra,
Ad Hall,
Cameron N Harris,
Jackson T Harris,
Calvin Hinkle,
Amii Lamm,
Leendert M Hayen,
Paul-Antoine Hervieux,
Geon-Bo Kim,
Inwook Kim,
Annika Lennarz,
Vincenzo Lordi,
Jorge Machado
, et al. (13 additional authors not shown)
Abstract:
Despite their high relative abundance in our Universe, neutrinos are the least understood fundamental particles of nature. They also provide a unique system to study quantum coherence and the wavelike nature of particles in fundamental systems due to their extremely weak interaction probabilities. In fact, the quantum properties of neutrinos emitted in experimentally relevant sources are virtually…
▽ More
Despite their high relative abundance in our Universe, neutrinos are the least understood fundamental particles of nature. They also provide a unique system to study quantum coherence and the wavelike nature of particles in fundamental systems due to their extremely weak interaction probabilities. In fact, the quantum properties of neutrinos emitted in experimentally relevant sources are virtually unknown and the spatial extent of the neutrino wavepacket is only loosely constrained by reactor neutrino oscillation data with a spread of 13 orders of magnitude. Here, we present the first direct limits of this quantity through a new experimental concept to extract the energy width, $σ_{\textrm{N},E}$, of the recoil daughter nucleus emitted in the nuclear electron capture (EC) decay of $^7$Be. The final state in the EC decay process contains a recoiling $^7$Li nucleus and an electron neutrino ($ν_e$) which are entangled at their creation. The $^7$Li energy spectrum is measured to high precision by directly embedding $^7$Be radioisotopes into a high resolution superconducting tunnel junction that is operated as a cryogenic sensor. The lower limit on the spatial uncertainty of the recoil daughter was found to be $σ_{\textrm{N}, x} \geq 6.2$\,pm, which implies the final-state system is localized at a scale more than a thousand times larger than the nucleus itself. From this measurement, the first direct lower limits on the spatial extent of the neutrino wavepacket were extracted using two different theoretical methods. These results have wide-reaching implications in several areas including the nature of spatial localization at sub-atomic scales, interpretation of neutrino physics data, and the potential reach of future large-scale experiments.
△ Less
Submitted 30 April, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
The Artificial Intelligence Ontology: LLM-assisted construction of AI concept hierarchies
Authors:
Marcin P. Joachimiak,
Mark A. Miller,
J. Harry Caufield,
Ryan Ly,
Nomi L. Harris,
Andrew Tritt,
Christopher J. Mungall,
Kristofer E. Bouchard
Abstract:
The Artificial Intelligence Ontology (AIO) is a systematization of artificial intelligence (AI) concepts, methodologies, and their interrelations. Developed via manual curation, with the additional assistance of large language models (LLMs), AIO aims to address the rapidly evolving landscape of AI by providing a comprehensive framework that encompasses both technical and ethical aspects of AI tech…
▽ More
The Artificial Intelligence Ontology (AIO) is a systematization of artificial intelligence (AI) concepts, methodologies, and their interrelations. Developed via manual curation, with the additional assistance of large language models (LLMs), AIO aims to address the rapidly evolving landscape of AI by providing a comprehensive framework that encompasses both technical and ethical aspects of AI technologies. The primary audience for AIO includes AI researchers, developers, and educators seeking standardized terminology and concepts within the AI domain. The ontology is structured around six top-level branches: Networks, Layers, Functions, LLMs, Preprocessing, and Bias, each designed to support the modular composition of AI methods and facilitate a deeper understanding of deep learning architectures and ethical considerations in AI.
AIO's development utilized the Ontology Development Kit (ODK) for its creation and maintenance, with its content being dynamically updated through AI-driven curation support. This approach not only ensures the ontology's relevance amidst the fast-paced advancements in AI but also significantly enhances its utility for researchers, developers, and educators by simplifying the integration of new AI concepts and methodologies.
The ontology's utility is demonstrated through the annotation of AI methods data in a catalog of AI research publications and the integration into the BioPortal ontology resource, highlighting its potential for cross-disciplinary research. The AIO ontology is open source and is available on GitHub (https://github.com/berkeleybop/artificial-intelligence-ontology) and BioPortal (https://bioportal.bioontology.org/ontologies/AIO).
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Using Enriched Category Theory to Construct the Nearest Neighbour Classification Algorithm
Authors:
Matthew Pugh,
Jo Grundy,
Corina Cirstea,
Nick Harris
Abstract:
Exploring whether Enriched Category Theory could provide the foundation of an alternative approach to Machine Learning. This paper is the first to construct and motivate a Machine Learning algorithm solely with Enriched Category Theory. In order to supplement evidence that Category Theory can be used to motivate robust and explainable algorithms, it is shown that a series of reasonable assumptions…
▽ More
Exploring whether Enriched Category Theory could provide the foundation of an alternative approach to Machine Learning. This paper is the first to construct and motivate a Machine Learning algorithm solely with Enriched Category Theory. In order to supplement evidence that Category Theory can be used to motivate robust and explainable algorithms, it is shown that a series of reasonable assumptions about a dataset lead to the construction of the Nearest Neighbours Algorithm. In particular, as an extension of the original dataset using profunctors in the category of Lawvere metric spaces. This leads to a definition of an Enriched Nearest Neighbours Algorithm, which consequently also produces an enriched form of the Voronoi diagram. This paper is intended to be accessible without any knowledge of Category Theory
△ Less
Submitted 27 December, 2023;
originally announced December 2023.
-
Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI)
Authors:
Sabrina Toro,
Anna V Anagnostopoulos,
Sue Bello,
Kai Blumberg,
Rhiannon Cameron,
Leigh Carmody,
Alexander D Diehl,
Damion Dooley,
William Duncan,
Petra Fey,
Pascale Gaudet,
Nomi L Harris,
Marcin Joachimiak,
Leila Kiani,
Tiago Lubiana,
Monica C Munoz-Torres,
Shawn O'Neil,
David Osumi-Sutherland,
Aleix Puig,
Justin P Reese,
Leonore Reiser,
Sofia Robb,
Troy Ruem**,
James Seager,
Eric Sid
, et al. (5 additional authors not shown)
Abstract:
Background: Ontologies are fundamental components of informatics infrastructure in domains such as biomedical, environmental, and food sciences, representing consensus knowledge in an accurate and computable form. However, their construction and maintenance demand substantial resources and necessitate substantial collaboration between domain experts, curators, and ontology experts. We present Dyna…
▽ More
Background: Ontologies are fundamental components of informatics infrastructure in domains such as biomedical, environmental, and food sciences, representing consensus knowledge in an accurate and computable form. However, their construction and maintenance demand substantial resources and necessitate substantial collaboration between domain experts, curators, and ontology experts. We present Dynamic Retrieval Augmented Generation of Ontologies using AI (DRAGON-AI), an ontology generation method employing Large Language Models (LLMs) and Retrieval Augmented Generation (RAG). DRAGON-AI can generate textual and logical ontology components, drawing from existing knowledge in multiple ontologies and unstructured text sources.
Results: We assessed performance of DRAGON-AI on de novo term construction across ten diverse ontologies, making use of extensive manual evaluation of results. Our method has high precision for relationship generation, but has slightly lower precision than from logic-based reasoning. Our method is also able to generate definitions deemed acceptable by expert evaluators, but these scored worse than human-authored definitions. Notably, evaluators with the highest level of confidence in a domain were better able to discern flaws in AI-generated definitions. We also demonstrated the ability of DRAGON-AI to incorporate natural language instructions in the form of GitHub issues.
Conclusions: These findings suggest DRAGON-AI's potential to substantially aid the manual ontology construction process. However, our results also underscore the importance of having expert curators and ontology editors drive the ontology generation process.
△ Less
Submitted 12 June, 2024; v1 submitted 17 December, 2023;
originally announced December 2023.
-
MapperGPT: Large Language Models for Linking and Map** Entities
Authors:
Nicolas Matentzoglu,
J. Harry Caufield,
Harshad B. Hegde,
Justin T. Reese,
Sierra Moxon,
Hyeongsik Kim,
Nomi L. Harris,
Melissa A Haendel,
Christopher J. Mungall
Abstract:
Aligning terminological resources, including ontologies, controlled vocabularies, taxonomies, and value sets is a critical part of data integration in many domains such as healthcare, chemistry, and biomedical research. Entity map** is the process of determining correspondences between entities across these resources, such as gene identifiers, disease concepts, or chemical entity identifiers. Ma…
▽ More
Aligning terminological resources, including ontologies, controlled vocabularies, taxonomies, and value sets is a critical part of data integration in many domains such as healthcare, chemistry, and biomedical research. Entity map** is the process of determining correspondences between entities across these resources, such as gene identifiers, disease concepts, or chemical entity identifiers. Many tools have been developed to compute such map**s based on common structural features and lexical information such as labels and synonyms. Lexical approaches in particular often provide very high recall, but low precision, due to lexical ambiguity. As a consequence of this, map** efforts often resort to a labor intensive manual map** refinement through a human curator.
Large Language Models (LLMs), such as the ones employed by ChatGPT, have generalizable abilities to perform a wide range of tasks, including question-answering and information extraction. Here we present MapperGPT, an approach that uses LLMs to review and refine map** relationships as a post-processing step, in concert with existing high-recall methods that are based on lexical and structural heuristics.
We evaluated MapperGPT on a series of alignment tasks from different domains, including anatomy, developmental biology, and renal diseases. We devised a collection of tasks that are designed to be particularly challenging for lexical methods. We show that when used in combination with high-recall methods, MapperGPT can provide a substantial improvement in accuracy, beating state-of-the-art (SOTA) methods such as LogMap.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Gene Set Summarization using Large Language Models
Authors:
Marcin P. Joachimiak,
J. Harry Caufield,
Nomi L. Harris,
Hyeongsik Kim,
Christopher J. Mungall
Abstract:
Molecular biologists frequently interpret gene lists derived from high-throughput experiments and computational analysis. This is typically done as a statistical enrichment analysis that measures the over- or under-representation of biological function terms associated with genes or their properties, based on curated assertions from a knowledge base (KB) such as the Gene Ontology (GO). Interpretin…
▽ More
Molecular biologists frequently interpret gene lists derived from high-throughput experiments and computational analysis. This is typically done as a statistical enrichment analysis that measures the over- or under-representation of biological function terms associated with genes or their properties, based on curated assertions from a knowledge base (KB) such as the Gene Ontology (GO). Interpreting gene lists can also be framed as a textual summarization task, enabling the use of Large Language Models (LLMs), potentially utilizing scientific texts directly and avoiding reliance on a KB.
We developed SPINDOCTOR (Structured Prompt Interpolation of Natural Language Descriptions of Controlled Terms for Ontology Reporting), a method that uses GPT models to perform gene set function summarization as a complement to standard enrichment analysis. This method can use different sources of gene functional information: (1) structured text derived from curated ontological KB annotations, (2) ontology-free narrative gene summaries, or (3) direct model retrieval.
We demonstrate that these methods are able to generate plausible and biologically valid summary GO term lists for gene sets. However, GPT-based approaches are unable to deliver reliable scores or p-values and often return terms that are not statistically significant. Crucially, these methods were rarely able to recapitulate the most precise and informative term from standard enrichment, likely due to an inability to generalize and reason using an ontology. Results are highly nondeterministic, with minor variations in prompt resulting in radically different term lists. Our results show that at this point, LLM-based methods are unsuitable as a replacement for standard term enrichment analysis and that manual curation of ontological assertions remains necessary.
△ Less
Submitted 3 July, 2024; v1 submitted 20 May, 2023;
originally announced May 2023.
-
Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learning
Authors:
J. Harry Caufield,
Harshad Hegde,
Vincent Emonet,
Nomi L. Harris,
Marcin P. Joachimiak,
Nicolas Matentzoglu,
HyeongSik Kim,
Sierra A. T. Moxon,
Justin T. Reese,
Melissa A. Haendel,
Peter N. Robinson,
Christopher J. Mungall
Abstract:
Creating knowledge bases and ontologies is a time consuming task that relies on a manual curation. AI/NLP approaches can assist expert curators in populating these knowledge bases, but current approaches rely on extensive training data, and are not able to populate arbitrary complex nested knowledge schemas.
Here we present Structured Prompt Interrogation and Recursive Extraction of Semantics (S…
▽ More
Creating knowledge bases and ontologies is a time consuming task that relies on a manual curation. AI/NLP approaches can assist expert curators in populating these knowledge bases, but current approaches rely on extensive training data, and are not able to populate arbitrary complex nested knowledge schemas.
Here we present Structured Prompt Interrogation and Recursive Extraction of Semantics (SPIRES), a Knowledge Extraction approach that relies on the ability of Large Language Models (LLMs) to perform zero-shot learning (ZSL) and general-purpose query answering from flexible prompts and return information conforming to a specified schema. Given a detailed, user-defined knowledge schema and an input text, SPIRES recursively performs prompt interrogation against GPT-3+ to obtain a set of responses matching the provided schema. SPIRES uses existing ontologies and vocabularies to provide identifiers for all matched elements.
We present examples of use of SPIRES in different domains, including extraction of food recipes, multi-species cellular signaling pathways, disease treatments, multi-step drug mechanisms, and chemical to disease causation graphs. Current SPIRES accuracy is comparable to the mid-range of existing Relation Extraction (RE) methods, but has the advantage of easy customization, flexibility, and, crucially, the ability to perform new tasks in the absence of any training data. This method supports a general strategy of leveraging the language interpreting capabilities of LLMs to assemble knowledge bases, assisting manual knowledge curation and acquisition while supporting validation with publicly-available databases and ontologies external to the LLM.
SPIRES is available as part of the open source OntoGPT package: https://github.com/ monarch-initiative/ontogpt.
△ Less
Submitted 22 December, 2023; v1 submitted 5 April, 2023;
originally announced April 2023.
-
KG-Hub -- Building and Exchanging Biological Knowledge Graphs
Authors:
J Harry Caufield,
Tim Putman,
Kevin Schaper,
Deepak R Unni,
Harshad Hegde,
Tiffany J Callahan,
Luca Cappelletti,
Sierra AT Moxon,
Vida Ravanmehr,
Seth Carbon,
Lauren E Chan,
Katherina Cortes,
Kent A Shefchek,
Glass Elsarboukh,
James P Balhoff,
Tommaso Fontana,
Nicolas Matentzoglu,
Richard M Bruskiewich,
Anne E Thessen,
Nomi L Harris,
Monica C Munoz-Torres,
Melissa A Haendel,
Peter N Robinson,
Marcin P Joachimiak,
Christopher J Mungall
, et al. (1 additional authors not shown)
Abstract:
Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of knowledge graphs is lacking. Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of knowledge graphs. Features include a simp…
▽ More
Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of knowledge graphs is lacking. Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of knowledge graphs. Features include a simple, modular extract-transform-load (ETL) pattern for producing graphs compliant with Biolink Model (a high-level data model for standardizing biological data), easy integration of any OBO (Open Biological and Biomedical Ontologies) ontology, cached downloads of upstream data sources, versioned and automatically updated builds with stable URLs, web-browsable storage of KG artifacts on cloud infrastructure, and easy reuse of transformed subgraphs across projects. Current KG-Hub projects span use cases including COVID-19 research, drug repurposing, microbial-environmental interactions, and rare disease research. KG-Hub is equipped with tooling to easily analyze and manipulate knowledge graphs. KG-Hub is also tightly integrated with graph machine learning (ML) tools which allow automated graph machine learning, including node embeddings and training of models for link prediction and node classification.
△ Less
Submitted 31 January, 2023;
originally announced February 2023.
-
Neural Cell Video Synthesis via Optical-Flow Diffusion
Authors:
Manuel Serna-Aguilera,
Khoa Luu,
Nathaniel Harris,
Min Zou
Abstract:
The biomedical imaging world is notorious for working with small amounts of data, frustrating state-of-the-art efforts in the computer vision and deep learning worlds. With large datasets, it is easier to make progress we have seen from the natural image distribution. It is the same with microscopy videos of neuron cells moving in a culture. This problem presents several challenges as it can be di…
▽ More
The biomedical imaging world is notorious for working with small amounts of data, frustrating state-of-the-art efforts in the computer vision and deep learning worlds. With large datasets, it is easier to make progress we have seen from the natural image distribution. It is the same with microscopy videos of neuron cells moving in a culture. This problem presents several challenges as it can be difficult to grow and maintain the culture for days, and it is expensive to acquire the materials and equipment. In this work, we explore how to alleviate this data scarcity problem by synthesizing the videos. We, therefore, take the recent work of the video diffusion model to synthesize videos of cells from our training dataset. We then analyze the model's strengths and consistent shortcomings to guide us on improving video generation to be as high-quality as possible. To improve on such a task, we propose modifying the denoising function and adding motion information (dense optical flow) so that the model has more context regarding how video frames transition over time and how each pixel changes over time.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Single chip photonic deep neural network with accelerated training
Authors:
Saumil Bandyopadhyay,
Alexander Sludds,
Stefan Krastanov,
Ryan Hamerly,
Nicholas Harris,
Darius Bunandar,
Matthew Streshinsky,
Michael Hochberg,
Dirk Englund
Abstract:
As deep neural networks (DNNs) revolutionize machine learning, energy consumption and throughput are emerging as fundamental limitations of CMOS electronics. This has motivated a search for new hardware architectures optimized for artificial intelligence, such as electronic systolic arrays, memristor crossbar arrays, and optical accelerators. Optical systems can perform linear matrix operations at…
▽ More
As deep neural networks (DNNs) revolutionize machine learning, energy consumption and throughput are emerging as fundamental limitations of CMOS electronics. This has motivated a search for new hardware architectures optimized for artificial intelligence, such as electronic systolic arrays, memristor crossbar arrays, and optical accelerators. Optical systems can perform linear matrix operations at exceptionally high rate and efficiency, motivating recent demonstrations of low latency linear algebra and optical energy consumption below a photon per multiply-accumulate operation. However, demonstrating systems that co-integrate both linear and nonlinear processing units in a single chip remains a central challenge. Here we introduce such a system in a scalable photonic integrated circuit (PIC), enabled by several key advances: (i) high-bandwidth and low-power programmable nonlinear optical function units (NOFUs); (ii) coherent matrix multiplication units (CMXUs); and (iii) in situ training with optical acceleration. We experimentally demonstrate this fully-integrated coherent optical neural network (FICONN) architecture for a 3-layer DNN comprising 12 NOFUs and three CMXUs operating in the telecom C-band. Using in situ training on a vowel classification task, the FICONN achieves 92.7% accuracy on a test set, which is identical to the accuracy obtained on a digital computer with the same number of weights. This work lends experimental evidence to theoretical proposals for in situ training, unlocking orders of magnitude improvements in the throughput of training data. Moreover, the FICONN opens the path to inference at nanosecond latency and femtojoule per operation energy efficiency.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Boosting the interpretability of clinical risk scores with intervention predictions
Authors:
Eric Loreaux,
Ke Yu,
Jonas Kemp,
Martin Seneviratne,
Christina Chen,
Subhrajit Roy,
Ivan Protsyuk,
Natalie Harris,
Alexander D'Amour,
Steve Yadlowsky,
Ming-Jun Chen
Abstract:
Machine learning systems show significant promise for forecasting patient adverse events via risk scores. However, these risk scores implicitly encode assumptions about future interventions that the patient is likely to receive, based on the intervention policy present in the training data. Without this important context, predictions from such systems are less interpretable for clinicians. We prop…
▽ More
Machine learning systems show significant promise for forecasting patient adverse events via risk scores. However, these risk scores implicitly encode assumptions about future interventions that the patient is likely to receive, based on the intervention policy present in the training data. Without this important context, predictions from such systems are less interpretable for clinicians. We propose a joint model of intervention policy and adverse event risk as a means to explicitly communicate the model's assumptions about future interventions. We develop such an intervention policy model on MIMIC-III, a real world de-identified ICU dataset, and discuss some use cases that highlight the utility of this approach. We show how combining typical risk scores, such as the likelihood of mortality, with future intervention probability scores leads to more interpretable clinical predictions.
△ Less
Submitted 6 July, 2022;
originally announced July 2022.
-
Ontology Development Kit: a toolkit for building, maintaining, and standardising biomedical ontologies
Authors:
Nicolas Matentzoglu,
Damien Goutte-Gattat,
Shawn Zheng Kai Tan,
James P. Balhoff,
Seth Carbon,
Anita R. Caron,
William D. Duncan,
Joe E. Flack,
Melissa Haendel,
Nomi L. Harris,
William R Hogan,
Charles Tapley Hoyt,
Rebecca C. Jackson,
HyeongSik Kim,
Huseyin Kir,
Martin Larralde,
Julie A. McMurry,
James A. Overton,
Bjoern Peters,
Clare Pilgrim,
Ray Stefancsik,
Sofia MC Robb,
Sabrina Toro,
Nicole A Vasilevsky,
Ramona Walls
, et al. (2 additional authors not shown)
Abstract:
Similar to managing software packages, managing the ontology life cycle involves multiple complex workflows such as preparing releases, continuous quality control checking, and dependency management. To manage these processes, a diverse set of tools is required, from command line utilities to powerful ontology engineering environments such as ROBOT. Particularly in the biomedical domain, which has…
▽ More
Similar to managing software packages, managing the ontology life cycle involves multiple complex workflows such as preparing releases, continuous quality control checking, and dependency management. To manage these processes, a diverse set of tools is required, from command line utilities to powerful ontology engineering environments such as ROBOT. Particularly in the biomedical domain, which has developed a set of highly diverse yet inter-dependent ontologies, standardising release practices and metadata, and establishing shared quality standards, are crucial to enable interoperability. The Ontology Development Kit (ODK) provides a set of standardised, customisable, and automatically executable workflows, and packages all required tooling in a single Docker image. In this paper, we provide an overview of how the ODK works, show how it is used in practice, and describe how we envision it driving standardisation efforts in our community.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
Adaptive Block Floating-Point for Analog Deep Learning Hardware
Authors:
Ayon Basumallik,
Darius Bunandar,
Nicholas Dronen,
Nicholas Harris,
Ludmila Levkova,
Calvin McCarter,
Lakshmi Nair,
David Walter,
David Widemann
Abstract:
Analog mixed-signal (AMS) devices promise faster, more energy-efficient deep neural network (DNN) inference than their digital counterparts. However, recent studies show that DNNs on AMS devices with fixed-point numbers can incur an accuracy penalty because of precision loss. To mitigate this penalty, we present a novel AMS-compatible adaptive block floating-point (ABFP) number representation. We…
▽ More
Analog mixed-signal (AMS) devices promise faster, more energy-efficient deep neural network (DNN) inference than their digital counterparts. However, recent studies show that DNNs on AMS devices with fixed-point numbers can incur an accuracy penalty because of precision loss. To mitigate this penalty, we present a novel AMS-compatible adaptive block floating-point (ABFP) number representation. We also introduce amplification (or gain) as a method for increasing the accuracy of the number representation without increasing the bit precision of the output. We evaluate the effectiveness of ABFP on the DNNs in the MLPerf datacenter inference benchmark -- realizing less than $1\%$ loss in accuracy compared to FLOAT32. We also propose a novel method of finetuning for AMS devices, Differential Noise Finetuning (DNF), which samples device noise to speed up finetuning compared to conventional Quantization-Aware Training.
△ Less
Submitted 12 May, 2022;
originally announced May 2022.
-
Disability prediction in multiple sclerosis using performance outcome measures and demographic data
Authors:
Subhrajit Roy,
Diana Mincu,
Lev Proleev,
Negar Rostamzadeh,
Chintan Ghate,
Natalie Harris,
Christina Chen,
Jessica Schrouff,
Nenad Tomasev,
Fletcher Lee Hartsell,
Katherine Heller
Abstract:
Literature on machine learning for multiple sclerosis has primarily focused on the use of neuroimaging data such as magnetic resonance imaging and clinical laboratory tests for disease identification. However, studies have shown that these modalities are not consistent with disease activity such as symptoms or disease progression. Furthermore, the cost of collecting data from these modalities is h…
▽ More
Literature on machine learning for multiple sclerosis has primarily focused on the use of neuroimaging data such as magnetic resonance imaging and clinical laboratory tests for disease identification. However, studies have shown that these modalities are not consistent with disease activity such as symptoms or disease progression. Furthermore, the cost of collecting data from these modalities is high, leading to scarce evaluations. In this work, we used multi-dimensional, affordable, physical and smartphone-based performance outcome measures (POM) in conjunction with demographic data to predict multiple sclerosis disease progression. We performed a rigorous benchmarking exercise on two datasets and present results across 13 clinically actionable prediction endpoints and 6 machine learning models. To the best of our knowledge, our results are the first to show that it is possible to predict disease progression using POMs and demographic data in the context of both clinical trials and smartphone-base studies by using two datasets. Moreover, we investigate our models to understand the impact of different POMs and demographics on model performance through feature ablation studies. We also show that model performance is similar across different demographic subgroups (based on age and sex). To enable this work, we developed an end-to-end reusable pre-processing and machine learning framework which allows quicker experimentation over disparate MS datasets.
△ Less
Submitted 8 April, 2022;
originally announced April 2022.
-
Biolink Model: A Universal Schema for Knowledge Graphs in Clinical, Biomedical, and Translational Science
Authors:
Deepak R. Unni,
Sierra A. T. Moxon,
Michael Bada,
Matthew Brush,
Richard Bruskiewich,
Paul Clemons,
Vlado Dancik,
Michel Dumontier,
Karamarie Fecho,
Gustavo Glusman,
Jennifer J. Hadlock,
Nomi L. Harris,
Arpita Joshi,
Tim Putman,
Guangrong Qin,
Stephen A. Ramsey,
Kent A. Shefchek,
Harold Solbrig,
Karthik Soman,
Anne T. Thessen,
Melissa A. Haendel,
Chris Bizon,
Christopher J. Mungall,
the Biomedical Data Translator Consortium
Abstract:
Within clinical, biomedical, and translational science, an increasing number of projects are adopting graphs for knowledge representation. Graph-based data models elucidate the interconnectedness between core biomedical concepts, enable data structures to be easily updated, and support intuitive queries, visualizations, and inference algorithms. However, knowledge discovery across these "knowledge…
▽ More
Within clinical, biomedical, and translational science, an increasing number of projects are adopting graphs for knowledge representation. Graph-based data models elucidate the interconnectedness between core biomedical concepts, enable data structures to be easily updated, and support intuitive queries, visualizations, and inference algorithms. However, knowledge discovery across these "knowledge graphs" (KGs) has remained difficult. Data set heterogeneity and complexity; the proliferation of ad hoc data formats; poor compliance with guidelines on findability, accessibility, interoperability, and reusability; and, in particular, the lack of a universally-accepted, open-access model for standardization across biomedical KGs has left the task of reconciling data sources to downstream consumers. Biolink Model is an open source data model that can be used to formalize the relationships between data structures in translational science. It incorporates object-oriented classification and graph-oriented features. The core of the model is a set of hierarchical, interconnected classes (or categories) and relationships between them (or predicates), representing biomedical entities such as gene, disease, chemical, anatomical structure, and phenotype. The model provides class and edge attributes and associations that guide how entities should relate to one another. Here, we highlight the need for a standardized data model for KGs, describe Biolink Model, and compare it with other models. We demonstrate the utility of Biolink Model in various initiatives, including the Biomedical Data Translator Consortium and the Monarch Initiative, and show how it has supported easier integration and interoperability of biomedical KGs, bringing together knowledge from multiple sources and hel** to realize the goals of translational science.
△ Less
Submitted 25 March, 2022;
originally announced March 2022.
-
Diagnosing failures of fairness transfer across distribution shift in real-world medical settings
Authors:
Jessica Schrouff,
Natalie Harris,
Oluwasanmi Koyejo,
Ibrahim Alabdulmohsin,
Eva Schnider,
Krista Opsahl-Ong,
Alex Brown,
Subhrajit Roy,
Diana Mincu,
Christina Chen,
Awa Dieng,
Yuan Liu,
Vivek Natarajan,
Alan Karthikesalingam,
Katherine Heller,
Silvia Chiappa,
Alexander D'Amour
Abstract:
Diagnosing and mitigating changes in model fairness under distribution shift is an important component of the safe deployment of machine learning in healthcare settings. Importantly, the success of any mitigation strategy strongly depends on the structure of the shift. Despite this, there has been little discussion of how to empirically assess the structure of a distribution shift that one is enco…
▽ More
Diagnosing and mitigating changes in model fairness under distribution shift is an important component of the safe deployment of machine learning in healthcare settings. Importantly, the success of any mitigation strategy strongly depends on the structure of the shift. Despite this, there has been little discussion of how to empirically assess the structure of a distribution shift that one is encountering in practice. In this work, we adopt a causal framing to motivate conditional independence tests as a key tool for characterizing distribution shifts. Using our approach in two medical applications, we show that this knowledge can help diagnose failures of fairness transfer, including cases where real-world shifts are more complex than is often assumed in the literature. Based on these results, we discuss potential remedies at each step of the machine learning pipeline.
△ Less
Submitted 10 February, 2023; v1 submitted 2 February, 2022;
originally announced February 2022.
-
A Simple Standard for Sharing Ontological Map**s (SSSOM)
Authors:
Nicolas Matentzoglu,
James P. Balhoff,
Susan M. Bello,
Chris Bizon,
Matthew Brush,
Tiffany J. Callahan,
Christopher G Chute,
William D. Duncan,
Chris T. Evelo,
Davera Gabriel,
John Graybeal,
Alasdair Gray,
Benjamin M. Gyori,
Melissa Haendel,
Henriette Harmse,
Nomi L. Harris,
Ian Harrow,
Harshad Hegde,
Amelia L. Hoyt,
Charles T. Hoyt,
Dazhi Jiao,
Ernesto Jiménez-Ruiz,
Simon Jupp,
Hyeongsik Kim,
Sebastian Koehler
, et al. (19 additional authors not shown)
Abstract:
Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for map** between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Map**s often lack the metadata needed to be correctly interpreted and applied. For example, ar…
▽ More
Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for map** between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Map**s often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Are they associated in some other way? Such relationships between the mapped terms are often not documented, leading to incorrect assumptions and making them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Also, the lack of descriptions of how map**s were done makes it hard to combine and reconcile map**s, particularly curated and automated ones.
The Simple Standard for Sharing Ontological Map**s (SSSOM) addresses these problems by: 1. Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in map**s explicit. 2. Defining an easy to use table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data standards. 3. Implementing open and community-driven collaborative workflows designed to evolve the standard continuously to address changing requirements and map** practices. 4. Providing reference tools and software libraries for working with the standard.
In this paper, we present the SSSOM standard, describe several use cases, and survey some existing work on standardizing the exchange of map**s, with the goal of making map**s Findable, Accessible, Interoperable, and Reusable (FAIR). The SSSOM specification is at http://w3id.org/sssom/spec.
△ Less
Submitted 13 December, 2021;
originally announced December 2021.
-
An Electro-Photonic System for Accelerating Deep Neural Networks
Authors:
Cansu Demirkiran,
Furkan Eris,
Gongyu Wang,
Jonathan Elmhurst,
Nick Moore,
Nicholas C. Harris,
Ayon Basumallik,
Vijay Janapa Reddi,
Ajay Joshi,
Darius Bunandar
Abstract:
The number of parameters in deep neural networks (DNNs) is scaling at about 5$\times$ the rate of Moore's Law. To sustain this growth, photonic computing is a promising avenue, as it enables higher throughput in dominant general matrix-matrix multiplication (GEMM) operations in DNNs than their electrical counterpart. However, purely photonic systems face several challenges including lack of photon…
▽ More
The number of parameters in deep neural networks (DNNs) is scaling at about 5$\times$ the rate of Moore's Law. To sustain this growth, photonic computing is a promising avenue, as it enables higher throughput in dominant general matrix-matrix multiplication (GEMM) operations in DNNs than their electrical counterpart. However, purely photonic systems face several challenges including lack of photonic memory and accumulation of noise. In this paper, we present an electro-photonic accelerator, ADEPT, which leverages a photonic computing unit for performing GEMM operations, a vectorized digital electronic ASIC for performing non-GEMM operations, and SRAM arrays for storing DNN parameters and activations. In contrast to prior works in photonic DNN accelerators, we adopt a system-level perspective and show that the gains while large are tempered relative to prior expectations. Our goal is to encourage architects to explore photonic technology in a more pragmatic way considering the system as a whole to understand its general applicability in accelerating today's DNNs. Our evaluation shows that ADEPT can provide, on average, 5.73$\times$ higher throughput per Watt compared to the traditional systolic arrays (SAs) in a full-system, and at least 6.8$\times$ and $2.5\times$ better throughput per Watt, compared to state-of-the-art electronic and photonic accelerators, respectively.
△ Less
Submitted 16 December, 2022; v1 submitted 2 September, 2021;
originally announced September 2021.
-
Experimental quantum speed-up in reinforcement learning agents
Authors:
Valeria Saggio,
Beate E. Asenbeck,
Arne Hamann,
Teodor Strömberg,
Peter Schiansky,
Vedran Dunjko,
Nicolai Friis,
Nicholas C. Harris,
Michael Hochberg,
Dirk Englund,
Sabine Wölk,
Hans J. Briegel,
Philip Walther
Abstract:
Increasing demand for algorithms that can learn quickly and efficiently has led to a surge of development within the field of artificial intelligence (AI). An important paradigm within AI is reinforcement learning (RL), where agents interact with environments by exchanging signals via a communication channel. Agents can learn by updating their behaviour based on obtained feedback. The crucial ques…
▽ More
Increasing demand for algorithms that can learn quickly and efficiently has led to a surge of development within the field of artificial intelligence (AI). An important paradigm within AI is reinforcement learning (RL), where agents interact with environments by exchanging signals via a communication channel. Agents can learn by updating their behaviour based on obtained feedback. The crucial question for practical applications is how fast agents can learn to respond correctly. An essential figure of merit is therefore the learning time. While various works have made use of quantum mechanics to speed up the agent's decision-making process, a reduction in learning time has not been demonstrated yet. Here we present a RL experiment where the learning of an agent is boosted by utilizing a quantum communication channel with the environment. We further show that the combination with classical communication enables the evaluation of such an improvement, and additionally allows for optimal control of the learning progress. This novel scenario is therefore demonstrated by considering hybrid agents, that alternate between rounds of quantum and classical communication. We implement this learning protocol on a compact and fully tunable integrated nanophotonic processor. The device interfaces with telecom-wavelength photons and features a fast active feedback mechanism, allowing us to demonstrate the agent's systematic quantum advantage in a setup that could be readily integrated within future large-scale quantum communication networks.
△ Less
Submitted 10 March, 2021;
originally announced March 2021.
-
A Recurrent Ising Machine in a Photonic Integrated Circuit
Authors:
Mihika Prabhu,
Charles Roques-Carmes,
Yichen Shen,
Nicholas Harris,
Li **g,
Jacques Carolan,
Ryan Hamerly,
Tom Baehr-Jones,
Michael Hochberg,
Vladimir Čeperić,
John D. Joannopoulos,
Dirk R. Englund,
Marin Soljačić
Abstract:
Conventional computing architectures have no known efficient algorithms for combinatorial optimization tasks, which are encountered in fundamental areas and real-world practical problems including logistics, social networks, and cryptography. Physical machines have recently been proposed and implemented as an alternative to conventional exact and heuristic solvers for the Ising problem, one such o…
▽ More
Conventional computing architectures have no known efficient algorithms for combinatorial optimization tasks, which are encountered in fundamental areas and real-world practical problems including logistics, social networks, and cryptography. Physical machines have recently been proposed and implemented as an alternative to conventional exact and heuristic solvers for the Ising problem, one such optimization task that requires finding the ground state spin configuration of an arbitrary Ising graph. However, these physical approaches usually suffer from decreased ground state convergence probability or universality for high edge-density graphs or arbitrary graph weights, respectively. We experimentally demonstrate a proof-of-principle integrated nanophotonic recurrent Ising sampler (INPRIS) capable of converging to the ground state of various 4-spin graphs with high probability. The INPRIS exploits experimental physical noise as a resource to speed up the ground state search. By injecting additional extrinsic noise during the algorithm iterations, the INPRIS explores larger regions of the phase space, thus allowing one to probe noise-dependent physical observables. Since the recurrent photonic transformation that our machine imparts is a fixed function of the graph problem, and could thus be implemented with optoelectronic architectures that enable GHz clock rates (such as passive or non-volatile photonic circuits that do not require reprogramming at each iteration), our work paves a way for orders-of-magnitude speedups in exploring the solution space of combinatorially hard problems.
△ Less
Submitted 30 September, 2019;
originally announced September 2019.
-
Variational Quantum Unsampling on a Quantum Photonic Processor
Authors:
Jacques Carolan,
Masoud Mohseni,
Jonathan P. Olson,
Mihika Prabhu,
Changchen Chen,
Darius Bunandar,
Nicholas C. Harris,
Franco N. C. Wong,
Michael Hochberg,
Seth Lloyd,
Dirk Englund
Abstract:
Quantum algorithms for Noisy Intermediate-Scale Quantum (NISQ) machines have recently emerged as new promising routes towards demonstrating near-term quantum advantage (or supremacy) over classical systems. In these systems samples are typically drawn from probability distributions which --- under plausible complexity-theoretic conjectures --- cannot be efficiently generated classically. Rather th…
▽ More
Quantum algorithms for Noisy Intermediate-Scale Quantum (NISQ) machines have recently emerged as new promising routes towards demonstrating near-term quantum advantage (or supremacy) over classical systems. In these systems samples are typically drawn from probability distributions which --- under plausible complexity-theoretic conjectures --- cannot be efficiently generated classically. Rather than first define a physical system and then determine computational features of the output state, we ask the converse question: given direct access to the quantum state, what features of the generating system can we efficiently learn? In this work we introduce the Variational Quantum Unsampling (VQU) protocol, a nonlinear quantum neural network approach for verification and inference of near-term quantum circuits outputs. In our approach one can variationally train a quantum operation to unravel the action of an unknown unitary on a known input state; essentially learning the inverse of the black-box quantum dynamics. While the principle of our approach is platform independent, its implementation will depend on the unique architecture of a specific quantum processor. Here, we experimentally demonstrate the VQU protocol on a quantum photonic processor. Alongside quantum verification, our protocol has broad applications; including optimal quantum measurement and tomography, quantum sensing and imaging, and ansatz validation.
△ Less
Submitted 13 May, 2019; v1 submitted 23 April, 2019;
originally announced April 2019.
-
Scalable feedback control of single photon sources for photonic quantum technologies
Authors:
Jacques Carolan,
Uttara Chakraborty,
Nicholas C. Harris,
Mihir Pant,
Tom Baehr-Jones,
Michael Hochberg,
Dirk Englund
Abstract:
Large-scale quantum technologies require exquisite control over many individual quantum systems. Typically, such systems are very sensitive to environmental fluctuations, and diagnosing errors via measurements causes unavoidable perturbations. In this work we present an in situ frequency locking technique that monitors and corrects frequency variations in single photon sources based on microring r…
▽ More
Large-scale quantum technologies require exquisite control over many individual quantum systems. Typically, such systems are very sensitive to environmental fluctuations, and diagnosing errors via measurements causes unavoidable perturbations. In this work we present an in situ frequency locking technique that monitors and corrects frequency variations in single photon sources based on microring resonators. By using the same classical laser fields required for photon generation as a probe to diagnose variations in the resonator frequency, our protocol applies feedback control to correct photon frequency errors in parallel to the optical quantum computation without disturbing the physical qubit. We implement our technique on a silicon photonic device and demonstrate sub 1 pm frequency stabilization in the presence of applied environmental noise, corresponding to a fractional frequency drift of <1 % of a photon linewidth. Using these methods we demonstrate feedback controlled quantum state engineering. By distributing a single local oscillator across a single chip or network of chips, our approach enables frequency locking of many single photon sources for large-scale photonic quantum technologies.
△ Less
Submitted 15 November, 2018;
originally announced November 2018.
-
Genuine Counterfactual Communication with a Nanophotonic Processor
Authors:
I. Alonso Calafell,
T. Strömberg,
D. R. M. Arvidsson-Shukur,
L. A. Rozema,
V. Saggio,
C. Greganti,
N. C. Harris,
M. Prabhu,
J. Carolan,
M Hochberg,
T. Baehr-Jones,
D. Englund,
C. H. W. Barnes,
P. Walther
Abstract:
In standard communication information is carried by particles or waves. Counterintuitively, in counterfactual communication particles and information can travel in opposite directions. The quantum Zeno effect allows Bob to transmit a message to Alice by encoding information in particles he never interacts with. The first suggested protocol not only required thousands of ideal optical components, b…
▽ More
In standard communication information is carried by particles or waves. Counterintuitively, in counterfactual communication particles and information can travel in opposite directions. The quantum Zeno effect allows Bob to transmit a message to Alice by encoding information in particles he never interacts with. The first suggested protocol not only required thousands of ideal optical components, but also resulted in a so-called "weak trace" of the particles having travelled from Bob to Alice, calling the scalability and counterfactuality of previous proposals and experiments into question. Here we overcome these challenges, implementing a new protocol in a programmable nanophotonic processor, based on reconfigurable silicon-on-insulator waveguides that operate at telecom wavelengths. This, together with our telecom single-photon source and highly-efficient superconducting nanowire single-photon detectors, provides a versatile and stable platform for a high-fidelity implementation of genuinely trace-free counterfactual communication, allowing us to actively tune the number of steps in the Zeno measurement, and achieve a bit error probability below 1%, with neither post-selection nor a weak trace. Our demonstration shows how our programmable nanophotonic processor could be applied to more complex counterfactual tasks and quantum information protocols.
△ Less
Submitted 14 August, 2018;
originally announced August 2018.
-
Nonlinear characterisation of a silicon integrated Bragg waveguide filter
Authors:
Micol Previde Massara,
Matteo Menotti,
Nicola Bergamasco,
Nicholas C. Harris,
Tom Baehr-Jones,
Michael Hochberg,
Christophe Galland,
Marco Liscidini,
Matteo Galli,
Daniele Bajoni
Abstract:
Bragg waveguides are promising optical filters for pump suppression in spontaneous Four-Wave Mixing (FWM) photon sources. In this work, we investigate the generation of unwanted photon pairs in the filter itself. We do this by taking advantage of the relation between spontaneous and classical FWM, which allows for the precise characterisation of the nonlinear response of the device. The pair gener…
▽ More
Bragg waveguides are promising optical filters for pump suppression in spontaneous Four-Wave Mixing (FWM) photon sources. In this work, we investigate the generation of unwanted photon pairs in the filter itself. We do this by taking advantage of the relation between spontaneous and classical FWM, which allows for the precise characterisation of the nonlinear response of the device. The pair generation rate estimated from the classical measurement is compared with the theoretical value calculated by means of a full quantum model of the filter, which also allows to investigate the spectral properties of the generated pairs. We find a good agreement between theory and experiment, confirming that stimulated FWM is a valuable approach to characterise the nonlinear response of an integrated filter, and that the pairs generated in a Bragg waveguide are not a serious issue for the operation of a fully integrated nonclassical source.
△ Less
Submitted 15 February, 2018;
originally announced February 2018.
-
Deep Learning with Coherent Nanophotonic Circuits
Authors:
Yichen Shen,
Nicholas C. Harris,
Scott Skirlo,
Mihika Prabhu,
Tom Baehr-Jones,
Michael Hochberg,
Xin Sun,
Shijie Zhao,
Hugo Larochelle,
Dirk Englund,
Marin Soljacic
Abstract:
Artificial Neural Networks are computational network models inspired by signal processing in the brain. These models have dramatically improved the performance of many learning tasks, including speech and object recognition. However, today's computing hardware is inefficient at implementing neural networks, in large part because much of it was designed for von Neumann computing schemes. Significan…
▽ More
Artificial Neural Networks are computational network models inspired by signal processing in the brain. These models have dramatically improved the performance of many learning tasks, including speech and object recognition. However, today's computing hardware is inefficient at implementing neural networks, in large part because much of it was designed for von Neumann computing schemes. Significant effort has been made to develop electronic architectures tuned to implement artificial neural networks that improve upon both computational speed and energy efficiency. Here, we propose a new architecture for a fully-optical neural network that, using unique advantages of optics, promises a computational speed enhancement of at least two orders of magnitude over the state-of-the-art and three orders of magnitude in power efficiency for conventional learning tasks. We experimentally demonstrate essential parts of our architecture using a programmable nanophotonic processor.
△ Less
Submitted 7 October, 2016;
originally announced October 2016.
-
Energy correlations of photon pairs generated by a silicon microring resonator probed by Stimulated Four Wave Mixing
Authors:
Davide Grassani,
Angelica Simbula,
Stefano Pirotta,
Matteo Galli,
Matteo Menotti,
Nicholas C. Harris,
Tom Baehr-Jones,
Michael Hochberg,
Christophe Galland,
Marco Liscidini,
Daniele Bajoni
Abstract:
Compact silicon integrated devices, such as micro-ring resonators, have recently been demonstrated as efficient sources of quantum correlated photon pairs. The mass production of integrated devices demands the implementation of fast and reliable techniques to monitor the device performances. In the case of time-energy correlations, this is particularly challenging, as it requires high spectral res…
▽ More
Compact silicon integrated devices, such as micro-ring resonators, have recently been demonstrated as efficient sources of quantum correlated photon pairs. The mass production of integrated devices demands the implementation of fast and reliable techniques to monitor the device performances. In the case of time-energy correlations, this is particularly challenging, as it requires high spectral resolution that is not currently achievable in coincidence measurements. Here we reconstruct the joint spectral density of photons pairs generated by spontaneous four-wave mixing in a silicon ring resonator by studying the corresponding stimulated process, namely stimulated four wave mixing. We show that this approach, featuring high spectral resolution and short measurement times, allows one to discriminate between nearly-uncorrelated and highly-correlated photon pairs.
△ Less
Submitted 16 February, 2016;
originally announced February 2016.
-
Nuclear magnetic resonance spectroscopy of single subnanoliter ova
Authors:
Marco Grisi,
Beatrice Volpe,
Roberto Guidetti,
Nicola Harris,
Giovanni Boero
Abstract:
Nuclear magnetic resonance (NMR) spectroscopy is, in principle, a promising candidate to study the intracellular chemistry of single microscopic living entities. However, due to sensitivity limitations, NMR experiments were reported only on very few and relatively large single cells down to a minimum volume of 10 nl. Here we show NMR spectroscopy of single ova at volume scales (0.1 and 0.5 nl) whe…
▽ More
Nuclear magnetic resonance (NMR) spectroscopy is, in principle, a promising candidate to study the intracellular chemistry of single microscopic living entities. However, due to sensitivity limitations, NMR experiments were reported only on very few and relatively large single cells down to a minimum volume of 10 nl. Here we show NMR spectroscopy of single ova at volume scales (0.1 and 0.5 nl) where life development begins for a broad variety of animals, humans included. We demonstrate that the sensitivity achieved by miniaturized inductive NMR probes (few pmol of 1H nuclei in some hours at 7 T) is sufficient to observe chemical heterogeneities among subnanoliter ova of tardigrades. Such sensitivities should allow to non-invasively monitor variations of concentrated intracellular compounds, such as glutathione, in single mammalian zygotes.
△ Less
Submitted 20 November, 2015;
originally announced November 2015.
-
Quantum transport simulations in a programmable nanophotonic processor
Authors:
Nicholas C. Harris,
Gregory R. Steinbrecher,
Jacob Mower,
Yoav Lahini,
Mihika Prabhu,
Darius Bunandar,
Changchen Chen,
Franco N. C. Wong,
Tom Baehr-Jones,
Michael Hochberg,
Seth Lloyd,
Dirk Englund
Abstract:
Environmental noise and disorder play critical roles in quantum particle and wave transport in complex media, including solid-state and biological systems. Recent work has predicted that coupling between noisy environments and disordered systems, in which coherent transport has been arrested due to localization effects, could actually enhance transport. Photonic integrated circuits are promising p…
▽ More
Environmental noise and disorder play critical roles in quantum particle and wave transport in complex media, including solid-state and biological systems. Recent work has predicted that coupling between noisy environments and disordered systems, in which coherent transport has been arrested due to localization effects, could actually enhance transport. Photonic integrated circuits are promising platforms for studying such effects, with a central goal being the development of large systems providing low-loss, high-fidelity control over all parameters of the transport problem. Here, we fully map the role of disorder in quantum transport using a nanophotonic processor consisting of a mesh of 88 generalized beamsplitters programmable on microsecond timescales. Over 64,400 transport experiments, we observe several distinct transport regimes, including environment-assisted quantum transport and the ''quantum Goldilocks'' regime in strong, statically disordered discrete-time systems. Low loss and high-fidelity programmable transformations make this nanophotonic processor a promising platform for many-boson quantum simulation experiments.
△ Less
Submitted 28 January, 2020; v1 submitted 13 July, 2015;
originally announced July 2015.
-
The Accessible Lasso Models
Authors:
Amir Sepehri,
Naftali Harris
Abstract:
A new line of research on the lasso exploits the beautiful geometric fact that the lasso fit is the residual from projecting the response vector $y$ onto a certain convex polytope. This geometric picture also allows an exact geometric description of the set of accessible lasso models for a given design matrix, that is, which configurations of the signs of the coefficients it is possible to realize…
▽ More
A new line of research on the lasso exploits the beautiful geometric fact that the lasso fit is the residual from projecting the response vector $y$ onto a certain convex polytope. This geometric picture also allows an exact geometric description of the set of accessible lasso models for a given design matrix, that is, which configurations of the signs of the coefficients it is possible to realize with some choice of $y$. In particular, the accessible lasso models are those that correspond to a face of the convex hull of all the feature vectors together with their negations. This convex hull representation then permits the enumeration and bounding of the number of accessible lasso models, which in turn provides a direct proof of model selection inconsistency when the size of the true model is greater than half the number of observations.
△ Less
Submitted 9 June, 2016; v1 submitted 12 January, 2015;
originally announced January 2015.
-
Peer assessment enhances student learning
Authors:
Dennis L. Sun,
Naftali Harris,
Guenther Walther,
Michael Baiocchi
Abstract:
Feedback has a powerful influence on learning, but it is also expensive to provide. In large classes, it may even be impossible for instructors to provide individualized feedback. Peer assessment has received attention lately as a way of providing personalized feedback that scales to large classes. Besides these obvious benefits, some researchers have also conjectured that students learn by peer a…
▽ More
Feedback has a powerful influence on learning, but it is also expensive to provide. In large classes, it may even be impossible for instructors to provide individualized feedback. Peer assessment has received attention lately as a way of providing personalized feedback that scales to large classes. Besides these obvious benefits, some researchers have also conjectured that students learn by peer assessing, although no studies have ever conclusively demonstrated this effect. By conducting a randomized controlled trial in an introductory statistics class, we provide evidence that peer assessment causes significant gains in student achievement. The strength of our conclusions depends critically on the careful design of the experiment, which was made possible by a web-based platform that we developed. Hence, our study is also a proof of concept of the high-quality experiments that are possible with online tools.
△ Less
Submitted 14 October, 2014;
originally announced October 2014.
-
Efficient, Compact and Low Loss Thermo-Optic Phase Shifter in Silicon
Authors:
Nicholas C. Harris,
Yang** Ma,
Jacob Mower,
Tom Baehr-Jones,
Dirk Englund,
Michael Hochberg,
Christophe Galland
Abstract:
We design a resistive heater optimized for efficient and low-loss optical phase modulation in a silicon-on-insulator (SOI) waveguide and characterize the fabricated devices. Modulation is achieved by flowing current perpendicular to a new ridge waveguide geometry. The resistance profile is engineered using different dopant concentrations to obtain localized heat generation and maximize the overlap…
▽ More
We design a resistive heater optimized for efficient and low-loss optical phase modulation in a silicon-on-insulator (SOI) waveguide and characterize the fabricated devices. Modulation is achieved by flowing current perpendicular to a new ridge waveguide geometry. The resistance profile is engineered using different dopant concentrations to obtain localized heat generation and maximize the overlap between the optical mode and the high temperature regions, while simultaneously minimizing optical loss due to free-carrier absorption. A 61.6 micrometer-long phase shifter was fabricated in a CMOS process with oxide cladding and two metal layers. The device features a phase-shifting efficiency of 24.77 +/- 0.43 mW/pi and a -3 dB modulation bandwidth of 130.0 +/- 5.59 kHz. The insertion loss measured for 21 devices across an 8-inch wafer was only 0.23 +/- 0.13 dB. Considering the prospect of densely integrated photonic circuits, we also quantify the separation necessary to isolate thermo-optic devices in the standard 220 nm SOI platform.
△ Less
Submitted 14 October, 2014;
originally announced October 2014.
-
An integrated source of spectrally filtered correlated photons for large scale quantum photonic systems
Authors:
Nicholas C. Harris,
Davide Grassani,
Angelica Simbula,
Mihir Pant,
Matteo Galli,
Tom Baehr-Jones,
Michael Hochberg,
Dirk Englund,
Daniele Bajoni,
Christophe Galland
Abstract:
We demonstrate the generation of quantum-correlated photon-pairs combined with the spectral filtering of the pump field by more than 95dB using Bragg reflectors and electrically tunable ring resonators. Moreover, we perform demultiplexing and routing of signal and idler photons after transferring them via a fiber to a second identical chip. Non-classical two-photon temporal correlations with a coi…
▽ More
We demonstrate the generation of quantum-correlated photon-pairs combined with the spectral filtering of the pump field by more than 95dB using Bragg reflectors and electrically tunable ring resonators. Moreover, we perform demultiplexing and routing of signal and idler photons after transferring them via a fiber to a second identical chip. Non-classical two-photon temporal correlations with a coincidence-to-accidental ratio of 50 are measured without further off-chip filtering. Our system, fabricated with high yield and reproducibility in a CMOS process, paves the way toward truly large-scale quantum photonic circuits by allowing sources and detectors of single photons to be integrated on the same chip.
△ Less
Submitted 29 September, 2014;
originally announced September 2014.
-
Towards High-Fidelity Quantum Computation and Simulation on a Programmable Photonic Integrated Circuit
Authors:
Jacob Mower,
Nicholas C. Harris,
Gregory R. Steinbrecher,
Yoav Lahini,
Dirk Englund
Abstract:
We propose and analyze the design of a programmable photonic integrated circuit for high-fidelity quantum computation and simulation. We demonstrate that the reconfigurability of our design allows us to overcome two major impediments to quantum optics on a chip: it removes the need for a full fabrication cycle for each experiment and allows for compensation of fabrication errors using numerical op…
▽ More
We propose and analyze the design of a programmable photonic integrated circuit for high-fidelity quantum computation and simulation. We demonstrate that the reconfigurability of our design allows us to overcome two major impediments to quantum optics on a chip: it removes the need for a full fabrication cycle for each experiment and allows for compensation of fabrication errors using numerical optimization techniques. Under a pessimistic fabrication model for the silicon-on-insulator process, we demonstrate a dramatic fidelity improvement for the linear optics CNOT and CPHASE gates and, showing the scalability of this approach, the iterative phase estimation algorithm built from individually optimized gates. We also propose and simulate a novel experiment that the programmability of our system would enable: a statistically robust study of the evolution of entangled photons in disordered quantum walks. Overall, our results suggest that existing fabrication processes are sufficient to build a quantum photonic processor capable of high fidelity operation.
△ Less
Submitted 16 December, 2014; v1 submitted 12 June, 2014;
originally announced June 2014.
-
On-Chip Detection of Entangled Photons by Scalable Integration of Single-Photon Detectors
Authors:
Faraz Najafi,
Jacob Mower,
Nicholas Harris,
Francesco Bellei,
Andrew Dane,
Catherine Lee,
Prashanta Kharel,
Francesco Marsili,
Solomon Assefa,
Karl K. Berggren,
Dirk Englund
Abstract:
Photonic integrated circuits (PICs) have emerged as a scalable platform for complex quantum technologies using photonic and atomic systems. A central goal has been to integrate photon-resolving detectors to reduce optical losses, latency, and wiring complexity associated with off-chip detectors. Superconducting nanowire single-photon detectors (SNSPDs) are particularly attractive because of high d…
▽ More
Photonic integrated circuits (PICs) have emerged as a scalable platform for complex quantum technologies using photonic and atomic systems. A central goal has been to integrate photon-resolving detectors to reduce optical losses, latency, and wiring complexity associated with off-chip detectors. Superconducting nanowire single-photon detectors (SNSPDs) are particularly attractive because of high detection efficiency, sub-50-ps timing jitter, nanosecond-scale reset time, and sensitivity from the visible to the mid-infrared spectrum. However, while single SNSPDs have been incorporated into individual waveguides, the system efficiency of multiple SNSPDs in one photonic circuit has been limited below 0.2% due to low device yield. Here we introduce a micrometer-scale flip-chip process that enables scalable integration of SNSPDs on a range of PICs. Ten low-jitter detectors were integrated on one PIC with 100% device yield. With an average system efficiency beyond 10% for multiple SNSPDs on one PIC, we demonstrate high-fidelity on-chip photon correlation measurements of non-classical light.
△ Less
Submitted 16 May, 2014;
originally announced May 2014.
-
Broadband on-chip optical non-reciprocity using phase modulators
Authors:
Christophe Galland,
Ran Ding,
Nicholas C Harris,
Tom Baehr-Jones,
Michael Hochberg
Abstract:
Breaking the reciprocity of light propagation in photonic integrated circuits (PIC) - especially in the CMOS-compatible silicon-on-insulator platform - is a topic of intense research. However, a practical solution for monolithic integrating of optical isolators and circulators remains elusive. Here, we propose and analyze a new non-reciprocal photonic architecture operating with standard single-mo…
▽ More
Breaking the reciprocity of light propagation in photonic integrated circuits (PIC) - especially in the CMOS-compatible silicon-on-insulator platform - is a topic of intense research. However, a practical solution for monolithic integrating of optical isolators and circulators remains elusive. Here, we propose and analyze a new non-reciprocal photonic architecture operating with standard single-mode waveguides (or optical fibers). Our design exploits cascaded phase modulators separated by optical delay lines and suitably driven by time shifted waveforms. Because it is based on fully balanced interferometers and does not involve resonant structures, our scheme is also intrinsically broadband. Using realistic parameters we calculate an extinction ratio superior to 20 dB and insertion loss below -3 dB.
△ Less
Submitted 6 May, 2014;
originally announced May 2014.
-
PC algorithm for Gaussian copula graphical models
Authors:
Naftali Harris,
Mathias Drton
Abstract:
The PC algorithm uses conditional independence tests for model selection in graphical modeling with acyclic directed graphs. In Gaussian models, tests of conditional independence are typically based on Pearson correlations, and high-dimensional consistency results have been obtained for the PC algorithm in this setting. We prove that high-dimensional consistency carries over to the broader class o…
▽ More
The PC algorithm uses conditional independence tests for model selection in graphical modeling with acyclic directed graphs. In Gaussian models, tests of conditional independence are typically based on Pearson correlations, and high-dimensional consistency results have been obtained for the PC algorithm in this setting. We prove that high-dimensional consistency carries over to the broader class of Gaussian copula or \textit{nonparanormal} models when using rank-based measures of correlation. For graphs with bounded degree, our result is as strong as prior Gaussian results. In simulations, the `Rank PC' algorithm works as well as the `Pearson PC' algorithm for normal data and considerably better for non-normal Gaussian copula data, all the while incurring a negligible increase of computation time. Simulations with contaminated data show that rank correlations can also perform better than other robust estimates considered in previous work when the underlying distribution does not belong to the nonparanormal family.
△ Less
Submitted 1 July, 2012;
originally announced July 2012.
-
A 25 Gb/s Silicon Photonics Platform
Authors:
Tom Baehr-Jones,
Ran Ding,
Ali Ayazi,
Thierry **uet,
Matt Streshinsky,
Nick Harris,
**g Li,
Li He,
Mike Gould,
Yi Zhang,
Andy Eu-** Lim,
Tsung-Yang Liow,
Selin Hwee-Gee Teo,
Guo-Qiang Lo,
Michael Hochberg
Abstract:
Silicon has attracted attention as an inexpensive and scalable material system for photonic-electronic, system-on-chip development. For this, a platform with both photodetectors and modulators working at high speeds, with excellent cross-wafer uniformity, is needed. We demonstrate an optical-lithography, wafer-scale photonics platform with 25 Gb/s operation. We also demonstrate modulation with an…
▽ More
Silicon has attracted attention as an inexpensive and scalable material system for photonic-electronic, system-on-chip development. For this, a platform with both photodetectors and modulators working at high speeds, with excellent cross-wafer uniformity, is needed. We demonstrate an optical-lithography, wafer-scale photonics platform with 25 Gb/s operation. We also demonstrate modulation with an ultra-low drive voltage of 1 Vpp at 25 Gb/s. We demonstrate attractive cross-wafer uniformity, and provide detailed information about the device geometry. Our platform is available to the community as part of a photonics shuttle service.
△ Less
Submitted 4 March, 2012;
originally announced March 2012.
-
The Refined Gross-Prasad Conjecture for Unitary Groups
Authors:
R. Neal Harris
Abstract:
Let F be a number field, A_F its ring of adeles, and let π_n and π_{n+1} be irreducible, cuspidal, automorphic representations of SO_n(A_F) and SO_{n+1}(A_F), respectively. In 1991, Benedict Gross and Dipendra Prasad conjectured the non-vanishing of a certain period integral attached to π_n and π_{n+1} is equivalent to the non-vanishing of L(1/2, π_n x π_{n+1}). More recently, Atsushi Ichino and T…
▽ More
Let F be a number field, A_F its ring of adeles, and let π_n and π_{n+1} be irreducible, cuspidal, automorphic representations of SO_n(A_F) and SO_{n+1}(A_F), respectively. In 1991, Benedict Gross and Dipendra Prasad conjectured the non-vanishing of a certain period integral attached to π_n and π_{n+1} is equivalent to the non-vanishing of L(1/2, π_n x π_{n+1}). More recently, Atsushi Ichino and Tamotsu Ikeda gave a refinement of this conjecture as well as a proof of the first few cases (n = 2,3). Their conjecture gives an explicit relationship between the aforementioned L-value and period integral. We make a similar conjecture for unitary groups, and prove the first few cases. The first case of the conjecture will be proved using a theorem of Waldspurger, while the second case will use the machinery of the Θ-correspondence.
△ Less
Submitted 8 September, 2012; v1 submitted 2 January, 2012;
originally announced January 2012.
-
Defects Can Increase the Melting Temperature of DNA-Nanoparticle Assemblies
Authors:
Nolan C. Harris,
Ching-Hwa Kiang
Abstract:
DNA-gold nanoparticle assemblies have shown promise as an alternative technology to DNA microarrays for DNA detection and RNA profiling. Understanding the effect of DNA sequences on the melting temperature of the system is central to develo** reliable detection technology. We studied the effects of DNA base-pairing defects, such as mismatches and deletions, on the melting temperature of DNA-na…
▽ More
DNA-gold nanoparticle assemblies have shown promise as an alternative technology to DNA microarrays for DNA detection and RNA profiling. Understanding the effect of DNA sequences on the melting temperature of the system is central to develo** reliable detection technology. We studied the effects of DNA base-pairing defects, such as mismatches and deletions, on the melting temperature of DNA-nanoparticle assemblies. We found that, contrary to the general assumption that defects lower the melting temperature of DNA, some defects increase the melting temperature of DNA-linked nanoparticle assemblies. The effects of mismatches and deletions were found to depend on the specific base pair, the sequence, and the location of the defects. Our results demonstrate that the surface-bound DNA exhibit hybridization behavior different from that of free DNA. Such findings indicate that a detailed understanding of DNA-nanoparticle assembly phase behavior is required for quantitative interpretation of DNA-nanoparticle aggregation.
△ Less
Submitted 5 July, 2007;
originally announced July 2007.
-
Experimental Free Energy Surface Reconstruction From Single-Molecule Force Spectroscopy Using Jarzynski's Equality
Authors:
Nolan C. Harris,
Yang Song,
Ching-Hwa Kiang
Abstract:
We used the atomic force microscope to manipulate and unfold individual molecules of the titin I27 domain and reconstructed its free energy surface using Jarzynski's equality. The free energy surface for both stretching and unfolding was reconstructed using an exact formula that relates the nonequilibrium work fluctuations to the molecular free energy. In addition, the unfolding free energy barr…
▽ More
We used the atomic force microscope to manipulate and unfold individual molecules of the titin I27 domain and reconstructed its free energy surface using Jarzynski's equality. The free energy surface for both stretching and unfolding was reconstructed using an exact formula that relates the nonequilibrium work fluctuations to the molecular free energy. In addition, the unfolding free energy barrier, i.e. the activation energy, was directly obtained from experimental data for the first time. This work demonstrates that Jarzynski's equality can be used to analyze nonequilibrium single-molecule experiments, and to obtain the free energy surfaces for molecular systems, including interactions for which only nonequilibrium work can be measured.
△ Less
Submitted 3 July, 2007;
originally announced July 2007.
-
Disorder in DNA-Linked Gold Nanoparticle Assemblies
Authors:
Nolan C. Harris,
Ching-Hwa Kiang
Abstract:
We report experimental observations on the effect of disorder on the phase behavior of DNA-linked nanoparticle assemblies. Variation in DNA linker lengths results in different melting temperatures of the DNA-linked nanoparticle assemblies. We observed an unusual trend of a non-monotonic ``zigzag'' pattern in the melting temperature as a function of DNAlinker length. Linker DNA resulting in unequ…
▽ More
We report experimental observations on the effect of disorder on the phase behavior of DNA-linked nanoparticle assemblies. Variation in DNA linker lengths results in different melting temperatures of the DNA-linked nanoparticle assemblies. We observed an unusual trend of a non-monotonic ``zigzag'' pattern in the melting temperature as a function of DNAlinker length. Linker DNA resulting in unequal DNA duplex lengths introduces disorder and lowers the melting temperature of the nanoparticle system. Comparison with free DNA thermodynamics shows that such an anomalous zigzag pattern does not exist for free DNA duplex melting, which suggests that the disorder introduced by unequal DNA duplex lengths results in this unusual collective behavior of DNA-linked nanoparticle assemblies.
△ Less
Submitted 2 July, 2005;
originally announced July 2005.
-
The Reversible Phase Transition of DNA-Linked Colloidal Gold Assemblies
Authors:
Young Sun,
Nolan C. Harris,
Ching-Hwa Kiang
Abstract:
We present direct evidence for a reversible phase transition of DNA-linked colloidal gold assemblies. Transmission electron microscopy and optical absorption spectroscopy are used to monitor the colloidal gold phase transition, whose behavior is dominated by DNA interactions. We use single-stranded DNA-capped colloidal gold that is linked by complementary linker DNA to form the assemblies. We fo…
▽ More
We present direct evidence for a reversible phase transition of DNA-linked colloidal gold assemblies. Transmission electron microscopy and optical absorption spectroscopy are used to monitor the colloidal gold phase transition, whose behavior is dominated by DNA interactions. We use single-stranded DNA-capped colloidal gold that is linked by complementary linker DNA to form the assemblies. We found that, compared to free DNA, a sharp melting transition is observed for the DNA-linked colloidal gold assemblies. The structure of the assemblies is non-crystalline, much like a gel phase, consistent with theoretical predictions. Optical spectra and melting curves provide additional evidence of gelation of the colloidal system. The phase transition and separation are examples of percolation in a dilute solvent.
△ Less
Submitted 21 April, 2005; v1 submitted 8 April, 2005;
originally announced April 2005.
-
Melting Transition of Directly-Linked Gold Nanoparticle DNA Assembly
Authors:
Y. Sun,
N. C. Harris,
C. -H. Kiang
Abstract:
DNA melting and hybridization is a fundamental biological process as well as a crucial step in many modern biotechnology applications. DNA confined on surfaces exhibits different behavior from that in free solutions. The system of DNA-capped gold nanoparticles exhibits unique phase transitions and represents a new class of complex fluids. Depending on the sequence of the DNA, particles can be li…
▽ More
DNA melting and hybridization is a fundamental biological process as well as a crucial step in many modern biotechnology applications. DNA confined on surfaces exhibits different behavior from that in free solutions. The system of DNA-capped gold nanoparticles exhibits unique phase transitions and represents a new class of complex fluids. Depending on the sequence of the DNA, particles can be linked to each other through direct complementary DNA sequences or via a ``linker'' DNA whose sequence is complementary to the sequence attached to the gold nanoparticles. We observed different melting transitions for these two distinct systems.
△ Less
Submitted 10 March, 2005;
originally announced March 2005.
-
The chemical history of $^{14}{\rm C}$ in deep oilfields
Authors:
G. Bonvicini,
N. Harris,
V. Paolone
Abstract:
14C is an overwhelming background in low-background underground experiments, to the point where the observation of the all-important (pp) neutrinos from the Sun can not be observed in carbon-containing experiments. This paper shows that 14C purity can be improved by four orders of magnitude by a careful selection of the gas field. Two large reduction factors are at work: the low chemical affinit…
▽ More
14C is an overwhelming background in low-background underground experiments, to the point where the observation of the all-important (pp) neutrinos from the Sun can not be observed in carbon-containing experiments. This paper shows that 14C purity can be improved by four orders of magnitude by a careful selection of the gas field. Two large reduction factors are at work: the low chemical affinity of methane to single carbon, and the migration of natural gas away from nitrogen-bearing kerogen during as the oilfield matures.
△ Less
Submitted 8 August, 2003; v1 submitted 8 August, 2003;
originally announced August 2003.