Search | arXiv e-print repository

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content. △ Less

Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

arXiv:2312.11805 [pdf, other]

Gemini: A Family of Highly Capable Multimodal Models

Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI. △ Less

Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

arXiv:2207.04901 [pdf, other]

Exploring Length Generalization in Large Language Models

Authors: Cem Anil, Yuhuai Wu, Anders Andreassen, Aitor Lewkowycz, Vedant Misra, Vinay Ramasesh, Ambrose Slone, Guy Gur-Ari, Ethan Dyer, Behnam Neyshabur

Abstract: The ability to extrapolate from short problem instances to longer ones is an important form of out-of-distribution generalization in reasoning tasks, and is crucial when learning from datasets where longer problem instances are rare. These include theorem proving, solving quantitative mathematics problems, and reading/summarizing novels. In this paper, we run careful empirical studies exploring th… ▽ More The ability to extrapolate from short problem instances to longer ones is an important form of out-of-distribution generalization in reasoning tasks, and is crucial when learning from datasets where longer problem instances are rare. These include theorem proving, solving quantitative mathematics problems, and reading/summarizing novels. In this paper, we run careful empirical studies exploring the length generalization capabilities of transformer-based language models. We first establish that naively finetuning transformers on length generalization tasks shows significant generalization deficiencies independent of model scale. We then show that combining pretrained large language models' in-context learning abilities with scratchpad prompting (asking the model to output solution steps before producing an answer) results in a dramatic improvement in length generalization. We run careful failure analyses on each of the learning modalities and identify common sources of mistakes that highlight opportunities in equip** language models with the ability to generalize to longer problems. △ Less

Submitted 14 November, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

arXiv:2206.14858 [pdf, other]

Solving Quantitative Reasoning Problems with Language Models

Authors: Aitor Lewkowycz, Anders Andreassen, David Dohan, Ethan Dyer, Henryk Michalewski, Vinay Ramasesh, Ambrose Slone, Cem Anil, Imanol Schlag, Theo Gutman-Solo, Yuhuai Wu, Behnam Neyshabur, Guy Gur-Ari, Vedant Misra

Abstract: Language models have achieved remarkable performance on a wide range of tasks that require natural language understanding. Nevertheless, state-of-the-art models have generally struggled with tasks that require quantitative reasoning, such as solving mathematics, science, and engineering problems at the college level. To help close this gap, we introduce Minerva, a large language model pretrained o… ▽ More Language models have achieved remarkable performance on a wide range of tasks that require natural language understanding. Nevertheless, state-of-the-art models have generally struggled with tasks that require quantitative reasoning, such as solving mathematics, science, and engineering problems at the college level. To help close this gap, we introduce Minerva, a large language model pretrained on general natural language data and further trained on technical content. The model achieves state-of-the-art performance on technical benchmarks without the use of external tools. We also evaluate our model on over two hundred undergraduate-level problems in physics, biology, chemistry, economics, and other sciences that require quantitative reasoning, and find that the model can correctly answer nearly a third of them. △ Less

Submitted 30 June, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

Comments: 12 pages, 5 figures + references and appendices

arXiv:2206.04615 [pdf, other]

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-future capabilities and limitations of language models. To address this challenge, we introduce the Beyond the Imitation Game benchmark (BIG-bench). BIG-bench currently consists of 204 tasks, contributed by 450 authors across 132 institutions. Task topics are diverse, drawing problems from linguistics, childhood development, math, common-sense reasoning, biology, physics, social bias, software development, and beyond. BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models. We evaluate the behavior of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters. In addition, a team of human expert raters performed all tasks in order to provide a strong baseline. Findings include: model performance and calibration both improve with scale, but are poor in absolute terms (and when compared with rater performance); performance is remarkably similar across model classes, though with benefits from sparsity; tasks that improve gradually and predictably commonly involve a large knowledge or memorization component, whereas tasks that exhibit "breakthrough" behavior at a critical scale often involve multiple steps or components, or brittle metrics; social bias typically increases with scale in settings with ambiguous context, but this can be improved with prompting. △ Less

Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

arXiv:2110.15253 [pdf, other]

Understanding How Encoder-Decoder Architectures Attend

Authors: Kyle Aitken, Vinay V Ramasesh, Yuan Cao, Niru Maheswaranathan

Abstract: Encoder-decoder networks with attention have proven to be a powerful way to solve many sequence-to-sequence tasks. In these networks, attention aligns encoder and decoder states and is often used for visualizing network behavior. However, the mechanisms used by networks to generate appropriate attention matrices are still mysterious. Moreover, how these mechanisms vary depending on the particular… ▽ More Encoder-decoder networks with attention have proven to be a powerful way to solve many sequence-to-sequence tasks. In these networks, attention aligns encoder and decoder states and is often used for visualizing network behavior. However, the mechanisms used by networks to generate appropriate attention matrices are still mysterious. Moreover, how these mechanisms vary depending on the particular architecture used for the encoder and decoder (recurrent, feed-forward, etc.) are also not well understood. In this work, we investigate how encoder-decoder networks solve different sequence-to-sequence tasks. We introduce a way of decomposing hidden states over a sequence into temporal (independent of input) and input-driven (independent of sequence position) components. This reveals how attention matrices are formed: depending on the task requirements, networks rely more heavily on either the temporal or input-driven components. These findings hold across both recurrent and feed-forward architectures despite their differences in forming the temporal components. Overall, our results provide new insight into the inner workings of attention-based encoder-decoder networks. △ Less

Submitted 28 October, 2021; originally announced October 2021.

Comments: 10+14 pages, 16 figures. NeurIPS 2021

arXiv:2010.15114 [pdf, other]

The geometry of integration in text classification RNNs

Authors: Kyle Aitken, Vinay V. Ramasesh, Ankush Garg, Yuan Cao, David Sussillo, Niru Maheswaranathan

Abstract: Despite the widespread application of recurrent neural networks (RNNs) across a variety of tasks, a unified understanding of how RNNs solve these tasks remains elusive. In particular, it is unclear what dynamical patterns arise in trained RNNs, and how those patterns depend on the training dataset or task. This work addresses these questions in the context of a specific natural language processing… ▽ More Despite the widespread application of recurrent neural networks (RNNs) across a variety of tasks, a unified understanding of how RNNs solve these tasks remains elusive. In particular, it is unclear what dynamical patterns arise in trained RNNs, and how those patterns depend on the training dataset or task. This work addresses these questions in the context of a specific natural language processing task: text classification. Using tools from dynamical systems analysis, we study recurrent networks trained on a battery of both natural and synthetic text classification tasks. We find the dynamics of these trained RNNs to be both interpretable and low-dimensional. Specifically, across architectures and datasets, RNNs accumulate evidence for each class as they process the text, using a low-dimensional attractor manifold as the underlying mechanism. Moreover, the dimensionality and geometry of the attractor manifold are determined by the structure of the training dataset; in particular, we describe how simple word-count statistics computed on the training dataset can be used to predict these properties. Our observations span multiple architectures and datasets, reflecting a common mechanism RNNs employ to perform text classification. To the degree that integration of evidence towards a decision is a common computational primitive, this work lays the foundation for using dynamical systems techniques to study the inner workings of RNNs. △ Less

Submitted 3 June, 2022; v1 submitted 28 October, 2020; originally announced October 2020.

Comments: 9+19 pages, 30 figures; v2: smaller file size

arXiv:2008.09134 [pdf, other]

doi 10.1103/PhysRevLett.126.210504

Qutrit randomized benchmarking

Authors: A. Morvan, V. V. Ramasesh, M. S. Blok, J. M. Kreikebaum, K. O'Brien, L. Chen, B. K. Mitchell, R. K. Naik, D. I. Santiago, I. Siddiqi

Abstract: Ternary quantum processors offer significant computational advantages over conventional qubit technologies, leveraging the encoding and processing of quantum information in qutrits (three-level systems). To evaluate and compare the performance of such emerging quantum hardware it is essential to have robust benchmarking methods suitable for a higher-dimensional Hilbert space. We demonstrate extens… ▽ More Ternary quantum processors offer significant computational advantages over conventional qubit technologies, leveraging the encoding and processing of quantum information in qutrits (three-level systems). To evaluate and compare the performance of such emerging quantum hardware it is essential to have robust benchmarking methods suitable for a higher-dimensional Hilbert space. We demonstrate extensions of industry standard Randomized Benchmarking (RB) protocols, developed and used extensively for qubits, suitable for ternary quantum logic. Using a superconducting five-qutrit processor, we find a single-qutrit gate infidelity as low as $2.38 \times 10^{-3}$. Through interleaved RB, we find that this qutrit gate error is largely limited by the native (qubit-like) gate fidelity, and employ simultaneous RB to fully characterize cross-talk errors. Finally, we apply cycle benchmarking to a two-qutrit CSUM gate and obtain a two-qutrit process fidelity of $0.82$. Our results demonstrate a RB-based tool to characterize the obtain overall performance of a qutrit processor, and a general approach to diagnose control errors in future qudit hardware. △ Less

Submitted 20 August, 2020; originally announced August 2020.

Comments: 6 pages (+ 2 pages supplement), 5 figures

Journal ref: Phys. Rev. Lett. 126, 210504 (2021)

arXiv:2007.07400 [pdf, other]

Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics

Authors: Vinay V. Ramasesh, Ethan Dyer, Maithra Raghu

Abstract: A central challenge in develo** versatile machine learning systems is catastrophic forgetting: a model trained on tasks in sequence will suffer significant performance drops on earlier tasks. Despite the ubiquity of catastrophic forgetting, there is limited understanding of the underlying process and its causes. In this paper, we address this important knowledge gap, investigating how forgetting… ▽ More A central challenge in develo** versatile machine learning systems is catastrophic forgetting: a model trained on tasks in sequence will suffer significant performance drops on earlier tasks. Despite the ubiquity of catastrophic forgetting, there is limited understanding of the underlying process and its causes. In this paper, we address this important knowledge gap, investigating how forgetting affects representations in neural network models. Through representational analysis techniques, we find that deeper layers are disproportionately the source of forgetting. Supporting this, a study of methods to mitigate forgetting illustrates that they act to stabilize deeper layers. These insights enable the development of an analytic argument and empirical picture relating the degree of forgetting to representational similarity between tasks. Consistent with this picture, we observe maximal forgetting occurs for task sequences with intermediate similarity. We perform empirical studies on the standard split CIFAR-10 setup and also introduce a novel CIFAR-100 based task approximating realistic input distribution shift. △ Less

Submitted 14 July, 2020; originally announced July 2020.

arXiv:2003.03307 [pdf, other]

doi 10.1103/PhysRevX.11.021010

Quantum Information Scrambling in a Superconducting Qutrit Processor

Authors: M. S. Blok, V. V. Ramasesh, T. Schuster, K. O'Brien, J. M. Kreikebaum, D. Dahlen, A. Morvan, B. Yoshida, N. Y. Yao, I. Siddiqi

Abstract: The theory of quantum information provides a common language which links disciplines ranging from cosmology to condensed-matter physics. For example, the delocalization of quantum information in strongly-interacting many-body systems, known as quantum information scrambling, has recently begun to unite our understanding of black hole dynamics, transport in exotic non-Fermi liquids, and many-body a… ▽ More The theory of quantum information provides a common language which links disciplines ranging from cosmology to condensed-matter physics. For example, the delocalization of quantum information in strongly-interacting many-body systems, known as quantum information scrambling, has recently begun to unite our understanding of black hole dynamics, transport in exotic non-Fermi liquids, and many-body analogs of quantum chaos. To date, verified experimental implementations of scrambling have dealt only with systems comprised of two-level qubits. Higher-dimensional quantum systems, however, may exhibit different scrambling modalities and are predicted to saturate conjectured speed limits on the rate of quantum information scrambling. We take the first steps toward accessing such phenomena, by realizing a quantum processor based on superconducting qutrits (three-level quantum systems). We implement two-qutrit scrambling operations and embed them in a five-qutrit teleportation algorithm to directly measure the associated out of-time-ordered correlation functions. Measured teleportation fidelities, Favg = 0.568 +- 0001, confirm the occurrence of scrambling even in the presence of experimental imperfections. Our teleportation algorithm, which connects to recent proposals for studying traversable wormholes in the laboratory, demonstrates how quantum information processing technology based on higher dimensional systems can exploit a larger and more connected state space to achieve the resource efficient encoding of complex quantum circuits. △ Less

Submitted 10 February, 2021; v1 submitted 6 March, 2020; originally announced March 2020.

Journal ref: Phys. Rev. X 11, 021010 (2021)

arXiv:1710.02875 [pdf, other]

doi 10.22331/q-2018-05-28-69

Scattering into one-dimensional waveguides from a coherently-driven quantum-optical system

Authors: Kevin A. Fischer, Rahul Trivedi, Vinay Ramasesh, Irfan Siddiqi, Jelena Vučković

Abstract: We develop a new computational tool and framework for characterizing the scattering of photons by energy-nonconserving Hamiltonians into unidirectional (chiral) waveguides, for example, with coherent pulsed excitation. The temporal waveguide modes are a natural basis for characterizing scattering in quantum optics, and afford a powerful technique based on a coarse discretization of time. This over… ▽ More We develop a new computational tool and framework for characterizing the scattering of photons by energy-nonconserving Hamiltonians into unidirectional (chiral) waveguides, for example, with coherent pulsed excitation. The temporal waveguide modes are a natural basis for characterizing scattering in quantum optics, and afford a powerful technique based on a coarse discretization of time. This overcomes limitations imposed by singularities in the waveguide-system coupling. Moreover, the integrated discretized equations can be faithfully converted to a continuous-time result by taking the appropriate limit. This approach provides a complete solution to the scattered photon field in the waveguide, and can also be used to track system-waveguide entanglement during evolution. We further develop a direct connection between quantum measurement theory and evolution of the scattered field, demonstrating the correspondence between quantum trajectories and the scattered photon state. Our method is most applicable when the number of photons scattered is known to be small, i.e. for a single-photon or photon-pair source. We illustrate two examples: analytical solutions for short laser pulses scattering off a two-level system and numerically exact solutions for short laser pulses scattering off a spontaneous parametric downconversion (SPDC) or spontaneous four-wave mixing (SFWM) source. Finally, we note that our technique can easily be extended to systems with multiple ground states and generalized scattering problems with both finite photon number input and coherent state drive, potentially enhancing the understanding of, e.g., light-matter entanglement and photon phase gates. △ Less

Submitted 19 May, 2018; v1 submitted 8 October, 2017; originally announced October 2017.

Comments: Numerical package in collaboration with Ben Bartlett (Stanford University), implemented in QuTiP: The Quantum Toolbox in Python, Quantum 2018

Journal ref: Quantum 2, 69 (2018)

arXiv:1707.06408 [pdf, other]

doi 10.1103/PhysRevX.8.011021

Robust determination of molecular spectra on a quantum processor

Authors: James I. Colless, Vinay V. Ramasesh, Dar Dahlen, Machiel S. Blok, Jarrod R. McClean, Jonathan Carter, Wibe A. de Jong, Irfan Siddiqi

Abstract: Harnessing the full power of nascent quantum processors requires the efficient management of a limited number of quantum bits with finite lifetime. Hybrid algorithms leveraging classical resources have demonstrated promising initial results in the efficient calculation of Hamiltonian ground states--an important eigenvalue problem in the physical sciences that is often classically intractable. In t… ▽ More Harnessing the full power of nascent quantum processors requires the efficient management of a limited number of quantum bits with finite lifetime. Hybrid algorithms leveraging classical resources have demonstrated promising initial results in the efficient calculation of Hamiltonian ground states--an important eigenvalue problem in the physical sciences that is often classically intractable. In these protocols, a Hamiltonian is parsed and evaluated term-wise with a shallow quantum circuit, and the resulting energy minimized using classical resources. This reduces the number of consecutive logical operations that must be performed on the quantum hardware before the onset of decoherence. We demonstrate a complete implementation of the Variational Quantum Eigensolver (VQE), augmented with a novel Quantum Subspace Expansion, to calculate the complete energy spectrum of the H2 molecule with near chemical accuracy. The QSE also enables the mitigation of incoherent errors, potentially allowing the implementation of larger-scale algorithms without complex quantum error correction techniques. △ Less

Submitted 20 July, 2017; originally announced July 2017.

Journal ref: Phys. Rev. X 8, 011021 (2018)

arXiv:1610.03069 [pdf, other]

doi 10.1103/PhysRevX.7.031023

Observing Topological Invariants Using Quantum Walk in Superconducting Circuits

Authors: Emmanuel Flurin, Vinay V. Ramasesh, Shay Hacohen-Gourgy, Leigh S. Martin, Norman Y. Yao, Irfan Siddiqi

Abstract: The direct measurement of topological invariants in both engineered and naturally occurring quantum materials is a key step in classifying quantum phases of matter. Here we motivate a toolbox based on time-dependent quantum walks as a method to digitally simulate single-particle topological band structures. Using a superconducting qubit dispersively coupled to a microwave cavity, we implement two… ▽ More The direct measurement of topological invariants in both engineered and naturally occurring quantum materials is a key step in classifying quantum phases of matter. Here we motivate a toolbox based on time-dependent quantum walks as a method to digitally simulate single-particle topological band structures. Using a superconducting qubit dispersively coupled to a microwave cavity, we implement two classes of split-step quantum walks and directly measure the topological invariant (winding number) associated with each. The measurement relies upon interference between two components of a cavity Schrödinger cat state and highlights a novel refocusing technique which allows for the direct implementation of a digital version of Bloch oscillations. Our scheme can readily be extended to higher dimensions, whereby quantum walk-based simulations can probe topological phases ranging from the quantum spin Hall effect to the Hopf insulator. △ Less

Submitted 10 October, 2016; originally announced October 2016.

Comments: 5 pages, 4 figures

Journal ref: Phys. Rev. X 7, 031023 (2017)

arXiv:1609.09504 [pdf, other]

doi 10.1103/PhysRevLett.118.130501

Direct Probe of Topological Invariants Using Bloch Oscillating Quantum Walks

Authors: Vinay V. Ramasesh, Emmanuel Flurin, Mark S. Rudner, Irfan Siddiqi, Norman Y. Yao

Abstract: The topology of a single-particle band structure plays a fundamental role in understanding a multitude of physical phenomena. Motivated by the connection between quantum walks and such topological band structures, we demonstrate that a simple time-dependent, Bloch-oscillating quantum walk enables the direct measurement of topological invariants. We consider two classes of one-dimensional quantum w… ▽ More The topology of a single-particle band structure plays a fundamental role in understanding a multitude of physical phenomena. Motivated by the connection between quantum walks and such topological band structures, we demonstrate that a simple time-dependent, Bloch-oscillating quantum walk enables the direct measurement of topological invariants. We consider two classes of one-dimensional quantum walks and connect the global phase imprinted on the walker with its refocusing behavior. By disentangling the dynamical and geometric contributions to this phase we describe a general strategy to measure the topological invariant in these quantum walks. As an example, we propose an experimental protocol in a circuit QED architecture where a superconducting transmon qubit plays the role of the coin, while the quantum walk takes place in the phase space of a cavity. △ Less

Submitted 29 September, 2016; originally announced September 2016.

Comments: Main text: 6 pages, 4 figures; Supplement: 4 pages, 0 figures

Journal ref: Phys. Rev. Lett. 118, 130501 (2017)

arXiv:1608.06652 [pdf, other]

doi 10.1038/nature19762

Dynamics of simultaneously measured non-commuting observables

Authors: Shay Hacohen-Gourgy, Leigh S. Martin, Emmanuel Flurin, Vinay V. Ramasesh, K. Birgitta Whaley, Irfan Siddiqi

Abstract: In quantum mechanics, measurements cause wavefunction collapse that yields precise outcomes, for non-commuting observables such as position and momentum Heisenberg's uncertainty principle limits the intrinsic precision of a state. Although theoretical work has demonstrated the possibility to perform simultaneous non-commuting measurements and has revealed the limits on measurement outcomes, only r… ▽ More In quantum mechanics, measurements cause wavefunction collapse that yields precise outcomes, for non-commuting observables such as position and momentum Heisenberg's uncertainty principle limits the intrinsic precision of a state. Although theoretical work has demonstrated the possibility to perform simultaneous non-commuting measurements and has revealed the limits on measurement outcomes, only recently has the dynamics of the quantum state been discussed. To realize this unexplored regime, we simultaneously apply two continuous quantum non-demolition probes of non-commuting observables to a superconducting qubit. We implement multiple readout channels by coupling the qubit to multiple modes of a cavity. To control the measurement observables, we implement a 'single quadrature' measurement by driving the qubit and applying cavity sidebands with a relative phase that sets the observable. Here, we show that the uncertainty principle governs the dynamics of the wavefunction by enforcing a lower bound on the measurement-induced disturbance. Consequently, as we transition from measuring identical to measuring non-commuting observables, the dynamics make a smooth transition from standard wavefunction collapse to persistent diffusion. Although the evolution of the state differs from that of a conventional measurement, information about both observables is extracted by kee** track of the time ordering of the measurement record, enabling quantum state tomography without alternating measurements. Our work creates novel capabilities for quantum control, including rapid state purification, adaptive measurement, measurement-based state steering and continuous quantum error correction. As physical systems often interact continuously with their environment via non-commuting degrees of freedom, our work offers a way to study how notions of contemporary quantum foundations arise in such settings. △ Less

Submitted 25 October, 2016; v1 submitted 23 August, 2016; originally announced August 2016.

Journal ref: Nature (2016)

arXiv:1506.05837 [pdf, other]

doi 10.1103/PhysRevLett.115.240501

Cooling and Autonomous Feedback in a Bose-Hubbard chain with Attractive Interactions

Authors: Shay Hacohen-Gourgy, Vinay V. Ramasesh, Claudia De Grandi, Irfan Siddiqi, Steve M. Girvin

Abstract: We engineer a quantum bath that enables entropy and energy exchange with a one-dimensional Bose-Hubbard lattice with attractive on-site interactions. We implement this in an array of three superconducting transmon qubits coupled to a single cavity mode; the transmons represent lattice sites and their excitation quanta embody bosonic particles. Our cooling protocol preserves particle number--realiz… ▽ More We engineer a quantum bath that enables entropy and energy exchange with a one-dimensional Bose-Hubbard lattice with attractive on-site interactions. We implement this in an array of three superconducting transmon qubits coupled to a single cavity mode; the transmons represent lattice sites and their excitation quanta embody bosonic particles. Our cooling protocol preserves particle number--realizing a canonical ensemble-- and also affords the efficient preparation of dark states which, due to symmetry, cannot be prepared via coherent drives on the cavity. Furthermore, by applying continuous microwave radiation, we also realize autonomous feedback to indefinitely stabilize particular eigenstates of the array. △ Less

Submitted 15 December, 2015; v1 submitted 18 June, 2015; originally announced June 2015.

Comments: 5 pages paper, 21 pages supplementary

Journal ref: Phys. Rev. Lett. 115, 240501 (2015)

arXiv:1503.02648 [pdf, other]

doi 10.1103/PhysRevLett.114.193001

A Quantum Gas Microscope for Fermionic Atoms

Authors: Lawrence W. Cheuk, Matthew A. Nichols, Melih Okan, Thomas Gersdorf, Vinay V. Ramasesh, Waseem S. Bakr, Thomas Lompe, Martin W. Zwierlein

Abstract: Strongly interacting fermions define the properties of complex matter at all densities, from atomic nuclei to modern solid state materials and neutron stars. Ultracold atomic Fermi gases have emerged as a pristine platform for the study of many-fermion systems. Here we realize a quantum gas microscope for fermionic $^{40}$K atoms trapped in an optical lattice, which allows one to probe strongly co… ▽ More Strongly interacting fermions define the properties of complex matter at all densities, from atomic nuclei to modern solid state materials and neutron stars. Ultracold atomic Fermi gases have emerged as a pristine platform for the study of many-fermion systems. Here we realize a quantum gas microscope for fermionic $^{40}$K atoms trapped in an optical lattice, which allows one to probe strongly correlated fermions at the single atom level. We combine 3D Raman sideband cooling with high-resolution optics to simultaneously cool and image individual atoms with single lattice site resolution at a detection fidelity above $95\%$. The imaging process leaves each atom predominantly in the 3D ground state of its lattice site, inviting the implementation of a Maxwell's demon to assemble low-entropy many-body states. Single site resolved imaging of fermions enables the direct observation of magnetic order, time resolved measurements of the spread of particle correlations, and the detection of many-fermion entanglement. △ Less

Submitted 10 March, 2015; v1 submitted 9 March, 2015; originally announced March 2015.

Journal ref: Phys. Rev. Lett. 114, 193001 (2015)

Showing 1–17 of 17 results for author: Ramasesh, V