Search | arXiv e-print repository

CodeRAG-Bench: Can Retrieval Augment Code Generation?

Authors: Zora Zhiruo Wang, Akari Asai, Xinyan Velocity Yu, Frank F. Xu, Yiqing Xie, Graham Neubig, Daniel Fried

Abstract: While language models (LMs) have proven remarkably adept at generating code, many programs are challenging for LMs to generate using their parametric knowledge alone. Providing external contexts such as library documentation can facilitate generating accurate and functional code. Despite the success of retrieval-augmented generation (RAG) in various text-oriented tasks, its potential for improving… ▽ More While language models (LMs) have proven remarkably adept at generating code, many programs are challenging for LMs to generate using their parametric knowledge alone. Providing external contexts such as library documentation can facilitate generating accurate and functional code. Despite the success of retrieval-augmented generation (RAG) in various text-oriented tasks, its potential for improving code generation remains under-explored. In this work, we conduct a systematic, large-scale analysis by asking: in what scenarios can retrieval benefit code generation models? and what challenges remain? We first curate a comprehensive evaluation benchmark, CodeRAG-Bench, encompassing three categories of code generation tasks, including basic programming, open-domain, and repository-level problems. We aggregate documents from five sources for models to retrieve contexts: competition solutions, online tutorials, library documentation, StackOverflow posts, and GitHub repositories. We examine top-performing models on CodeRAG-Bench by providing contexts retrieved from one or multiple sources. While notable gains are made in final code generation by retrieving high-quality contexts across various settings, our analysis reveals room for improvement -- current retrievers still struggle to fetch useful contexts especially with limited lexical overlap, and generators fail to improve with limited context lengths or abilities to integrate additional contexts. We hope CodeRAG-Bench serves as an effective testbed to encourage further development of advanced code-oriented RAG methods. △ Less

Submitted 20 June, 2024; originally announced June 2024.

arXiv:2312.17710 [pdf, other]

Principled Gradient-based Markov Chain Monte Carlo for Text Generation

Authors: Li Du, Afra Amini, Lucas Torroba Hennigen, Xinyan Velocity Yu, Jason Eisner, Holden Lee, Ryan Cotterell

Abstract: Recent papers have demonstrated the possibility of energy-based text generation by adapting gradient-based sampling algorithms, a paradigm of MCMC algorithms that promises fast convergence. However, as we show in this paper, previous attempts on this approach to text generation all fail to sample correctly from the target language model distributions. To address this limitation, we consider the pr… ▽ More Recent papers have demonstrated the possibility of energy-based text generation by adapting gradient-based sampling algorithms, a paradigm of MCMC algorithms that promises fast convergence. However, as we show in this paper, previous attempts on this approach to text generation all fail to sample correctly from the target language model distributions. To address this limitation, we consider the problem of designing text samplers that are faithful, meaning that they have the target text distribution as its limiting distribution. We propose several faithful gradient-based sampling algorithms to sample from the target energy-based text distribution correctly, and study their theoretical properties. Through experiments on various forms of text generation, we demonstrate that faithful samplers are able to generate more fluent text while adhering to the control objectives better. △ Less

Submitted 29 December, 2023; originally announced December 2023.

Comments: Preprint

arXiv:2312.09733 [pdf, other]

Quantum-centric Supercomputing for Materials Science: A Perspective on Challenges and Future Directions

Authors: Yuri Alexeev, Maximilian Amsler, Paul Baity, Marco Antonio Barroca, Sanzio Bassini, Torey Battelle, Daan Camps, David Casanova, Young jai Choi, Frederic T. Chong, Charles Chung, Chris Codella, Antonio D. Corcoles, James Cruise, Alberto Di Meglio, Jonathan Dubois, Ivan Duran, Thomas Eckl, Sophia Economou, Stephan Eidenbenz, Bruce Elmegreen, Clyde Fare, Ismael Faro, Cristina Sanz Fernández, Rodrigo Neumann Barros Ferreira , et al. (102 additional authors not shown)

Abstract: Computational models are an essential tool for the design, characterization, and discovery of novel materials. Hard computational tasks in materials science stretch the limits of existing high-performance supercomputing centers, consuming much of their simulation, analysis, and data resources. Quantum computing, on the other hand, is an emerging technology with the potential to accelerate many of… ▽ More Computational models are an essential tool for the design, characterization, and discovery of novel materials. Hard computational tasks in materials science stretch the limits of existing high-performance supercomputing centers, consuming much of their simulation, analysis, and data resources. Quantum computing, on the other hand, is an emerging technology with the potential to accelerate many of the computational tasks needed for materials science. In order to do that, the quantum technology must interact with conventional high-performance computing in several ways: approximate results validation, identification of hard problems, and synergies in quantum-centric supercomputing. In this paper, we provide a perspective on how quantum-centric supercomputing can help address critical computational problems in materials science, the challenges to face in order to solve representative use cases, and new suggested directions. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Comments: 60 pages, 14 figures; comments welcome

arXiv:2311.09615 [pdf, other]

On Retrieval Augmentation and the Limitations of Language Model Training

Authors: Ting-Rui Chiang, Xinyan Velocity Yu, Joshua Robinson, Ollie Liu, Isabelle Lee, Dani Yogatama

Abstract: Augmenting a language model (LM) with $k$-nearest neighbors ($k$NN) retrieval on its training data alone can decrease its perplexity, though the underlying reasons for this remain elusive. In this work, we rule out one previously posited possibility -- the "softmax bottleneck." We then create a new dataset to evaluate LM generalization ability in the setting where training data contains additional… ▽ More Augmenting a language model (LM) with $k$-nearest neighbors ($k$NN) retrieval on its training data alone can decrease its perplexity, though the underlying reasons for this remain elusive. In this work, we rule out one previously posited possibility -- the "softmax bottleneck." We then create a new dataset to evaluate LM generalization ability in the setting where training data contains additional information that is not causally relevant. This task is challenging even for GPT-3.5 Turbo. We show that, for both GPT-2 and Mistral 7B, $k$NN retrieval augmentation consistently improves performance in this setting. Finally, to make $k$NN retrieval more accessible, we propose using a multi-layer perceptron model that maps datastore keys to values as a drop-in replacement for traditional retrieval. This reduces storage costs by over 25x. △ Less

Submitted 2 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

Comments: Accepted to NAACL 2024

arXiv:2309.03513 [pdf, other]

doi 10.1021/acs.jctc.3c00986

Excited state properties of point defects in semiconductors and insulators investigated with time-dependent density functional theory

Authors: Yu **, Victor Wen-zhe Yu, Marco Govoni, Andrew C Xu, Giulia Galli

Abstract: We present a formulation of spin-conserving and spin-flip, hybrid time-dependent density functional theory (TDDFT), including the calculation of analytical forces, which allows for efficient calculations of excited state properties of solid-state systems with hundreds to thousands of atoms. We discuss an implementation on both GPU and CPU based architectures, along with several acceleration techni… ▽ More We present a formulation of spin-conserving and spin-flip, hybrid time-dependent density functional theory (TDDFT), including the calculation of analytical forces, which allows for efficient calculations of excited state properties of solid-state systems with hundreds to thousands of atoms. We discuss an implementation on both GPU and CPU based architectures, along with several acceleration techniques. We then apply our formulation to the study of several point defects in semiconductors and insulators, specifically the negatively charged nitrogen-vacancy and neutral silicon-vacancy centers in diamond, the neutral divacancy center in 4H silicon carbide, and the neutral oxygen-vacancy center in magnesium oxide. Our results highlight the importance of taking into account structural relaxations in excited states, in order to interpret and predict optical absorption and emission mechanisms in spin-defects. △ Less

Submitted 3 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

arXiv:2305.14857 [pdf, other]

BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer

Authors: Akari Asai, Sneha Kudugunta, Xinyan Velocity Yu, Terra Blevins, Hila Gonen, Machel Reid, Yulia Tsvetkov, Sebastian Ruder, Hannaneh Hajishirzi

Abstract: Despite remarkable advancements in few-shot generalization in natural language processing, most models are developed and evaluated primarily in English. To facilitate research on few-shot cross-lingual transfer, we introduce a new benchmark, called BUFFET, which unifies 15 diverse tasks across 54 languages in a sequence-to-sequence format and provides a fixed set of few-shot examples and instructi… ▽ More Despite remarkable advancements in few-shot generalization in natural language processing, most models are developed and evaluated primarily in English. To facilitate research on few-shot cross-lingual transfer, we introduce a new benchmark, called BUFFET, which unifies 15 diverse tasks across 54 languages in a sequence-to-sequence format and provides a fixed set of few-shot examples and instructions. BUFFET is designed to establish a rigorous and equitable evaluation framework for few-shot cross-lingual transfer across a broad range of tasks and languages. Using BUFFET, we perform thorough evaluations of state-of-the-art multilingual large language models with different transfer methods, namely in-context learning and fine-tuning. Our findings reveal significant room for improvement in few-shot in-context cross-lingual transfer. In particular, ChatGPT with in-context learning often performs worse than much smaller mT5-base models fine-tuned on English task data and few-shot in-language examples. Our analysis suggests various avenues for future research in few-shot cross-lingual transfer, such as improved pretraining, understanding, and future evaluations. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: The data and code is available at https://buffetfs.github.io/

arXiv:2212.08607 [pdf, other]

MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation

Authors: Swarnadeep Saha, Xinyan Velocity Yu, Mohit Bansal, Ramakanth Pasunuru, Asli Celikyilmaz

Abstract: Prompting large language models has enabled significant recent progress in multi-step reasoning over text. However, when applied to text generation from semi-structured data (e.g., graphs or tables), these methods typically suffer from low semantic coverage, hallucination, and logical inconsistency. We propose MURMUR, a neuro-symbolic modular approach to text generation from semi-structured data w… ▽ More Prompting large language models has enabled significant recent progress in multi-step reasoning over text. However, when applied to text generation from semi-structured data (e.g., graphs or tables), these methods typically suffer from low semantic coverage, hallucination, and logical inconsistency. We propose MURMUR, a neuro-symbolic modular approach to text generation from semi-structured data with multi-step reasoning. MURMUR is a best-first search method that generates reasoning paths using: (1) neural and symbolic modules with specific linguistic and logical skills, (2) a grammar whose production rules define valid compositions of modules, and (3) value functions that assess the quality of each reasoning step. We conduct experiments on two diverse data-to-text generation tasks like WebNLG and LogicNLG. These tasks differ in their data representations (graphs and tables) and span multiple linguistic and logical skills. MURMUR obtains significant improvements over recent few-shot baselines like direct prompting and chain-of-thought prompting, while also achieving comparable performance to fine-tuned GPT-2 on out-of-domain data. Moreover, human evaluation shows that MURMUR generates highly faithful and correct reasoning paths that lead to 26% more logically consistent summaries on LogicNLG, compared to direct prompting. △ Less

Submitted 16 December, 2022; originally announced December 2022.

Comments: 22 pages (9 figures, 18 tables)

arXiv:2211.17257 [pdf, other]

CREPE: Open-Domain Question Answering with False Presuppositions

Authors: Xinyan Velocity Yu, Sewon Min, Luke Zettlemoyer, Hannaneh Hajishirzi

Abstract: Information seeking users often pose questions with false presuppositions, especially when asking about unfamiliar topics. Most existing question answering (QA) datasets, in contrast, assume all questions have well defined answers. We introduce CREPE, a QA dataset containing a natural distribution of presupposition failures from online information-seeking forums. We find that 25% of questions cont… ▽ More Information seeking users often pose questions with false presuppositions, especially when asking about unfamiliar topics. Most existing question answering (QA) datasets, in contrast, assume all questions have well defined answers. We introduce CREPE, a QA dataset containing a natural distribution of presupposition failures from online information-seeking forums. We find that 25% of questions contain false presuppositions, and provide annotations for these presuppositions and their corrections. Through extensive baseline experiments, we show that adaptations of existing open-domain QA models can find presuppositions moderately well, but struggle when predicting whether a presupposition is factually correct. This is in large part due to difficulty in retrieving relevant evidence passages from a large text corpus. CREPE provides a benchmark to study question answering in the wild, and our analyses provide avenues for future work in better modeling and further studying the task. △ Less

Submitted 30 November, 2022; originally announced November 2022.

arXiv:2211.15649 [pdf, other]

Beyond Counting Datasets: A Survey of Multilingual Dataset Construction and Necessary Resources

Authors: Xinyan Velocity Yu, Akari Asai, Trina Chatterjee, Junjie Hu, Eunsol Choi

Abstract: While the NLP community is generally aware of resource disparities among languages, we lack research that quantifies the extent and types of such disparity. Prior surveys estimating the availability of resources based on the number of datasets can be misleading as dataset quality varies: many datasets are automatically induced or translated from English data. To provide a more comprehensive pictur… ▽ More While the NLP community is generally aware of resource disparities among languages, we lack research that quantifies the extent and types of such disparity. Prior surveys estimating the availability of resources based on the number of datasets can be misleading as dataset quality varies: many datasets are automatically induced or translated from English data. To provide a more comprehensive picture of language resources, we examine the characteristics of 156 publicly available NLP datasets. We manually annotate how they are created, including input text and label sources and tools used to build them, and what they study, tasks they address and motivations for their creation. After quantifying the qualitative NLP resource gap across languages, we discuss how to improve data collection in low-resource languages. We survey language-proficient NLP researchers and crowd workers per language, finding that their estimated availability correlates with dataset availability. Through crowdsourcing experiments, we identify strategies for collecting high-quality multilingual data on the Mechanical Turk platform. We conclude by making macro and micro-level suggestions to the NLP community and individual researchers for future multilingual data development. △ Less

Submitted 28 November, 2022; originally announced November 2022.

Comments: Accepted to Findings of EMNLP 2022. You can view our annotations, contribute to our survey, and view the analysis visualizations on our website at https://multilingual-dataset-survey.github.io

arXiv:2209.12747 [pdf]

doi 10.1088/1361-651X/acdf06

Roadmap on Electronic Structure Codes in the Exascale Era

Authors: Vikram Gavini, Stefano Baroni, Volker Blum, David R. Bowler, Alexander Buccheri, James R. Chelikowsky, Sambit Das, William Dawson, Pietro Delugas, Mehmet Dogan, Claudia Draxl, Giulia Galli, Luigi Genovese, Paolo Giannozzi, Matteo Giantomassi, Xavier Gonze, Marco Govoni, Andris Gulans, François Gygi, John M. Herbert, Sebastian Kokott, Thomas D. Kühne, Kai-Hsin Liou, Tsuyoshi Miyazaki, Phani Motamarri , et al. (16 additional authors not shown)

Abstract: Electronic structure calculations have been instrumental in providing many important insights into a range of physical and chemical properties of various molecular and solid-state systems. Their importance to various fields, including materials science, chemical sciences, computational chemistry and device physics, is underscored by the large fraction of available public supercomputing resources d… ▽ More Electronic structure calculations have been instrumental in providing many important insights into a range of physical and chemical properties of various molecular and solid-state systems. Their importance to various fields, including materials science, chemical sciences, computational chemistry and device physics, is underscored by the large fraction of available public supercomputing resources devoted to these calculations. As we enter the exascale era, exciting new opportunities to increase simulation numbers, sizes, and accuracies present themselves. In order to realize these promises, the community of electronic structure software developers will however first have to tackle a number of challenges pertaining to the efficient use of new architectures that will rely heavily on massive parallelism and hardware accelerators. This roadmap provides a broad overview of the state-of-the-art in electronic structure calculations and of the various new directions being pursued by the community. It covers 14 electronic structure codes, presenting their current status, their development priorities over the next five years, and their plans towards tackling the challenges and leveraging the opportunities presented by the advent of exascale computing. △ Less

Submitted 26 September, 2022; originally announced September 2022.

Comments: Submitted as a roadmap article to Modelling and Simulation in Materials Science and Engineering; Address any correspondence to Vikram Gavini ([email protected]) and Danny Perez ([email protected])

arXiv:2204.07313 [pdf]

Rapid 3D Multiparametric Map** of Brain Metastases with Deep Learning-Based Phase-Sensitive MR Fingerprinting

Authors: Victoria Y. Yu, Kathryn R. Tringale, Ricardo Otazo, Ouri Cohen

Abstract: In MR fingerprinting (MRF) reconstruction, measured data is pattern-matched to simulated signals to extract quantitative tissue parameters. A critical drawback to this approach is the exponentially increasing compute time for map** of multiple parameters. Previously, a deep learning (DL) reconstruction method called DRONE was shown to overcome this constraint by map** the magnitude time-series… ▽ More In MR fingerprinting (MRF) reconstruction, measured data is pattern-matched to simulated signals to extract quantitative tissue parameters. A critical drawback to this approach is the exponentially increasing compute time for map** of multiple parameters. Previously, a deep learning (DL) reconstruction method called DRONE was shown to overcome this constraint by map** the magnitude time-series signal to the underlying tissue parameters. However, relaxometry from magnitude images is susceptible to errors arising from ambiguities in the zero crossing of the signal or the non-zero noise mean. The aim of this study is to develop rapid acquisition and quantification methods to enable accurate multiparametric tissue map** from complex data. An optimized EPI based MRF sequence is developed along with a novel phasesensitive DL quantification allowing the use of real-valued neural networks to reconstruct complex measured data and providing an additional quantitative map of the phase. Phantom experiments demonstrate the accuracy of the proposed approach. A comparison to previous DRONE methods in a healthy subject shows improved fidelity to known T1 and T2 values for the phase-sensitive approach. By processing the estimated phase map with conventional quantitative susceptibility map** algorithms, we demonstrate the feasibility of simultaneous quantification of proton density, T1, T2, transmitter B1+ field and the quantitative susceptibility maps. In vivo experiments in a healthy volunteer and a subject with metastatic brain cancer are used to illustrate potential applications of this technology for treatment response assessment and tumor characterization. △ Less

Submitted 14 April, 2022; originally announced April 2022.

Comments: 9 pages, 9 figures

arXiv:2203.05623 [pdf, other]

doi 10.1021/acs.jctc.2c00241

GPU Acceleration of Large-Scale Full-Frequency GW Calculations

Authors: Victor Wen-zhe Yu, Marco Govoni

Abstract: Many-body perturbation theory is a powerful method to simulate electronic excitations in molecules and materials starting from the output of density functional theory calculations. By implementing the theory efficiently so as to run at scale on the latest leadership high-performance computing systems it is possible to extend the scope of GW calculations. We present a GPU acceleration study of the… ▽ More Many-body perturbation theory is a powerful method to simulate electronic excitations in molecules and materials starting from the output of density functional theory calculations. By implementing the theory efficiently so as to run at scale on the latest leadership high-performance computing systems it is possible to extend the scope of GW calculations. We present a GPU acceleration study of the full-frequency GW method as implemented in the WEST code. Excellent performance is achieved through the use of (i) optimized GPU libraries, e.g., cuFFT and cuBLAS, (ii) a hierarchical parallelization strategy that minimizes CPU-CPU, CPU-GPU, and GPU-GPU data transfer operations, (iii) nonblocking MPI communications that overlap with GPU computations, and (iv) mixed-precision in selected portions of the code. A series of performance benchmarks have been carried out on leadership high-performance computing systems, showing a substantial speedup of the GPU-accelerated version of WEST with respect to its CPU version. Good strong and weak scaling is demonstrated using up to 25920 GPUs. Finally, we showcase the capability of the GPU version of WEST for large-scale, full-frequency GW calculations of realistic systems, e.g., a nanostructure, an interface, and a defect, comprising up to 10368 valence electrons. △ Less

Submitted 9 August, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

Journal ref: Journal of Chemical Theory and Computation 18 (2022): 4690-4707

arXiv:2203.00985 [pdf, other]

doi 10.1103/PhysRevMaterials.6.064002

Boron nitride on SiC(0001)

Authors: You-Ron Lin, Markus Franke, Shayan Parhizkar, Miriam Raths, Victor Wen-zhe Yu, Tien-Lin Lee, Serguei Soubatch, Volker Blum, F. Stefan Tautz, Christian Kumpf, François C. Bocquet

Abstract: In the field of van der Waals heterostructures, the twist angle between stacked two-dimensional (2D) layers has been identified to be of utmost importance for the properties of the heterostructures. In this context, we previously reported the growth of a single layer of unconventionally oriented epitaxial graphene that forms in a surfactant atmosphere [F. C. Bocquet, et al., Phys. Rev. Lett. 125,… ▽ More In the field of van der Waals heterostructures, the twist angle between stacked two-dimensional (2D) layers has been identified to be of utmost importance for the properties of the heterostructures. In this context, we previously reported the growth of a single layer of unconventionally oriented epitaxial graphene that forms in a surfactant atmosphere [F. C. Bocquet, et al., Phys. Rev. Lett. 125, 106102 (2020)]. The resulting G-R0$^\circ$ layer is aligned with the SiC lattice, and hence represents an important milestone towards high quality twisted bilayer graphene (tBLG), a frequently investigated model system in this field. Here, we focus on the surface structures obtained in the same surfactant atmosphere, but at lower preparation temperatures at which a boron nitride template layer forms on SiC(0001). In a comprehensive study based on complementary experimental and theoretical techniques, we find -- in contrast to the literature -- that this template layer is a hexagonal B$_x$N$_y$ layer, but not high-quality hBN. It is aligned with the SiC lattice and gradually replaced by low-quality graphene in the 0$^\circ$ orientation of the B$_x$N$_y$ template layer upon annealing. △ Less

Submitted 14 April, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

Journal ref: Phys. Rev. Materials, 6, 064002 (2022)

arXiv:2108.08333 [pdf]

CEST MR fingerprinting (CEST-MRF) for Brain Tumor Quantification Using EPI Readout and Deep Learning Reconstruction

Authors: Ouri Cohen, Victoria Y. Yu, Kathryn R. Tringale, Robert J. Young, Or Perlman, Christian T. Farrar, Ricardo Otazo

Abstract: $\textbf{Purpose}$: To develop a clinical CEST MR fingerprinting (CEST-MRF) method for brain tumor quantification using EPI acquisition and deep learning reconstruction. $\textbf{Methods}$: A CEST-MRF pulse sequence originally designed for animal imaging was modified to conform to hardware limits on clinical scanners while kee** scan time $\leq… ▽ More $\textbf{Purpose}$: To develop a clinical CEST MR fingerprinting (CEST-MRF) method for brain tumor quantification using EPI acquisition and deep learning reconstruction. $\textbf{Methods}$: A CEST-MRF pulse sequence originally designed for animal imaging was modified to conform to hardware limits on clinical scanners while kee** scan time $\leq$ 2 minutes. Quantitative MRF reconstruction was performed using a deep reconstruction network (DRONE) to yield the water relaxation and chemical exchange parameters. The feasibility of the 6 parameter DRONE reconstruction was tested in simulations in a digital brain phantom. A healthy subject was scanned with the CEST-MRF sequence, conventional MRF and CEST sequences for comparison. Reproducibility was assessed via test-retest experiments and the concordance correlation coefficient (CCC) calculated for white matter (WM) and grey matter (GM). The clinical utility of CEST-MRF was demonstrated in 4 patients with brain metastases in comparison to standard clinical imaging sequences. Tumors were segmented into edema, solid core and necrotic core regions and the CEST-MRF values compared to the contra-lateral side. $\textbf{Results}$: The DRONE reconstruction of the digital phantom yielded a normalized RMS error of $\leq$ 7% for all parameters. The CEST-MRF parameters were in good agreement with those from conventional MRF and CEST sequences and previous studies. The mean CCC for all 6 parameters was 0.98$\pm$0.01 in WM and 0.98$\pm$0.02 in GM. The CEST-MRF values in nearly all tumor regions were significantly different (P=0.05) from each other and the contra-lateral side. $\textbf{Conclusion}$: Combination of EPI readout and deep learning reconstruction enabled fast, accurate and reproducible CEST-MRF in brain tumors. △ Less

Submitted 11 April, 2022; v1 submitted 18 August, 2021; originally announced August 2021.

Comments: 9 figures, 1 table

arXiv:2106.06412 [pdf, other]

doi 10.1063/5.0050296

Accurate Frozen Core Approximation for All-Electron Density-Functional Theory

Authors: Victor Wen-zhe Yu, Jonathan Moussa, Volker Blum

Abstract: We implement and benchmark the frozen core approximation, a technique commonly adopted in electronic structure theory to reduce the computational cost by means of mathematically fixing the chemically inactive core electron states. The accuracy and efficiency of this approach are well controlled by a single parameter, the number of frozen orbitals. Explicit corrections for the frozen core orbitals… ▽ More We implement and benchmark the frozen core approximation, a technique commonly adopted in electronic structure theory to reduce the computational cost by means of mathematically fixing the chemically inactive core electron states. The accuracy and efficiency of this approach are well controlled by a single parameter, the number of frozen orbitals. Explicit corrections for the frozen core orbitals and the unfrozen valence orbitals are introduced, safeguarding against seemingly minor numerical deviations from the assumed orthonormality conditions of the basis functions. A speedup of over two-fold can be achieved for the diagonalization step in all-electron density-functional theory simulations containing heavy elements, without any accuracy degradation in terms of the electron density, total energy, and atomic forces. This is demonstrated in a benchmark study covering 103 materials across the periodic table, and a large-scale simulation of CsPbBr3 with 2,560 atoms. Our study provides a rigorous benchmark of the precision of the frozen core approximation (sub-meV per atom for frozen core orbitals below -200 eV) for a wide range of test cases and for chemical elements ranging from Li to Po. The algorithms discussed here are implemented in the open-source Electronic Structure Infrastructure software package. △ Less

Submitted 11 June, 2021; originally announced June 2021.

Journal ref: The Journal of Chemical Physics 154 (2021) 224107

arXiv:2012.12263 [pdf, other]

Challenges of Equitable Vaccine Distribution in the COVID-19 Pandemic

Authors: Joseph Bae, Darshan Gandhi, Jil Kothari, Sheshank Shankar, Jonah Bae, Parth Patwa, Rohan Sukumaran, Aviral Chharia, Sanjay Adhikesaven, Shloak Rathod, Irene Nandutu, Sethuraman TV, Vanessa Yu, Krutika Misra, Srinidhi Murali, Aishwarya Saxena, Kasia Jakimowicz, Vivek Sharma, Rohan Iyer, Ashley Mehra, Alex Radunsky, Priyanshi Katiyar, Ananthu James, Jyoti Dalal, Sunaina Anand , et al. (3 additional authors not shown)

Abstract: The COVID-19 pandemic has led to a need for widespread and rapid vaccine development. As several vaccines have recently been approved for human use or are in different stages of development, governments across the world are preparing comprehensive guidelines for vaccine distribution and monitoring. In this early article, we identify challenges in logistics, health outcomes, user-centric matters, a… ▽ More The COVID-19 pandemic has led to a need for widespread and rapid vaccine development. As several vaccines have recently been approved for human use or are in different stages of development, governments across the world are preparing comprehensive guidelines for vaccine distribution and monitoring. In this early article, we identify challenges in logistics, health outcomes, user-centric matters, and communication associated with disease-related, individual, societal, economic, and privacy consequences. Primary challenges include difficulty in equitable distribution, vaccine efficacy, duration of immunity, multi-dose adherence, and privacy-focused record-kee** to be HIPAA compliant. While many of these challenges have been previously identified and addressed, some have not been acknowledged from a comprehensive view accounting for unprecedented interactions between challenges and specific populations. The logistics of equitable widespread vaccine distribution in disparate populations and countries of various economic, racial, and cultural constitutions must be thoroughly examined and accounted for. We also describe unique challenges regarding the efficacy of vaccines in specialized populations including children, the elderly, and immunocompromised individuals. Furthermore, we report the potential for understudied drug-vaccine interactions as well as the possibility that certain vaccine platforms may increase susceptibility to HIV. Given these complicated issues, the importance of privacy-focused, user-centric systems for vaccine education and incentivization along with clear communication from governments, organizations, and academic institutions is imperative. These challenges are by no means insurmountable, but require careful attention to avoid consequences spanning a range of disease-related, individual, societal, economic, and security domains. △ Less

Submitted 27 April, 2022; v1 submitted 24 November, 2020; originally announced December 2020.

Comments: 18 pages, 3 figures

arXiv:2006.01270 [pdf, other]

doi 10.1063/5.0005077

SIESTA: recent developments and applications

Authors: Alberto García, Nick Papior, Arsalan Akhtar, Emilio Artacho, Volker Blum, Emanuele Bosoni, Pedro Brandimarte, Mads Brandbyge, J. I. Cerdá, Fabiano Corsetti, Ramón Cuadrado, Vladimir Dikan, Jaime Ferrer, Julian Gale, Pablo García-Fernández, V. M. García-Suárez, Sandra García, Georg Huhs, Sergio Illera, Richard Korytár, Peter Koval, Irina Lebedeva, Lin Lin, Pablo López-Tarifa, Sara G. Mayo , et al. (11 additional authors not shown)

Abstract: A review of the present status, recent enhancements, and applicability of the SIESTA program is presented. Since its debut in the mid-nineties, SIESTA's flexibility, efficiency and free distribution has given advanced materials simulation capabilities to many groups worldwide. The core methodological scheme of SIESTA combines finite-support pseudo-atomic orbitals as basis sets, norm-conserving pse… ▽ More A review of the present status, recent enhancements, and applicability of the SIESTA program is presented. Since its debut in the mid-nineties, SIESTA's flexibility, efficiency and free distribution has given advanced materials simulation capabilities to many groups worldwide. The core methodological scheme of SIESTA combines finite-support pseudo-atomic orbitals as basis sets, norm-conserving pseudopotentials, and a real-space grid for the representation of charge density and potentials and the computation of their associated matrix elements. Here we describe the more recent implementations on top of that core scheme, which include: full spin-orbit interaction, non-repeated and multiple-contact ballistic electron transport, DFT+U and hybrid functionals, time-dependent DFT, novel reduced-scaling solvers, density-functional perturbation theory, efficient Van der Waals non-local density functionals, and enhanced molecular-dynamics options. In addition, a substantial effort has been made in enhancing interoperability and interfacing with other codes and utilities, such as Wannier90 and the second-principles modelling it can be used for, an AiiDA plugin for workflow automatization, interface to Lua for steering SIESTA runs, and various postprocessing utilities. SIESTA has also been engaged in the Electronic Structure Library effort from its inception, which has allowed the sharing of various low level libraries, as well as data standards and support for them, in particular the PSML definition and library for transferable pseudopotentials, and the interface to the ELSI library of solvers. Code sharing is made easier by the new open-source licensing model of the program. This review also presents examples of application of the capabilities of the code, as well as a view of on-going and future developments. △ Less

Submitted 1 June, 2020; originally announced June 2020.

Comments: 29 pages, 23 figures

Journal ref: J. Chem. Phys. 152, 204108 (2020)

arXiv:2005.05756 [pdf, other]

doi 10.1063/5.0012901

The CECAM Electronic Structure Library and the modular software development paradigm

Authors: Micael J. T. Oliveira, Nick Papior, Yann Pouillon, Volker Blum, Emilio Artacho, Damien Caliste, Fabiano Corsetti, Stefano de Gironcoli, Alin M. Elena, Alberto Garcia, Victor M. Garcia-Suarez, Luigi Genovese, William P. Huhn, Georg Huhs, Sebastian Kokott, Emine Kucukbenli, Ask H. Larsen, Alfio Lazzaro, Irina V. Lebedeva, Yingzhou Li, David Lopez-Duran, Pablo Lopez-Tarifa, Martin Luders, Miguel A. L. Marques, Jan Minar , et al. (12 additional authors not shown)

Abstract: First-principles electronic structure calculations are very widely used thanks to the many successful software packages available. Their traditional coding paradigm is monolithic, i.e., regardless of how modular its internal structure may be, the code is built independently from others, from the compiler up, with the exception of linear-algebra and message-passing libraries. This model has been qu… ▽ More First-principles electronic structure calculations are very widely used thanks to the many successful software packages available. Their traditional coding paradigm is monolithic, i.e., regardless of how modular its internal structure may be, the code is built independently from others, from the compiler up, with the exception of linear-algebra and message-passing libraries. This model has been quite successful for decades. The rapid progress in methodology, however, has resulted in an ever increasing complexity of those programs, which implies a growing amount of replication in coding and in the recurrent re-engineering needed to adapt to evolving hardware architecture. The Electronic Structure Library (\esl) was initiated by CECAM (European Centre for Atomic and Molecular Calculations) to catalyze a paradigm shift away from the monolithic model and promote modularization, with the ambition to extract common tasks from electronic structure programs and redesign them as free, open-source libraries. They include "heavy-duty" ones with a high degree of parallelisation, and potential for adaptation to novel hardware within them, thereby separating the sophisticated computer science aspects of performance optimization and re-engineering from the computational science done by scientists when implementing new ideas. It is a community effort, undertaken by developers of various successful codes, now facing the challenges arising in the new model. This modular paradigm will improve overall coding efficiency and enable specialists (computer scientists or computational scientists) to use their skills more effectively. It will lead to a more sustainable and dynamic evolution of software as well as lower barriers to entry for new developers. △ Less

Submitted 24 June, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

Comments: Revised version as finally accepted by J. Chem. Phys. to appear within the Special Topic in Electronic Structure Software (version prior to JCP's typesetting and proofs)

arXiv:2002.10991 [pdf, other]

doi 10.1016/j.cpc.2020.107808

GPU-Acceleration of the ELPA2 Distributed Eigensolver for Dense Symmetric and Hermitian Eigenproblems

Authors: Victor Wen-zhe Yu, Jonathan Moussa, Pavel Kůs, Andreas Marek, Peter Messmer, Mina Yoon, Hermann Lederer, Volker Blum

Abstract: The solution of eigenproblems is often a key computational bottleneck that limits the tractable system size of numerical algorithms, among them electronic structure theory in chemistry and in condensed matter physics. Large eigenproblems can easily exceed the capacity of a single compute node, thus must be solved on distributed-memory parallel computers. We here present GPU-oriented optimizations… ▽ More The solution of eigenproblems is often a key computational bottleneck that limits the tractable system size of numerical algorithms, among them electronic structure theory in chemistry and in condensed matter physics. Large eigenproblems can easily exceed the capacity of a single compute node, thus must be solved on distributed-memory parallel computers. We here present GPU-oriented optimizations of the ELPA two-stage tridiagonalization eigensolver (ELPA2). On top of cuBLAS-based GPU offloading, we add a CUDA kernel to speed up the back-transformation of eigenvectors, which can be the computationally most expensive part of the two-stage tridiagonalization algorithm. We benchmark the performance of this GPU-accelerated eigensolver on two hybrid CPU-GPU architectures, namely a compute cluster based on Intel Xeon Gold CPUs and NVIDIA Volta GPUs, and the Summit supercomputer based on IBM POWER9 CPUs and NVIDIA Volta GPUs. Consistent with previous benchmarks on CPU-only architectures, the GPU-accelerated two-stage solver exhibits a parallel performance superior to the one-stage counterpart. Finally, we demonstrate the performance of the GPU-accelerated eigensolver developed in this work for routine semi-local KS-DFT calculations comprising thousands of atoms. △ Less

Submitted 14 January, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

Journal ref: Computer Physics Communications 262 (2021) 107808

arXiv:1912.13403 [pdf, other]

doi 10.1016/j.cpc.2020.107459

ELSI -- An Open Infrastructure for Electronic Structure Solvers

Authors: Victor Wen-zhe Yu, Carmen Campos, William Dawson, Alberto García, Ville Havu, Ben Hourahine, William P Huhn, Mathias Jacquelin, Weile Jia, Murat Keçeli, Raul Laasner, Yingzhou Li, Lin Lin, Jianfeng Lu, Jonathan Moussa, Jose E Roman, Álvaro Vázquez-Mayagoitia, Chao Yang, Volker Blum

Abstract: Routine applications of electronic structure theory to molecules and periodic systems need to compute the electron density from given Hamiltonian and, in case of non-orthogonal basis sets, overlap matrices. System sizes can range from few to thousands or, in some examples, millions of atoms. Different discretization schemes (basis sets) and different system geometries (finite non-periodic vs. infi… ▽ More Routine applications of electronic structure theory to molecules and periodic systems need to compute the electron density from given Hamiltonian and, in case of non-orthogonal basis sets, overlap matrices. System sizes can range from few to thousands or, in some examples, millions of atoms. Different discretization schemes (basis sets) and different system geometries (finite non-periodic vs. infinite periodic boundary conditions) yield matrices with different structures. The ELectronic Structure Infrastructure (ELSI) project provides an open-source software interface to facilitate the implementation and optimal use of high-performance solver libraries covering cubic scaling eigensolvers, linear scaling density-matrix-based algorithms, and other reduced scaling methods in between. In this paper, we present recent improvements and developments inside ELSI, mainly covering (1) new solvers connected to the interface, (2) matrix layout and communication adapted for parallel calculations of periodic and/or spin-polarized systems, (3) routines for density matrix extrapolation in geometry optimization and molecular dynamics calculations, and (4) general utilities such as parallel matrix I/O and JSON output. The ELSI interface has been integrated into four electronic structure code projects (DFTB+, DGDFT, FHI-aims, SIESTA), allowing us to rigorously benchmark the performance of the solvers on an equal footing. Based on results of a systematic set of large-scale benchmarks performed with Kohn-Sham density-functional theory and density-functional tight-binding theory, we identify factors that strongly affect the efficiency of the solvers, and propose a decision layer that assists with the solver selection process. Finally, we describe a reverse communication interface encoding matrix-free iterative solver strategies that are amenable, e.g., for use with planewave basis sets. △ Less

Submitted 4 July, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

Journal ref: Computer Physics Communications 256 (2020) 107459

arXiv:1912.06636 [pdf, other]

doi 10.1016/j.cpc.2020.107314

GPGPU Acceleration of All-Electron Electronic Structure Theory Using Localized Numeric Atom-Centered Basis Functions

Authors: William Huhn, Björn Lange, Victor Wen-zhe Yu, Mina Yoon, Volker Blum

Abstract: We present an implementation of all-electron density-functional theory for massively parallel GPGPU-based platforms, using localized atom-centered basis functions and real-space integration grids. Special attention is paid to domain decomposition of the problem on non-uniform grids, which enables compute- and memory-parallel execution across thousands of nodes for real-space operations, e.g. the u… ▽ More We present an implementation of all-electron density-functional theory for massively parallel GPGPU-based platforms, using localized atom-centered basis functions and real-space integration grids. Special attention is paid to domain decomposition of the problem on non-uniform grids, which enables compute- and memory-parallel execution across thousands of nodes for real-space operations, e.g. the update of the electron density, the integration of the real-space Hamiltonian matrix, and calculation of Pulay forces. To assess the performance of our GPGPU implementation, we performed benchmarks on three different architectures using a 103-material test set. We find that operations which rely on dense serial linear algebra show dramatic speedups from GPGPU acceleration: in particular, SCF iterations including force and stress calculations exhibit speedups ranging from 4.5 to 6.6. For the architectures and problem types investigated here, this translates to an expected overall speedup between 3-4 for the entire calculation (including non-GPU accelerated parts), for problems featuring several tens to hundreds of atoms. Additional calculations for a 375-atom Bi$_2$Se$_3$ bilayer show that the present GPGPU strategy scales for large-scale distributed-parallel simulations. △ Less

Submitted 13 December, 2019; originally announced December 2019.

Comments: 49 pages, 9 figures

Journal ref: Computer Physics Communications 254, 107314 (2020)

arXiv:1908.04809 [pdf]

doi 10.1088/2057-1976/ab6e1f

Generation of abdominal synthetic CTs from 0.35T MR images using generative adversarial networks for MR-only liver radiotherapy

Authors: Jie Fu, Kamal Singhrao, Minsong Cao, Victoria Yu, Anand P. Santhanam, Yingli Yang, Minghao Guo, Ann C. Raldow, Dan Ruan, John H. Lewis

Abstract: Electron density maps must be accurately estimated to achieve valid dose calculation in MR-only radiotherapy. The goal of this study is to assess whether two deep learning models, the conditional generative adversarial network (cGAN) and the cycle-consistent generative adversarial network (cycleGAN), can generate accurate abdominal synthetic CT (sCT) images from 0.35T MR images for MR-only liver r… ▽ More Electron density maps must be accurately estimated to achieve valid dose calculation in MR-only radiotherapy. The goal of this study is to assess whether two deep learning models, the conditional generative adversarial network (cGAN) and the cycle-consistent generative adversarial network (cycleGAN), can generate accurate abdominal synthetic CT (sCT) images from 0.35T MR images for MR-only liver radiotherapy. A retrospective study was performed using CT images and 0.35T MR images of 12 patients with liver (n=8) and non-liver abdominal (n=4) cancer. CT images were deformably registered to the corresponding MR images to generate deformed CT (dCT) images for treatment planning. Both cGAN and cycleGAN were trained using MR and dCT transverse slices. Four-fold cross-validation testing was conducted to generate sCT images for all patients. The HU prediction accuracy was evaluated by voxel-wise similarity metric between each dCT and sCT image for all 12 patients. dCT-based and sCT-based dose distributions were compared using gamma and dose-volume histogram (DVH) metric analysis for 8 liver patients. sCTcycleGAN achieved the average mean absolute error (MAE) of 94.1 HU, while sCTcGAN achieved 89.8 HU. In both models, the average gamma passing rates within all volumes of interest were higher than 95% using a 2%, 2 mm criterion, and 99% using a 3%, 3 mm criterion. The average differences in the mean dose and DVH metrics were within +/-0.6% for the planning target volume and within +/-0.15% for evaluated organs in both models. Results demonstrated that abdominal sCT images generated by both cGAN and cycleGAN achieved accurate dose calculation for 8 liver radiotherapy plans. sCTcGAN images had smaller average MAE and achieved better dose calculation accuracy than sCTcyleGAN images. More abdominal patients will be enrolled in the future to further evaluate two models. △ Less

Submitted 13 August, 2019; originally announced August 2019.

Comments: Review in progress

Journal ref: 2020 Biomed. Phys. Eng. Express

arXiv:1805.12225 [pdf, other]

Molecular NMR shieldings, J-couplings, and magnetizabilities from numeric atom-centered orbital based density-functional calculations

Authors: Raul Laasner, William Huhn, Johannes Colell, Thomas Theis, Victor Yu, Warren Warren, Volker Blum

Abstract: We describe an accurate and scalable implementation for the computation of molecular nuclear magnetic resonance shieldings, J-couplings, and magnetizabilities within nonrelativistic semilocal density functional theory, based on numeric atom-centered orbital (NAO) basis sets. We compare the convergence to the basis set limit for two established types of NAO basis sets, called NAO-VCC-nZ and FHI-aim… ▽ More We describe an accurate and scalable implementation for the computation of molecular nuclear magnetic resonance shieldings, J-couplings, and magnetizabilities within nonrelativistic semilocal density functional theory, based on numeric atom-centered orbital (NAO) basis sets. We compare the convergence to the basis set limit for two established types of NAO basis sets, called NAO-VCC-nZ and FHI-aims-09, to several established Gaussian-type basis sets. The basis set limit is reached faster for the NAO basis sets than for standard correlation consistent Gaussian-type basis sets (cc-pVnZ, aug-cc-pVnZ, cc-pCVnZ, aug-cc-pCVnZ). For shieldings, the convergence properties and accuracy of the NAO-VCC-nZ basis sets are similar to Jensen's polarization consistent (pc) basis sets optimized for shieldings (pcS-n). For J-couplings, we develop a new type of NAO basis set (NAO-J-n) by augmenting the NAO-VCC-nZ basis sets with tight s-functions from Jensen's pcJ-n basis sets, which are optimized for J-couplings. We find the convergence of the NAO-J-n to be similar to the pcJ-n basis sets. Large scale applicability of the implementation is demonstrated for shieldings and J-couplings in a system of over 1,000 atoms. △ Less

Submitted 30 May, 2018; originally announced May 2018.

arXiv:1805.09725 [pdf, other]

doi 10.1103/PhysRevB.98.165434

Phase Diagram of Quantum Hall Breakdown and Non-linear Phenomena for InGaAs/InP Quantum Wells

Authors: V. Yu, M. Hilke, P. J. Poole, S. Studenikin, D. G. Austing

Abstract: We investigate non-linear magneto-transport in a Hall bar device made from a strained InGaAs/InP quantum well: a material system with attractive spintronic properties. From extensive maps of the longitudinal differential resistance (r_xx) as a function of current and magnetic (B-) field phase diagrams are generated for quantum Hall breakdown in the strong quantum Hall regime reaching filling facto… ▽ More We investigate non-linear magneto-transport in a Hall bar device made from a strained InGaAs/InP quantum well: a material system with attractive spintronic properties. From extensive maps of the longitudinal differential resistance (r_xx) as a function of current and magnetic (B-) field phase diagrams are generated for quantum Hall breakdown in the strong quantum Hall regime reaching filling factor $ν$=1. By careful illumination the electron sheet density (n) is incremented in small steps and this provides insight into how the transport characteristics evolve with n. We explore in depth the energetics of integer quantum Hall breakdown and provide a simple picture for the principal features in the r_xx maps. A simple tunneling model that captures a number of the characteristic features is introduced. Parameters such as critical Hall electric fields and the exchange-enhanced g-factors for odd-filling factors including nu=1 are extracted. A detailed examination is made of the B-field dependence of the critical current as determined by two different methods and compiled for different values of n. A simple rescaling procedure that allows the critical current data points obtained from r_xx maxima for even-filling to collapse on to a single curve is demonstrated. Exchange-enhanced g-factors for odd-filling are extracted from the compiled data and are compared to those determined by conventional thermal activation measurements. The exchange-enhanced g-factor is found to increase with decreasing n. △ Less

Submitted 24 May, 2018; originally announced May 2018.

Comments: 16 pages

Journal ref: Phys. Rev. B 98, 165434 (2018)

arXiv:1710.05480 [pdf, other]

doi 10.1088/1361-6560/aaa94f

Fraction-variant beam orientation optimization for non-coplanar IMRT

Authors: Daniel O'Connor, Dan Nguyen, Dan Ruan, Victoria Yu, Ke Sheng

Abstract: Conventional beam orientation optimization (BOO) algorithms for IMRT assume that the same set of beam angles is used for all treatment fractions. In this paper we present a BOO formulation based on group sparsity that simultaneously optimizes non-coplanar beam angles for all fractions, yielding a fraction-variant (FV) treatment plan. Beam angles are selected by solving a multi-fraction FMO problem… ▽ More Conventional beam orientation optimization (BOO) algorithms for IMRT assume that the same set of beam angles is used for all treatment fractions. In this paper we present a BOO formulation based on group sparsity that simultaneously optimizes non-coplanar beam angles for all fractions, yielding a fraction-variant (FV) treatment plan. Beam angles are selected by solving a multi-fraction FMO problem involving 500-700 candidate beams per fraction, with an additional group sparsity term that encourages most candidate beams to be inactive. The optimization problem is solved using the Fast Iterative Shrinkage-Thresholding Algorithm. Our FV BOO algorithm is used to create non-coplanar, five-fraction treatment plans for prostate and lung cases, as well as a non-coplanar 30-fraction plan for a head and neck case. A homogeneous PTV dose coverage is maintained in all fractions. The treatment plans are compared with fraction-invariant plans that use a fixed set of beam angles for all fractions. The FV plans reduced mean and max OAR dose on average by 3.3% and 3.7% of the prescription dose, respectively. Notably, mean OAR dose was reduced by 14.3% of prescription dose (rectum), 11.6% (penile bulb), 10.7% (seminal vesicle), 5.5% (right femur), 3.5% (bladder), 4.0% (normal left lung), 15.5% (cochleas), and 5.2% (chiasm). Max OAR dose was reduced by 14.9% of prescription dose (right femur), 8.2% (penile bulb), 12.7% (prox. bronchus), 4.1% (normal left lung), 15.2% (cochleas), 10.1% (orbits), 9.1% (chiasm), 8.7% (brainstem), and 7.1% (parotids). Meanwhile, PTV homogeneity defined as D95/D5 improved from .95 to .98 (prostate case) and from .94 to .97 (lung case), and remained constant for the head and neck case. Moreover, the FV plans are dosimetrically similar to conventional plans that use twice as many beams per fraction. Thus, FV BOO offers the potential to reduce delivery time for non-coplanar IMRT. △ Less

Submitted 15 October, 2017; originally announced October 2017.

arXiv:1705.11191 [pdf, other]

doi 10.1016/j.cpc.2017.09.007

ELSI: A Unified Software Interface for Kohn-Sham Electronic Structure Solvers

Authors: Victor Wen-zhe Yu, Fabiano Corsetti, Alberto García, William P. Huhn, Mathias Jacquelin, Weile Jia, Björn Lange, Lin Lin, Jianfeng Lu, Wenhui Mi, Ali Seifitokaldani, Álvaro Vázquez-Mayagoitia, Chao Yang, Haizhao Yang, Volker Blum

Abstract: Solving the electronic structure from a generalized or standard eigenproblem is often the bottleneck in large scale calculations based on Kohn-Sham density-functional theory. This problem must be addressed by essentially all current electronic structure codes, based on similar matrix expressions, and by high-performance computation. We here present a unified software interface, ELSI, to access dif… ▽ More Solving the electronic structure from a generalized or standard eigenproblem is often the bottleneck in large scale calculations based on Kohn-Sham density-functional theory. This problem must be addressed by essentially all current electronic structure codes, based on similar matrix expressions, and by high-performance computation. We here present a unified software interface, ELSI, to access different strategies that address the Kohn-Sham eigenvalue problem. Currently supported algorithms include the dense generalized eigensolver library ELPA, the orbital minimization method implemented in libOMM, and the pole expansion and selected inversion (PEXSI) approach with lower computational complexity for semilocal density functionals. The ELSI interface aims to simplify the implementation and optimal use of the different strategies, by offering (a) a unified software framework designed for the electronic structure solvers in Kohn-Sham density-functional theory; (b) reasonable default parameters for a chosen solver; (c) automatic conversion between input and internal working matrix formats, and in the future (d) recommendation of the optimal solver depending on the specific problem. Comparative benchmarks are shown for system sizes up to 11,520 atoms (172,800 basis functions) on distributed memory supercomputing architectures. △ Less

Submitted 31 May, 2017; originally announced May 2017.

Comments: 55 pages, 14 figures, 2 tables

Journal ref: Computer Physics Communications 222 (2018) 267-285

arXiv:1509.01579 [pdf, other]

Time Evolution of the Growth of Single Graphene Crystals and High Resolution Isotope Labeling

Authors: Eric Whiteway, Wayne Yang, Victor Yu, Michael Hilke

Abstract: We developed a method of precise isotope labeling to visualize the continuous growth of graphene by chemical vapor deposition (CVD). This method allows us to see in real time the growth of graphene monocrystals at a resolution of a few seconds. This technique is used to extract the anisotropic growth rates, the formation of dendrites, and the dependence on adsorption area of methane on copper. We… ▽ More We developed a method of precise isotope labeling to visualize the continuous growth of graphene by chemical vapor deposition (CVD). This method allows us to see in real time the growth of graphene monocrystals at a resolution of a few seconds. This technique is used to extract the anisotropic growth rates, the formation of dendrites, and the dependence on adsorption area of methane on copper. We obtain a physical picture of the growth dynamics of graphene and its dependence on various parameters. Finally, our method is relevant to other CVD grown materials. △ Less

Submitted 4 September, 2015; originally announced September 2015.

arXiv:1503.08041 [pdf]

Experimental and ab initio studies of the novel piperidine-containing acetylene glycols

Authors: Amina Mirsakiyeva, Darya Botkina, Karim Elgammal, Assel Ten, Håkan W. Hugosson, Anna Delin, Valentina K. Yu

Abstract: Synthesis routes of novel piperidine-containing diacetylene are presented. The new molecules are expected to exhibit plant growth stimulation properties. In particular, the yield in a situation of drought is expected to increase. The synthesis makes use of the Favorskii reaction between cycloketones/piperidone and triple-bond containing glycols. The geometries of the obtained molecules were determ… ▽ More Synthesis routes of novel piperidine-containing diacetylene are presented. The new molecules are expected to exhibit plant growth stimulation properties. In particular, the yield in a situation of drought is expected to increase. The synthesis makes use of the Favorskii reaction between cycloketones/piperidone and triple-bond containing glycols. The geometries of the obtained molecules were determined using nuclear magnetic resonance (NMR). The electronic structure and geometries of the molecules were studied theoretically using first-principles calculations based on density functional theory. The calculated geometries agree very well with the experimentally measured ones, and also allow us to determine bond lengths, angles and charge distributions inside the molecules. The stability of the OH-radicals located close to the triple bond and the piperidine/cyclohexane rings was proven by both experimental and theoretical analyses. The HOMO/LUMO analysis was done in order to characterize the electron density of the molecule. The calculations show that triple bond does not participate in intermolecular reactions which excludes the instability of novel materials as a reason for low production rate. △ Less

Submitted 27 March, 2015; originally announced March 2015.

Comments: 10 pages, 9 figures, 3 tables, method of synthesis, NMR analysis, DFT calculations

arXiv:1301.7033 [pdf, other]

Quantum Hall Effect in Fractal Graphene: Growth and Properties of Graphlocons

Authors: Mathieu Massicotte, Victor Yu, Eric Whiteway, Dan Vatnik, Michael Hilke

Abstract: Highly dendritic graphene crystals up to 0.25 mm in diameter are synthesized by low pressure chemical vapor deposition inside a copper enclosure. With their sixfold symmetry and fractal-like shape, the crystals resemble snowflakes. The evolution of the dendritic growth features is investigated for different growth conditions and surface diffusion is found to be the growth-limiting step responsible… ▽ More Highly dendritic graphene crystals up to 0.25 mm in diameter are synthesized by low pressure chemical vapor deposition inside a copper enclosure. With their sixfold symmetry and fractal-like shape, the crystals resemble snowflakes. The evolution of the dendritic growth features is investigated for different growth conditions and surface diffusion is found to be the growth-limiting step responsible for the formation of dendrites. The electronic properties of the dendritic crystals are examined down to sub-Kelvin temperatures, showing a mobility of up to 6300 cm$^2$V$^{-1}$s$^{-1}$ and quantum Hall oscillations are observed above 4T. These results demonstrate the high quality of the transport properties despite their rough dendritic edges. △ Less

Submitted 29 January, 2013; originally announced January 2013.

arXiv:1212.5337 [pdf, other]

doi 10.1088/1742-6596/456/1/012016

Weak Localisation in Clean and Highly Disordered Graphene

Authors: Michael Hilke, Mathieu Massicotte, Eric Whiteway, Victor Yu

Abstract: We look at the magnetic field induced weak localisation peak of graphene samples with different mobilities. At very low temperatures, low mobility samples exhibit a very broad peak as a function of the magnetic field, in contrast to higher mobility samples, where the weak localisation peak is very sharp. We analyze the experimental data in the context of the localisation length, which allows us to… ▽ More We look at the magnetic field induced weak localisation peak of graphene samples with different mobilities. At very low temperatures, low mobility samples exhibit a very broad peak as a function of the magnetic field, in contrast to higher mobility samples, where the weak localisation peak is very sharp. We analyze the experimental data in the context of the localisation length, which allows us to extract, both the localisation length and the phase coherence length of the samples, regardless of their mobilities. This analysis is made possible by the observation that the localisation length undergoes a generic weak localisation dependence with striking universal properties. △ Less

Submitted 20 December, 2012; originally announced December 2012.

Comments: 6 pages, HMF20 Proceedings

arXiv:1212.5334 [pdf, other]

Weak Localization in Graphene: Theory, Simulations and Experiments

Authors: M. Hilke, M. Massicotte, E. Whiteway, V. Yu

Abstract: We provide a comprehensive picture of magnetotransport in graphene monolayers in the limit of non-quantizing magnetic fields. We discuss the effects of two carrier transport, weak localization, weak anti-localization, and strong localization for graphene devices of various mobilities, through theory, experiments and numerical simulations. In particular, we observe the weak localization of the loca… ▽ More We provide a comprehensive picture of magnetotransport in graphene monolayers in the limit of non-quantizing magnetic fields. We discuss the effects of two carrier transport, weak localization, weak anti-localization, and strong localization for graphene devices of various mobilities, through theory, experiments and numerical simulations. In particular, we observe the weak localization of the localization length, which allows us to make the connection between weak and strong localization. It provides a unified framework for both localizations, which explains the observed experimental features. We compare these results to numerical simulation and find a remarkable agreement between theory, experiment and numerics. Various graphene devices were used in this study, including graphene on different substrates, such as glass and silicon, as well as low and high mobility devices. △ Less

Submitted 20 December, 2012; originally announced December 2012.

Comments: 8 pages

arXiv:1211.2234 [pdf, ps, other]

doi 10.1088/0004-637X/763/2/83

Herschel/PACS Spectroscopic Survey of Protostars in Orion: The Origin of Far-Infrared CO Emission

Authors: P. Manoj, D. M. Watson, D. A. Neufeld, S. T. Megeath, R. Vavrek, Vincent Yu, R. Visser, E. A. Bergin, W. J. Fischer, J. J. Tobin, A. M. Stutz, B. Ali, T. L. Wilson, J. Di Francesco, M. Osorio, S. Maret, C. A. Poteet

Abstract: We present far-IR (57-196 mu) spectra of 21 protostars in the Orion molecular clouds, obtained with the Photodetector Array Camera and Spectrometer (PACS) onboard the Herschel Space observatory, as part of the Herschel Orion Protostar Survey (HOPS) program. We analyzed the CO emission lines (J_up = 14-46) in the PACS spectra, extracted within a projected distance of <= 2000 AU centered on the prot… ▽ More We present far-IR (57-196 mu) spectra of 21 protostars in the Orion molecular clouds, obtained with the Photodetector Array Camera and Spectrometer (PACS) onboard the Herschel Space observatory, as part of the Herschel Orion Protostar Survey (HOPS) program. We analyzed the CO emission lines (J_up = 14-46) in the PACS spectra, extracted within a projected distance of <= 2000 AU centered on the protostar. The total luminosity of the CO lines observed with PACS (L(CO)) is found to increase with increasing L_bol. The CO rotational temperature implied by the line ratios increases with J, and at least 3-4 rotational temperature components are required to fit the observed rotational diagram. The rotational temperature components are remarkably invariant between protostars and show no dependence on L_bol, T_bol or envelope density, implying that if the emitting gas is in LTE, the CO emission must arise in multiple temperature components that remain independent of L_bol over two orders of magnitudes. The observed CO emission can also be modeled as arising from a single temperature gas component or from a medium with a power-law temperature distribution; both of these require sub-thermally excited molecular gas at low densities (n(H_2) <= 10^6 cm^-3) and high temperatures (T >= 2000 K). Our results suggest that the contribution from PDRs along the envelope cavity walls is unlikely to be the dominant component of the CO emission observed with PACS. Instead, the "universality" of the rotational temperatures and the observed correlation between L(CO) and L_bol can most easily be explained if the observed CO emission originates in shock-heated, hot (T >= 2000 K), sub-thermally excited (n(H_2) <= 10^6 cm^-3) molecular gas. Post-shock gas at these densities is more likely to be found within the outflow cavities along the molecular outflow or along the cavity walls at radii >= several 100-1000 AU. △ Less

Submitted 9 November, 2012; originally announced November 2012.

Comments: 30 pages, 20 figures, 4 tables, accepted for publication in ApJ

arXiv:1111.1643 [pdf, ps, other]

doi 10.1103/PhysRevB.86.085409

Experimental Phonon Band Structure of Graphene using C$^{12} and C$^{13}$ Isotopes

Authors: Simon Bernard, Eric Whiteway, Victor Yu, D. G. Austing, Michael Hilke

Abstract: Using very uniform large scale chemical vapor deposition grown graphene transferred onto silicon, we were able to identify 15 distinct Raman lines associated with graphene monolayers. This was possible thanks to a combination of different carbon isotopes and different Raman laser energies and extensive averaging without increasing the laser power. This allowed us to obtain a detailed experimental… ▽ More Using very uniform large scale chemical vapor deposition grown graphene transferred onto silicon, we were able to identify 15 distinct Raman lines associated with graphene monolayers. This was possible thanks to a combination of different carbon isotopes and different Raman laser energies and extensive averaging without increasing the laser power. This allowed us to obtain a detailed experimental phonon dispersion relation for many points in the Brillouin zone. We further identified a D+D' peak corresponding to a double phonon process involving both an inter- and intra-valley phonon. △ Less

Submitted 7 November, 2011; originally announced November 2011.

Comments: 5 pages, 4 figures, 1 table

Journal ref: Phys. Rev. B 86, 085409 (2012)

arXiv:1110.6557 [pdf, other]

Experimental review of graphene

Authors: Daniel R. Cooper, Benjamin D'Anjou, Nageswara Ghattamaneni, Benjamin Harack, Michael Hilke, Alexandre Horth, Norberto Majlis, Mathieu Massicotte, Leron Vandsburger, Eric Whiteway, Victor Yu

Abstract: This review examines the properties of graphene from an experimental perspective. The intent is to review the most important experimental results at a level of detail appropriate for new graduate students who are interested in a general overview of the fascinating properties of graphene. While some introductory theoretical concepts are provided, including a discussion of the electronic band struct… ▽ More This review examines the properties of graphene from an experimental perspective. The intent is to review the most important experimental results at a level of detail appropriate for new graduate students who are interested in a general overview of the fascinating properties of graphene. While some introductory theoretical concepts are provided, including a discussion of the electronic band structure and phonon dispersion, the main emphasis is on describing relevant experiments and important results as well as some of the novel applications of graphene. In particular, this review covers graphene synthesis and characterization, field-effect behavior, electronic transport properties, magneto-transport, integer and fractional quantum Hall effects, mechanical properties, transistors, optoelectronics, graphene-based sensors, and biosensors. This approach attempts to highlight both the means by which the current understanding of graphene has come about and some tools for future contributions. △ Less

Submitted 29 October, 2011; originally announced October 2011.

Comments: Equal contributions from all authors

arXiv:1101.1884 [pdf, ps, other]

doi 10.1103/PhysRevB.84.205407

Straining Graphene by Chemical Vapour Deposition Growth on Copper

Authors: Victor Yu, Eric Whiteway, Jesse Maassen, Michael Hilke

Abstract: Strain can be used as an alternate way to tune the electronic properties of graphene. Here we demonstrate that it is possible to tune the uniform strain of graphene simply by changing the chemical vapor deposition growth temperature of graphene on copper. Due to the cooling of the graphene on copper system, we can induce a uniform compressive strain on graphene. The strain is analyzed by Raman spe… ▽ More Strain can be used as an alternate way to tune the electronic properties of graphene. Here we demonstrate that it is possible to tune the uniform strain of graphene simply by changing the chemical vapor deposition growth temperature of graphene on copper. Due to the cooling of the graphene on copper system, we can induce a uniform compressive strain on graphene. The strain is analyzed by Raman spectroscopy, where a shift in the 2D peak is observed and compared to our ab initio calculations of the graphene on copper system as a function of strain. △ Less

Submitted 6 January, 2011; originally announced January 2011.

Comments: 5 pages, 5 figures

Journal ref: Phys. Rev. B 84, 205407 (2011)

arXiv:1011.5712 [pdf, ps, other]

Magneto-transport of large CVD-grown graphene

Authors: Eric Whiteway, Victor Yu, Josianne Lefebvre, Robert Gagnon, Michael Hilke

Abstract: We present magnetoresistance measurements on large scale monolayer graphene grown by chemical vapor deposition (CVD) on copper. The graphene layer was transferred onto SiO2/Si via PMMA and thermal release tape for transport measurements. The resulting centimeter-sized graphene samples were measured at temperatures down to 30mK in a magnetic field. We observe a very sharp peak in resistance at zero… ▽ More We present magnetoresistance measurements on large scale monolayer graphene grown by chemical vapor deposition (CVD) on copper. The graphene layer was transferred onto SiO2/Si via PMMA and thermal release tape for transport measurements. The resulting centimeter-sized graphene samples were measured at temperatures down to 30mK in a magnetic field. We observe a very sharp peak in resistance at zero field, which is well fitted by weak localization theory. The samples exhibit conductance fluctuations symmetric in field, which are attributed to ensemble averaged conductance fluctuations due to large scale inhomogeneities consistent with the grain boundaries of copper during the CVD growth. △ Less

Submitted 17 October, 2011; v1 submitted 26 November, 2010; originally announced November 2010.

Comments: 4 figures, 1 table

arXiv:0906.4056 [pdf, ps, other]

doi 10.1063/1.3247967

Large contrast enhancement of graphene monolayers by angle detection

Authors: Victor Yu, Michael Hilke

Abstract: Exfoliated graphene monolayers are identified by optical inspection. In order to improve the monolayer detection, we investigate the angle dependence of the optical contrast of graphene on a 90nm SiO$_2$/Si substrate. We observe a significant enhancement of the visibility of graphene by changing the polarization and the angle of optical incidence. This method can be used to detect graphene on ne… ▽ More Exfoliated graphene monolayers are identified by optical inspection. In order to improve the monolayer detection, we investigate the angle dependence of the optical contrast of graphene on a 90nm SiO$_2$/Si substrate. We observe a significant enhancement of the visibility of graphene by changing the polarization and the angle of optical incidence. This method can be used to detect graphene on new substrate designs such as GaAs/AlAs based materials, which have a much cleaner surface. △ Less

Submitted 22 June, 2009; originally announced June 2009.

Comments: 3 pages, 6 figures

Journal ref: Applied Physics Letters 95, 151904 (2009)

arXiv:0708.3474 [pdf, ps, other]

Nonlinear control of chaotic walking of atoms in an optical lattice

Authors: Argonov V. Yu, S. V. Prants

Abstract: Centre-of-mass atomic motion in an optical lattice near the resonance is shown to be a chaotic walking due to the interplay between coherent internal atomic dynamics and spontaneous emission. Statistical properties of chaotic atomic motion can be controlled by the single parameter, the detuning between the atomic transition frequency and the laser frequency. We derive a Fokker-Planck equation in… ▽ More Centre-of-mass atomic motion in an optical lattice near the resonance is shown to be a chaotic walking due to the interplay between coherent internal atomic dynamics and spontaneous emission. Statistical properties of chaotic atomic motion can be controlled by the single parameter, the detuning between the atomic transition frequency and the laser frequency. We derive a Fokker-Planck equation in the energetic space to describe the atomic transport near the resonance and demonstrate numerically how to manipulate the atomic motion varying the detuning. △ Less

Submitted 16 December, 2007; v1 submitted 26 August, 2007; originally announced August 2007.

Comments: 6 pages, 4 figures

Journal ref: Argonov V. Yu, Prants S. V. Nonlinear control of chaotic walking of atoms in an optical lattice. Europhysics Letters. 2008. Vol. 81. Art. no. 24003

Showing 1–38 of 38 results for author: Yu, V