-
Visualizing the microscopic origins of topology in twisted molybdenum ditelluride
Authors:
Ellis Thompson,
Keng Tou Chu,
Florie Mesple,
Xiao-Wei Zhang,
Chaowei Hu,
Yuzhou Zhao,
Heonjoon Park,
Jiaqi Cai,
Eric Anderson,
Kenji Watanabe,
Takashi Taniguchi,
Jihui Yang,
Jiun-Haw Chu,
Xiaodong Xu,
Ting Cao,
Di Xiao,
Matthew Yankowitz
Abstract:
In moiré materials with flat electronic bands and suitable quantum geometry, strong correlations can give rise to novel topological states of matter. The nontrivial band topology of twisted molybdenum ditelluride (tMoTe$_2$) -- responsible for its fractional quantum anomalous Hall (FQAH) states -- is predicted to arise from a layer-pseudospin skyrmion lattice. Tracing the layer polarization of wav…
▽ More
In moiré materials with flat electronic bands and suitable quantum geometry, strong correlations can give rise to novel topological states of matter. The nontrivial band topology of twisted molybdenum ditelluride (tMoTe$_2$) -- responsible for its fractional quantum anomalous Hall (FQAH) states -- is predicted to arise from a layer-pseudospin skyrmion lattice. Tracing the layer polarization of wavefunctions within the moiré unit cell can thus offer crucial insights into the band topology. Here, we use scanning tunneling microscopy and spectroscopy (STM/S) to probe the layer-pseudospin skyrmion textures of tMoTe$_2$. We do this by simultaneously visualizing the moiré lattice structure and the spatial localization of its electronic states. We find that the wavefunctions associated with the topological flat bands exhibit a spatially-dependent layer polarization within the moiré unit cell. This is in excellent agreement with our theoretical modeling, thereby revealing a direct microscopic connection between the structural properties of tMoTe$_2$ and its band topology. Our work enables new pathways for engineering FQAH states with strain, as well as future STM studies of the intertwined correlated and topological states arising in gate-tunable devices.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
numaPTE: Managing Page-Tables and TLBs on NUMA Systems
Authors:
Bin Gao,
Qingxuan Kang,
Hao-Wei Tee,
Kyle Timothy Ng Chu,
Alireza Sanaee,
Djordje Jevdjic
Abstract:
Memory management operations that modify page-tables, typically performed during memory allocation/deallocation, are infamous for their poor performance in highly threaded applications, largely due to process-wide TLB shootdowns that the OS must issue due to the lack of hardware support for TLB coherence. We study these operations in NUMA settings, where we observe up to 40x overhead for basic ope…
▽ More
Memory management operations that modify page-tables, typically performed during memory allocation/deallocation, are infamous for their poor performance in highly threaded applications, largely due to process-wide TLB shootdowns that the OS must issue due to the lack of hardware support for TLB coherence. We study these operations in NUMA settings, where we observe up to 40x overhead for basic operations such as munmap or mprotect. The overhead further increases if page-table replication is used, where complete coherent copies of the page-tables are maintained across all NUMA nodes. While eager system-wide replication is extremely effective at localizing page-table reads during address translation, we find that it creates additional penalties upon any page-table changes due to the need to maintain all replicas coherent.
In this paper, we propose a novel page-table management mechanism, called numaPTE, to enable transparent, on-demand, and partial page-table replication across NUMA nodes in order to perform address translation locally, while avoiding the overheads and scalability issues of system-wide full page-table replication. We then show that numaPTE's precise knowledge of page-table sharers can be leveraged to significantly reduce the number of TLB shootdowns issued upon any memory-management operation. As a result, numaPTE not only avoids replication-related slowdowns, but also provides significant speedup over the baseline on memory allocation/deallocation and access control operations. We implement numaPTEin Linux on x86_64, evaluate it on 4- and 8-socket systems, and show that numaPTE achieves the full benefits of eager page-table replication on a wide range of applications, while also achieving a 12% and 36% runtime improvement on Webserver and Memcached respectively due to a significant reduction in TLB shootdowns.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
You Only Spike Once: Improving Energy-Efficient Neuromorphic Inference to ANN-Level Accuracy
Authors:
Srivatsa P,
Kyle Timothy Ng Chu,
Burin Amornpaisannon,
Yaswanth Tavva,
Venkata Pavan Kumar Miriyala,
Jibin Wu,
Malu Zhang,
Haizhou Li,
Trevor E. Carlson
Abstract:
In the past decade, advances in Artificial Neural Networks (ANNs) have allowed them to perform extremely well for a wide range of tasks. In fact, they have reached human parity when performing image recognition, for example. Unfortunately, the accuracy of these ANNs comes at the expense of a large number of cache and/or memory accesses and compute operations. Spiking Neural Networks (SNNs), a type…
▽ More
In the past decade, advances in Artificial Neural Networks (ANNs) have allowed them to perform extremely well for a wide range of tasks. In fact, they have reached human parity when performing image recognition, for example. Unfortunately, the accuracy of these ANNs comes at the expense of a large number of cache and/or memory accesses and compute operations. Spiking Neural Networks (SNNs), a type of neuromorphic, or brain-inspired network, have recently gained significant interest as power-efficient alternatives to ANNs, because they are sparse, accessing very few weights, and typically only use addition operations instead of the more power-intensive multiply-and-accumulate (MAC) operations. The vast majority of neuromorphic hardware designs support rate-encoded SNNs, where the information is encoded in spike rates. Rate-encoded SNNs could be seen as inefficient as an encoding scheme because it involves the transmission of a large number of spikes. A more efficient encoding scheme, Time-To-First-Spike (TTFS) encoding, encodes information in the relative time of arrival of spikes. While TTFS-encoded SNNs are more efficient than rate-encoded SNNs, they have, up to now, performed poorly in terms of accuracy compared to previous methods. Hence, in this work, we aim to overcome the limitations of TTFS-encoded neuromorphic systems. To accomplish this, we propose: (1) a novel optimization algorithm for TTFS-encoded SNNs converted from ANNs and (2) a novel hardware accelerator for TTFS-encoded SNNs, with a scalable and low-power design. Overall, our work in TTFS encoding and training improves the accuracy of SNNs to achieve state-of-the-art results on MNIST MLPs, while reducing power consumption by 1.46$\times$ over the state-of-the-art neuromorphic hardware.
△ Less
Submitted 8 November, 2020; v1 submitted 3 June, 2020;
originally announced June 2020.
-
Boosting the Accuracy of Finite Difference Schemes via Optimal Time Step Selection and Non-Iterative Defect Correction
Authors:
Kevin T. Chu
Abstract:
In this article, we present a simple technique for boosting the order of accuracy of finite difference schemes for time dependent partial differential equations by optimally selecting the time step used to advance the numerical solution and adding defect correction terms in a non-iterative manner. The power of the technique is its ability to extract as much accuracy as possible from existing fin…
▽ More
In this article, we present a simple technique for boosting the order of accuracy of finite difference schemes for time dependent partial differential equations by optimally selecting the time step used to advance the numerical solution and adding defect correction terms in a non-iterative manner. The power of the technique is its ability to extract as much accuracy as possible from existing finite difference schemes with minimal additional effort. Through straightforward numerical analysis arguments, we explain the origin of the boost in accuracy and estimate the computational cost of the resulting numerical method. We demonstrate the utility of optimal time step (OTS) selection combined with non-iterative defect correction (NIDC) on several different types of finite difference schemes for a wide array of classical linear and semilinear PDEs in one and more space dimensions on both regular and irregular domains.
△ Less
Submitted 25 May, 2009; v1 submitted 18 November, 2008;
originally announced November 2008.
-
Surface Conservation Laws at Microscopically Diffuse Interfaces
Authors:
Kevin T. Chu,
Martin Z. Bazant
Abstract:
In studies of interfaces with dynamic chemical composition, bulk and interfacial quantities coupled via surface conservation laws of excess surface quantities. While this approach is for microscopically sharp interfaces, its applicability in the context of microscopically diffuse is less theoretically well-established. Furthermore, surface conservation laws (and interfacial in general) are often…
▽ More
In studies of interfaces with dynamic chemical composition, bulk and interfacial quantities coupled via surface conservation laws of excess surface quantities. While this approach is for microscopically sharp interfaces, its applicability in the context of microscopically diffuse is less theoretically well-established. Furthermore, surface conservation laws (and interfacial in general) are often derived phenomenologically rather than systematically. In this article, provide a mathematically rigorous justification for surface conservation laws at diffuse interfaces on an asymptotic analysis of transport processes in the boundary layer and derive general the surface and normal fluxes that appear in surface conservation laws. Next, we use non-thermodynamics to formulate surface conservation laws in terms of chemical potentials a method for systematically deriving the structure of the interfacial layer. Finally, we conservation laws for a few examples from diffusive and electrochemical transport.
△ Less
Submitted 5 February, 2007;
originally announced February 2007.
-
A Direct Matrix Method for Computing Analytical Jacobians of Discretized Nonlinear Integro-differential Equations
Authors:
Kevin T. Chu
Abstract:
In this pedagogical article, we present a simple direct matrix method for analytically computing the Jacobian of nonlinear algebraic equations that arise from the discretization of nonlinear integro-differential equations. The method is based on a formulation of the discretized equations in vector form using only matrix-vector products and component-wise operations. By applying simple matrix-bas…
▽ More
In this pedagogical article, we present a simple direct matrix method for analytically computing the Jacobian of nonlinear algebraic equations that arise from the discretization of nonlinear integro-differential equations. The method is based on a formulation of the discretized equations in vector form using only matrix-vector products and component-wise operations. By applying simple matrix-based differentiation rules, the matrix form of the analytical Jacobian can be calculated with little more difficulty than that required when computing derivatives in single-variable calculus. After describing the direct matrix method, we present numerical experiments demonstrating the computational performance of the method, discuss its connection to the Newton-Kantorovich method, and apply it to illustrative 1D and 2D example problems. MATLAB code is provided to demonstrate the low code complexity required by the method.
△ Less
Submitted 10 December, 2008; v1 submitted 5 February, 2007;
originally announced February 2007.
-
A Variational Level Set Approach for Surface Area Minimization of Triply Periodic Surfaces
Authors:
Youngjean Jung,
Kevin T. Chu,
Salvatore Torquato
Abstract:
In this paper, we study triply periodic surfaces with minimal surface area under a constraint in the volume fraction of the regions (phases) that the surface separates. Using a variational level set method formulation, we present a theoretical characterization of and a numerical algorithm for computing these surfaces. We use our theoretical and computational formulation to study the optimality o…
▽ More
In this paper, we study triply periodic surfaces with minimal surface area under a constraint in the volume fraction of the regions (phases) that the surface separates. Using a variational level set method formulation, we present a theoretical characterization of and a numerical algorithm for computing these surfaces. We use our theoretical and computational formulation to study the optimality of the Schwartz P, Schwartz D, and Schoen G surfaces when the volume fractions of the two phases are equal and explore the properties of optimal structures when the volume fractions of the two phases not equal. Due to the computational cost of the fully, three-dimensional shape optimization problem, we implement our numerical simulations using a parallel level set method software package.
△ Less
Submitted 14 June, 2006;
originally announced June 2006.
-
Nonlinear electrochemical relaxation around conductors
Authors:
Kevin T. Chu,
Martin Z. Bazant
Abstract:
We analyze the simplest problem of electrochemical relaxation in more than one dimension - the response of an uncharged, ideally polarizable metallic sphere (or cylinder) in a symmetric, binary electrolyte to a uniform electric field. In order to go beyond the circuit approximation for thin double layers, our analysis is based on the Poisson-Nernst-Planck (PNP) equations of dilute solution theor…
▽ More
We analyze the simplest problem of electrochemical relaxation in more than one dimension - the response of an uncharged, ideally polarizable metallic sphere (or cylinder) in a symmetric, binary electrolyte to a uniform electric field. In order to go beyond the circuit approximation for thin double layers, our analysis is based on the Poisson-Nernst-Planck (PNP) equations of dilute solution theory. Unlike most previous studies, however, we focus on the nonlinear regime, where the applied voltage across the conductor is larger than the thermal voltage. In such strong electric fields, the classical model predicts that the double layer adsorbs enough ions to produce bulk concentration gradients and surface conduction. Our analysis begins with a general derivation of surface conservation laws in the thin double-layer limit, which provide effective boundary conditions on the quasi-neutral bulk. We solve the resulting nonlinear partial differential equations numerically for strong fields and also perform a time-dependent asymptotic analysis for weaker fields, where bulk diffusion and surface conduction arise as first-order corrections. We also derive various dimensionless parameters comparing surface to bulk transport processes, which generalize the Bikerman-Dukhin number. Our results have basic relevance for double-layer charging dynamics and nonlinear electrokinetics in the ubiquitous PNP approximation.
△ Less
Submitted 24 May, 2006; v1 submitted 8 March, 2006;
originally announced March 2006.
-
Electrochemical Thin Films At and Above the Classical Limiting Current
Authors:
Kevin T. Chu,
Martin Z. Bazant
Abstract:
We study a model electrochemical thin film at dc currents exceeding the classical diffusion-limited value. The mathematical problem involves the steady Poisson-Nernst-Planck equations for a binary electrolyte with nonlinear boundary conditions for reaction kinetics and Stern-layer capacitance, as well as an integral constraint on the number of anions. At the limiting current, we find a nested bo…
▽ More
We study a model electrochemical thin film at dc currents exceeding the classical diffusion-limited value. The mathematical problem involves the steady Poisson-Nernst-Planck equations for a binary electrolyte with nonlinear boundary conditions for reaction kinetics and Stern-layer capacitance, as well as an integral constraint on the number of anions. At the limiting current, we find a nested boundary layer structure at the cathode, which is required by the reaction boundary condition. Above the limiting current, a depletion of anions generally characterizes the cathode side of the cell. In this regime, we derive leading-order asymptotic approximations for the (i) classical bulk space-charge layer and (ii) another, nested highly charged boundary layer at the cathode. The former involves an exact solution to the Nernst-Planck equations for a single, unscreened ionic species, which may apply more generally to Faradaic conduction through very thin insulating films. By matching expansions, we derive current-voltage relations well into the space-charge regime. Throughout our analysis, we emphasize the strong influence of the Stern-layer capacitance on cell behavior.
△ Less
Submitted 16 June, 2004;
originally announced June 2004.
-
Current-Voltage Relations for Electrochemical Thin Films
Authors:
Martin Z. Bazant,
Kevin T. Chu,
B. J. Bayly
Abstract:
The dc response of an electrochemical thin film, such as the separator in a micro-battery, is analyzed by solving the Poisson-Nernst-Planck equations, subject to boundary conditions appropriate for an electrolytic/galvanic cell. The model system consists of a binary electrolyte between parallel-plate electrodes, each possessing a compact Stern layer, which mediates Faradaic reactions with nonlin…
▽ More
The dc response of an electrochemical thin film, such as the separator in a micro-battery, is analyzed by solving the Poisson-Nernst-Planck equations, subject to boundary conditions appropriate for an electrolytic/galvanic cell. The model system consists of a binary electrolyte between parallel-plate electrodes, each possessing a compact Stern layer, which mediates Faradaic reactions with nonlinear Butler-Volmer kinetics. Analytical results are obtained by matched asymptotic expansions in the limit of thin double layers and compared with full numerical solutions. The analysis shows that (i) decreasing the system size relative to the Debye screening length decreases the voltage of the cell and allows currents higher than the classical diffusion-limited current; (ii) finite reaction rates lead to the important possibility of a reaction-limited current; (iii) the Stern-layer capacitance is critical for allowing the cell to achieve currents above the reaction-limited current; and (iv) all polarographic (current-voltage) curves tend to the same limit as reaction kinetics become fast. Dimensional analysis, however, shows that ``fast'' reactions tend to become ``slow'' with decreasing system size, so the nonlinear effects of surface polarization may dominate the dc response of thin films.
△ Less
Submitted 16 June, 2004;
originally announced June 2004.