Search | arXiv e-print repository

LLM2FEA: Discover Novel Designs with Generative Evolutionary Multitasking

Authors: Melvin Wong, Jiao Liu, Thiago Rios, Stefan Menzel, Yew Soon Ong

Abstract: The rapid research and development of generative artificial intelligence has enabled the generation of high-quality images, text, and 3D models from text prompts. This advancement impels an inquiry into whether these models can be leveraged to create digital artifacts for both creative and engineering applications. Drawing on innovative designs from other domains may be one answer to this question… ▽ More The rapid research and development of generative artificial intelligence has enabled the generation of high-quality images, text, and 3D models from text prompts. This advancement impels an inquiry into whether these models can be leveraged to create digital artifacts for both creative and engineering applications. Drawing on innovative designs from other domains may be one answer to this question, much like the historical practice of ``bionics", where humans have sought inspiration from nature's exemplary designs. This raises the intriguing possibility of using generative models to simultaneously tackle design tasks across multiple domains, facilitating cross-domain learning and resulting in a series of innovative design solutions. In this paper, we propose LLM2FEA as the first attempt to discover novel designs in generative models by transferring knowledge across multiple domains. By utilizing a multi-factorial evolutionary algorithm (MFEA) to drive a large language model, LLM2FEA integrates knowledge from various fields to generate prompts that guide the generative model in discovering novel and practical objects. Experimental results in the context of 3D aerodynamic design verify the discovery capabilities of the proposed LLM2FEA. The designs generated by LLM2FEA not only satisfy practicality requirements to a certain degree but also feature novel and aesthetically pleasing shapes, demonstrating the potential applications of LLM2FEA in discovery tasks. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2406.09143 [pdf, other]

Generative AI-based Prompt Evolution Engineering Design Optimization With Vision-Language Model

Authors: Melvin Wong, Thiago Rios, Stefan Menzel, Yew Soon Ong

Abstract: Engineering design optimization requires an efficient combination of a 3D shape representation, an optimization algorithm, and a design performance evaluation method, which is often computationally expensive. We present a prompt evolution design optimization (PEDO) framework contextualized in a vehicle design scenario that leverages a vision-language model for penalizing impractical car designs sy… ▽ More Engineering design optimization requires an efficient combination of a 3D shape representation, an optimization algorithm, and a design performance evaluation method, which is often computationally expensive. We present a prompt evolution design optimization (PEDO) framework contextualized in a vehicle design scenario that leverages a vision-language model for penalizing impractical car designs synthesized by a generative model. The backbone of our framework is an evolutionary strategy coupled with an optimization objective function that comprises a physics-based solver and a vision-language model for practical or functional guidance in the generated car designs. In the prompt evolutionary search, the optimizer iteratively generates a population of text prompts, which embed user specifications on the aerodynamic performance and visual preferences of the 3D car designs. Then, in addition to the computational fluid dynamics simulations, the pre-trained vision-language model is used to penalize impractical designs and, thus, foster the evolutionary algorithm to seek more viable designs. Our investigations on a car design optimization problem show a wide spread of potential car designs generated at the early phase of the search, which indicates a good diversity of designs in the initial populations, and an increase of over 20\% in the probability of generating practical designs compared to a baseline framework without using a vision-language model. Visual inspection of the designs against the performance results demonstrates prompt evolution as a very promising paradigm for finding novel designs with good optimization performance while providing ease of use in specifying design specifications and preferences via a natural language interface. △ Less

Submitted 14 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

Comments: Accepted and to be published in IEEE Congress on Evolutionary Computation (CEC) 2024. Copyright 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses

arXiv:2406.00812 [pdf, other]

Covariance-Adaptive Sequential Black-box Optimization for Diffusion Targeted Generation

Authors: Yueming Lyu, Kim Yong Tan, Yew Soon Ong, Ivor W. Tsang

Abstract: Diffusion models have demonstrated great potential in generating high-quality content for images, natural language, protein domains, etc. However, how to perform user-preferred targeted generation via diffusion models with only black-box target scores of users remains challenging. To address this issue, we first formulate the fine-tuning of the targeted reserve-time stochastic differential equatio… ▽ More Diffusion models have demonstrated great potential in generating high-quality content for images, natural language, protein domains, etc. However, how to perform user-preferred targeted generation via diffusion models with only black-box target scores of users remains challenging. To address this issue, we first formulate the fine-tuning of the targeted reserve-time stochastic differential equation (SDE) associated with a pre-trained diffusion model as a sequential black-box optimization problem. Furthermore, we propose a novel covariance-adaptive sequential optimization algorithm to optimize cumulative black-box scores under unknown transition dynamics. Theoretically, we prove a $O(\frac{d^2}{\sqrt{T}})$ convergence rate for cumulative convex functions without smooth and strongly convex assumptions. Empirically, experiments on both numerical test problems and target-guided 3D-molecule generation tasks show the superior performance of our method in achieving better target scores. △ Less

Submitted 8 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

arXiv:2405.16835 [pdf]

Superionic surface Li-ion transport in carbonaceous materials

Authors: Jianbin Zhou, Shen Wang, Chaoshan Wu, Ji Qi, Hongli Wan, Shen Lai, Shijie Feng, Tsz Wai Ko, Zhaohui Liang, Ke Zhou, Nimrod Harpak, Nick Solan, Mengchen Liu, Zeyu Hui, Paulina J. Ai, Kent Griffith, Chunsheng Wang, Shyue ** Ong, Yan Yao, ** Liu

Abstract: Unlike Li-ion transport in the bulk of carbonaceous materials, little is known about Li-ion diffusion on their surface. In this study, we have discovered an ultra-fast Li-ion transport phenomenon on the surface of carbonaceous materials, particularly when they have limited Li insertion capacity along with a high surface area. This is exemplified by a carbon black, Ketjen Black (KB). An ionic condu… ▽ More Unlike Li-ion transport in the bulk of carbonaceous materials, little is known about Li-ion diffusion on their surface. In this study, we have discovered an ultra-fast Li-ion transport phenomenon on the surface of carbonaceous materials, particularly when they have limited Li insertion capacity along with a high surface area. This is exemplified by a carbon black, Ketjen Black (KB). An ionic conductivity of 18.1 mS cm-1 at room temperature is observed, far exceeding most solid-state ion conductors. Theoretical calculations reveal a low diffusion barrier for the surface Li species. The species is also identified as Li*, which features a partial positive charge. As a result, lithiated KB functions effectively as an interlayer between Li and solid-state electrolytes (SSE) to mitigate dendrite growth and cell shorting. This function is found to be electrolyte agnostic, effective for both sulfide and halide SSEs. Further, lithiated KB can act as a high-performance mixed ion/electron conductor that is thermodynamically stable at potentials near Li metal. A graphite anode mixed with KB instead of a solid electrolyte demonstrates full utilization with a capacity retention of ~85% over 300 cycles. The discovery of this surface-mediated ultra-fast Li-ion transport mechanism provides new directions for the design of solid-state ion conductors and solid-state batteries. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 21 pages, 6 figures

arXiv:2405.13048 [pdf]

Human-Generative AI Collaborative Problem Solving Who Leads and How Students Perceive the Interactions

Authors: Gaoxia Zhu, Vidya Sudarshan, Jason Fok Kow, Yew Soon Ong

Abstract: This research investigates distinct human-generative AI collaboration types and students' interaction experiences when collaborating with generative AI (i.e., ChatGPT) for problem-solving tasks and how these factors relate to students' sense of agency and perceived collaborative problem solving. By analyzing the surveys and reflections of 79 undergraduate students, we identified three human-genera… ▽ More This research investigates distinct human-generative AI collaboration types and students' interaction experiences when collaborating with generative AI (i.e., ChatGPT) for problem-solving tasks and how these factors relate to students' sense of agency and perceived collaborative problem solving. By analyzing the surveys and reflections of 79 undergraduate students, we identified three human-generative AI collaboration types: even contribution, human leads, and AI leads. Notably, our study shows that 77.21% of students perceived they led or had even contributed to collaborative problem-solving when collaborating with ChatGPT. On the other hand, 15.19% of the human participants indicated that the collaborations were led by ChatGPT, indicating a potential tendency for students to rely on ChatGPT. Furthermore, 67.09% of students perceived their interaction experiences with ChatGPT to be positive or mixed. We also found a positive correlation between positive interaction experience and a sense of positive agency. The results of this study contribute to our understanding of the collaboration between students and generative AI and highlight the need to study further why some students let ChatGPT lead collaborative problem-solving and how to enhance their interaction experience through curriculum and technology design. △ Less

Submitted 18 May, 2024; originally announced May 2024.

Comments: This paper appears at the IEEE Conference on Artificial Intelligence (CAI) 2024

arXiv:2405.07464 [pdf]

Atomic-scale tunable phonon transport at tailored grain boundaries

Authors: Xiaowang Wang, Chaitanya A. Gadre, Runqing Yang, Wanjuan Zou, Xing Bin, Christopher Addiego, Toshihiro Aoki, Yujie Quan, Wei-Tao Peng, Yifeng Huang, Chaojie Du, Mingjie Xu, Xingxu Yan, Ruqian Wu, Shyue ** Ong, Bolin Liao, Penghui Cao, Xiaoqing Pan

Abstract: Manipulating thermal properties in materials has been of fundamental importance for advancing innovative technologies. Heat carriers such as phonons are impeded by breaking crystal symmetry or periodicity. Notable methods of impeding the phonon propagation include varying the density of defects, interfaces, and nanostructures, as well as changing composition. However, a robust link between the ind… ▽ More Manipulating thermal properties in materials has been of fundamental importance for advancing innovative technologies. Heat carriers such as phonons are impeded by breaking crystal symmetry or periodicity. Notable methods of impeding the phonon propagation include varying the density of defects, interfaces, and nanostructures, as well as changing composition. However, a robust link between the individual nanoscale defect structures, phonon states, and macroscopic thermal conductivity is lacking. Here we reveal from nanoscale structure-phonon mechanisms on how the grain boundary (GB) tilt and twist angles fundamentally drive the changes in atom rearrangements, exotic vibrational states, and finally macroscopic heat transport at different bicrystal strontium titanate GBs using emerging atomic resolution vibrational spectroscopy. The 10 deg and 22 deg tilt GBs exhibit reduced phonon populations by 54% and 16% compared to the bulk value, respectively, consistent with measured thermal conductivities. A tiny twist angle further introduces a fine and local tunning of thermal conductivity by introducing twist induced defects periodically embedded with the tilt induced GB defects. Our results demonstrate that varying the tilt angle coarsely modifies the phonon population along entire GB while varying the twist angle incurs a finer adjustment at periodic locations on the GB. Our study offers a systematic approach to understanding and manipulating cross GB thermal transport of arbitrary GBs predictably and precisely. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2403.16645 [pdf]

Virtual Co-Pilot: Multimodal Large Language Model-enabled Quick-access Procedures for Single Pilot Operations

Authors: Fan Li, Shanshan Feng, Yuqi Yan, Ching-Hung Lee, Yew Soon Ong

Abstract: Advancements in technology, pilot shortages, and cost pressures are driving a trend towards single-pilot and even remote operations in aviation. Considering the extensive workload and huge risks associated with single-pilot operations, the development of a Virtual Co-Pilot (V-CoP) is expected to be a potential way to ensure aviation safety. This study proposes a V-CoP concept and explores how huma… ▽ More Advancements in technology, pilot shortages, and cost pressures are driving a trend towards single-pilot and even remote operations in aviation. Considering the extensive workload and huge risks associated with single-pilot operations, the development of a Virtual Co-Pilot (V-CoP) is expected to be a potential way to ensure aviation safety. This study proposes a V-CoP concept and explores how humans and virtual assistants can effectively collaborate. A preliminary case study is conducted to explore a critical role of V-CoP, namely automated quick procedures searching, using the multimodal large language model (LLM). The LLM-enabled V-CoP integrates the pilot instruction and real-time cockpit instrumental data to prompt applicable aviation manuals and operation procedures. The results showed that the LLM-enabled V-CoP achieved high accuracy in situational analysis and effective retrieval of procedure information. The results showed that the LLM-enabled V-CoP achieved high accuracy in situational analysis (90.5%) and effective retrieval of procedure information (86.5%). The proposed V-CoP is expected to provide a foundation for future virtual intelligent assistant development, improve the performance of single pilots, and reduce the risk of human errors in aviation. △ Less

Submitted 25 March, 2024; originally announced March 2024.

Comments: 10 pages,7 figures

arXiv:2403.08255 [pdf, other]

Make Me Happier: Evoking Emotions Through Image Diffusion Models

Authors: Qing Lin, **gfeng Zhang, Yew Soon Ong, Mengmi Zhang

Abstract: Despite the rapid progress in image generation, emotional image editing remains under-explored. The semantics, context, and structure of an image can evoke emotional responses, making emotional image editing techniques valuable for various real-world applications, including treatment of psychological disorders, commercialization of products, and artistic design. For the first time, we present a no… ▽ More Despite the rapid progress in image generation, emotional image editing remains under-explored. The semantics, context, and structure of an image can evoke emotional responses, making emotional image editing techniques valuable for various real-world applications, including treatment of psychological disorders, commercialization of products, and artistic design. For the first time, we present a novel challenge of emotion-evoked image generation, aiming to synthesize images that evoke target emotions while retaining the semantics and structures of the original scenes. To address this challenge, we propose a diffusion model capable of effectively understanding and editing source images to convey desired emotions and sentiments. Moreover, due to the lack of emotion editing datasets, we provide a unique dataset consisting of 340,000 pairs of images and their emotion annotations. Furthermore, we conduct human psychophysics experiments and introduce four new evaluation metrics to systematically benchmark all the methods. Experimental results demonstrate that our method surpasses all competitive baselines. Our diffusion model is capable of identifying emotional cues from original images, editing images that elicit desired emotions, and meanwhile, preserving the semantic structure of the original images. All code, model, and dataset will be made public. △ Less

Submitted 27 May, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

arXiv:2403.07331 [pdf, other]

LIST: Learning to Index Spatio-Textual Data for Embedding based Spatial Keyword Queries

Authors: Ziqi Yin, Shanshan Feng, Shang Liu, Gao Cong, Yew Soon Ong, Bin Cui

Abstract: With the proliferation of spatio-textual data, Top-k KNN spatial keyword queries (TkQs), which return a list of objects based on a ranking function that evaluates both spatial and textual relevance, have found many real-life applications. Existing geo-textual indexes for TkQs use traditional retrieval models like BM25 to compute text relevance and usually exploit a simple linear function to comput… ▽ More With the proliferation of spatio-textual data, Top-k KNN spatial keyword queries (TkQs), which return a list of objects based on a ranking function that evaluates both spatial and textual relevance, have found many real-life applications. Existing geo-textual indexes for TkQs use traditional retrieval models like BM25 to compute text relevance and usually exploit a simple linear function to compute spatial relevance, but its effectiveness is limited. To improve effectiveness, several deep learning models have recently been proposed, but they suffer severe efficiency issues. To the best of our knowledge, there are no efficient indexes specifically designed to accelerate the top-k search process for these deep learning models. To tackle these issues, we propose a novel technique, which Learns to Index the Spatio-Textual data for answering embedding based spatial keyword queries (called LIST). LIST is featured with two novel components. Firstly, we propose a lightweight and effective relevance model that is capable of learning both textual and spatial relevance. Secondly, we introduce a novel machine learning based Approximate Nearest Neighbor Search (ANNS) index, which utilizes a new learning-to-cluster technique to group relevant queries and objects together while separating irrelevant queries and objects. Two key challenges in building an effective and efficient index are the absence of high-quality labels and unbalanced clustering results. We develop a novel pseudo-label generation technique to address the two challenges. Experimental results show that LIST significantly outperforms state-of-the-art methods on effectiveness, with improvements up to 19.21% and 12.79% in terms of NDCG@1 and Recall@10, and is three orders of magnitude faster than the most effective baseline. △ Less

Submitted 18 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

arXiv:2403.05448 [pdf, other]

On Practicality of Using ARM TrustZone Trusted Execution Environment for Securing Programmable Logic Controllers

Authors: Zhiang Li, Daisuke Mashima, Wen Shei Ong, Ertem Esiner, Zbigniew Kalbarczyk, Ee-Chien Chang

Abstract: Programmable logic controllers (PLCs) are crucial devices for implementing automated control in various industrial control systems (ICS), such as smart power grids, water treatment systems, manufacturing, and transportation systems. Owing to their importance, PLCs are often the target of cyber attackers that are aiming at disrupting the operation of ICS, including the nation's critical infrastruct… ▽ More Programmable logic controllers (PLCs) are crucial devices for implementing automated control in various industrial control systems (ICS), such as smart power grids, water treatment systems, manufacturing, and transportation systems. Owing to their importance, PLCs are often the target of cyber attackers that are aiming at disrupting the operation of ICS, including the nation's critical infrastructure, by compromising the integrity of control logic execution. While a wide range of cybersecurity solutions for ICS have been proposed, they cannot counter strong adversaries with a foothold on the PLC devices, which could manipulate memory, I/O interface, or PLC logic itself. These days, many ICS devices in the market, including PLCs, run on ARM-based processors, and there is a promising security technology called ARM TrustZone, to offer a Trusted Execution Environment (TEE) on embedded devices. Envisioning that such a hardware-assisted security feature becomes available for ICS devices in the near future, this paper investigates the application of the ARM TrustZone TEE technology for enhancing the security of PLC. Our aim is to evaluate the feasibility and practicality of the TEE-based PLCs through the proof-of-concept design and implementation using open-source software such as OP-TEE and OpenPLC. Our evaluation assesses the performance and resource consumption in real-world ICS configurations, and based on the results, we discuss bottlenecks in the OP-TEE secure OS towards a large-scale ICS and desired changes for its application on ICS devices. Our implementation is made available to public for further study and research. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: To appear at ACM AsiaCCS 2024

arXiv:2402.09608 [pdf, other]

Exact, Fast and Expressive Poisson Point Processes via Squared Neural Families

Authors: Russell Tsuchida, Cheng Soon Ong, Dino Sejdinovic

Abstract: We introduce squared neural Poisson point processes (SNEPPPs) by parameterising the intensity function by the squared norm of a two layer neural network. When the hidden layer is fixed and the second layer has a single neuron, our approach resembles previous uses of squared Gaussian process or kernel methods, but allowing the hidden layer to be learnt allows for additional flexibility. In many cas… ▽ More We introduce squared neural Poisson point processes (SNEPPPs) by parameterising the intensity function by the squared norm of a two layer neural network. When the hidden layer is fixed and the second layer has a single neuron, our approach resembles previous uses of squared Gaussian process or kernel methods, but allowing the hidden layer to be learnt allows for additional flexibility. In many cases of interest, the integrated intensity function admits a closed form and can be computed in quadratic time in the number of hidden neurons. We enumerate a far more extensive number of such cases than has previously been discussed. Our approach is more memory and time efficient than naive implementations of squared or exponentiated kernel methods or Gaussian processes. Maximum likelihood and maximum a posteriori estimates in a reparameterisation of the final layer of the intensity function can be obtained by solving a (strongly) convex optimisation problem using projected gradient descent. We demonstrate SNEPPPs on real, and synthetic benchmarks, and provide a software implementation. https://github.com/RussellTsuchida/snefy △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: AAAI 2024 camera ready submission

arXiv:2402.00572 [pdf, other]

doi 10.1039/D4DD00039K

Developments and applications of the OPTIMADE API for materials discovery, design, and data exchange

Authors: Matthew L. Evans, Johan Bergsma, Andrius Merkys, Casper W. Andersen, Oskar B. Andersson, Daniel Beltrán, Evgeny Blokhin, Tara M. Boland, Rubén Castañeda Balderas, Kamal Choudhary, Alberto Díaz Díaz, Rodrigo Domínguez García, Hagen Eckert, Kristjan Eimre, María Elena Fuentes Montero, Adam M. Krajewski, Jens Jørgen Mortensen, José Manuel Nápoles Duarte, Jacob Pietryga, Ji Qi, Felipe de Jesús Trejo Carrillo, Antanas Vaitkus, Jusong Yu, Adam Zettel, Pedro Baptista de Castro , et al. (34 additional authors not shown)

Abstract: The Open Databases Integration for Materials Design (OPTIMADE) application programming interface (API) empowers users with holistic access to a growing federation of databases, enhancing the accessibility and discoverability of materials and chemical data. Since the first release of the OPTIMADE specification (v1.0), the API has undergone significant development, leading to the upcoming v1.2 relea… ▽ More The Open Databases Integration for Materials Design (OPTIMADE) application programming interface (API) empowers users with holistic access to a growing federation of databases, enhancing the accessibility and discoverability of materials and chemical data. Since the first release of the OPTIMADE specification (v1.0), the API has undergone significant development, leading to the upcoming v1.2 release, and has underpinned multiple scientific studies. In this work, we highlight the latest features of the API format, accompanying software tools, and provide an update on the implementation of OPTIMADE in contributing materials databases. We end by providing several use cases that demonstrate the utility of the OPTIMADE API in materials research that continue to drive its ongoing development. △ Less

Submitted 5 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

arXiv:2311.07581 [pdf]

Crown ether decorated silicon photonics for safeguarding against lead poisoning

Authors: Luigi Ranno, Yong Zen Tan, Chi Siang Ong, Xin Guo, Khong Nee Koo, Xiang Li, Wanjun Wang, Samuel Serna, Chongyang Liu, Rusli, Callum G. Littlejohns, Graham T. Reed, Juejun Hu, Hong Wang, Jia Xu Brian Sia

Abstract: Lead (Pb2+) toxification in society is one of the most concerning public health crisis that remains unaddressed. The exposure to Pb2+ poisoning leads to a multitude of enduring health issues, even at the part-per-billion scale (ppb). Yet, public action dwarfs its impact. Pb2+ poisoning is estimated to account for 1 million deaths per year globally, which is in addition to its chronic impact on chi… ▽ More Lead (Pb2+) toxification in society is one of the most concerning public health crisis that remains unaddressed. The exposure to Pb2+ poisoning leads to a multitude of enduring health issues, even at the part-per-billion scale (ppb). Yet, public action dwarfs its impact. Pb2+ poisoning is estimated to account for 1 million deaths per year globally, which is in addition to its chronic impact on children. With their ring-shaped cavities, crown ethers are uniquely capable of selectively binding to specific ions. In this work, for the first time, the synergistic integration of highly-scalable silicon photonics, with crown ether amine conjugation via Fischer esterification in an environmentally-friendly fashion is demonstrated. This realises a photonic platform that enables the in-situ, highly-selective and quantitative detection of various ions. The development dispels the existing notion that Fischer esterification is restricted to organic compounds, laying the ground for subsequent amine conjugation for various crown ethers. In this work, the platform is engineered for Pb2+ detection, demonstrating a large dynamic detection range of 1 - 262000 ppb with high selectivity against a wide range of relevant ions. These results indicate the potential for the pervasive implementation of the technology to safeguard against ubiquitous lead poisoning in our society. △ Less

Submitted 31 October, 2023; originally announced November 2023.

arXiv:2310.08103 [pdf, other]

Radio Galaxy Zoo: tagging radio subjects using text

Authors: Dawei Chen, Vinay Kerai, Matthew J. Alger, O. Ivy Wong, Cheng Soon Ong

Abstract: RadioTalk is a communication platform that enabled members of the Radio Galaxy Zoo (RGZ) citizen science project to engage in discussion threads and provide further descriptions of the radio subjects they were observing in the form of tags and comments. It contains a wealth of auxiliary information which is useful for the morphology identification of complex and extended radio sources. In this pap… ▽ More RadioTalk is a communication platform that enabled members of the Radio Galaxy Zoo (RGZ) citizen science project to engage in discussion threads and provide further descriptions of the radio subjects they were observing in the form of tags and comments. It contains a wealth of auxiliary information which is useful for the morphology identification of complex and extended radio sources. In this paper, we present this new dataset, and for the first time in radio astronomy, we combine text and images to automatically classify radio galaxies using a multi-modal learning approach. We found incorporating text features improved classification performance which demonstrates that text annotations are rare but valuable sources of information for classifying astronomical sources, and suggests the importance of exploiting multi-modal information in future citizen science projects. We also discovered over 10,000 new radio sources beyond the RGZ-DR1 catalogue in this dataset. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: 14 pages, 9 figures, accepted for publication in PASA

arXiv:2310.07049 [pdf, other]

Robust Machine Learning Inference from X-ray Absorption Near Edge Spectra through Featurization

Authors: Yiming Chen, Chi Chen, Inhui Hwang, Michael J. Davis, Wanli Yang, Chengjun Sun, Shyue ** Ong, Maria K. Y. Chan

Abstract: X-ray absorption spectroscopy (XAS) is a commonly-employed technique for characterizing functional materials. In particular, x-ray absorption near edge spectra (XANES) encodes local coordination and electronic information and machine learning approaches to extract this information is of significant interest. To date, most ML approaches for XANES have primarily focused on using the raw spectral int… ▽ More X-ray absorption spectroscopy (XAS) is a commonly-employed technique for characterizing functional materials. In particular, x-ray absorption near edge spectra (XANES) encodes local coordination and electronic information and machine learning approaches to extract this information is of significant interest. To date, most ML approaches for XANES have primarily focused on using the raw spectral intensities as input, overlooking the potential benefits of incorporating spectral transformations and dimensionality reduction techniques into ML predictions. In this work, we focused on systematically comparing the impact of different featurization methods on the performance of ML models for XAS analysis. We evaluated the classification and regression capabilities of these models on computed datasets and validated their performance on previously unseen experimental datasets. Our analysis revealed an intriguing discovery: the cumulative distribution function (CDF) feature achieves both high prediction accuracy and exceptional transferability. This remarkably robust performance can be attributed to its tolerance to horizontal shifts in spectra, which is crucial when validating models using experimental data. While this work exclusively focuses on XANES analysis, we anticipate that the methodology presented here will hold promise as a versatile asset to the broader spectroscopy community. △ Less

Submitted 10 October, 2023; originally announced October 2023.

arXiv:2309.13042 [pdf, other]

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

Authors: Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy

Abstract: We present MosaicFusion, a simple yet effective diffusion-based data augmentation approach for large vocabulary instance segmentation. Our method is training-free and does not rely on any label supervision. Two key designs enable us to employ an off-the-shelf text-to-image diffusion model as a useful dataset generator for object instances and mask annotations. First, we divide an image canvas into… ▽ More We present MosaicFusion, a simple yet effective diffusion-based data augmentation approach for large vocabulary instance segmentation. Our method is training-free and does not rely on any label supervision. Two key designs enable us to employ an off-the-shelf text-to-image diffusion model as a useful dataset generator for object instances and mask annotations. First, we divide an image canvas into several regions and perform a single round of diffusion process to generate multiple instances simultaneously, conditioning on different text prompts. Second, we obtain corresponding instance masks by aggregating cross-attention maps associated with object prompts across layers and diffusion time steps, followed by simple thresholding and edge-aware refinement processing. Without bells and whistles, our MosaicFusion can produce a significant amount of synthetic labeled data for both rare and novel categories. Experimental results on the challenging LVIS long-tailed and open-vocabulary benchmarks demonstrate that MosaicFusion can significantly improve the performance of existing instance segmentation models, especially for rare and novel categories. Code will be released at https://github.com/Jiahao000/MosaicFusion. △ Less

Submitted 22 September, 2023; originally announced September 2023.

Comments: GitHub: https://github.com/Jiahao000/MosaicFusion

arXiv:2307.13710 [pdf, other]

Robust Training of Machine Learning Interatomic Potentials with Dimensionality Reduction and Stratified Sampling

Authors: Ji Qi, Tsz Wai Ko, Brandon C. Wood, Tuan Anh Pham, Shyue ** Ong

Abstract: Machine learning interatomic potentials (MLIPs) enable the accurate simulation of materials at larger sizes and time scales, and play increasingly important roles in the computational understanding and design of materials. However, MLIPs are only as accurate and robust as the data they are trained on. In this work, we present DImensionality-Reduced Encoded Clusters with sTratified (DIRECT) samplin… ▽ More Machine learning interatomic potentials (MLIPs) enable the accurate simulation of materials at larger sizes and time scales, and play increasingly important roles in the computational understanding and design of materials. However, MLIPs are only as accurate and robust as the data they are trained on. In this work, we present DImensionality-Reduced Encoded Clusters with sTratified (DIRECT) sampling as an approach to select a robust training set of structures from a large and complex configuration space. By applying DIRECT sampling on the Materials Project relaxation trajectories dataset with over one million structures and 89 elements, we develop an improved materials 3-body graph network (M3GNet) universal potential that extrapolate more reliably to unseen structures. We further show that molecular dynamics (MD) simulations with universal potentials such as M3GNet can be used in place of expensive \textit{ab initio} MD to rapidly create a large configuration space for target materials systems. Combined with DIRECT sampling, we develop a highly reliable moment tensor potential for Ti-H system without the need for iterative optimization. This work paves the way towards robust high throughput development of MLIPs across any compositional complexity. △ Less

Submitted 24 July, 2023; originally announced July 2023.

arXiv:2307.04993 [pdf, other]

doi 10.1093/mnras/stad2080

Uncertainty Quantification of the Virial Black Hole Mass with Conformal Prediction

Authors: Suk Yee Yong, Cheng Soon Ong

Abstract: Precise measurements of the black hole mass are essential to gain insight on the black hole and host galaxy co-evolution. A direct measure of the black hole mass is often restricted to nearest galaxies and instead, an indirect method using the single-epoch virial black hole mass estimation is used for objects at high redshifts. However, this method is subjected to biases and uncertainties as it is… ▽ More Precise measurements of the black hole mass are essential to gain insight on the black hole and host galaxy co-evolution. A direct measure of the black hole mass is often restricted to nearest galaxies and instead, an indirect method using the single-epoch virial black hole mass estimation is used for objects at high redshifts. However, this method is subjected to biases and uncertainties as it is reliant on the scaling relation from a small sample of local active galactic nuclei. In this study, we propose the application of conformalised quantile regression (CQR) to quantify the uncertainties of the black hole predictions in a machine learning setting. We compare CQR with various prediction interval techniques and demonstrated that CQR can provide a more useful prediction interval indicator. In contrast to baseline approaches for prediction interval estimation, we show that the CQR method provides prediction intervals that adjust to the black hole mass and its related properties. That is it yields a tighter constraint on the prediction interval (hence more certain) for a larger black hole mass, and accordingly, bright and broad spectral line width source. Using a combination of neural network model and CQR framework, the recovered virial black hole mass predictions and uncertainties are comparable to those measured from the Sloan Digital Sky Survey. The code is publicly available at https://github.com/yongsukyee/uncertain_blackholemass. △ Less

Submitted 10 July, 2023; originally announced July 2023.

Comments: Accepted for publication in MNRAS. 15 pages, 11 figures, 2 tables

arXiv:2306.09626 [pdf, other]

PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition

Authors: Jia Le Ngwe, Kian Ming Lim, Chin Poo Lee, Thian Song Ong

Abstract: Facial Expression Recognition (FER) is a machine learning problem that deals with recognizing human facial expressions. While existing work has achieved performance improvements in recent years, FER in the wild and under challenging conditions remains a challenge. In this paper, a lightweight patch and attention network based on MobileNetV1, referred to as PAtt-Lite, is proposed to improve FER per… ▽ More Facial Expression Recognition (FER) is a machine learning problem that deals with recognizing human facial expressions. While existing work has achieved performance improvements in recent years, FER in the wild and under challenging conditions remains a challenge. In this paper, a lightweight patch and attention network based on MobileNetV1, referred to as PAtt-Lite, is proposed to improve FER performance under challenging conditions. A truncated ImageNet-pre-trained MobileNetV1 is utilized as the backbone feature extractor of the proposed method. In place of the truncated layers is a patch extraction block that is proposed for extracting significant local facial features to enhance the representation from MobileNetV1, especially under challenging conditions. An attention classifier is also proposed to improve the learning of these patched feature maps from the extremely lightweight feature extractor. The experimental results on public benchmark databases proved the effectiveness of the proposed method. PAtt-Lite achieved state-of-the-art results on CK+, RAF-DB, FER2013, FERPlus, and the challenging conditions subsets for RAF-DB and FERPlus. The source code for the proposed method will be available at https://github.com/JLREx/PAtt-Lite. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2306.08219 [pdf, other]

doi 10.1145/3539618.3591876

Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects

Authors: Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma

Abstract: Conversational recommender systems (CRSs) have become crucial emerging research topics in the field of RSs, thanks to their natural advantages of explicitly acquiring user preferences via interactive conversations and revealing the reasons behind recommendations. However, the majority of current CRSs are text-based, which is less user-friendly and may pose challenges for certain users, such as tho… ▽ More Conversational recommender systems (CRSs) have become crucial emerging research topics in the field of RSs, thanks to their natural advantages of explicitly acquiring user preferences via interactive conversations and revealing the reasons behind recommendations. However, the majority of current CRSs are text-based, which is less user-friendly and may pose challenges for certain users, such as those with visual impairments or limited writing and reading abilities. Therefore, for the first time, this paper investigates the potential of voice-based CRS (VCRSs) to revolutionize the way users interact with RSs in a natural, intuitive, convenient, and accessible fashion. To support such studies, we create two VCRSs benchmark datasets in the e-commerce and movie domains, after realizing the lack of such datasets through an exhaustive literature review. Specifically, we first empirically verify the benefits and necessity of creating such datasets. Thereafter, we convert the user-item interactions to text-based conversations through the ChatGPT-driven prompts for generating diverse and natural templates, and then synthesize the corresponding audios via the text-to-speech model. Meanwhile, a number of strategies are delicately designed to ensure the naturalness and high quality of voice conversations. On this basis, we further explore the potential solutions and point out possible directions to build end-to-end VCRSs by seamlessly extracting and integrating voice-based inputs, thus delivering performance-enhanced, self-explainable, and user-friendly VCRSs. Our study aims to establish the foundation and motivate further pioneering research in the emerging field of VCRSs. This aligns with the principles of explainable AI and AI for social good, viz., utilizing technology's potential to create a fair, sustainable, and just world. △ Less

Submitted 13 June, 2023; originally announced June 2023.

Comments: Accepted by SIGIR 2023 Resource Track

arXiv:2305.13552 [pdf, other]

Squared Neural Families: A New Class of Tractable Density Models

Authors: Russell Tsuchida, Cheng Soon Ong, Dino Sejdinovic

Abstract: Flexible models for probability distributions are an essential ingredient in many machine learning tasks. We develop and investigate a new class of probability distributions, which we call a Squared Neural Family (SNEFY), formed by squaring the 2-norm of a neural network and normalising it with respect to a base measure. Following the reasoning similar to the well established connections between i… ▽ More Flexible models for probability distributions are an essential ingredient in many machine learning tasks. We develop and investigate a new class of probability distributions, which we call a Squared Neural Family (SNEFY), formed by squaring the 2-norm of a neural network and normalising it with respect to a base measure. Following the reasoning similar to the well established connections between infinitely wide neural networks and Gaussian processes, we show that SNEFYs admit closed form normalising constants in many cases of interest, thereby resulting in flexible yet fully tractable density models. SNEFYs strictly generalise classical exponential families, are closed under conditioning, and have tractable marginal distributions. Their utility is illustrated on a variety of density estimation, conditional density estimation, and density estimation with missing data tasks. △ Less

Submitted 25 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

Comments: Spotlight award at NeurIPS 2023

arXiv:2305.11825 [pdf, other]

Machine Learning Moment Tensor Potential for Modelling Dislocation and Fracture in L1$_0$-TiAl and D0$_{19}$-Ti$_3$Al Alloys

Authors: Ji Qi, Z. H. Aitken, Qingxiang Pei, Anne Marie Z. Tan, Yunxing Zuo, M. H. Jhon, S. S. Quek, T. Wen, Zhaoxuan Wu, Shyue ** Ong

Abstract: Dual-phase $γ$-TiAl and $α_2$-Ti$_{3}$Al alloys exhibit high strength and creep resistance at high temperatures. However, they suffer from low tensile ductility and fracture toughness at room temperature. Experimental studies show unusual plastic behaviour associated with ordinary and superdislocations, making it necessary to gain a detailed understanding on their core properties in individual pha… ▽ More Dual-phase $γ$-TiAl and $α_2$-Ti$_{3}$Al alloys exhibit high strength and creep resistance at high temperatures. However, they suffer from low tensile ductility and fracture toughness at room temperature. Experimental studies show unusual plastic behaviour associated with ordinary and superdislocations, making it necessary to gain a detailed understanding on their core properties in individual phases and at the two-phase interfaces. Unfortunately, extended superdislocation cores are widely dissociated beyond the length scales practical for routine first-principles density-functional theory (DFT) calculations, while extant interatomic potentials are not quantitatively accurate to reveal mechanistic origins of the unusual core-related behaviour in either phases. Here, we develop a highly-accurate moment tensor potential (MTP) for the binary Ti-Al alloy system using a DFT dataset covering a broad range of intermetallic and solid solution structures. The optimized MTP is rigorously benchmarked against both previous and new DFT calculations, and unlike existing potentials, is shown to possess outstanding accuracy in nearly all tested mechanical properties, including lattice parameters, elastic constants, surface energies, and generalized stacking fault energies (GSFE) in both phases. The utility of the MTP is further demonstrated by producing dislocation core structures largely consistent with expectations from DFT-GSFE and experimental observations. The new MTP opens the path to realistic modelling and simulations of bulk lattice and defect properties relevant to the plastic deformation and fracture processes in $γ$-TiAl and $α_2$-Ti$_{3}$Al dual-phase alloys. △ Less

Submitted 22 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

arXiv:2304.02350 [pdf, ps, other]

Unfolded Self-Reconstruction LSH: Towards Machine Unlearning in Approximate Nearest Neighbour Search

Authors: Kim Yong Tan, Yueming Lyu, Yew Soon Ong, Ivor W. Tsang

Abstract: Approximate nearest neighbour (ANN) search is an essential component of search engines, recommendation systems, etc. Many recent works focus on learning-based data-distribution-dependent hashing and achieve good retrieval performance. However, due to increasing demand for users' privacy and security, we often need to remove users' data information from Machine Learning (ML) models to satisfy speci… ▽ More Approximate nearest neighbour (ANN) search is an essential component of search engines, recommendation systems, etc. Many recent works focus on learning-based data-distribution-dependent hashing and achieve good retrieval performance. However, due to increasing demand for users' privacy and security, we often need to remove users' data information from Machine Learning (ML) models to satisfy specific privacy and security requirements. This need requires the ANN search algorithm to support fast online data deletion and insertion. Current learning-based hashing methods need retraining the hash function, which is prohibitable due to the vast time-cost of large-scale data. To address this problem, we propose a novel data-dependent hashing method named unfolded self-reconstruction locality-sensitive hashing (USR-LSH). Our USR-LSH unfolded the optimization update for instance-wise data reconstruction, which is better for preserving data information than data-independent LSH. Moreover, our USR-LSH supports fast online data deletion and insertion without retraining. To the best of our knowledge, we are the first to address the machine unlearning of retrieval problems. Empirically, we demonstrate that USR-LSH outperforms the state-of-the-art data-distribution-independent LSH in ANN tasks in terms of precision and recall. We also show that USR-LSH has significantly faster data deletion and insertion time than learning-based data-dependent hashing. △ Less

Submitted 6 April, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

Comments: correct author's name typo

arXiv:2303.01773 [pdf, other]

Comparative study of magnetocaloric properties for Gd$^{3+}$ compounds with different frustrated lattice geometries

Authors: EliseAnne C. Koskelo, Paromita Mukherjee, Cheng Liu, Alice C. Sackville Hamilton, Harapan S. Ong, M. E. Zhitomirsky, Claudio Castelnovo, Siân E. Dutton

Abstract: As materials with suppressed ordering temperatures and enhanced ground state entropies, frustrated magnetic oxides are ideal candidates for cryogenic magnetocaloric refrigeration. While previous materials design has focused on tuning the magnetic moments, their interactions, and density of moments on the lattice, there has been relatively little attention to frustrated lattices. Prior theoretical… ▽ More As materials with suppressed ordering temperatures and enhanced ground state entropies, frustrated magnetic oxides are ideal candidates for cryogenic magnetocaloric refrigeration. While previous materials design has focused on tuning the magnetic moments, their interactions, and density of moments on the lattice, there has been relatively little attention to frustrated lattices. Prior theoretical work has shown that the magnetocaloric cooling rate at the saturation field is proportional to a macroscopic number of soft mode excitations that arise due to the classical ground state degeneracy. The number of these modes is directly determined by the geometry of the frustrating lattice. For corner-sharing geometries, the pyrochlore has 50\% more modes than the garnet and kagome lattices, whereas the edge-sharing \emph{fcc} has only a subextensive number of soft modes. Here, we study the role of soft modes in the magnetocaloric effect of four large-spin Gd$^{3+}$ ($L=0$, $J=S=7/2$) Heisenberg antiferromagnets on a kagome, garnet, pyrochlore, and \emph{fcc} lattice. By comparing measurements of the magnetic entropy change $ΔS_m$ of these materials at fields up to $9$~T with predictions using mean-field theory and Monte Carlo simulations, we are able to understand the relative importance of spin correlations and quantization effects. We observe that tuning the value of the nearest neighbor coupling has a more dominant contribution to the magnetocaloric entropy change in the liquid-He cooling regime ($2$-$20$~K), rather than tuning the number of soft mode excitations. Our results inform future materials design in terms of dimensionality, degree of magnetic frustration, and lattice geometry. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: 15 pages, 14 figures

arXiv:2212.13451 [pdf]

Compositionally Complex Perovskite Oxides as a New Class of Li-Ion Solid Electrolytes

Authors: Shu-Ting Ko, Tom Lee, Ji Qi, Dawei Zhang, Wei-Tao Peng, Xin Wang, Wei-Che Tsai, Shikai Sun, Zhaokun Wang, William J. Bowman, Shyue ** Ong, Xiaoqing Pan, Jian Luo

Abstract: Compositionally complex ceramics (CCCs), including high-entropy ceramics (HECs) as a subclass, offer new opportunities of materials discovery beyond the traditional methodology of searching new stoichiometric compounds. Herein, we establish new strategies of tailoring CCCs via a seamless combination of (1) non-equimolar compositional designs and (2) controlling microstructures and interfaces. Usin… ▽ More Compositionally complex ceramics (CCCs), including high-entropy ceramics (HECs) as a subclass, offer new opportunities of materials discovery beyond the traditional methodology of searching new stoichiometric compounds. Herein, we establish new strategies of tailoring CCCs via a seamless combination of (1) non-equimolar compositional designs and (2) controlling microstructures and interfaces. Using oxide solid electrolytes for all-solid-state batteries as an exemplar, we validate these new strategies via discovering a new class of compositionally complex perovskite oxides (CCPOs) to show the possibility of improving ionic conductivities beyond the limit of conventional do**. As an example (amongst the 28 CCPOs examined), we demonstrate that the ionic conductivity can be improved by >60% in (Li0.375Sr0.4375)(Ta0.375Nb0.375Zr0.125Hf0.125)O3-δ, in comparison with the state-of-art (Li0.375Sr0.4375)(Ta0.75Zr0.25)O3-δ (LSTZ) baseline, via maintaining comparable electrochemical stability. Furthermore, the ionic conductivity can be improved by another >70% via grain boundary (GB) engineering, achieving >270% of the LSTZ baseline. This work suggests transformative new strategies for designing and tailoring HECs and CCCs, thereby opening a new window for discovering materials for energy storage and many other applications. △ Less

Submitted 27 December, 2022; originally announced December 2022.

arXiv:2211.05943 [pdf, other]

Deep equilibrium models as estimators for continuous latent variables

Authors: Russell Tsuchida, Cheng Soon Ong

Abstract: Principal Component Analysis (PCA) and its exponential family extensions have three components: observations, latents and parameters of a linear transformation. We consider a generalised setting where the canonical parameters of the exponential family are a nonlinear transformation of the latents. We show explicit relationships between particular neural network architectures and the corresponding… ▽ More Principal Component Analysis (PCA) and its exponential family extensions have three components: observations, latents and parameters of a linear transformation. We consider a generalised setting where the canonical parameters of the exponential family are a nonlinear transformation of the latents. We show explicit relationships between particular neural network architectures and the corresponding statistical models. We find that deep equilibrium models -- a recently introduced class of implicit neural networks -- solve maximum a-posteriori (MAP) estimates for the latents and parameters of the transformation. Our analysis provides a systematic way to relate activation functions, dropout, and layer structure, to statistical assumptions about the observations, thus providing foundational principles for unsupervised DEQs. For hierarchical latents, individual neurons can be interpreted as nodes in a deep graphical model. Our DEQ feature maps are end-to-end differentiable, enabling fine-tuning for downstream tasks. △ Less

Submitted 10 November, 2022; originally announced November 2022.

Comments: 25 pages

arXiv:2211.03334 [pdf]

doi 10.1073/pnas.2307611120

Optically controlled single-valley exciton doublet states with tunable internal spin structures and spin magnetization generation

Authors: Jiawei Ruan, Zhenglu Li, Chin Shen Ong, Steven G. Louie

Abstract: Manipulating quantum states through light-matter interactions has been actively pursued in two-dimensional (2D) materials research. Significant progress has been made towards the optical control of the valley degrees of freedom in semiconducting monolayer transition-metal dichalcogenides (TMD), based on doubly degenerate excitons from their two distinct valleys in reciprocal space. Here, we introd… ▽ More Manipulating quantum states through light-matter interactions has been actively pursued in two-dimensional (2D) materials research. Significant progress has been made towards the optical control of the valley degrees of freedom in semiconducting monolayer transition-metal dichalcogenides (TMD), based on doubly degenerate excitons from their two distinct valleys in reciprocal space. Here, we introduce a novel kind of optically controllable doubly degenerate exciton states that come from a single valley, dubbed as single-valley exciton doublet (SVXD) states. They are unique in that their constituent holes originate from the same valence band, making possible the direct optical control of the spin structure of the excited constituent electrons. Combining ab initio GW plus Bethe-Salpeter equation (GW-BSE) calculations and a newly developed theoretical analysis method, we demonstrate such novel SVXD in substrate-supported monolayer bismuthene -- which has been successfully grown using molecular beam epitaxy. In each of the two distinct valleys in the Brillouin zone, strong spin-orbit coupling and $C_{3v}$ symmetry lead to a pair of degenerate 1s exciton states (the SVXD states) with opposite spin configurations. Any coherent linear combinations of the SVXD in a single valley can be excited by light with a specific polarization, enabling full manipulation of their internal spin configurations. In particular, a controllable net spin magnetization can be generated through light excitation. Our findings open new routes to control quantum degrees of freedom, paving the way for applications in spintronics and quantum information science. △ Less

Submitted 14 October, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

Comments: 15 pages, 3 figures

Journal ref: Proc. Natl. Acad. Sci. 120, e2307611120 (2023)

arXiv:2210.00294 [pdf, other]

Gait-based Age Group Classification with Adaptive Graph Neural Network

Authors: Timilehin B. Aderinola, Tee Connie, Thian Song Ong, Andrew Beng ** Teoh, Michael Kah Ong Goh

Abstract: Deep learning techniques have recently been utilized for model-free age-associated gait feature extraction. However, acquiring model-free gait demands accurate pre-processing such as background subtraction, which is non-trivial in unconstrained environments. On the other hand, model-based gait can be obtained without background subtraction and is less affected by covariates. For model-based gait-b… ▽ More Deep learning techniques have recently been utilized for model-free age-associated gait feature extraction. However, acquiring model-free gait demands accurate pre-processing such as background subtraction, which is non-trivial in unconstrained environments. On the other hand, model-based gait can be obtained without background subtraction and is less affected by covariates. For model-based gait-based age group classification problems, present works rely solely on handcrafted features, where feature extraction is tedious and requires domain expertise. This paper proposes a deep learning approach to extract age-associated features from model-based gait for age group classification. Specifically, we first develop an unconstrained gait dataset called Multimedia University Gait Age and Gender dataset (MMU GAG). Next, the body joint coordinates are determined via pose estimation algorithms and represented as compact gait graphs via a novel part aggregation scheme. Then, a Part-AdaptIve Residual Graph Convolutional Neural Network (PairGCN) is designed for age-associated feature learning. Experiments suggest that PairGCN features are far more informative than handcrafted features, yielding up to 99% accuracy for classifying subjects as a child, adult, or senior in the MMU GAG dataset. △ Less

Submitted 1 October, 2022; originally announced October 2022.

arXiv:2208.14420 [pdf, other]

The Intercalation Chemistry of the Disordered RockSalt Li3V2O5 Anode from Cluster Expansions and Machine Learning Interatomic Potentials

Authors: Xingyu Guo, Chi Chen, Shyue ** Ong

Abstract: Disordered rocksalt (DRX) Li3V2O5 is a promising candidate for anode in rechargeable lithium-ion batteries because of its ideal low voltage, high rate capability, and superior cycling stability. Herein, we presents a comprehensive study of intercalation chemistry of the DRX-Li3V2O5 anode using density functional theory calculations combined with machine learning cluster expansions and interatomic… ▽ More Disordered rocksalt (DRX) Li3V2O5 is a promising candidate for anode in rechargeable lithium-ion batteries because of its ideal low voltage, high rate capability, and superior cycling stability. Herein, we presents a comprehensive study of intercalation chemistry of the DRX-Li3V2O5 anode using density functional theory calculations combined with machine learning cluster expansions and interatomic potentials. The predicted voltage profile of the disordered Li3V2O5 anode at room temperature based on Monte Carlo simulations with a fitted cluster expansion model is in excellent agreement with experiments. In contrast to previous DFT results, we find that Li ions predominately intercalate into tetrahedral sites during charging, while the majority of Li and V ions at octahedral sites remain stable. In addition, MD simulations with a fitted moment tensor potential attribute the fast-charging capability of DRX-Li3V2O5 to the facile diffusivity of Li+ via tetrahedral - octahedral - tetrahedral pathway. We further suggest tuning the Li:V ratio as a means to trade off increased lithiation capacity and decreased anode voltage in this system. This work provides in-depth insights into the high-performance DRX-Li3V2O5 anode, and paves the way to the discovery of other disordered anode materials. △ Less

Submitted 30 August, 2022; originally announced August 2022.

arXiv:2208.07823 [pdf, other]

Synthetic control of structure and conduction properties in Na-Y-Zr-Cl solid electrolytes

Authors: Elias Sebti, Ji Qi, Peter M. Richardson, Phillip Ridley, Erik A. Wu, Swastika Banerjee, Raynald Giovine, Ashley Cronk, So-Yeon Ham, Ying Shirley Meng, Shyue ** Ong, Raphaële J. Clément

Abstract: In the development of low cost, sustainable, and energy-dense batteries, chloride-based compounds are promising catholyte materials for solid-state batteries owing to their high Na-ion conductivities and oxidative stabilities. The ability to further improve Na-ion conduction, however, requires an understanding of the impact of long-range and local structural features on transport in these systems.… ▽ More In the development of low cost, sustainable, and energy-dense batteries, chloride-based compounds are promising catholyte materials for solid-state batteries owing to their high Na-ion conductivities and oxidative stabilities. The ability to further improve Na-ion conduction, however, requires an understanding of the impact of long-range and local structural features on transport in these systems. In this study, we leverage different synthesis methods to control polymorphism and cation disorder in Na-Y-Zr-Cl solid electrolytes and interrogate the impact on Na-ion conduction. We demonstrate the existence of a more conductive P2$_1$/n polymorph of Na$_2$ZrCl$_6$ formed upon ball milling. In Na$_3$YCl$_6$, the R$\bar{3}$ polymorph is shown to be more conductive than its P2$_1$/n counterpart owing to the presence of intrinsic vacancies and disorder on the Y sublattice. Transition metal ordering in the Na$_{2.25}$Y$_{0.25}$Zr$_{0.75}$Cl$_6$ composition strongly impacts Na-ion transport, where a greater mixing of Y$^{3+}$ and Zr$^{4+}$ on the transition metal sublattice facilitates ion migration through partial activation of Cl rotations at relevant temperatures. Overall, Na-ion transport sensitively depends on the phases and transition metal distributions stabilized during synthesis. These results are likely generalizable to other halide compositions and indicate that achieving control over the synthetic protocol and resultant structure is key in the pursuit of improved catholytes for high voltage solid-state sodium-ion batteries. △ Less

Submitted 16 August, 2022; originally announced August 2022.

arXiv:2208.07080 [pdf, other]

sp$^{2}$/sp$^{3}$ bonding controlling mechanism at the $α$-Al$_{2}$O$_{3}|$graphene interface

Authors: Renan P. Maciel, Chin Shen Ong, Daria Belotcerkovtceva, Yaroslav O. Kvashnin, Danny Thonig, M. Venkata Kamalakar, Olle Eriksson

Abstract: First-principles calculations reported here illuminate the effects of the interfacial properties of $α$-Al$_{2}$O$_{3}$ and graphene, with emphasis on the structural and electronic properties. Various contact interfaces and different $α$-Al$_{2}$O$_{3}$ surface terminations are considered with on and slightly-off stoichiometric aluminium oxide. We show that depending on whether aluminium or oxygen… ▽ More First-principles calculations reported here illuminate the effects of the interfacial properties of $α$-Al$_{2}$O$_{3}$ and graphene, with emphasis on the structural and electronic properties. Various contact interfaces and different $α$-Al$_{2}$O$_{3}$ surface terminations are considered with on and slightly-off stoichiometric aluminium oxide. We show that depending on whether aluminium or oxygen is in contact with graphene, an $sp^{3}$ structural deformation and spontaneous spin-polarization may occur next to the interface contact. Interestingly, some cases cause a $p$-type do** in the graphene band structure, depending on the initial $α$-Al$_{2}$O$_{3}$ geometry placed on graphene. The importance of leaving the surface dangling bonds of alumina saturated or not is also highlighted, and we show that it might be a control mechanism for opening a gap in graphene by the influence of the $sp^{3}$ bond between oxygen and carbon atoms at the interface. We discuss the potential of utilizing this sensitivity for practical applications. △ Less

Submitted 15 August, 2022; originally announced August 2022.

Comments: 10 pages, 6 figures, In submission process (peer review) at physical review research (PRR)

arXiv:2207.14443 [pdf, other]

A Survey of Learning on Small Data: Generalization, Optimization, and Challenge

Authors: Xiaofeng Cao, Weixin Bu, Shengjun Huang, Minling Zhang, Ivor W. Tsang, Yew Soon Ong, James T. Kwok

Abstract: Learning on big data brings success for artificial intelligence (AI), but the annotation and training costs are expensive. In future, learning on small data that approximates the generalization ability of big data is one of the ultimate purposes of AI, which requires machines to recognize objectives and scenarios relying on small data as humans. A series of learning topics is going on this way suc… ▽ More Learning on big data brings success for artificial intelligence (AI), but the annotation and training costs are expensive. In future, learning on small data that approximates the generalization ability of big data is one of the ultimate purposes of AI, which requires machines to recognize objectives and scenarios relying on small data as humans. A series of learning topics is going on this way such as active learning and few-shot learning. However, there are few theoretical guarantees for their generalization performance. Moreover, most of their settings are passive, that is, the label distribution is explicitly controlled by finite training resources from known distributions. This survey follows the agnostic active sampling theory under a PAC (Probably Approximately Correct) framework to analyze the generalization error and label complexity of learning on small data in model-agnostic supervised and unsupervised fashion. Considering multiple learning communities could produce small data representation and related topics have been well surveyed, we thus subjoin novel geometric representation perspectives for small data: the Euclidean and non-Euclidean (hyperbolic) mean, where the optimization solutions including the Euclidean gradients, non-Euclidean gradients, and Stein gradient are presented and discussed. Later, multiple learning communities that may be improved by learning on small data are summarized, which yield data-efficient representations, such as transfer learning, contrastive learning, graph representation learning. Meanwhile, we find that the meta-learning may provide effective parameter update policies for learning on small data. Then, we explore multiple challenging scenarios for small data, such as the weak supervision and multi-label. Finally, multiple data applications that may benefit from efficient small data representation are surveyed. △ Less

Submitted 6 June, 2023; v1 submitted 28 July, 2022; originally announced July 2022.

arXiv:2206.14648 [pdf, other]

Two-Stage Neural Contextual Bandits for Personalised News Recommendation

Authors: Mengyan Zhang, Thanh Nguyen-Tang, Fangzhao Wu, Zhenyu He, Xing Xie, Cheng Soon Ong

Abstract: We consider the problem of personalised news recommendation where each user consumes news in a sequential fashion. Existing personalised news recommendation methods focus on exploiting user interests and ignores exploration in recommendation, which leads to biased feedback loops and hurt recommendation quality in the long term. We build on contextual bandits recommendation strategies which natural… ▽ More We consider the problem of personalised news recommendation where each user consumes news in a sequential fashion. Existing personalised news recommendation methods focus on exploiting user interests and ignores exploration in recommendation, which leads to biased feedback loops and hurt recommendation quality in the long term. We build on contextual bandits recommendation strategies which naturally address the exploitation-exploration trade-off. The main challenges are the computational efficiency for exploring the large-scale item space and utilising the deep representations with uncertainty. We propose a two-stage hierarchical topic-news deep contextual bandits framework to efficiently learn user preferences when there are many news items. We use deep learning representations for users and news, and generalise the neural upper confidence bound (UCB) policies to generalised additive UCB and bilinear UCB. Empirical results on a large-scale news recommendation dataset show that our proposed policies are efficient and outperform the baseline bandit policies. △ Less

Submitted 26 June, 2022; originally announced June 2022.

arXiv:2206.08030 [pdf]

doi 10.1038/s41563-022-01285-3

Unconventional Excitonic States with Phonon Sidebands in Layered Silicon Diphosphide

Authors: Ling Zhou, Junwei Huang, Lukas Windgaetter, Chin Shen Ong, Xiaoxu Zhao, Caorong Zhang, Ming Tang, Zeya Li, Caiyu Qiu, Simone Latini, Yangfan Lu, Di Wu, Huiyang Gou, Andrew T. S. Wee, Hideo Hosono, Steven G. Louie, Peizhe Tang, Angel Rubio, Hongtao Yuan

Abstract: Many-body interactions between quasiparticles (electrons, excitons, and phonons) have led to the emergence of new complex correlated states and are at the core of condensed matter physics and material science. In low-dimensional materials, unique electronic properties for these correlated states could significantly affect their optical properties. Herein, combining photoluminescence, optical refle… ▽ More Many-body interactions between quasiparticles (electrons, excitons, and phonons) have led to the emergence of new complex correlated states and are at the core of condensed matter physics and material science. In low-dimensional materials, unique electronic properties for these correlated states could significantly affect their optical properties. Herein, combining photoluminescence, optical reflection measurements and theoretical calculations, we demonstrate an unconventional excitonic state and its bound phonon sideband in layered silicon diphosphide (SiP$_2$), in which the bound electron-hole pair is composed of electrons confined within one-dimensional phosphorus$-$phosphorus chains and holes extended in two-dimensional SiP$_2$ layers. The excitonic state and the emergent phonon sideband show linear dichroism and large energy redshifts with increasing temperature. Within the $GW$ plus Bethe$-$Salpeter equation calculations and solving the generalized Holstein model non-perturbatively, we confirm that the observed sideband feature results from the correlated interaction between excitons and optical phonons. Such a layered material provides a new platform to study excitonic physics and many-particle effects. △ Less

Submitted 16 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

Journal ref: Nature Materials (2022)

arXiv:2206.07706 [pdf, other]

Masked Frequency Modeling for Self-Supervised Visual Pre-Training

Authors: Jiahao Xie, Wei Li, Xiaohang Zhan, Ziwei Liu, Yew Soon Ong, Chen Change Loy

Abstract: We present Masked Frequency Modeling (MFM), a unified frequency-domain-based approach for self-supervised pre-training of visual models. Instead of randomly inserting mask tokens to the input embeddings in the spatial domain, in this paper, we shift the perspective to the frequency domain. Specifically, MFM first masks out a portion of frequency components of the input image and then predicts the… ▽ More We present Masked Frequency Modeling (MFM), a unified frequency-domain-based approach for self-supervised pre-training of visual models. Instead of randomly inserting mask tokens to the input embeddings in the spatial domain, in this paper, we shift the perspective to the frequency domain. Specifically, MFM first masks out a portion of frequency components of the input image and then predicts the missing frequencies on the frequency spectrum. Our key insight is that predicting masked components in the frequency domain is more ideal to reveal underlying image patterns rather than predicting masked patches in the spatial domain, due to the heavy spatial redundancy. Our findings suggest that with the right configuration of mask-and-predict strategy, both the structural information within high-frequency components and the low-level statistics among low-frequency counterparts are useful in learning good representations. For the first time, MFM demonstrates that, for both ViT and CNN, a simple non-Siamese framework can learn meaningful representations even using none of the following: (i) extra data, (ii) extra model, (iii) mask token. Experimental results on image classification and semantic segmentation, as well as several robustness benchmarks show the competitive performance and advanced robustness of MFM compared with recent masked image modeling approaches. Furthermore, we also comprehensively investigate the effectiveness of classical image restoration tasks for representation learning from a unified frequency perspective and reveal their intriguing relations with our MFM approach. △ Less

Submitted 25 April, 2023; v1 submitted 15 June, 2022; originally announced June 2022.

Comments: ICLR 2023. Project page: https://www.mmlab-ntu.com/project/mfm/index.html Code: https://github.com/Jiahao000/MFM

arXiv:2205.10595 [pdf, ps, other]

doi 10.1007/978-3-642-23626-6_53

Myocardial Segmentation of Late Gadolinium Enhanced MR Images by Propagation of Contours from Cine MR Images

Authors: Dong Wei, Ying Sun, ** Chai, Adrian Low, Sim Heng Ong

Abstract: Automatic segmentation of myocardium in Late Gadolinium Enhanced (LGE) Cardiac MR (CMR) images is often difficult due to the intensity heterogeneity resulting from accumulation of contrast agent in infarcted areas. In this paper, we propose an automatic segmentation framework that fully utilizes shared information between corresponding cine and LGE images of a same patient. Given myocardial contou… ▽ More Automatic segmentation of myocardium in Late Gadolinium Enhanced (LGE) Cardiac MR (CMR) images is often difficult due to the intensity heterogeneity resulting from accumulation of contrast agent in infarcted areas. In this paper, we propose an automatic segmentation framework that fully utilizes shared information between corresponding cine and LGE images of a same patient. Given myocardial contours in cine CMR images, the proposed framework achieves accurate segmentation of LGE CMR images in a coarse-to-fine manner. Affine registration is first performed between the corresponding cine and LGE image pair, followed by nonrigid registration, and finally local deformation of myocardial contours driven by forces derived from local features of the LGE image. Experimental results on real patient data with expert outlined ground truth show that the proposed framework can generate accurate and reliable results for myocardial segmentation of LGE CMR images. △ Less

Submitted 21 May, 2022; originally announced May 2022.

Comments: MICCAI 2011

arXiv:2205.10572 [pdf, other]

doi 10.1109/TBME.2013.2237907

A Comprehensive 3-D Framework for Automatic Quantification of Late Gadolinium Enhanced Cardiac Magnetic Resonance Images

Authors: Dong Wei, Ying Sun, Sim-Heng Ong, ** Chai, Lynette L Teo, Adrian F Low

Abstract: Late gadolinium enhanced (LGE) cardiac magnetic resonance (CMR) can directly visualize nonviable myocardium with hyperenhanced intensities with respect to normal myocardium. For heart attack patients, it is crucial to facilitate the decision of appropriate therapy by analyzing and quantifying their LGE CMR images. To achieve accurate quantification, LGE CMR images need to be processed in two steps… ▽ More Late gadolinium enhanced (LGE) cardiac magnetic resonance (CMR) can directly visualize nonviable myocardium with hyperenhanced intensities with respect to normal myocardium. For heart attack patients, it is crucial to facilitate the decision of appropriate therapy by analyzing and quantifying their LGE CMR images. To achieve accurate quantification, LGE CMR images need to be processed in two steps: segmentation of the myocardium followed by classification of infarcts within the segmented myocardium. However, automatic segmentation is difficult usually due to the intensity heterogeneity of the myocardium and intensity similarity between the infarcts and blood pool. Besides, the slices of an LGE CMR dataset often suffer from spatial and intensity distortions, causing further difficulties in segmentation and classification. In this paper, we present a comprehensive 3-D framework for automatic quantification of LGE CMR images. In this framework, myocardium is segmented with a novel method that deforms coupled endocardial and epicardial meshes and combines information in both short- and long-axis slices, while infarcts are classified with a graph-cut algorithm incorporating intensity and spatial information. Moreover, both spatial and intensity distortions are effectively corrected with specially designed countermeasures. Experiments with 20 sets of real patient data show visually good segmentation and classification results that are quantitatively in strong agreement with those manually obtained by experts. △ Less

Submitted 21 May, 2022; originally announced May 2022.

Comments: IEEE Transactions on Biomedical Engineering ( Volume: 60, Issue: 6, June 2013)

arXiv:2205.10548 [pdf, ps, other]

doi 10.1016/j.media.2013.03.001

Three-Dimensional Segmentation of the Left Ventricle in Late Gadolinium Enhanced MR Images of Chronic Infarction Combining Long- and Short-Axis Information

Authors: Dong Wei, Ying Sun, Sim-Heng Ong, ** Chai, Lynette L. Teo, Adrian F. Low

Abstract: Automatic segmentation of the left ventricle (LV) in late gadolinium enhanced (LGE) cardiac MR (CMR) images is difficult due to the intensity heterogeneity arising from accumulation of contrast agent in infarcted myocardium. In this paper, we present a comprehensive framework for automatic 3D segmentation of the LV in LGE CMR images. Given myocardial contours in cine images as a priori knowledge,… ▽ More Automatic segmentation of the left ventricle (LV) in late gadolinium enhanced (LGE) cardiac MR (CMR) images is difficult due to the intensity heterogeneity arising from accumulation of contrast agent in infarcted myocardium. In this paper, we present a comprehensive framework for automatic 3D segmentation of the LV in LGE CMR images. Given myocardial contours in cine images as a priori knowledge, the framework initially propagates the a priori segmentation from cine to LGE images via 2D translational registration. Two meshes representing respectively endocardial and epicardial surfaces are then constructed with the propagated contours. After construction, the two meshes are deformed towards the myocardial edge points detected in both short-axis and long-axis LGE images in a unified 3D coordinate system. Taking into account the intensity characteristics of the LV in LGE images, we propose a novel parametric model of the LV for consistent myocardial edge points detection regardless of pathological status of the myocardium (infarcted or healthy) and of the type of the LGE images (short-axis or long-axis). We have evaluated the proposed framework with 21 sets of real patient and 4 sets of simulated phantom data. Both distance- and region-based performance metrics confirm the observation that the framework can generate accurate and reliable results for myocardial segmentation of LGE images. We have also tested the robustness of the framework with respect to varied a priori segmentation in both practical and simulated settings. Experimental results show that the proposed framework can greatly compensate variations in the given a priori knowledge and consistently produce accurate segmentations. △ Less

Submitted 21 May, 2022; originally announced May 2022.

Comments: Medical Image Analysis, Volume 17, Issue 6, August 2013, Pages 685-697

arXiv:2204.01832 [pdf, other]

doi 10.1063/5.0094205

Quantum materials for energy-efficient neuromorphic computing

Authors: Axel Hoffmann, Shriram Ramanathan, Julie Grollier, Andrew D. Kent, Marcelo Rozenberg, Ivan K. Schuller, Oleg Shpyrko, Robert Dynes, Yeshaiahu Fainman, Alex Frano, Eric E. Fullerton, Giulia Galli, Vitaliy Lomakin, Shyue ** Ong, Amanda K. Petford-Long, Jonathan A. Schuller, Mark D. Stiles, Yayoi Takamura, Yimei Zhu

Abstract: Neuromorphic computing approaches become increasingly important as we address future needs for efficiently processing massive amounts of data. The unique attributes of quantum materials can help address these needs by enabling new energy-efficient device concepts that implement neuromorphic ideas at the hardware level. In particular, strong correlations give rise to highly non-linear responses, su… ▽ More Neuromorphic computing approaches become increasingly important as we address future needs for efficiently processing massive amounts of data. The unique attributes of quantum materials can help address these needs by enabling new energy-efficient device concepts that implement neuromorphic ideas at the hardware level. In particular, strong correlations give rise to highly non-linear responses, such as conductive phase transitions that can be harnessed for short and long-term plasticity. Similarly, magnetization dynamics are strongly non-linear and can be utilized for data classification. This paper discusses select examples of these approaches, and provides a perspective for the current opportunities and challenges for assembling quantum-material-based devices for neuromorphic functionalities into larger emergent complex network systems. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Journal ref: APL Materials 10, 070904 (2022)

arXiv:2204.00091 [pdf]

doi 10.1038/s41467-023-37115-6

Atomic-scale origin of the low grain-boundary resistance in perovskite solid electrolytes

Authors: Tom Lee, Ji Qi, Chaitanya A. Gadre, Huaixun Huyan, Shu-Ting Ko, Yunxing Zuo, Chaojie Du, Jie Li, Toshihiro Aoki, Caden John Stippich, Ruqian Wu, Jian Luo, Shyue ** Ong, Xiaoqing Pan

Abstract: Oxide solid electrolytes (OSEs) have the potential to achieve improved safety and energy density for lithium-ion batteries, but their high grain-boundary (GB) resistance is a general bottleneck. In the most well studied perovskite OSE, Li3xLa2/3-xTiO3 (LLTO), the ionic conductivity of GBs is about three orders of magnitude lower than that of the bulk. In contrast, the related Li0.375Sr0.4375Ta0.75… ▽ More Oxide solid electrolytes (OSEs) have the potential to achieve improved safety and energy density for lithium-ion batteries, but their high grain-boundary (GB) resistance is a general bottleneck. In the most well studied perovskite OSE, Li3xLa2/3-xTiO3 (LLTO), the ionic conductivity of GBs is about three orders of magnitude lower than that of the bulk. In contrast, the related Li0.375Sr0.4375Ta0.75Zr0.25O3 (LSTZ0.75) perovskite exhibits low GB resistance for reasons yet unknown. Here, we used aberration-corrected scanning transmission electron microscopy and spectroscopy, along with an active learning moment tensor potential, to reveal the atomic scale structure and composition of LSTZ0.75 GBs. Vibrational electron energy loss spectroscopy is applied for the first time to characterize the otherwise unmeasurable Li distribution in GBs of LSTZ0.75. We found that Li depletion, which is a major reason for the low GB ionic conductivity of LLTO, is absent for the GBs of LSTZ0.75. Instead, the low GB resistivity of LSTZ0.75 is attributed to the formation of a unique defective cubic perovskite interfacial structure that contained abundant vacancies. Our study provides insights into the atomic scale mechanisms of low GB resistivity and sheds light on possible paths for designing OSEs with high total ionic conductivity. △ Less

Submitted 31 March, 2022; originally announced April 2022.

arXiv:2203.13509 [pdf, other]

Photo-induced Hidden Phase of 1T-TaS2 with Tunable Lifetime

Authors: Pierre-Adrien Mante, Chin Shen Ong, Daniel Finkelstein Shapiro, Arkady Yartsev, Oscar Grånäs, Olle Eriksson

Abstract: Phase transitions are ubiquitous, appearing at every length scale from atoms to galaxies. In condensed matter, ultrafast laser pulses drive materials to highly non-equilibrium conditions allowing transitions to new phases of matter not attainable under thermal excitation. Despite the intense scrutiny these hidden phases have received, the details of the dynamics of transition and reestablishment o… ▽ More Phase transitions are ubiquitous, appearing at every length scale from atoms to galaxies. In condensed matter, ultrafast laser pulses drive materials to highly non-equilibrium conditions allowing transitions to new phases of matter not attainable under thermal excitation. Despite the intense scrutiny these hidden phases have received, the details of the dynamics of transition and reestablishment of the ground state remain largely unexplored. Here, we show the transition to a hidden phase of 1T-TaS2 driven by the screening of Coulombic repulsive interaction by photoexcited electrons. The temporal evolution of the coherent lattice dynamics highlights the existence of a novel phase with a laser fluence-dependent lifetime. The modeling of the dynamics reveals that the transition is caused by photo-excited carriers and it disappears at the rate of electron-phonon scattering. Our results demonstrate how femtosecond laser absorption leads to a decoupling of the electronic and lattice sub-systems, opening the way to novel states of matter, which can be controlled with light. We expect our investigation to be a starting point towards the development of novel ultrafast photonics devices, such as switches and modulators, taking advantage of fast and tunable phase transitions. △ Less

Submitted 25 March, 2022; originally announced March 2022.

arXiv:2203.08605 [pdf, other]

Compositional dependence of direct transition energies in Si$_x$Ge$_{1-x-y}$Sn$_y$ alloys lattice-matched to Ge/GaAs

Authors: Phoebe M. Pearce, Sheau Wei Ong, Andrew D. Johnson, Eng Soon Tok, Nicholas J. Ekins-Daukes

Abstract: Si$_x$Ge$_{1-x-y}$Sn$_y$ ternary alloys are a candidate material system for use in solar cells and other optoelectronic devices. We report on the direct transition energies and structural properties of Ge-rich Si$_x$Ge$_{1-x-y}$Sn$_y$ alloys with six different compositions up to 10 % Si and 3 % Sn, lattice-matched to Ge or GaAs substrates. The direct transitions occurring between 0.9 and 5.0 eV we… ▽ More Si$_x$Ge$_{1-x-y}$Sn$_y$ ternary alloys are a candidate material system for use in solar cells and other optoelectronic devices. We report on the direct transition energies and structural properties of Ge-rich Si$_x$Ge$_{1-x-y}$Sn$_y$ alloys with six different compositions up to 10 % Si and 3 % Sn, lattice-matched to Ge or GaAs substrates. The direct transitions occurring between 0.9 and 5.0 eV were investigated using spectroscopic ellipsometry (SE), and the resulting data was used to obtain the dielectric functions of the Si$_x$Ge$_{1-x-y}$Sn$_y$n layer by fitting a multi-layer model. Values for the $E_0$, $E_1$, $Δ_1$, $E_0'$ and $E_2$ transition energies were then found by differentiating these dielectric functions to extract the locations of critical points. Structurally, the composition of the samples was measured using energy-dispersive X-ray measurements (EDX). The lattice constants predicted from these compositions are in good agreement with reciprocal space maps obtained through X-ray diffraction (XRD). The results confirm that a 1 eV direct absorption edge can be achieved using relatively low Si and Sn fractions ($<$ 10 % and $<$ 3 % respectively), while the higher-energy critical points show smaller shifts relative to Ge and match results previously observed or predicted in the literature. △ Less

Submitted 10 March, 2022; originally announced March 2022.

Comments: 11 pages, 9 figures

arXiv:2203.03767 [pdf, other]

doi 10.1038/s41524-023-01046-z

Multi-scale Investigation of Chemical Short-Range Order and Dislocation Glide in the MoNbTi and TaNbTi Refractory Multi-Principal Element Alloys

Authors: Hui Zheng, Lauren T. W. Fey, Xiang-Guo Li, Yong-Jie Hu, Liang Qi, Chi Chen, Shuozhi Xu, Irene J. Beyerlein, Shyue ** Ong

Abstract: Refractory multi-principal element alloys (RMPEAs) are promising materials for high-temperature structural applications. Here, we investigate the role of chemical short-range ordering (CSRO) on dislocation glide in two model RMPEAs - TaNbTi and MoNbTi - using a multi-scale modeling approach. A highly accurate machine learning interatomic potential was developed for the Mo-Ta-Nb-Ti system and used… ▽ More Refractory multi-principal element alloys (RMPEAs) are promising materials for high-temperature structural applications. Here, we investigate the role of chemical short-range ordering (CSRO) on dislocation glide in two model RMPEAs - TaNbTi and MoNbTi - using a multi-scale modeling approach. A highly accurate machine learning interatomic potential was developed for the Mo-Ta-Nb-Ti system and used to demonstrate that MoNbTi exhibits a much greater degree of SRO than TaNbTi and the local composition has a direct effect on the unstable stacking fault energies (USFE). From mesoscale phase-field dislocation dynamics simulations, we find that increasing SRO leads to higher mean USFEs, thereby increasing the stress required for dislocation glide. The gliding dislocations experience significant hardening due to pinning and depinning caused by random compositional fluctuations, with higher SRO decreasing the degree of USFE dispersion and hence, amount of hardening. Finally, we show how the morphology of an expanding dislocation loop is affected by the applied stress, with higher SRO requiring higher applied stresses to achieve smooth screw dislocation glide. △ Less

Submitted 7 March, 2022; originally announced March 2022.

arXiv:2202.02450 [pdf, other]

doi 10.1038/s43588-022-00349-3

A Universal Graph Deep Learning Interatomic Potential for the Periodic Table

Authors: Chi Chen, Shyue ** Ong

Abstract: Interatomic potentials (IAPs), which describe the potential energy surface of atoms, are a fundamental input for atomistic simulations. However, existing IAPs are either fitted to narrow chemistries or too inaccurate for general applications. Here, we report a universal IAP for materials based on graph neural networks with three-body interactions (M3GNet). The M3GNet IAP was trained on the massive… ▽ More Interatomic potentials (IAPs), which describe the potential energy surface of atoms, are a fundamental input for atomistic simulations. However, existing IAPs are either fitted to narrow chemistries or too inaccurate for general applications. Here, we report a universal IAP for materials based on graph neural networks with three-body interactions (M3GNet). The M3GNet IAP was trained on the massive database of structural relaxations performed by the Materials Project over the past 10 years and has broad applications in structural relaxation, dynamic simulations and property prediction of materials across diverse chemical spaces. About 1.8 million materials were identified from a screening of 31 million hypothetical crystal structures to be potentially stable against existing Materials Project crystals based on M3GNet energies. Of the top 2000 materials with the lowest energies above hull, 1578 were verified to be stable using DFT calculations. These results demonstrate a machine learning-accelerated pathway to the discovery of synthesizable materials with exceptional properties. △ Less

Submitted 14 August, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

arXiv:2201.11991 [pdf, other]

A Universal Machine Learning Model for Elemental Grain Boundary Energies

Authors: Weike Ye, Hui Zheng, Chi Chen, Shyue ** Ong

Abstract: The grain boundary (GB) energy has a profound influence on the grain growth and properties of polycrystalline metals. Here, we show that the energy of a GB, normalized by the bulk cohesive energy, can be described purely by four geometric features. By machine learning on a large computed database of 361 small $Σ$ ($Σ< 10$) GBs of more than 50 metals, we develop a model that can predict the grain b… ▽ More The grain boundary (GB) energy has a profound influence on the grain growth and properties of polycrystalline metals. Here, we show that the energy of a GB, normalized by the bulk cohesive energy, can be described purely by four geometric features. By machine learning on a large computed database of 361 small $Σ$ ($Σ< 10$) GBs of more than 50 metals, we develop a model that can predict the grain boundary energies to within a mean absolute error of 0.13 J m$^{-2}$. More importantly, this universal GB energy model can be extrapolated to the energies of high $Σ$ GBs without loss in accuracy. These results highlight the importance of capturing fundamental scaling physics and domain knowledge in the design of interpretable, extrapolatable machine learning models for materials science. △ Less

Submitted 28 January, 2022; originally announced January 2022.

Comments: 12 pages, 4 figures

arXiv:2201.02562 [pdf]

Nature of novel moiré exciton states in WSe$_2$/WS$_2$ heterobilayers

Authors: Mit H. Naik, Emma C. Regan, Zuocheng Zhang, Yang-hao Chan, Zhenglu Li, Danqing Wang, Yoseob Yoon, Chin Shen Ong, Wenyu Zhao, Sihan Zhao, M. Iqbal Bakti Utama, Beini Gao, Xin Wei, Mohammed Sayyad, Kentaro Yumigeta, Kenji Watanabe, Takashi Taniguchi, Sefaattin Tongay, Felipe H. da Jornada, Feng Wang, Steven G. Louie

Abstract: Moiré patterns of transition metal dichalcogenide (TMD) heterobilayers have proven to be an ideal platform to host unusual correlated electronic phases, emerging magnetism, and correlated exciton physics. While the existence of novel moiré excitonic states is established through optical measurements, the microscopic nature of these states is still poorly understood, often relying on empirically fi… ▽ More Moiré patterns of transition metal dichalcogenide (TMD) heterobilayers have proven to be an ideal platform to host unusual correlated electronic phases, emerging magnetism, and correlated exciton physics. While the existence of novel moiré excitonic states is established through optical measurements, the microscopic nature of these states is still poorly understood, often relying on empirically fit models. Here, combining large-scale first-principles GW-BSE calculations and micro-reflection spectroscopy, we identify the nature of the exciton resonances in WSe$_2$/WS$_2$ moiré superlattices, discovering a surprisingly rich set of moiré excitons that cannot be even qualitatively captured by prevailing continuum models. Our calculations reveal moiré excitons with distinct characters, including modulated Wannier excitons and previously unindentified intralayer charge-transfer excitons. Signatures of these distinct excitonic characters are confirmed experimentally via the unique carrier-density and magnetic-field dependences of different moiré exciton resonances. Our study highlights the highly non-trivial exciton states that can emerge in TMD moiré superlattices, and suggests novel ways of tuning many-body physics in moiré systems by engineering excited-states with specific spatial characters. △ Less

Submitted 7 January, 2022; originally announced January 2022.

arXiv:2112.13029 [pdf, other]

Gaussian Process Bandits with Aggregated Feedback

Authors: Mengyan Zhang, Russell Tsuchida, Cheng Soon Ong

Abstract: We consider the continuum-armed bandits problem, under a novel setting of recommending the best arms within a fixed budget under aggregated feedback. This is motivated by applications where the precise rewards are impossible or expensive to obtain, while an aggregated reward or feedback, such as the average over a subset, is available. We constrain the set of reward functions by assuming that they… ▽ More We consider the continuum-armed bandits problem, under a novel setting of recommending the best arms within a fixed budget under aggregated feedback. This is motivated by applications where the precise rewards are impossible or expensive to obtain, while an aggregated reward or feedback, such as the average over a subset, is available. We constrain the set of reward functions by assuming that they are from a Gaussian Process and propose the Gaussian Process Optimistic Optimisation (GPOO) algorithm. We adaptively construct a tree with nodes as subsets of the arm space, where the feedback is the aggregated reward of representatives of a node. We propose a new simple regret notion with respect to aggregated feedback on the recommended arms. We provide theoretical analysis for the proposed algorithm, and recover single point feedback as a special case. We illustrate GPOO and compare it with related algorithms on simulated data. △ Less

Submitted 24 December, 2021; originally announced December 2021.

Comments: to be published in 36th AAAI Conference on Artificial Intelligence (2022)

arXiv:2112.08714 [pdf]

Degradation Mechanism of Perovskite under High Charge Carrier Density Condition

Authors: Guohui Li, Huihui Pi, Yanfu Wei, Bolin Zhou, Ya Gao, Rong Wen, Yuying Hao, Han Zhang, Beng S. Ong, Yanxia Cui

Abstract: Extensive studies have focused on degradation of perovskite at low charge carrier density (<10^16 cm^-3), but few have surveyed the degradation mechanism at high charge carrier density (~10^18 cm^-3). Here, we investigate the degradation mechanisms of perovskite under high charge carrier conditions. Unlike the observations in previous works, we find that MAPbI3 degradation starts at surface defect… ▽ More Extensive studies have focused on degradation of perovskite at low charge carrier density (<10^16 cm^-3), but few have surveyed the degradation mechanism at high charge carrier density (~10^18 cm^-3). Here, we investigate the degradation mechanisms of perovskite under high charge carrier conditions. Unlike the observations in previous works, we find that MAPbI3 degradation starts at surface defects and progressing from the surface defects towards neighboring regions under high charge carrier density condition. By using PbI2 passivation, the defect-initiated degradation is significantly suppressed and the nanoplatelet degrades in a layer-by-layer way, enabling the MAPbI3 laser sustain for 4500 s (2.7*10^7 pulses), which is almost 3 times longer than that of the nanoplatelet laser without passivation. Meanwhile, the PbI2 passivated MAPbI3 nanoplatelet laser with the nanoplatelet cavity displaying a maximum quality factor up to ~7800, the highest reported for all MAPbI3 nanoplatelet cavities. Furthermore, a high stability MAPbI3 nanoplatelet laser that can last for 8500 s (5.1*10^7 pulses) is demonstrated based on a dual passivation strategy, by retarding the defect-initiated degradation and surface-initiated degradation, simultaneously. This work provides in-depth insights for understanding the degradation of perovskite at high charge carrier density. △ Less

Submitted 16 December, 2021; originally announced December 2021.

Comments: 37 pages,19 figures

arXiv:2111.13802 [pdf, other]

Factorized Fourier Neural Operators

Authors: Alasdair Tran, Alexander Mathews, Lexing Xie, Cheng Soon Ong

Abstract: We propose the Factorized Fourier Neural Operator (F-FNO), a learning-based approach for simulating partial differential equations (PDEs). Starting from a recently proposed Fourier representation of flow fields, the F-FNO bridges the performance gap between pure machine learning approaches to that of the best numerical or hybrid solvers. This is achieved with new representations - separable spectr… ▽ More We propose the Factorized Fourier Neural Operator (F-FNO), a learning-based approach for simulating partial differential equations (PDEs). Starting from a recently proposed Fourier representation of flow fields, the F-FNO bridges the performance gap between pure machine learning approaches to that of the best numerical or hybrid solvers. This is achieved with new representations - separable spectral layers and improved residual connections - and a combination of training strategies such as the Markov assumption, Gaussian noise, and cosine learning rate decay. On several challenging benchmark PDEs on regular grids, structured meshes, and point clouds, the F-FNO can scale to deeper networks and outperform both the FNO and the geo-FNO, reducing the error by 83% on the Navier-Stokes problem, 31% on the elasticity problem, 57% on the airfoil flow problem, and 60% on the plastic forging problem. Compared to the state-of-the-art pseudo-spectral method, the F-FNO can take a step size that is an order of magnitude larger in time and achieve an order of magnitude speedup to produce the same solution quality. △ Less

Submitted 2 March, 2023; v1 submitted 26 November, 2021; originally announced November 2021.

Comments: Published in The Eleventh International Conference on Learning Representations (2023). Code is available at https://github.com/alasdairtran/fourierflow

arXiv:2110.14820 [pdf, other]

doi 10.1038/s41524-022-00734-6

Recent Advances and Applications of Deep Learning Methods in Materials Science

Authors: Kamal Choudhary, Brian DeCost, Chi Chen, Anubhav Jain, Francesca Tavazza, Ryan Cohn, Cheol WooPark, Alok Choudhary, Ankit Agrawal, Simon J. L. Billinge, Elizabeth Holm, Shyue ** Ong, Chris Wolverton

Abstract: Deep learning (DL) is one of the fastest growing topics in materials data science, with rapidly emerging applications spanning atomistic, image-based, spectral, and textual data modalities. DL allows analysis of unstructured data and automated identification of features. Recent development of large materials databases has fueled the application of DL methods in atomistic prediction in particular.… ▽ More Deep learning (DL) is one of the fastest growing topics in materials data science, with rapidly emerging applications spanning atomistic, image-based, spectral, and textual data modalities. DL allows analysis of unstructured data and automated identification of features. Recent development of large materials databases has fueled the application of DL methods in atomistic prediction in particular. In contrast, advances in image and spectral data have largely leveraged synthetic data enabled by high quality forward models as well as by generative unsupervised DL methods. In this article, we present a high-level overview of deep-learning methods followed by a detailed discussion of recent developments of deep learning in atomistic simulation, materials imaging, spectral analysis, and natural language processing. For each modality we discuss applications involving both theoretical and experimental data, typical modeling approaches with their strengths and limitations, and relevant publicly available software and datasets. We conclude the review with a discussion of recent cross-cutting work related to uncertainty quantification in this field and a brief perspective on limitations, challenges, and potential growth areas for DL methods in materials science. The application of DL methods in materials science presents an exciting avenue for future materials discovery and design. △ Less

Submitted 27 October, 2021; originally announced October 2021.

Showing 1–50 of 158 results for author: Ong, S