Search | arXiv e-print repository

Towards Neural Scaling Laws for Foundation Models on Temporal Graphs

Authors: Razieh Shirzadkhani, Tran Gia Bao Ngo, Kiarash Shamsi, Shenyang Huang, Farimah Poursafaei, Poupak Azad, Reihaneh Rabbany, Baris Coskunuzer, Guillaume Rabusseau, Cuneyt Gurcan Akcora

Abstract: The field of temporal graph learning aims to learn from evolving network data to forecast future interactions. Given a collection of observed temporal graphs, is it possible to predict the evolution of an unseen network from the same domain? To answer this question, we first present the Temporal Graph Scaling (TGS) dataset, a large collection of temporal graphs consisting of eighty-four ERC20 toke… ▽ More The field of temporal graph learning aims to learn from evolving network data to forecast future interactions. Given a collection of observed temporal graphs, is it possible to predict the evolution of an unseen network from the same domain? To answer this question, we first present the Temporal Graph Scaling (TGS) dataset, a large collection of temporal graphs consisting of eighty-four ERC20 token transaction networks collected from 2017 to 2023. Next, we evaluate the transferability of Temporal Graph Neural Networks (TGNNs) for the temporal graph property prediction task by pre-training on a collection of up to sixty-four token transaction networks and then evaluating the downstream performance on twenty unseen token networks. We find that the neural scaling law observed in NLP and Computer Vision also applies in temporal graph learning, where pre-training on greater number of networks leads to improved downstream performance. To the best of our knowledge, this is the first empirical demonstration of the transferability of temporal graphs learning. On downstream token networks, the largest pre-trained model outperforms single model TGNNs on thirteen unseen test networks. Therefore, we believe that this is a promising first step towards building foundation models for temporal graphs. △ Less

Submitted 26 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

Comments: 17 pages, 15 figures, preprint version

arXiv:2406.06239 [pdf, other]

I-MPN: Inductive Message Passing Network for Effective and Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data

Authors: Hoang H. Le, Duy M. H. Nguyen, Omair Shahzad Bhatti, Laszlo Kopacsi, Thinh P. Ngo, Binh T. Nguyen, Michael Barz, Daniel Sonntag

Abstract: Understanding human visual processing in dynamic environments is essential for psychology and human-centered interaction design. Mobile eye-tracking systems, combining egocentric video and gaze signals, offer valuable insights. However, manual analysis of these recordings is time-intensive. In this work, we present a novel human-centered learning algorithm designed for automated object recognition… ▽ More Understanding human visual processing in dynamic environments is essential for psychology and human-centered interaction design. Mobile eye-tracking systems, combining egocentric video and gaze signals, offer valuable insights. However, manual analysis of these recordings is time-intensive. In this work, we present a novel human-centered learning algorithm designed for automated object recognition within mobile eye-tracking settings. Our approach seamlessly integrates an object detector with an inductive message-passing network technique (I-MPN), harnessing node features such as node profile information and positions. This integration enables our algorithm to learn embedding functions capable of generalizing to new object angle views, thereby facilitating rapid adaptation and efficient reasoning in dynamic contexts as users navigate through their environment. Through experiments conducted on three distinct video sequences, our \textit{interactive-based method} showcases significant performance improvements over fixed training/testing algorithms, even when trained on considerably smaller annotated samples collected through user feedback. Furthermore, we showcase exceptional efficiency in data annotation processes, surpassing approaches that use complete object detectors, combine detectors with convolutional networks, or employ interactive video segmentation. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: First version

arXiv:2406.05349 [pdf, other]

Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid

Authors: Thanh-Huy Nguyen, Thi Kim Ngan Ngo, Mai Anh Vu, Ting-Yuan Tu

Abstract: The ability of three-dimensional (3D) spheroid modeling to study the invasive behavior of breast cancer cells has drawn increased attention. The deep learning-based image processing framework is very effective at speeding up the cell morphological analysis process. Out-of-focus photos taken while capturing 3D cells under several z-slices, however, could negatively impact the deep learning model. I… ▽ More The ability of three-dimensional (3D) spheroid modeling to study the invasive behavior of breast cancer cells has drawn increased attention. The deep learning-based image processing framework is very effective at speeding up the cell morphological analysis process. Out-of-focus photos taken while capturing 3D cells under several z-slices, however, could negatively impact the deep learning model. In this work, we created a new algorithm to handle blurry images while preserving the stacked image quality. Furthermore, we proposed a unique training architecture that leverages consistency training to help reduce the bias of the model when dense-slice stacking is applied. Additionally, the model's stability is increased under the sparse-slice stacking effect by utilizing the self-training approach. The new blurring stacking technique and training flow are combined with the suggested architecture and self-training mechanism to provide an innovative yet easy-to-use framework. Our methods produced noteworthy experimental outcomes in terms of both quantitative and qualitative aspects. △ Less

Submitted 8 June, 2024; originally announced June 2024.

arXiv:2405.19821 [pdf]

Polarized sub-meV Photoluminescence in 2D PbS Nanoplatelets at Cryogenic Temperatures

Authors: Pengji Li, Leon Biesterfeld, Lars Klepzig, **gzhong Yang, Huu Thoai Ngo, Ahmed Addad, Tom N. Rakow, Ruolin Guan, Eddy P. Rugeramigabo, Louis Biadala, Jannika Lauth, Michael Zopf

Abstract: Colloidal semiconductor nanocrystals are promising materials for classical and quantum light sources due to their versatile chemistry and efficient photoluminescence (PL) properties. While visible emitters are well-established, the pursuit of excellent (near-)infrared sources continues. One notable candidate in this regard are photoluminescent two-dimensional (2D) PbS nanoplatelets (NPLs) exhibiti… ▽ More Colloidal semiconductor nanocrystals are promising materials for classical and quantum light sources due to their versatile chemistry and efficient photoluminescence (PL) properties. While visible emitters are well-established, the pursuit of excellent (near-)infrared sources continues. One notable candidate in this regard are photoluminescent two-dimensional (2D) PbS nanoplatelets (NPLs) exhibiting excitonic emission at 720 nm (1.7 eV) directly tying to the typical emission range limit of CdSe NPLs. Here, we present the first comprehensive analysis of low-temperature PL from this material class. Ultrathin 2D PbS NPLs exhibit high crystallinity confirmed by scanning transmission electron microscopy, and revealing Moire patterns in overlap** structures. At 4K, we observe unique PL features in single PbS NPLs, including narrow zero-phonon lines with line widths down to 0.6 meV and a linear degree of polarization up to 90%. Time-resolved measurements identify trions as the dominant emission source with a 2.3 ns decay time. Sub-meV spectral diffusion and no immanent blinking over minutes is observed, as well as discrete spectral jumps without memory effects. These findings advance the understanding and underpin the potential of colloidal PbS NPLs for optical and quantum technologies. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.15843 [pdf, other]

SpotNet: An Image Centric, Lidar Anchored Approach To Long Range Perception

Authors: Louis Foucard, Samar Khanna, Yi Shi, Chi-Kuei Liu, Quinn Z Shen, Thuyen Ngo, Zi-Xiang Xia

Abstract: In this paper, we propose SpotNet: a fast, single stage, image-centric but LiDAR anchored approach for long range 3D object detection. We demonstrate that our approach to LiDAR/image sensor fusion, combined with the joint learning of 2D and 3D detection tasks, can lead to accurate 3D object detection with very sparse LiDAR support. Unlike more recent bird's-eye-view (BEV) sensor-fusion methods whi… ▽ More In this paper, we propose SpotNet: a fast, single stage, image-centric but LiDAR anchored approach for long range 3D object detection. We demonstrate that our approach to LiDAR/image sensor fusion, combined with the joint learning of 2D and 3D detection tasks, can lead to accurate 3D object detection with very sparse LiDAR support. Unlike more recent bird's-eye-view (BEV) sensor-fusion methods which scale with range $r$ as $O(r^2)$, SpotNet scales as $O(1)$ with range. We argue that such an architecture is ideally suited to leverage each sensor's strength, i.e. semantic understanding from images and accurate range finding from LiDAR data. Finally we show that anchoring detections on LiDAR points removes the need to regress distances, and so the architecture is able to transfer from 2MP to 8MP resolution images without re-training. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.08843 [pdf, other]

FLEXIBLE: Forecasting Cellular Traffic by Leveraging Explicit Inductive Graph-Based Learning

Authors: Duc Thinh Ngo, Kandaraj Piamrat, Ons Aouedi, Thomas Hassan, Philippe Raipin-Parvédy

Abstract: From a telecommunication standpoint, the surge in users and services challenges next-generation networks with escalating traffic demands and limited resources. Accurate traffic prediction can offer network operators valuable insights into network conditions and suggest optimal allocation policies. Recently, spatio-temporal forecasting, employing Graph Neural Networks (GNNs), has emerged as a promi… ▽ More From a telecommunication standpoint, the surge in users and services challenges next-generation networks with escalating traffic demands and limited resources. Accurate traffic prediction can offer network operators valuable insights into network conditions and suggest optimal allocation policies. Recently, spatio-temporal forecasting, employing Graph Neural Networks (GNNs), has emerged as a promising method for cellular traffic prediction. However, existing studies, inspired by road traffic forecasting formulations, overlook the dynamic deployment and removal of base stations, requiring the GNN-based forecaster to handle an evolving graph. This work introduces a novel inductive learning scheme and a generalizable GNN-based forecasting model that can process diverse graphs of cellular traffic with one-time training. We also demonstrate that this model can be easily leveraged by transfer learning with minimal effort, making it applicable to different areas. Experimental results show up to 9.8% performance improvement compared to the state-of-the-art, especially in rare-data settings with training data reduced to below 20%. △ Less

Submitted 14 May, 2024; originally announced May 2024.

arXiv:2403.03466 [pdf, other]

Systematic Improvement of Quantum Monte Carlo Calculations in Transition Metal Oxides: sCI-Driven Wavefunction Optimization for Reliable Band Gap prediction

Authors: Hyeondeok Shin, Kevin Gasperich, Tomas Rojas, Anh T. Ngo, Jaron T. Krogel, Anouar Benali

Abstract: Accurate determination of electronic properties of correlated oxides remains a significant challenge for computational theory. Traditional Hubbard-corrected density functional theory (DFT+U) frequently encounters limitations in precisely capturing electron correlation, particularly when predicting band gaps. We introduce a systematic methodology to enhance the accuracy of diffusion Monte Carlo (DM… ▽ More Accurate determination of electronic properties of correlated oxides remains a significant challenge for computational theory. Traditional Hubbard-corrected density functional theory (DFT+U) frequently encounters limitations in precisely capturing electron correlation, particularly when predicting band gaps. We introduce a systematic methodology to enhance the accuracy of diffusion Monte Carlo (DMC) simulations for both ground and excited states, focusing on LiCoO$_2$ as a case study. By employing a selected CI (sCI) approach, we demonstrate the capability to optimize wavefunctions beyond the constraints of single-reference DFT+U trial wavefunctions. We show that the sCI framework enables accurate prediction of band gaps in LiCoO$_2$, closely aligning with experimental values and substantially improving upon traditional computational methods. The study uncovers a nuanced mixed state of $t_{2g}$ a $e_g$ orbitals at the band edges that is not captured by conventional single-reference methods, further elucidating the limitations of PBE+U in describing $d$-$d$ excitations. Our findings advocate for the adoption of beyond-DFT methodologies, such as sCI, to capture the essential physics of excited state wavefunctions in strongly correlated materials. The improved accuracy in band gap predictions and the ability to generate more reliable trial wavefunctions for DMC calculations underscore the potential of this approach for broader applications in the study of correlated oxides. This work not only provides a pathway for more accurate simulations of electronic structures in complex materials but also suggests a framework for future investigations into the excited states of other challenging systems. △ Less

Submitted 15 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

arXiv:2403.01077 [pdf, other]

doi 10.1149/1945-7111/ad4ac9

Exploring Li-ion Transport Properties of Li$_3$TiCl$_6$: A Machine Learning Molecular Dynamics Study

Authors: Selva Chandrasekaran Selvaraj, Volodymyr Koverga, Anh T. Ngo

Abstract: We performed large-scale molecular dynamics simulations based on a machine-learning force field (MLFF) to investigate the Li-ion transport mechanism in cation-disordered Li$_3$TiCl$_6$ cathode at six different temperatures, ranging from 25$^\mathrm{o}$C to 100$^\mathrm{o}$C. In this work, deep neural network method and data generated by $ab-initio$ molecular dynamics (AIMD) simulations were deploy… ▽ More We performed large-scale molecular dynamics simulations based on a machine-learning force field (MLFF) to investigate the Li-ion transport mechanism in cation-disordered Li$_3$TiCl$_6$ cathode at six different temperatures, ranging from 25$^\mathrm{o}$C to 100$^\mathrm{o}$C. In this work, deep neural network method and data generated by $ab-initio$ molecular dynamics (AIMD) simulations were deployed to build a high-fidelity MLFF. Radial distribution functions, Li-ion mean square displacements (MSD), diffusion coefficients, ionic conductivity, activation energy, and crystallographic direction-dependent migration barriers were calculated and compared with corresponding AIMD and experimental data to benchmark the accuracy of the MLFF. From MSD analysis, we captured both the self and distinct parts of Li-ion dynamics. The latter reveals that the Li-ions are involved in anti-correlation motion that was rarely reported for solid-state materials. Similarly, the self and distinct parts of Li-ion dynamics were used to determine Haven's ratio to describe the Li-ion transport mechanism in Li$_3$TiCl$_6$. Obtained trajectory from molecular dynamics infers that the Li-ion transportation is mainly through interstitial hop** which was confirmed by intra- and inter-layer Li-ion displacement with respect to simulation time. Ionic conductivity (1.06 mS/cm) and activation energy (0.29eV) calculated by our simulation are highly comparable with that of experimental values. Overall, the combination of machine-learning methods and AIMD simulations explains the intricate electrochemical properties of the Li$_3$TiCl$_6$ cathode with remarkably reduced computational time. Thus, our work strongly suggests that the deep neural network-based MLFF could be a promising method for large-scale complex materials. △ Less

Submitted 17 June, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

Comments: 8 pages with 6 figures

MSC Class: 81-XX Quantum theory 81-XX Quantum theory 81-xx ACM Class: J.2

Journal ref: Journal of the Electrochemical Society 171 (2024) 050544

arXiv:2402.13543 [pdf]

High-temperature stability of ambient-cured one-part alkali-activated materials incorporating graphene for thermal energy storage

Authors: Nghia Tran, Tuan Nguyen, Jay Black, Tuan Ngo

Abstract: In this research, the ambient cured one part alkali activated material (AAM) containing graphene nanoplatelets (GNPs), fly ash, slag and silica fume has been investigated after high temperature exposure to 200 to 800oC. Their compressive strength, thermal properties, microstructure, pore structure were characterised through visual observation, isothermal calorimetry, TGA, XRD, SEM-EDS and X-ray CT… ▽ More In this research, the ambient cured one part alkali activated material (AAM) containing graphene nanoplatelets (GNPs), fly ash, slag and silica fume has been investigated after high temperature exposure to 200 to 800oC. Their compressive strength, thermal properties, microstructure, pore structure were characterised through visual observation, isothermal calorimetry, TGA, XRD, SEM-EDS and X-ray CT. The research findings indicated high strength characteristics of the developed AAM (80 MPa) at ambient condition, which could further reach to approx. 100 MPa after being heated up to 400oC. GNPs provided nucleation effects for promoting geopolymerisation and crystallisation. As observed from X-ray CT, a high extent of severe cracks initiated from the core and propagated towards the surface. From SEM-EDS analysis, high Na-Al and Na-Si ratios or low Si-Al and Ca-Si ratios highly correlated to thermal stability. Overall, the research outcomes implied the promising use of the nano-engineered AAMs for thermal energy storage (TES) at 400 to 600oC. △ Less

Submitted 21 February, 2024; originally announced February 2024.

arXiv:2402.02655 [pdf, other]

VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension

Authors: Thinh Phuoc Ngo, Khoa Tran Anh Dang, Son T. Luu, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

Abstract: This paper presents the development process of a Vietnamese spoken language corpus for machine reading comprehension (MRC) tasks and provides insights into the challenges and opportunities associated with using real-world data for machine reading comprehension tasks. The existing MRC corpora in Vietnamese mainly focus on formal written documents such as Wikipedia articles, online newspapers, or te… ▽ More This paper presents the development process of a Vietnamese spoken language corpus for machine reading comprehension (MRC) tasks and provides insights into the challenges and opportunities associated with using real-world data for machine reading comprehension tasks. The existing MRC corpora in Vietnamese mainly focus on formal written documents such as Wikipedia articles, online newspapers, or textbooks. In contrast, the VlogQA consists of 10,076 question-answer pairs based on 1,230 transcript documents sourced from YouTube -- an extensive source of user-uploaded content, covering the topics of food and travel. By capturing the spoken language of native Vietnamese speakers in natural settings, an obscure corner overlooked in Vietnamese research, the corpus provides a valuable resource for future research in reading comprehension tasks for the Vietnamese language. Regarding performance evaluation, our deep-learning models achieved the highest F1 score of 75.34% on the test set, indicating significant progress in machine reading comprehension for Vietnamese spoken language data. In terms of EM, the highest score we accomplished is 53.97%, which reflects the challenge in processing spoken-based content and highlights the need for further improvement. △ Less

Submitted 6 April, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

Comments: To appear as the main conference paper at EACL 2024

arXiv:2312.17738 [pdf, other]

Physics-informed Graphical Neural Network for Power System State Estimation

Authors: Quang-Ha Ngo, Bang L. H. Nguyen, Tuyen V. Vu, Jianhua Zhang, Tuan Ngo

Abstract: State estimation is highly critical for accurately observing the dynamic behavior of the power grids and minimizing risks from cyber threats. However, existing state estimation methods encounter challenges in accurately capturing power system dynamics, primarily because of limitations in encoding the grid topology and sparse measurements. This paper proposes a physics-informed graphical learning s… ▽ More State estimation is highly critical for accurately observing the dynamic behavior of the power grids and minimizing risks from cyber threats. However, existing state estimation methods encounter challenges in accurately capturing power system dynamics, primarily because of limitations in encoding the grid topology and sparse measurements. This paper proposes a physics-informed graphical learning state estimation method to address these limitations by leveraging both domain physical knowledge and a graph neural network (GNN). We employ a GNN architecture that can handle the graph-structured data of power systems more effectively than traditional data-driven methods. The physics-based knowledge is constructed from the branch current formulation, making the approach adaptable to both transmission and distribution systems. The validation results of three IEEE test systems show that the proposed method can achieve lower mean square error more than 20% than the conventional methods. △ Less

Submitted 29 December, 2023; originally announced December 2023.

Comments: 11 pages, 17 figures, journal accepted

arXiv:2312.10671 [pdf, other]

Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance

Authors: Phuc D. A. Nguyen, Tuan Duc Ngo, Evangelos Kalogerakis, Chuang Gan, Anh Tran, Cuong Pham, Khoi Nguyen

Abstract: We introduce Open3DIS, a novel solution designed to tackle the problem of Open-Vocabulary Instance Segmentation within 3D scenes. Objects within 3D environments exhibit diverse shapes, scales, and colors, making precise instance-level identification a challenging task. Recent advancements in Open-Vocabulary scene understanding have made significant strides in this area by employing class-agnostic… ▽ More We introduce Open3DIS, a novel solution designed to tackle the problem of Open-Vocabulary Instance Segmentation within 3D scenes. Objects within 3D environments exhibit diverse shapes, scales, and colors, making precise instance-level identification a challenging task. Recent advancements in Open-Vocabulary scene understanding have made significant strides in this area by employing class-agnostic 3D instance proposal networks for object localization and learning queryable features for each 3D mask. While these methods produce high-quality instance proposals, they struggle with identifying small-scale and geometrically ambiguous objects. The key idea of our method is a new module that aggregates 2D instance masks across frames and maps them to geometrically coherent point cloud regions as high-quality object proposals addressing the above limitations. These are then combined with 3D class-agnostic instance proposals to include a wide range of objects in the real world. To validate our approach, we conducted experiments on three prominent datasets, including ScanNet200, S3DIS, and Replica, demonstrating significant performance gains in segmenting objects with diverse categories over the state-of-the-art approaches. △ Less

Submitted 5 April, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

Comments: CVPR 2024. Project page: https://open3dis.github.io/

arXiv:2312.09871 [pdf, other]

ChemTime: Rapid and Early Classification for Multivariate Time Series Classification of Chemical Sensors

Authors: Alexander M. Moore, Randy C. Paffenroth, Kenneth T. Ngo, Joshua R. Uzarski

Abstract: Multivariate time series data are ubiquitous in the application of machine learning to problems in the physical sciences. Chemiresistive sensor arrays are highly promising in chemical detection tasks relevant to industrial, safety, and military applications. Sensor arrays are an inherently multivariate time series data collection tool which demand rapid and accurate classification of arbitrary che… ▽ More Multivariate time series data are ubiquitous in the application of machine learning to problems in the physical sciences. Chemiresistive sensor arrays are highly promising in chemical detection tasks relevant to industrial, safety, and military applications. Sensor arrays are an inherently multivariate time series data collection tool which demand rapid and accurate classification of arbitrary chemical analytes. Previous research has benchmarked data-agnostic multivariate time series classifiers across diverse multivariate time series supervised tasks in order to find general-purpose classification algorithms. To our knowledge, there has yet to be an effort to survey machine learning and time series classification approaches to chemiresistive hardware sensor arrays for the detection of chemical analytes. In addition to benchmarking existing approaches to multivariate time series classifiers, we incorporate findings from a model survey to propose the novel \textit{ChemTime} approach to sensor array classification for chemical sensing. We design experiments addressing the unique challenges of hardware sensor arrays classification including the rapid classification ability of classifiers and minimization of inference time while maintaining performance for deployed lightweight hardware sensing devices. We find that \textit{ChemTime} is uniquely positioned for the chemical sensing task by combining rapid and early classification of time series with beneficial inference and high accuracy. △ Less

Submitted 15 December, 2023; originally announced December 2023.

Comments: 14 pages, 12 figures

arXiv:2312.09462 [pdf, other]

Applying Machine Learning Models on Metrology Data for Predicting Device Electrical Performance

Authors: Bappaditya Dey, Anh Tuan Ngo, Sara Sacchi, Victor Blanco, Philippe Leray, Sandip Halder

Abstract: Moore Law states that transistor density will double every two years, which is sustained until today due to continuous multi-directional innovations, such as extreme ultraviolet lithography, novel patterning techniques etc., leading the semiconductor industry towards 3nm node and beyond. For any patterning scheme, the most important metric to evaluate the quality of printed patterns is EPE, with o… ▽ More Moore Law states that transistor density will double every two years, which is sustained until today due to continuous multi-directional innovations, such as extreme ultraviolet lithography, novel patterning techniques etc., leading the semiconductor industry towards 3nm node and beyond. For any patterning scheme, the most important metric to evaluate the quality of printed patterns is EPE, with overlay being its largest contribution. Overlay errors can lead to fatal failures of IC devices such as short circuits or broken connections in terms of P2P electrical contacts. Therefore, it is essential to develop effective overlay analysis and control techniques to ensure good functionality of fabricated semiconductor devices. In this work we have used an imec N14 BEOL process flow using LELE patterning technique to print metal layers with minimum pitch of 48nm with 193i lithography. FF structures are decomposed into two mask layers (M1A and M1B) and then the LELE flow is carried out to make the final patterns. Since a single M1 layer is decomposed into two masks, control of overlay between the two masks is critical. The goal of this work is of two-fold as, (a) to quantify the impact of overlay on capacitance and (b) to see if we can predict the final capacitance measurements with selected machine learning models at an early stage. To do so, scatterometry spectra are collected on these electrical test structures at (a)post litho, (b)post TiN hardmask etch, and (c)post Cu plating and CMP. Critical Dimension and overlay measurements for line-space pattern are done with SEM post litho, post etch and post Cu CMP. Various machine learning models are applied to do the capacitance prediction with multiple metrology inputs at different steps of wafer processing. Finally, we demonstrate that by using appropriate machine learning models we are able to do better prediction of electrical results. △ Less

Submitted 20 November, 2023; originally announced December 2023.

arXiv:2309.09400 [pdf, other]

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Authors: Thuat Nguyen, Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

Abstract: The driving factors behind the development of large language models (LLMs) with impressive learning capabilities are their colossal model sizes and extensive training datasets. Along with the progress in natural language processing, LLMs have been frequently made accessible to the public to foster deeper investigation and applications. However, when it comes to training datasets for these LLMs, es… ▽ More The driving factors behind the development of large language models (LLMs) with impressive learning capabilities are their colossal model sizes and extensive training datasets. Along with the progress in natural language processing, LLMs have been frequently made accessible to the public to foster deeper investigation and applications. However, when it comes to training datasets for these LLMs, especially the recent state-of-the-art models, they are often not fully disclosed. Creating training data for high-performing LLMs involves extensive cleaning and deduplication to ensure the necessary level of quality. The lack of transparency for training data has thus hampered research on attributing and addressing hallucination and bias issues in LLMs, hindering replication efforts and further advancements in the community. These challenges become even more pronounced in multilingual learning scenarios, where the available multilingual text datasets are often inadequately collected and cleaned. Consequently, there is a lack of open-source and readily usable dataset to effectively train LLMs in multiple languages. To overcome this issue, we present CulturaX, a substantial multilingual dataset with 6.3 trillion tokens in 167 languages, tailored for LLM development. Our dataset undergoes meticulous cleaning and deduplication through a rigorous pipeline of multiple stages to accomplish the best quality for model training, including language identification, URL-based filtering, metric-based cleaning, document refinement, and data deduplication. CulturaX is fully released to the public in HuggingFace to facilitate research and advancements in multilingual LLMs: https://huggingface.co/datasets/uonlp/CulturaX. △ Less

Submitted 17 September, 2023; originally announced September 2023.

Comments: Ongoing Work

arXiv:2309.03995 [pdf, other]

First-principle Study of Multiple Metastable Charge Ordering States in La$_{1/3}$Sr$_{2/3}$FeO$_{3}$

Authors: Nam Nguyen, Alex Taekyung Lee, Vijay Singh, Anh T. Ngo, Hyowon Park

Abstract: La doped SrFeO$_{3}$, La$_{1/3}$Sr$_{2/3}$FeO$_{3}$, exhibits a metal-to-insulator transition accompanied by both antiferromagnetic and charge ordering states along with the Fe-O bond disproportionation below a critical temperature near 200K. Unconventionally slow charge dynamics measured in this material near the critical temperature shows that its excited charge ordering states can exhibit novel… ▽ More La doped SrFeO$_{3}$, La$_{1/3}$Sr$_{2/3}$FeO$_{3}$, exhibits a metal-to-insulator transition accompanied by both antiferromagnetic and charge ordering states along with the Fe-O bond disproportionation below a critical temperature near 200K. Unconventionally slow charge dynamics measured in this material near the critical temperature shows that its excited charge ordering states can exhibit novel electronic structures with nontrivial energy profiles. Here, we reveal possible metastable states of charge ordering structures in La$_{1/3}$Sr$_{2/3}$FeO$_{3}$ using the first-principle and climbing image nudged elastic band methods. In the strong correlation regime, La$_{1/3}$Sr$_{2/3}$FeO$_{3}$ is an antiferromagnetic insulator with a charge ordering state of the big-small-big pattern, consistent with the experimental measurement of this material at the low temperature. As the correlation effect becomes weak, we find at least two possible metastable charge ordering states with the distinct Fe-O bond disproportionation. Remarkably, a ferroelectric metallic state emerges with the small energy barrier of $\sim$7 meV, driven by a metastable CO state of the small-medium-big pattern. The electronic structures of these metastable charge ordering states are noticeably different from those of the ground-state. Our results can provide an insightful explanation to multiple metastable charge ordering states and the slow charge dynamics of this and related oxide materials. △ Less

Submitted 7 September, 2023; originally announced September 2023.

Comments: The paper has 8 pages and 6 figures

arXiv:2308.04043 [pdf, other]

doi 10.1103/PhysRevB.108.205122

Delocalized polaron and Burstein-Moss shift induced by Li in $α$-$\textrm{V}_{2}\textrm{O}_{5}$: DFT+DMFT study

Authors: Huu T. Do, Alex Taekyung Lee, Hyowon Park, Anh T. Ngo

Abstract: We performed density functional theory (DFT)+$U$ and dynamical mean field theory (DMFT) calculations with continuous time quantum Monte Carlo impurity solver to investigate the electronic properties of V$_2$O$_5$ and Li$_x$V$_2$O$_5$ ($x$ = 0.125 and 0.25). Pristine V$_2$O$_5$ is a charge-transfer insulator with strong O $p$-V $d$ hybridization, and exhibits a large band gap ($E_{\textrm{gap}}$) a… ▽ More We performed density functional theory (DFT)+$U$ and dynamical mean field theory (DMFT) calculations with continuous time quantum Monte Carlo impurity solver to investigate the electronic properties of V$_2$O$_5$ and Li$_x$V$_2$O$_5$ ($x$ = 0.125 and 0.25). Pristine V$_2$O$_5$ is a charge-transfer insulator with strong O $p$-V $d$ hybridization, and exhibits a large band gap ($E_{\textrm{gap}}$) as well as non-zero conduction band (CB) gap. We show that the band gap, the number of $d$ electrons of vanadium, $N_d$, and conduction band (CB) gap for V$_2$O$_5$ obtained from our DMFT calculations are in excellent agreement with the experimental values. While the DFT+$U$ approach replicates the experimental band gap, it overestimates the value of $N_d$ and underestimates the CB gap. In the presence of low Li do**, the electronic properties of V$_2$O$_5$ are mainly driven by a polaronic mechanism, the electron spin resonance and electron nuclear double resonance spectroscopies observed the coexistence of free and bound polarons. Notably, our DMFT results identify both polaron types, with the bound polaron being energetically preferred, while DFT+$U$ method predicts only the free polaron. Our DMFT analysis also reveals that increased Li do** leads to electron filling in the conduction band, shifting the Fermi level, this result consistent with the observed Burstein-Moss shift upon enhanced Li do** and we thus demonstrate that the DFT+DMFT approach can be used for accurate and realistic description of strongly correlated materials. △ Less

Submitted 27 November, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

Comments: 12 pages, 13 figures

Journal ref: Physical Review B 108, 205122 (2023)

arXiv:2307.16039 [pdf, other]

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Authors: Viet Dac Lai, Chien Van Nguyen, Nghia Trung Ngo, Thuat Nguyen, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

Abstract: A key technology for the development of large language models (LLMs) involves instruction tuning that helps align the models' responses with human expectations to realize impressive learning abilities. Two major approaches for instruction tuning characterize supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF), which are currently applied to produce the best commercia… ▽ More A key technology for the development of large language models (LLMs) involves instruction tuning that helps align the models' responses with human expectations to realize impressive learning abilities. Two major approaches for instruction tuning characterize supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF), which are currently applied to produce the best commercial LLMs (e.g., ChatGPT). To improve the accessibility of LLMs for research and development efforts, various instruction-tuned open-source LLMs have also been introduced recently, e.g., Alpaca, Vicuna, to name a few. However, existing open-source LLMs have only been instruction-tuned for English and a few popular languages, thus hindering their impacts and accessibility to many other languages in the world. Among a few very recent work to explore instruction tuning for LLMs in multiple languages, SFT has been used as the only approach to instruction-tune LLMs for multiple languages. This has left a significant gap for fine-tuned LLMs based on RLHF in diverse languages and raised important questions on how RLHF can boost the performance of multilingual instruction tuning. To overcome this issue, we present Okapi, the first system with instruction-tuned LLMs based on RLHF for multiple languages. Okapi introduces instruction and response-ranked data in 26 diverse languages to facilitate the experiments and development of future multilingual LLM research. We also present benchmark datasets to enable the evaluation of generative LLMs in multiple languages. Our experiments demonstrate the advantages of RLHF for multilingual instruction over SFT for different base models and datasets. Our framework and resources are released at https://github.com/nlp-uoregon/Okapi. △ Less

Submitted 1 August, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

arXiv:2307.13251 [pdf, other]

GaPro: Box-Supervised 3D Point Cloud Instance Segmentation Using Gaussian Processes as Pseudo Labelers

Authors: Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen

Abstract: Instance segmentation on 3D point clouds (3DIS) is a longstanding challenge in computer vision, where state-of-the-art methods are mainly based on full supervision. As annotating ground truth dense instance masks is tedious and expensive, solving 3DIS with weak supervision has become more practical. In this paper, we propose GaPro, a new instance segmentation for 3D point clouds using axis-aligned… ▽ More Instance segmentation on 3D point clouds (3DIS) is a longstanding challenge in computer vision, where state-of-the-art methods are mainly based on full supervision. As annotating ground truth dense instance masks is tedious and expensive, solving 3DIS with weak supervision has become more practical. In this paper, we propose GaPro, a new instance segmentation for 3D point clouds using axis-aligned 3D bounding box supervision. Our two-step approach involves generating pseudo labels from box annotations and training a 3DIS network with the resulting labels. Additionally, we employ the self-training strategy to improve the performance of our method further. We devise an effective Gaussian Process to generate pseudo instance masks from the bounding boxes and resolve ambiguities when they overlap, resulting in pseudo instance masks with their uncertainty values. Our experiments show that GaPro outperforms previous weakly supervised 3D instance segmentation methods and has competitive performance compared to state-of-the-art fully supervised ones. Furthermore, we demonstrate the robustness of our approach, where we can adapt various state-of-the-art fully supervised methods to the weak supervision task by using our pseudo labels for training. The source code and trained models are available at https://github.com/VinAIResearch/GaPro. △ Less

Submitted 25 July, 2023; originally announced July 2023.

Comments: Accepted to ICCV 2023

arXiv:2305.13730 [pdf, ps, other]

A matrix variant of the Erdős-Falconer distance problems over finite field

Authors: Hieu T. Ngo

Abstract: We study a matrix analog of the Erdős-Falconer distance problems in vector spaces over finite fields. There arises an interesting analysis of certain quadratic matrix Gauss sums. We study a matrix analog of the Erdős-Falconer distance problems in vector spaces over finite fields. There arises an interesting analysis of certain quadratic matrix Gauss sums. △ Less

Submitted 23 May, 2023; originally announced May 2023.

Comments: 17 pages

MSC Class: 11T24; 52C10

arXiv:2305.13473 [pdf]

Impact of Electron-Withdrawing Groups on Ion Transport and Structure in Lithium Borate Ionic Liquids

Authors: Volodymyr Koverga, Selvaraj S. Chandrasekaran, Anh T. Ngo

Abstract: Among the distinctive structural features of lithium ionic liquids (LILs), a novel class of single-component electrolytes, the variation of the electron-withdrawing group stands out as a key factor in determining their dynamics. To understand this phenomenon, we conducted molecular dynamics (MD) simulations for LILs based on hexafluoro-2-propanoxy (LIL2), hexafluoro-2-methyl-2-propanoxy (LIL4), an… ▽ More Among the distinctive structural features of lithium ionic liquids (LILs), a novel class of single-component electrolytes, the variation of the electron-withdrawing group stands out as a key factor in determining their dynamics. To understand this phenomenon, we conducted molecular dynamics (MD) simulations for LILs based on hexafluoro-2-propanoxy (LIL2), hexafluoro-2-methyl-2-propanoxy (LIL4), and trifluoro-2-propanoxy (LIL6) derivatives. Results revealed that correlated ion dynamics govern the general transport characteristics in LILs, while the electron-withdrawing group regulates the Li transport mechanism. Upon saturation by fluorine atoms, LILs exhibit higher inhomogeneity in their transport and structure properties. Strong coordination along the ethoxide group promotes jumps of Li across positive domains, while in fluorine-poor LILs, stronger coordination in proximity to boron atoms carries the anion along Li transport. Understanding the results of MD simulation will aid the further design and widespread use of this class of electrolytes in production of the energy storage and conversion devices △ Less

Submitted 26 February, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

arXiv:2305.08336 [pdf, other]

Inverse Rendering of Translucent Objects using Physical and Neural Renderers

Authors: Chenhao Li, Trung Thanh Ngo, Hajime Nagahara

Abstract: In this work, we propose an inverse rendering model that estimates 3D shape, spatially-varying reflectance, homogeneous subsurface scattering parameters, and an environment illumination jointly from only a pair of captured images of a translucent object. In order to solve the ambiguity problem of inverse rendering, we use a physically-based renderer and a neural renderer for scene reconstruction a… ▽ More In this work, we propose an inverse rendering model that estimates 3D shape, spatially-varying reflectance, homogeneous subsurface scattering parameters, and an environment illumination jointly from only a pair of captured images of a translucent object. In order to solve the ambiguity problem of inverse rendering, we use a physically-based renderer and a neural renderer for scene reconstruction and material editing. Because two renderers are differentiable, we can compute a reconstruction loss to assist parameter estimation. To enhance the supervision of the proposed neural renderer, we also propose an augmented loss. In addition, we use a flash and no-flash image pair as the input. To supervise the training, we constructed a large-scale synthetic dataset of translucent objects, which consists of 117K scenes. Qualitative and quantitative results on both synthetic and real-world datasets demonstrated the effectiveness of the proposed model. △ Less

Submitted 15 May, 2023; originally announced May 2023.

Comments: Accepted to CVPR2023

arXiv:2304.07459 [pdf, other]

doi 10.1109/TIP.2023.3267621

Instance-level Few-shot Learning with Class Hierarchy Mining

Authors: Anh-Khoa Nguyen Vu, Thanh-Toan Do, Nhat-Duy Nguyen, Vinh-Tiep Nguyen, Thanh Duc Ngo, Tam V. Nguyen

Abstract: Few-shot learning is proposed to tackle the problem of scarce training data in novel classes. However, prior works in instance-level few-shot learning have paid less attention to effectively utilizing the relationship between categories. In this paper, we exploit the hierarchical information to leverage discriminative and relevant features of base classes to effectively classify novel objects. The… ▽ More Few-shot learning is proposed to tackle the problem of scarce training data in novel classes. However, prior works in instance-level few-shot learning have paid less attention to effectively utilizing the relationship between categories. In this paper, we exploit the hierarchical information to leverage discriminative and relevant features of base classes to effectively classify novel objects. These features are extracted from abundant data of base classes, which could be utilized to reasonably describe classes with scarce data. Specifically, we propose a novel superclass approach that automatically creates a hierarchy considering base and novel classes as fine-grained classes for few-shot instance segmentation (FSIS). Based on the hierarchical information, we design a novel framework called Soft Multiple Superclass (SMS) to extract relevant features or characteristics of classes in the same superclass. A new class assigned to the superclass is easier to classify by leveraging these relevant features. Besides, in order to effectively train the hierarchy-based-detector in FSIS, we apply the label refinement to further describe the associations between fine-grained classes. The extensive experiments demonstrate the effectiveness of our method on FSIS benchmarks. Code is available online. △ Less

Submitted 14 April, 2023; originally announced April 2023.

Comments: accepted by IEEE Transactions on Image Processing

arXiv:2304.07444 [pdf, other]

The Art of Camouflage: Few-shot Learning for Animal Detection and Segmentation

Authors: Thanh-Danh Nguyen, Anh-Khoa Nguyen Vu, Nhat-Duy Nguyen, Vinh-Tiep Nguyen, Thanh Duc Ngo, Thanh-Toan Do, Minh-Triet Tran, Tam V. Nguyen

Abstract: Camouflaged object detection and segmentation is a new and challenging research topic in computer vision. There is a serious issue of lacking data of camouflaged objects such as camouflaged animals in natural scenes. In this paper, we address the problem of few-shot learning for camouflaged object detection and segmentation. To this end, we first collect a new dataset, CAMO-FS, for the benchmark.… ▽ More Camouflaged object detection and segmentation is a new and challenging research topic in computer vision. There is a serious issue of lacking data of camouflaged objects such as camouflaged animals in natural scenes. In this paper, we address the problem of few-shot learning for camouflaged object detection and segmentation. To this end, we first collect a new dataset, CAMO-FS, for the benchmark. We then propose a novel method to efficiently detect and segment the camouflaged objects in the images. In particular, we introduce the instance triplet loss and the instance memory storage. The extensive experiments demonstrated that our proposed method achieves state-of-the-art performance on the newly collected dataset. △ Less

Submitted 21 January, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

Comments: Under-review Journal

arXiv:2304.05613 [pdf, other]

ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning

Authors: Viet Dac Lai, Nghia Trung Ngo, Amir Pouran Ben Veyseh, Hieu Man, Franck Dernoncourt, Trung Bui, Thien Huu Nguyen

Abstract: Over the last few years, large language models (LLMs) have emerged as the most important breakthroughs in natural language processing (NLP) that fundamentally transform research and developments in the field. ChatGPT represents one of the most exciting LLM systems developed recently to showcase impressive skills for language generation and highly attract public attention. Among various exciting ap… ▽ More Over the last few years, large language models (LLMs) have emerged as the most important breakthroughs in natural language processing (NLP) that fundamentally transform research and developments in the field. ChatGPT represents one of the most exciting LLM systems developed recently to showcase impressive skills for language generation and highly attract public attention. Among various exciting applications discovered for ChatGPT in English, the model can process and generate texts for multiple languages due to its multilingual training data. Given the broad adoption of ChatGPT for English in different problems and areas, a natural question is whether ChatGPT can also be applied effectively for other languages or it is necessary to develop more language-specific technologies. The answer to this question requires a thorough evaluation of ChatGPT over multiple tasks with diverse languages and large datasets (i.e., beyond reported anecdotes), which is still missing or limited in current research. Our work aims to fill this gap for the evaluation of ChatGPT and similar LLMs to provide more comprehensive information for multilingual NLP applications. While this work will be an ongoing effort to include additional experiments in the future, our current paper evaluates ChatGPT on 7 different tasks, covering 37 diverse languages with high, medium, low, and extremely low resources. We also focus on the zero-shot learning setting for ChatGPT to improve reproducibility and better simulate the interactions of general users. Compared to the performance of previous models, our extensive experimental results demonstrate a worse performance of ChatGPT for different NLP tasks and languages, calling for further research to develop better models and understanding for multilingual learning. △ Less

Submitted 12 April, 2023; originally announced April 2023.

arXiv:2304.01934 [pdf, other]

doi 10.1103/PhysRevB.108.205146

Effect of Off-Diagonal Elements in Wannier Hamiltonian on DFT+DMFT for low-symmetry material: Study of Li$_2$MnO$_3$

Authors: Alex Taekyung Lee, Hyowon Park, Anh T. Ngo

Abstract: We study the effect of the off-diagonal elements of the Wannier Hamiltonian on the electronic structure of low-symmetry material Li$_2$MnO$_3$ ($C2/m$), using dynamical mean field theory calculations with continuous-time Quantum Monte Carlo impurity solver. Presence of significant off-diagonal elements leads to a pronounced suppression of the energy gap. The off-diagonal elements are largest when… ▽ More We study the effect of the off-diagonal elements of the Wannier Hamiltonian on the electronic structure of low-symmetry material Li$_2$MnO$_3$ ($C2/m$), using dynamical mean field theory calculations with continuous-time Quantum Monte Carlo impurity solver. Presence of significant off-diagonal elements leads to a pronounced suppression of the energy gap. The off-diagonal elements are largest when the Wannier projection is used based on the global coordinate, and they remain substantial even with the projection using the local coordinate close to the direction of Mn-O bonds. We show that the energy gap is enhanced by the diagonalization of the Mn $d$ block in the full $p$-$d$ Hamiltonian, with applying unitary rotation matrix. Additionally, the inclusion of a small double counting energy is crucial for achieving the experimental gap by reducing $p$-$d$ hybridization. Furthermore, we establish the efficiency of a low-energy ($d$-only basis) model for studying the electronic structure of Li$_2$MnO$3$, as the Wannier basis represents a hybridized state of Mn $d$ and O $p$ orbitals. These findings suggest an appropriate new approach for investigating low-symmetry materials using the DFT+DMFT method. To the best of our knowledge, no systematic study of the effect of off-diagonal terms has been conducted thus far. We also find that the antiferromagnetic ground state $Γ_{2u}$ is stable with $U \leq 2$ eV within density functional theory+$U$ calculations, which is much smaller than widely used $U$=5 eV. △ Less

Submitted 27 November, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

Comments: 13 pages, 10 figures

Journal ref: Physical Review B 108, 205146 (2023)

arXiv:2304.00969 [pdf, other]

doi 10.1145/3503252.3531304

Is More Always Better? The Effects of Personal Characteristics and Level of Detail on the Perception of Explanations in a Recommender System

Authors: Mohamed Amine Chatti, Mouadh Guesmi, Laura Vorgerd, Thao Ngo, Shoeb Joarder, Qurat Ul Ain, Arham Muslim

Abstract: Despite the acknowledgment that the perception of explanations may vary considerably between end-users, explainable recommender systems (RS) have traditionally followed a one-size-fits-all model, whereby the same explanation level of detail is provided to each user, without taking into consideration individual user's context, i.e., goals and personal characteristics. To fill this research gap, we… ▽ More Despite the acknowledgment that the perception of explanations may vary considerably between end-users, explainable recommender systems (RS) have traditionally followed a one-size-fits-all model, whereby the same explanation level of detail is provided to each user, without taking into consideration individual user's context, i.e., goals and personal characteristics. To fill this research gap, we aim in this paper at a shift from a one-size-fits-all to a personalized approach to explainable recommendation by giving users agency in deciding which explanation they would like to see. We developed a transparent Recommendation and Interest Modeling Application (RIMA) that provides on-demand personalized explanations of the recommendations, with three levels of detail (basic, intermediate, advanced) to meet the demands of different types of end-users. We conducted a within-subject study (N=31) to investigate the relationship between user's personal characteristics and the explanation level of detail, and the effects of these two variables on the perception of the explainable RS with regard to different explanation goals. Our results show that the perception of explainable RS with different levels of detail is affected to different degrees by the explanation goal and user type. Consequently, we suggested some theoretical and design guidelines to support the systematic design of explanatory interfaces in RS tailored to the user's context. △ Less

Submitted 3 April, 2023; originally announced April 2023.

Comments: Proceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization (UMAP'22)

arXiv:2303.00246 [pdf, other]

ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution

Authors: Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen

Abstract: Existing 3D instance segmentation methods are predominated by the bottom-up design -- manually fine-tuned algorithm to group points into clusters followed by a refinement network. However, by relying on the quality of the clusters, these methods generate susceptible results when (1) nearby objects with the same semantic class are packed together, or (2) large objects with loosely connected regions… ▽ More Existing 3D instance segmentation methods are predominated by the bottom-up design -- manually fine-tuned algorithm to group points into clusters followed by a refinement network. However, by relying on the quality of the clusters, these methods generate susceptible results when (1) nearby objects with the same semantic class are packed together, or (2) large objects with loosely connected regions. To address these limitations, we introduce ISBNet, a novel cluster-free method that represents instances as kernels and decodes instance masks via dynamic convolution. To efficiently generate high-recall and discriminative kernels, we propose a simple strategy named Instance-aware Farthest Point Sampling to sample candidates and leverage the local aggregation layer inspired by PointNet++ to encode candidate features. Moreover, we show that predicting and leveraging the 3D axis-aligned bounding boxes in the dynamic convolution further boosts performance. Our method set new state-of-the-art results on ScanNetV2 (55.9), S3DIS (60.8), and STPLS3D (49.2) in terms of AP and retains fast inference time (237ms per scene on ScanNetV2). The source code and trained models are available at https://github.com/VinAIResearch/ISBNet. △ Less

Submitted 26 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

Comments: Accepted to CVPR 2023

arXiv:2302.04917 [pdf, other]

ChemVise: Maximizing Out-of-Distribution Chemical Detection with the Novel Application of Zero-Shot Learning

Authors: Alexander M. Moore, Randy C. Paffenroth, Ken T. Ngo, Joshua R. Uzarski

Abstract: Accurate chemical sensors are vital in medical, military, and home safety applications. Training machine learning models to be accurate on real world chemical sensor data requires performing many diverse, costly experiments in controlled laboratory settings to create a data set. In practice even expensive, large data sets may be insufficient for generalization of a trained model to a real-world te… ▽ More Accurate chemical sensors are vital in medical, military, and home safety applications. Training machine learning models to be accurate on real world chemical sensor data requires performing many diverse, costly experiments in controlled laboratory settings to create a data set. In practice even expensive, large data sets may be insufficient for generalization of a trained model to a real-world testing distribution. Rather than perform greater numbers of experiments requiring exhaustive mixtures of chemical analytes, this research proposes learning approximations of complex exposures from training sets of simple ones by using single-analyte exposure signals as building blocks of a multiple-analyte space. We demonstrate this approach to synthetic sensor responses surprisingly improves the detection of out-of-distribution obscured chemical analytes. Further, we pair these synthetic signals to targets in an information-dense representation space utilizing a large corpus of chemistry knowledge. Through utilization of a semantically meaningful analyte representation spaces along with synthetic targets we achieve rapid analyte classification in the presence of obscurants without corresponding obscured-analyte training data. Transfer learning for supervised learning with molecular representations makes assumptions about the input data. Instead, we borrow from the natural language and natural image processing literature for a novel approach to chemical sensor signal classification using molecular semantics for arbitrary chemical sensor hardware designs. △ Less

Submitted 9 February, 2023; originally announced February 2023.

Comments: 12 pages, 14 figures

arXiv:2302.02255 [pdf, other]

Human-Imperceptible Identification with Learnable Lensless Imaging

Authors: Thuong Nguyen Canh, Trung Thanh Ngo, Hajime Nagahara

Abstract: Lensless imaging protects visual privacy by capturing heavily blurred images that are imperceptible for humans to recognize the subject but contain enough information for machines to infer information. Unfortunately, protecting visual privacy comes with a reduction in recognition accuracy and vice versa. We propose a learnable lensless imaging framework that protects visual privacy while maintaini… ▽ More Lensless imaging protects visual privacy by capturing heavily blurred images that are imperceptible for humans to recognize the subject but contain enough information for machines to infer information. Unfortunately, protecting visual privacy comes with a reduction in recognition accuracy and vice versa. We propose a learnable lensless imaging framework that protects visual privacy while maintaining recognition accuracy. To make captured images imperceptible to humans, we designed several loss functions based on total variation, invertibility, and the restricted isometry property. We studied the effect of privacy protection with blurriness on the identification of personal identity via a quantitative method based on a subjective evaluation. Moreover, we validate our simulation by implementing a hardware realization of lensless imaging with photo-lithographically printed masks. △ Less

Submitted 4 February, 2023; originally announced February 2023.

arXiv:2212.12960 [pdf, other]

Quantum enhanced probing of multilayered-samples

Authors: Mayte Y. Li-Gomez, Pablo D. Yepiz-Graciano, Taras Hrushevskyi, Omar Calderon-Losada, Erhan Saglamyurek, Dorilian Lopez-Mago, Vahid Salari, Trong Ngo, Alfred B. U'Ren, Shabir Barzanjeh

Abstract: Quantum sensing exploits quantum phenomena to enhance the detection and estimation of classical parameters of physical systems and biological entities, particularly so as to overcome the inefficiencies of its classical counterparts. A particularly promising approach within quantum sensing is Quantum Optical Coherence Tomography which relies on non-classical light sources to reconstruct the interna… ▽ More Quantum sensing exploits quantum phenomena to enhance the detection and estimation of classical parameters of physical systems and biological entities, particularly so as to overcome the inefficiencies of its classical counterparts. A particularly promising approach within quantum sensing is Quantum Optical Coherence Tomography which relies on non-classical light sources to reconstruct the internal structure of multilayered materials. Compared to traditional classical probing, Quantum Optical Coherence Tomography provides enhanced-resolution images and is unaffected by even-order dispersion. One of the main limitations of this technique lies in the appearance of artifacts and echoes, i.e. fake structures that appear in the coincidence interferogram, which hinder the retrieval of information required for tomography scans. Here, by utilizing a full theoretical model, in combination with a fast genetic algorithm to post-process the data, we successfully extract the morphology of complex multilayered samples and thoroughly distinguish real interfaces, artifacts, and echoes. We test the effectiveness of the model and algorithm by comparing its predictions to experimentally-generated interferograms through the controlled variation of the pump wavelength. Our results could potentially lead to the development of practical high-resolution probing of complex structures and non-invasive scanning of photo-degradable materials for biomedical imaging/sensing, clinical applications, and materials science. △ Less

Submitted 12 May, 2023; v1 submitted 25 December, 2022; originally announced December 2022.

arXiv:2212.08772 [pdf, other]

doi 10.1103/PhysRevB.108.085407

Spin and electronic excitations in $4f$ atomic chains on Au(111) substrates

Authors: David W. Facemyer, Naveen K. Dandu, Alex Taekyung Lee, Vijay R. Singh, Anh T. Ngo, Sergio E. Ulloa

Abstract: High spin systems, like those that incorporate rare-earth $4f$ elements (REEs), are increasingly relevant in many fields. Although research in such systems is sparse, the large Hilbert spaces they occupy are promising for many applications. In this work, we examine a one-dimensional linear array of europium (Eu) atoms on a Au(111) surface and study their electronic and magnetic excitations. Ab ini… ▽ More High spin systems, like those that incorporate rare-earth $4f$ elements (REEs), are increasingly relevant in many fields. Although research in such systems is sparse, the large Hilbert spaces they occupy are promising for many applications. In this work, we examine a one-dimensional linear array of europium (Eu) atoms on a Au(111) surface and study their electronic and magnetic excitations. Ab initio calculations using VASP with PBE+U are employed to study the structure. We find Eu atoms to have a net charge when on gold, consistent with a net magnetic momemt of $\simeq 3.5 μ_B$. Examining various spin-projection configurations, we can evaluate first and second neighbor exchange energies in an isotropic Heisenberg model between spin-$\frac{7}{2}$ moments to obtain $J_1 \approx -1.2 \, \mathrm{K}$ and $J_2 \approx 0.2 \, \mathrm{K}$ for the relaxed-chain atomic separation of $a \approx 5$ $\mathrm{\dot{A}}$. These parameters are used to obtain the full spin excitation spectrum of a physically realizable four-atom chain. The large $|J_1|/J_2$ ratio results in a highly degenerate ferromagnetic ground state that is split by a significant easy plane single ion anisotropy of $0.6$ K. Spin-flip excitations are calculated to extract differential conductance profiles as those obtained by scanning tunneling microscopy techniques. We uncover interesting behavior of local spin excitations, especially as we track their dispersion with applied magnetic fields. △ Less

Submitted 19 July, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

arXiv:2211.07833 [pdf]

doi 10.1016/j.apenergy.2023.120817

Optimal sizing of renewable energy storage: A comparative study of hydrogen and battery system considering degradation and seasonal storage

Authors: Son Tay Le, Tuan Ngoc Nguyen, Dac-Khuong Bui, Tuan Duc Ngo

Abstract: Renewable energy storage (RES) is essential to address the intermittence issues of renewable energy systems, thereby enhancing the system stability and reliability. This study presents an optimisation study of sizing and operational strategy parameters of a grid-connected photovoltaic (PV)-hydrogen/battery systems using a Multi-Objective Modified Firefly Algorithm (MOMFA). An operational strategy… ▽ More Renewable energy storage (RES) is essential to address the intermittence issues of renewable energy systems, thereby enhancing the system stability and reliability. This study presents an optimisation study of sizing and operational strategy parameters of a grid-connected photovoltaic (PV)-hydrogen/battery systems using a Multi-Objective Modified Firefly Algorithm (MOMFA). An operational strategy that utilises the ability of hydrogen to store energy over a long time was also investigated. The proposed method was applied to a real-world distributed energy project located in the tropical climate zone. To further demonstrate the robustness and versatility of the method, another synthetic test case was examined for a location in the subtropical weather zone, which has a high seasonal mismatch. The performance of the proposed MOMFA method is compared with the NSGA-II method, which has been widely used to design renewable energy storage systems in the literature. The result shows that MOMFA is more accurate and robust than NSGA-II owing to the complex and dynamic nature of energy storage system. The optimisation results show that battery storage systems, as a mature technology, yield better economic performance than current hydrogen storage systems. However, it is proven that hydrogen storage systems provide better techno-economic performance and can be a viable long-term storage solution when high penetration of renewable energy is required. The study also proves that the proposed long-term operational strategy can lower component degradation, enhance efficiency, and increase the total economic performance of hydrogen storage systems. The findings of this study can support the implementation of energy storage systems for renewable energy. △ Less

Submitted 14 November, 2022; originally announced November 2022.

arXiv:2209.13875 [pdf, other]

A General Scattering Phase Function for Inverse Rendering

Authors: Thanh-Trung Ngo, Hajime Nagahara

Abstract: We tackle the problem of modeling light scattering in homogeneous translucent material and estimating its scattering parameters. A scattering phase function is one of such parameters which affects the distribution of scattered radiation. It is the most complex and challenging parameter to be modeled in practice, and empirical phase functions are usually used. Empirical phase functions (such as Hen… ▽ More We tackle the problem of modeling light scattering in homogeneous translucent material and estimating its scattering parameters. A scattering phase function is one of such parameters which affects the distribution of scattered radiation. It is the most complex and challenging parameter to be modeled in practice, and empirical phase functions are usually used. Empirical phase functions (such as Henyey-Greenstein (HG) phase function or its modified ones) are usually presented and limited to a specific range of scattering materials. This limitation raises concern for an inverse rendering problem where the target material is generally unknown. In such a situation, a more general phase function is preferred. Although there exists such a general phase function in the polynomial form using a basis such as Legendre polynomials \cite{Fowler1983}, inverse rendering with this phase function is not straightforward. This is because the base polynomials may be negative somewhere, while a phase function cannot. This research presents a novel general phase function that can avoid this issue and an inverse rendering application using this phase function. The proposed phase function was positively evaluated with a wide range of materials modeled with Mie scattering theory. The scattering parameters estimation with the proposed phase function was evaluated with simulation and real-world experiments. △ Less

Submitted 28 September, 2022; originally announced September 2022.

arXiv:2209.03672 [pdf]

Observation of strange metal in hole-doped valley-spin insulator

Authors: Tuan Dung Nguyen, Baithi Mallesh, Seon Je Kim, Houcine Bouzid, Byeongwook Cho, Xuan Phu Le, Tien Dat Ngo, Won Jong Yoo, Young-Min Kim, Dinh Loc Duong, Young Hee Lee

Abstract: Temperature-linear resistance at low temperatures in strange metals is an exotic characteristic of strong correlation systems, as observed in high-TC superconducting cuprates, heavy fermions, Fe-based superconductors, ruthenates, and twisted bilayer graphene. Here, we introduce a hole-doped valley-spin insulator, V-doped WSe2, with hole pockets in the valence band. The strange metal characteristic… ▽ More Temperature-linear resistance at low temperatures in strange metals is an exotic characteristic of strong correlation systems, as observed in high-TC superconducting cuprates, heavy fermions, Fe-based superconductors, ruthenates, and twisted bilayer graphene. Here, we introduce a hole-doped valley-spin insulator, V-doped WSe2, with hole pockets in the valence band. The strange metal characteristic was observed in VxW1-xSe2 at a critical carrier concentration of 9.5 x 10^20 cm-3 from 150 K to 1.8 K. The unsaturated magnetoresistance is almost linearly proportional to the magnetic field. Using the ansatz R(H,T) - R(0,0) ~ [(alpha.k.T)^2+(gamma.mu.B)^2]^1/2, the gamma/alpha ratio is estimated approximately to 4, distinct from that for the quasiparticles of LSCO, BaFe2(As1-xPx)2 (gamma/alpha=1) and bosons of YBCO (gamma/alpha=2). Our observation opens up the possible routes that induce strong correlation and superconductivity in two-dimensional materials with strong spin-orbit coupling. △ Less

Submitted 8 September, 2022; originally announced September 2022.

Comments: 8 pages, 4 figures + Supplemental Material

arXiv:2208.03403 [pdf, other]

Slice-level Detection of Intracranial Hemorrhage on CT Using Deep Descriptors of Adjacent Slices

Authors: Dat T. Ngo, Thao T. B. Nguyen, Hieu T. Nguyen, Dung B. Nguyen, Ha Q. Nguyen, Hieu H. Pham

Abstract: The rapid development in representation learning techniques such as deep neural networks and the availability of large-scale, well-annotated medical imaging datasets have to a rapid increase in the use of supervised machine learning in the 3D medical image analysis and diagnosis. In particular, deep convolutional neural networks (D-CNNs) have been key players and were adopted by the medical imagin… ▽ More The rapid development in representation learning techniques such as deep neural networks and the availability of large-scale, well-annotated medical imaging datasets have to a rapid increase in the use of supervised machine learning in the 3D medical image analysis and diagnosis. In particular, deep convolutional neural networks (D-CNNs) have been key players and were adopted by the medical imaging community to assist clinicians and medical experts in disease diagnosis and treatment. However, training and inferencing deep neural networks such as D-CNN on high-resolution 3D volumes of Computed Tomography (CT) scans for diagnostic tasks pose formidable computational challenges. This challenge raises the need of develo** deep learning-based approaches that are robust in learning representations in 2D images, instead 3D scans. In this work, we propose for the first time a new strategy to train \emph{slice-level} classifiers on CT scans based on the descriptors of the adjacent slices along the axis. In particular, each of which is extracted through a convolutional neural network (CNN). This method is applicable to CT datasets with per-slice labels such as the RSNA Intracranial Hemorrhage (ICH) dataset, which aims to predict the presence of ICH and classify it into 5 different sub-types. We obtain a single model in the top 4% best-performing solutions of the RSNA ICH challenge, where model ensembles are allowed. Experiments also show that the proposed method significantly outperforms the baseline model on CQ500. The proposed method is general and can be applied to other 3D medical diagnosis tasks such as MRI imaging. To encourage new advances in the field, we will make our codes and pre-trained model available upon acceptance of the paper. △ Less

Submitted 17 April, 2023; v1 submitted 5 August, 2022; originally announced August 2022.

Comments: Accepted for presentation at the 22nd IEEE Statistical Signal Processing (SSP) workshop

arXiv:2207.10859 [pdf, other]

Geodesic-Former: a Geodesic-Guided Few-shot 3D Point Cloud Instance Segmenter

Authors: Tuan Ngo, Khoi Nguyen

Abstract: This paper introduces a new problem in 3D point cloud: few-shot instance segmentation. Given a few annotated point clouds exemplified a target class, our goal is to segment all instances of this target class in a query point cloud. This problem has a wide range of practical applications where point-wise instance segmentation annotation is prohibitively expensive to collect. To address this problem… ▽ More This paper introduces a new problem in 3D point cloud: few-shot instance segmentation. Given a few annotated point clouds exemplified a target class, our goal is to segment all instances of this target class in a query point cloud. This problem has a wide range of practical applications where point-wise instance segmentation annotation is prohibitively expensive to collect. To address this problem, we present Geodesic-Former -- the first geodesic-guided transformer for 3D point cloud instance segmentation. The key idea is to leverage the geodesic distance to tackle the density imbalance of LiDAR 3D point clouds. The LiDAR 3D point clouds are dense near the object surface and sparse or empty elsewhere making the Euclidean distance less effective to distinguish different objects. The geodesic distance, on the other hand, is more suitable since it encodes the scene's geometry which can be used as a guiding signal for the attention mechanism in a transformer decoder to generate kernels representing distinct features of instances. These kernels are then used in a dynamic convolution to obtain the final instance masks. To evaluate Geodesic-Former on the new task, we propose new splits of the two common 3D point cloud instance segmentation datasets: ScannetV2 and S3DIS. Geodesic-Former consistently outperforms strong baselines adapted from state-of-the-art 3D point cloud instance segmentation approaches with a significant margin. Code is available at https://github.com/VinAIResearch/GeoFormer. △ Less

Submitted 6 August, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

Comments: Accepted to ECCV 2022

arXiv:2207.08221 [pdf, ps, other]

Expanders on matrices over a finite chain ring, I

Authors: Dung M. Ha, Hieu T. Ngo

Abstract: In this work and its sequel, we study the expanding phenomenon of matrices over a finite chain ring of large residue field. A sum-product estimate is proved. It is showed that $x+yz$ is a moderate expander on $n\times n$ matrices with exponent $\frac{n+1}{6}$. These results generalise the main theorems in a recent work of Xie and Ge. The proofs use spectral graph theory and elementary divisor theo… ▽ More In this work and its sequel, we study the expanding phenomenon of matrices over a finite chain ring of large residue field. A sum-product estimate is proved. It is showed that $x+yz$ is a moderate expander on $n\times n$ matrices with exponent $\frac{n+1}{6}$. These results generalise the main theorems in a recent work of Xie and Ge. The proofs use spectral graph theory and elementary divisor theory. △ Less

Submitted 17 July, 2022; originally announced July 2022.

Comments: 20 pages

MSC Class: 11T30; 05C50

arXiv:2206.02992 [pdf, other]

SMT-Based Model Checking of Industrial Simulink Models

Authors: Daisuke Ishii, Takashi Tomita, Toshiaki Aoki, The Quyen Ngo, Thi Bich Ngoc Do, Hideaki Takai

Abstract: The development of embedded systems requires formal analysis of models such as those described with MATLAB/Simulink. However, the increasing complexity of industrial models makes analysis difficult. This paper proposes a model checking method for Simulink models using SMT solvers. The proposed method aims at (1) automated, efficient and comprehensible verification of complex models, (2) numericall… ▽ More The development of embedded systems requires formal analysis of models such as those described with MATLAB/Simulink. However, the increasing complexity of industrial models makes analysis difficult. This paper proposes a model checking method for Simulink models using SMT solvers. The proposed method aims at (1) automated, efficient and comprehensible verification of complex models, (2) numerically accurate analysis of models, and (3) demonstrating the analysis of Simulink models using an SMT solver (we use Z3). It first encodes a target model into a predicate logic formula in the domain of mathematical arithmetic and bit vectors. We explore how to encode various Simulink blocks exactly. Then, the method verifies a given invariance property using the k-induction-based algorithm that extracts a subsystem involving the target block and unrolls the execution paths incrementally. In the experiment, we applied the proposed method and other tools to a set of models and properties. Our method successfully verified most of the properties including those unverified with other tools. △ Less

Submitted 6 June, 2022; originally announced June 2022.

Comments: 16 pages, 5 figures, 1 table, submitted to ICFEM 2022

arXiv:2206.01964 [pdf, ps, other]

Indecomposable characters on direct limit of symmetric groups with diagonal embeddings

Authors: N. Nessonov, N. T. S. Ngo

Abstract: In this paper we obtain the complete description of all indecomposable characters (central positive-definite functions) of inductive limits of the symmetric groups under block diagonal embedding. As a corollary we obtain the full classification of the isomorphism classes of these inductive limits. In this paper we obtain the complete description of all indecomposable characters (central positive-definite functions) of inductive limits of the symmetric groups under block diagonal embedding. As a corollary we obtain the full classification of the isomorphism classes of these inductive limits. △ Less

Submitted 4 June, 2022; originally announced June 2022.

Comments: 34 pages

MSC Class: 20C32 Representations of infinite symmetric groups

arXiv:2203.05281 [pdf, other]

Multi-Agent Task Assignment in Vehicular Edge Computing: A Regret-Matching Learning-Based Approach

Authors: Bach Long Nguyen, Duong D. Nguyen, Hung X. Nguyen, Duy T. Ngo, Markus Wagner

Abstract: Vehicular edge computing has recently been proposed to support computation-intensive applications in Intelligent Transportation Systems (ITS) such as self-driving cars and augmented reality. Despite progress in this area, significant challenges remain to efficiently allocate limited computation resources to a range of time-critical ITS tasks. To this end, the current paper develops a new task assi… ▽ More Vehicular edge computing has recently been proposed to support computation-intensive applications in Intelligent Transportation Systems (ITS) such as self-driving cars and augmented reality. Despite progress in this area, significant challenges remain to efficiently allocate limited computation resources to a range of time-critical ITS tasks. To this end, the current paper develops a new task assignment scheme for vehicles in a highway. Because of the high speed of vehicles and the limited communication range of road side units (RSUs), the computation tasks of participating vehicles are to be dynamically migrated across multiple servers. We formulate a binary nonlinear programming (BNLP) problem of assigning computation tasks from vehicles to RSUs and a macrocell base station. To deal with the potentially large size of the formulated optimization problem, we develop a distributed multi-agent regret-matching learning algorithm. Based on the regret minimization principle, the proposed algorithm employs a forgetting method that allows the learning process to quickly adapt to and effectively handle the high mobility feature of vehicle networks. We theoretically prove that it converges to the correlated equilibrium solutions of the considered BNLP problem. Simulation results with practical parameter settings show that the proposed algorithm offers the lowest total delay and cost of processing tasks, as well as utility fairness among agents. Importantly, our algorithm converges much faster than existing methods as the problem size grows, demonstrating its clear advantage in large-scale vehicular networks. △ Less

Submitted 16 December, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

Comments: 10 pages, 12 figures, and 1 table

arXiv:2203.05074 [pdf, other]

The Transitive Information Theory and its Application to Deep Generative Models

Authors: Trung Ngo, Najwa Laabid, Ville Hautamäki, Merja Heinäniemi

Abstract: Paradoxically, a Variational Autoencoder (VAE) could be pushed in two opposite directions, utilizing powerful decoder model for generating realistic images but collapsing the learned representation, or increasing regularization coefficient for disentangling representation but ultimately generating blurry examples. Existing methods narrow the issues to the rate-distortion trade-off between compress… ▽ More Paradoxically, a Variational Autoencoder (VAE) could be pushed in two opposite directions, utilizing powerful decoder model for generating realistic images but collapsing the learned representation, or increasing regularization coefficient for disentangling representation but ultimately generating blurry examples. Existing methods narrow the issues to the rate-distortion trade-off between compression and reconstruction. We argue that a good reconstruction model does learn high capacity latents that encode more details, however, its use is hindered by two major issues: the prior is random noise which is completely detached from the posterior and allow no controllability in the generation; mean-field variational inference doesn't enforce hierarchy structure which makes the task of recombining those units into plausible novel output infeasible. As a result, we develop a system that learns a hierarchy of disentangled representation together with a mechanism for recombining the learned representation for generalization. This is achieved by introducing a minimal amount of inductive bias to learn controllable prior for the VAE. The idea is supported by here developed transitive information theory, that is, the mutual information between two target variables could alternately be maximized through the mutual information to the third variable, thus bypassing the rate-distortion bottleneck in VAE design. In particular, we show that our model, named SemafoVAE (inspired by the similar concept in computer science), could generate high-quality examples in a controllable manner, perform smooth traversals of the disentangled factors and intervention at a different level of representation hierarchy. △ Less

Submitted 28 March, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

arXiv:2202.08316 [pdf, other]

FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction

Authors: Minh Van Nguyen, Nghia Trung Ngo, Bonan Min, Thien Huu Nguyen

Abstract: This paper presents FAMIE, a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction. FAMIE is designed to address a fundamental problem in existing AL frameworks where annotators need to wait for a long time between annotation batches due to the time-consuming nature of model training and data selection at each AL iteration. This hinders the engagement, pr… ▽ More This paper presents FAMIE, a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction. FAMIE is designed to address a fundamental problem in existing AL frameworks where annotators need to wait for a long time between annotation batches due to the time-consuming nature of model training and data selection at each AL iteration. This hinders the engagement, productivity, and efficiency of annotators. Based on the idea of using a small proxy network for fast data selection, we introduce a novel knowledge distillation mechanism to synchronize the proxy network with the main large model (i.e., BERT-based) to ensure the appropriateness of the selected annotation examples for the main model. Our AL framework can support multiple languages. The experiments demonstrate the advantages of FAMIE in terms of competitive performance and time efficiency for sequence labeling with AL. We publicly release our code (\url{https://github.com/nlp-uoregon/famie}) and demo website (\url{http://nlp.uoregon.edu:9000/}). A demo video for FAMIE is provided at: \url{https://youtu.be/I2i8n_jAyrY}. △ Less

Submitted 4 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

Comments: Accepted to NAACL 2022 (System Demonstrations)

arXiv:2112.11723 [pdf, other]

Energy-Efficient Massive MIMO for Federated Learning: Transmission Designs and Resource Allocations

Authors: Tung T. Vu, Hien Q. Ngo, Minh N. Dao, Duy T. Ngo, Erik G. Larsson, Tho Le-Ngoc

Abstract: This work proposes novel synchronous, asynchronous, and session-based designs for energy-efficient massive multiple-input multiple-output networks to support federated learning (FL). The synchronous design relies on strict synchronization among users when executing each FL communication round, while the asynchronous design allows more flexibility for users to save energy by using lower computing f… ▽ More This work proposes novel synchronous, asynchronous, and session-based designs for energy-efficient massive multiple-input multiple-output networks to support federated learning (FL). The synchronous design relies on strict synchronization among users when executing each FL communication round, while the asynchronous design allows more flexibility for users to save energy by using lower computing frequencies. The session-based design splits the downlink and uplink phases in each FL communication round into separate sessions. In this design, we assign users such that one of the participating users in each session finishes its transmission and does not join the next session. As such, more power and degrees of freedom will be allocated to unfinished users, leading to higher rates, lower transmission times, and hence, a higher energy efficiency. In all three designs, we use zero-forcing processing for both uplink and downlink, and develop algorithms that optimize user assignment, time allocation, power, and computing frequencies to minimize the energy consumption at the base station and users, while guaranteeing a predefined maximum execution time of one FL communication round. △ Less

Submitted 15 November, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

Comments: accepted to appear

arXiv:2111.08754 [pdf, ps, other]

$\mathrm{GL}_n$-structure and principal $\mathfrak{sl}_2$-triple on the cohomology ring of complex Grassmannian

Authors: Nhok Tkhai Shon Ngo

Abstract: In this note we describe the cohomology ring of the Grassmannian of $k$-planes in $n$-dimensional complex vector space as an $\mathrm{GL}_n$-module. We give explicit formulas for the operators of its principal $\mathfrak{sl}_2$-triple. It is proved that one of these operators corresponds to the shifted cohomology degree operator and the second operator coincides with the Lefschetz map on cohomolog… ▽ More In this note we describe the cohomology ring of the Grassmannian of $k$-planes in $n$-dimensional complex vector space as an $\mathrm{GL}_n$-module. We give explicit formulas for the operators of its principal $\mathfrak{sl}_2$-triple. It is proved that one of these operators corresponds to the shifted cohomology degree operator and the second operator coincides with the Lefschetz map on cohomology (as in the hard Lefschetz theorem). We check that the cohomology ring of the complex Grassmannian as a $\mathrm{GL}_n$-representation is isomorphic to the $k$-th exterior power of the standard $n$-dimensional representation. △ Less

Submitted 16 November, 2021; originally announced November 2021.

arXiv:2110.12133 [pdf]

Distributed Dynamic State Estimation for Microgrids

Authors: Bang L. H. Nguyen, Tuyen V. Vu, Thomas H. Ortmeyer, Tuan Ngo

Abstract: Conventionally, the dynamic state estimation of variables in power networks is performed based on the forecasting-aided model of bus voltages. This approach is effective in the stiff grids at the transmission level, where the bus voltages are less sensitive to variations of the load. However, in microgrids, bus voltages can fluctuate significantly under load changes, the forecasting-aided model ma… ▽ More Conventionally, the dynamic state estimation of variables in power networks is performed based on the forecasting-aided model of bus voltages. This approach is effective in the stiff grids at the transmission level, where the bus voltages are less sensitive to variations of the load. However, in microgrids, bus voltages can fluctuate significantly under load changes, the forecasting-aided model may not sufficiently accurate. To resolve this problem, this paper proposes a dynamic state estimation scheme for microgrids using the state-space model derived from differential equations of power networks. In the proposed scheme, the branch currents are the state variables, whereas the bus voltages become the inputs which can vary freely with loads. As a result, the entire microgrids system can be partitioned into local areas, where neighbor areas share the common inputs. The proposed estimation scheme then can be implemented in a distributed manner. A novel Kalman-based filtering method is derived to estimate both states and inputs simultaneously. Only information of common inputs is exchanged between neighboring estimators. Simulation results of the 13-bus Potsdam microgrid (New York State) are provided to prove the feasibility and performances of the proposed scheme. △ Less

Submitted 23 October, 2021; originally announced October 2021.

Comments: 5 pages, 9 figures

Journal ref: PESGM 2020

arXiv:2108.13512 [pdf, ps, other]

Energy-Efficient Massive MIMO for Serving Multiple Federated Learning Groups

Authors: Tung T. Vu, Hien Quoc Ngo, Duy T. Ngo, Minh N Dao, Erik G. Larsson

Abstract: With its privacy preservation and communication efficiency, federated learning (FL) has emerged as a learning framework that suits beyond 5G and towards 6G systems. This work looks into a future scenario in which there are multiple groups with different learning purposes and participating in different FL processes. We give energy-efficient solutions to demonstrate that this scenario can be realist… ▽ More With its privacy preservation and communication efficiency, federated learning (FL) has emerged as a learning framework that suits beyond 5G and towards 6G systems. This work looks into a future scenario in which there are multiple groups with different learning purposes and participating in different FL processes. We give energy-efficient solutions to demonstrate that this scenario can be realistic. First, to ensure a stable operation of multiple FL processes over wireless channels, we propose to use a massive multiple-input multiple-output network to support the local and global FL training updates, and let the iterations of these FL processes be executed within the same large-scale coherence time. Then, we develop asynchronous and synchronous transmission protocols where these iterations are asynchronously and synchronously executed, respectively, using the downlink unicasting and conventional uplink transmission schemes. Zero-forcing processing is utilized for both uplink and downlink transmissions. Finally, we propose an algorithm that optimally allocates power and computation resources to save energy at both base station and user sides, while guaranteeing a given maximum execution time threshold of each FL iteration. Compared to the baseline schemes, the proposed algorithm significantly reduces the energy consumption, especially when the number of base station antennas is large. △ Less

Submitted 17 October, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

Comments: Accepted to appear in Proc. IEEE Global Communications Conference (GLOBECOM), Madrid, Spain, Dec. 2021. (v2). arXiv admin note: text overlap with arXiv:2107.09577

arXiv:2108.03408 [pdf, ps, other]

doi 10.1103/PhysRevA.105.012606

Enhanced nonlinear quantum metrology with weakly coupled solitons and particle losses

Authors: Alexander Alodjants, Dmitriy Tsarev, The Vinh Ngo, Ray-Kuang Lee

Abstract: The estimation of physical parameters with Heisenberg sensitivity and beyond is one of the crucial problems for current quantum metrology. However, unavoidable lossy effect is commonly believed to be the main obstacle when applying fragile quantum states. To utilize the lossy quantum metrology, we offer an interferometric procedure for phase parameters estimation at the Heisenberg (up to 1/N) and… ▽ More The estimation of physical parameters with Heisenberg sensitivity and beyond is one of the crucial problems for current quantum metrology. However, unavoidable lossy effect is commonly believed to be the main obstacle when applying fragile quantum states. To utilize the lossy quantum metrology, we offer an interferometric procedure for phase parameters estimation at the Heisenberg (up to 1/N) and super-Heisenberg (up to 1/N^3) scaling levels in the framework of the linear and nonlinear metrology approaches, respectively. The heart of our setup is the novel soliton Josephson Junction (SJJ) system providing the formation of the quantum probe, i.e, the entangled Fock (N00N-like) state, beyond the superfluid-Mott insulator quantum phase transition point. We illustrate that such states are close to the optimal ones even with moderate losses. The enhancement of phase estimation accuracy remains feasible both for the linear and nonlinear metrologies with the SJJs, and allows further improvement for the current experiments performed with atomic condensate solitons with a mesoscopic number of particles. △ Less

Submitted 7 August, 2021; originally announced August 2021.

arXiv:2108.01306 [pdf]

Distributed Dynamic State-Input Estimation for Power Networks of Microgrids and Active Distribution Systems with Unknown Inputs

Authors: Bang L. H. Nguyen, Tuyen V. Vu, Joseph M. Guerrero, Mischael Steurer, Karl Schoder, Tuan Ngo

Abstract: This paper proposes a joint input and state dynamic estimation scheme for power networks in microgrids and active distribution systems with unknown inputs. The conventional dynamic state estimation of power networks in the transmission system relies on the forecasting methods to obtain the state-transition model of state variables. However, under highly dynamic conditions in the operation of micro… ▽ More This paper proposes a joint input and state dynamic estimation scheme for power networks in microgrids and active distribution systems with unknown inputs. The conventional dynamic state estimation of power networks in the transmission system relies on the forecasting methods to obtain the state-transition model of state variables. However, under highly dynamic conditions in the operation of microgrids and active distribution networks, this approach may become ineffective as the forecasting accuracy is not guaranteed. To overcome such drawbacks, this paper employs the power networks model derived from the physical equations of branch currents. Specifically, the power network model is a linear state-space model, in which the state vector consists of branch currents, and the input vector consists of bus voltages. To estimate both state and input variables, we propose linear Kalman-based dynamic filtering algorithms in batch-mode regression form, considering the cross-correlation between states and inputs. For the scalability of the proposed scheme, the distributed implementation is also presented. Complementarily, the predicted state and input vectors are leveraged for bad data detection. Results carried out on a 13-bus microgrid system in real-time Opal-RT platform demonstrate the effectiveness of the proposed method in comparison with the traditional weighted least square and tracking state estimation methods. △ Less

Submitted 3 August, 2021; originally announced August 2021.

arXiv:2107.13301 [pdf, ps, other]

On roots of quadratic congruences

Authors: Hieu T. Ngo

Abstract: The equidistribution of roots of quadratic congruences with prime moduli depends crucially upon effective bounds for a special Weyl linear form. Duke, Friedlander and Iwaniec discovered a strong estimate for this Weyl linear form when the quadratic polynomial has negative discriminant. Tóth established an analogous but weaker bound when the quadratic polynomial has positive discriminant. We obtain… ▽ More The equidistribution of roots of quadratic congruences with prime moduli depends crucially upon effective bounds for a special Weyl linear form. Duke, Friedlander and Iwaniec discovered a strong estimate for this Weyl linear form when the quadratic polynomial has negative discriminant. Tóth established an analogous but weaker bound when the quadratic polynomial has positive discriminant. We obtain a stronger estimate for the Weyl linear form for quadratics of positive discriminants. △ Less

Submitted 28 July, 2021; originally announced July 2021.

Showing 1–50 of 141 results for author: Ngo, T