-
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Authors:
Zuxin Liu,
Thai Hoang,
Jianguo Zhang,
Ming Zhu,
Tian Lan,
Shirley Kokane,
Juntao Tan,
Weiran Yao,
Zhiwei Liu,
Yihao Feng,
Rithesh Murthy,
Liangwei Yang,
Silvio Savarese,
Juan Carlos Niebles,
Huan Wang,
Shelby Heinecke,
Caiming Xiong
Abstract:
The advancement of function-calling agent models requires diverse, reliable, and high-quality datasets. This paper presents APIGen, an automated data generation pipeline designed to synthesize verifiable high-quality datasets for function-calling applications. We leverage APIGen and collect 3,673 executable APIs across 21 different categories to generate diverse function-calling datasets in a scal…
▽ More
The advancement of function-calling agent models requires diverse, reliable, and high-quality datasets. This paper presents APIGen, an automated data generation pipeline designed to synthesize verifiable high-quality datasets for function-calling applications. We leverage APIGen and collect 3,673 executable APIs across 21 different categories to generate diverse function-calling datasets in a scalable and structured manner. Each data in our dataset is verified through three hierarchical stages: format checking, actual function executions, and semantic verification, ensuring its reliability and correctness. We demonstrate that models trained with our curated datasets, even with only 7B parameters, can achieve state-of-the-art performance on the Berkeley Function-Calling Benchmark, outperforming multiple GPT-4 models. Moreover, our 1B model achieves exceptional performance, surpassing GPT-3.5-Turbo and Claude-3 Haiku. We release a dataset containing 60,000 high-quality entries, aiming to advance the field of function-calling agent domains. The dataset is available on Huggingface: https://huggingface.co/datasets/Salesforce/xlam-function-calling-60k and the project homepage: https://apigen-pipeline.github.io/
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Authors:
Terry Yue Zhuo,
Minh Chien Vu,
Jenny Chim,
Han Hu,
Wenhao Yu,
Ratnadira Widyasari,
Imam Nur Bani Yusuf,
Haolan Zhan,
Junda He,
Indraneil Paul,
Simon Brunner,
Chen Gong,
Thong Hoang,
Armel Randy Zebaze,
Xiaoheng Hong,
Wen-Ding Li,
Jean Kaddour,
Ming Xu,
Zhihan Zhang,
Prateek Yadav,
Naman Jain,
Alex Gu,
Zhoujun Cheng,
Jiawei Liu,
Qian Liu
, et al. (8 additional authors not shown)
Abstract:
Automated software engineering has been greatly empowered by the recent advances in Large Language Models (LLMs) for programming. While current benchmarks have shown that LLMs can perform various software engineering tasks like human developers, the majority of their evaluations are limited to short and self-contained algorithmic tasks. Solving challenging and practical programming tasks requires…
▽ More
Automated software engineering has been greatly empowered by the recent advances in Large Language Models (LLMs) for programming. While current benchmarks have shown that LLMs can perform various software engineering tasks like human developers, the majority of their evaluations are limited to short and self-contained algorithmic tasks. Solving challenging and practical programming tasks requires the capability of utilizing diverse function calls as tools to efficiently implement functionalities like data analysis and web development. In addition, using multiple tools to solve a task needs compositional reasoning by accurately understanding complex instructions. Fulfilling both of these characteristics can pose a great challenge for LLMs. To assess how well LLMs can solve challenging and practical programming tasks, we introduce Bench, a benchmark that challenges LLMs to invoke multiple function calls as tools from 139 libraries and 7 domains for 1,140 fine-grained programming tasks. To evaluate LLMs rigorously, each programming task encompasses 5.6 test cases with an average branch coverage of 99%. In addition, we propose a natural-language-oriented variant of Bench, Benchi, that automatically transforms the original docstrings into short instructions only with essential information. Our extensive evaluation of 60 LLMs shows that LLMs are not yet capable of following complex instructions to use function calls precisely, with scores up to 60%, significantly lower than the human performance of 97%. The results underscore the need for further advancements in this area.
△ Less
Submitted 26 June, 2024; v1 submitted 22 June, 2024;
originally announced June 2024.
-
Data Augmentation for Multivariate Time Series Classification: An Experimental Study
Authors:
Romain Ilbert,
Thai V. Hoang,
Zonghua Zhang
Abstract:
Our study investigates the impact of data augmentation on the performance of multivariate time series models, focusing on datasets from the UCR archive. Despite the limited size of these datasets, we achieved classification accuracy improvements in 10 out of 13 datasets using the Rocket and InceptionTime models. This highlights the essential role of sufficient data in training effective models, pa…
▽ More
Our study investigates the impact of data augmentation on the performance of multivariate time series models, focusing on datasets from the UCR archive. Despite the limited size of these datasets, we achieved classification accuracy improvements in 10 out of 13 datasets using the Rocket and InceptionTime models. This highlights the essential role of sufficient data in training effective models, paralleling the advancements seen in computer vision. Our work delves into adapting and applying existing methods in innovative ways to the domain of multivariate time series classification. Our comprehensive exploration of these techniques sets a new standard for addressing data scarcity in time series analysis, emphasizing that diverse augmentation strategies are crucial for unlocking the potential of both traditional and deep learning models. Moreover, by meticulously analyzing and applying a variety of augmentation techniques, we demonstrate that strategic data enrichment can enhance model accuracy. This not only establishes a benchmark for future research in time series analysis but also underscores the importance of adopting varied augmentation approaches to improve model performance in the face of limited data availability.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
The LiteBIRD mission to explore cosmic inflation
Authors:
T. Ghigna,
A. Adler,
K. Aizawa,
H. Akamatsu,
R. Akizawa,
E. Allys,
A. Anand,
J. Aumont,
J. Austermann,
S. Azzoni,
C. Baccigalupi,
M. Ballardini,
A. J. Banday,
R. B. Barreiro,
N. Bartolo,
S. Basak,
A. Basyrov,
S. Beckman,
M. Bersanelli,
M. Bortolami,
F. Bouchet,
T. Brinckmann,
P. Campeti,
E. Carinos,
A. Carones
, et al. (134 additional authors not shown)
Abstract:
LiteBIRD, the next-generation cosmic microwave background (CMB) experiment, aims for a launch in Japan's fiscal year 2032, marking a major advancement in the exploration of primordial cosmology and fundamental physics. Orbiting the Sun-Earth Lagrangian point L2, this JAXA-led strategic L-class mission will conduct a comprehensive map** of the CMB polarization across the entire sky. During its 3-…
▽ More
LiteBIRD, the next-generation cosmic microwave background (CMB) experiment, aims for a launch in Japan's fiscal year 2032, marking a major advancement in the exploration of primordial cosmology and fundamental physics. Orbiting the Sun-Earth Lagrangian point L2, this JAXA-led strategic L-class mission will conduct a comprehensive map** of the CMB polarization across the entire sky. During its 3-year mission, LiteBIRD will employ three telescopes within 15 unique frequency bands (ranging from 34 through 448 GHz), targeting a sensitivity of 2.2\,$μ$K-arcmin and a resolution of 0.5$^\circ$ at 100\,GHz. Its primary goal is to measure the tensor-to-scalar ratio $r$ with an uncertainty $δr = 0.001$, including systematic errors and margin. If $r \geq 0.01$, LiteBIRD expects to achieve a $>5σ$ detection in the $\ell=$2-10 and $\ell=$11-200 ranges separately, providing crucial insight into the early Universe. We describe LiteBIRD's scientific objectives, the application of systems engineering to mission requirements, the anticipated scientific impact, and the operations and scanning strategies vital to minimizing systematic effects. We will also highlight LiteBIRD's synergies with concurrent CMB projects.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Learning to Discretize Denoising Diffusion ODEs
Authors:
Vinh Tong,
Anji Liu,
Trung-Dung Hoang,
Guy Van den Broeck,
Mathias Niepert
Abstract:
Diffusion Probabilistic Models (DPMs) are powerful generative models showing competitive performance in various domains, including image synthesis and 3D point cloud generation. However, sampling from pre-trained DPMs involves multiple neural function evaluations (NFE) to transform Gaussian noise samples into images, resulting in higher computational costs compared to single-step generative models…
▽ More
Diffusion Probabilistic Models (DPMs) are powerful generative models showing competitive performance in various domains, including image synthesis and 3D point cloud generation. However, sampling from pre-trained DPMs involves multiple neural function evaluations (NFE) to transform Gaussian noise samples into images, resulting in higher computational costs compared to single-step generative models such as GANs or VAEs. Therefore, a crucial problem is to reduce NFE while preserving generation quality. To this end, we propose LD3, a lightweight framework for learning time discretization while sampling from the diffusion ODE encapsulated by DPMs. LD3 can be combined with various diffusion ODE solvers and consistently improves performance without retraining resource-intensive neural networks. We demonstrate analytically and empirically that LD3 enhances sampling efficiency compared to distillation-based methods, without the extensive computational overhead. We evaluate our method with extensive experiments on 5 datasets, covering unconditional and conditional sampling in both pixel-space and latent-space DPMs. For example, in about 5 minutes of training on a single GPU, our method reduces the FID score from 6.63 to 2.68 on CIFAR10 (7 NFE), and in around 20 minutes, decreases the FID from 8.51 to 5.03 on class-conditional ImageNet-256 (5 NFE). LD3 complements distillation methods, offering a more efficient approach to sampling from pre-trained diffusion models.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Sparse Attention-driven Quality Prediction for Production Process Optimization in Digital Twins
Authors:
Yanlei Yin,
Lihua Wang,
Wenbo Wang,
Dinh Thai Hoang
Abstract:
In the process industry, optimizing production lines for long-term efficiency requires real-time monitoring and analysis of operation states to fine-tune production line parameters. However, the complexity in operational logic and the intricate coupling of production process parameters make it difficult to develop an accurate mathematical model for the entire process, thus hindering the deployment…
▽ More
In the process industry, optimizing production lines for long-term efficiency requires real-time monitoring and analysis of operation states to fine-tune production line parameters. However, the complexity in operational logic and the intricate coupling of production process parameters make it difficult to develop an accurate mathematical model for the entire process, thus hindering the deployment of efficient optimization mechanisms. In view of these difficulties, we propose to deploy a digital twin of the production line by digitally abstracting its physical layout and operational logic. By iteratively map** the real-world data reflecting equipment operation status and product quality inspection in the digital twin, we adopt a quality prediction model for production process based on self-attention-enabled temporal convolutional neural networks. This model enables the data-driven state evolution of the digital twin. The digital twin takes a role of aggregating the information of actual operating conditions and the results of quality-sensitive analysis, which facilitates the optimization of process production quality with virtual-reality evolution under multi-dimensional constraints. Leveraging the digital twin model as an information-flow carrier, we extract temporal features from key process indicators and establish a production process quality prediction model based on the proposed composite neural network. Our operation experiments on a specific tobacco shredding line demonstrate that the proposed digital twin-based production process optimization method fosters seamless integration between virtual and real production lines. This integration achieves an average operating status prediction accuracy of over 98\% and near-optimal production process control.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Effect of helium bubbles on the mobility of edge dislocations in copper
Authors:
Minh Tam Hoang,
Nithin Mathew,
Daniel N. Blaschke,
Saryu Fensin
Abstract:
Helium bubbles can form in materials upon exposure to irradiation. It is well known that the presence of helium bubbles can cause changes in the mechanical behavior of materials. To improve the lifetime of nuclear components, it is important to understand deformation mechanisms in helium-containing materials. In this work, we investigate the interactions between edge dislocations and helium bubble…
▽ More
Helium bubbles can form in materials upon exposure to irradiation. It is well known that the presence of helium bubbles can cause changes in the mechanical behavior of materials. To improve the lifetime of nuclear components, it is important to understand deformation mechanisms in helium-containing materials. In this work, we investigate the interactions between edge dislocations and helium bubbles in copper using molecular dynamics (MD) simulations. We focus on the effect of helium bubble pressure (equivalently, the helium-to-vacancy ratio) on the obstacle strength of helium bubbles and their interaction with dislocations. Our simulations predict significant differences in the interaction mechanisms as a function of helium bubble pressure. Specifically, bubbles with high internal pressure are found to exhibit weaker obstacle strength as compared to low-pressure bubbles of the same size due to the formation of super-jogs in the dislocation. Activation energies and rate constants extracted from the MD data confirm this transition in mechanism and enable upscaling of these phenomena to high length-scale models.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Offline Model-Based Optimization via Policy-Guided Gradient Search
Authors:
Yassine Chemingui,
Aryan Deshwal,
Trong Nghia Hoang,
Janardhan Rao Doppa
Abstract:
Offline optimization is an emerging problem in many experimental engineering domains including protein, drug or aircraft design, where online experimentation to collect evaluation data is too expensive or dangerous. To avoid that, one has to optimize an unknown function given only its offline evaluation at a fixed set of inputs. A naive solution to this problem is to learn a surrogate model of the…
▽ More
Offline optimization is an emerging problem in many experimental engineering domains including protein, drug or aircraft design, where online experimentation to collect evaluation data is too expensive or dangerous. To avoid that, one has to optimize an unknown function given only its offline evaluation at a fixed set of inputs. A naive solution to this problem is to learn a surrogate model of the unknown function and optimize this surrogate instead. However, such a naive optimizer is prone to erroneous overestimation of the surrogate (possibly due to over-fitting on a biased sample of function evaluation) on inputs outside the offline dataset. Prior approaches addressing this challenge have primarily focused on learning robust surrogate models. However, their search strategies are derived from the surrogate model rather than the actual offline data. To fill this important gap, we introduce a new learning-to-search perspective for offline optimization by reformulating it as an offline reinforcement learning problem. Our proposed policy-guided gradient search approach explicitly learns the best policy for a given surrogate model created from the offline data. Our empirical results on multiple benchmarks demonstrate that the learned optimization policy can be combined with existing offline surrogates to significantly improve the optimization performance.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Collective nature of high-Q resonances in finite-size photonic metastructures
Authors:
Thanh Xuan Hoang,
Daniel Leykam,
Hong-Son Chu,
Ching Eng Png,
Francisco J. Garcıa-Vidal,
Yuri S. Kivshar
Abstract:
We study high quality-factor (high Q) resonances supported by periodic arrays of Mie resonators from the perspectives of both Bloch wave theory and multiple scattering theory. We reveal that, unlike a common belief, the bound states in the continuum (BICs) derived by the Bloch-wave theory do not directly determine the resonance with the highest Q value in large but finite arrays. Higher Q factors…
▽ More
We study high quality-factor (high Q) resonances supported by periodic arrays of Mie resonators from the perspectives of both Bloch wave theory and multiple scattering theory. We reveal that, unlike a common belief, the bound states in the continuum (BICs) derived by the Bloch-wave theory do not directly determine the resonance with the highest Q value in large but finite arrays. Higher Q factors appear to be associated with collective resonances formed by nominally guided modes below the light line associated with strong effect of both electric and magnetic multipoles. Our findings offer valuable insights into accessing the modes with higher Q resonances via bonding modes within finite metastructures. Our results underpin the pivotal significance of magnetic and electric multipoles in the design of resonant metadevices and nonlocal flat-band optics. Moreover, our demonstrations reveal that coupled arrays of high-Q microcavities do not inherently result in a stronger light-matter interaction when compared to coupled low-Q nanoresonators. This result emphasizes the critical importance of the study of multiple light-scattering effects in cavity-based systems.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Rethinking Attention Gated with Hybrid Dual Pyramid Transformer-CNN for Generalized Segmentation in Medical Imaging
Authors:
Fares Bougourzi,
Fadi Dornaika,
Abdelmalik Taleb-Ahmed,
Vinh Truong Hoang
Abstract:
Inspired by the success of Transformers in Computer vision, Transformers have been widely investigated for medical imaging segmentation. However, most of Transformer architecture are using the recent transformer architectures as encoder or as parallel encoder with the CNN encoder. In this paper, we introduce a novel hybrid CNN-Transformer segmentation architecture (PAG-TransYnet) designed for effi…
▽ More
Inspired by the success of Transformers in Computer vision, Transformers have been widely investigated for medical imaging segmentation. However, most of Transformer architecture are using the recent transformer architectures as encoder or as parallel encoder with the CNN encoder. In this paper, we introduce a novel hybrid CNN-Transformer segmentation architecture (PAG-TransYnet) designed for efficiently building a strong CNN-Transformer encoder. Our approach exploits attention gates within a Dual Pyramid hybrid encoder. The contributions of this methodology can be summarized into three key aspects: (i) the utilization of Pyramid input for highlighting the prominent features at different scales, (ii) the incorporation of a PVT transformer to capture long-range dependencies across various resolutions, and (iii) the implementation of a Dual-Attention Gate mechanism for effectively fusing prominent features from both CNN and Transformer branches. Through comprehensive evaluation across different segmentation tasks including: abdominal multi-organs segmentation, infection segmentation (Covid-19 and Bone Metastasis), microscopic tissues segmentation (Gland and Nucleus). The proposed approach demonstrates state-of-the-art performance and exhibits remarkable generalization capabilities. This research represents a significant advancement towards addressing the pressing need for efficient and adaptable segmentation solutions in medical imaging applications.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
Adapted Lie splitting method for convection-diffusion problems with singular convective term
Authors:
Thi Tam Dang,
Trung Hau Hoang,
Giandomenico Orlandi
Abstract:
Splitting methods are a widely used numerical scheme for solving convection-diffusion problems. However, they may lose stability in some situations, particularly when applied to convection-diffusion problems in the presence of an unbounded convective term. In this paper, we propose a new splitting method, called the "Adapted Lie splitting method", which successfully overcomes the observed instabil…
▽ More
Splitting methods are a widely used numerical scheme for solving convection-diffusion problems. However, they may lose stability in some situations, particularly when applied to convection-diffusion problems in the presence of an unbounded convective term. In this paper, we propose a new splitting method, called the "Adapted Lie splitting method", which successfully overcomes the observed instability in certain cases. Assuming that the unbounded coefficient belongs to a suitable Lorentz space, we show that the adapted Lie splitting converges to first-order under the analytic semigroup framework. Furthermore, we provide numerical experiments to illustrate our newly proposed splitting approach.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Interplay between magnetic and lattice excitations and emergent multiple phase transitions in MnPSe3-xSx
Authors:
Deepu Kumar,
Nguyen The Hoang,
Yumin Sim,
Youngsu Choi,
Kalaivanan Raju,
Rajesh Kumar Ulaganathan,
Raman Sankar,
Maeng-Je Seong,
Kwang-Yong Choi
Abstract:
The intricate interplay between spin and lattice degrees of freedom in two-dimensional magnetic materials plays a pivotal role in modifying their magnetic characteristics, engendering hybrid quasiparticles, and implementing functional devices. Herein, we present our comprehensive and in-depth investigations on magnetic and lattice excitations of MnPSe3-xSx (x = 0, 0.5, and 1.5) alloys, utilizing t…
▽ More
The intricate interplay between spin and lattice degrees of freedom in two-dimensional magnetic materials plays a pivotal role in modifying their magnetic characteristics, engendering hybrid quasiparticles, and implementing functional devices. Herein, we present our comprehensive and in-depth investigations on magnetic and lattice excitations of MnPSe3-xSx (x = 0, 0.5, and 1.5) alloys, utilizing temperature- and polarization-dependent Raman scattering. Our experimental results reveal the occurrence of multiple phase transitions, evidenced by notable changes in phonon self-energy and the appearance or splitting of phonon modes. These emergent phases are tied to the development of long and short-range spin-spin correlations, as well as to spin reorientations or magnetic instabilities. Our analysis of two-magnon excitations as a function of temperature and composition showcases their hybridization with phonons whose degree weakens with increasing x. Moreover, the suppression of spin-dependent phonon intensity in chemically most-disordered MnPSe3-xSx (x = 1.5) suggests that chalcogen substitution offers a control knob of tuning spin and phonon dynamics by modulating concurrently superexchange pathways and a degree of trigonal distortions.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Complementarity-constrained predictive control for efficient gas-balanced hybrid power systems
Authors:
Kiet Tuan Hoang,
Brage Rugstad Knudsen,
Lars Struen Imsland
Abstract:
Controlling gas turbines (GTs) efficiently is vital as GTs are used to balance power in onshore/offshore hybrid power systems with variable renewable energy and energy storage. However, predictive control of GTs is non-trivial when formulated as a dynamic optimisation problem due to the semi-continuous operating regions of GTs, which must be included to ensure complete combustion and high fuel eff…
▽ More
Controlling gas turbines (GTs) efficiently is vital as GTs are used to balance power in onshore/offshore hybrid power systems with variable renewable energy and energy storage. However, predictive control of GTs is non-trivial when formulated as a dynamic optimisation problem due to the semi-continuous operating regions of GTs, which must be included to ensure complete combustion and high fuel efficiency. This paper studies two approaches for handling the semi-continuous operating regions of GTs in hybrid power systems through predictive control, dynamic optimisation, and complementarity constraints. The proposed solutions are qualitatively investigated and compared with baseline controllers in a case study involving GTs, offshore wind, and batteries. While one of the baseline controllers considers fuel efficiency, it employs a continuous formulation, which results in lower efficiency than the two proposed approaches as it does not account for the semi-continuous operating regions of each GT.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
DDPG-E2E: A Novel Policy Gradient Approach for End-to-End Communication Systems
Authors:
Bolun Zhang,
Nguyen Van Huynh,
Dinh Thai Hoang,
Diep N. Nguyen,
Quoc-Viet Pham
Abstract:
The End-to-end (E2E) learning-based approach has great potential to reshape the existing communication systems by replacing the transceivers with deep neural networks. To this end, the E2E learning approach needs to assume the availability of prior channel information to mathematically formulate a differentiable channel layer for the backpropagation (BP) of the error gradients, thereby jointly opt…
▽ More
The End-to-end (E2E) learning-based approach has great potential to reshape the existing communication systems by replacing the transceivers with deep neural networks. To this end, the E2E learning approach needs to assume the availability of prior channel information to mathematically formulate a differentiable channel layer for the backpropagation (BP) of the error gradients, thereby jointly optimizing the transmitter and the receiver. However, accurate and instantaneous channel state information is hardly obtained in practical wireless communication scenarios. Moreover, the existing E2E learning-based solutions exhibit limited performance in data transmissions with large block lengths. In this article, these practical issues are addressed by our proposed deep deterministic policy gradient-based E2E communication system. In particular, the proposed solution utilizes a reward feedback mechanism to train both the transmitter and the receiver, which alleviates the information loss of error gradients during BP. In addition, a convolutional neural network (CNN)-based architecture is developed to mitigate the curse of dimensionality problem when transmitting messages with large block lengths. Extensive simulations then demonstrate that our proposed solution can not only jointly train the transmitter and the receiver simultaneously without requiring the prior channel knowledge but also can obtain significant performance improvement on block error rate compared to state-of-the-art solutions.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Strong interactions between integrated microresonators and alkali atomic vapors: towards single-atom, single-photon operation
Authors:
Roy Zektzer,
Xiyuan Lu,
Khoi Tuan Hoang,
Rahul Shrestha,
Sharoon Austin,
Feng Zhou,
Ashish Chanana,
Glenn Holland,
Daron Westly,
Paul Lett,
Alexey V. Gorshkov,
Kartik Srinivasan
Abstract:
Cavity quantum electrodynamics (cQED), the interaction of a two-level system with a high quality factor (Q) cavity, is a foundational building block in different architectures for quantum computation, communication, and metrology. The strong interaction between the atom and the cavity enables single photon operation which is required for quantum gates and sources. Cold atoms, quantum dots, and col…
▽ More
Cavity quantum electrodynamics (cQED), the interaction of a two-level system with a high quality factor (Q) cavity, is a foundational building block in different architectures for quantum computation, communication, and metrology. The strong interaction between the atom and the cavity enables single photon operation which is required for quantum gates and sources. Cold atoms, quantum dots, and color centers in crystals are amongst the systems that have shown single photon operations, but they require significant physical infrastructure. Atomic vapors, on the other hand, require limited experimental infrastructure and are hence much easier to deploy outside a laboratory, but they produce an ensemble of moving atoms that results in short interaction times involving multiple atoms, which can hamper quantum operations. A solution to this issue can be found in nanophotonic cavities, where light-matter interaction is enhanced and the volume of operation is small, so that fast single-atom, single-photon operations are enabled. In this work, we study the interaction of an atomically-clad microring resonator (ACMRR) with different-sized ensembles of Rb atoms. We demonstrate strong coupling between an ensemble of ~50 atoms interacting with a high-quality factor (Q > 4 x 10^5) ACMRR, yielding a many-atom cooperativity C ~ 5.5. We continue to observe signatures of atom-photon interaction for a few (< 3) atoms, for which we observe saturation at the level of one intracavity photon. Further development of our platform, which includes integrated thermo-optic heaters to enable cavity tuning and stabilization, should enable the observation of interactions between single photons and single atoms.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Incentives in Private Collaborative Machine Learning
Authors:
Rachael Hwee Ling Sim,
Yehong Zhang,
Trong Nghia Hoang,
Xinyi Xu,
Bryan Kian Hsiang Low,
Patrick Jaillet
Abstract:
Collaborative machine learning involves training models on data from multiple parties but must incentivize their participation. Existing data valuation methods fairly value and reward each party based on shared data or model parameters but neglect the privacy risks involved. To address this, we introduce differential privacy (DP) as an incentive. Each party can select its required DP guarantee and…
▽ More
Collaborative machine learning involves training models on data from multiple parties but must incentivize their participation. Existing data valuation methods fairly value and reward each party based on shared data or model parameters but neglect the privacy risks involved. To address this, we introduce differential privacy (DP) as an incentive. Each party can select its required DP guarantee and perturb its sufficient statistic (SS) accordingly. The mediator values the perturbed SS by the Bayesian surprise it elicits about the model parameters. As our valuation function enforces a privacy-valuation trade-off, parties are deterred from selecting excessive DP guarantees that reduce the utility of the grand coalition's model. Finally, the mediator rewards each party with different posterior samples of the model parameters. Such rewards still satisfy existing incentives like fairness but additionally preserve DP and a high similarity to the grand coalition's posterior. We empirically demonstrate the effectiveness and practicality of our approach on synthetic and real-world datasets.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Existence for noncoercive nonlinear elliptic equations with two lower-order terms
Authors:
Thi Tam Dang,
Trung Hau Hoang
Abstract:
This paper considers a class of noncoercive nonlinear elliptic problems with coefficients defined in Marcinkiewicz and Lorentz spaces. We prove the existence of a solution for the corresponding Dirichlet problem and investigate the higher integrability properties of the solution.
This paper considers a class of noncoercive nonlinear elliptic problems with coefficients defined in Marcinkiewicz and Lorentz spaces. We prove the existence of a solution for the corresponding Dirichlet problem and investigate the higher integrability properties of the solution.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Foreground Dust Properties towards the Cluster NGC 7380
Authors:
Sadhana Singh,
Jeewan C. Pandey,
Thiem Hoang,
Neelam Panwar,
Biman J. Medhi,
Vishal Joshi,
Shashikiran Ganesh
Abstract:
Using starlight polarization, we present the properties of foreground dust towards cluster NGC 7380 embedded in H{\sc ii} region Sh 2-142. Observations of starlight polarization are carried out in four filters using an imaging polarimeter equipped with a 104-cm ARIES telescope. Polarization vectors of stars are aligned along the Galactic magnetic field. Towards the east and southeast regions, the…
▽ More
Using starlight polarization, we present the properties of foreground dust towards cluster NGC 7380 embedded in H{\sc ii} region Sh 2-142. Observations of starlight polarization are carried out in four filters using an imaging polarimeter equipped with a 104-cm ARIES telescope. Polarization vectors of stars are aligned along the Galactic magnetic field. Towards the east and southeast regions, the dust structure appears much denser than in other regions (inferred from extinction contours and colour composite image) and is also reflected in polarization distribution. We find that the polarization degree and extinction tend to increase with distance and indication for the presence of a dust layer at a distance of around 1.2 $kpc$. We have identified eight potential candidates exhibiting intrinsic polarization by employing three distinct criteria to distinguish between stars of intrinsic polarization and interstellar polarized stars. For interstellar polarized stars, we find that the maximum polarization degree increases with the color excess and has a strong scatter, with the mean value of 1.71$\pm$0.57$\%$. The peak wavelength spans $0.40-0.88μ$m with the mean value of 0.56$\pm$0.07 $μm$, suggesting similar grain sizes in the region as the average diffuse interstellar medium. The polarization efficiency is also found to decrease with visual extinction as $P_{max}/A_{V}\propto A_{V}^{-0.61}$. Our observational results are found to be consistent with the predictions by the radiative torque alignment theory.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
Understanding the Multi-wavelength Thermal Dust Polarisation from the Orion Molecular Cloud in Light of the Radiative Torque Paradigm
Authors:
Le Ngoc Tram,
Thiem Hoang,
Helmut Wiesemeyer,
Isabelle Ristorcelli,
Karl M. Menten,
Nguyen Bich Ngoc,
Pham Ngoc Diep
Abstract:
Dust grains are important in various astrophysical processes and serve as indicators of interstellar medium structures, density, and mass. Understanding their physical properties and chemical composition is crucial in astrophysics. Dust polarisation is a valuable tool for studying these properties. The Radiative Torque (RAT) paradigm, which includes Radiative Torque Alignment (RAT-A) and Radiative…
▽ More
Dust grains are important in various astrophysical processes and serve as indicators of interstellar medium structures, density, and mass. Understanding their physical properties and chemical composition is crucial in astrophysics. Dust polarisation is a valuable tool for studying these properties. The Radiative Torque (RAT) paradigm, which includes Radiative Torque Alignment (RAT-A) and Radiative Torque Disruption (RAT-D), is essential to interpret the dust polarisation data and constrain the fundamental properties of dust grains. However, it has been used primarily to interpret observations at a single wavelength. In this study, we analyse the thermal dust polarisation spectrum obtained from observations with SOFIA/HAWC+ and JCMT/POL-2 in the OMC-1 region and compare the observational data with our numerical results using the RAT paradigm. In general, we show that the dense gas exhibits a positive spectral slope, whereas the warm regions show a negative one. We demonstrate that a one-layer dust (one-phase) model can only reproduce the observed spectra at certain locations and cannot match those with prominent V-shaped spectra (for which the degree of polarisation initially decreases with wavelength from 54 to $\sim$ 300$\,μ$m and then increases at longer wavelengths). To address this, we improve our model by incorporating two dust components (warm and cold) along the line of sight, resulting in a two-phase model. This improved model successfully reproduces the V-shaped spectra. The best model corresponds to a mixture composition of silicate and carbonaceous grains in the cold medium. Finally, by assuming the plausible model of grain alignment, we infer the inclination angle of the magnetic fields in OMC-1. This approach represents an important step towards better understanding the physics of grain alignment and constraining 3D magnetic fields using dust polarisation spectra.
△ Less
Submitted 27 June, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
B-fields And dust in interstelLar fiLAments using Dust POLarization (BALLAD-POL): II. Testing the Radiative Torque Paradigm in Musca and OMC-1
Authors:
Nguyen Bich Ngoc,
Thiem Hoang,
Pham Ngoc Diep,
Le Ngoc Tram
Abstract:
Polarization of starlight and thermal dust emission caused by aligned dust grains is a valuable tool to characterize magnetic fields (B-fields) and constrain dust properties. However, the physics of grain alignment is not fully understood. To test the popular paradigm of radiative torque (RAT) theory, including RAT alignment (RAT-A) and disruption (RAT-D), we use dust polarization data observed by…
▽ More
Polarization of starlight and thermal dust emission caused by aligned dust grains is a valuable tool to characterize magnetic fields (B-fields) and constrain dust properties. However, the physics of grain alignment is not fully understood. To test the popular paradigm of radiative torque (RAT) theory, including RAT alignment (RAT-A) and disruption (RAT-D), we use dust polarization data observed by {\it Planck} and SOFIA/HAWC+ toward two filaments with contrasting physical conditions: Musca, a quiet filament, and OMC-1, a highly dynamic filament due to feedback. We analyze various relations of the observed polarization fraction, $P$, with gas column density, $\NHt$, dust temperature, $\Td$, and polarization angle dispersion function, $§$. We found that $P$ decreases with increasing $§$ and increasing $\NHt$, as expected from RAT-A. On the other hand, the $P-\Td$ relation is more complicated; it is a linear correlation at low $\Td$ but turns into an anti-correlation when $\Td$ reaches a certain high value. Next, we compute the polarization fraction on a pixel-by-pixel with B-fields in the plane of the sky using the DustPOL code based on RAT, incorporate the depolarization effect by B-field tangling using $§$, and compare the realistic polarization model with observations of Musca and OMC-1. For Musca with well-ordered B-fields, our numerical model reproduces the decline of $P$ toward the filament spine (aka. polarization hole), having high $\NHt$ and low $\Td$, indicating the loss of grain alignment efficiency due to RAT-A. For OMC-1, with stronger B-field variations and higher $\Td$, our model can reproduce the observed $P-\Td$ and $P-N(\rm H_{2})$ relations only if the depolarization effect resulting from B-field tangling and RAT-D effect are taken into account. Our results provide more robust observational evidence for the RAT paradigm, particularly the recently discovered RAT-D.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Who Uses Personas in Requirements Engineering: The Practitioners' Perspective
Authors:
Yi Wang,
Chetan Arora,
Xiao Liu,
Thuong Hoang,
Vasudha Malhotra,
Ben Cheng,
John Grundy
Abstract:
Personas are commonly used in software projects to gain a better understanding of end-users' needs. However, there is a limited understanding of their usage and effectiveness in practice. This paper presents the results of a two-step investigation, comprising interviews with 26 software developers, UI/UX designers, business analysts and product managers and a survey of 203 practitioners, aimed at…
▽ More
Personas are commonly used in software projects to gain a better understanding of end-users' needs. However, there is a limited understanding of their usage and effectiveness in practice. This paper presents the results of a two-step investigation, comprising interviews with 26 software developers, UI/UX designers, business analysts and product managers and a survey of 203 practitioners, aimed at shedding light on the current practices, methods and challenges of using personas in software development. Our findings reveal variations in the frequency and effectiveness of personas across different software projects and IT companies, the challenges practitioners face when using personas and the reasons for not using them at all. Furthermore, we investigate the coverage of human aspects in personas, often assumed to be a key feature of persona descriptions. Contrary to the general perception, our study shows that human aspects are often ignored for various reasons in personas or requirements engineering in general. Our study provides actionable insights for practitioners to overcome challenges in using personas during requirements engineering stages, and we identify areas for future research.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
Multiple-Input Auto-Encoder Guided Feature Selection for IoT Intrusion Detection Systems
Authors:
Phai Vu Dinh,
Diep N. Nguyen,
Dinh Thai Hoang,
Quang Uy Nguyen,
Eryk Dutkiewicz,
Son Pham Bao
Abstract:
While intrusion detection systems (IDSs) benefit from the diversity and generalization of IoT data features, the data diversity (e.g., the heterogeneity and high dimensions of data) also makes it difficult to train effective machine learning models in IoT IDSs. This also leads to potentially redundant/noisy features that may decrease the accuracy of the detection engine in IDSs. This paper first i…
▽ More
While intrusion detection systems (IDSs) benefit from the diversity and generalization of IoT data features, the data diversity (e.g., the heterogeneity and high dimensions of data) also makes it difficult to train effective machine learning models in IoT IDSs. This also leads to potentially redundant/noisy features that may decrease the accuracy of the detection engine in IDSs. This paper first introduces a novel neural network architecture called Multiple-Input Auto-Encoder (MIAE). MIAE consists of multiple sub-encoders that can process inputs from different sources with different characteristics. The MIAE model is trained in an unsupervised learning mode to transform the heterogeneous inputs into lower-dimensional representation, which helps classifiers distinguish between normal behaviour and different types of attacks. To distil and retain more relevant features but remove less important/redundant ones during the training process, we further design and embed a feature selection layer right after the representation layer of MIAE resulting in a new model called MIAEFS. This layer learns the importance of features in the representation vector, facilitating the selection of informative features from the representation vector. The results on three IDS datasets, i.e., NSLKDD, UNSW-NB15, and IDS2017, show the superior performance of MIAE and MIAEFS compared to other methods, e.g., conventional classifiers, dimensionality reduction models, unsupervised representation learning methods with different input dimensions, and unsupervised feature selection models. Moreover, MIAE and MIAEFS combined with the Random Forest (RF) classifier achieve accuracy of 96.5% in detecting sophisticated attacks, e.g., Slowloris. The average running time for detecting an attack sample using RF with the representation of MIAE and MIAEFS is approximate 1.7E-6 seconds, whilst the model size is lower than 1 MB.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
An AIC-based approach for articulating unpredictable problems in open complex environments
Authors:
Haider AL-Shareefy,
Michael Butler,
Thai Son Hoang
Abstract:
This research paper presents an approach to enhancing the predictive capability of architects in the design and assurance of systems, focusing on systems operating in dynamic and unpredictable environments. By adopting a systems approach, we aim to improve architects' predictive capabilities in designing dependable systems (for example, ML-based systems). An aerospace case study is used to illustr…
▽ More
This research paper presents an approach to enhancing the predictive capability of architects in the design and assurance of systems, focusing on systems operating in dynamic and unpredictable environments. By adopting a systems approach, we aim to improve architects' predictive capabilities in designing dependable systems (for example, ML-based systems). An aerospace case study is used to illustrate the approach. Multiple factors (challenges) influencing aircraft detection are identified, demonstrating the effectiveness of our approach in a complex operational setting. Our approach primarily aimed to enhance the architect's predictive capability.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
ARtVista: Gateway To Empower Anyone Into Artist
Authors:
Trong-Vu Hoang,
Quang-Binh Nguyen,
Duy-Nam Ly,
Khanh-Duy Le,
Tam V. Nguyen,
Minh-Triet Tran,
Trung-Nghia Le
Abstract:
Drawing is an art that enables people to express their imagination and emotions. However, individuals usually face challenges in drawing, especially when translating conceptual ideas into visually coherent representations and bridging the gap between mental visualization and practical execution. In response, we propose ARtVista - a novel system integrating AR and generative AI technologies. ARtVis…
▽ More
Drawing is an art that enables people to express their imagination and emotions. However, individuals usually face challenges in drawing, especially when translating conceptual ideas into visually coherent representations and bridging the gap between mental visualization and practical execution. In response, we propose ARtVista - a novel system integrating AR and generative AI technologies. ARtVista not only recommends reference images aligned with users' abstract ideas and generates sketches for users to draw but also goes beyond, crafting vibrant paintings in various painting styles. ARtVista also offers users an alternative approach to create striking paintings by simulating the paint-by-number concept on reference images, empowering users to create visually stunning artwork devoid of the necessity for advanced drawing skills. We perform a pilot study and reveal positive feedback on its usability, emphasizing its effectiveness in visualizing user ideas and aiding the painting process to achieve stunning pictures without requiring advanced drawing skills. The source code will be available at https://github.com/htrvu/ARtVista.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Emerging Technologies for 6G Non-Terrestrial-Networks: From Academia to Industrial Applications
Authors:
Cong T. Nguyen,
Yuris Mulya Saputra,
Nguyen Van Huynh,
Tan N. Nguyen,
Dinh Thai Hoang,
Diep N Nguyen,
Van-Quan Pham,
Miroslav Voznak,
Symeon Chatzinotas,
Dinh-Hieu Tran
Abstract:
Terrestrial networks form the fundamental infrastructure of modern communication systems, serving more than 4 billion users globally. However, terrestrial networks are facing a wide range of challenges, from coverage and reliability to interference and congestion. As the demands of the 6G era are expected to be much higher, it is crucial to address these challenges to ensure a robust and efficient…
▽ More
Terrestrial networks form the fundamental infrastructure of modern communication systems, serving more than 4 billion users globally. However, terrestrial networks are facing a wide range of challenges, from coverage and reliability to interference and congestion. As the demands of the 6G era are expected to be much higher, it is crucial to address these challenges to ensure a robust and efficient communication infrastructure for the future. To address these problems, Non-terrestrial Network (NTN) has emerged to be a promising solution. NTNs are communication networks that leverage airborne (e.g., unmanned aerial vehicles) and spaceborne vehicles (e.g., satellites) to facilitate ultra-reliable communications and connectivity with high data rates and low latency over expansive regions. This article aims to provide a comprehensive survey on the utilization of network slicing, Artificial Intelligence/Machine Learning (AI/ML), and Open Radio Access Network (ORAN) to address diverse challenges of NTNs from the perspectives of both academia and industry. Particularly, we first provide an in-depth tutorial on NTN and the key enabling technologies including network slicing, AI/ML, and ORAN. Then, we provide a comprehensive survey on how network slicing and AI/ML have been leveraged to overcome the challenges that NTNs are facing. Moreover, we present how ORAN can be utilized for NTNs. Finally, we highlight important challenges, open issues, and future research directions of NTN in the 6G era.
△ Less
Submitted 3 July, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
-
Generative AI for Unmanned Vehicle Swarms: Challenges, Applications and Opportunities
Authors:
Guangyuan Liu,
Nguyen Van Huynh,
Hongyang Du,
Dinh Thai Hoang,
Dusit Niyato,
Kun Zhu,
Jiawen Kang,
Zehui Xiong,
Abbas Jamalipour,
Dong In Kim
Abstract:
With recent advances in artificial intelligence (AI) and robotics, unmanned vehicle swarms have received great attention from both academia and industry due to their potential to provide services that are difficult and dangerous to perform by humans. However, learning and coordinating movements and actions for a large number of unmanned vehicles in complex and dynamic environments introduce signif…
▽ More
With recent advances in artificial intelligence (AI) and robotics, unmanned vehicle swarms have received great attention from both academia and industry due to their potential to provide services that are difficult and dangerous to perform by humans. However, learning and coordinating movements and actions for a large number of unmanned vehicles in complex and dynamic environments introduce significant challenges to conventional AI methods. Generative AI (GAI), with its capabilities in complex data feature extraction, transformation, and enhancement, offers great potential in solving these challenges of unmanned vehicle swarms. For that, this paper aims to provide a comprehensive survey on applications, challenges, and opportunities of GAI in unmanned vehicle swarms. Specifically, we first present an overview of unmanned vehicles and unmanned vehicle swarms as well as their use cases and existing issues. Then, an in-depth background of various GAI techniques together with their capabilities in enhancing unmanned vehicle swarms are provided. After that, we present a comprehensive review on the applications and challenges of GAI in unmanned vehicle swarms with various insights and discussions. Finally, we highlight open issues of GAI in unmanned vehicle swarms and discuss potential research directions.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Authors:
Jianguo Zhang,
Tian Lan,
Rithesh Murthy,
Zhiwei Liu,
Weiran Yao,
Juntao Tan,
Thai Hoang,
Liangwei Yang,
Yihao Feng,
Zuxin Liu,
Tulika Awalgaonkar,
Juan Carlos Niebles,
Silvio Savarese,
Shelby Heinecke,
Huan Wang,
Caiming Xiong
Abstract:
Autonomous agents powered by large language models (LLMs) have garnered significant research attention. However, fully harnessing the potential of LLMs for agent-based tasks presents inherent challenges due to the heterogeneous nature of diverse data sources featuring multi-turn trajectories. In this paper, we introduce \textbf{AgentOhana} as a comprehensive solution to address these challenges. \…
▽ More
Autonomous agents powered by large language models (LLMs) have garnered significant research attention. However, fully harnessing the potential of LLMs for agent-based tasks presents inherent challenges due to the heterogeneous nature of diverse data sources featuring multi-turn trajectories. In this paper, we introduce \textbf{AgentOhana} as a comprehensive solution to address these challenges. \textit{AgentOhana} aggregates agent trajectories from distinct environments, spanning a wide array of scenarios. It meticulously standardizes and unifies these trajectories into a consistent format, streamlining the creation of a generic data loader optimized for agent training. Leveraging the data unification, our training pipeline maintains equilibrium across different data sources and preserves independent randomness across devices during dataset partitioning and model training. Additionally, we present \textbf{xLAM-v0.1}, a large action model tailored for AI agents, which demonstrates exceptional performance across various benchmarks. Begin the exploration at \url{https://github.com/SalesforceAIResearch/xLAM}.
△ Less
Submitted 20 March, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
Magnetic Field at the Galactic Centre from Multi-Wavelength Dust Polarization
Authors:
M. S. Akshaya,
Thiem Hoang
Abstract:
We have mapped the magnetic field ($B$-field) for a region of about 30 pc around the centre of our Galaxy, which encompasses the circumnuclear disk (CND), the minispiral, and the 20 km s$^{-1}$ and 50 km s$^{-1}$ molecular clouds, using thermal dust polarization observations obtained from SOFIA/HAWC+ and JCMT/SCUPOL. We decompose the spectra of $^{12}$CO ($J\!=\!3\!\rightarrow\!2$) transition from…
▽ More
We have mapped the magnetic field ($B$-field) for a region of about 30 pc around the centre of our Galaxy, which encompasses the circumnuclear disk (CND), the minispiral, and the 20 km s$^{-1}$ and 50 km s$^{-1}$ molecular clouds, using thermal dust polarization observations obtained from SOFIA/HAWC+ and JCMT/SCUPOL. We decompose the spectra of $^{12}$CO ($J\!=\!3\!\rightarrow\!2$) transition from this region into individual cloud components and find the polarization observed at different wavelengths might be tracing completely different layers of dust along the line of sight. We use modified Davis-Chandrasekhar-Fermi methods to measure the strength of $B$-field projected in the plane of the sky ($B_{{}_{\mathrm{POS}}}$). The mean $B_{{}_{\mathrm{POS}}}$ of the CND and the minispiral, probed at 53 $μ$m is of the order of $\sim\!2$ mG. $B_{{}_{\mathrm{POS}}}\!\!\!<\!1$ mG close to the Galactic Centre, in the region of the ionized mini-cavity within the CND, and increases outwards. However, the longer wavelength polarization at 216 $μ$m appears to come from a dust layer that is cooler and behind the CND and has a stronger $B$-field of about 7 mG. The $B$-field strength is lowest along the Eastern Arm of the minispiral, which is also the only region with Alfvén Mach number, $\mathcal{M}_{\mathrm{A}}>1$ and mass-to-flux ratio, $λ\!\gtrsim\!1$. Such an observed weak $B$-field could be a result of the low resolution of the observation, where the tangled $B$-fields due to the strong turbulence in the high density clumps of the CND are lost within the beam size of the observation.
△ Less
Submitted 10 June, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
{A New Hope}: Contextual Privacy Policies for Mobile Applications and An Approach Toward Automated Generation
Authors:
Shidong Pan,
Zhen Tao,
Thong Hoang,
Dawen Zhang,
Tianshi Li,
Zhenchang Xing,
Sherry Xu,
Mark Staples,
Thierry Rakotoarivelo,
David Lo
Abstract:
Privacy policies have emerged as the predominant approach to conveying privacy notices to mobile application users. In an effort to enhance both readability and user engagement, the concept of contextual privacy policies (CPPs) has been proposed by researchers. The aim of CPPs is to fragment privacy policies into concise snippets, displaying them only within the corresponding contexts within the a…
▽ More
Privacy policies have emerged as the predominant approach to conveying privacy notices to mobile application users. In an effort to enhance both readability and user engagement, the concept of contextual privacy policies (CPPs) has been proposed by researchers. The aim of CPPs is to fragment privacy policies into concise snippets, displaying them only within the corresponding contexts within the application's graphical user interfaces (GUIs). In this paper, we first formulate CPP in mobile application scenario, and then present a novel multimodal framework, named SeePrivacy, specifically designed to automatically generate CPPs for mobile applications. This method uniquely integrates vision-based GUI understanding with privacy policy analysis, achieving 0.88 precision and 0.90 recall to detect contexts, as well as 0.98 precision and 0.96 recall in extracting corresponding policy segments. A human evaluation shows that 77% of the extracted privacy policy segments were perceived as well-aligned with the detected contexts. These findings suggest that SeePrivacy could serve as a significant tool for bolstering user interaction with, and understanding of, privacy policies. Furthermore, our solution has the potential to make privacy notices more accessible and inclusive, thus appealing to a broader demographic. A demonstration of our work can be accessed at https://cpp4app.github.io/SeePrivacy/
△ Less
Submitted 10 March, 2024; v1 submitted 22 February, 2024;
originally announced February 2024.
-
Towards Efficient Pareto-optimal Utility-Fairness between Groups in Repeated Rankings
Authors:
Phuong Dinh Mai,
Duc-Trong Le,
Tuan-Anh Hoang,
Dung D. Le
Abstract:
In this paper, we tackle the problem of computing a sequence of rankings with the guarantee of the Pareto-optimal balance between (1) maximizing the utility of the consumers and (2) minimizing unfairness between producers of the items. Such a multi-objective optimization problem is typically solved using a combination of a scalarization method and linear programming on bi-stochastic matrices, repr…
▽ More
In this paper, we tackle the problem of computing a sequence of rankings with the guarantee of the Pareto-optimal balance between (1) maximizing the utility of the consumers and (2) minimizing unfairness between producers of the items. Such a multi-objective optimization problem is typically solved using a combination of a scalarization method and linear programming on bi-stochastic matrices, representing the distribution of possible rankings of items. However, the above-mentioned approach relies on Birkhoff-von Neumann (BvN) decomposition, of which the computational complexity is $\mathcal{O}(n^5)$ with $n$ being the number of items, making it impractical for large-scale systems. To address this drawback, we introduce a novel approach to the above problem by using the Expohedron - a permutahedron whose points represent all achievable exposures of items. On the Expohedron, we profile the Pareto curve which captures the trade-off between group fairness and user utility by identifying a finite number of Pareto optimal solutions. We further propose an efficient method by relaxing our optimization problem on the Expohedron's circumscribed $n$-sphere, which significantly improve the running time. Moreover, the approximate Pareto curve is asymptotically close to the real Pareto optimal curve as the number of substantial solutions increases. Our methods are applicable with different ranking merits that are non-decreasing functions of item relevance. The effectiveness of our methods are validated through experiments on both synthetic and real-world datasets.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
Q-learning-based Joint Design of Adaptive Modulation and Precoding for Physical Layer Security in Visible Light Communications
Authors:
Duc M. T. Hoang,
Thanh V. Pham,
Anh T. Pham,
Chuyen T Nguyen
Abstract:
There has been an increasing interest in physical layer security (PLS), which, compared with conventional cryptography, offers a unique approach to guaranteeing information confidentiality against eavesdroppers. In this paper, we study a joint design of adaptive $M$-ary pulse amplitude modulation (PAM) and precoding, which aims to optimize wiretap visible-light channels' secrecy capacity and bit e…
▽ More
There has been an increasing interest in physical layer security (PLS), which, compared with conventional cryptography, offers a unique approach to guaranteeing information confidentiality against eavesdroppers. In this paper, we study a joint design of adaptive $M$-ary pulse amplitude modulation (PAM) and precoding, which aims to optimize wiretap visible-light channels' secrecy capacity and bit error rate (BER) performances. The proposed design is motivated by higher-order modulation, which results in better secrecy capacity at the expense of a higher BER. On the other hand, a proper precoding design, which can manipulate the received signal quality at the legitimate user and the eavesdropper, can also enhance secrecy performance and influence the BER. A reward function that considers the secrecy capacity and the BERs of the legitimate user's (Bob) and the eavesdropper's (Eve) channels is introduced and maximized. Due to the non-linearity and complexity of the reward function, it is challenging to solve the optical design using classical optimization techniques. Therefore, reinforcement learning-based designs using Q-learning and Deep Q-learning are proposed to maximize the reward function. Simulation results verify that compared with the baseline designs, the proposed joint designs achieve better reward values while maintaining the BER of Bob's channel (Eve's channel) well below (above) the pre-FEC (forward error correction) BER threshold.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
An elaborate new proof of Cayley's formula
Authors:
Esther Banaian,
Anh Trong Nam Hoang,
Elizabeth Kelley,
Weston Miller,
Jason Stack,
Carolyn Stephen,
Nathan Williams
Abstract:
We construct a bijection between certain Deodhar components of a braid variety constructed from an affine Kac-Moody group of type $A_{n-1}$ and vertex-labeled trees on $n$ vertices. By an argument of Galashin, Lam, and Williams using Opdam's trace formula in the affine Hecke algebra and an identity due to Haglund, we obtain an elaborate new proof for the enumeration of the number of vertex-labeled…
▽ More
We construct a bijection between certain Deodhar components of a braid variety constructed from an affine Kac-Moody group of type $A_{n-1}$ and vertex-labeled trees on $n$ vertices. By an argument of Galashin, Lam, and Williams using Opdam's trace formula in the affine Hecke algebra and an identity due to Haglund, we obtain an elaborate new proof for the enumeration of the number of vertex-labeled trees on $n$ vertices.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Detection of extragalactic anomalous microwave emission in NGC 2903 using KVN single-dish observations
Authors:
Panomporn Poojon,
Aeree Chung,
Thiem Hoang,
Junhyun Baek,
Hiroyuki Nakanishi,
Tomoya Hirota,
Chao-Wei Tsai
Abstract:
We present the results of the single-dish observations using the Korean VLBI Network to search for anomalous microwave emission (AME) in nearby galaxies. The targets were selected from MAp** the dense moLecular gAs in the sTrongest stAr-formiNg Galaxies (MALATANG), a legacy survey project of the James Clerk Maxwell Telescope. The MALATANG galaxies are good representatives of local galaxies with…
▽ More
We present the results of the single-dish observations using the Korean VLBI Network to search for anomalous microwave emission (AME) in nearby galaxies. The targets were selected from MAp** the dense moLecular gAs in the sTrongest stAr-formiNg Galaxies (MALATANG), a legacy survey project of the James Clerk Maxwell Telescope. The MALATANG galaxies are good representatives of local galaxies with enhanced nuclear activity associated with star formation and/or AGN, providing IR-bright galaxy samples; thus, they are good candidates for AME hosts. Combining with the ancillary data, we investigated the radio-IR spectral energy distribution (SED), while searching for the AME signals in five galaxies. The AME in NGC 2903 was well detected at a significant confidence level, whereas that in NGC 2146 and M82 was marginal. NGC 1068 and Arp 299 indicated no significant hints, and we provided the upper limits for the AME. The best-fit SED exhibited local peaks of the AME components at higher frequencies and with stronger peak fluxes than those in the previous studies. This suggested the origin of AME being denser environments such as molecular clouds or photodissociation regions rather than warm neutral/ionized medium as commonly suggested by previous studies. Further, our AME-detected targets were observed to exhibit higher specific star-formation rates than the other extragalactic AME hosts. Furthermore, AME favored starburst galaxies among our sample rather than AGN hosts. Consequently, this might imply that AGNs are excessively harsh environments for tiny dust to survive.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Smart Textile-Driven Soft Spine Exosuit for Lifting Tasks in Industrial Applications
Authors:
Kefan Zhu,
Bibhu Sharma,
Phuoc Thien Phan,
James Davies,
Mai Thanh Thai,
Trung Thien Hoang,
Chi Cong Nguyen,
Adrienne Ji,
Emanuele Nicotra,
Nigel H. Lovell,
Thanh Nho Do
Abstract:
Work related musculoskeletal disorders (WMSDs) are often caused by repetitive lifting, making them a significant concern in occupational health. Although wearable assist devices have become the norm for mitigating the risk of back pain, most spinal assist devices still possess a partially rigid structure that impacts the user comfort and flexibility. This paper addresses this issue by presenting a…
▽ More
Work related musculoskeletal disorders (WMSDs) are often caused by repetitive lifting, making them a significant concern in occupational health. Although wearable assist devices have become the norm for mitigating the risk of back pain, most spinal assist devices still possess a partially rigid structure that impacts the user comfort and flexibility. This paper addresses this issue by presenting a smart textile actuated spine assistance robotic exosuit (SARE), which can conform to the back seamlessly without impeding the user movement and is incredibly lightweight. The SARE can assist the human erector spinae to complete any action with virtually infinite degrees of freedom. To detect the strain on the spine and to control the smart textile automatically, a soft knitting sensor which utilizes fluid pressure as sensing element is used. The new device is validated experimentally with human subjects where it reduces peak electromyography (EMG) signals of lumbar erector spinae by around 32 percent in loaded and around 22 percent in unloaded conditions. Moreover, the integrated EMG decreased by around 24.2 percent under loaded condition and around 23.6 percent under unloaded condition. In summary, the artificial muscle wearable device represents an anatomical solution to reduce the risk of muscle strain, metabolic energy cost and back pain associated with repetitive lifting tasks.
△ Less
Submitted 3 February, 2024;
originally announced February 2024.
-
Changes in heat waves characteristics over Extremadura (SW Spain)
Authors:
F. J. Acero,
M. I. Fernández-Fernández,
V. M. S. Carrasco,
S. Parey,
T. T. Huong Hoang,
D. Dacunha-Castelle,
J. A. García
Abstract:
Heat wave (HW) events are becoming more frequent, and they have important consequences because of the negative effects they can have not only on the human population in health terms, but also on biodiversity and agriculture. This motivated a study of the trends in HW events over Extremadura, a region in the southwest of Spain, with much of its area in summer devoted to the production of irrigated…
▽ More
Heat wave (HW) events are becoming more frequent, and they have important consequences because of the negative effects they can have not only on the human population in health terms, but also on biodiversity and agriculture. This motivated a study of the trends in HW events over Extremadura, a region in the southwest of Spain, with much of its area in summer devoted to the production of irrigated crops such as maize and tomatoes. Heat waves were defined for the study as two consecutive days with temperatures above the 95th percentile of the summer (June-August) maximum temperature (Tmax) time series. Two datasets were used: one consisted of 13 daily temperature records uniformly distributed over the Region, and the other was the SPAIN02 gridded observational dataset, extracting just the points corresponding to Extremadura. The trends studied were in the duration, intensity, and frequency of HW events, and in other parameters such as the mean, low (25th percentile), and high (75th percentile) values. In general terms, the results showed significant positive trends in those parameters over the east, the northwest, and a small area in the south of the Region. In order to study changes in HW characteristics (duration, frequency and intensity) considering different subperiods, a stochastic model was used to generate 1000 time series equivalent to the observed ones. The results showed that there were no significant changes in HW duration in the last 10-year subperiod in comparison with the first. But the results were different for warm events (WE), defined with a lower threshold (the 75th percentile), which are also important for agriculture. For several sites, there were significant changes in WE duration, frequency, and intensity.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
CNN-FL for Biotechnology Industry Empowered by Internet-of-BioNano Things and Digital Twins
Authors:
Mohammad,
Jamshidi,
Dinh Thai Hoang,
Diep N. Nguyen
Abstract:
Digital twins (DTs) are revolutionizing the biotechnology industry by enabling sophisticated digital representations of biological assets, microorganisms, drug development processes, and digital health applications. However, digital twinning at micro and nano scales, particularly in modeling complex entities like bacteria, presents significant challenges in terms of requiring advanced Internet of…
▽ More
Digital twins (DTs) are revolutionizing the biotechnology industry by enabling sophisticated digital representations of biological assets, microorganisms, drug development processes, and digital health applications. However, digital twinning at micro and nano scales, particularly in modeling complex entities like bacteria, presents significant challenges in terms of requiring advanced Internet of Things (IoT) infrastructure and computing approaches to achieve enhanced accuracy and scalability. In this work, we propose a novel framework that integrates the Internet of Bio-Nano Things (IoBNT) with advanced machine learning techniques, specifically convolutional neural networks (CNN) and federated learning (FL), to effectively tackle the identified challenges. Within our framework, IoBNT devices are deployed to gather image-based biological data across various physical environments, leveraging the strong capabilities of CNNs for robust machine vision and pattern recognition. Subsequently, FL is utilized to aggregate insights from these disparate data sources, creating a refined global model that continually enhances accuracy and predictive reliability, which is crucial for the effective deployment of DTs in biotechnology. The primary contribution is the development of a novel framework that synergistically combines CNN and FL, augmented by the capabilities of the IoBNT. This novel approach is specifically tailored to enhancing DTs in the biotechnology industry. The results showcase enhancements in the reliability and safety of microorganism DTs, while preserving their accuracy. Furthermore, the proposed framework excels in energy efficiency and security, offering a user-friendly and adaptable solution. This broadens its applicability across diverse sectors, including biotechnology and pharmaceutical industries, as well as clinical and hospital settings.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.
-
Effective Multi-Stage Training Model For Edge Computing Devices In Intrusion Detection
Authors:
Thua Huynh Trong,
Thanh Nguyen Hoang
Abstract:
Intrusion detection poses a significant challenge within expansive and persistently interconnected environments. As malicious code continues to advance and sophisticated attack methodologies proliferate, various advanced deep learning-based detection approaches have been proposed. Nevertheless, the complexity and accuracy of intrusion detection models still need further enhancement to render them…
▽ More
Intrusion detection poses a significant challenge within expansive and persistently interconnected environments. As malicious code continues to advance and sophisticated attack methodologies proliferate, various advanced deep learning-based detection approaches have been proposed. Nevertheless, the complexity and accuracy of intrusion detection models still need further enhancement to render them more adaptable to diverse system categories, particularly within resource-constrained devices, such as those embedded in edge computing systems. This research introduces a three-stage training paradigm, augmented by an enhanced pruning methodology and model compression techniques. The objective is to elevate the system's effectiveness, concurrently maintaining a high level of accuracy for intrusion detection. Empirical assessments conducted on the UNSW-NB15 dataset evince that this solution notably reduces the model's dimensions, while upholding accuracy levels equivalent to similar proposals.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
A Survey on Structure-Preserving Graph Transformers
Authors:
Van Thuy Hoang,
O-Joun Lee
Abstract:
The transformer architecture has shown remarkable success in various domains, such as natural language processing and computer vision. When it comes to graph learning, transformers are required not only to capture the interactions between pairs of nodes but also to preserve graph structures connoting the underlying relations and proximity between them, showing the expressive power to capture diffe…
▽ More
The transformer architecture has shown remarkable success in various domains, such as natural language processing and computer vision. When it comes to graph learning, transformers are required not only to capture the interactions between pairs of nodes but also to preserve graph structures connoting the underlying relations and proximity between them, showing the expressive power to capture different graph structures. Accordingly, various structure-preserving graph transformers have been proposed and widely used for various tasks, such as graph-level tasks in bioinformatics and chemoinformatics. However, strategies related to graph structure preservation have not been well organized and systematized in the literature. In this paper, we provide a comprehensive overview of structure-preserving graph transformers and generalize these methods from the perspective of their design objective. First, we divide strategies into four main groups: node feature modulation, context node sampling, graph rewriting, and transformer architecture improvements. We then further divide the strategies according to the coverage and goals of graph structure preservation. Furthermore, we also discuss challenges and future directions for graph transformer models to preserve the graph structure and understand the nature of graphs.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Generative AI-enabled Blockchain Networks: Fundamentals, Applications, and Case Study
Authors:
Cong T. Nguyen,
Yinqiu Liu,
Hongyang Du,
Dinh Thai Hoang,
Dusit Niyato,
Diep N. Nguyen,
Shiwen Mao
Abstract:
Generative Artificial Intelligence (GAI) has recently emerged as a promising solution to address critical challenges of blockchain technology, including scalability, security, privacy, and interoperability. In this paper, we first introduce GAI techniques, outline their applications, and discuss existing solutions for integrating GAI into blockchains. Then, we discuss emerging solutions that demon…
▽ More
Generative Artificial Intelligence (GAI) has recently emerged as a promising solution to address critical challenges of blockchain technology, including scalability, security, privacy, and interoperability. In this paper, we first introduce GAI techniques, outline their applications, and discuss existing solutions for integrating GAI into blockchains. Then, we discuss emerging solutions that demonstrate the effectiveness of GAI in addressing various challenges of blockchain, such as detecting unknown blockchain attacks and smart contract vulnerabilities, designing key secret sharing schemes, and enhancing privacy. Moreover, we present a case study to demonstrate that GAI, specifically the generative diffusion model, can be employed to optimize blockchain network performance metrics. Experimental results clearly show that, compared to a baseline traditional AI approach, the proposed generative diffusion model approach can converge faster, achieve higher rewards, and significantly improve the throughput and latency of the blockchain network. Additionally, we highlight future research directions for GAI in blockchain applications, including personalized GAI-enabled blockchains, GAI-blockchain synergy, and privacy and security considerations within blockchain ecosystems.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
BugsInPy: A Database of Existing Bugs in Python Programs to Enable Controlled Testing and Debugging Studies
Authors:
Ratnadira Widyasari,
Sheng Qin Sim,
Camellia Lok,
Haodi Qi,
Jack Phan,
Qi** Tay,
Constance Tan,
Fiona Wee,
Jodie Ethelda Tan,
Yuheng Yieh,
Brian Goh,
Ferdian Thung,
Hong ** Kang,
Thong Hoang,
David Lo,
Eng Lieh Ouh
Abstract:
The 2019 edition of Stack Overflow developer survey highlights that, for the first time, Python outperformed Java in terms of popularity. The gap between Python and Java further widened in the 2020 edition of the survey. Unfortunately, despite the rapid increase in Python's popularity, there are not many testing and debugging tools that are designed for Python. This is in stark contrast with the a…
▽ More
The 2019 edition of Stack Overflow developer survey highlights that, for the first time, Python outperformed Java in terms of popularity. The gap between Python and Java further widened in the 2020 edition of the survey. Unfortunately, despite the rapid increase in Python's popularity, there are not many testing and debugging tools that are designed for Python. This is in stark contrast with the abundance of testing and debugging tools for Java. Thus, there is a need to push research on tools that can help Python developers. One factor that contributed to the rapid growth of Java testing and debugging tools is the availability of benchmarks. A popular benchmark is the Defects4J benchmark; its initial version contained 357 real bugs from 5 real-world Java programs. Each bug comes with a test suite that can expose the bug. Defects4J has been used by hundreds of testing and debugging studies and has helped to push the frontier of research in these directions. In this project, inspired by Defects4J, we create another benchmark database and tool that contain 493 real bugs from 17 real-world Python programs. We hope our benchmark can help catalyze future work on testing and debugging tools that work on Python programs.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
A Novel Blockchain Based Information Management Framework for Web 3.0
Authors:
Md Arif Hassan,
Cong T. Nguyen,
Chi-Hieu Nguyen,
Dinh Thai Hoang,
Diep N. Nguyen,
Eryk Dutkiewicz
Abstract:
Web 3.0 is the third generation of the World Wide Web (WWW), concentrating on the critical concepts of decentralization, availability, and increasing client usability. Although Web 3.0 is undoubtedly an essential component of the future Internet, it currently faces critical challenges, including decentralized data collection and management. To overcome these challenges, blockchain has emerged as o…
▽ More
Web 3.0 is the third generation of the World Wide Web (WWW), concentrating on the critical concepts of decentralization, availability, and increasing client usability. Although Web 3.0 is undoubtedly an essential component of the future Internet, it currently faces critical challenges, including decentralized data collection and management. To overcome these challenges, blockchain has emerged as one of the core technologies for the future development of Web 3.0. In this paper, we propose a novel blockchain-based information management framework, namely Smart Blockchain-based Web, to manage information in Web 3.0 effectively, enhance the security and privacy of users data, bring additional profits, and incentivize users to contribute information to the websites. Particularly, SBW utilizes blockchain technology and smart contracts to manage the decentralized data collection process for Web 3.0 effectively. Moreover, in this framework, we develop an effective consensus mechanism based on Proof-of-Stake to reward the user's information contribution and conduct game theoretical analysis to analyze the users behavior in the considered system. Additionally, we conduct simulations to assess the performance of SBW and investigate the impact of critical parameters on information contribution. The findings confirm our theoretical analysis and demonstrate that our proposed consensus mechanism can incentivize the nodes and users to contribute more information to our systems.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Filamentary Network and Magnetic Field Structures Revealed with BISTRO in the High-Mass Star-Forming Region NGC2264 : Global Properties and Local Magnetogravitational Configurations
Authors:
Jia-Wei Wang,
Patrick M. Koch,
Seamus D. Clarke,
Gary Fuller,
Nicolas Peretto,
Ya-Wen Tang,
Hsi-Wei Yen,
Shih-** Lai,
Nagayoshi Ohashi,
Doris Arzoumanian,
Doug Johnstone,
Ray Furuya,
Shu-ichiro Inutsuka,
Chang Won Lee,
Derek Ward-Thompson,
Valentin J. M. Le Gouellec,
Hong-Li Liu,
Lapo Fanciullo,
Jihye Hwang,
Kate Pattle,
Frédérick Poidevin,
Mehrnoosh Tahani,
Takashi Onaka,
Mark G. Rawlings,
Eun Jung Chung
, et al. (132 additional authors not shown)
Abstract:
We report 850 $μ$m continuum polarization observations toward the filamentary high-mass star-forming region NGC 2264, taken as part of the B-fields In STar forming Regions Observations (BISTRO) large program on the James Clerk Maxwell Telescope (JCMT). These data reveal a well-structured non-uniform magnetic field in the NGC 2264C and 2264D regions with a prevailing orientation around 30 deg from…
▽ More
We report 850 $μ$m continuum polarization observations toward the filamentary high-mass star-forming region NGC 2264, taken as part of the B-fields In STar forming Regions Observations (BISTRO) large program on the James Clerk Maxwell Telescope (JCMT). These data reveal a well-structured non-uniform magnetic field in the NGC 2264C and 2264D regions with a prevailing orientation around 30 deg from north to east. Field strengths estimates and a virial analysis for the major clumps indicate that NGC 2264C is globally dominated by gravity while in 2264D magnetic, gravitational, and kinetic energies are roughly balanced. We present an analysis scheme that utilizes the locally resolved magnetic field structures, together with the locally measured gravitational vector field and the extracted filamentary network. From this, we infer statistical trends showing that this network consists of two main groups of filaments oriented approximately perpendicular to one another. Additionally, gravity shows one dominating converging direction that is roughly perpendicular to one of the filament orientations, which is suggestive of mass accretion along this direction. Beyond these statistical trends, we identify two types of filaments. The type-I filament is perpendicular to the magnetic field with local gravity transitioning from parallel to perpendicular to the magnetic field from the outside to the filament ridge. The type-II filament is parallel to the magnetic field and local gravity. We interpret these two types of filaments as originating from the competition between radial collapsing, driven by filament self-gravity, and the longitudinal collapsing, driven by the region's global gravity.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Enabling Technologies for Web 3.0: A Comprehensive Survey
Authors:
Md Arif Hassan,
Mohammad Behdad Jamshidi,
Bui Duc Manh,
Nam H. Chu,
Chi-Hieu Nguyen,
Nguyen Quang Hieu,
Cong T. Nguyen,
Dinh Thai Hoang,
Diep N. Nguyen,
Nguyen Van Huynh,
Mohammad Abu Alsheikh,
Eryk Dutkiewicz
Abstract:
Web 3.0 represents the next stage of Internet evolution, aiming to empower users with increased autonomy, efficiency, quality, security, and privacy. This evolution can potentially democratize content access by utilizing the latest developments in enabling technologies. In this paper, we conduct an in-depth survey of enabling technologies in the context of Web 3.0, such as blockchain, semantic web…
▽ More
Web 3.0 represents the next stage of Internet evolution, aiming to empower users with increased autonomy, efficiency, quality, security, and privacy. This evolution can potentially democratize content access by utilizing the latest developments in enabling technologies. In this paper, we conduct an in-depth survey of enabling technologies in the context of Web 3.0, such as blockchain, semantic web, 3D interactive web, Metaverse, Virtual reality/Augmented reality, Internet of Things technology, and their roles in sha** Web 3.0. We commence by providing a comprehensive background of Web 3.0, including its concept, basic architecture, potential applications, and industry adoption. Subsequently, we examine recent breakthroughs in IoT, 5G, and blockchain technologies that are pivotal to Web 3.0 development. Following that, other enabling technologies, including AI, semantic web, and 3D interactive web, are discussed. Utilizing these technologies can effectively address the critical challenges in realizing Web 3.0, such as ensuring decentralized identity, platform interoperability, data transparency, reducing latency, and enhancing the system's scalability. Finally, we highlight significant challenges associated with Web 3.0 implementation, emphasizing potential solutions and providing insights into future research directions in this field.
△ Less
Submitted 29 December, 2023;
originally announced January 2024.
-
Improving Graph Convolutional Networks with Transformer Layer in social-based items recommendation
Authors:
Thi Linh Hoang,
Tuan Dung Pham,
Viet Cuong Ta
Abstract:
In this work, we have proposed an approach for improving the GCN for predicting ratings in social networks. Our model is expanded from the standard model with several layers of transformer architecture. The main focus of the paper is on the encoder architecture for node embedding in the network. Using the embedding layer from the graph-based convolution layer, the attention mechanism could rearran…
▽ More
In this work, we have proposed an approach for improving the GCN for predicting ratings in social networks. Our model is expanded from the standard model with several layers of transformer architecture. The main focus of the paper is on the encoder architecture for node embedding in the network. Using the embedding layer from the graph-based convolution layer, the attention mechanism could rearrange the feature space to get a more efficient embedding for the downstream task. The experiments showed that our proposed architecture achieves better performance than GCN on the traditional link prediction task.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
BOD: Blindly Optimal Data Discovery
Authors:
Thomas Hoang
Abstract:
Combining discovery and augmentation is important in the era of data usage when it comes to predicting the outcome of tasks. However, having to ask the user the utility function to discover the goal to achieve the optimal small rightful dataset is not an optimal solution. The existing solutions do not make good use of this combination, hence underutilizing the data. In this paper, we introduce a n…
▽ More
Combining discovery and augmentation is important in the era of data usage when it comes to predicting the outcome of tasks. However, having to ask the user the utility function to discover the goal to achieve the optimal small rightful dataset is not an optimal solution. The existing solutions do not make good use of this combination, hence underutilizing the data. In this paper, we introduce a novel goal-oriented framework, called BOD: Blindly Optimal Data Discovery, that involves humans in the loop and comparing utility scores every time querying in the process without knowing the utility function. This establishes the promise of using BOD: Blindly Optimal Data Discovery for modern data science solutions.
△ Less
Submitted 12 January, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation
Authors:
Weimin Wang,
Jiawei Liu,
Zhijie Lin,
Jiangqiao Yan,
Shuo Chen,
Chetwin Low,
Tuyen Hoang,
Jie Wu,
Jun Hao Liew,
Hanshu Yan,
Daquan Zhou,
Jiashi Feng
Abstract:
The growing demand for high-fidelity video generation from textual descriptions has catalyzed significant research in this field. In this work, we introduce MagicVideo-V2 that integrates the text-to-image model, video motion generator, reference image embedding module and frame interpolation module into an end-to-end video generation pipeline. Benefiting from these architecture designs, MagicVideo…
▽ More
The growing demand for high-fidelity video generation from textual descriptions has catalyzed significant research in this field. In this work, we introduce MagicVideo-V2 that integrates the text-to-image model, video motion generator, reference image embedding module and frame interpolation module into an end-to-end video generation pipeline. Benefiting from these architecture designs, MagicVideo-V2 can generate an aesthetically pleasing, high-resolution video with remarkable fidelity and smoothness. It demonstrates superior performance over leading Text-to-Video systems such as Runway, Pika 1.0, Morph, Moon Valley and Stable Video Diffusion model via user evaluation at large scale.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Circumstellar interaction signatures in the low luminosity type II SN 2021gmj
Authors:
Nicolas Meza-Retamal,
Yize Dong,
K. Azalee Bostroem,
Stefano Valenti,
Lluis Galbany,
Jeniveve Pearson,
Griffin Hosseinzadeh,
Jennifer E. Andrews,
David J. Sand,
Jacob E. Jencson,
Daryl Janzen,
Michael J. Lundquist,
Emily T. Hoang,
Samuel Wyatt,
Peter J. Brown,
D. Andrew Howell,
Megan Newsome,
Estefania Padilla Gonzalez,
Craig Pellegrino,
Giacomo Terreran,
Vladimir Kouprianov,
Daichi Hiramatsu,
Saurabh W. Jha,
Nathan Smith,
Joshua Haislip
, et al. (3 additional authors not shown)
Abstract:
We present comprehensive optical observations of SN~2021gmj, a Type II supernova (SN~II) discovered within a day of explosion by the Distance Less Than 40~Mpc (DLT40) survey. Follow-up observations show that SN~2021gmj is a low-luminosity SN~II (LL~SN~II), with a peak magnitude $M_V = -15.45$ and Fe~II velocity of $\sim 1800 \ \mathrm{km} \ \mathrm{s}^{-1}$ at 50 days past explosion. Using the exp…
▽ More
We present comprehensive optical observations of SN~2021gmj, a Type II supernova (SN~II) discovered within a day of explosion by the Distance Less Than 40~Mpc (DLT40) survey. Follow-up observations show that SN~2021gmj is a low-luminosity SN~II (LL~SN~II), with a peak magnitude $M_V = -15.45$ and Fe~II velocity of $\sim 1800 \ \mathrm{km} \ \mathrm{s}^{-1}$ at 50 days past explosion. Using the expanding photosphere method, we derive a distance of $17.8^{+0.6}_{-0.4}$~Mpc. From the tail of the light curve we obtain a radioactive nickel mass of $0.014 \pm 0.001$ M$_{\odot}$. The presence of circumstellar material (CSM) is suggested by the early-time light curve, early spectra, and high-velocity H$α$ in absorption. Analytical shock-cooling models of the light curve cannot reproduce the fast rise, supporting the idea that the early-time emission is partially powered by the interaction of the SN ejecta and CSM. The inferred low CSM mass of 0.025 M$_{\odot}$ in our hydrodynamic-modeling light curve analysis is also consistent with our spectroscopy. We observe a broad feature near 4600 Å, which may be high-ionization lines of C, N, or/and He~II. This feature is reproduced by radiation-hydrodynamic simulations of red supergiants with extended atmospheres. Several LL~SNe~II show similar spectral features, implying that high-density material around the progenitor may be common among them.
△ Less
Submitted 22 May, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Towards Efficient Communication and Secure Federated Recommendation System via Low-rank Training
Authors:
Ngoc-Hieu Nguyen,
Tuan-Anh Nguyen,
Tuan Nguyen,
Vu Tien Hoang,
Dung D. Le,
Kok-Seng Wong
Abstract:
Federated Recommendation (FedRec) systems have emerged as a solution to safeguard users' data in response to growing regulatory concerns. However, one of the major challenges in these systems lies in the communication costs that arise from the need to transmit neural network models between user devices and a central server. Prior approaches to these challenges often lead to issues such as computat…
▽ More
Federated Recommendation (FedRec) systems have emerged as a solution to safeguard users' data in response to growing regulatory concerns. However, one of the major challenges in these systems lies in the communication costs that arise from the need to transmit neural network models between user devices and a central server. Prior approaches to these challenges often lead to issues such as computational overheads, model specificity constraints, and compatibility issues with secure aggregation protocols. In response, we propose a novel framework, called Correlated Low-rank Structure (CoLR), which leverages the concept of adjusting lightweight trainable parameters while kee** most parameters frozen. Our approach substantially reduces communication overheads without introducing additional computational burdens. Critically, our framework remains fully compatible with secure aggregation protocols, including the robust use of Homomorphic Encryption. The approach resulted in a reduction of up to 93.75% in payload size, with only an approximate 8% decrease in recommendation performance across datasets. Code for reproducing our experiments can be found at https://github.com/NNHieu/CoLR-FedRec.
△ Less
Submitted 28 February, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Evidence of grain alignment by magnetically enhanced radiative torques from multi-wavelength dust polarization modeling of HL Tau
Authors:
Thang Nguyen Tat,
Pham Ngoc Diep,
Thiem Hoang,
Le Ngoc Tram,
Nguyen Bich Ngoc,
Nguyen Thi Phuong,
Bao Truong
Abstract:
Atacama Large Millimeter/Submillimeter Array (ALMA) has revolutionized the field of dust polarization in protoplanetary disks across multiple wavelengths. Previous observations and empirical modeling suggested multiple mechanisms of dust polarization toward HL Tau, including grain alignment and dust scattering. However, a detailed modeling of dust polarization based on grain alignment physics is n…
▽ More
Atacama Large Millimeter/Submillimeter Array (ALMA) has revolutionized the field of dust polarization in protoplanetary disks across multiple wavelengths. Previous observations and empirical modeling suggested multiple mechanisms of dust polarization toward HL Tau, including grain alignment and dust scattering. However, a detailed modeling of dust polarization based on grain alignment physics is not yet available. Here, using our updated POLARIS code, we perform numerical modeling of dust polarization arising from both grain alignment by Magnetically Enhanced Radiative Torque (MRAT) mechanism and self-scattering to reproduce the HL Tau polarization observed at three wavelengths 0.87, 1.3, and 3.1$\,$mm. Our modeling results show that the observed multi-wavelength polarization could be reproduced only when large grains contain embedded iron inclusions and those with slow internal relaxation must have wrong internal alignment (i.e., the grain's major axis parallel to its angular momentum). The abundance of iron embedded inside grains in the form of clusters is constrained to be $\gtrsim 16$%, and the number of iron atoms per cluster is $N_{\rm cl} \sim 9\times10^2$. Maximum grain sizes probed at wavelengths $λ$ = 0.87, 1.3, and 3.1$\,$mm are constrained at $\sim$ 60, 80, and 90$\,μ$m, respectively.
△ Less
Submitted 13 May, 2024; v1 submitted 30 December, 2023;
originally announced January 2024.
-
Mitigating Degree Biases in Message Passing Mechanism by Utilizing Community Structures
Authors:
Van Thuy Hoang,
O-Joun Lee
Abstract:
This study utilizes community structures to address node degree biases in message-passing (MP) via learnable graph augmentations and novel graph transformers. Recent augmentation-based methods showed that MP neural networks often perform poorly on low-degree nodes, leading to degree biases due to a lack of messages reaching low-degree nodes. Despite their success, most methods use heuristic or uni…
▽ More
This study utilizes community structures to address node degree biases in message-passing (MP) via learnable graph augmentations and novel graph transformers. Recent augmentation-based methods showed that MP neural networks often perform poorly on low-degree nodes, leading to degree biases due to a lack of messages reaching low-degree nodes. Despite their success, most methods use heuristic or uniform random augmentations, which are non-differentiable and may not always generate valuable edges for learning representations. In this paper, we propose Community-aware Graph Transformers, namely CGT, to learn degree-unbiased representations based on learnable augmentations and graph transformers by extracting within community structures. We first design a learnable graph augmentation to generate more within-community edges connecting low-degree nodes through edge perturbation. Second, we propose an improved self-attention to learn underlying proximity and the roles of nodes within the community. Third, we propose a self-supervised learning task that could learn the representations to preserve the global graph structure and regularize the graph augmentations. Extensive experiments on various benchmark datasets showed CGT outperforms state-of-the-art baselines and significantly improves the node degree biases. The source code is available at https://github.com/NSLab-CUK/Community-aware-Graph-Transformer.
△ Less
Submitted 27 December, 2023;
originally announced December 2023.