-
Scatterer Recognition from LiDAR Point Clouds for Environment-Embedded Vehicular Channel Modeling via Synesthesia of Machines
Authors:
Ziwei Huang,
Lu Bai,
Zengrui Han,
Xiang Cheng
Abstract:
In this paper, a novel environment-embedded vehicular channel model is proposed by scatterer recognition from light detection and ranging (LiDAR) point clouds via Synesthesia of Machines (SoM). To provide a robust data foundation, a new intelligent sensing-communication integration dataset in vehicular urban scenarios is constructed. Based on the constructed dataset, the complex SoM mechanism, i.e…
▽ More
In this paper, a novel environment-embedded vehicular channel model is proposed by scatterer recognition from light detection and ranging (LiDAR) point clouds via Synesthesia of Machines (SoM). To provide a robust data foundation, a new intelligent sensing-communication integration dataset in vehicular urban scenarios is constructed. Based on the constructed dataset, the complex SoM mechanism, i.e., map** relationship between scatterers in electromagnetic space and LiDAR point clouds in physical environment, is explored via multilayer perceptron (MLP) with electromagnetic propagation mechanism. By using LiDAR point clouds to implement scatterer recognition, channel non-stationarity and consistency are modeled in an environment-embedded manner. Using ray-tracing (RT)-based results as the ground truth, the scatterer recognition accuracy exceeds 90%. The accuracy of the proposed model is further verified by the close fit between simulation results and RT results.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Quantum Multi-Agent Reinforcement Learning for Cooperative Mobile Access in Space-Air-Ground Integrated Networks
Authors:
Gyu Seon Kim,
Yeryeong Cho,
Jaehyun Chung,
Soohyun Park,
Soyi Jung,
Zhu Han,
Joongheon Kim
Abstract:
Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance unmanned aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for prov…
▽ More
Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance unmanned aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for providing cooperatively global access sustainability and energy efficiency. However, as the number of CubeSats and HALE-UAVs, increases, the scheduling dimension of each ground station (GS) increases. As a result, each GS can fall into the curse of dimensionality, and this challenge becomes one major hurdle for efficient global access. Therefore, this paper provides a quantum multi-agent reinforcement Learning (QMARL)-based method for scheduling between GSs and CubeSats/HALE-UAVs in order to improve global access availability and energy efficiency. The main reason why the QMARL-based scheduler can be beneficial is that the algorithm facilitates a logarithmic-scale reduction in scheduling action dimensions, which is one critical feature as the number of CubeSats and HALE-UAVs expands. Additionally, individual GSs have different traffic demands depending on their locations and characteristics, thus it is essential to provide differentiated access services. The superiority of the proposed scheduler is validated through data-intensive experiments in realistic CubeSat/HALE-UAV settings.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Displaced Heavy Neutral Lepton from New Higgs Doublet
Authors:
Fa-Xin Yang,
Feng-Lan Shao,
Zhi-Long Han,
Yi **,
Honglei Li
Abstract:
Heavy neutral leptons $N$ are introduced to explain the tiny neutrino masses via the seesaw mechanism. For proper small mixing parameter $V_{\ell N}$, the heavy neutral leptons $N$ become long-lived, which leads to the displaced vertex signature at colliders. In this paper, we consider the displaced heavy neutral lepton from the neutrinophilic Higgs doublet $Φ_ν$ decay. The new Higgs doublet with…
▽ More
Heavy neutral leptons $N$ are introduced to explain the tiny neutrino masses via the seesaw mechanism. For proper small mixing parameter $V_{\ell N}$, the heavy neutral leptons $N$ become long-lived, which leads to the displaced vertex signature at colliders. In this paper, we consider the displaced heavy neutral lepton from the neutrinophilic Higgs doublet $Φ_ν$ decay. The new Higgs doublet with MeV scale VEV can naturally explain the tiny neutrino masses with TeV scale $N$. Different from current experimental searches via the $W^\pm\to \ell^\pm N$ decay, the new decays as $H^\pm\to \ell^\pm N$ are not suppressed by the small mixing parameter $V_{\ell N}$. Therefore, a larger parameter space is expected to be detected at colliders. We then investigate the promising region at the 14 TeV HL-LHC and the 3 TeV CLIC. According to our simulation, the DV signature could probe $|V_{\ell N}|^2\gtrsim10^{-19}$ with $m_N<m_{H^+}$, which covers the seesaw predicted value $|V_{\ell N}|^2\sim m_ν/m_N$. We could probe $m_{H^+}\lesssim1200$ GeV at the 14 TeV HL-LHC and $m_{H^+}\lesssim1490$ GeV at the 3 TeV CLIC.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Decoupling Many-Body Interactions in CeO2 (111) Oxygen Vacancy Structure: Insights from Machine-Learning and Cluster Expansion
Authors:
Yu**g Zhang,
Zhong-Kang Han,
Beien Zhu,
Xiaojuan Hu,
Maria Troppenz,
Santiago Riga-monti,
Hui Li,
Claudia Draxl,
M. Verónica Ganduglia-Pirovano,
Yi Gao
Abstract:
Oxygen vacancies (VO's) are of paramount importance in influencing the properties and applications of ceria (CeO2). Yet, comprehending the distribution and nature of the VO's poses a significant challenge due to the vast number of electronic configurations and intricate many-body interactions among VO's and polarons (Ce3+'s). In this study, we employed a combination of LASSO regression in machine…
▽ More
Oxygen vacancies (VO's) are of paramount importance in influencing the properties and applications of ceria (CeO2). Yet, comprehending the distribution and nature of the VO's poses a significant challenge due to the vast number of electronic configurations and intricate many-body interactions among VO's and polarons (Ce3+'s). In this study, we employed a combination of LASSO regression in machine learning, in conjunction with a cluster expansion model and first-principles calculations to decouple the interactions among the Ce3+'s and VO's, thereby circumventing the limitations associated with sampling electronic configurations. By separating these interactions, we identified specific electronic configurations characterized by the most favorable VO-Ce3+ attractions and the least Ce3+-Ce3+/VO-VO repulsions, which are crucial in determining the stability of vacancy structures. Through more than 10^8 Metropolis Monte Carlo samplings of Vo's and Ce3+ in the near-surface of CeO2(111), we explored potential configurations within an 8x8 supercell. Our findings revealed that oxygen vacancies tend to aggregate and are most abundant in the third oxygen layer, primarily due to extensive geometric relaxation-an aspect previously overlooked. This behavior is notably dependent on the concentration of Vo. This work introduces a novel theoretical framework for unraveling the complex vacancy structures in metal oxides, with potential applications in redox and catalytic chemistry.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
AI-Empowered Multiple Access for 6G: A Survey of Spectrum Sensing, Protocol Designs, and Optimizations
Authors:
Xuelin Cao,
Bo Yang,
Kaining Wang,
Xinghua Li,
Zhiwen Yu,
Chau Yuen,
Yan Zhang,
Zhu Han
Abstract:
With the rapidly increasing number of bandwidth-intensive terminals capable of intelligent computing and communication, such as smart devices equipped with shallow neural network models, the complexity of multiple access for these intelligent terminals is increasing due to the dynamic network environment and ubiquitous connectivity in 6G systems. Traditional multiple access (MA) design and optimiz…
▽ More
With the rapidly increasing number of bandwidth-intensive terminals capable of intelligent computing and communication, such as smart devices equipped with shallow neural network models, the complexity of multiple access for these intelligent terminals is increasing due to the dynamic network environment and ubiquitous connectivity in 6G systems. Traditional multiple access (MA) design and optimization methods are gradually losing ground to artificial intelligence (AI) techniques that have proven their superiority in handling complexity. AI-empowered MA and its optimization strategies aimed at achieving high Quality-of-Service (QoS) are attracting more attention, especially in the area of latency-sensitive applications in 6G systems. In this work, we aim to: 1) present the development and comparative evaluation of AI-enabled MA; 2) provide a timely survey focusing on spectrum sensing, protocol design, and optimization for AI-empowered MA; and 3) explore the potential use cases of AI-empowered MA in the typical application scenarios within 6G systems. Specifically, we first present a unified framework of AI-empowered MA for 6G systems by incorporating various promising machine learning techniques in spectrum sensing, resource allocation, MA protocol design, and optimization. We then introduce AI-empowered MA spectrum sensing related to spectrum sharing and spectrum interference management. Next, we discuss the AI-empowered MA protocol designs and implementation methods by reviewing and comparing the state-of-the-art, and we further explore the optimization algorithms related to dynamic resource management, parameter adjustment, and access scheme switching. Finally, we discuss the current challenges, point out open issues, and outline potential future research directions in this field.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Adiabatic Mass Loss In Binary Stars. IV. Low and Intermediate Mass Helium Binary Stars
Authors:
Lifu Zhang,
Hongwei Ge,
Xuefei Chen,
Zhanwen Han
Abstract:
The unstable mass transfer situation in binary systems will asymptotically cause the adiabatic expansion of the donor star and finally lead to the common envelope phase. This process could happen in helium binary systems once the helium donor star fills its Roche-lobe. We have calculated the adiabatic mass loss model of naked helium stars with a mass range of 0.35\,$M_{\odot}$ to 10\,$M_{\odot}$,…
▽ More
The unstable mass transfer situation in binary systems will asymptotically cause the adiabatic expansion of the donor star and finally lead to the common envelope phase. This process could happen in helium binary systems once the helium donor star fills its Roche-lobe. We have calculated the adiabatic mass loss model of naked helium stars with a mass range of 0.35\,$M_{\odot}$ to 10\,$M_{\odot}$, and every mass sequence evolved from the He-ZAMS to the cooling track of white dwarf or carbon ignition. In consideration of the influence of stellar wind, massive helium stars are not considered in this paper. Comparing stellar radius with the evolution of the Roche-lobe under the assumption of conservative mass transfer, we give the critical mass ratio $q_{\textrm{crit}}=M_{\textrm{He}}/M_{\textrm{accretor}}$ as the binary stability criteria of low and intermediate-mass helium binary stars. On He-MS, the result shows $1.0<q_{\textrm{crit}}<2.6$, which is more unstable than the classical result of polytropic model $q_{\textrm{crit}}=3$. After early He-HG, the $q_{\textrm{crit}}$ quickly increases even larger than 10 (more stable compared with widely used result $q_{\textrm{crit}}=4$), which is dominated by the expansion of radiative envelope. Our result could be useful for these quick mass transfer binary systems such as AM CVns, UCXBs, and helium novae, and it could guide the binary population synthesis for the formation of special objects such as SNe Ia and GW sources.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Authors:
Xunzhi Wang,
Zhuowei Zhang,
Qiongyu Li,
Gaonan Chen,
Mengting Hu,
Zhiyu li,
Bitong Luo,
Hang Gao,
Zhixin Han,
Haotian Wang
Abstract:
The rapid development of large language models (LLMs) has shown promising practical results. However, their low interpretability often leads to errors in unforeseen circumstances, limiting their utility. Many works have focused on creating comprehensive evaluation systems, but previous benchmarks have primarily assessed problem-solving abilities while neglecting the response's uncertainty, which m…
▽ More
The rapid development of large language models (LLMs) has shown promising practical results. However, their low interpretability often leads to errors in unforeseen circumstances, limiting their utility. Many works have focused on creating comprehensive evaluation systems, but previous benchmarks have primarily assessed problem-solving abilities while neglecting the response's uncertainty, which may result in unreliability. Recent methods for measuring LLM reliability are resource-intensive and unable to test black-box models. To address this, we propose UBENCH, a comprehensive benchmark for evaluating LLM reliability. UBENCH includes 3,978 multiple-choice questions covering knowledge, language, understanding, and reasoning abilities. Experimental results show that UBENCH has achieved state-of-the-art performance, while its single-sampling method significantly saves computational resources compared to baseline methods that require multiple samplings. Additionally, based on UBENCH, we evaluate the reliability of 15 popular LLMs, finding GLM4 to be the most outstanding, closely followed by GPT-4. We also explore the impact of Chain-of-Thought prompts, role-playing prompts, option order, and temperature on LLM reliability, analyzing the varying effects on different LLMs.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
QoE Maximization for Multiple-UAV-Assisted Multi-Access Edge Computing: An Online Joint Optimization Approach
Authors:
Long He,
Geng Sun,
Zemin Sun,
Qingqing Wu,
Jiawen Kang,
Dusit Niyato,
Zhu Han,
Victor C. M. Leung
Abstract:
In disaster scenarios, conventional terrestrial multi-access edge computing (MEC) paradigms, which rely on fixed infrastructure, may become unavailable due to infrastructure damage. With high-probability line-of-sight (LoS) communication, flexible mobility, and low cost, unmanned aerial vehicle (UAV)-assisted MEC is emerging as a new promising paradigm to provide edge computing services for ground…
▽ More
In disaster scenarios, conventional terrestrial multi-access edge computing (MEC) paradigms, which rely on fixed infrastructure, may become unavailable due to infrastructure damage. With high-probability line-of-sight (LoS) communication, flexible mobility, and low cost, unmanned aerial vehicle (UAV)-assisted MEC is emerging as a new promising paradigm to provide edge computing services for ground user devices (UDs) in disaster-stricken areas. However, the limited battery capacity, computing resources, and spectrum resources also pose serious challenges for UAV-assisted MEC, which can potentially shorten the service time of UAVs and degrade the quality of experience (QoE) of UDs without an effective control approach. To this end, in this work, we first present a hierarchical architecture of multiple-UAV-assisted MEC networks that enables the coordinated provision of edge computing services by multiple UAVs. Then, we formulate a joint task offloading, resource allocation, and UAV trajectory planning optimization problem (JTRTOP) to maximize the QoE of UDs while considering the energy consumption constraints of UAVs. Since the problem is proven to be a future-dependent and NP-hard problem, we propose a novel online joint task offloading, resource allocation, and UAV trajectory planning approach (OJTRTA) to solve the problem. Specifically, the JTRTOP is first transformed into a per-slot real-time optimization problem (PROP) using the Lyapunov optimization framework. Then, a two-stage optimization method based on game theory and convex optimization is proposed to solve the PROP. Simulation results provide empirical evidence supporting the superior system performance of the proposed OJTRTA in comparison to alternative approaches.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Faster Convergence on Heterogeneous Federated Edge Learning: An Adaptive Sidelink-Assisted Data Multicasting Approach
Authors:
Gang Hu,
Yinglei Teng,
Nan Wang,
Zhu Han
Abstract:
Federated Edge Learning (FEEL) emerges as a pioneering distributed machine learning paradigm for the 6G Hyper-Connectivity, harnessing data from the Internet of Things (IoT) devices while upholding data privacy. However, current FEEL algorithms struggle with non-independent and non-identically distributed (non-IID) data, leading to elevated communication costs and compromised model accuracy. To ad…
▽ More
Federated Edge Learning (FEEL) emerges as a pioneering distributed machine learning paradigm for the 6G Hyper-Connectivity, harnessing data from the Internet of Things (IoT) devices while upholding data privacy. However, current FEEL algorithms struggle with non-independent and non-identically distributed (non-IID) data, leading to elevated communication costs and compromised model accuracy. To address these statistical imbalances within FEEL, we introduce a clustered data sharing framework, mitigating data heterogeneity by selectively sharing partial data from cluster heads to trusted associates through sidelink-aided multicasting. The collective communication pattern is integral to FEEL training, where both cluster formation and the efficiency of communication and computation impact training latency and accuracy simultaneously. To tackle the strictly coupled data sharing and resource optimization, we decompose the overall optimization problem into the clients clustering and effective data sharing subproblems. Specifically, a distribution-based adaptive clustering algorithm (DACA) is devised basing on three deductive cluster forming conditions, which ensures the maximum sharing yield. Meanwhile, we design a stochastic optimization based joint computed frequency and shared data volume optimization (JFVO) algorithm, determining the optimal resource allocation with an uncertain objective function. The experiments show that the proposed framework facilitates FEEL on non-IID datasets with faster convergence rate and higher model accuracy in a limited communication environment.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Interference Analysis for Coexistence of UAVs and Civil Aircrafts Based on Automatic Dependent Surveillance-Broadcast
Authors:
Yiyang Liao,
Ziye Jia,
Chao Dong,
Lei Zhang,
Qihui Wu,
Huiling Hu,
Zhu Han
Abstract:
Due to the advantages of high mobility and easy deployment, unmanned aerial vehicles (UAVs) are widely applied in both military and civilian fields. In order to strengthen the flight surveillance of UAVs and guarantee the airspace safety, UAVs can be equipped with the automatic dependent surveillance-broadcast (ADS-B) system, which periodically sends flight information to other aircrafts and groun…
▽ More
Due to the advantages of high mobility and easy deployment, unmanned aerial vehicles (UAVs) are widely applied in both military and civilian fields. In order to strengthen the flight surveillance of UAVs and guarantee the airspace safety, UAVs can be equipped with the automatic dependent surveillance-broadcast (ADS-B) system, which periodically sends flight information to other aircrafts and ground stations (GSs). However, due to the limited resource of channel capacity, UAVs equipped with ADS-B results in the interference between UAVs and civil aircrafts (CAs), which further impacts the accuracy of received information at GSs. In detail, the channel capacity is mainly affected by the density of aircrafts and the transmitting power of ADS-B. Hence, based on the three-dimensional poisson point process, this work leverages the stochastic geometry theory to build a model of the coexistence of UAVs and CAs and analyze the interference performance of ADS-B monitoring system. From simulation results, we reveal the effects of transmitting power, density, threshold and pathloss on the performance of the ADS-B monitoring system. Besides, we provide the suggested transmitting power and density for the safe coexistence of UAVs and CAs.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Prediction
Authors:
Yinhao Bai,
Yalan Xie,
Xiaoyi Liu,
Yuhua Zhao,
Zhixin Han,
Mengting Hu,
Hang Gao,
Renhong Cheng
Abstract:
Aspect sentiment quad prediction (ASQP) aims to predict four aspect-based elements, including aspect term, opinion term, aspect category, and sentiment polarity. In practice, unseen aspects, due to distinct data distribution, impose many challenges for a trained neural model. Motivated by this, this work formulates ASQP into the few-shot scenario, which aims for fast adaptation in real application…
▽ More
Aspect sentiment quad prediction (ASQP) aims to predict four aspect-based elements, including aspect term, opinion term, aspect category, and sentiment polarity. In practice, unseen aspects, due to distinct data distribution, impose many challenges for a trained neural model. Motivated by this, this work formulates ASQP into the few-shot scenario, which aims for fast adaptation in real applications. Therefore, we first construct a few-shot ASQP dataset (FSQP) that contains richer categories and is more balanced for the few-shot study. Moreover, recent methods extract quads through a generation paradigm, which involves converting the input sentence into a templated target sequence. However, they primarily focus on the utilization of a single template or the consideration of different template orders, thereby overlooking the correlations among various templates. To tackle this issue, we further propose a Broadview Soft Prompting (BvSP) method that aggregates multiple templates with a broader view by taking into account the correlation between the different templates. Specifically, BvSP uses the pre-trained language model to select the most relevant k templates with Jensen-Shannon divergence. BvSP further introduces soft prompts to guide the pre-trained language model using the selected templates. Then, we aggregate the results of multi-templates by voting mechanism. Empirical results demonstrate that BvSP significantly outperforms the stateof-the-art methods under four few-shot settings and other public datasets. Our code and dataset are available at https://github.com/byinhao/BvSP.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Determination method of binary fractions by the integrated spectrum
Authors:
F. Zhang,
L. Li,
Z. Han,
X. Wang
Abstract:
We need to resolve the individual stars for binary fraction determinations of stellar systems. Therefore, it is not possible to obtain the binary fractions for dense or distant stellar systems. % We proposed a method to determine the binary fraction of a dense or distant stellar system. The method is to first determine the binary fraction variation for any two adjacent regions and then add up thos…
▽ More
We need to resolve the individual stars for binary fraction determinations of stellar systems. Therefore, it is not possible to obtain the binary fractions for dense or distant stellar systems. % We proposed a method to determine the binary fraction of a dense or distant stellar system. The method is to first determine the binary fraction variation for any two adjacent regions and then add up those binary fraction variations along the radial direction to obtain the binary fraction for a stellar system. Binary fraction variation is derived by using ten binary fraction-sensitive spectral absorption feature indices (SAFIs) and the binary fraction variation calibrations in terms of these SAFIs. Using this method, we first presented the binary fraction variations for twenty-one Galactic globular clusters (GCs). By comparisons, we find that they agree well with the binary fractions based on the main-sequence fiducial line method by previous studies. This verifies that the above mentioned method is feasible. Next, we presented the binary fraction variations of thirteen Galactic GCs. We gave the relationships between binary fraction and various parameters, and found that binary fraction is negatively correlated with NHB and NRR, binary fraction of some studies is not strongly correlated with NBS, and the number of GCs with large binary fraction is greater at extreme blue horizontal branch population ratio. At last, if we want to obtain more accurate binary fraction, we suggest that the spectroscopic and photometric observations are conducted at an appropriate area interval for a stellar system.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Two-Stage Resource Allocation in Reconfigurable Intelligent Surface Assisted Hybrid Networks via Multi-Player Bandits
Authors:
**gwen Tong,
Hongliang Zhang,
Liqun Fu,
Amir Leshem,
Zhu Han
Abstract:
This paper considers a resource allocation problem where several Internet-of-Things (IoT) devices send data to a base station (BS) with or without the help of the reconfigurable intelligent surface (RIS) assisted cellular network. The objective is to maximize the sum rate of all IoT devices by finding the optimal RIS and spreading factor (SF) for each device. Since these IoT devices lack prior inf…
▽ More
This paper considers a resource allocation problem where several Internet-of-Things (IoT) devices send data to a base station (BS) with or without the help of the reconfigurable intelligent surface (RIS) assisted cellular network. The objective is to maximize the sum rate of all IoT devices by finding the optimal RIS and spreading factor (SF) for each device. Since these IoT devices lack prior information on the RISs or the channel state information (CSI), a distributed resource allocation framework with low complexity and learning features is required to achieve this goal. Therefore, we model this problem as a two-stage multi-player multi-armed bandit (MPMAB) framework to learn the optimal RIS and SF sequentially. Then, we put forth an exploration and exploitation boosting (E2Boost) algorithm to solve this two-stage MPMAB problem by combining the $ε$-greedy algorithm, Thompson sampling (TS) algorithm, and non-cooperation game method. We derive an upper regret bound for the proposed algorithm, i.e., $\mathcal{O}(\log^{1+δ}_2 T)$, increasing logarithmically with the time horizon $T$. Numerical results show that the E2Boost algorithm has the best performance among the existing methods and exhibits a fast convergence rate. More importantly, the proposed algorithm is not sensitive to the number of combinations of the RISs and SFs thanks to the two-stage allocation mechanism, which can benefit high-density networks.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Multi-attribute Auction-based Resource Allocation for Twins Migration in Vehicular Metaverses: A GPT-based DRL Approach
Authors:
Yongju Tong,
Junlong Chen,
Minrui Xu,
Jiawen Kang,
Zehui Xiong,
Dusit Niyato,
Chau Yuen,
Zhu Han
Abstract:
Vehicular Metaverses are developed to enhance the modern automotive industry with an immersive and safe experience among connected vehicles and roadside infrastructures, e.g., RoadSide Units (RSUs). For seamless synchronization with virtual spaces, Vehicle Twins (VTs) are constructed as digital representations of physical entities. However, resource-intensive VTs updating and high mobility of vehi…
▽ More
Vehicular Metaverses are developed to enhance the modern automotive industry with an immersive and safe experience among connected vehicles and roadside infrastructures, e.g., RoadSide Units (RSUs). For seamless synchronization with virtual spaces, Vehicle Twins (VTs) are constructed as digital representations of physical entities. However, resource-intensive VTs updating and high mobility of vehicles require intensive computation, communication, and storage resources, especially for their migration among RSUs with limited coverages. To address these issues, we propose an attribute-aware auction-based mechanism to optimize resource allocation during VTs migration by considering both price and non-monetary attributes, e.g., location and reputation. In this mechanism, we propose a two-stage matching for vehicular users and Metaverse service providers in multi-attribute resource markets. First, the resource attributes matching algorithm obtains the resource attributes perfect matching, namely, buyers and sellers can participate in a double Dutch auction (DDA). Then, we train a DDA auctioneer using a generative pre-trained transformer (GPT)-based deep reinforcement learning (DRL) algorithm to adjust the auction clocks efficiently during the auction process. We compare the performance of social welfare and auction information exchange costs with state-of-the-art baselines under different settings. Simulation results show that our proposed GPT-based DRL auction schemes have better performance than others.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation
Authors:
Loc X. Nguyen,
Kitae Kim,
Ye Lin Tun,
Sheikh Salman Hassan,
Yan Kyaw Tun,
Zhu Han,
Choong Seon Hong
Abstract:
Semantic communication, notable for ensuring quality of service by jointly optimizing source and channel coding, effectively extracts data semantics, reduces transmission length, and mitigates channel noise. However, most studies overlook multi-user scenarios and resource availability, limiting real-world application. This paper addresses this gap by focusing on downlink communication from a base…
▽ More
Semantic communication, notable for ensuring quality of service by jointly optimizing source and channel coding, effectively extracts data semantics, reduces transmission length, and mitigates channel noise. However, most studies overlook multi-user scenarios and resource availability, limiting real-world application. This paper addresses this gap by focusing on downlink communication from a base station to multiple users with varying computing capacities. Users employ variants of Swin transformer models for source decoding and a simple architecture for channel decoding. We propose a novel training regimen, incorporating transfer learning and knowledge distillation to improve low-computing users' performance. Extensive simulations validate the proposed methods.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Correlated states controlled by tunable van Hove singularity in moiré WSe2
Authors:
Patrick Knüppel,
Jiacheng Zhu,
Yiyu Xia,
Zhengchao Xia,
Zhongdong Han,
Yihang Zeng,
Kenji Watanabe,
Takashi Taniguchi,
Jie Shan,
Kin Fai Mak
Abstract:
Twisted bilayers of transition metal dichalcogenide (TMD) semiconductors have emerged as a highly tunable system for the studies of correlated and topological states of matter, such as superconductivity, ferromagnetism, correlated insulators, and topological and Chern insulators. However, the connection between these symmetry-breaking ground states and the underlying band structure singularity in…
▽ More
Twisted bilayers of transition metal dichalcogenide (TMD) semiconductors have emerged as a highly tunable system for the studies of correlated and topological states of matter, such as superconductivity, ferromagnetism, correlated insulators, and topological and Chern insulators. However, the connection between these symmetry-breaking ground states and the underlying band structure singularity in these materials remains largely unexplored. Here, by combining exciton sensing and magnetic circular dichroism (MCD) measurements, we demonstrate how the magnetic properties and the correlated insulating states are controlled by the gate tunable van Hove singularity (VHS) in the band structure of twisted bilayer WSe2 (tWSe2). In particular, we demonstrate how the location of the VHS in the tWSe2 band structure can influence 1) the stability of Stoner ferromagnetism, 2) the valley polarizability and the stability of Chern insulators, as well as 3) the layer polarizability and the associated metal-insulator transition. The results are supported by continuum model band structure calculations. Our work highlights an important ingredient for understanding the electronic phase diagram in twisted bilayer TMDs.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Multi-Beam Integrated Sensing and Communication: State-of-the-Art, Challenges and Opportunities
Authors:
Yinxiao Zhuo,
Tianqi Mao,
Hao** Li,
Chen Sun,
Zhaocheng Wang,
Zhu Han,
Sheng Chen
Abstract:
Integrated sensing and communication (ISAC) has been envisioned as a critical enabling technology for the next-generation wireless communication, which can realize location/motion detection of surroundings with communication devices. This additional sensing capability leads to a substantial network quality gain and expansion of the service scenarios. As the system evolves to millimeter wave (mmWav…
▽ More
Integrated sensing and communication (ISAC) has been envisioned as a critical enabling technology for the next-generation wireless communication, which can realize location/motion detection of surroundings with communication devices. This additional sensing capability leads to a substantial network quality gain and expansion of the service scenarios. As the system evolves to millimeter wave (mmWave) and above, ISAC can realize simultaneous communications and sensing of the ultra-high throughput level and radar resolution with compact design, which relies on directional beamforming against the path loss. With the multi-beam technology, the dual functions of ISAC can be seamlessly incorporated at the beamspace level by unleashing the potential of joint beamforming. To this end, this article investigates the key technologies for multi-beam ISAC system. We begin with an overview of the current state-of-the-art solutions in multi-beam ISAC. Subsequently, a detailed analysis of the advantages associated with the multi-beam ISAC is provided. Additionally, the key technologies for transmitter, channel and receiver of the multi-beam ISAC are introduced. Finally, we explore the challenges and opportunities presented by multi-beam ISAC, offering valuable insights into this emerging field.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Authors:
Chaofan Lin,
Zhenhua Han,
Chengruidong Zhang,
Yuqing Yang,
Fan Yang,
Chen Chen,
Lili Qiu
Abstract:
The rise of large language models (LLMs) has enabled LLM-based applications (a.k.a. AI agents or co-pilots), a new software paradigm that combines the strength of LLM and conventional software. Diverse LLM applications from different tenants could design complex workflows using multiple LLM requests to accomplish one task. However, they have to use the over-simplified request-level API provided by…
▽ More
The rise of large language models (LLMs) has enabled LLM-based applications (a.k.a. AI agents or co-pilots), a new software paradigm that combines the strength of LLM and conventional software. Diverse LLM applications from different tenants could design complex workflows using multiple LLM requests to accomplish one task. However, they have to use the over-simplified request-level API provided by today's public LLM services, losing essential application-level information. Public LLM services have to blindly optimize individual LLM requests, leading to sub-optimal end-to-end performance of LLM applications.
This paper introduces Parrot, an LLM service system that focuses on the end-to-end experience of LLM-based applications. Parrot proposes Semantic Variable, a unified abstraction to expose application-level knowledge to public LLM services. A Semantic Variable annotates an input/output variable in the prompt of a request, and creates the data pipeline when connecting multiple LLM requests, providing a natural way to program LLM applications. Exposing Semantic Variables to the public LLM service allows it to perform conventional data flow analysis to uncover the correlation across multiple LLM requests. This correlation opens a brand-new optimization space for the end-to-end performance of LLM-based applications. Extensive evaluations demonstrate that Parrot can achieve up to an order-of-magnitude improvement for popular and practical use cases of LLM applications.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
The First Photometric Analysis of Two Low Mass Ratio Contact Binary Systems In TESS Survey
Authors:
Qiyuan Cheng,
Jian** XIong,
Xu Ding,
Kaifan Ji,
Jiao Li,
Chao Liu,
Jiangdan Li,
**gxiao Luo,
Xin Lyu,
Zhanwen Han,
Xuefei Chen
Abstract:
Low mass-ratio (q) contact binary systems are progenitors of stellar mergers such as blue straggles (BS) or fast-rotating FK Com stars. In this study, we present the first light curve analysis of two newly identified low mass-ratio contact binary systems, TIC 55007847 and TIC 63597006, that are identified from TESS. Both stars are classified as A-subtype contact binaries. We obtained the precise o…
▽ More
Low mass-ratio (q) contact binary systems are progenitors of stellar mergers such as blue straggles (BS) or fast-rotating FK Com stars. In this study, we present the first light curve analysis of two newly identified low mass-ratio contact binary systems, TIC 55007847 and TIC 63597006, that are identified from TESS. Both stars are classified as A-subtype contact binaries. We obtained the precise orbit periods for the two objects by using the O-C method, i.e. P=0.6117108 d for TIC 55007847 and P=0.7008995 d for TIC 63597006, respectively, and found an obvious periodic signal in the O-C curve of TIC 63597006. We suggest that the periodic signal comes from a third body. We further use the Markov Chain Monte Carlo (MCMC) method with PHOEBE to derive the photometric solutions for the two binaries. The photometric solution for this object shows that the contribution of the third body is about 6%. Our analysis revealed that TIC 55007847 has an extremely low mass ratio of q=0.08. By calculating the ratio of spin angular momentum to the orbital angular momentum Js/Jo, we found that TIC 55007847 is very close to the instability threshold with Js/Jo = 0.31, indicating that it may merge into a single, fast-rotating star in the future. For TIC 63597006, q=0.14 and Js/Jo=0.15. This object is in a relatively stable evolutionary status at present.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Optical IRS for Visible Light Communication: Modeling, Design, and Open Issues
Authors:
Shiyuan Sun,
Fang Yang,
Weidong Mei,
Jian Song,
Zhu Han,
Rui Zhang
Abstract:
Optical intelligent reflecting surface (OIRS) offers a new and effective approach to resolving the line-of-sight blockage issue in visible light communication (VLC) by enabling redirection of light to bypass obstacles, thereby dramatically enhancing indoor VLC coverage and reliability. This article provides a comprehensive overview of OIRS for VLC, including channel modeling, design techniques, an…
▽ More
Optical intelligent reflecting surface (OIRS) offers a new and effective approach to resolving the line-of-sight blockage issue in visible light communication (VLC) by enabling redirection of light to bypass obstacles, thereby dramatically enhancing indoor VLC coverage and reliability. This article provides a comprehensive overview of OIRS for VLC, including channel modeling, design techniques, and open issues. First, we present the characteristics of OIRS-reflected channels and introduce two practical models, namely, optics model and association model, which are then compared in terms of applicable conditions, configuration methods, and channel parameters. Next, under the more practically appealing association model, we discuss the main design techniques for OIRS-aided VLC systems, including beam alignment, channel estimation, and OIRS reflection optimization. Finally, open issues are identified to stimulate future research in this area.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
PermLLM: Private Inference of Large Language Models within 3 Seconds under WAN
Authors:
Fei Zheng,
Chaochao Chen,
Zhongxuan Han,
Xiaolin Zheng
Abstract:
The emergence of ChatGPT marks the arrival of the large language model (LLM) era. While LLMs demonstrate their power in a variety of fields, they also raise serious privacy concerns as the users' queries are sent to the model provider. On the other side, deploying the LLM on the user's device will also leak all the model data. Existing methods based on secure multiparty computation (MPC) managed t…
▽ More
The emergence of ChatGPT marks the arrival of the large language model (LLM) era. While LLMs demonstrate their power in a variety of fields, they also raise serious privacy concerns as the users' queries are sent to the model provider. On the other side, deploying the LLM on the user's device will also leak all the model data. Existing methods based on secure multiparty computation (MPC) managed to protect both the privacy of the model parameters and user queries. However, they require gigabytes of data transfer and several minutes to generate just one token, making them impractical for most real-world applications. To improve the efficiency of private LLM inference, we propose PermLLM, which accelerates the evaluation of non-linear functions using secure random permutation. Along with the optimized secret sharing protocols and homomorphic encryption, PermLLM achieves two-party private inference of the ChatGLM-6B model at the speed of around 3s/token, under a realistic network setting (10ms RTT and 1Gbps bandwidth), which is magnitudes faster than existing MPC solutions.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Confidence-aware multi-modality learning for eye disease screening
Authors:
Ke Zou,
Tian Lin,
Zongbo Han,
Meng Wang,
Xuedong Yuan,
Haoyu Chen,
Changqing Zhang,
Xiao**g Shen,
Huazhu Fu
Abstract:
Multi-modal ophthalmic image classification plays a key role in diagnosing eye diseases, as it integrates information from different sources to complement their respective performances. However, recent improvements have mainly focused on accuracy, often neglecting the importance of confidence and robustness in predictions for diverse modalities. In this study, we propose a novel multi-modality evi…
▽ More
Multi-modal ophthalmic image classification plays a key role in diagnosing eye diseases, as it integrates information from different sources to complement their respective performances. However, recent improvements have mainly focused on accuracy, often neglecting the importance of confidence and robustness in predictions for diverse modalities. In this study, we propose a novel multi-modality evidential fusion pipeline for eye disease screening. It provides a measure of confidence for each modality and elegantly integrates the multi-modality information using a multi-distribution fusion perspective. Specifically, our method first utilizes normal inverse gamma prior distributions over pre-trained models to learn both aleatoric and epistemic uncertainty for uni-modality. Then, the normal inverse gamma distribution is analyzed as the Student's t distribution. Furthermore, within a confidence-aware fusion framework, we propose a mixture of Student's t distributions to effectively integrate different modalities, imparting the model with heavy-tailed properties and enhancing its robustness and reliability. More importantly, the confidence-aware multi-modality ranking regularization term induces the model to more reasonably rank the noisy single-modal and fused-modal confidence, leading to improved reliability and accuracy. Experimental results on both public and internal datasets demonstrate that our model excels in robustness, particularly in challenging scenarios involving Gaussian noise and modality missing conditions. Moreover, our model exhibits strong generalization capabilities to out-of-distribution data, underscoring its potential as a promising solution for multimodal eye disease screening.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
SkinCAP: A Multi-modal Dermatology Dataset Annotated with Rich Medical Captions
Authors:
Juexiao Zhou,
Liyuan Sun,
Yan Xu,
Wenbin Liu,
Shawn Afvari,
Zhongyi Han,
Jiaoyan Song,
Yongzhi Ji,
Xiaonan He,
Xin Gao
Abstract:
With the widespread application of artificial intelligence (AI), particularly deep learning (DL) and vision-based large language models (VLLMs), in skin disease diagnosis, the need for interpretability becomes crucial. However, existing dermatology datasets are limited in their inclusion of concept-level meta-labels, and none offer rich medical descriptions in natural language. This deficiency imp…
▽ More
With the widespread application of artificial intelligence (AI), particularly deep learning (DL) and vision-based large language models (VLLMs), in skin disease diagnosis, the need for interpretability becomes crucial. However, existing dermatology datasets are limited in their inclusion of concept-level meta-labels, and none offer rich medical descriptions in natural language. This deficiency impedes the advancement of LLM-based methods in dermatological diagnosis. To address this gap and provide a meticulously annotated dermatology dataset with comprehensive natural language descriptions, we introduce SkinCAP: a multi-modal dermatology dataset annotated with rich medical captions. SkinCAP comprises 4,000 images sourced from the Fitzpatrick 17k skin disease dataset and the Diverse Dermatology Images dataset, annotated by board-certified dermatologists to provide extensive medical descriptions and captions. Notably, SkinCAP represents the world's first such dataset and is publicly available at https://huggingface.co/datasets/joshuachou/SkinCAP.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Emergent Gauge Fields in Band Insulators
Authors:
Zhaoyu Han,
Steven Kivelson
Abstract:
By explicit microscopic construction involving a map** to a quantum vertex model subject to the `ice rule,' we show that an electronically `trivial' band insulator with suitable vibrational (phonon) degrees of freedom can host a ``resonating valence-bond'' state - a quantum phase with emergent gauge fields. This novel type of band insulator is identifiable by the existence of emergent gapless `p…
▽ More
By explicit microscopic construction involving a map** to a quantum vertex model subject to the `ice rule,' we show that an electronically `trivial' band insulator with suitable vibrational (phonon) degrees of freedom can host a ``resonating valence-bond'' state - a quantum phase with emergent gauge fields. This novel type of band insulator is identifiable by the existence of emergent gapless `photon' modes and deconfined excitations, the latter of which carry non-quantized mobile charges. We suggest that such phases may exist in the quantum regimes of various nearly ferroelectric materials.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
A new parametrization of Hubble function and Hubble tension
Authors:
Tong-Yu He,
Jia-Jun Yin,
Zhen-Yu Wang,
Zhan-Wen Han,
Rong-Jia Yang
Abstract:
We present a new Hubble parameterization method and employ observational data from Hubble, Pantheon, and Baryon Acoustic Oscillations to constrain model parameters. The proposed method is thoroughly validated against these datasets, demonstrating a robust fit to the observational data. The obtained best-fit values are $H_0 = 67.5^{+1.3}_{-1.6}$ $\text{km s}^{-1} \text{Mpc}^{-1}$,…
▽ More
We present a new Hubble parameterization method and employ observational data from Hubble, Pantheon, and Baryon Acoustic Oscillations to constrain model parameters. The proposed method is thoroughly validated against these datasets, demonstrating a robust fit to the observational data. The obtained best-fit values are $H_0 = 67.5^{+1.3}_{-1.6}$ $\text{km s}^{-1} \text{Mpc}^{-1}$, $Ω_{\rm{m0}} = 0.2764\pm{0.0094}$, and $α= 0.33\pm{0.22}$, consistent with the Planck 2018 results, highlighting the existence of Hubble tension.
△ Less
Submitted 16 June, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
FUGNN: Harmonizing Fairness and Utility in Graph Neural Networks
Authors:
Renqiang Luo,
Huafei Huang,
Shuo Yu,
Zhuoyang Han,
Estrid He,
Xiuzhen Zhang,
Feng Xia
Abstract:
Fairness-aware Graph Neural Networks (GNNs) often face a challenging trade-off, where prioritizing fairness may require compromising utility. In this work, we re-examine fairness through the lens of spectral graph theory, aiming to reconcile fairness and utility within the framework of spectral graph learning. We explore the correlation between sensitive features and spectrum in GNNs, using theore…
▽ More
Fairness-aware Graph Neural Networks (GNNs) often face a challenging trade-off, where prioritizing fairness may require compromising utility. In this work, we re-examine fairness through the lens of spectral graph theory, aiming to reconcile fairness and utility within the framework of spectral graph learning. We explore the correlation between sensitive features and spectrum in GNNs, using theoretical analysis to delineate the similarity between original sensitive features and those after convolution under different spectrum. Our analysis reveals a reduction in the impact of similarity when the eigenvectors associated with the largest magnitude eigenvalue exhibit directional similarity. Based on these theoretical insights, we propose FUGNN, a novel spectral graph learning approach that harmonizes the conflict between fairness and utility. FUGNN ensures algorithmic fairness and utility by truncating the spectrum and optimizing eigenvector distribution during the encoding process. The fairness-aware eigenvector selection reduces the impact of convolution on sensitive features while concurrently minimizing the sacrifice of utility. FUGNN further optimizes the distribution of eigenvectors through a transformer architecture. By incorporating the optimized spectrum into the graph convolution network, FUGNN effectively learns node representations. Experiments on six real-world datasets demonstrate the superiority of FUGNN over baseline methods. The codes are available at https://github.com/yushuowiki/FUGNN.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Realization of 2/3-layer transition metal dichalcogenides
Authors:
Ya-Xin Zhao,
Zi-Yi Han,
Ya-Ning Ren,
Ruo-Han Zhang,
Xiao-Feng Zhou,
Yu Zhang,
Lin He
Abstract:
Layered van der Waals transition metal dichalcogenides (TMDCs), generally composed of three atomic X-M-X planes in each layer (M = transition metal, X = chalcogen), provide versatile platforms for exploring diverse quantum phenomena. In each MX2 layer, the M-X bonds are predominantly covalent in nature, as a result, the cleavage of TMDC crystals always occurring between the layers. Here we report…
▽ More
Layered van der Waals transition metal dichalcogenides (TMDCs), generally composed of three atomic X-M-X planes in each layer (M = transition metal, X = chalcogen), provide versatile platforms for exploring diverse quantum phenomena. In each MX2 layer, the M-X bonds are predominantly covalent in nature, as a result, the cleavage of TMDC crystals always occurring between the layers. Here we report the controllable realization of fractional-layer WTe2 via an in-situ scanning tunnelling microscopy (STM) tip manipulation technique. By applying STM tip pulses, hundreds of the topmost Te atoms are removed to form a nanoscale monolayer Te pit in the 1T'-WTe2, thus realizing a brand-new 2/3-layer WTe2. Such a unique configuration undergoes a spontaneous atomic reconstruction, yielding an energy-dependent unidirectional charge-density-wave state with the wavevector and geometry quite distinct from that of pristine 1T'-WTe2. Our results expand the conventional understanding of the TMDCs and are expected to stimulate the research on extraordinary structures and properties based on fractional-layer TMDCs.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Unconventional superconductivity in twisted bilayer WSe2
Authors:
Yiyu Xia,
Zhongdong Han,
Kenji Watanabe,
Takashi Taniguchi,
Jie Shan,
Kin Fai Mak
Abstract:
Moiré materials have enabled the realization of flat electron bands and quantum phases that are driven by strong correlations associated with flat bands. Superconductivity has been observed, but solely, in graphene moiré materials. The absence of robust superconductivity in moiré materials beyond graphene, such as semiconductor moiré materials, has remained a mystery and challenged our current und…
▽ More
Moiré materials have enabled the realization of flat electron bands and quantum phases that are driven by strong correlations associated with flat bands. Superconductivity has been observed, but solely, in graphene moiré materials. The absence of robust superconductivity in moiré materials beyond graphene, such as semiconductor moiré materials, has remained a mystery and challenged our current understanding of superconductivity in flat bands. Here, we report the observation of robust superconductivity in 3.65-degree twisted bilayer WSe2 which hosts a honeycomb moiré lattice. Superconductivity emerges at half-band filling and under small sublattice potential differences, where the moiré band is a flat Chern band. The optimal superconducting transition temperature is about 220 mK and constitutes 2% of the effective Fermi temperature; the latter is comparable to the value in high-temperature cuprate superconductors and suggests strong pairing. The superconductor borders on two distinct metals below and above half-band filling; it undergoes a continuous transition to a correlated insulator by tuning the sublattice potential difference. The observed superconductivity on the verge of Coulomb-induced charge localization suggests roots in strong electron correlations.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
A rapid approach to urban traffic noise map** with a generative adversarial network
Authors:
Xinhao Yang,
Zhen Han,
Xiaodong Lu,
Yuan Zhang
Abstract:
With rapid urbanisation and the accompanying increase in traffic density, traffic noise has become a major concern in urban planning. However, traditional grid noise map** methods have limitations in terms of time consumption, software costs, and a lack of parameter integration interfaces. These limitations hinder their ability to meet the need for iterative updates and rapid performance feedbac…
▽ More
With rapid urbanisation and the accompanying increase in traffic density, traffic noise has become a major concern in urban planning. However, traditional grid noise map** methods have limitations in terms of time consumption, software costs, and a lack of parameter integration interfaces. These limitations hinder their ability to meet the need for iterative updates and rapid performance feedback in the early design stages of street-scale urban planning. Herein, we developed a rapid urban traffic noise map** technique that leverages generative adversarial networks (GANs) as a surrogate model. This approach enables the rapid assessment of urban traffic noise distribution by using urban elements such as roads and buildings as the input. The mean values for the mean squared error (MSE) and structural similarity index (SSIM) are 0.0949 and 0.8528, respectively, for the validation dataset. Hence, our prediction accuracy is on par with that of conventional prediction software. Furthermore, the trained model is integrated into Grasshopper as a tool, facilitating the rapid generation of traffic noise maps. This integration allows urban designers and planners, even those without expertise in acoustics, to easily anticipate changes in acoustics impacts caused by design.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Can We Treat Noisy Labels as Accurate?
Authors:
Yuxiang Zheng,
Zhongyi Han,
Yilong Yin,
Xin Gao,
Tongliang Liu
Abstract:
Noisy labels significantly hinder the accuracy and generalization of machine learning models, particularly due to ambiguous instance features. Traditional techniques that attempt to correct noisy labels directly, such as those using transition matrices, often fail to address the inherent complexities of the problem sufficiently. In this paper, we introduce EchoAlign, a transformative paradigm shif…
▽ More
Noisy labels significantly hinder the accuracy and generalization of machine learning models, particularly due to ambiguous instance features. Traditional techniques that attempt to correct noisy labels directly, such as those using transition matrices, often fail to address the inherent complexities of the problem sufficiently. In this paper, we introduce EchoAlign, a transformative paradigm shift in learning from noisy labels. Instead of focusing on label correction, EchoAlign treats noisy labels ($\tilde{Y}$) as accurate and modifies corresponding instance features ($X$) to achieve better alignment with $\tilde{Y}$. EchoAlign's core components are (1) EchoMod: Employing controllable generative models, EchoMod precisely modifies instances while maintaining their intrinsic characteristics and ensuring alignment with the noisy labels. (2) EchoSelect: Instance modification inevitably introduces distribution shifts between training and test sets. EchoSelect maintains a significant portion of clean original instances to mitigate these shifts. It leverages the distinct feature similarity distributions between original and modified instances as a robust tool for accurate sample selection. This integrated approach yields remarkable results. In environments with 30% instance-dependent noise, even at 99% selection accuracy, EchoSelect retains nearly twice the number of samples compared to the previous best method. Notably, on three datasets, EchoAlign surpasses previous state-of-the-art techniques with a substantial improvement.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Hybrid Digital-Analog Semantic Communications
Authors:
Huiqiang Xie,
Zhi** Qin,
Zhu Han,
Khaled B. Letaief
Abstract:
Digital and analog semantic communications (SemCom) face inherent limitations such as data security concerns in analog SemCom, as well as leveling-off and cliff-edge effects in digital SemCom. In order to overcome these challenges, we propose a novel SemCom framework and a corresponding system called HDA-DeepSC, which leverages a hybrid digital-analog approach for multimedia transmission. This is…
▽ More
Digital and analog semantic communications (SemCom) face inherent limitations such as data security concerns in analog SemCom, as well as leveling-off and cliff-edge effects in digital SemCom. In order to overcome these challenges, we propose a novel SemCom framework and a corresponding system called HDA-DeepSC, which leverages a hybrid digital-analog approach for multimedia transmission. This is achieved through the introduction of digital-analog allocation and fusion modules. To strike a balance between data rate and distortion, we design new loss functions that take into account long-distance dependencies in the semantic distortion constraint, essential information recovery in the channel distortion constraint, and optimal bit stream generation in the rate constraint. Additionally, we propose denoising diffusion-based signal detection techniques, which involve carefully designed variance schedules and sampling algorithms to refine transmitted signals. Through extensive numerical experiments, we will demonstrate that HDA-DeepSC exhibits robustness to channel variations and is capable of supporting various communication scenarios. Our proposed framework outperforms existing benchmarks in terms of peak signal-to-noise ratio and multi-scale structural similarity, showcasing its superiority in semantic communication quality.
△ Less
Submitted 27 May, 2024; v1 submitted 21 May, 2024;
originally announced May 2024.
-
Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning
Authors:
Guanglin Zhou,
Zhongyi Han,
Shiming Chen,
Biwei Huang,
Liming Zhu,
Salman Khan,
Xin Gao,
Lina Yao
Abstract:
Recent studies indicate that large multimodal models (LMMs) are highly robust against natural distribution shifts, often surpassing previous baselines. Despite this, domain-specific adaptation is still necessary, particularly in specialized areas like healthcare. Due to the impracticality of fine-tuning LMMs given their vast parameter space, this work investigates in-context learning (ICL) as an e…
▽ More
Recent studies indicate that large multimodal models (LMMs) are highly robust against natural distribution shifts, often surpassing previous baselines. Despite this, domain-specific adaptation is still necessary, particularly in specialized areas like healthcare. Due to the impracticality of fine-tuning LMMs given their vast parameter space, this work investigates in-context learning (ICL) as an effective alternative for enhancing LMMs' adaptability. We find that the success of ICL heavily relies on the choice of demonstration, mirroring challenges seen in large language models but introducing unique complexities for LMMs facing distribution shifts. Our study addresses this by evaluating an unsupervised ICL method, TopKNearestPR, which selects in-context examples through a nearest example search based on feature similarity. We uncover that its effectiveness is limited by the deficiencies of pre-trained vision encoders under distribution shift scenarios. To address these challenges, we propose InvariantSelectPR, a novel method leveraging Class-conditioned Contrastive Invariance (CCI) for more robust demonstration selection. Specifically, CCI enhances pre-trained vision encoders by improving their discriminative capabilities across different classes and ensuring invariance to domain-specific variations. This enhancement allows the encoders to effectively identify and retrieve the most informative examples, which are then used to guide LMMs in adapting to new query samples under varying distributions. Our experiments show that InvariantSelectPR substantially improves the adaptability of LMMs, achieving significant performance gains on benchmark datasets, with a 34.2%$\uparrow$ accuracy increase in 7-shot on Camelyon17 and 16.9%$\uparrow$ increase in 7-shot on HAM10000 compared to the baseline zero-shot performance.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Rota-Baxter groups with weight zero and integration on topological groups
Authors:
Xing Gao,
Li Guo,
Zongjian Han
Abstract:
Rota-Baxter groups with weights $\pm 1$ have attracted quite much attention since their recent introduction, thanks to their connections with Rota-Baxter Lie algebras, factorizations of Lie groups, post- and pre-Lie algebras, braces and set-theoretic solutions of the Yang-Baxter equation. Despite their expected importance from integrals on groups to pre-groups and Yang-Baxter equations, Rota-Baxte…
▽ More
Rota-Baxter groups with weights $\pm 1$ have attracted quite much attention since their recent introduction, thanks to their connections with Rota-Baxter Lie algebras, factorizations of Lie groups, post- and pre-Lie algebras, braces and set-theoretic solutions of the Yang-Baxter equation. Despite their expected importance from integrals on groups to pre-groups and Yang-Baxter equations, Rota-Baxter groups with weight zero and other weights has been a challenge to define and their search has been the focus of several attempts.
By composing an operator with a section map as a perturbation device, we first generalize the notion of a Rota-Baxter operator on a group from the existing case of weight $\pm 1$ to the case where the weight is given by a pair of maps and then a sequence limit of such pairs. From there, two candidates of Rota-Baxter operators with weight zero are given. One of them is the Rota-Baxter operator with limit-weight zero detailed here, with the other candidate introduced in a companion work. This operator is shown to have its tangent map the Rota-Baxter operator with weight zero on Lie algebras. It also gives concrete applications in integrals of maps with values in a class of topological groups called $\RR$-groups, satisfying a multiplicative version of the integration-by-parts formula.
In parallel, differential groups in this framework is also developed and a group formulation of the First Fundamental Theorem of Calculus is obtained.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Cooperative Cognitive Dynamic System in UAV Swarms: Reconfigurable Mechanism and Framework
Authors:
Ziye Jia,
Jiahao You,
Chao Dong,
Qihui Wu,
Fuhui Zhou,
Dusit Niyato,
Zhu Han
Abstract:
As the demands for immediate and effective responses increase in both civilian and military domains, the unmanned aerial vehicle (UAV) swarms emerge as effective solutions, in which multiple cooperative UAVs can work together to achieve specific goals. However, how to manage such complex systems to ensure real-time adaptability lack sufficient researches. Hence, in this paper, we propose the coope…
▽ More
As the demands for immediate and effective responses increase in both civilian and military domains, the unmanned aerial vehicle (UAV) swarms emerge as effective solutions, in which multiple cooperative UAVs can work together to achieve specific goals. However, how to manage such complex systems to ensure real-time adaptability lack sufficient researches. Hence, in this paper, we propose the cooperative cognitive dynamic system (CCDS), to optimize the management for UAV swarms. CCDS leverages a hierarchical and cooperative control structure that enables real-time data processing and decision. Accordingly, CCDS optimizes the UAV swarm management via dynamic reconfigurability and adaptive intelligent optimization. In addition, CCDS can be integrated with the biomimetic mechanism to efficiently allocate tasks for UAV swarms. Further, the distributed coordination of CCDS ensures reliable and resilient control, thus enhancing the adaptability and robustness. Finally, the potential challenges and future directions are analyzed, to provide insights into managing UAV swarms in dynamic heterogeneous networking.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Generative AI for Secure and Privacy-Preserving Mobile Crowdsensing
Authors:
Yaoqi Yang,
Bangning Zhang,
Daoxing Guo,
Hongyang Du,
Zehui Xiong,
Dusit Niyato,
Zhu Han
Abstract:
Recently, generative AI has attracted much attention from both academic and industrial fields, which has shown its potential, especially in the data generation and synthesis aspects. Simultaneously, secure and privacy-preserving mobile crowdsensing (SPPMCS) has been widely applied in data collection/ acquirement due to an advantage on low deployment cost, flexible implementation, and high adaptabi…
▽ More
Recently, generative AI has attracted much attention from both academic and industrial fields, which has shown its potential, especially in the data generation and synthesis aspects. Simultaneously, secure and privacy-preserving mobile crowdsensing (SPPMCS) has been widely applied in data collection/ acquirement due to an advantage on low deployment cost, flexible implementation, and high adaptability. Since generative AI can generate new synthetic data to replace the original data to be analyzed and processed, it can lower data attacks and privacy leakage risks for the original data. Therefore, integrating generative AI into SPPMCS is feasible and significant. Moreover, this paper investigates an integration of generative AI in SPPMCS, where we present potential research focuses, solutions, and case studies. Specifically, we firstly review the preliminaries for generative AI and SPPMCS, where their integration potential is presented. Then, we discuss research issues and solutions for generative AI-enabled SPPMCS, including security defense of malicious data injection, illegal authorization, malicious spectrum manipulation at the physical layer, and privacy protection on sensing data content, sensing terminals' identification and location. Next, we propose a framework for sensing data content protection with generative AI, and simulations results have clearly demonstrated the effectiveness of the proposed framework. Finally, we present major research directions for generative AI-enabled SPPMCS.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Industrial Metaverse: Enabling Technologies, Open Problems, and Future Trends
Authors:
Shiying Zhang,
Jun Li,
Long Shi,
Ming Ding,
Dinh C. Nguyen,
Wen Chen,
Zhu Han
Abstract:
As an emerging technology that enables seamless integration between the physical and virtual worlds, the Metaverse has great potential to be deployed in the industrial production field with the development of extended reality (XR) and next-generation communication networks. This deployment, called the Industrial Metaverse, is used for product design, production operations, industrial quality inspe…
▽ More
As an emerging technology that enables seamless integration between the physical and virtual worlds, the Metaverse has great potential to be deployed in the industrial production field with the development of extended reality (XR) and next-generation communication networks. This deployment, called the Industrial Metaverse, is used for product design, production operations, industrial quality inspection, and product testing. However, there lacks of in-depth understanding of the enabling technologies associated with the Industrial Metaverse. This encompasses both the precise industrial scenarios targeted by each technology and the potential migration of technologies developed in other domains to the industrial sector. Driven by this issue, in this article, we conduct a comprehensive survey of the state-of-the-art literature on the Industrial Metaverse. Specifically, we first analyze the advantages of the Metaverse for industrial production. Then, we review a collection of key enabling technologies of the Industrial Metaverse, including blockchain (BC), digital twin (DT), 6G, XR, and artificial intelligence (AI), and analyze how these technologies can support different aspects of industrial production. Subsequently, we present numerous formidable challenges encountered within the Industrial Metaverse, including confidentiality and security concerns, resource limitations, and interoperability constraints. Furthermore, we investigate the extant solutions devised to address them. Finally, we briefly outline several open issues and future research directions of the Industrial Metaverse.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Exploring Equilibrium Strategies in Network Games with Generative AI
Authors:
Yaoqi Yang,
Hongyang Du,
Geng Sun,
Zehui Xiong,
Dusit Niyato,
Zhu Han
Abstract:
Game theory offers a powerful framework for analyzing strategic interactions among decision-makers, providing tools to model, analyze, and predict their behavior. However, implementing game theory can be challenging due to difficulties in deriving solutions, understanding interactions, and ensuring optimal performance. Traditional non-AI and discriminative AI approaches have made valuable contribu…
▽ More
Game theory offers a powerful framework for analyzing strategic interactions among decision-makers, providing tools to model, analyze, and predict their behavior. However, implementing game theory can be challenging due to difficulties in deriving solutions, understanding interactions, and ensuring optimal performance. Traditional non-AI and discriminative AI approaches have made valuable contributions but struggle with limitations in handling large-scale games and dynamic scenarios. In this context, generative AI emerges as a promising solution because of its superior data analysis and generation capabilities. This paper comprehensively summarizes the challenges, solutions, and outlooks of combining generative AI with game theory. We start with reviewing the limitations of traditional non-AI and discriminative AI approaches in employing game theory, and then highlight the necessity and advantages of integrating generative AI. Next, we explore the applications of generative AI in various stages of the game theory lifecycle, including model formulation, solution derivation, and strategy improvement. Additionally, from game theory viewpoint, we propose a generative AI-enabled framework for optimizing machine learning model performance against false data injection attacks, supported by a case study to demonstrate its effectiveness. Finally, we outline future research directions for generative AI-enabled game theory, paving the way for its further advancements and development.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Learning to Plan Maneuverable and Agile Flight Trajectory with Optimization Embedded Networks
Authors:
Zhichao Han,
Long Xu,
Fei Gao
Abstract:
In recent times, an increasing number of researchers have been devoted to utilizing deep neural networks for end-to-end flight navigation. This approach has gained traction due to its ability to bridge the gap between perception and planning that exists in traditional methods, thereby eliminating delays between modules. However, the practice of replacing original modules with neural networks in a…
▽ More
In recent times, an increasing number of researchers have been devoted to utilizing deep neural networks for end-to-end flight navigation. This approach has gained traction due to its ability to bridge the gap between perception and planning that exists in traditional methods, thereby eliminating delays between modules. However, the practice of replacing original modules with neural networks in a black-box manner diminishes the overall system's robustness and stability. It lacks principled explanations and often fails to consistently generate high-quality motion trajectories. Furthermore, such methods often struggle to rigorously account for the robot's kinematic constraints, resulting in the generation of trajectories that cannot be executed satisfactorily. In this work, we combine the advantages of traditional methods and neural networks by proposing an optimization-embedded neural network. This network can learn high-quality trajectories directly from visual inputs without the need of map**, while ensuring dynamic feasibility. Here, the deep neural network is employed to directly extract environment safety regions from depth images. Subsequently, we employ a model-based approach to represent these regions as safety constraints in trajectory optimization. Leveraging the availability of highly efficient optimization algorithms, our method robustly converges to feasible and optimal solutions that satisfy various user-defined constraints. Moreover, we differentiate the optimization process, allowing it to be trained as a layer within the neural network. This approach facilitates the direct interaction between perception and planning, enabling the network to focus more on the spatial regions where optimal solutions exist. As a result, it further enhances the quality and stability of the generated trajectories.
△ Less
Submitted 7 June, 2024; v1 submitted 13 May, 2024;
originally announced May 2024.
-
A Performance Analysis Modeling Framework for Extended Reality Applications in Edge-Assisted Wireless Networks
Authors:
Anik Mallik,
Jiang Xie,
Zhu Han
Abstract:
Extended reality (XR) is at the center of attraction in the research community due to the emergence of augmented, mixed, and virtual reality applications. The performance of such applications needs to be uptight to maintain the requirements of latency, energy consumption, and freshness of data. Therefore, a comprehensive performance analysis model is required to assess the effectiveness of an XR a…
▽ More
Extended reality (XR) is at the center of attraction in the research community due to the emergence of augmented, mixed, and virtual reality applications. The performance of such applications needs to be uptight to maintain the requirements of latency, energy consumption, and freshness of data. Therefore, a comprehensive performance analysis model is required to assess the effectiveness of an XR application but is challenging to design due to the dependence of the performance metrics on several difficult-to-model parameters, such as computing resources and hardware utilization of XR and edge devices, which are controlled by both their operating systems and the application itself. Moreover, the heterogeneity in devices and wireless access networks brings additional challenges in modeling. In this paper, we propose a novel modeling framework for performance analysis of XR applications considering edge-assisted wireless networks and validate the model with experimental data collected from testbeds designed specifically for XR applications. In addition, we present the challenges associated with performance analysis modeling and present methods to overcome them in detail. Finally, the performance evaluation shows that the proposed analytical model can analyze XR applications' performance with high accuracy compared to the state-of-the-art analytical models.
△ Less
Submitted 11 May, 2024;
originally announced May 2024.
-
Intelligent Duty Cycling Management and Wake-up for Energy Harvesting IoT Networks with Correlated Activity
Authors:
David E. Ruíz-Guirola,
Onel L. A. López,
Samuel Montejo-Sánchez,
Israel Leyva Mayorga,
Zhu Han,
Petar Popovski
Abstract:
This paper presents an approach for energy-neutral Internet of Things (IoT) scenarios where the IoT devices (IoTDs) rely entirely on their energy harvesting capabilities to sustain operation. We use a Markov chain to represent the operation and transmission states of the IoTDs, a modulated Poisson process to model their energy harvesting process, and a discrete-time Markov chain to model their bat…
▽ More
This paper presents an approach for energy-neutral Internet of Things (IoT) scenarios where the IoT devices (IoTDs) rely entirely on their energy harvesting capabilities to sustain operation. We use a Markov chain to represent the operation and transmission states of the IoTDs, a modulated Poisson process to model their energy harvesting process, and a discrete-time Markov chain to model their battery state. The aim is to efficiently manage the duty cycling of the IoTDs, so as to prolong their battery life and reduce instances of low-energy availability. We propose a duty-cycling management based on K- nearest neighbors, aiming to strike a trade-off between energy efficiency and detection accuracy. This is done by incorporating spatial and temporal correlations among IoTDs' activity, as well as their energy harvesting capabilities. We also allow the base station to wake up specific IoTDs if more information about an event is needed upon initial detection. Our proposed scheme shows significant improvements in energy savings and performance, with up to 11 times lower misdetection probability and 50\% lower energy consumption for high-density scenarios compared to a random duty cycling benchmark.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Semi-Autonomous Laparoscopic Robot Docking with Learned Hand-Eye Information Fusion
Authors:
Huanyu Tian,
Martin Huber,
Christopher E. Mower,
Zhe Han,
Changsheng Li,
Xingguang Duan,
Christos Bergeles
Abstract:
In this study, we introduce a novel shared-control system for key-hole docking operations, combining a commercial camera with occlusion-robust pose estimation and a hand-eye information fusion technique. This system is used to enhance docking precision and force-compliance safety. To train a hand-eye information fusion network model, we generated a self-supervised dataset using this docking system…
▽ More
In this study, we introduce a novel shared-control system for key-hole docking operations, combining a commercial camera with occlusion-robust pose estimation and a hand-eye information fusion technique. This system is used to enhance docking precision and force-compliance safety. To train a hand-eye information fusion network model, we generated a self-supervised dataset using this docking system. After training, our pose estimation method showed improved accuracy compared to traditional methods, including observation-only approaches, hand-eye calibration, and conventional state estimation filters. In real-world phantom experiments, our approach demonstrated its effectiveness with reduced position dispersion (1.23\pm 0.81 mm vs. 2.47 \pm 1.22 mm) and force dispersion (0.78\pm 0.57 N vs. 1.15 \pm 0.97 N) compared to the control group. These advancements in semi-autonomy co-manipulation scenarios enhance interaction and stability. The study presents an anti-interference, steady, and precision solution with potential applications extending beyond laparoscopic surgery to other minimally invasive procedures.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
DenserRadar: A 4D millimeter-wave radar point cloud detector based on dense LiDAR point clouds
Authors:
Zeyu Han,
Junkai Jiang,
Xiaokang Ding,
Qingwen Meng,
Shaobing Xu,
Lei He,
Jianqiang Wang
Abstract:
The 4D millimeter-wave (mmWave) radar, with its robustness in extreme environments, extensive detection range, and capabilities for measuring velocity and elevation, has demonstrated significant potential for enhancing the perception abilities of autonomous driving systems in corner-case scenarios. Nevertheless, the inherent sparsity and noise of 4D mmWave radar point clouds restrict its further d…
▽ More
The 4D millimeter-wave (mmWave) radar, with its robustness in extreme environments, extensive detection range, and capabilities for measuring velocity and elevation, has demonstrated significant potential for enhancing the perception abilities of autonomous driving systems in corner-case scenarios. Nevertheless, the inherent sparsity and noise of 4D mmWave radar point clouds restrict its further development and practical application. In this paper, we introduce a novel 4D mmWave radar point cloud detector, which leverages high-resolution dense LiDAR point clouds. Our approach constructs dense 3D occupancy ground truth from stitched LiDAR point clouds, and employs a specially designed network named DenserRadar. The proposed method surpasses existing probability-based and learning-based radar point cloud detectors in terms of both point cloud density and accuracy on the K-Radar dataset.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
The Catalog of early-type Runaway stars from LAMOST-DR8
Authors:
Yanjun Guo,
Luqian Wang,
Chao Liu,
You Wu,
ZhanWen Han,
XueFei Chen
Abstract:
Runaway stars are OB-type stars ejected from their birthplace with large peculiar velocities. The leading hypothesis addressed in their formation includes the supernova ejection mechanism and the dynamic ejection scenario. Identification of runaway populations is the first step to investigating their formation and evolution. Here we present our work of searching for Galactic runaway candidate star…
▽ More
Runaway stars are OB-type stars ejected from their birthplace with large peculiar velocities. The leading hypothesis addressed in their formation includes the supernova ejection mechanism and the dynamic ejection scenario. Identification of runaway populations is the first step to investigating their formation and evolution. Here we present our work of searching for Galactic runaway candidate stars from the LAMOST Medium-Resolution Survey DR8 database. After studying the kinematic properties for a collection of 4,432 early-type stars, predominantly B-type stars, using the radial velocity measurements from LAMOST DR8 and astrometric solutions made by Gaia DR3, we identified 229 runaway candidate stars. They span a wide distribution in projected rotational velocities. We investigated the Galactic spatial distribution of the runaway population and noticed that most of them likely reside within the Galactic thin disk. Based upon analyzing the Doppler shifts of the candidate stars, we found two binary runaway candidates displaying velocity variation with estimated orbital periods of 40 and 61 days.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Very Long Baseline Array Observations of Parsec-scale Radio Emission in Dual Active Galactic Nuclei
Authors:
Wancheng Xu,
Lang Cui,
Xiang Liu,
Tao An,
Hongmin Cao,
Pengfei Jiang,
Luis C. Ho,
Ning Chang,
Xiaolong Yang,
Yuling Shen,
Gui** Tan,
Zhenhua Han,
Junhui Fan,
Ming Zhang
Abstract:
It is believed that dual active galactic nuclei (dual AGN) will form during galaxies merge. Studying dual-AGN emission can provide valuable insights into galaxy merging and evolution. To investigate parsec-scale radio emission properties, we observed eight radio components of four selected dual-AGN systems using the Very Long Baseline Array (VLBA) at 5 GHz in multiple-phase-center mode. Among them…
▽ More
It is believed that dual active galactic nuclei (dual AGN) will form during galaxies merge. Studying dual-AGN emission can provide valuable insights into galaxy merging and evolution. To investigate parsec-scale radio emission properties, we observed eight radio components of four selected dual-AGN systems using the Very Long Baseline Array (VLBA) at 5 GHz in multiple-phase-center mode. Among them, two compact radio components, labeled J0051+0020B and J2300-0005A, were detected clearly on parsec scales for the first time. However, the radio emission of the other six components was resolved out in the high-resolution images. We provided the values or upper limits of the brightness temperature and radio emission power, and analyzed the emission origins in detail for each target. Based on their physical properties reported in this work and in the literature, we suggest the radio emission in J0051+0020B and J2300-0005A originates primarily from compact jets, while the other six sources show more complex emission mechanisms. In addition, our VLBA observations suggest the systematic X-ray deficit in our dual-AGN sample is likely attributed to the tidally induced effect and possible viewing angle effect.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Artificial General Intelligence (AGI)-Native Wireless Systems: A Journey Beyond 6G
Authors:
Walid Saad,
Omar Hashash,
Christo Kurisummoottil Thomas,
Christina Chaccour,
Merouane Debbah,
Narayan Mandayam,
Zhu Han
Abstract:
Building future wireless systems that support services like digital twins (DTs) is challenging to achieve through advances to conventional technologies like meta-surfaces. While artificial intelligence (AI)-native networks promise to overcome some limitations of wireless technologies, developments still rely on AI tools like neural networks. Such tools struggle to cope with the non-trivial challen…
▽ More
Building future wireless systems that support services like digital twins (DTs) is challenging to achieve through advances to conventional technologies like meta-surfaces. While artificial intelligence (AI)-native networks promise to overcome some limitations of wireless technologies, developments still rely on AI tools like neural networks. Such tools struggle to cope with the non-trivial challenges of the network environment and the growing demands of emerging use cases. In this paper, we revisit the concept of AI-native wireless systems, equip** them with the common sense necessary to transform them into artificial general intelligence (AGI)-native systems. These systems acquire common sense by exploiting different cognitive abilities such as perception, analogy, and reasoning, that enable them to generalize and deal with unforeseen scenarios. Towards develo** the components of such a system, we start by showing how the perception module can be built through abstracting real-world elements into generalizable representations. These representations are then used to create a world model, founded on principles of causality and hyper-dimensional (HD) computing, that aligns with intuitive physics and enables analogical reasoning, that define common sense. Then, we explain how methods such as integrated information theory play a role in the proposed intent-driven and objective-driven planning methods that maneuver the AGI-native network to take actions. Next, we discuss how an AGI-native network can enable use cases related to human and autonomous agents: a) analogical reasoning for next-generation DTs, b) synchronized and resilient experiences for cognitive avatars, and c) brain-level metaverse experiences like holographic teleportation. Finally, we conclude with a set of recommendations to build AGI-native systems. Ultimately, we envision this paper as a roadmap for the beyond 6G era.
△ Less
Submitted 29 April, 2024;
originally announced May 2024.
-
How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses
Authors:
Jionghao Lin,
Zifei Han,
Danielle R. Thomas,
Ashish Gurung,
Shivang Gupta,
Vincent Aleven,
Kenneth R. Koedinger
Abstract:
One-on-one tutoring is widely acknowledged as an effective instructional method, conditioned on qualified tutors. However, the high demand for qualified tutors remains a challenge, often necessitating the training of novice tutors (i.e., trainees) to ensure effective tutoring. Research suggests that providing timely explanatory feedback can facilitate the training process for trainees. However, it…
▽ More
One-on-one tutoring is widely acknowledged as an effective instructional method, conditioned on qualified tutors. However, the high demand for qualified tutors remains a challenge, often necessitating the training of novice tutors (i.e., trainees) to ensure effective tutoring. Research suggests that providing timely explanatory feedback can facilitate the training process for trainees. However, it presents challenges due to the time-consuming nature of assessing trainee performance by human experts. Inspired by the recent advancements of large language models (LLMs), our study employed the GPT-4 model to build an explanatory feedback system. This system identifies trainees' responses in binary form (i.e., correct/incorrect) and automatically provides template-based feedback with responses appropriately rephrased by the GPT-4 model. We conducted our study on 410 responses from trainees across three training lessons: Giving Effective Praise, Reacting to Errors, and Determining What Students Know. Our findings indicate that: 1) using a few-shot approach, the GPT-4 model effectively identifies correct/incorrect trainees' responses from three training lessons with an average F1 score of 0.84 and an AUC score of 0.85; and 2) using the few-shot approach, the GPT-4 model adeptly rephrases incorrect trainees' responses into desired responses, achieving performance comparable to that of human experts.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended Responses
Authors:
Jionghao Lin,
Eason Chen,
Zeifei Han,
Ashish Gurung,
Danielle R. Thomas,
Wei Tan,
Ngoc Dang Nguyen,
Kenneth R. Koedinger
Abstract:
Automated explanatory feedback systems play a crucial role in facilitating learning for a large cohort of learners by offering feedback that incorporates explanations, significantly enhancing the learning process. However, delivering such explanatory feedback in real-time poses challenges, particularly when high classification accuracy for domain-specific, nuanced responses is essential. Our study…
▽ More
Automated explanatory feedback systems play a crucial role in facilitating learning for a large cohort of learners by offering feedback that incorporates explanations, significantly enhancing the learning process. However, delivering such explanatory feedback in real-time poses challenges, particularly when high classification accuracy for domain-specific, nuanced responses is essential. Our study leverages the capabilities of large language models, specifically Generative Pre-Trained Transformers (GPT), to explore a sequence labeling approach focused on identifying components of desired and less desired praise for providing explanatory feedback within a tutor training dataset. Our aim is to equip tutors with actionable, explanatory feedback during online training lessons. To investigate the potential of GPT models for providing the explanatory feedback, we employed two commonly-used approaches: prompting and fine-tuning. To quantify the quality of highlighted praise components identified by GPT models, we introduced a Modified Intersection over Union (M-IoU) score. Our findings demonstrate that: (1) the M-IoU score effectively correlates with human judgment in evaluating sequence quality; (2) using two-shot prompting on GPT-3.5 resulted in decent performance in recognizing effort-based (M-IoU of 0.46) and outcome-based praise (M-IoU of 0.68); and (3) our optimally fine-tuned GPT-3.5 model achieved M-IoU scores of 0.64 for effort-based praise and 0.84 for outcome-based praise, aligning with the satisfaction levels evaluated by human coders. Our results show promise for using GPT models to provide feedback that focuses on specific elements in their open-ended responses that are desirable or could use improvement.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation
Authors:
Yousef Emami,
Hao Gao,
Kai Li,
Luis Almeida,
Eduardo Tovar,
Zhu Han
Abstract:
Unmanned Aerial Vehicle (UAV) swarms play an effective role in timely data collection from ground sensors in remote and hostile areas. Optimizing the collective behavior of swarms can improve data collection performance. This paper puts forth a new mean field flight resource allocation optimization to minimize age of information (AoI) of sensory data, where balancing the trade-off between the UAVs…
▽ More
Unmanned Aerial Vehicle (UAV) swarms play an effective role in timely data collection from ground sensors in remote and hostile areas. Optimizing the collective behavior of swarms can improve data collection performance. This paper puts forth a new mean field flight resource allocation optimization to minimize age of information (AoI) of sensory data, where balancing the trade-off between the UAVs movements and AoI is formulated as a mean field game (MFG). The MFG optimization yields an expansive solution space encompassing continuous state and action, resulting in significant computational complexity. To address practical situations, we propose, a new mean field hybrid proximal policy optimization (MF-HPPO) scheme to minimize the average AoI by optimizing the UAV's trajectories and data collection scheduling of the ground sensors given mixed continuous and discrete actions. Furthermore, a long short term memory (LSTM) is leveraged in MF-HPPO to predict the time-varying network state and stabilize the training. Numerical results demonstrate that the proposed MF-HPPO reduces the average AoI by up to 45 percent and 57 percent in the considered simulation setting, as compared to multi-agent deep Q-learning (MADQN) method and non-learning random algorithm, respectively.
△ Less
Submitted 2 May, 2024; v1 submitted 24 April, 2024;
originally announced May 2024.
-
On the Schwartz estimate for Hodge Laplacians on semisimple Lie groups
Authors:
Zhicheng Han
Abstract:
In this paper, we prove Schwartz estimates for Hodge Laplacian and Dirac operators on semisimple Lie groups. Alongside, we gives a version of Kuga lemma for its Lie algebra cohomology. This is a generalization of similar results on symmetric spaces. The main purpose of such estimates is to study the heat problem not only in the scalar case, but also for sections of vector bundles on homogeneous sp…
▽ More
In this paper, we prove Schwartz estimates for Hodge Laplacian and Dirac operators on semisimple Lie groups. Alongside, we gives a version of Kuga lemma for its Lie algebra cohomology. This is a generalization of similar results on symmetric spaces. The main purpose of such estimates is to study the heat problem not only in the scalar case, but also for sections of vector bundles on homogeneous spaces using Fourier analysis.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Authors:
Qingyang Zhang,
Yake Wei,
Zongbo Han,
Huazhu Fu,
Xi Peng,
Cheng Deng,
Qinghua Hu,
Cai Xu,
Jie Wen,
Di Hu,
Changqing Zhang
Abstract:
Multimodal fusion focuses on integrating information from multiple modalities with the goal of more accurate prediction, which has achieved remarkable progress in a wide range of scenarios, including autonomous driving and medical diagnosis. However, the reliability of multimodal fusion remains largely unexplored especially under low-quality data settings. This paper surveys the common challenges…
▽ More
Multimodal fusion focuses on integrating information from multiple modalities with the goal of more accurate prediction, which has achieved remarkable progress in a wide range of scenarios, including autonomous driving and medical diagnosis. However, the reliability of multimodal fusion remains largely unexplored especially under low-quality data settings. This paper surveys the common challenges and recent advances of multimodal fusion in the wild and presents them in a comprehensive taxonomy. From a data-centric view, we identify four main challenges that are faced by multimodal fusion on low-quality data, namely (1) noisy multimodal data that are contaminated with heterogeneous noises, (2) incomplete multimodal data that some modalities are missing, (3) imbalanced multimodal data that the qualities or properties of different modalities are significantly different and (4) quality-varying multimodal data that the quality of each modality dynamically changes with respect to different samples. This new taxonomy will enable researchers to understand the state of the field and identify several potential directions. We also provide discussion for the open problems in this field together with interesting future research directions.
△ Less
Submitted 5 May, 2024; v1 submitted 27 April, 2024;
originally announced April 2024.