Search | arXiv e-print repository

What Is Missing In Homophily? Disentangling Graph Homophily For Graph Neural Networks

Authors: Yilun Zheng, Sitao Luan, Lihui Chen

Abstract: Graph homophily refers to the phenomenon that connected nodes tend to share similar characteristics. Understanding this concept and its related metrics is crucial for designing effective Graph Neural Networks (GNNs). The most widely used homophily metrics, such as edge or node homophily, quantify such "similarity" as label consistency across the graph topology. These metrics are believed to be abl… ▽ More Graph homophily refers to the phenomenon that connected nodes tend to share similar characteristics. Understanding this concept and its related metrics is crucial for designing effective Graph Neural Networks (GNNs). The most widely used homophily metrics, such as edge or node homophily, quantify such "similarity" as label consistency across the graph topology. These metrics are believed to be able to reflect the performance of GNNs, especially on node-level tasks. However, many recent studies have empirically demonstrated that the performance of GNNs does not always align with homophily metrics, and how homophily influences GNNs still remains unclear and controversial. Then, a crucial question arises: What is missing in our current understanding of homophily? To figure out the missing part, in this paper, we disentangle the graph homophily into $3$ aspects: label, structural, and feature homophily, providing a more comprehensive understanding of GNN performance. To investigate their synergy, we propose a Contextual Stochastic Block Model with $3$ types of Homophily (CSBM-3H), where the topology and feature generation are controlled by the $3$ metrics. Based on the theoretical analysis of CSBM-3H, we derive a new composite metric, named Tri-Hom, that considers all $3$ aspects and overcomes the limitations of conventional homophily metrics. The theoretical conclusions and the effectiveness of Tri-Hom have been verified through synthetic experiments on CSBM-3H. In addition, we conduct experiments on $31$ real-world benchmark datasets and calculate the correlations between homophily metrics and model performance. Tri-Hom has significantly higher correlation values than $17$ existing metrics that only focus on a single homophily aspect, demonstrating its superiority and the importance of homophily synergy. Our code is available at \url{https://github.com/zylMozart/Disentangle_GraphHom}. △ Less

Submitted 26 June, 2024; originally announced June 2024.

arXiv:2404.14886 [pdf, other]

GCEPNet: Graph Convolution-Enhanced Expectation Propagation for Massive MIMO Detection

Authors: Qincheng Lu, Sitao Luan, Xiao-Wen Chang

Abstract: Massive MIMO (multiple-input multiple-output) detection is an important topic in wireless communication and various machine learning based methods have been developed recently for this task. Expectation propagation (EP) and its variants are widely used for MIMO detection and have achieved the best performance. However, EP-based solvers fail to capture the correlation between unknown variables, lea… ▽ More Massive MIMO (multiple-input multiple-output) detection is an important topic in wireless communication and various machine learning based methods have been developed recently for this task. Expectation propagation (EP) and its variants are widely used for MIMO detection and have achieved the best performance. However, EP-based solvers fail to capture the correlation between unknown variables, leading to loss of information, and in addition, they are computationally expensive. In this paper, we show that the real-valued system can be modeled as spectral signal convolution on graph, through which the correlation between unknown variables can be captured. Based on this analysis, we propose graph convolution-enhanced expectation propagation (GCEPNet), a graph convolution-enhanced EP detector. GCEPNet incorporates data-dependent attention scores into Chebyshev polynomial for powerful graph convolution with better generalization capacity. It enables a better estimation of the cavity distribution for EP and empirically achieves the state-of-the-art (SOTA) MIMO detection performance with much faster inference speed. To our knowledge, we are the first to shed light on the connection between the system model and graph convolution, and the first to design the data-dependent attention scores for graph convolution. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.00904 [pdf]

A Fast Percolation-Dijkstra Routing Method for Mega-Constellation Backbone Network

Authors: Shenshen Luan, Luyuan Wang, Yepeng Liu, Ninghan Sun, Ran Zhang

Abstract: The real-time routing for satellite communication of the mega-constellations is being challenged due to the large-scale of network nodes, especially on devices with limited computation such as onboard embedded systems. In this paper, a fast routing method is proposed for mega-constellation backbone networks. Firstly, inspired by the regularity and sparse characteristics of mega-constellations, the… ▽ More The real-time routing for satellite communication of the mega-constellations is being challenged due to the large-scale of network nodes, especially on devices with limited computation such as onboard embedded systems. In this paper, a fast routing method is proposed for mega-constellation backbone networks. Firstly, inspired by the regularity and sparse characteristics of mega-constellations, the 4-degree percolation theory is proposed to describe the node search process. Then, dynamic minimum search and map** methods are used to narrow down the traversal range. The proposed method performs as well as the heap-optimized Dijkstra algorithm with less memory space and dynamic access. The experimental results show that the method proposed in this paper can significantly reduce routing computation time, especially on the onboard, edge-computing or other computation-limited devices. △ Less

Submitted 2 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

arXiv:2403.01475 [pdf, other]

Representation Learning on Heterophilic Graph with Directional Neighborhood Attention

Authors: Qincheng Lu, Jiaqi Zhu, Sitao Luan, Xiao-Wen Chang

Abstract: Graph Attention Network (GAT) is one of the most popular Graph Neural Network (GNN) architecture, which employs the attention mechanism to learn edge weights and has demonstrated promising performance in various applications. However, since it only incorporates information from immediate neighborhood, it lacks the ability to capture long-range and global graph information, leading to unsatisfactor… ▽ More Graph Attention Network (GAT) is one of the most popular Graph Neural Network (GNN) architecture, which employs the attention mechanism to learn edge weights and has demonstrated promising performance in various applications. However, since it only incorporates information from immediate neighborhood, it lacks the ability to capture long-range and global graph information, leading to unsatisfactory performance on some datasets, particularly on heterophilic graphs. To address this limitation, we propose the Directional Graph Attention Network (DGAT) in this paper. DGAT is able to combine the feature-based attention with the global directional information extracted from the graph topology. To this end, a new class of Laplacian matrices is proposed which can provably reduce the diffusion distance between nodes. Based on the new Laplacian, topology-guided neighbour pruning and edge adding mechanisms are proposed to remove the noisy and capture the helpful long-range neighborhood information. Besides, a global directional attention is designed to enable a topological-aware information propagation. The superiority of the proposed DGAT over the baseline GAT has also been verified through experiments on real-world benchmarks and synthetic data sets. It also outperforms the state-of-the-art (SOTA) models on 6 out of 7 real-world benchmark datasets. △ Less

Submitted 3 March, 2024; originally announced March 2024.

arXiv:2402.06854 [pdf, other]

Gyroscope-Assisted Motion Deblurring Network

Authors: Simin Luan, Cong Yang, Zeyd Boukhers, Xue Qin, Dongfeng Cheng, Wei Sui, Zhijun Li

Abstract: Image research has shown substantial attention in deblurring networks in recent years. Yet, their practical usage in real-world deblurring, especially motion blur, remains limited due to the lack of pixel-aligned training triplets (background, blurred image, and blur heat map) and restricted information inherent in blurred images. This paper presents a simple yet efficient framework to synthetic a… ▽ More Image research has shown substantial attention in deblurring networks in recent years. Yet, their practical usage in real-world deblurring, especially motion blur, remains limited due to the lack of pixel-aligned training triplets (background, blurred image, and blur heat map) and restricted information inherent in blurred images. This paper presents a simple yet efficient framework to synthetic and restore motion blur images using Inertial Measurement Unit (IMU) data. Notably, the framework includes a strategy for training triplet generation, and a Gyroscope-Aided Motion Deblurring (GAMD) network for blurred image restoration. The rationale is that through harnessing IMU data, we can determine the transformation of the camera pose during the image exposure phase, facilitating the deduction of the motion trajectory (aka. blur trajectory) for each point inside the three-dimensional space. Thus, the synthetic triplets using our strategy are inherently close to natural motion blur, strictly pixel-aligned, and mass-producible. Through comprehensive experiments, we demonstrate the advantages of the proposed framework: only two-pixel errors between our synthetic and real-world blur trajectories, a marked improvement (around 33.17%) of the state-of-the-art deblurring method MIMO on Peak Signal-to-Noise Ratio (PSNR). △ Less

Submitted 9 February, 2024; originally announced February 2024.

arXiv:2306.12640 [pdf, other]

On Addressing the Limitations of Graph Neural Networks

Authors: Sitao Luan

Abstract: This report gives a summary of two problems about graph convolutional networks (GCNs): over-smoothing and heterophily challenges, and outlines future directions to explore. This report gives a summary of two problems about graph convolutional networks (GCNs): over-smoothing and heterophily challenges, and outlines future directions to explore. △ Less

Submitted 2 July, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

Comments: Proposal report and a concise version of Sitao Luan's thesis. The weight initialization part is quite interesting but will not be included in Sitao's formal thesis, thus Sitao put this preprint report online. arXiv admin note: substantial text overlap with arXiv:2109.05641, arXiv:2210.07606

arXiv:2304.14851 [pdf, other]

doi 10.1063/5.0146342

Effect of non-local transport of hot electrons on the laser-target ablation

Authors: Z. H. Chen, X. H. Yang, G. B. Zhang, Y. Y. Ma, H. Xu, S. X. Luan, J. Zhang

Abstract: The non-local heat transport of hot electrons during high-intensity lasers interaction with plasmas can preheat the fuel and limit the heat flow in inertial confinement fusion. It increases the entropy of the fuel and decreases the final compression. In this paper, the non-local electron transport model that is based on the improved SNB algorithm has been embedded into the radiation hydrodynamic c… ▽ More The non-local heat transport of hot electrons during high-intensity lasers interaction with plasmas can preheat the fuel and limit the heat flow in inertial confinement fusion. It increases the entropy of the fuel and decreases the final compression. In this paper, the non-local electron transport model that is based on the improved SNB algorithm has been embedded into the radiation hydrodynamic code and is benchmarked with two classical non-local transport cases. Then we studied a 2$ω$ laser ablating a CH target by using the non-local module. It is found that the non-local effect becomes significant when the laser intensity is above $1\times 10^{14} \mathrm{W/cm^{2}} $. The mass ablation rate from the SNB model is increased compared to that of the flux-limited model due to the lower coronal plasma temperature. This non-local model has a better agreement with the experimental results compared to that of the flux-limited model. The non-local transport is strongly dependent on the laser frequency, and the thresholds that the non-local transport should be considered are obtained for lasers of different frequencies. The appropriate flux-limiters that should be employed in the flux-limited model for different lasers are also presented. The results here should have a good reference for the laser-target ablation applications. △ Less

Submitted 28 April, 2023; originally announced April 2023.

Comments: 8 pages, 9 figures

arXiv:2304.14621 [pdf, other]

MUDiff: Unified Diffusion for Complete Molecule Generation

Authors: Chenqing Hua, Sitao Luan, Minkai Xu, Rex Ying, Jie Fu, Stefano Ermon, Doina Precup

Abstract: Molecule generation is a very important practical problem, with uses in drug discovery and material design, and AI methods promise to provide useful solutions. However, existing methods for molecule generation focus either on 2D graph structure or on 3D geometric structure, which is not sufficient to represent a complete molecule as 2D graph captures mainly topology while 3D geometry captures main… ▽ More Molecule generation is a very important practical problem, with uses in drug discovery and material design, and AI methods promise to provide useful solutions. However, existing methods for molecule generation focus either on 2D graph structure or on 3D geometric structure, which is not sufficient to represent a complete molecule as 2D graph captures mainly topology while 3D geometry captures mainly spatial atom arrangements. Combining these representations is essential to better represent a molecule. In this paper, we present a new model for generating a comprehensive representation of molecules, including atom features, 2D discrete molecule structures, and 3D continuous molecule coordinates, by combining discrete and continuous diffusion processes. The use of diffusion processes allows for capturing the probabilistic nature of molecular processes and exploring the effect of different factors on molecular structures. Additionally, we propose a novel graph transformer architecture to denoise the diffusion process. The transformer adheres to 3D roto-translation equivariance constraints, allowing it to learn invariant atom and edge representations while preserving the equivariance of atom coordinates. This transformer can be used to learn molecular representations robust to geometric transformations. We evaluate the performance of our model through experiments and comparisons with existing methods, showing its ability to generate more stable and valid molecules. Our model is a promising approach for designing stable and diverse molecules and can be applied to a wide range of tasks in molecular modeling. △ Less

Submitted 5 February, 2024; v1 submitted 28 April, 2023; originally announced April 2023.

arXiv:2304.14274 [pdf, other]

When Do Graph Neural Networks Help with Node Classification? Investigating the Impact of Homophily Principle on Node Distinguishability

Authors: Sitao Luan, Chenqing Hua, Minkai Xu, Qincheng Lu, Jiaqi Zhu, Xiao-Wen Chang, Jie Fu, Jure Leskovec, Doina Precup

Abstract: Homophily principle, i.e., nodes with the same labels are more likely to be connected, has been believed to be the main reason for the performance superiority of Graph Neural Networks (GNNs) over Neural Networks on node classification tasks. Recent research suggests that, even in the absence of homophily, the advantage of GNNs still exists as long as nodes from the same class share similar neighbo… ▽ More Homophily principle, i.e., nodes with the same labels are more likely to be connected, has been believed to be the main reason for the performance superiority of Graph Neural Networks (GNNs) over Neural Networks on node classification tasks. Recent research suggests that, even in the absence of homophily, the advantage of GNNs still exists as long as nodes from the same class share similar neighborhood patterns. However, this argument only considers intra-class Node Distinguishability (ND) but neglects inter-class ND, which provides incomplete understanding of homophily on GNNs. In this paper, we first demonstrate such deficiency with examples and argue that an ideal situation for ND is to have smaller intra-class ND than inter-class ND. To formulate this idea and study ND deeply, we propose Contextual Stochastic Block Model for Homophily (CSBM-H) and define two metrics, Probabilistic Bayes Error (PBE) and negative generalized Jeffreys divergence, to quantify ND. With the metrics, we visualize and analyze how graph filters, node degree distributions and class variances influence ND, and investigate the combined effect of intra- and inter-class ND. Besides, we discovered the mid-homophily pitfall, which occurs widely in graph datasets. Furthermore, we verified that, in real-work tasks, the superiority of GNNs is indeed closely related to both intra- and inter-class ND regardless of homophily levels. Grounded in this observation, we propose a new hypothesis-testing based performance metric beyond homophily, which is non-linear, feature-based and can provide statistical threshold value for GNNs' the superiority. Experiments indicate that it is significantly more effective than the existing homophily metrics on revealing the advantage and disadvantage of graph-aware modes on both synthetic and benchmark real-world datasets. △ Less

Submitted 1 January, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

Comments: Accepted by 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

arXiv:2304.04809 [pdf, other]

doi 10.1126/sciadv.adi1453

A data-driven framework for structure-property correlation in ordered and disordered cellular metamaterials

Authors: Shengzhi Luan, Enze Chen, Joel John, Stavros Gaitanaros

Abstract: Cellular solids and micro-lattices are a class of lightweight architected materials that have been established for their unique mechanical, thermal, and acoustic properties. It has been shown that by tuning material architecture, a combination of topology and solid(s) distribution, one can design new material systems, also known as metamaterials, with superior performance compared to conventional… ▽ More Cellular solids and micro-lattices are a class of lightweight architected materials that have been established for their unique mechanical, thermal, and acoustic properties. It has been shown that by tuning material architecture, a combination of topology and solid(s) distribution, one can design new material systems, also known as metamaterials, with superior performance compared to conventional monolithic solids. Despite the continuously growing complexity of synthesized microstructures, mainly enabled by developments in additive manufacturing, correlating their morphological characteristics to the resulting material properties has not advanced equally. This work aims to develop a systematic data-driven framework that is capable of identifying all key microstructural characteristics and evaluating their effect on a target material property. The framework relies on integrating virtual structure generation and quantification algorithms with interpretable surrogate models. The effectiveness of the proposed approach is demonstrated by analyzing the effective stiffness of a broad class of two-dimensional (2D) cellular metamaterials with varying topological disorder. The results reveal the complex manner in which well-known stiffness contributors, including nodal connectivity, cooperate with often-overlooked microstructural features such as strut orientation, to determine macroscopic material behavior. We further re-examine Maxwell's criteria regarding the rigidity of frame structures, as they pertain to the effective stiffness of cellular solids and showcase microstructures that violate them. This framework can be used for structure-property correlation in different classes of metamaterials as well as the discovery of novel architectures with tailored combinations of material properties. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Journal ref: Sci. Adv. 9, eadi1453 (2023)

arXiv:2301.03734 [pdf, other]

Exoshuffle-CloudSort

Authors: Frank Sifei Luan, Stephanie Wang, Samyukta Yagati, Sean Kim, Kenneth Lien, Isaac Ong, Tony Hong, SangBin Cho, Eric Liang, Ion Stoica

Abstract: We present Exoshuffle-CloudSort, a sorting application running on top of Ray using the Exoshuffle architecture. Exoshuffle-CloudSort runs on Amazon EC2, with input and output data stored on Amazon S3. Using 40 i4i.4xlarge workers, Exoshuffle-CloudSort completes the 100 TB CloudSort Benchmark (Indy category) in 5378 seconds, with an average total cost of $97. We present Exoshuffle-CloudSort, a sorting application running on top of Ray using the Exoshuffle architecture. Exoshuffle-CloudSort runs on Amazon EC2, with input and output data stored on Amazon S3. Using 40 i4i.4xlarge workers, Exoshuffle-CloudSort completes the 100 TB CloudSort Benchmark (Indy category) in 5378 seconds, with an average total cost of $97. △ Less

Submitted 9 January, 2023; originally announced January 2023.

arXiv:2212.10822 [pdf, other]

Complete the Missing Half: Augmenting Aggregation Filtering with Diversification for Graph Convolutional Neural Networks

Authors: Sitao Luan, Mingde Zhao, Chenqing Hua, Xiao-Wen Chang, Doina Precup

Abstract: The core operation of current Graph Neural Networks (GNNs) is the aggregation enabled by the graph Laplacian or message passing, which filters the neighborhood information of nodes. Though effective for various tasks, in this paper, we show that they are potentially a problematic factor underlying all GNN models for learning on certain datasets, as they force the node representations similar, maki… ▽ More The core operation of current Graph Neural Networks (GNNs) is the aggregation enabled by the graph Laplacian or message passing, which filters the neighborhood information of nodes. Though effective for various tasks, in this paper, we show that they are potentially a problematic factor underlying all GNN models for learning on certain datasets, as they force the node representations similar, making the nodes gradually lose their identity and become indistinguishable. Hence, we augment the aggregation operations with their dual, i.e. diversification operators that make the node more distinct and preserve the identity. Such augmentation replaces the aggregation with a two-channel filtering process that, in theory, is beneficial for enriching the node representations. In practice, the proposed two-channel filters can be easily patched on existing GNN methods with diverse training strategies, including spectral and spatial (message passing) methods. In the experiments, we observe desired characteristics of the models and significant performance boost upon the baselines on 9 node classification tasks. △ Less

Submitted 21 December, 2022; originally announced December 2022.

Comments: Accepted as Oral Presentation at NeurIPS 2022 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2022)

arXiv:2210.16979 [pdf, ps, other]

When Do We Need Graph Neural Networks for Node Classification?

Authors: Sitao Luan, Chenqing Hua, Qincheng Lu, Jiaqi Zhu, Xiao-Wen Chang, Doina Precup

Abstract: Graph Neural Networks (GNNs) extend basic Neural Networks (NNs) by additionally making use of graph structure based on the relational inductive bias (edge bias), rather than treating the nodes as collections of independent and identically distributed (i.i.d.) samples. Though GNNs are believed to outperform basic NNs in real-world tasks, it is found that in some cases, GNNs have little performance… ▽ More Graph Neural Networks (GNNs) extend basic Neural Networks (NNs) by additionally making use of graph structure based on the relational inductive bias (edge bias), rather than treating the nodes as collections of independent and identically distributed (i.i.d.) samples. Though GNNs are believed to outperform basic NNs in real-world tasks, it is found that in some cases, GNNs have little performance gain or even underperform graph-agnostic NNs. To identify these cases, based on graph signal processing and statistical hypothesis testing, we propose two measures which analyze the cases in which the edge bias in features and labels does not provide advantages. Based on the measures, a threshold value can be given to predict the potential performance advantages of graph-aware models over graph-agnostic models. △ Less

Submitted 3 November, 2023; v1 submitted 30 October, 2022; originally announced October 2022.

Comments: Accepted by 12th International Conference on Complex Networks and Their Applications

arXiv:2210.07606 [pdf, other]

Revisiting Heterophily For Graph Neural Networks

Authors: Sitao Luan, Chenqing Hua, Qincheng Lu, Jiaqi Zhu, Mingde Zhao, Shuyuan Zhang, Xiao-Wen Chang, Doina Precup

Abstract: Graph Neural Networks (GNNs) extend basic Neural Networks (NNs) by using graph structures based on the relational inductive bias (homophily assumption). While GNNs have been commonly believed to outperform NNs in real-world tasks, recent work has identified a non-trivial set of datasets where their performance compared to NNs is not satisfactory. Heterophily has been considered the main cause of t… ▽ More Graph Neural Networks (GNNs) extend basic Neural Networks (NNs) by using graph structures based on the relational inductive bias (homophily assumption). While GNNs have been commonly believed to outperform NNs in real-world tasks, recent work has identified a non-trivial set of datasets where their performance compared to NNs is not satisfactory. Heterophily has been considered the main cause of this empirical observation and numerous works have been put forward to address it. In this paper, we first revisit the widely used homophily metrics and point out that their consideration of only graph-label consistency is a shortcoming. Then, we study heterophily from the perspective of post-aggregation node similarity and define new homophily metrics, which are potentially advantageous compared to existing ones. Based on this investigation, we prove that some harmful cases of heterophily can be effectively addressed by local diversification operation. Then, we propose the Adaptive Channel Mixing (ACM), a framework to adaptively exploit aggregation, diversification and identity channels node-wisely to extract richer localized information for diverse node heterophily situations. ACM is more powerful than the commonly used uni-channel framework for node classification tasks on heterophilic graphs and is easy to be implemented in baseline GNN layers. When evaluated on 10 benchmark node classification tasks, ACM-augmented baselines consistently achieve significant performance gain, exceeding state-of-the-art GNNs on most tasks without incurring significant computational burden. △ Less

Submitted 14 October, 2022; originally announced October 2022.

Comments: Published at 36th Conference on Neural Information Processing Systems (NeurIPS 2022). arXiv admin note: substantial text overlap with arXiv:2109.05641

arXiv:2206.06089 [pdf, other]

Graph Neural Networks Intersect Probabilistic Graphical Models: A Survey

Authors: Chenqing Hua, Sitao Luan, Qian Zhang, Jie Fu

Abstract: Graphs are a powerful data structure to represent relational data and are widely used to describe complex real-world data structures. Probabilistic Graphical Models (PGMs) have been well-developed in the past years to mathematically model real-world scenarios in compact graphical representations of distributions of variables. Graph Neural Networks (GNNs) are new inference methods developed in rece… ▽ More Graphs are a powerful data structure to represent relational data and are widely used to describe complex real-world data structures. Probabilistic Graphical Models (PGMs) have been well-developed in the past years to mathematically model real-world scenarios in compact graphical representations of distributions of variables. Graph Neural Networks (GNNs) are new inference methods developed in recent years and are attracting growing attention due to their effectiveness and flexibility in solving inference and learning problems over graph-structured data. These two powerful approaches have different advantages in capturing relations from observations and how they conduct message passing, and they can benefit each other in various tasks. In this survey, we broadly study the intersection of GNNs and PGMs. Specifically, we first discuss how GNNs can benefit from learning structured representations in PGMs, generate explainable predictions by PGMs, and how PGMs can infer object relationships. Then we discuss how GNNs are implemented in PGMs for more efficient inference and structure learning. In the end, we summarize the benchmark datasets used in recent studies and discuss promising future directions. △ Less

Submitted 30 January, 2023; v1 submitted 23 May, 2022; originally announced June 2022.

arXiv:2203.05072 [pdf, other]

Exoshuffle: An Extensible Shuffle Architecture

Authors: Frank Sifei Luan, Stephanie Wang, Samyukta Yagati, Sean Kim, Kenneth Lien, Isaac Ong, Tony Hong, SangBin Cho, Eric Liang, Ion Stoica

Abstract: Shuffle is one of the most expensive communication primitives in distributed data processing and is difficult to scale. Prior work addresses the scalability challenges of shuffle by building monolithic shuffle systems. These systems are costly to develop, and they are tightly integrated with batch processing frameworks that offer only high-level APIs such as SQL. New applications, such as ML train… ▽ More Shuffle is one of the most expensive communication primitives in distributed data processing and is difficult to scale. Prior work addresses the scalability challenges of shuffle by building monolithic shuffle systems. These systems are costly to develop, and they are tightly integrated with batch processing frameworks that offer only high-level APIs such as SQL. New applications, such as ML training, require more flexibility and finer-grained interoperability with shuffle. They are often unable to leverage existing shuffle optimizations. We propose an extensible shuffle architecture. We present Exoshuffle, a library for distributed shuffle that offers competitive performance and scalability as well as greater flexibility than monolithic shuffle systems. We design an architecture that decouples the shuffle control plane from the data plane without sacrificing performance. We build Exoshuffle on Ray, a distributed futures system for data and ML applications, and demonstrate that we can: (1) rewrite previous shuffle optimizations as application-level libraries with an order of magnitude less code, (2) achieve shuffle performance and scalability competitive with monolithic shuffle systems, and break the CloudSort record as the world's most cost-efficient sorting system, and (3) enable new applications such as ML training to easily leverage scalable shuffle. △ Less

Submitted 17 August, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

arXiv:2201.01441 [pdf, other]

doi 10.1145/3514221.3517885

Balsa: Learning a Query Optimizer Without Expert Demonstrations

Authors: Zongheng Yang, Wei-Lin Chiang, Sifei Luan, Gautam Mittal, Michael Luo, Ion Stoica

Abstract: Query optimizers are a performance-critical component in every database system. Due to their complexity, optimizers take experts months to write and years to refine. In this work, we demonstrate for the first time that learning to optimize queries without learning from an expert optimizer is both possible and efficient. We present Balsa, a query optimizer built by deep reinforcement learning. Bals… ▽ More Query optimizers are a performance-critical component in every database system. Due to their complexity, optimizers take experts months to write and years to refine. In this work, we demonstrate for the first time that learning to optimize queries without learning from an expert optimizer is both possible and efficient. We present Balsa, a query optimizer built by deep reinforcement learning. Balsa first learns basic knowledge from a simple, environment-agnostic simulator, followed by safe learning in real execution. On the Join Order Benchmark, Balsa matches the performance of two expert query optimizers, both open-source and commercial, with two hours of learning, and outperforms them by up to 2.8$\times$ in workload runtime after a few more hours. Balsa thus opens the possibility of automatically learning to optimize in future compute environments where expert-designed optimizers do not exist. △ Less

Submitted 3 May, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

Comments: SIGMOD 2022; code released at: https://github.com/balsa-project/balsa/

arXiv:2109.05641 [pdf, other]

Is Heterophily A Real Nightmare For Graph Neural Networks To Do Node Classification?

Authors: Sitao Luan, Chenqing Hua, Qincheng Lu, Jiaqi Zhu, Mingde Zhao, Shuyuan Zhang, Xiao-Wen Chang, Doina Precup

Abstract: Graph Neural Networks (GNNs) extend basic Neural Networks (NNs) by using the graph structures based on the relational inductive bias (homophily assumption). Though GNNs are believed to outperform NNs in real-world tasks, performance advantages of GNNs over graph-agnostic NNs seem not generally satisfactory. Heterophily has been considered as a main cause and numerous works have been put forward to… ▽ More Graph Neural Networks (GNNs) extend basic Neural Networks (NNs) by using the graph structures based on the relational inductive bias (homophily assumption). Though GNNs are believed to outperform NNs in real-world tasks, performance advantages of GNNs over graph-agnostic NNs seem not generally satisfactory. Heterophily has been considered as a main cause and numerous works have been put forward to address it. In this paper, we first show that not all cases of heterophily are harmful for GNNs with aggregation operation. Then, we propose new metrics based on a similarity matrix which considers the influence of both graph structure and input features on GNNs. The metrics demonstrate advantages over the commonly used homophily metrics by tests on synthetic graphs. From the metrics and the observations, we find some cases of harmful heterophily can be addressed by diversification operation. With this fact and knowledge of filterbanks, we propose the Adaptive Channel Mixing (ACM) framework to adaptively exploit aggregation, diversification and identity channels in each GNN layer to address harmful heterophily. We validate the ACM-augmented baselines with 10 real-world node classification tasks. They consistently achieve significant performance gain and exceed the state-of-the-art GNNs on most of the tasks without incurring significant computational burden. △ Less

Submitted 12 September, 2021; originally announced September 2021.

arXiv:2106.02097 [pdf, other]

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

Authors: Mingde Zhao, Zhen Liu, Sitao Luan, Shuyuan Zhang, Doina Precup, Yoshua Bengio

Abstract: We present an end-to-end, model-based deep reinforcement learning agent which dynamically attends to relevant parts of its state during planning. The agent uses a bottleneck mechanism over a set-based representation to force the number of entities to which the agent attends at each planning step to be small. In experiments, we investigate the bottleneck mechanism with several sets of customized en… ▽ More We present an end-to-end, model-based deep reinforcement learning agent which dynamically attends to relevant parts of its state during planning. The agent uses a bottleneck mechanism over a set-based representation to force the number of entities to which the agent attends at each planning step to be small. In experiments, we investigate the bottleneck mechanism with several sets of customized environments featuring different challenges. We consistently observe that the design allows the planning agents to generalize their learned task-solving abilities in compatible unseen environments by attending to the relevant objects, leading to better out-of-distribution generalization performance. △ Less

Submitted 4 November, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

Comments: NeurIPS camera-ready version

arXiv:2008.08844 [pdf, other]

Complete the Missing Half: Augmenting Aggregation Filtering with Diversification for Graph Convolutional Networks

Authors: Sitao Luan, Mingde Zhao, Chenqing Hua, Xiao-Wen Chang, Doina Precup

Abstract: The core operation of current Graph Neural Networks (GNNs) is the aggregation enabled by the graph Laplacian or message passing, which filters the neighborhood node information. Though effective for various tasks, in this paper, we show that they are potentially a problematic factor underlying all GNN methods for learning on certain datasets, as they force the node representations similar, making… ▽ More The core operation of current Graph Neural Networks (GNNs) is the aggregation enabled by the graph Laplacian or message passing, which filters the neighborhood node information. Though effective for various tasks, in this paper, we show that they are potentially a problematic factor underlying all GNN methods for learning on certain datasets, as they force the node representations similar, making the nodes gradually lose their identity and become indistinguishable. Hence, we augment the aggregation operations with their dual, i.e. diversification operators that make the node more distinct and preserve the identity. Such augmentation replaces the aggregation with a two-channel filtering process that, in theory, is beneficial for enriching the node representations. In practice, the proposed two-channel filters can be easily patched on existing GNN methods with diverse training strategies, including spectral and spatial (message passing) methods. In the experiments, we observe desired characteristics of the models and significant performance boost upon the baselines on 9 node classification tasks. △ Less

Submitted 2 November, 2022; v1 submitted 20 August, 2020; originally announced August 2020.

Comments: New Frontiers in Graph Learning (GLFrontiers) Workshop (Oral), NeurIPS 2022

arXiv:2008.08838 [pdf, ps, other]

Training Matters: Unlocking Potentials of Deeper Graph Convolutional Neural Networks

Authors: Sitao Luan, Mingde Zhao, Xiao-Wen Chang, Doina Precup

Abstract: The performance limit of Graph Convolutional Networks (GCNs) and the fact that we cannot stack more of them to increase the performance, which we usually do for other deep learning paradigms, are pervasively thought to be caused by the limitations of the GCN layers, including insufficient expressive power, etc. However, if so, for a fixed architecture, it would be unlikely to lower the training di… ▽ More The performance limit of Graph Convolutional Networks (GCNs) and the fact that we cannot stack more of them to increase the performance, which we usually do for other deep learning paradigms, are pervasively thought to be caused by the limitations of the GCN layers, including insufficient expressive power, etc. However, if so, for a fixed architecture, it would be unlikely to lower the training difficulty and to improve performance by changing only the training procedure, which we show in this paper not only possible but possible in several ways. This paper first identify the training difficulty of GCNs from the perspective of graph signal energy loss. More specifically, we find that the loss of energy in the backward pass during training nullifies the learning of the layers closer to the input. Then, we propose several methodologies to mitigate the training problem by slightly modifying the GCN operator, from the energy perspective. After empirical validation, we confirm that these changes of operator lead to significant decrease in the training difficulties and notable performance boost, without changing the composition of parameters. With these, we conclude that the root cause of the problem is more likely the training difficulty than the others. △ Less

Submitted 3 November, 2023; v1 submitted 20 August, 2020; originally announced August 2020.

Comments: Accepted by 12th International Conference on Complex Networks and Their Applications

arXiv:2006.08109 [pdf, other]

NeuroCard: One Cardinality Estimator for All Tables

Authors: Zongheng Yang, Amog Kamsetty, Sifei Luan, Eric Liang, Yan Duan, Xi Chen, Ion Stoica

Abstract: Query optimizers rely on accurate cardinality estimates to produce good execution plans. Despite decades of research, existing cardinality estimators are inaccurate for complex queries, due to making lossy modeling assumptions and not capturing inter-table correlations. In this work, we show that it is possible to learn the correlations across all tables in a database without any independence assu… ▽ More Query optimizers rely on accurate cardinality estimates to produce good execution plans. Despite decades of research, existing cardinality estimators are inaccurate for complex queries, due to making lossy modeling assumptions and not capturing inter-table correlations. In this work, we show that it is possible to learn the correlations across all tables in a database without any independence assumptions. We present NeuroCard, a join cardinality estimator that builds a single neural density estimator over an entire database. Leveraging join sampling and modern deep autoregressive models, NeuroCard makes no inter-table or inter-column independence assumptions in its probabilistic modeling. NeuroCard achieves orders of magnitude higher accuracy than the best prior methods (a new state-of-the-art result of 8.5$\times$ maximum error on JOB-light), scales to dozens of tables, while being compact in space (several MBs) and efficient to construct or update (seconds to minutes). △ Less

Submitted 2 November, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

Comments: VLDB 2021

arXiv:1909.09186 [pdf, ps, other]

Revisit Policy Optimization in Matrix Form

Authors: Sitao Luan, Xiao-Wen Chang, Doina Precup

Abstract: In tabular case, when the reward and environment dynamics are known, policy evaluation can be written as $\bm{V}_{\bmπ} = (I - γP_{\bmπ})^{-1} \bm{r}_{\bmπ}$, where $P_{\bmπ}$ is the state transition matrix given policy ${\bmπ}$ and $\bm{r}_{\bmπ}$ is the reward signal given ${\bmπ}$. What annoys us is that $P_{\bmπ}$ and $\bm{r}_{\bmπ}$ are both mixed with ${\bmπ}$, which means every time when we… ▽ More In tabular case, when the reward and environment dynamics are known, policy evaluation can be written as $\bm{V}_{\bmπ} = (I - γP_{\bmπ})^{-1} \bm{r}_{\bmπ}$, where $P_{\bmπ}$ is the state transition matrix given policy ${\bmπ}$ and $\bm{r}_{\bmπ}$ is the reward signal given ${\bmπ}$. What annoys us is that $P_{\bmπ}$ and $\bm{r}_{\bmπ}$ are both mixed with ${\bmπ}$, which means every time when we update ${\bmπ}$, they will change together. In this paper, we leverage the notation from \cite{wang2007dual} to disentangle ${\bmπ}$ and environment dynamics which makes optimization over policy more straightforward. We show that policy gradient theorem \cite{sutton2018reinforcement} and TRPO \cite{schulman2015trust} can be put into a more general framework and such notation has good potential to be extended to model-based reinforcement learning. △ Less

Submitted 19 September, 2019; originally announced September 2019.

arXiv:1906.02174 [pdf, other]

Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks

Authors: Sitao Luan, Mingde Zhao, Xiao-Wen Chang, Doina Precup

Abstract: Recently, neural network based approaches have achieved significant improvement for solving large, complex, graph-structured problems. However, their bottlenecks still need to be addressed, and the advantages of multi-scale information and deep architectures have not been sufficiently exploited. In this paper, we theoretically analyze how existing Graph Convolutional Networks (GCNs) have limited e… ▽ More Recently, neural network based approaches have achieved significant improvement for solving large, complex, graph-structured problems. However, their bottlenecks still need to be addressed, and the advantages of multi-scale information and deep architectures have not been sufficiently exploited. In this paper, we theoretically analyze how existing Graph Convolutional Networks (GCNs) have limited expressive power due to the constraint of the activation functions and their architectures. We generalize spectral graph convolution and deep GCN in block Krylov subspace forms and devise two architectures, both with the potential to be scaled deeper but each making use of the multi-scale information in different ways. We further show that the equivalence of these two architectures can be established under certain conditions. On several node classification tasks, with or without the help of validation, the two new architectures achieve better performance compared to many state-of-the-art methods. △ Less

Submitted 8 September, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

Comments: Accepted and to be published by NeurIPS 2019

arXiv:1904.11439 [pdf, other]

META-Learning State-based Eligibility Traces for More Sample-Efficient Policy Evaluation

Authors: Mingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang, Doina Precup

Abstract: Temporal-Difference (TD) learning is a standard and very successful reinforcement learning approach, at the core of both algorithms that learn the value of a given policy, as well as algorithms which learn how to improve policies. TD-learning with eligibility traces provides a way to boost sample efficiency by temporal credit assignment, i.e. deciding which portion of a reward should be assigned t… ▽ More Temporal-Difference (TD) learning is a standard and very successful reinforcement learning approach, at the core of both algorithms that learn the value of a given policy, as well as algorithms which learn how to improve policies. TD-learning with eligibility traces provides a way to boost sample efficiency by temporal credit assignment, i.e. deciding which portion of a reward should be assigned to predecessor states that occurred at different previous times, controlled by a parameter $λ$. However, tuning this parameter can be time-consuming, and not tuning it can lead to inefficient learning. For better sample efficiency of TD-learning, we propose a meta-learning method for adjusting the eligibility trace parameter, in a state-dependent manner. The adaptation is achieved with the help of auxiliary learners that learn distributional information about the update targets online, incurring roughly the same computational complexity per step as the usual value learner. Our approach can be used both in on-policy and off-policy learning. We prove that, under some assumptions, the proposed method improves the overall quality of the update targets, by minimizing the overall target error. This method can be viewed as a plugin to assist prediction with function approximation by meta-learning feature (observation)-based $λ$ online, or even in the control case to assist policy improvement. Our empirical evaluation demonstrates significant performance improvements, as well as improved robustness of the proposed algorithm to learning rate variation. △ Less

Submitted 16 May, 2020; v1 submitted 25 April, 2019; originally announced April 2019.

Comments: Accepted by AAMAS 2020

arXiv:1901.02588 [pdf, ps, other]

Optimality conditions for optimal control of multisolution p-Laplacian elliptic equations

Authors: Hongwei Lou, Shu Luan

Abstract: In this paper, an optimal control problem governed by a class of p-Laplacian elliptic equations is studied. In particular, as no monotonicity assumption is assumed on the nonlinear term, the state equation may admit several solutions for one control. To obtain optimality conditions for an optimal pair, the multiplicity and singularity/degeneracy of the state equation need to be handled respectivel… ▽ More In this paper, an optimal control problem governed by a class of p-Laplacian elliptic equations is studied. In particular, as no monotonicity assumption is assumed on the nonlinear term, the state equation may admit several solutions for one control. To obtain optimality conditions for an optimal pair, the multiplicity and singularity/degeneracy of the state equation need to be handled respectively. For this reason, penalization problems and approximation problems are introduced. Finally the main result is proved by a series of process of taking to the limits. △ Less

Submitted 27 August, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

Comments: 21 pages

MSC Class: 49K20; 35J70

arXiv:1812.01158 [pdf, other]

doi 10.1145/3360578

Aroma: Code Recommendation via Structural Code Search

Authors: Sifei Luan, Di Yang, Celeste Barnaby, Koushik Sen, Satish Chandra

Abstract: Programmers often write code that has similarity to existing code written somewhere. A tool that could help programmers to search such similar code would be immensely useful. Such a tool could help programmers to extend partially written code snippets to completely implement necessary functionality, help to discover extensions to the partial code which are commonly included by other programmers, h… ▽ More Programmers often write code that has similarity to existing code written somewhere. A tool that could help programmers to search such similar code would be immensely useful. Such a tool could help programmers to extend partially written code snippets to completely implement necessary functionality, help to discover extensions to the partial code which are commonly included by other programmers, help to cross-check against similar code written by other programmers, or help to add extra code which would fix common mistakes and errors. We propose Aroma, a tool and technique for code recommendation via structural code search. Aroma indexes a huge code corpus including thousands of open-source projects, takes a partial code snippet as input, searches the corpus for method bodies containing the partial code snippet, and clusters and intersects the results of the search to recommend a small set of succinct code snippets which both contain the query snippet and appear as part of several methods in the corpus. We evaluated Aroma on 2000 randomly selected queries created from the corpus, as well as 64 queries derived from code snippets obtained from Stack Overflow, a popular website for discussing code. We implemented Aroma for 4 different languages, and developed an IDE plugin for Aroma. Furthermore, we conducted a study where we asked 12 programmers to complete programming tasks using Aroma, and collected their feedback. Our results indicate that Aroma is capable of retrieving and recommending relevant code snippets efficiently. △ Less

Submitted 17 October, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

Journal ref: Proc. ACM Program. Lang. 3, OOPSLA, Article 152 (October 2019), 28 pages

arXiv:1806.03853 [pdf, other]

Object detection and tracking benchmark in industry based on improved correlation filter

Authors: Shangzhen Luan, Yan Li, Xiaodi Wang, Baochang Zhang

Abstract: Real-time object detection and tracking have shown to be the basis of intelligent production for industrial 4.0 applications. It is a challenging task because of various distorted data in complex industrial setting. The correlation filter (CF) has been used to trade off the low-cost computation and high performance. However, traditional CF training strategy can not get satisfied performance for th… ▽ More Real-time object detection and tracking have shown to be the basis of intelligent production for industrial 4.0 applications. It is a challenging task because of various distorted data in complex industrial setting. The correlation filter (CF) has been used to trade off the low-cost computation and high performance. However, traditional CF training strategy can not get satisfied performance for the various industrial data; because the simple sampling(bagging) during training process will not find the exact solutions in a data space with a large diversity. In this paper, we propose Dijkstra-distance based correlation filters (DBCF), which establishes a new learning framework that embeds distribution-related constraints into the multi-channel correlation filters (MCCF). DBCF is able to handle the huge variations existing in the industrial data by improving those constraints based on the shortest path among all solutions. To evaluate DBCF, we build a new dataset as the benchmark for industrial 4.0 application. Extensive experiments demonstrate that DBCF produces high performance and exceeds the state-of-the-art methods. The dataset and source code can be found at https://github.com/bczhangbczhang △ Less

Submitted 11 June, 2018; v1 submitted 11 June, 2018; originally announced June 2018.

arXiv:1711.04192 [pdf, other]

doi 10.1109/TIP.2017.2775060

Latent Constrained Correlation Filter

Authors: Baochang Zhang, Shangzhen Luan, Chen Chen, Jungong Han, Wei Wang, Alessandro Perina, Ling Shao

Abstract: Correlation filters are special classifiers designed for shift-invariant object recognition, which are robust to pattern distortions. The recent literature shows that combining a set of sub-filters trained based on a single or a small group of images obtains the best performance. The idea is equivalent to estimating variable distribution based on the data sampling (bagging), which can be interpret… ▽ More Correlation filters are special classifiers designed for shift-invariant object recognition, which are robust to pattern distortions. The recent literature shows that combining a set of sub-filters trained based on a single or a small group of images obtains the best performance. The idea is equivalent to estimating variable distribution based on the data sampling (bagging), which can be interpreted as finding solutions (variable distribution approximation) directly from sampled data space. However, this methodology fails to account for the variations existed in the data. In this paper, we introduce an intermediate step -- solution sampling -- after the data sampling step to form a subspace, in which an optimal solution can be estimated. More specifically, we propose a new method, named latent constrained correlation filters (LCCF), by map** the correlation filters to a given latent subspace, and develop a new learning framework in the latent subspace that embeds distribution-related constraints into the original problem. To solve the optimization problem, we introduce a subspace based alternating direction method of multipliers (SADMM), which is proven to converge at the saddle point. Our approach is successfully applied to three different tasks, including eye localization, car detection and object tracking. Extensive experiments demonstrate that LCCF outperforms the state-of-the-art methods. The source code will be publicly available. https://github.com/bczhangbczhang/. △ Less

Submitted 11 November, 2017; originally announced November 2017.

arXiv:1710.02624 [pdf, ps, other]

doi 10.1364/OPTICA.4.001086

Extreme case of Faraday effect: magnetic splitting of ultrashort laser pulses in plasmas

Authors: Suming Weng, Qian Zhao, Zhengming Sheng, Wei Yu, Shixia Luan, Min Chen, Lule Yu, Masakatsu Murakami, Warren B. Mori, Jie Zhang

Abstract: The Faraday effect, caused by a magnetic-field-induced change in the optical properties, takes place in a vast variety of systems from a single atomic layer of graphenes to huge galaxies. Currently, it plays a pivot role in many applications such as the manipulation of light and the probing of magnetic fields and material's properties. Basically, this effect causes a polarization rotation of light… ▽ More The Faraday effect, caused by a magnetic-field-induced change in the optical properties, takes place in a vast variety of systems from a single atomic layer of graphenes to huge galaxies. Currently, it plays a pivot role in many applications such as the manipulation of light and the probing of magnetic fields and material's properties. Basically, this effect causes a polarization rotation of light during its propagation along the magnetic field in a medium. Here, we report an extreme case of the Faraday effect where a linearly polarized ultrashort laser pulse splits in time into two circularly polarized pulses of opposite handedness during its propagation in a highly magnetized plasma. This offers a new degree of freedom for manipulating ultrashort and ultrahigh power laser pulses. Together with technologies of ultra-strong magnetic fields, it may pave the way for novel optical devices, such as magnetized plasma polarizers. In addition, it may offer a powerful means to measure strong magnetic fields in laser-produced plasmas. △ Less

Submitted 6 October, 2017; originally announced October 2017.

Comments: 18 pages, 5 figures

Journal ref: Optica, Vol. 4, Issue 9, pp. 1086-1091 (2017)

arXiv:1705.01450 [pdf]

doi 10.1109/TIP.2018.2835143

Gabor Convolutional Networks

Authors: Shangzhen Luan, Baochang Zhang, Chen Chen, Xianbin Cao, Jungong Han, Jianzhuang Liu

Abstract: Steerable properties dominate the design of traditional filters, e.g., Gabor filters, and endow features the capability of dealing with spatial transformations. However, such excellent properties have not been well explored in the popular deep convolutional neural networks (DCNNs). In this paper, we propose a new deep model, termed Gabor Convolutional Networks (GCNs or Gabor CNNs), which incorpora… ▽ More Steerable properties dominate the design of traditional filters, e.g., Gabor filters, and endow features the capability of dealing with spatial transformations. However, such excellent properties have not been well explored in the popular deep convolutional neural networks (DCNNs). In this paper, we propose a new deep model, termed Gabor Convolutional Networks (GCNs or Gabor CNNs), which incorporates Gabor filters into DCNNs to enhance the resistance of deep learned features to the orientation and scale changes. By only manipulating the basic element of DCNNs based on Gabor filters, i.e., the convolution operator, GCNs can be easily implemented and are compatible with any popular deep learning architecture. Experimental results demonstrate the super capability of our algorithm in recognizing objects, where the scale and rotation changes occur frequently. The proposed GCNs have much fewer learnable network parameters, and thus is easier to train with an end-to-end pipeline. △ Less

Submitted 28 March, 2023; v1 submitted 3 May, 2017; originally announced May 2017.

arXiv:1701.01763 [pdf]

SOI RF Switch for Wireless Sensor Network

Authors: Wei Cai, Cheng Li, ShiWei Luan

Abstract: The objective of this research was to design a 0-5 GHz RF SOI switch, with 0.18um power Jazz SOI technology by using Cadence software, for health care applications. This paper introduces the design of a RF switch implemented in shunt-series topology. An insertion loss of 0.906 dB and an isolation of 30.95 dB were obtained at 5 GHz. The switch also achieved a third order distortion of 53.05 dBm and… ▽ More The objective of this research was to design a 0-5 GHz RF SOI switch, with 0.18um power Jazz SOI technology by using Cadence software, for health care applications. This paper introduces the design of a RF switch implemented in shunt-series topology. An insertion loss of 0.906 dB and an isolation of 30.95 dB were obtained at 5 GHz. The switch also achieved a third order distortion of 53.05 dBm and 1 dB compression point reached 50.06dBm. The RF switch performance meets the desired specification requirements. △ Less

Submitted 6 January, 2017; originally announced January 2017.

arXiv:1606.02170

Latent Constrained Correlation Filters for Object Localization

Authors: Shangzhen Luan, Baochang Zhang, Jungong Han, Chen Chen, Ling Shao, Alessandro Perina, Linlin Shen

Abstract: There is a neglected fact in the traditional machine learning methods that the data sampling can actually lead to the solution sampling. We consider this observation to be important because having the solution sampling available makes the variable distribution estimation, which is a problem in many learning-related applications, more tractable. In this paper, we implement this idea on correlation… ▽ More There is a neglected fact in the traditional machine learning methods that the data sampling can actually lead to the solution sampling. We consider this observation to be important because having the solution sampling available makes the variable distribution estimation, which is a problem in many learning-related applications, more tractable. In this paper, we implement this idea on correlation filter, which has attracted much attention in the past few years due to its high performance with a low computational cost. More specifically, we propose a new method, named latent constrained correlation filters (LCCF) by map** the correlation filters to a given latent subspace, in which we establish a new learning framework that embeds distribution-related constraints into the original problem. We further introduce a subspace based alternating direction method of multipliers (SADMM) to efficiently solve the optimization problem, which is proved to converge at the saddle point. Our approach is successfully applied to two different tasks inclduing eye localization and car detection. Extensive experiments demonstrate that LCCF outperforms the state-of-the-art methods when samples are suffered from noise and occlusion. △ Less

Submitted 3 June, 2017; v1 submitted 7 June, 2016; originally announced June 2016.

Comments: There are small problems and theories need to be perfected

arXiv:1602.08682 [pdf, ps, other]

doi 10.1088/1361-6587/aa69a9

The controllable super-high energetic electrons by external magnetic fields at relativistic laser-solid interactions in the presence of large scale pre-plasmas

Authors: D. Wu, S. I. Krasheninnikov, S. X. Luan, W. Yu

Abstract: The two stage electron acceleration model [arXiv: 1512.02411 and arXiv: 1512.07546] is extended to the study of laser magnetized-plasmas interactions at relativistic intensities and in the presence of large-scale preformed plasmas. It is shown that the cut-off electron kinetic energy is controllable by the external magnetic field strength and directions. Further studies indicate that for a right-h… ▽ More The two stage electron acceleration model [arXiv: 1512.02411 and arXiv: 1512.07546] is extended to the study of laser magnetized-plasmas interactions at relativistic intensities and in the presence of large-scale preformed plasmas. It is shown that the cut-off electron kinetic energy is controllable by the external magnetic field strength and directions. Further studies indicate that for a right-hand circularly polarized laser (RH-CP) of intensity $10^{20}\ \text{W}/\text{cm}^2$ and pre-plasma scale length $10\ μ\text{m}$, the cut-off electron kinetic energy can be as high as $500\ \text{MeV}$, when a homogeneous external magnetic field of exceeding $10000\ \text{T}$ (or $B=ω_{c}/ω_0>1$) is loaded along the laser propagation direction, which is a significant increase compared with that $120\ \text{MeV}$ without external magnetic field. A laser front sharpening mechanism is identified at relativistic laser magnetized-plasmas interactions with $B=ω_{c}/ω_0>1$, which is responsible for these super-high energetic electrons. △ Less

Submitted 28 February, 2016; originally announced February 2016.

Comments: 4 pages, 4 figures

arXiv:1512.07546 [pdf, ps, other]

doi 10.1063/1.4972539

The generation of unexpected super-high energetic electrons at relativistic circularly polarized laser-solid interactions in the presence of large scale pre-plasmas

Authors: D. Wu, S. I. Krasheninnikov, S. X. Luan, W. Yu

Abstract: As an extension of the previous work [ar** other conditions the same. It is found that in the presence of large scale pre-formed plasmas, super-high energetic electrons can be generated at relativistic CP laser-solid interactions. For laser of intensity $10^{20}\ \text{W}/\text{cm}^2$ and pre-plasm… ▽ More As an extension of the previous work [ar** other conditions the same. It is found that in the presence of large scale pre-formed plasmas, super-high energetic electrons can be generated at relativistic CP laser-solid interactions. For laser of intensity $10^{20}\ \text{W}/\text{cm}^2$ and pre-plasma scale-length $10\ μ\text{m}$, the cut-off energy of electron by CP laser is $120\ \text{MeV}$ compared with $100\ \text{MeV}$ in the case of linearly polarized (LP) laser. The unexpected super-high energetic electron acceleration can also be explained by the two-stage acceleration model, {by taking into account the envelop modulation effects of the reflected CP laser pulse.} The underlying physics of this envelop modulation is figured out, and a modified first-stage electron acceleration scaling law in the presence of the modulated-CP laser is also obtained. △ Less

Submitted 23 February, 2016; v1 submitted 23 December, 2015; originally announced December 2015.

Comments: 5 pages, 8 figures

arXiv:1512.02411 [pdf, ps, other]

doi 10.1088/0029-5515/57/1/016007

Identifying the source of super-high energetic electrons in the presence of pre-plasma in laser-matter interaction at relativistic intensities

Authors: D. Wu, S. I. Krasheninnikov, S. X. Luan, W. Yu

Abstract: The generation of super-high energetic electrons influenced by pre-plasma at relativistic intensity laser-matter interaction is studied in a one-dimensional slab approximation with particle-in-cell simulations. Different pre-plasma scale-lengths of $1\ μ\text{m}$, $5\ μ\text{m}$, $10\ μ\text{m}$ and $15\ μ\text{m}$ are considered, showing an increase in both particle number and cut-off kinetic ene… ▽ More The generation of super-high energetic electrons influenced by pre-plasma at relativistic intensity laser-matter interaction is studied in a one-dimensional slab approximation with particle-in-cell simulations. Different pre-plasma scale-lengths of $1\ μ\text{m}$, $5\ μ\text{m}$, $10\ μ\text{m}$ and $15\ μ\text{m}$ are considered, showing an increase in both particle number and cut-off kinetic energy of electrons with the increase of pre-plasma scale-length, and the cut-off kinetic energy greatly exceeding the corresponding laser ponderomotive energy. A two-stage electron acceleration model is proposed to explain the underlying physics. The first stage is attributed to the synergetic acceleration by longitudinal electric field and laser pulse, with its efficiency depending on the pre-plasma scale-length. These electrons pre-accelerated in the first stage could build up an intense electrostatic potential barrier with its maximal value several times as large of the initial electron kinetic energy. Part of energetic electrons could be further accelerated by the reflection off the electrostatic potential barrier, with their finial kinetic energies significantly higher than the values pre-accelerated in the first stage. △ Less

Submitted 23 December, 2015; v1 submitted 8 December, 2015; originally announced December 2015.

Comments: 10 pages, 9 figures

arXiv:1009.5282 [pdf, other]

Radiation therapy calculations using an on-demand virtual cluster via cloud computing

Authors: Roy W. Keyes, Christian Romano, Dorian Arnold, Shuang Luan

Abstract: Computer hardware costs are the limiting factor in producing highly accurate radiation dose calculations on convenient time scales. Because of this, large-scale, full Monte Carlo simulations and other resource intensive algorithms are often considered infeasible for clinical settings. The emerging cloud computing paradigm promises to fundamentally alter the economics of such calculations by provid… ▽ More Computer hardware costs are the limiting factor in producing highly accurate radiation dose calculations on convenient time scales. Because of this, large-scale, full Monte Carlo simulations and other resource intensive algorithms are often considered infeasible for clinical settings. The emerging cloud computing paradigm promises to fundamentally alter the economics of such calculations by providing relatively cheap, on-demand, pay-as-you-go computing resources over the Internet. We believe that cloud computing will usher in a new era, in which very large scale calculations will be routinely performed by clinics and researchers using cloud-based resources. In this research, several proof-of-concept radiation therapy calculations were successfully performed on a cloud-based virtual Monte Carlo cluster. Performance evaluations were made of a distributed processing framework developed specifically for this project. The expected 1/n performance was observed with some caveats. The economics of cloud-based virtual computing clusters versus traditional in-house hardware is also discussed. For most situations, cloud computing can provide a substantial cost savings for distributed calculations. △ Less

Submitted 27 September, 2010; originally announced September 2010.

Comments: 12 pages, 4 figures

arXiv:0905.4930 [pdf, ps, other]

doi 10.1016/j.ipl.2010.12.011

Improved Approximation Algorithms for Segment Minimization in Intensity Modulated Radiation Therapy

Authors: Therese Biedl, Stephane Durocher, Holger H. Hoos, Shuang Luan, Jared Saia, Maxwell Young

Abstract: he segment minimization problem consists of finding the smallest set of integer matrices that sum to a given intensity matrix, such that each summand has only one non-zero value, and the non-zeroes in each row are consecutive. This has direct applications in intensity-modulated radiation therapy, an effective form of cancer treatment. We develop three approximation algorithms for matrices with a… ▽ More he segment minimization problem consists of finding the smallest set of integer matrices that sum to a given intensity matrix, such that each summand has only one non-zero value, and the non-zeroes in each row are consecutive. This has direct applications in intensity-modulated radiation therapy, an effective form of cancer treatment. We develop three approximation algorithms for matrices with arbitrarily many rows. Our first two algorithms improve the approximation factor from the previous best of $1+\log_2 h $ to (roughly) $3/2 \cdot (1+\log_3 h)$ and $11/6\cdot(1+\log_4{h})$, respectively, where $h$ is the largest entry in the intensity matrix. We illustrate the limitations of the specific approach used to obtain these two algorithms by proving a lower bound of $\frac{(2b-2)}{b}\cdot\log_b{h} + \frac{1}{b}$ on the approximation guarantee. Our third algorithm improves the approximation factor from $2 \cdot (\log D+1)$ to $24/13 \cdot (\log D+1)$, where $D$ is (roughly) the largest difference between consecutive elements of a row of the intensity matrix. Finally, experimentation with these algorithms shows that they perform well with respect to the optimum and outperform other approximation algorithms on 77% of the 122 test cases we consider, which include both real world and synthetic data. △ Less

Submitted 2 September, 2009; v1 submitted 29 May, 2009; originally announced May 2009.

Comments: 18 pages

Journal ref: "A Note on Improving the Performance of Approximation Algorithms for Radiation Therapy''. Information Processing Letters, 111(7), 326-333, 2011

Showing 1–38 of 38 results for author: Luan, S