-
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method
Authors:
Junyu Zhang,
Chengzhuo Ni,
Zheng Yu,
Csaba Szepesvari,
Mengdi Wang
Abstract:
Policy gradient (PG) gives rise to a rich class of reinforcement learning (RL) methods. Recently, there has been an emerging trend to accelerate the existing PG methods such as REINFORCE by the \emph{variance reduction} techniques. However, all existing variance-reduced PG methods heavily rely on an uncheckable importance weight assumption made for every single iteration of the algorithms. In this…
▽ More
Policy gradient (PG) gives rise to a rich class of reinforcement learning (RL) methods. Recently, there has been an emerging trend to accelerate the existing PG methods such as REINFORCE by the \emph{variance reduction} techniques. However, all existing variance-reduced PG methods heavily rely on an uncheckable importance weight assumption made for every single iteration of the algorithms. In this paper, a simple gradient truncation mechanism is proposed to address this issue. Moreover, we design a Truncated Stochastic Incremental Variance-Reduced Policy Gradient (TSIVR-PG) method, which is able to maximize not only a cumulative sum of rewards but also a general utility function over a policy's long-term visiting distribution. We show an $\tilde{\mathcal{O}}(ε^{-3})$ sample complexity for TSIVR-PG to find an $ε$-stationary policy. By assuming the overparameterizaiton of policy and exploiting the hidden convexity of the problem, we further show that TSIVR-PG converges to global $ε$-optimal policy with $\tilde{\mathcal{O}}(ε^{-2})$ samples.
△ Less
Submitted 27 May, 2021; v1 submitted 17 February, 2021;
originally announced February 2021.
-
Spin-crossover induced ferromagnetism and layer stacking-order change in pressurized 2D antiferromagnet MnPS$_3$
Authors:
Hanxing Zhang,
Cao** Ni,
Jie Zhang,
Liangjian Zou,
Zhi Zeng,
Xianlong Wang
Abstract:
High-pressure properties of MnPS$_3$ are investigated by using the hybrid functional, we report a spin-crossover pressure of 35 GPa consisting with experimental observation (30 GPa), less than half of existing report (63 GPa) using the Hubbard U correction. Interestingly, a spin-crossover induced antiferromagnetism-to-ferromagnetism transition combined with stacking-order change from monoclinic to…
▽ More
High-pressure properties of MnPS$_3$ are investigated by using the hybrid functional, we report a spin-crossover pressure of 35 GPa consisting with experimental observation (30 GPa), less than half of existing report (63 GPa) using the Hubbard U correction. Interestingly, a spin-crossover induced antiferromagnetism-to-ferromagnetism transition combined with stacking-order change from monoclinic to rhombohedral are founded, and the ferromagnetism origins from the partially occupied $t_{2g}$ orbitals. Different from previous understanding, the Mott metal-insulator transition of MnPS$_3$ does not occur simultaneously with the spin-crossover but in pressurized low-spin phase.
△ Less
Submitted 15 August, 2020;
originally announced August 2020.
-
Optimal Asset Allocation For Outperforming A Stochastic Benchmark Target
Authors:
Chendi Ni,
Yuying Li,
Peter Forsyth,
Ray Carroll
Abstract:
We propose a data-driven Neural Network (NN) optimization framework to determine the optimal multi-period dynamic asset allocation strategy for outperforming a general stochastic target. We formulate the problem as an optimal stochastic control with an asymmetric, distribution sha**, objective function. The proposed framework is illustrated with the asset allocation problem in the accumulation p…
▽ More
We propose a data-driven Neural Network (NN) optimization framework to determine the optimal multi-period dynamic asset allocation strategy for outperforming a general stochastic target. We formulate the problem as an optimal stochastic control with an asymmetric, distribution sha**, objective function. The proposed framework is illustrated with the asset allocation problem in the accumulation phase of a defined contribution pension plan, with the goal of achieving a higher terminal wealth than a stochastic benchmark. We demonstrate that the data-driven approach is capable of learning an adaptive asset allocation strategy directly from historical market returns, without assuming any parametric model of the financial market dynamics. Following the optimal adaptive strategy, investors can make allocation decisions simply depending on the current state of the portfolio. The optimal adaptive strategy outperforms the benchmark constant proportion strategy, achieving a higher terminal wealth with a 90% probability, a 46% higher median terminal wealth, and a significantly more right-skewed terminal wealth distribution. We further demonstrate the robustness of the optimal adaptive strategy by testing the performance of the strategy on bootstrap resampled market data, which has different distributions compared to the training data.
△ Less
Submitted 27 June, 2020;
originally announced June 2020.
-
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning
Authors:
Zhi** Zeng,
Van Tung Pham,
Haihua Xu,
Yerbolat Khassanov,
Eng Siong Chng,
Chongjia Ni,
Bin Ma
Abstract:
In this work, we study leveraging extra text data to improve low-resource end-to-end ASR under cross-lingual transfer learning setting. To this end, we extend our prior work [1], and propose a hybrid Transformer-LSTM based architecture. This architecture not only takes advantage of the highly effective encoding capacity of the Transformer network but also benefits from extra text data due to the L…
▽ More
In this work, we study leveraging extra text data to improve low-resource end-to-end ASR under cross-lingual transfer learning setting. To this end, we extend our prior work [1], and propose a hybrid Transformer-LSTM based architecture. This architecture not only takes advantage of the highly effective encoding capacity of the Transformer network but also benefits from extra text data due to the LSTM-based independent language model network. We conduct experiments on our in-house Malay corpus which contains limited labeled data and a large amount of extra text. Results show that the proposed architecture outperforms the previous LSTM-based architecture [1] by 24.2% relative word error rate (WER) when both are trained using limited labeled data. Starting from this, we obtain further 25.4% relative WER reduction by transfer learning from another resource-rich language. Moreover, we obtain additional 13.6% relative WER reduction by boosting the LSTM decoder of the transferred model with the extra text data. Overall, our best model outperforms the vanilla Transformer ASR by 11.9% relative WER. Last but not least, the proposed hybrid architecture offers much faster inference compared to both LSTM and Transformer architectures.
△ Less
Submitted 28 May, 2020; v1 submitted 20 May, 2020;
originally announced May 2020.
-
Layered Graph Embedding for Entity Recommendation using Wikipedia in the Yahoo! Knowledge Graph
Authors:
Chien-Chun Ni,
Kin Sum Liu,
Nicolas Torzec
Abstract:
In this paper, we describe an embedding-based entity recommendation framework for Wikipedia that organizes Wikipedia into a collection of graphs layered on top of each other, learns complementary entity representations from their topology and content, and combines them with a lightweight learning-to-rank approach to recommend related entities on Wikipedia. Through offline and online evaluations, w…
▽ More
In this paper, we describe an embedding-based entity recommendation framework for Wikipedia that organizes Wikipedia into a collection of graphs layered on top of each other, learns complementary entity representations from their topology and content, and combines them with a lightweight learning-to-rank approach to recommend related entities on Wikipedia. Through offline and online evaluations, we show that the resulting embeddings and recommendations perform well in terms of quality and user engagement. Balancing simplicity and quality, this framework provides default entity recommendations for English and other languages in the Yahoo! Knowledge Graph, which Wikipedia is a core subset of.
△ Less
Submitted 14 April, 2020;
originally announced April 2020.
-
SYMBA: An end-to-end VLBI synthetic data generation pipeline
Authors:
F. Roelofs,
M. Janssen,
I. Natarajan,
R. Deane,
J. Davelaar,
H. Olivares,
O. Porth,
S. N. Paine,
K. L. Bouman,
R. P. J. Tilanus,
I. M. van Bemmel,
H. Falcke,
K. Akiyama,
A. Alberdi,
W. Alef,
K. Asada,
R. Azulay,
A. Baczko,
D. Ball,
M. Baloković,
J. Barrett,
D. Bintley,
L. Blackburn,
W. Boland,
G. C. Bower
, et al. (183 additional authors not shown)
Abstract:
Realistic synthetic observations of theoretical source models are essential for our understanding of real observational data. In using synthetic data, one can verify the extent to which source parameters can be recovered and evaluate how various data corruption effects can be calibrated. These studies are important when proposing observations of new sources, in the characterization of the capabili…
▽ More
Realistic synthetic observations of theoretical source models are essential for our understanding of real observational data. In using synthetic data, one can verify the extent to which source parameters can be recovered and evaluate how various data corruption effects can be calibrated. These studies are important when proposing observations of new sources, in the characterization of the capabilities of new or upgraded instruments, and when verifying model-based theoretical predictions in a comparison with observational data. We present the SYnthetic Measurement creator for long Baseline Arrays (SYMBA), a novel synthetic data generation pipeline for Very Long Baseline Interferometry (VLBI) observations. SYMBA takes into account several realistic atmospheric, instrumental, and calibration effects. We used SYMBA to create synthetic observations for the Event Horizon Telescope (EHT), a mm VLBI array, which has recently captured the first image of a black hole shadow. After testing SYMBA with simple source and corruption models, we study the importance of including all corruption and calibration effects. Based on two example general relativistic magnetohydrodynamics (GRMHD) model images of M87, we performed case studies to assess the attainable image quality with the current and future EHT array for different weather conditions. The results show that the effects of atmospheric and instrumental corruptions on the measured visibilities are significant. Despite these effects, we demonstrate how the overall structure of the input models can be recovered robustly after performing calibration steps. With the planned addition of new stations to the EHT array, images could be reconstructed with higher angular resolution and dynamic range. In our case study, these improvements allowed for a distinction between a thermal and a non-thermal GRMHD model based on salient features in reconstructed images.
△ Less
Submitted 2 April, 2020;
originally announced April 2020.
-
Verification benchmarks for single-phase flow in three-dimensional fractured porous media
Authors:
Inga Berre,
Wietse M. Boon,
Bernd Flemisch,
Alessio Fumagalli,
Dennis Gläser,
Eirik Keilegavlen,
Anna Scotti,
Ivar Stefansson,
Alexandru Tatomir,
Konstantin Brenner,
Samuel Burbulla,
Philippe Devloo,
Omar Duran,
Marco Favino,
Julian Hennicker,
I-Hsien Lee,
Konstantin Lipnikov,
Roland Masson,
Klaus Mosthaf,
Maria Giuseppina Chiara Nestola,
Chuen-Fa Ni,
Kirill Nikitin,
Philipp Schädle,
Daniil Svyatskiy,
Ruslan Yanbarisov
, et al. (1 additional authors not shown)
Abstract:
Flow in fractured porous media occurs in the earth's subsurface, in biological tissues, and in man-made materials. Fractures have a dominating influence on flow processes, and the last decade has seen an extensive development of models and numerical methods that explicitly account for their presence. To support these developments, we present a portfolio of four benchmark cases for single-phase flo…
▽ More
Flow in fractured porous media occurs in the earth's subsurface, in biological tissues, and in man-made materials. Fractures have a dominating influence on flow processes, and the last decade has seen an extensive development of models and numerical methods that explicitly account for their presence. To support these developments, we present a portfolio of four benchmark cases for single-phase flow in three-dimensional fractured porous media. The cases are specifically designed to test the methods' capabilities in handling various complexities common to the geometrical structures of fracture networks. Based on an open call for participation, results obtained with 17 numerical methods were collected. This paper presents the underlying mathematical model, an overview of the features of the participating numerical methods, and their performance in solving the benchmark cases.
△ Less
Submitted 17 February, 2020;
originally announced February 2020.
-
Independent language modeling architecture for end-to-end ASR
Authors:
Van Tung Pham,
Haihua Xu,
Yerbolat Khassanov,
Zhi** Zeng,
Eng Siong Chng,
Chongjia Ni,
Bin Ma,
Haizhou Li
Abstract:
The attention-based end-to-end (E2E) automatic speech recognition (ASR) architecture allows for joint optimization of acoustic and language models within a single network. However, in a vanilla E2E ASR architecture, the decoder sub-network (subnet), which incorporates the role of the language model (LM), is conditioned on the encoder output. This means that the acoustic encoder and the language mo…
▽ More
The attention-based end-to-end (E2E) automatic speech recognition (ASR) architecture allows for joint optimization of acoustic and language models within a single network. However, in a vanilla E2E ASR architecture, the decoder sub-network (subnet), which incorporates the role of the language model (LM), is conditioned on the encoder output. This means that the acoustic encoder and the language model are entangled that doesn't allow language model to be trained separately from external text data. To address this problem, in this work, we propose a new architecture that separates the decoder subnet from the encoder output. In this way, the decoupled subnet becomes an independently trainable LM subnet, which can easily be updated using the external text data. We study two strategies for updating the new architecture. Experimental results show that, 1) the independent LM architecture benefits from external text data, achieving 9.3% and 22.8% relative character and word error rate reduction on Mandarin HKUST and English NSC datasets respectively; 2)the proposed architecture works well with external LM and can be generalized to different amount of labelled data.
△ Less
Submitted 25 November, 2019;
originally announced December 2019.
-
Revisiting Heterogeneous Defect Prediction: How Far Are We?
Authors:
Xiang Chen,
Yanzhou Mu,
Chao Ni,
Zhanqi Cui
Abstract:
Until now, researchers have proposed several novel heterogeneous defect prediction HDP methods with promising performance. To the best of our knowledge, whether HDP methods can perform significantly better than unsupervised methods has not yet been thoroughly investigated. In this article, we perform a replication study to have a holistic look in this issue. In particular, we compare state-of-the-…
▽ More
Until now, researchers have proposed several novel heterogeneous defect prediction HDP methods with promising performance. To the best of our knowledge, whether HDP methods can perform significantly better than unsupervised methods has not yet been thoroughly investigated. In this article, we perform a replication study to have a holistic look in this issue. In particular, we compare state-of-the-art five HDP methods with five unsupervised methods. Final results surprisingly show that these HDP methods do not perform significantly better than some of unsupervised methods (especially the simple unsupervised methods proposed by Zhou et al.) in terms of two non-effort-aware performance measures and four effort-aware performance measures. Then, we perform diversity analysis on defective modules via McNemar's test and find the prediction diversity is more obvious when the comparison is performed between the HDP methods and the unsupervised methods than the comparisons only between the HDP methods or between the unsupervised methods. This shows the HDP methods and the unsupervised methods are complementary to each other in identifying defective models to some extent. Finally, we investigate the feasibility of five HDP methods by considering two satisfactory criteria recommended by previous CPDP studies and find the satisfactory ratio of these HDP methods is still pessimistic. The above empirical results implicate there is still a long way for heterogeneous defect prediction to go. More effective HDP methods need to be designed and the unsupervised methods should be considered as baselines.
△ Less
Submitted 18 August, 2019;
originally announced August 2019.
-
Topology Based Scalable Graph Kernels
Authors:
Kin Sum Liu,
Chien-Chun Ni,
Yu-Yao Lin,
Jie Gao
Abstract:
We propose a new graph kernel for graph classification and comparison using Ollivier Ricci curvature. The Ricci curvature of an edge in a graph describes the connectivity in the local neighborhood. An edge in a densely connected neighborhood has positive curvature and an edge serving as a local bridge has negative curvature. We use the edge curvature distribution to form a graph kernel which is th…
▽ More
We propose a new graph kernel for graph classification and comparison using Ollivier Ricci curvature. The Ricci curvature of an edge in a graph describes the connectivity in the local neighborhood. An edge in a densely connected neighborhood has positive curvature and an edge serving as a local bridge has negative curvature. We use the edge curvature distribution to form a graph kernel which is then used to compare and cluster graphs. The curvature kernel uses purely the graph topology and thereby works for settings when node attributes are not available.
△ Less
Submitted 14 July, 2019;
originally announced July 2019.
-
Community Detection on Networks with Ricci Flow
Authors:
Chien-Chun Ni,
Yu-Yao Lin,
Feng Luo,
Jie Gao
Abstract:
Many complex networks in the real world have community structures -- groups of well-connected nodes with important functional roles. It has been well recognized that the identification of communities bears numerous practical applications. While existing approaches mainly apply statistical or graph theoretical/combinatorial methods for community detection, in this paper, we present a novel geometri…
▽ More
Many complex networks in the real world have community structures -- groups of well-connected nodes with important functional roles. It has been well recognized that the identification of communities bears numerous practical applications. While existing approaches mainly apply statistical or graph theoretical/combinatorial methods for community detection, in this paper, we present a novel geometric approach which enables us to borrow powerful classical geometric methods and properties. By considering networks as geometric objects and communities in a network as a geometric decomposition, we apply curvature and discrete Ricci flow, which have been used to decompose smooth manifolds with astonishing successes in mathematics, to break down communities in networks. We tested our method on networks with ground-truth community structures, and experimentally confirmed the effectiveness of this geometric approach.
△ Less
Submitted 9 July, 2019;
originally announced July 2019.
-
Learning to Control in Metric Space with Optimal Regret
Authors:
Lin F. Yang,
Chengzhuo Ni,
Mengdi Wang
Abstract:
We study online reinforcement learning for finite-horizon deterministic control systems with {\it arbitrary} state and action spaces. Suppose that the transition dynamics and reward function is unknown, but the state and action space is endowed with a metric that characterizes the proximity between different states and actions. We provide a surprisingly simple upper-confidence reinforcement learni…
▽ More
We study online reinforcement learning for finite-horizon deterministic control systems with {\it arbitrary} state and action spaces. Suppose that the transition dynamics and reward function is unknown, but the state and action space is endowed with a metric that characterizes the proximity between different states and actions. We provide a surprisingly simple upper-confidence reinforcement learning algorithm that uses a function approximation oracle to estimate optimistic Q functions from experiences. We show that the regret of the algorithm after $K$ episodes is $O(HL(KH)^{\frac{d-1}{d}}) $ where $L$ is a smoothness parameter, and $d$ is the doubling dimension of the state-action space with respect to the given metric. We also establish a near-matching regret lower bound. The proposed method can be adapted to work for more structured transition systems, including the finite-state case and the case where value functions are linear combinations of features, where the method also achieve the optimal regret.
△ Less
Submitted 4 May, 2019;
originally announced May 2019.
-
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data
Authors:
Yerbolat Khassanov,
Haihua Xu,
Van Tung Pham,
Zhi** Zeng,
Eng Siong Chng,
Chongjia Ni,
Bin Ma
Abstract:
The lack of code-switch training data is one of the major concerns in the development of end-to-end code-switching automatic speech recognition (ASR) models. In this work, we propose a method to train an improved end-to-end code-switching ASR using only monolingual data. Our method encourages the distributions of output token embeddings of monolingual languages to be similar, and hence, promotes t…
▽ More
The lack of code-switch training data is one of the major concerns in the development of end-to-end code-switching automatic speech recognition (ASR) models. In this work, we propose a method to train an improved end-to-end code-switching ASR using only monolingual data. Our method encourages the distributions of output token embeddings of monolingual languages to be similar, and hence, promotes the ASR model to easily code-switch between languages. Specifically, we propose to use Jensen-Shannon divergence and cosine distance based constraints. The former will enforce output embeddings of monolingual languages to possess similar distributions, while the later simply brings the centroids of two distributions to be close to each other. Experimental results demonstrate high effectiveness of the proposed method, yielding up to 4.5% absolute mixed error rate improvement on Mandarin-English code-switching ASR task.
△ Less
Submitted 31 July, 2019; v1 submitted 7 April, 2019;
originally announced April 2019.
-
On the Calibration of Multiclass Classification with Rejection
Authors:
Chenri Ni,
Nontawat Charoenphakdee,
Junya Honda,
Masashi Sugiyama
Abstract:
We investigate the problem of multiclass classification with rejection, where a classifier can choose not to make a prediction to avoid critical misclassification. First, we consider an approach based on simultaneous training of a classifier and a rejector, which achieves the state-of-the-art performance in the binary case. We analyze this approach for the multiclass case and derive a general cond…
▽ More
We investigate the problem of multiclass classification with rejection, where a classifier can choose not to make a prediction to avoid critical misclassification. First, we consider an approach based on simultaneous training of a classifier and a rejector, which achieves the state-of-the-art performance in the binary case. We analyze this approach for the multiclass case and derive a general condition for calibration to the Bayes-optimal solution, which suggests that calibration is hard to achieve by general loss functions unlike the binary case. Next, we consider another traditional approach based on confidence scores, in which the existing work focuses on a specific class of losses. We propose rejection criteria for more general losses for this approach and guarantee calibration to the Bayes-optimal solution. Finally, we conduct experiments to validate the relevance of our theoretical findings.
△ Less
Submitted 29 October, 2019; v1 submitted 29 January, 2019;
originally announced January 2019.
-
Both cellular ATP level and ATP hydrolysis free energy determine energetically the calcium oscillation in pancreatic $β$-cell
Authors:
Yunsheng Sun,
Congjian Ni,
Yingda Ge,
Hong Qian,
Qi Ouyang,
Fangting Li
Abstract:
In pancreatic $β$-cells, calcium oscillation signal is the core part of glucose-stimulated insulin secretion. Intracellular calcium concentration oscillates in response to the intake of glucose, which triggers the exocytosis of insulin secretory granules. ATP plays a crucial part in this process. ATP increases as the result of glucose intake, then ATP binds to ATP-sensitive $K^+$ channels (…
▽ More
In pancreatic $β$-cells, calcium oscillation signal is the core part of glucose-stimulated insulin secretion. Intracellular calcium concentration oscillates in response to the intake of glucose, which triggers the exocytosis of insulin secretory granules. ATP plays a crucial part in this process. ATP increases as the result of glucose intake, then ATP binds to ATP-sensitive $K^+$ channels ($K_{ATP}$), depolarizes the cell and triggers calcium oscillation, while the ion pumps on the cell membrane consumes the free energy form ATP hydrolysis. Based on Betram et. al. 2004 model, we construct a kinetic models to analyze the thermodynamic characteristics of this system, to reveal how the ATP hydrolysis free energy affects the calcium oscillation. Our results suggest that bifurcation point is sensitive to both the free energy level and cellular ATP level, and the insufficient ATP energy supply would cause dysfunction of calcium oscillation.
△ Less
Submitted 29 November, 2018; v1 submitted 27 November, 2018;
originally announced November 2018.
-
Free Energy of ATP hydrolysis manipulates the cellular calcium signals
Authors:
Yingda Ge,
Congjian Ni,
Yunsheng Sun,
Fangting Li
Abstract:
In living cells, oscillation of the concentration of cytosolic Ca2+ is an important and pervasive signal for the intercellular and intracellular information conduction. To generate the oscillation, the hydrolysis of ATP is always needed. Many recent studies show that both ATP molecules themselves and the free energy by ATP hydrolysis play significant role in the biochemical process involving ATP h…
▽ More
In living cells, oscillation of the concentration of cytosolic Ca2+ is an important and pervasive signal for the intercellular and intracellular information conduction. To generate the oscillation, the hydrolysis of ATP is always needed. Many recent studies show that both ATP molecules themselves and the free energy by ATP hydrolysis play significant role in the biochemical process involving ATP hydrolysis. To verify the prediction, we consider the role of ATP molecules and their hydrolysis in a classic one-pool model of Ca2+ oscillation. Our results show that the available Gibbs free energy of ATP hydrolysis {\DeltaG}, which measures the "distance" of a reaction to its equilibrium state, is another important regulatory factor of the oscillation system besides the concentration of ATP. Furthermore, our model suggest a rudimental prediction of how the oscillation system changes in an aging cell, such as the decrease of the amplitude and the increase of the least {\DeltaG} required for the oscillation.
△ Less
Submitted 25 November, 2018;
originally announced November 2018.
-
Network Alignment by Discrete Ollivier-Ricci Flow
Authors:
Chien-Chun Ni,
Yu-Yao Lin,
Jie Gao,
Xianfeng David Gu
Abstract:
In this paper, we consider the problem of approximately aligning/matching two graphs. Given two graphs $G_{1}=(V_{1},E_{1})$ and $G_{2}=(V_{2},E_{2})$, the objective is to map nodes $u, v \in G_1$ to nodes $u',v'\in G_2$ such that when $u, v$ have an edge in $G_1$, very likely their corresponding nodes $u', v'$ in $G_2$ are connected as well. This problem with subgraph isomorphism as a special cas…
▽ More
In this paper, we consider the problem of approximately aligning/matching two graphs. Given two graphs $G_{1}=(V_{1},E_{1})$ and $G_{2}=(V_{2},E_{2})$, the objective is to map nodes $u, v \in G_1$ to nodes $u',v'\in G_2$ such that when $u, v$ have an edge in $G_1$, very likely their corresponding nodes $u', v'$ in $G_2$ are connected as well. This problem with subgraph isomorphism as a special case has extra challenges when we consider matching complex networks exhibiting the small world phenomena. In this work, we propose to use `Ricci flow metric', to define the distance between two nodes in a network. This is then used to define similarity of a pair of nodes in two networks respectively, which is the crucial step of network alignment. %computed by discrete graph curvatures and graph Ricci flows. Specifically, the Ricci curvature of an edge describes intuitively how well the local neighborhood is connected. The graph Ricci flow uniformizes discrete Ricci curvature and induces a Ricci flow metric that is insensitive to node/edge insertions and deletions. With the new metric, we can map a node in $G_1$ to a node in $G_2$ whose distance vector to only a few preselected landmarks is the most similar. The robustness of the graph metric makes it outperform other methods when tested on various complex graph models and real world network data sets (Emails, Internet, and protein interaction networks)\footnote{The source code of computing Ricci curvature and Ricci flow metric are available: https://github.com/saibalmars/GraphRicciCurvature}.
△ Less
Submitted 7 September, 2018; v1 submitted 2 September, 2018;
originally announced September 2018.
-
Band gap and band offset of Ga$_2$O$_3$ and (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloys
Authors:
Tianshi Wang,
Wei Li,
Chaoying Ni,
Anderson Janotti
Abstract:
Ga$_2$O$_3$ and (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloys are promising materials for solar-blind UV photodetectors and high-power transistors. Basic key parameters in the device design, such as band gap variation with alloy composition and band offset between Ga$_2$O$_3$ and (Al$_x$Ga$_{1-x}$)$_2$O$_3$, are yet to be established. Using density functional theory with the HSE hybrid functional, we compute…
▽ More
Ga$_2$O$_3$ and (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloys are promising materials for solar-blind UV photodetectors and high-power transistors. Basic key parameters in the device design, such as band gap variation with alloy composition and band offset between Ga$_2$O$_3$ and (Al$_x$Ga$_{1-x}$)$_2$O$_3$, are yet to be established. Using density functional theory with the HSE hybrid functional, we compute formation enthalpies, band gaps, and band edge positions of (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloys in the monoclinic ($β$) and corundum ($α$) phases. We find the formation enthlapies of (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloys are significantly lower than of (In$_x$Ga$_{1-x}$)$_2$O$_3$, and that (Al$_x$Ga$_{1-x}$)$_2$O$_3$ with $x$=0.5 can be considered as an ordered compound AlGaO$_3$ in the monoclinic phase, with Al occupying the octahedral sites and Ga occupying the tetrahedral sites. The direct band gaps of the alloys range from 4.69 to 7.03 eV for $β$-(Al$_x$Ga$_{1-x}$)$_2$O$_3$ and from 5.26 to 8.56 eV for $α$-(Al$_x$Ga$_{1-x}$)$_2$O$_3$. Most of the band offset of the (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloy arises from the discontinuity in the conduction band. Our results are used to explain the available experimental data, and consequences for designing modulation-doped field effect transistors (MODFETs) based on (Al$_x$Ga$_{1-x}$)$_2$O$_3$/Ga$_2$O$_3$ are discussed.
△ Less
Submitted 31 July, 2018; v1 submitted 8 June, 2018;
originally announced June 2018.
-
Observation of $^{6}$He+$t$ cluster states in $^{9}$Li
Authors:
W. H. Ma,
J. S. Wang,
D. Patel,
Y. Y. Yang,
J. B. Ma,
S. L. **,
P. Ma,
Q. Hu,
Z. Bai,
M. R. Huang,
X. Q. Liu,
Y. J. Zhou,
J. Chen,
Z. H. Gao,
Q. Wang,
J. Lubian,
J. X. Li,
T. F. Wang,
S. Mukherjee,
X. Y. Ju,
Y. S. Yu,
T. W. Wu,
C. Ni,
X. D. Jia,
Q. B. Liu
, et al. (3 additional authors not shown)
Abstract:
$^{6}$He+$t$ cluster states of exited $^{9}$Li have been measured by 32.7 MeV/nucleon $^{9}$Li beams bombarding on $^{208}$Pb target. Two resonant states are clearly observed with the excitation energies at 9.8 MeV and 12.6 MeV and spin-parity of 3/2$^{-}$ and 7/2$^{-}$ respectively. These two states are considered to be members of K$^π$=1/2$^{-}…
▽ More
$^{6}$He+$t$ cluster states of exited $^{9}$Li have been measured by 32.7 MeV/nucleon $^{9}$Li beams bombarding on $^{208}$Pb target. Two resonant states are clearly observed with the excitation energies at 9.8 MeV and 12.6 MeV and spin-parity of 3/2$^{-}$ and 7/2$^{-}$ respectively. These two states are considered to be members of K$^π$=1/2$^{-}$ band. The spin-parity of them are identified by the method of angular correlation analysis and verified by the continuum discretized coupled channels (CDCC) calculation, which agrees with the prediction of the generator coordinate method (GCM). A monopole matrix element about 4 fm$^{2}$ for the 3/2$^{-}$ state is extracted from the distorted wave Born approximation (DWBA) calculation. These results strongly support the feature of clustering structure of two neutron-rich clusters in the neutron-rich nucleus $^{9}$Li for the first time.
△ Less
Submitted 5 January, 2018; v1 submitted 7 September, 2017;
originally announced September 2017.
-
Decentralized Trajectory Tracking Using Homology and Hodge Decomposition in Sensor Networks
Authors:
Xiaotian Yin,
Yu-Yao Lin,
Chien-Chun Ni,
Jiaxin Ding,
Wei Han,
Dengpan Zhou,
Jie Gao,
Xianfeng Gu
Abstract:
With the recent development of localization and tracking systems for both indoor and outdoor settings, we consider the problem of sensing, representing and analyzing human movement trajectories that we expect to gather in the near future. In this paper, we propose to use the topological representation, which records how a target moves around the natural obstacles in the underlying environment. We…
▽ More
With the recent development of localization and tracking systems for both indoor and outdoor settings, we consider the problem of sensing, representing and analyzing human movement trajectories that we expect to gather in the near future. In this paper, we propose to use the topological representation, which records how a target moves around the natural obstacles in the underlying environment. We demonstrate that the topological information can be sufficiently descriptive for many applications and efficient enough for storing, comparing and classifying these natural human trajectories. We pre-process the sensor network with a purely decentralized algorithm such that certain edges are given numerical weights. Then we can perform trajectory classification by simply summing up the edge weights along the trajectory. Our method supports real-time classification of trajectories with minimum communication cost. We test the effectiveness of our approach by showing how to classify randomly generated trajectories in a multi-level arts museum layout as well as how to distinguish real world taxi trajectories in a large city.
△ Less
Submitted 30 August, 2017;
originally announced August 2017.
-
Energy-Efficient Resource Allocation for Cache-Assisted Mobile Edge Computing
Authors:
Ying Cui,
Wen He,
Chun Ni,
Chengjun Guo,
Zhi Liu
Abstract:
In this paper, we jointly consider communication, caching and computation in a multi-user cache-assisted mobile edge computing (MEC) system, consisting of one base station (BS) of caching and computing capabilities and multiple users with computation-intensive and latency-sensitive applications. We propose a joint caching and offloading mechanism which involves task uploading and executing for tas…
▽ More
In this paper, we jointly consider communication, caching and computation in a multi-user cache-assisted mobile edge computing (MEC) system, consisting of one base station (BS) of caching and computing capabilities and multiple users with computation-intensive and latency-sensitive applications. We propose a joint caching and offloading mechanism which involves task uploading and executing for tasks with uncached computation results as well as computation result downloading for all tasks at the BS, and efficiently utilizes multi-user diversity and multicasting opportunities. Then, we formulate the average total energy minimization problem subject to the caching and deadline constraints to optimally allocate the storage resource at the BS for caching computation results as well as the uploading and downloading time durations. The problem is a challenging mixed discrete-continuous optimization problem. We show that strong duality holds, and obtain an optimal solution using a dual method. To reduce the computational complexity, we further propose a low-complexity suboptimal solution. Finally, numerical results show that the proposed suboptimal solution outperforms existing comparison schemes.
△ Less
Submitted 16 August, 2017;
originally announced August 2017.
-
Robot Coverage Path Planning for General Surfaces Using Quadratic Differentials
Authors:
Yu-Yao Lin,
Chien-Chun Ni,
Na Lei,
Xianfeng David Gu,
Jie Gao
Abstract:
Robot Coverage Path planning (i.e., provide full coverage of a given domain by one or multiple robots) is a classical problem in the field of robotics and motion planning. The goal is to provide nearly full coverage while also minimize duplicately visited area. In this paper we focus on the scenario of path planning on general surfaces including planar domains with complex topology, complex terrai…
▽ More
Robot Coverage Path planning (i.e., provide full coverage of a given domain by one or multiple robots) is a classical problem in the field of robotics and motion planning. The goal is to provide nearly full coverage while also minimize duplicately visited area. In this paper we focus on the scenario of path planning on general surfaces including planar domains with complex topology, complex terrain or general surface in 3D space. The main idea is to adopt a natural, intrinsic and global parametrization of the surface for robot path planning, namely the holomorphic quadratic differentials. Except for a small number of zero points (singularities), each point on the surface is given a uv-coordinates naturally represented by a complex number. We show that natural, efficient robot paths can be obtained by using such coordinate systems. The method is based on intrinsic geometry and thus can be adapted to general surface exploration in 3D.
△ Less
Submitted 25 January, 2017;
originally announced January 2017.
-
Microstructural Characteristics of Reaction-Bonded B4C/SiC Composite
Authors:
Tianshi Wang,
Chaoying Ni,
Prashant Karandikar
Abstract:
A detailed microstructural investigation was performed to understand structural characteristics of a reaction-bonded B$_4$C/SiC ceramic composite. The state-of-the-art focused ion beam & scanning electron microscopy (FIB/SEM) and transmission electron microscopy (TEM) revealed that the as-fabricated product consisted of core-rim structures with α-SiC and $B_4$C cores surrounded by β-SiC and $B_4$C…
▽ More
A detailed microstructural investigation was performed to understand structural characteristics of a reaction-bonded B$_4$C/SiC ceramic composite. The state-of-the-art focused ion beam & scanning electron microscopy (FIB/SEM) and transmission electron microscopy (TEM) revealed that the as-fabricated product consisted of core-rim structures with α-SiC and $B_4$C cores surrounded by β-SiC and $B_4$C, respectively. In addition, plate-like β-SiC was detected within the $B_4$C rim. A phase formation mechanism was proposed and the analytical elucidation is anticipated to shed light on potential fabrication optimization and the property improvement of ceramic composites.
△ Less
Submitted 28 September, 2016;
originally announced September 2016.
-
Thermal Transport Across Metal Silicide-Silicon Interfaces: An Experimental Comparison between Epitaxial and Non-epitaxial Interfaces
Authors:
Ning Ye,
Joseph P Feser,
Sridhar Sadasivam,
Timothy S. Fisher,
Tianshi Wang,
Chaoying Ni,
Anderson Janotti
Abstract:
Silicides are used extensively in nano- and microdevices due to their low electrical resistivity, low contact resistance to silicon, and their process compatibility. In this work, the thermal interface conductance of TiSi$_2$, CoSi$_2$, NiSi and PtSi are studied using time-domain thermoreflectance. Exploiting the fact that most silicides formed on Si(111) substrates grow epitaxially, while most si…
▽ More
Silicides are used extensively in nano- and microdevices due to their low electrical resistivity, low contact resistance to silicon, and their process compatibility. In this work, the thermal interface conductance of TiSi$_2$, CoSi$_2$, NiSi and PtSi are studied using time-domain thermoreflectance. Exploiting the fact that most silicides formed on Si(111) substrates grow epitaxially, while most silicides on Si(100) do not, we study the effect of epitaxy, and show that for a wide variety of interfaces there is no difference in the thermal interface conductance of epitaxial and non-epitaxial silicide/silicon interfaces. The effect of substrate carrier concentration is also investigated over a wide range of p- and n-type do**, and is found to be independent of carrier concentration, regardless of whether the interface is epitaxial and regardless of silicide type. In the case of epitaxial CoSi$_2$, a comparison of temperature dependant experimental data is made with two detailed computational models using (1) full-dispersion diffuse mismatch modeling (DMM) including the effect of near-interfacial strain and (2) an atomistic Green' function (AGF) approach that integrates near-interface changes in the interatomic force constants obtained through density functional perturbation theory. At temperatures above 100K, the AGF approach greatly underpredicts the CoSi$_2$ data, while the DMM prediction matches the data well. The full-dispersion DMM is also found to closely predict the experimentally observed temperature-dependent interface conductance for epitaxial NiSi/Si and non-epitaxial TiSi$_2$/Si interfaces. In the case of epitaxial PtSi/Si interfaces, full dispersion DMM significantly overpredicts the experimental data.
△ Less
Submitted 26 January, 2017; v1 submitted 6 September, 2016;
originally announced September 2016.
-
Novel 16-QAM and 64-QAM Near-Complementary Sequences with Low PMEPR in OFDM Systems
Authors:
Tao Jiang,
Chunxing Ni,
Yuance Xu
Abstract:
In this paper, we firstly propose a novel construction of $16$-quadrature amplitude modulation (QAM) near-complementary sequences with low peak-to-mean envelope power ratio (PMEPR) in orthogonal frequency division multiplexing (OFDM) systems. The proposed $16$-QAM near-complementary sequences can be constructed by utilizing novel nonlinear offsets, where the length of the sequences is $n=2^m$. The…
▽ More
In this paper, we firstly propose a novel construction of $16$-quadrature amplitude modulation (QAM) near-complementary sequences with low peak-to-mean envelope power ratio (PMEPR) in orthogonal frequency division multiplexing (OFDM) systems. The proposed $16$-QAM near-complementary sequences can be constructed by utilizing novel nonlinear offsets, where the length of the sequences is $n=2^m$. The family size of the newly constructed $16$-QAM near-complementary sequences is $8\times (\frac{m!}{2})\times 4^{m+1}$, and the PMEPR of these sequences is proven to satisfy ${\textrm{PMEPR}}\leq 2.4$. Thus, the proposed construction can generate a number of $16$-QAM near-complementary sequences with low PMEPR, resulting in the improvement of the code rate in OFDM systems. Furthermore, we also propose a novel construction of $64$-QAM near-complementary sequences with low PMEPR, which is the first proven construction of $64$-QAM near-complementary sequences. The PMEPRs of two types of the proposed $64$-QAM near-complementary sequences are proven to satisfy that ${\textrm{PMEPR}}\leq 3.62$ or ${\textrm{PMEPR}}\leq 2.48$, respectively. The family size of the newly constructed $64$-QAM near-complementary sequences is $64\times (\frac{m!}{2})\times 4^{m+1}$.
△ Less
Submitted 12 July, 2016;
originally announced July 2016.
-
Capacitated Kinetic Clustering in Mobile Networks by Optimal Transportation Theory
Authors:
Chien-Chun Ni,
Zhengyu Su,
Jie Gao,
Xianfeng David Gu
Abstract:
We consider the problem of capacitated kinetic clustering in which $n$ mobile terminals and $k$ base stations with respective operating capacities are given. The task is to assign the mobile terminals to the base stations such that the total squared distance from each terminal to its assigned base station is minimized and the capacity constraints are satisfied. This paper focuses on the developmen…
▽ More
We consider the problem of capacitated kinetic clustering in which $n$ mobile terminals and $k$ base stations with respective operating capacities are given. The task is to assign the mobile terminals to the base stations such that the total squared distance from each terminal to its assigned base station is minimized and the capacity constraints are satisfied. This paper focuses on the development of \emph{distributed} and computationally efficient algorithms that adapt to the motion of both terminals and base stations. Suggested by the optimal transportation theory, we exploit the structural property of the optimal solution, which can be represented by a power diagram on the base stations such that the total usage of nodes within each power cell equals the capacity of the corresponding base station. We show by using the kinetic data structure framework the first analytical upper bound on the number of changes in the optimal solution, i.e., its stability. On the algorithm side, using the power diagram formulation we show that the solution can be represented in size proportional to the number of base stations and can be solved by an iterative, local algorithm. In particular, this algorithm can naturally exploit the continuity of motion and has orders of magnitude faster than existing solutions using min-cost matching and linear programming, and thus is able to handle large scale data under mobility.
△ Less
Submitted 25 February, 2016;
originally announced February 2016.
-
Efficient Ranking and Selection in Parallel Computing Environments
Authors:
Eric C. Ni,
Dragos F. Ciocan,
Shane G. Henderson,
Susan R. Hunter
Abstract:
The goal of ranking and selection (R&S) procedures is to identify the best stochastic system from among a finite set of competing alternatives. Such procedures require constructing estimates of each system's performance, which can be obtained simultaneously by running multiple independent replications on a parallel computing platform. However, nontrivial statistical and implementation issues arise…
▽ More
The goal of ranking and selection (R&S) procedures is to identify the best stochastic system from among a finite set of competing alternatives. Such procedures require constructing estimates of each system's performance, which can be obtained simultaneously by running multiple independent replications on a parallel computing platform. However, nontrivial statistical and implementation issues arise when designing R&S procedures for a parallel computing environment. Thus we propose several design principles for parallel R&S procedures that preserve statistical validity and maximize core utilization, especially when large numbers of alternatives or cores are involved. These principles are followed closely by our parallel Good Selection Procedure (GSP), which, under the assumption of normally distributed output, (i) guarantees to select a system in the indifference zone with high probability, (ii) runs efficiently on up to 1,024 parallel cores, and (iii) in an example uses smaller sample sizes compared to existing parallel procedures, particularly for large problems (over $10^6$ alternatives). In our computational study we discuss two methods for implementing GSP on parallel computers, namely the Message-Passing Interface (MPI) and Hadoop MapReduce and show that the latter provides good protection against core failures at the expense of a significant drop in utilization due to periodic unavoidable synchronization.
△ Less
Submitted 16 June, 2015;
originally announced June 2015.
-
Ricci Curvature of the Internet Topology
Authors:
Chien-Chun Ni,
Yu-Yao Lin,
Jie Gao,
Xianfeng David Gu,
Emil Saucan
Abstract:
Analysis of Internet topologies has shown that the Internet topology has negative curvature, measured by Gromov's "thin triangle condition", which is tightly related to core congestion and route reliability. In this work we analyze the discrete Ricci curvature of the Internet, defined by Ollivier, Lin, etc. Ricci curvature measures whether local distances diverge or converge. It is a more local me…
▽ More
Analysis of Internet topologies has shown that the Internet topology has negative curvature, measured by Gromov's "thin triangle condition", which is tightly related to core congestion and route reliability. In this work we analyze the discrete Ricci curvature of the Internet, defined by Ollivier, Lin, etc. Ricci curvature measures whether local distances diverge or converge. It is a more local measure which allows us to understand the distribution of curvatures in the network. We show by various Internet data sets that the distribution of Ricci cuvature is spread out, suggesting the network topology to be non-homogenous. We also show that the Ricci curvature has interesting connections to both local measures such as node degree and clustering coefficient, global measures such as betweenness centrality and network connectivity, as well as auxilary attributes such as geographical distances. These observations add to the richness of geometric structures in complex network theory.
△ Less
Submitted 16 January, 2015;
originally announced January 2015.
-
ALMA Nutator Design and Preliminary Performance
Authors:
Pierre Martin-Cocher,
John Ford,
Patrick M. Koch,
Chih-Wen Ni,
Wei-Long Chen,
Ming-Tang Chen,
Philippe Raffin,
Chin-Long Ong,
Paul T. P. Ho,
Arthur Symmes
Abstract:
We report the past two years of collaboration between the different actors on the ALMA nutator. Building on previous developments, the nutator has seen changes in much of the design. A high-modulus carbon fiber structure has been added on the back of the mirror in order to transfer the voice coils forces with less deformation, thus reducing delay problems due to flexing. The controller is now an o…
▽ More
We report the past two years of collaboration between the different actors on the ALMA nutator. Building on previous developments, the nutator has seen changes in much of the design. A high-modulus carbon fiber structure has been added on the back of the mirror in order to transfer the voice coils forces with less deformation, thus reducing delay problems due to flexing. The controller is now an off-the-shelf National Instrument NI-cRIO, and the amplifier a class D servo drive from Advanced Motion Controls, with high peak power able to drive the coils at 300 Volts DC. The stow mechanism has been totally redesigned to improve on the repeatability and precision of the stow position, which is also the reference for the 26 bits Heidenhain encoders. This also improves on the accuracy of the stow position with wind loading. Finally, the software, written largely with National Instrument's LabView, has been developed. We will discuss these changes and the preliminary performances achieved to date. Keywords: ALMA, nutator, class D, high-modulus carbon fiber.
△ Less
Submitted 20 July, 2013;
originally announced July 2013.
-
Demonstration of mid-infrared waveguide photonic crystal cavities
Authors:
Hongtao Lin,
Lan Li,
Fei Deng,
Chaoying Ni,
Sylvain Danto,
J. David Musgraves,
Kathleen Richardson,
Juejun Hu
Abstract:
We have demonstrated what we believe to be the first waveguide photonic crystal cavity operating in the mid-infrared. The devices were fabricated from Ge23Sb7S70 chalcogenide glass on CaF2 substrates by combing photolithographic patterning and focus ion beam milling. The waveguide-coupled cavities were characterized using a fiber end fire coupling method at 5.2 μm wavelength, and a loaded quality…
▽ More
We have demonstrated what we believe to be the first waveguide photonic crystal cavity operating in the mid-infrared. The devices were fabricated from Ge23Sb7S70 chalcogenide glass on CaF2 substrates by combing photolithographic patterning and focus ion beam milling. The waveguide-coupled cavities were characterized using a fiber end fire coupling method at 5.2 μm wavelength, and a loaded quality factor of ~ 2,000 was measured near the critical coupling regime.
△ Less
Submitted 20 May, 2013;
originally announced May 2013.