-
Guaranteeing Recoverability via Partially Constrained Transaction Logs
Authors:
H. Zhou,
J. W. Guo,
H. Q. Hu,
W. N. Qian,
X. Zhou,
A. Y. Zhou
Abstract:
Transaction logging is an essential constituent to guarantee the atomicity and durability in online transaction processing (OLTP) systems. It always has a considerable impact on performance, especially in an in-memory database system. Conventional implementations of logging rely heavily on a centralized design, which guarantees the correctness of recovery by enforcing a total order of all operatio…
▽ More
Transaction logging is an essential constituent to guarantee the atomicity and durability in online transaction processing (OLTP) systems. It always has a considerable impact on performance, especially in an in-memory database system. Conventional implementations of logging rely heavily on a centralized design, which guarantees the correctness of recovery by enforcing a total order of all operations such as log sequence number (LSN) allocation, log persistence, transaction committing and recovering. This strict sequential constraint seriously limits the scalability and parallelism of transaction logging and recovery, especially in the multi-core hardware environment.
In this paper, we define recoverability for transaction logging and demonstrate its correctness for crash recovery. Based on recoverability, we propose a recoverable logging scheme named Poplar, which enables scalable and parallel log processing by easing the restrictions. Its main advantages are that (1) Poplar enables the parallel log persistence on multiple storage devices; (2) it replaces the centralized LSN allocation by calculating a partially ordered sequence number in a distributed manner, which allows log records to only track RAW and WAW dependencies among transactions; (3) it only demands transactions with RAW dependencies to be committed in serial order; (4) Poplar can concurrently restore a consistent database state based on the partially constrained logs after a crash. Experimental results show that Poplar scales well with the increase of IO devices and outperforms other logging approaches on both SSDs and emulated non-volatile memory.
△ Less
Submitted 19 January, 2019;
originally announced January 2019.
-
Vacuum energy decay from a q-bubble
Authors:
F. R. Klinkhamer,
O. P. Santillan,
G. E. Volovik,
A. Zhou
Abstract:
We consider a finite-size spherical bubble with a nonequilibrium value of the $q$-field, where the bubble is immersed in an infinite vacuum with the constant equilibrium value $q_{0}$ for the $q$-field (this $q_{0}$ has already cancelled an initial cosmological constant). Numerical results are presented for the time evolution of such a $q$-bubble with gravity turned off and with gravity turned on.…
▽ More
We consider a finite-size spherical bubble with a nonequilibrium value of the $q$-field, where the bubble is immersed in an infinite vacuum with the constant equilibrium value $q_{0}$ for the $q$-field (this $q_{0}$ has already cancelled an initial cosmological constant). Numerical results are presented for the time evolution of such a $q$-bubble with gravity turned off and with gravity turned on. For small enough bubbles and a $q$-field energy scale sufficiently below the gravitational energy scale $E_\text{Planck}$, the vacuum energy of the $q$-bubble is found to disperse completely. For large enough bubbles and a finite value of $E_\text{Planck}$, the vacuum energy of the $q$-bubble disperses only partially and there occurs gravitational collapse near the bubble center.
△ Less
Submitted 1 November, 2019; v1 submitted 17 January, 2019;
originally announced January 2019.
-
Resampling detection of recompressed images via dual-stream convolutional neural network
Authors:
Gang Cao,
Antao Zhou,
Xianglin Huang,
Gege Song,
Lifang Yang,
Yonggui Zhu
Abstract:
Resampling detection plays an important role in identifying image tampering, such as image splicing. Currently, the resampling detection is still difficult in recompressed images, which are yielded by applying resampling followed by post-JPEG compression to primary JPEG images. Except for the scenario of low quality primary compression, it remains rather challenging due to the widespread use of mi…
▽ More
Resampling detection plays an important role in identifying image tampering, such as image splicing. Currently, the resampling detection is still difficult in recompressed images, which are yielded by applying resampling followed by post-JPEG compression to primary JPEG images. Except for the scenario of low quality primary compression, it remains rather challenging due to the widespread use of middle/high quality compression in imaging devices. In this paper, we propose a new convolution neural network (CNN) method to learn the resampling trace features directly from the recompressed images. To this end, a noise extraction layer based on low-order high pass filters is deployed to yield the image residual domain, which is more beneficial to extract manipulation trace features. A dual-stream CNN is presented to capture the resampling trails along different directions, where the horizontal and vertical streams are interleaved and concatenated. Lastly, the learned features are fed into Sigmoid/Softmax layer, which acts as a binary/multiple classifier for achieving the blind detection and parameter estimation of resampling, respectively. Extensive experimental results demonstrate that our proposed method could detect resampling effectively in recompressed images and outperform the state-of-the-art detectors.
△ Less
Submitted 10 May, 2019; v1 submitted 14 January, 2019;
originally announced January 2019.
-
Multi-Granularity Reasoning for Social Relation Recognition from Images
Authors:
Meng Zhang,
Xinchen Liu,
Wu Liu,
Anfu Zhou,
Huadong Ma,
Tao Mei
Abstract:
Discovering social relations in images can make machines better interpret the behavior of human beings. However, automatically recognizing social relations in images is a challenging task due to the significant gap between the domains of visual content and social relation. Existing studies separately process various features such as faces expressions, body appearance, and contextual objects, thus…
▽ More
Discovering social relations in images can make machines better interpret the behavior of human beings. However, automatically recognizing social relations in images is a challenging task due to the significant gap between the domains of visual content and social relation. Existing studies separately process various features such as faces expressions, body appearance, and contextual objects, thus they cannot comprehensively capture the multi-granularity semantics, such as scenes, regional cues of persons, and interactions among persons and objects. To bridge the domain gap, we propose a Multi-Granularity Reasoning framework for social relation recognition from images. The global knowledge and mid-level details are learned from the whole scene and the regions of persons and objects, respectively. Most importantly, we explore the fine-granularity pose keypoints of persons to discover the interactions among persons and objects. Specifically, the pose-guided Person-Object Graph and Person-Pose Graph are proposed to model the actions from persons to object and the interactions between paired persons, respectively. Based on the graphs, social relation reasoning is performed by graph convolutional networks. Finally, the global features and reasoned knowledge are integrated as a comprehensive representation for social relation recognition. Extensive experiments on two public datasets show the effectiveness of the proposed framework.
△ Less
Submitted 10 January, 2019;
originally announced January 2019.
-
Analysis of Contraction Effort Level in EMG-Based Gesture Recognition Using Hyperdimensional Computing
Authors:
Ali Moin,
Andy Zhou,
Simone Benatti,
Abbas Rahimi,
Luca Benini,
Jan M. Rabaey
Abstract:
Varying contraction levels of muscles is a big challenge in electromyography-based gesture recognition. Some use cases require the classifier to be robust against varying force changes, while others demand to distinguish between different effort levels of performing the same gesture. We use brain-inspired hyperdimensional computing paradigm to build classification models that are both robust to th…
▽ More
Varying contraction levels of muscles is a big challenge in electromyography-based gesture recognition. Some use cases require the classifier to be robust against varying force changes, while others demand to distinguish between different effort levels of performing the same gesture. We use brain-inspired hyperdimensional computing paradigm to build classification models that are both robust to these variations and able to recognize multiple contraction levels. Experimental results on 5 subjects performing 9 gestures with 3 effort levels show up to 39.17% accuracy drop when training and testing across different effort levels, with up to 30.35% recovery after applying our algorithm.
△ Less
Submitted 30 August, 2019; v1 submitted 1 January, 2019;
originally announced January 2019.
-
Learning to Walk via Deep Reinforcement Learning
Authors:
Tuomas Haarnoja,
Sehoon Ha,
Aurick Zhou,
Jie Tan,
George Tucker,
Sergey Levine
Abstract:
Deep reinforcement learning (deep RL) holds the promise of automating the acquisition of complex controllers that can map sensory inputs directly to low-level actions. In the domain of robotic locomotion, deep RL could enable learning locomotion skills with minimal engineering and without an explicit model of the robot dynamics. Unfortunately, applying deep RL to real-world robotic tasks is except…
▽ More
Deep reinforcement learning (deep RL) holds the promise of automating the acquisition of complex controllers that can map sensory inputs directly to low-level actions. In the domain of robotic locomotion, deep RL could enable learning locomotion skills with minimal engineering and without an explicit model of the robot dynamics. Unfortunately, applying deep RL to real-world robotic tasks is exceptionally difficult, primarily due to poor sample complexity and sensitivity to hyperparameters. While hyperparameters can be easily tuned in simulated domains, tuning may be prohibitively expensive on physical systems, such as legged robots, that can be damaged through extensive trial-and-error learning. In this paper, we propose a sample-efficient deep RL algorithm based on maximum entropy RL that requires minimal per-task tuning and only a modest number of trials to learn neural network policies. We apply this method to learning walking gaits on a real-world Minitaur robot. Our method can acquire a stable gait from scratch directly in the real world in about two hours, without relying on any model or simulation, and the resulting policy is robust to moderate variations in the environment. We further show that our algorithm achieves state-of-the-art performance on simulated benchmarks with a single set of hyperparameters. Videos of training and the learned policy can be found on the project website.
△ Less
Submitted 19 June, 2019; v1 submitted 26 December, 2018;
originally announced December 2018.
-
Soft Actor-Critic Algorithms and Applications
Authors:
Tuomas Haarnoja,
Aurick Zhou,
Kristian Hartikainen,
George Tucker,
Sehoon Ha,
Jie Tan,
Vikash Kumar,
Henry Zhu,
Abhishek Gupta,
Pieter Abbeel,
Sergey Levine
Abstract:
Model-free deep reinforcement learning (RL) algorithms have been successfully applied to a range of challenging sequential decision making and control tasks. However, these methods typically suffer from two major challenges: high sample complexity and brittleness to hyperparameters. Both of these challenges limit the applicability of such methods to real-world domains. In this paper, we describe S…
▽ More
Model-free deep reinforcement learning (RL) algorithms have been successfully applied to a range of challenging sequential decision making and control tasks. However, these methods typically suffer from two major challenges: high sample complexity and brittleness to hyperparameters. Both of these challenges limit the applicability of such methods to real-world domains. In this paper, we describe Soft Actor-Critic (SAC), our recently introduced off-policy actor-critic algorithm based on the maximum entropy RL framework. In this framework, the actor aims to simultaneously maximize expected return and entropy. That is, to succeed at the task while acting as randomly as possible. We extend SAC to incorporate a number of modifications that accelerate training and improve stability with respect to the hyperparameters, including a constrained formulation that automatically tunes the temperature hyperparameter. We systematically evaluate SAC on a range of benchmark tasks, as well as real-world challenging tasks such as locomotion for a quadrupedal robot and robotic manipulation with a dexterous hand. With these improvements, SAC achieves state-of-the-art performance, outperforming prior on-policy and off-policy methods in sample-efficiency and asymptotic performance. Furthermore, we demonstrate that, in contrast to other off-policy algorithms, our approach is very stable, achieving similar performance across different random seeds. These results suggest that SAC is a promising candidate for learning in real-world robotics tasks.
△ Less
Submitted 29 January, 2019; v1 submitted 12 December, 2018;
originally announced December 2018.
-
Plane Wave Methods for Quantum Eigenvalue Problems of Incommensurate Systems
Authors:
Yuzhi Zhou,
Huajie Chen,
Aihui Zhou
Abstract:
We propose a novel numerical algorithm for computing the electronic structure related eigenvalue problem of incommensurate systems. Unlike the conventional practice that approximates the system by a large commensurate supercell, our algorithm directly discretizes the eigenvalue problem under the framework of a plane wave method. The emerging ergodicity and the interpretation from higher dimensions…
▽ More
We propose a novel numerical algorithm for computing the electronic structure related eigenvalue problem of incommensurate systems. Unlike the conventional practice that approximates the system by a large commensurate supercell, our algorithm directly discretizes the eigenvalue problem under the framework of a plane wave method. The emerging ergodicity and the interpretation from higher dimensions give rise to many unique features compared to what we have been familiar with in the periodic system. The numerical results of 1D and 2D quantum eigenvalue problems are presented to show the reliability and efficiency of our scheme. Furthermore, the extension of our algorithm to full Kohn-Sham density functional theory calculations are discussed.
△ Less
Submitted 11 November, 2018;
originally announced November 2018.
-
Interval Estimation of Individual-Level Causal Effects Under Unobserved Confounding
Authors:
Nathan Kallus,
Xiaojie Mao,
Angela Zhou
Abstract:
We study the problem of learning conditional average treatment effects (CATE) from observational data with unobserved confounders. The CATE function maps baseline covariates to individual causal effect predictions and is key for personalized assessments. Recent work has focused on how to learn CATE under unconfoundedness, i.e., when there are no unobserved confounders. Since CATE may not be identi…
▽ More
We study the problem of learning conditional average treatment effects (CATE) from observational data with unobserved confounders. The CATE function maps baseline covariates to individual causal effect predictions and is key for personalized assessments. Recent work has focused on how to learn CATE under unconfoundedness, i.e., when there are no unobserved confounders. Since CATE may not be identified when unconfoundedness is violated, we develop a functional interval estimator that predicts bounds on the individual causal effects under realistic violations of unconfoundedness. Our estimator takes the form of a weighted kernel estimator with weights that vary adversarially. We prove that our estimator is sharp in that it converges exactly to the tightest bounds possible on CATE when there may be unobserved confounders. Further, we study personalized decision rules derived from our estimator and prove that they achieve optimal minimax regret asymptotically. We assess our approach in a simulation study as well as demonstrate its application in the case of hormone replacement therapy by comparing conclusions from a real observational study and clinical trial.
△ Less
Submitted 5 October, 2018;
originally announced October 2018.
-
Cost Functions for Robot Motion Style
Authors:
Allan Zhou,
Anca D. Dragan
Abstract:
We focus on autonomously generating robot motion for day to day physical tasks that is expressive of a certain style or emotion. Because we seek generalization across task instances and task types, we propose to capture style via cost functions that the robot can use to augment its nominal task cost and task constraints in a trajectory optimization process. We compare two approaches to representin…
▽ More
We focus on autonomously generating robot motion for day to day physical tasks that is expressive of a certain style or emotion. Because we seek generalization across task instances and task types, we propose to capture style via cost functions that the robot can use to augment its nominal task cost and task constraints in a trajectory optimization process. We compare two approaches to representing such cost functions: a weighted linear combination of hand-designed features, and a neural network parameterization operating on raw trajectory input. For each cost type, we learn weights for each style from user feedback. We contrast these approaches to a nominal motion across different tasks and for different styles in a user study, and find that they both perform on par with each other, and significantly outperform the baseline. Each approach has its advantages: featurized costs require learning fewer parameters and can perform better on some styles, but neural network representations do not require expert knowledge to design features and could even learn more complex, nuanced costs than an expert can easily design.
△ Less
Submitted 31 August, 2018;
originally announced September 2018.
-
PyDraw: a GUI drawing generator based on Tkinter and its design concept
Authors:
**wei Lin,
Aimin Zhou
Abstract:
The emergence of GUI is a great progress in the history of computer science and software design. GUI makes human computer interaction more simple and interesting. Python, as a popular programming language in recent years, has not been realized in GUI design. Tkinter has the advantage of native support for Python, but there are too few visual GUI generators supporting Tkinter. This article presents…
▽ More
The emergence of GUI is a great progress in the history of computer science and software design. GUI makes human computer interaction more simple and interesting. Python, as a popular programming language in recent years, has not been realized in GUI design. Tkinter has the advantage of native support for Python, but there are too few visual GUI generators supporting Tkinter. This article presents a GUI generator based on Tkinter framework, PyDraw. The design principle of PyDraw and the powerful design concept behind it are introduced in detail. With PyDraw's GUI design philosophy, it can easily design a visual GUI rendering generator for any GUI framework with canvas functionality or programming language with screen display control. This article is committed to conveying PyDraw's GUI free design concept. Through experiments, we have proved the practicability and efficiency of PyDrawd. In order to better convey the design concept of PyDraw, let more enthusiasts join PyDraw update and evolution, we have the source code of PyDraw. At the end of the article, we summarize our experience and express our vision for future GUI design. We believe that the future GUI will play an important role in graphical software programming, the future of less code or even no code programming software design methods must become a focus and hot, free, like drawing GUI will be worth pursuing.
△ Less
Submitted 27 August, 2018;
originally announced August 2018.
-
Completely Positive Binary Tensors
Authors:
**yan Fan,
Jiawang Nie,
Anwa Zhou
Abstract:
A symmetric tensor is completely positive (CP) if it is a sum of tensor powers of nonnegative vectors. This paper characterizes completely positive binary tensors. We show that a binary tensor is completely positive if and only if it satisfies two linear matrix inequalities. This result can be used to determine whether a binary tensor is completely positive or not. When it is, we give an algorithm…
▽ More
A symmetric tensor is completely positive (CP) if it is a sum of tensor powers of nonnegative vectors. This paper characterizes completely positive binary tensors. We show that a binary tensor is completely positive if and only if it satisfies two linear matrix inequalities. This result can be used to determine whether a binary tensor is completely positive or not. When it is, we give an algorithm for computing its cp-rank and the decomposition. When the order is odd, we show that the cp-rank decomposition is unique. When the order is even, we completely characterize when the cp-rank decomposition is unique. We also discuss how to compute the nearest cp-approximation when a binary tensor is not completely positive.
△ Less
Submitted 7 August, 2018;
originally announced August 2018.
-
Cooperative Adaptive Cruise Control for Connected Autonomous Vehicles by Factoring Communication-Related Constraints
Authors:
Chaojie Wang,
Siyuan Gong,
Anye Zhou,
Tao Li,
Srinivas Peeta
Abstract:
Emergent cooperative adaptive cruise control (CACC) strategies being proposed in the literature for platoon formation in the Connected Autonomous Vehicle (CAV) context mostly assume idealized fixed information flow topologies (IFTs) for the platoon, implying guaranteed vehicle-to-vehicle (V2V) communications for the IFT assumed. Since CACC strategies entail continuous information broadcasting, com…
▽ More
Emergent cooperative adaptive cruise control (CACC) strategies being proposed in the literature for platoon formation in the Connected Autonomous Vehicle (CAV) context mostly assume idealized fixed information flow topologies (IFTs) for the platoon, implying guaranteed vehicle-to-vehicle (V2V) communications for the IFT assumed. Since CACC strategies entail continuous information broadcasting, communication failures can occur in congested CAV traffic networks, leading to a platoon's IFT varying dynamically. To enhance the performance of CACC strategies, this study proposes the idea of dynamically optimizing the IFT for CACC, labeled the CACC-OIFT strategy. Under CACC-OIFT, the vehicles in the platoon cooperatively determine in real-time which vehicles will dynamically deactivate or activate the "send" functionality of their V2V communication devices to generate IFTs that optimize the platoon performance in terms of string stability under the ambient traffic conditions. Given the adaptive Proportional-Derivative (PD) controller with a two-predecessor-following scheme, and the ambient traffic conditions and the platoon size just before the start of a time period, the IFT optimization model determines the optimal IFT that maximizes the expected string stability. The optimal IFT is deployed for that time period, and the adaptive PD controller continuously determines the car-following behaviors of the vehicles based on the unfolding degeneration scenario for each time instant within that period. The effectiveness of the proposed CACC-OIFT is validated through numerical experiments in NS-3 based on NGSIM field data. The results indicate that the proposed CACC-OIFT can significantly enhance the string stability of platoon control in an unreliable V2V communication context, outperforming CACCs with fixed IFTs or with passive adaptive schemes for IFT dynamics.
△ Less
Submitted 7 August, 2018; v1 submitted 19 July, 2018;
originally announced July 2018.
-
Cooperative Adaptive Cruise Control for a Platoon of Connected and Autonomous Vehicles Considering Dynamic Information Flow Topology
Authors:
Siyuan Gong,
Anye Zhou,
Jian Wang,
Tao Li,
Srinivas Peeta
Abstract:
Vehicle-to-vehicle communications can be unreliable as interference causes communication failures. Thereby, the information flow topology for a platoon of Connected Autonomous Vehicles (CAVs) can vary dynamically. This limits existing Cooperative Adaptive Cruise Control (CACC) strategies as most of them assume a fixed information flow topology (IFT). To address this problem, we introduce a CACC de…
▽ More
Vehicle-to-vehicle communications can be unreliable as interference causes communication failures. Thereby, the information flow topology for a platoon of Connected Autonomous Vehicles (CAVs) can vary dynamically. This limits existing Cooperative Adaptive Cruise Control (CACC) strategies as most of them assume a fixed information flow topology (IFT). To address this problem, we introduce a CACC design that considers a dynamic information flow topology (CACC-DIFT) for CAV platoons. An adaptive Proportional-Derivative (PD) controller under a two-predecessor-following IFT is proposed to reduce the negative effects when communication failures occur. The PD controller parameters are determined to ensure the string stability of the platoon. Further, the designed controller also factors the performance of individual vehicles. Hence, when communication failure occurs, the system will switch to a certain type of CACC instead of degenerating to adaptive cruise control, which improves the control performance considerably. The effectiveness of the proposed CACC-DIFT is validated through numerical experiments based on NGSIM field data. Results indicate that the proposed CACC-DIFT design outperforms a CACC with a predetermined information flow topology.
△ Less
Submitted 7 August, 2018; v1 submitted 5 July, 2018;
originally announced July 2018.
-
Residual Unfairness in Fair Machine Learning from Prejudiced Data
Authors:
Nathan Kallus,
Angela Zhou
Abstract:
Recent work in fairness in machine learning has proposed adjusting for fairness by equalizing accuracy metrics across groups and has also studied how datasets affected by historical prejudices may lead to unfair decision policies. We connect these lines of work and study the residual unfairness that arises when a fairness-adjusted predictor is not actually fair on the target population due to syst…
▽ More
Recent work in fairness in machine learning has proposed adjusting for fairness by equalizing accuracy metrics across groups and has also studied how datasets affected by historical prejudices may lead to unfair decision policies. We connect these lines of work and study the residual unfairness that arises when a fairness-adjusted predictor is not actually fair on the target population due to systematic censoring of training data by existing biased policies. This scenario is particularly common in the same applications where fairness is a concern. We characterize theoretically the impact of such censoring on standard fairness metrics for binary classifiers and provide criteria for when residual unfairness may or may not appear. We prove that, under certain conditions, fairness-adjusted classifiers will in fact induce residual unfairness that perpetuates the same injustices, against the same groups, that biased the data to begin with, thus showing that even state-of-the-art fair machine learning can have a "bias in, bias out" property. When certain benchmark data is available, we show how sample reweighting can estimate and adjust fairness metrics while accounting for censoring. We use this to study the case of Stop, Question, and Frisk (SQF) and demonstrate that attempting to adjust for fairness perpetuates the same injustices that the policy is infamous for.
△ Less
Submitted 7 June, 2018;
originally announced June 2018.
-
Confounding-Robust Policy Improvement
Authors:
Nathan Kallus,
Angela Zhou
Abstract:
We study the problem of learning personalized decision policies from observational data while accounting for possible unobserved confounding. Previous approaches, which assume unconfoundedness, i.e., that no unobserved confounders affect both the treatment assignment as well as outcome, can lead to policies that introduce harm rather than benefit when some unobserved confounding is present, as is…
▽ More
We study the problem of learning personalized decision policies from observational data while accounting for possible unobserved confounding. Previous approaches, which assume unconfoundedness, i.e., that no unobserved confounders affect both the treatment assignment as well as outcome, can lead to policies that introduce harm rather than benefit when some unobserved confounding is present, as is generally the case with observational data. Instead, since policy value and regret may not be point-identifiable, we study a method that minimizes the worst-case estimated regret of a candidate policy against a baseline policy over an uncertainty set for propensity weights that controls the extent of unobserved confounding. We prove generalization guarantees that ensure our policy will be safe when applied in practice and will in fact obtain the best-possible uniform control on the range of all possible population regrets that agree with the possible extent of confounding. We develop efficient algorithmic solutions to compute this confounding-robust policy. Finally, we assess and compare our methods on synthetic and semi-synthetic data. In particular, we consider a case study on personalizing hormone replacement therapy based on observational data, where we validate our results on a randomized experiment. We demonstrate that hidden confounding can hinder existing policy learning approaches and lead to unwarranted harm, while our robust approach guarantees safety and focuses on well-evidenced improvement, a necessity for making personalized treatment policies learned from observational data reliable in practice.
△ Less
Submitted 4 November, 2019; v1 submitted 22 May, 2018;
originally announced May 2018.
-
Antidam** torque-induced switching in biaxial antiferromagnetic insulators
Authors:
X. Z. Chen,
R. Zarzuela,
J. Zhang,
C. Song,
X. F. Zhou,
G. Y. Shi,
F. Li,
H. A. Zhou,
W. J. Jiang,
F. Pan,
Y. Tserkovnyak
Abstract:
We investigate the current-induced switching of the Neel order in NiO(001)/Pt heterostructures,which is manifested electrically via the spin Hall magnetoresistance. Significant reversible changes in the longitudinal and transverse resistances are found at room temperature for a current threshold lying in the range of 10^7 A/cm^2. The order-parameter switching is ascribed to the antiferromagnetic d…
▽ More
We investigate the current-induced switching of the Neel order in NiO(001)/Pt heterostructures,which is manifested electrically via the spin Hall magnetoresistance. Significant reversible changes in the longitudinal and transverse resistances are found at room temperature for a current threshold lying in the range of 10^7 A/cm^2. The order-parameter switching is ascribed to the antiferromagnetic dynamics triggered by the (current-induced) antidam** torque, which orients the Neel order towards the direction of the writing current. This is in stark contrast to the case of antiferromagnets such as Mn2Au and CuMnAs, where field-like torques induced by the Edelstein effect drive the Neel switching, therefore resulting in an orthogonal alignment between the Neel order and the writing current. Our findings can be readily generalized to other biaxial antiferromagnets, providing broad opportunities for all-electrical writing and readout in antiferromagnetic spintronics.
△ Less
Submitted 15 April, 2018;
originally announced April 2018.
-
Composable Deep Reinforcement Learning for Robotic Manipulation
Authors:
Tuomas Haarnoja,
Vitchyr Pong,
Aurick Zhou,
Murtaza Dalal,
Pieter Abbeel,
Sergey Levine
Abstract:
Model-free deep reinforcement learning has been shown to exhibit good performance in domains ranging from video games to simulated robotic manipulation and locomotion. However, model-free methods are known to perform poorly when the interaction time with the environment is limited, as is the case for most real-world robotic tasks. In this paper, we study how maximum entropy policies trained using…
▽ More
Model-free deep reinforcement learning has been shown to exhibit good performance in domains ranging from video games to simulated robotic manipulation and locomotion. However, model-free methods are known to perform poorly when the interaction time with the environment is limited, as is the case for most real-world robotic tasks. In this paper, we study how maximum entropy policies trained using soft Q-learning can be applied to real-world robotic manipulation. The application of this method to real-world manipulation is facilitated by two important features of soft Q-learning. First, soft Q-learning can learn multimodal exploration strategies by learning policies represented by expressive energy-based models. Second, we show that policies learned with soft Q-learning can be composed to create new policies, and that the optimality of the resulting policy can be bounded in terms of the divergence between the composed policies. This compositionality provides an especially valuable tool for real-world manipulation, where constructing new policies by composing existing skills can provide a large gain in efficiency over training from scratch. Our experimental evaluation demonstrates that soft Q-learning is substantially more sample efficient than prior model-free deep reinforcement learning methods, and that compositionality can be performed for both simulated and real-world tasks.
△ Less
Submitted 18 March, 2018;
originally announced March 2018.
-
Deep Neural Network Compression with Single and Multiple Level Quantization
Authors:
Yuhui Xu,
Yongzhuang Wang,
Aojun Zhou,
Weiyao Lin,
Hongkai Xiong
Abstract:
Network quantization is an effective solution to compress deep neural networks for practical usage. Existing network quantization methods cannot sufficiently exploit the depth information to generate low-bit compressed network. In this paper, we propose two novel network quantization approaches, single-level network quantization (SLQ) for high-bit quantization and multi-level network quantization…
▽ More
Network quantization is an effective solution to compress deep neural networks for practical usage. Existing network quantization methods cannot sufficiently exploit the depth information to generate low-bit compressed network. In this paper, we propose two novel network quantization approaches, single-level network quantization (SLQ) for high-bit quantization and multi-level network quantization (MLQ) for extremely low-bit quantization (ternary).We are the first to consider the network quantization from both width and depth level. In the width level, parameters are divided into two parts: one for quantization and the other for re-training to eliminate the quantization loss. SLQ leverages the distribution of the parameters to improve the width level. In the depth level, we introduce incremental layer compensation to quantize layers iteratively which decreases the quantization loss in each iteration. The proposed approaches are validated with extensive experiments based on the state-of-the-art neural networks including AlexNet, VGG-16, GoogleNet and ResNet-18. Both SLQ and MLQ achieve impressive results.
△ Less
Submitted 15 December, 2018; v1 submitted 5 March, 2018;
originally announced March 2018.
-
Fast and robust misalignment correction of Fourier ptychographic microscopy
Authors:
Ao Zhou,
Wei Wang,
Ni Chen,
Edmund Y. Lam,
Byoungho Lee,
Guohai Situ
Abstract:
Fourier ptychographi cmicroscopy(FPM) is a newly developed computational imaging technique that can provide gigapixel images with both high resolution (HR) and wide field of view (FOV). However, the positional misalignment of the LED array induces a degradation of the reconstruction, especially in the regions away from the optical axis. In this paper, we propose a robust and fast method to correct…
▽ More
Fourier ptychographi cmicroscopy(FPM) is a newly developed computational imaging technique that can provide gigapixel images with both high resolution (HR) and wide field of view (FOV). However, the positional misalignment of the LED array induces a degradation of the reconstruction, especially in the regions away from the optical axis. In this paper, we propose a robust and fast method to correct the LED misalignment of FPM, termed as misalignment correction for FPM (mcFPM). Although different regions in the FOV have different sensitivity to the LED misalignment, the experimental results show that mcFPM is robust to eliminate the degradation in each region. Compared with the state-of-the-art methods, mcFPM is much faster.
△ Less
Submitted 19 February, 2018;
originally announced March 2018.
-
An EMG Gesture Recognition System with Flexible High-Density Sensors and Brain-Inspired High-Dimensional Classifier
Authors:
Ali Moin,
Andy Zhou,
Abbas Rahimi,
Simone Benatti,
Alisha Menon,
Senam Tamakloe,
Jonathan Ting,
Natasha Yamamoto,
Yasser Khan,
Fred Burghardt,
Luca Benini,
Ana C. Arias,
Jan M. Rabaey
Abstract:
EMG-based gesture recognition shows promise for human-machine interaction. Systems are often afflicted by signal and electrode variability which degrades performance over time. We present an end-to-end system combating this variability using a large-area, high-density sensor array and a robust classification algorithm. EMG electrodes are fabricated on a flexible substrate and interfaced to a custo…
▽ More
EMG-based gesture recognition shows promise for human-machine interaction. Systems are often afflicted by signal and electrode variability which degrades performance over time. We present an end-to-end system combating this variability using a large-area, high-density sensor array and a robust classification algorithm. EMG electrodes are fabricated on a flexible substrate and interfaced to a custom wireless device for 64-channel signal acquisition and streaming. We use brain-inspired high-dimensional (HD) computing for processing EMG features in one-shot learning. The HD algorithm is tolerant to noise and electrode misplacement and can quickly learn from few gestures without gradient descent or back-propagation. We achieve an average classification accuracy of 96.64% for five gestures, with only 7% degradation when training and testing across different days. Our system maintains this accuracy when trained with only three trials of gestures; it also demonstrates comparable accuracy with the state-of-the-art when trained with one trial.
△ Less
Submitted 5 April, 2018; v1 submitted 27 February, 2018;
originally announced February 2018.
-
Analysis of Fourier ptychographic microscopy with half of the captured images
Authors:
Ao Zhou,
Ni Chen,
Haichao Wang,
Guohai Situ
Abstract:
Fourier ptychography microscopy (FPM) is a new computational imaging technique that can provide gigapixel images with both high resolution and a wide field of view (FOV). However, time consuming of the data-acquisition process is a critical issue. In this paper, we make an analysis on the FPM imaging system with half number of the captured images. Based on the image analysis of the conventional FP…
▽ More
Fourier ptychography microscopy (FPM) is a new computational imaging technique that can provide gigapixel images with both high resolution and a wide field of view (FOV). However, time consuming of the data-acquisition process is a critical issue. In this paper, we make an analysis on the FPM imaging system with half number of the captured images. Based on the image analysis of the conventional FPM system, we then compare the reconstructed images with different number of captured data. Simulation and experiment results show that the reconstructed image with half number captured data do not show obvious resolution degradation compared to that with all the captured data, except a contrast reduction. In particular in the case when the object is close to phase-only/amplitude only, the quality of the reconstructed image with half of the captured data is nearly as good as the one reconstructed with full data.
△ Less
Submitted 19 February, 2018;
originally announced February 2018.
-
Finding Top-k Optimal Sequenced Routes -- Full Version
Authors:
Hui** Liu,
Cheqing **,
Bin Yang,
Aoying Zhou
Abstract:
Motivated by many practical applications in logistics and mobility-as-a-service, we study the top-k optimal sequenced routes (KOSR) querying on large, general graphs where the edge weights may not satisfy the triangle inequality, e.g., road network graphs with travel times as edge weights. The KOSR querying strives to find the top-k optimal routes (i.e., with the top-k minimal total costs) from a…
▽ More
Motivated by many practical applications in logistics and mobility-as-a-service, we study the top-k optimal sequenced routes (KOSR) querying on large, general graphs where the edge weights may not satisfy the triangle inequality, e.g., road network graphs with travel times as edge weights. The KOSR querying strives to find the top-k optimal routes (i.e., with the top-k minimal total costs) from a given source to a given destination, which must visit a number of vertices with specific vertex categories (e.g., gas stations, restaurants, and shop** malls) in a particular order (e.g., visiting gas stations before restaurants and then shop** malls).
To efficiently find the top-k optimal sequenced routes, we propose two algorithms PruningKOSR and StarKOSR. In PruningKOSR, we define a dominance relationship between two partially-explored routes. The partially-explored routes that can be dominated by other partially-explored routes are postponed being extended, which leads to a smaller searching space and thus improves efficiency. In StarKOSR, we further improve the efficiency by extending routes in an A* manner. With the help of a judiciously designed heuristic estimation that works for general graphs, the cost of partially explored routes to the destination can be estimated such that the qualified complete routes can be found early. In addition, we demonstrate the high extensibility of the proposed algorithms by incorporating Hop Labeling, an effective label indexing technique for shortest path queries, to further improve efficiency. Extensive experiments on multiple real-world graphs demonstrate that the proposed methods significantly outperform the baseline method. Furthermore, when k=1, StarKOSR also outperforms the state-of-the-art method for the optimal sequenced route queries.
△ Less
Submitted 22 February, 2018;
originally announced February 2018.
-
Policy Evaluation and Optimization with Continuous Treatments
Authors:
Nathan Kallus,
Angela Zhou
Abstract:
We study the problem of policy evaluation and learning from batched contextual bandit data when treatments are continuous, going beyond previous work on discrete treatments. Previous work for discrete treatment/action spaces focuses on inverse probability weighting (IPW) and doubly robust (DR) methods that use a rejection sampling approach for evaluation and the equivalent weighted classification…
▽ More
We study the problem of policy evaluation and learning from batched contextual bandit data when treatments are continuous, going beyond previous work on discrete treatments. Previous work for discrete treatment/action spaces focuses on inverse probability weighting (IPW) and doubly robust (DR) methods that use a rejection sampling approach for evaluation and the equivalent weighted classification problem for learning. In the continuous setting, this reduction fails as we would almost surely reject all observations. To tackle the case of continuous treatments, we extend the IPW and DR approaches to the continuous setting using a kernel function that leverages treatment proximity to attenuate discrete rejection. Our policy estimator is consistent and we characterize the optimal bandwidth. The resulting continuous policy optimizer (CPO) approach using our estimator achieves convergent regret and approaches the best-in-class policy for learnable policy classes. We demonstrate that the estimator performs well and, in particular, outperforms a discretization-based benchmark. We further study the performance of our policy optimizer in a case study on personalized dosing based on a dataset of Warfarin patients, their covariates, and final therapeutic doses. Our learned policy outperforms benchmarks and nears the oracle-best linear policy.
△ Less
Submitted 16 February, 2018;
originally announced February 2018.
-
Expressive Robot Motion Timing
Authors:
Allan Zhou,
Dylan Hadfield-Menell,
Anusha Nagabandi,
Anca D. Dragan
Abstract:
Our goal is to enable robots to \emph{time} their motion in a way that is purposefully expressive of their internal states, making them more transparent to people. We start by investigating what types of states motion timing is capable of expressing, focusing on robot manipulation and kee** the path constant while systematically varying the timing. We find that users naturally pick up on certain…
▽ More
Our goal is to enable robots to \emph{time} their motion in a way that is purposefully expressive of their internal states, making them more transparent to people. We start by investigating what types of states motion timing is capable of expressing, focusing on robot manipulation and kee** the path constant while systematically varying the timing. We find that users naturally pick up on certain properties of the robot (like confidence), of the motion (like naturalness), or of the task (like the weight of the object that the robot is carrying). We then conduct a hypothesis-driven experiment to tease out the directions and magnitudes of these effects, and use our findings to develop candidate mathematical models for how users make these inferences from the timing. We find a strong correlation between the models and real user data, suggesting that robots can leverage these models to autonomously optimize the timing of their motion to be expressive.
△ Less
Submitted 5 February, 2018;
originally announced February 2018.
-
A Linear Solution Method of Generalized Robust Chance Constrained Real-time Dispatch
Authors:
An** Zhou,
Ming Yang,
Zhaoyu Wang
Abstract:
In this letter, a novel solution method of generalized robust chance constrained real-time dispatch (GRCC-RTD) considering wind power uncertainty is proposed. GRCC models are advantageous in dealing with distributional uncertainty, however, they are difficult to solve because of the complex ambiguity set. By constructing traceable counterparts of the robust chance constraints and using the reformu…
▽ More
In this letter, a novel solution method of generalized robust chance constrained real-time dispatch (GRCC-RTD) considering wind power uncertainty is proposed. GRCC models are advantageous in dealing with distributional uncertainty, however, they are difficult to solve because of the complex ambiguity set. By constructing traceable counterparts of the robust chance constraints and using the reformulation linearization technique, the model is equivalently transformed into a deterministic linear programming problem, which can be solved efficiently by off-the-shelf solvers. Numerical results verify the effectiveness and efficiency of the approach.
△ Less
Submitted 11 January, 2018;
originally announced January 2018.
-
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Authors:
Tuomas Haarnoja,
Aurick Zhou,
Pieter Abbeel,
Sergey Levine
Abstract:
Model-free deep reinforcement learning (RL) algorithms have been demonstrated on a range of challenging decision making and control tasks. However, these methods typically suffer from two major challenges: very high sample complexity and brittle convergence properties, which necessitate meticulous hyperparameter tuning. Both of these challenges severely limit the applicability of such methods to c…
▽ More
Model-free deep reinforcement learning (RL) algorithms have been demonstrated on a range of challenging decision making and control tasks. However, these methods typically suffer from two major challenges: very high sample complexity and brittle convergence properties, which necessitate meticulous hyperparameter tuning. Both of these challenges severely limit the applicability of such methods to complex, real-world domains. In this paper, we propose soft actor-critic, an off-policy actor-critic deep RL algorithm based on the maximum entropy reinforcement learning framework. In this framework, the actor aims to maximize expected reward while also maximizing entropy. That is, to succeed at the task while acting as randomly as possible. Prior deep RL methods based on this framework have been formulated as Q-learning methods. By combining off-policy updates with a stable stochastic actor-critic formulation, our method achieves state-of-the-art performance on a range of continuous control benchmark tasks, outperforming prior on-policy and off-policy methods. Furthermore, we demonstrate that, in contrast to other off-policy algorithms, our approach is very stable, achieving very similar performance across different random seeds.
△ Less
Submitted 8 August, 2018; v1 submitted 4 January, 2018;
originally announced January 2018.
-
A Pulsational Time-evolution Study for the $δ$ Scuti Star AN Lyncis
Authors:
A. -Y. Zhou,
Eric G. Hintz,
Jeremy N. Schoonmaker,
Eloy Rodríguez,
Victor Costa,
M. J. Lopez-Gonzalez,
Horace A. Smith,
Nathan Sanders,
Gerold Monninger,
Lienhard Pagel
Abstract:
This paper presents a large amount of observations for the $δ$ Scuti star AN Lyncis carried out in 2001--2012. The extensive observations include two tri-continent campaigns coordinated in 2002 and 2011, respectively, and several single-site contributions throughout the period. The data in total have more than 104100 raw CCD frames and photoelectric records, which consist of 165 nights (about 816…
▽ More
This paper presents a large amount of observations for the $δ$ Scuti star AN Lyncis carried out in 2001--2012. The extensive observations include two tri-continent campaigns coordinated in 2002 and 2011, respectively, and several single-site contributions throughout the period. The data in total have more than 104100 raw CCD frames and photoelectric records, which consist of 165 nights (about 816 hours) spanning over 3778 days. The final reduced light curves have more than 26500 data points (including those 3462 unpublished BYU data), from which we determined 306 new timings of maximum light. A time-dependent behaviour study based on all available data indicates cyclic amplitude variability as well as period change [for the main periodicity]. Orbital sinusoid fittings to $(O-C)$ residuals and pulsation amplitudes may account for their variations being caused by the light-time effect of AN Lyn in a binary system. The orbital period is about 26--30 years. Current results support the binarity of AN Lyn, first suspected by Zhou (2002). We further show the detailed time evolution structure of the pulsation of AN Lyn as function of both time and period through wavelet analyses.
△ Less
Submitted 11 October, 2017;
originally announced October 2017.
-
Low temperature electroweak phase transition in the Standard Model with hidden scale invariance
Authors:
Suntharan Arunasalam,
Archil Kobakhidze,
Cyril Lagger,
Shelley Liang,
Albert Zhou
Abstract:
We discuss a cosmological phase transition within the Standard Model which incorporates spontaneously broken scale invariance as a low-energy theory. In addition to the Standard Model fields, the minimal model involves a light dilaton, which acquires a large vacuum expectation value (VEV) through the mechanism of dimensional transmutation. Under the assumption of the cancellation of the vacuum ene…
▽ More
We discuss a cosmological phase transition within the Standard Model which incorporates spontaneously broken scale invariance as a low-energy theory. In addition to the Standard Model fields, the minimal model involves a light dilaton, which acquires a large vacuum expectation value (VEV) through the mechanism of dimensional transmutation. Under the assumption of the cancellation of the vacuum energy, the dilaton develops a very small mass at 2-loop order. As a result, a flat direction is present in the classical dilaton-Higgs potential at zero temperature while the quantum potential admits two (almost) degenerate local minima with unbroken and broken eletroweak symmetry. We found that the cosmological electroweak phase transition in this model can only be triggered by a QCD chiral symmetry breaking phase transition at low temperatures, $T\lesssim 132$ MeV. Furthermore, unlike the standard case, the universe settles into the chiral symmetry breaking vacuum via a first-order phase transition which gives rise to a stochastic gravitational background with a peak frequency $\sim 10^{-8}$ Hz as well as triggers the production of approximately solar mass primordial black holes. The observation of these signatures of cosmological phase transitions together with the detection of a light dilaton would provide a strong hint of the fundamental role of scale invariance in particle physics.
△ Less
Submitted 29 September, 2017;
originally announced September 2017.
-
A Mathematical Aspect of Hohenberg-Kohn Theorem
Authors:
Aihui Zhou
Abstract:
The Hohenberg-Kohn theorem plays a fundamental role in density functional theory, which has become a basic tool for the study of electronic structure of matter. In this article, we study the Hohenberg-Kohn theorem for a class of external potentials based on a unique continuation principle.
The Hohenberg-Kohn theorem plays a fundamental role in density functional theory, which has become a basic tool for the study of electronic structure of matter. In this article, we study the Hohenberg-Kohn theorem for a class of external potentials based on a unique continuation principle.
△ Less
Submitted 20 September, 2017;
originally announced September 2017.
-
Preselection via Classification: A Case Study on Evolutionary Multiobjective Optimization
Authors:
**yuan Zhang,
Aimin Zhou,
Ke Tang,
Guixu Zhang
Abstract:
In evolutionary algorithms, a preselection operator aims to select the promising offspring solutions from a candidate offspring set. It is usually based on the estimated or real objective values of the candidate offspring solutions. In a sense, the preselection can be treated as a classification procedure, which classifies the candidate offspring solutions into promising ones and unpromising ones.…
▽ More
In evolutionary algorithms, a preselection operator aims to select the promising offspring solutions from a candidate offspring set. It is usually based on the estimated or real objective values of the candidate offspring solutions. In a sense, the preselection can be treated as a classification procedure, which classifies the candidate offspring solutions into promising ones and unpromising ones. Following this idea, we propose a classification based preselection (CPS) strategy for evolutionary multiobjective optimization. When applying classification based preselection, an evolutionary algorithm maintains two external populations (training data set) that consist of some selected good and bad solutions found so far; then it trains a classifier based on the training data set in each generation. Finally it uses the classifier to filter the unpromising candidate offspring solutions and choose a promising one from the generated candidate offspring set for each parent solution. In such cases, it is not necessary to estimate or evaluate the objective values of the candidate offspring solutions. The classification based preselection is applied to three state-of-the-art multiobjective evolutionary algorithms (MOEAs) and is empirically studied on two sets of test instances. The experimental results suggest that classification based preselection can successfully improve the performance of these MOEAs.
△ Less
Submitted 3 August, 2017;
originally announced August 2017.
-
WAND: A 128-channel, closed-loop, wireless artifact-free neuromodulation device
Authors:
Andy Zhou,
Samantha R. Santacruz,
Benjamin C. Johnson,
George Alexandrov,
Ali Moin,
Fred L. Burghardt,
Jan M. Rabaey,
Jose M. Carmena,
Rikky Muller
Abstract:
Closed-loop neuromodulation systems aim to treat a variety of neurological conditions by dynamically delivering and adjusting therapeutic electrical stimulation in response to a patient's neural state, recorded in real-time. Existing systems are limited by low channel counts, lack of algorithmic flexibility, and distortion of recorded signals from large, persistent stimulation artifacts. Here, we…
▽ More
Closed-loop neuromodulation systems aim to treat a variety of neurological conditions by dynamically delivering and adjusting therapeutic electrical stimulation in response to a patient's neural state, recorded in real-time. Existing systems are limited by low channel counts, lack of algorithmic flexibility, and distortion of recorded signals from large, persistent stimulation artifacts. Here, we describe a device that enables new research applications requiring high-throughput data streaming, low-latency biosignal processing, and truly simultaneous sensing and stimulation. The Wireless Artifact-free Neuromodulation Device (WAND) is a miniaturized, wireless neural interface capable of recording and stimulating on 128 channels with on-board processing to fully cancel stimulation artifacts, detect neural biomarkers, and automatically adjust stimulation parameters in a closed-loop fashion. It combines custom application specific integrated circuits (ASICs), an on-board FPGA, and a low-power bidirectional radio. We validate wireless, long-term recordings of local field potentials (LFP) and real-time cancellation of stimulation artifacts in a behaving nonhuman primate (NHP). We use WAND to demonstrate a closed-loop stimulation paradigm to disrupt movement preparatory activity during a delayed-reach task in a NHP in vivo. This wireless device, leveraging custom ASICs for both neural recording and electrical stimulation modalities, makes possible a neural interface platform technology to significantly advance both neuroscientific discovery and preclinical investigations of stimulation-based therapeutic interventions.
△ Less
Submitted 29 May, 2018; v1 submitted 1 August, 2017;
originally announced August 2017.
-
The Variability and Period Analysis for the BL Lac AO 0235+164
Authors:
J. H. Fan,
O. Kurtanidze,
Y. Liu,
X. Liu,
J. H. Yang,
G. M. Richter,
M. G. Nikolashvili,
S. O. Kurtanidze,
H. T. Wang,
M. Sasada,
A. Y. Zhou,
C. Lin,
Y. H. Yuan,
Y. T. Zhang,
D. Constantin
Abstract:
Variability is one of the extreme observational properties of BL Lacertae objects. AO 0235+164 is a well studied BL Lac through the whole electro-magnetic wavebands. In the present work, we show its optical R band photometric observations carried out during the period of Nov, 2006 to Dec. 2012 using the Ap6E CCD camera attached to the primary focus of the $\rm 70-cm$ meniscus telescope at Abastuma…
▽ More
Variability is one of the extreme observational properties of BL Lacertae objects. AO 0235+164 is a well studied BL Lac through the whole electro-magnetic wavebands. In the present work, we show its optical R band photometric observations carried out during the period of Nov, 2006 to Dec. 2012 using the Ap6E CCD camera attached to the primary focus of the $\rm 70-cm$ meniscus telescope at Abastumani Observatory, Georgia. It shows a large variation of $ΔR$ = 4.88 mag (14.19 - 19.07 mag) and a short time scale of $ΔT_v$ = 73.5 min during our monitoring period. During the period of Dec. 2006 to Nov. 2009, we made radio observations of the source using the 25-m radio telescope at Xinjiang Astronomical Observatory. When a discrete correlation function (DCF) is adopted to the optical and radio observations, we found that the optical variation leads the radio variation by 23.2$\pm$12.9 days.
△ Less
Submitted 18 February, 2017;
originally announced February 2017.
-
A parallel orbital-updating based plane-wave basis method for electronic structure calculations
Authors:
Yan Pan,
Xiaoying Dai,
Stefano de Gironcoli,
Xin-Gao Gong,
Gian-Marco Rignanese,
Aihui Zhou
Abstract:
Motivated by the recently proposed parallel orbital-updating approach in real space method, we propose a parallel orbital-updating based plane-wave basis method for electronic structure calculations, for solving the corresponding eigenvalue problems. In addition, we propose two new modified parallel orbital-updating methods. Compared to the traditional plane-wave methods, our methods allow for two…
▽ More
Motivated by the recently proposed parallel orbital-updating approach in real space method, we propose a parallel orbital-updating based plane-wave basis method for electronic structure calculations, for solving the corresponding eigenvalue problems. In addition, we propose two new modified parallel orbital-updating methods. Compared to the traditional plane-wave methods, our methods allow for two-level parallelization, which is particularly interesting for large scale parallelization. Numerical experiments show that these new methods are more reliable and efficient for large scale calculations on modern supercomputers
△ Less
Submitted 13 February, 2017;
originally announced February 2017.
-
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Authors:
Aojun Zhou,
Anbang Yao,
Yiwen Guo,
Lin Xu,
Yurong Chen
Abstract:
This paper presents incremental network quantization (INQ), a novel method, targeting to efficiently convert any pre-trained full-precision convolutional neural network (CNN) model into a low-precision version whose weights are constrained to be either powers of two or zero. Unlike existing methods which are struggled in noticeable accuracy loss, our INQ has the potential to resolve this issue, as…
▽ More
This paper presents incremental network quantization (INQ), a novel method, targeting to efficiently convert any pre-trained full-precision convolutional neural network (CNN) model into a low-precision version whose weights are constrained to be either powers of two or zero. Unlike existing methods which are struggled in noticeable accuracy loss, our INQ has the potential to resolve this issue, as benefiting from two innovations. On one hand, we introduce three interdependent operations, namely weight partition, group-wise quantization and re-training. A well-proven measure is employed to divide the weights in each layer of a pre-trained CNN model into two disjoint groups. The weights in the first group are responsible to form a low-precision base, thus they are quantized by a variable-length encoding method. The weights in the other group are responsible to compensate for the accuracy loss from the quantization, thus they are the ones to be re-trained. On the other hand, these three operations are repeated on the latest re-trained group in an iterative manner until all the weights are converted into low-precision ones, acting as an incremental network quantization and accuracy enhancement procedure. Extensive experiments on the ImageNet classification task using almost all known deep CNN architectures including AlexNet, VGG-16, GoogleNet and ResNets well testify the efficacy of the proposed method. Specifically, at 5-bit quantization, our models have improved accuracy than the 32-bit floating-point references. Taking ResNet-18 as an example, we further show that our quantized models with 4-bit, 3-bit and 2-bit ternary weights have improved or very similar accuracy against its 32-bit floating-point baseline. Besides, impressive results with the combination of network pruning and INQ are also reported. The code is available at https://github.com/Zhouaojun/Incremental-Network-Quantization.
△ Less
Submitted 25 August, 2017; v1 submitted 9 February, 2017;
originally announced February 2017.
-
Dynamic Task Allocation for Crowdsourcing Settings
Authors:
Angela Zhou,
Irineo Cabreros,
Karan Singh
Abstract:
We consider the problem of optimal budget allocation for crowdsourcing problems, allocating users to tasks to maximize our final confidence in the crowdsourced answers. Such an optimized worker assignment method allows us to boost the efficacy of any popular crowdsourcing estimation algorithm. We consider a mutual information interpretation of the crowdsourcing problem, which leads to a stochastic…
▽ More
We consider the problem of optimal budget allocation for crowdsourcing problems, allocating users to tasks to maximize our final confidence in the crowdsourced answers. Such an optimized worker assignment method allows us to boost the efficacy of any popular crowdsourcing estimation algorithm. We consider a mutual information interpretation of the crowdsourcing problem, which leads to a stochastic subset selection problem with a submodular objective function. We present experimental simulation results which demonstrate the effectiveness of our dynamic task allocation method for achieving higher accuracy, possibly requiring fewer labels, as well as improving upon a previous method which is sensitive to the proportion of users to questions.
△ Less
Submitted 25 February, 2017; v1 submitted 30 January, 2017;
originally announced January 2017.
-
Complementarity via error-free measurement in a two-path interferometer
Authors:
Yanjun Liu,
**g Lu,
and Lan Zhou
Abstract:
We study both the wave-like behavior and particle-like behavior in a general Mach-Zehnder interferometer with its asymmetric beam splitter. A error-free measurement in the detector is used to extract the which-path information. The fringe visibility V and the which-path information Ipath are derived: their complementary relation V + Ipath less than or equal to 1 are found, and the condition for th…
▽ More
We study both the wave-like behavior and particle-like behavior in a general Mach-Zehnder interferometer with its asymmetric beam splitter. A error-free measurement in the detector is used to extract the which-path information. The fringe visibility V and the which-path information Ipath are derived: their complementary relation V + Ipath less than or equal to 1 are found, and the condition for the equality is also presented.
△ Less
Submitted 1 November, 2016;
originally announced November 2016.
-
Parallel Stream Processing Against Workload Skewness and Variance
Authors:
Junhua Fang,
Rong Zhang,
Tom Z. J. Fu,
Zhenjie Zhang,
Aoying Zhou,
Junhua Zhu
Abstract:
Key-based workload partitioning is a common strategy used in parallel stream processing engines, enabling effective key-value tuple distribution over worker threads in a logical operator. While randomized hashing on the keys is capable of balancing the workload for key-based partitioning when the keys generally follow a static distribution, it is likely to generate poor balancing performance when…
▽ More
Key-based workload partitioning is a common strategy used in parallel stream processing engines, enabling effective key-value tuple distribution over worker threads in a logical operator. While randomized hashing on the keys is capable of balancing the workload for key-based partitioning when the keys generally follow a static distribution, it is likely to generate poor balancing performance when workload variance occurs on the incoming data stream. This paper presents a new key-based workload partitioning framework, with practical algorithms to support dynamic workload assignment for stateful operators. The framework combines hash-based and explicit key-based routing strategies for workload distribution, which specifies the destination worker threads for a handful of keys and assigns the other keys with the hashing function. When short-term distribution fluctuations occur to the incoming data stream, the system adaptively updates the routing table containing the chosen keys, in order to rebalance the workload with minimal migration overhead within the stateful operator. We formulate the rebalance operation as an optimization problem, with multiple objectives on minimizing state migration costs, controlling the size of the routing table and breaking workload imbalance among worker threads. Despite of the NP-hardness nature behind the optimization formulation, we carefully investigate and justify the heuristics behind key (re)routing and state migration, to facilitate fast response to workload variance with ignorable cost to the normal processing in the distributed system. Empirical studies on synthetic data and real-world stream applications validate the usefulness of our proposals and prove the huge advantage of our approaches over state-of-the-art solutions in the literature.
△ Less
Submitted 13 December, 2016; v1 submitted 17 October, 2016;
originally announced October 2016.
-
The Spectral Energy Distributions of Fermi Blazars
Authors:
J. H. Fan,
J. H. Yang,
Y. Liu,
G. Y. Luo,
C. Lin,
Y. H. Yuan,
H. B. Xiao,
A. Y. Zhou,
T. X. Hua,
Z. Y. Pei
Abstract:
(Abridged) In this paper, multi-wavelength data are compiled for a sample of 1425 Fermi blazars to calculate their spectral energy distributions (SEDs). A parabolic function, $\log(νF_ν) = P_1(\logν- P_2)^2 + P_3,$ is used for SED fitting. Synchrotron peak frequency ($\log ν_p$), spectral curvature ($P_1$), peak flux ($ν_{\rm p}F_{\rm ν_p}$), and integrated flux ($νF_ν$) are successfully obtained…
▽ More
(Abridged) In this paper, multi-wavelength data are compiled for a sample of 1425 Fermi blazars to calculate their spectral energy distributions (SEDs). A parabolic function, $\log(νF_ν) = P_1(\logν- P_2)^2 + P_3,$ is used for SED fitting. Synchrotron peak frequency ($\log ν_p$), spectral curvature ($P_1$), peak flux ($ν_{\rm p}F_{\rm ν_p}$), and integrated flux ($νF_ν$) are successfully obtained for 1392 blazars (461 flat spectrum radio quasars-FSRQs, 620 BL Lacs-BLs and 311 blazars of uncertain type-BCUs, 999 sources have known redshifts). Monochromatic luminosity at radio 1.4 GHz, optical R band, X-ray at 1 keV and $γ$-ray at 1 GeV, peak luminosity, integrated luminosity and effective spectral indexes of radio to optical ($α_{\rm RO}$), and optical to X-ray ($α_{\rm OX}$) are calculated. The "Bayesian classification" is employed to log$ν_{\rm p}$ in the rest frame for 999 blazars with available redshift and the results show that 3 components are enough to fit the $\logν_{\rm p}$ distribution, there is no ultra high peaked subclass. Based on the 3 components, the subclasses of blazars using the acronyms of Abdo et al. (2010a) are classified, and some mutual correlations are also studied. Conclusions are finally drawn as follows: (1) SEDs are successfully obtained for 1392 blazars. The fitted peak frequencies are compared with common sources from samples available (Sambruna et al. 1996, Nieppola et al. 2006, 2008, Abdo et al. 2010a). (2) Blazars are classified as low synchrotron peak sources (LSPs) if $\logν_{\rm p}$(Hz) $\leq 14.0$, intermediate synchrotron peak sources (ISPs) if $14.0 < \logν_{\rm p}$(Hz) $\leq 15.3$, and high synchrotron peak sources (HSPs) if $\logν_{\rm p}$(Hz) $> 15.3$. (3) $γ$-ray emissions are strongly correlated with radio emissions. (...)
△ Less
Submitted 13 August, 2016;
originally announced August 2016.
-
Existence of a calibrated regime switching local volatility model and new fake Brownian motions
Authors:
Benjamin Jourdain,
Alexandre Zhou
Abstract:
By Gyongy's theorem, a local and stochastic volatility (LSV) model is calibrated to the market prices of all European call options with positive maturities and strikes if its local volatility function is equal to the ratio of the Dupire local volatility function over the root conditional mean square of the stochastic volatility factor given the spot value. This leads to a SDE nonlinear in the sens…
▽ More
By Gyongy's theorem, a local and stochastic volatility (LSV) model is calibrated to the market prices of all European call options with positive maturities and strikes if its local volatility function is equal to the ratio of the Dupire local volatility function over the root conditional mean square of the stochastic volatility factor given the spot value. This leads to a SDE nonlinear in the sense of McKean. Particle methods based on a kernel approximation of the conditional expectation, as presented by Guyon and Henry-Labordère (2011), provide an efficient calibration procedure even if some calibration errors may appear when the range of the stochastic volatility factor is very large. But so far, no global existence result is available for the SDE nonlinear in the sense of McKean. In the particular case where the local volatility function is equal to the inverse of the root conditional mean square of the stochastic volatility factor multiplied by the spot value given this value and the interest rate is zero, the solution to the SDE is a fake Brownian motion. When the stochastic volatility factor is a constant (over time) random variable taking finitely many values and the range of its square is not too large, we prove existence to the associated Fokker-Planck equation. Thanks to Figalli (2008), we then deduce existence of a new class of fake Brownian motions. We then extend these results to the special case of the LSV model called regime switching local volatility, where the stochastic volatility factor is a jump process taking finitely many values and with jump intensities depending on the spot level. Under the same condition on the range of its square, we prove existence to the associated Fokker-Planck PDE. Finally, we deduce existence of the calibrated model by extending the results in Figalli (2008).
△ Less
Submitted 20 January, 2017; v1 submitted 30 June, 2016;
originally announced July 2016.
-
Learning from Non-Stationary Stream Data in Multiobjective Evolutionary Algorithm
Authors:
Jianyong Sun,
Hu Zhang,
Aimin Zhou,
Qingfu Zhang
Abstract:
Evolutionary algorithms (EAs) have been well acknowledged as a promising paradigm for solving optimisation problems with multiple conflicting objectives in the sense that they are able to locate a set of diverse approximations of Pareto optimal solutions in a single run. EAs drive the search for approximated solutions through maintaining a diverse population of solutions and by recombining promisi…
▽ More
Evolutionary algorithms (EAs) have been well acknowledged as a promising paradigm for solving optimisation problems with multiple conflicting objectives in the sense that they are able to locate a set of diverse approximations of Pareto optimal solutions in a single run. EAs drive the search for approximated solutions through maintaining a diverse population of solutions and by recombining promising solutions selected from the population. Combining machine learning techniques has shown great potentials since the intrinsic structure of the Pareto optimal solutions of an multiobjective optimisation problem can be learned and used to guide for effective recombination. However, existing multiobjective EAs (MOEAs) based on structure learning spend too much computational resources on learning. To address this problem, we propose to use an online learning scheme. Based on the fact that offsprings along evolution are streamy, dependent and non-stationary (which implies that the intrinsic structure, if any, is temporal and scale-variant), an online agglomerative clustering algorithm is applied to adaptively discover the intrinsic structure of the Pareto optimal solution set; and to guide effective offspring recombination. Experimental results have shown significant improvement over five state-of-the-art MOEAs on a set of well-known benchmark problems with complicated Pareto sets and complex Pareto fronts.
△ Less
Submitted 16 June, 2016;
originally announced June 2016.
-
A conjugate gradient method for electronic structure calculations
Authors:
Xiaoying Dai,
Zhuang Liu,
Liwei Zhang,
Aihui Zhou
Abstract:
In this paper, we study a conjugate gradient method for electronic structure calculations. We propose a Hessian based step size strategy, which together with three orthogonality approaches yields three algorithms for computing the ground state energy of atomic and molecular systems. Under some mild assumptions, we prove that our algorithms converge locally. It is shown by our numerical experiments…
▽ More
In this paper, we study a conjugate gradient method for electronic structure calculations. We propose a Hessian based step size strategy, which together with three orthogonality approaches yields three algorithms for computing the ground state energy of atomic and molecular systems. Under some mild assumptions, we prove that our algorithms converge locally. It is shown by our numerical experiments that the conjugate gradient method is efficient.
△ Less
Submitted 29 August, 2017; v1 submitted 28 January, 2016;
originally announced January 2016.
-
Tensor Eigenvalue Complementarity Problems
Authors:
**yan Fan,
Jiawang Nie,
Anwa Zhou
Abstract:
This paper studies tensor eigenvalue complementarity problems. Basic properties of standard and complementarity tensor eigenvalues are discussed. We formulate tensor eigenvalue complementarity problems as constrained polynomial optimization. When one tensor is strictly copositive, the complementarity eigenvalues can be computed by solving polynomial optimization with normalization by strict coposi…
▽ More
This paper studies tensor eigenvalue complementarity problems. Basic properties of standard and complementarity tensor eigenvalues are discussed. We formulate tensor eigenvalue complementarity problems as constrained polynomial optimization. When one tensor is strictly copositive, the complementarity eigenvalues can be computed by solving polynomial optimization with normalization by strict copositivity. When no tensor is strictly copositive, we formulate the tensor eigenvalue complementarity problem equivalently as polynomial optimization by a randomization process. The complementarity eigenvalues can be computed sequentially. The formulated polynomial optimization can be solved by Lasserre's hierarchy of semidefinite relaxations. We show that it has finite convergence for general tensors. Numerical experiments are presented to show the efficiency of proposed methods.
△ Less
Submitted 29 May, 2017; v1 submitted 20 January, 2016;
originally announced January 2016.
-
A Parallel Orbital-updating Based Optimization Method for Electronic Structure Calculations
Authors:
Xiaoying Dai,
Zhuang Liu,
Xin Zhang,
Aihui Zhou
Abstract:
In this paper, we propose a parallel optimization method for electronic structure calculations based on a single orbital-updating approximation. It is shown by our numerical experiments that the method is efficient and reliable for atomic and molecular systems of large scale over supercomputers.
In this paper, we propose a parallel optimization method for electronic structure calculations based on a single orbital-updating approximation. It is shown by our numerical experiments that the method is efficient and reliable for atomic and molecular systems of large scale over supercomputers.
△ Less
Submitted 19 November, 2015; v1 submitted 25 October, 2015;
originally announced October 2015.
-
Diagnostics From Three Rising Submillimeter Bursts
Authors:
Ai-Hua Zhou,
Jian-** Li,
Xin-Dong Wang
Abstract:
In the paper we investigate three novel rising submillimeter (THz) bursts occurred sequentially in a super-Active Region NOAA 10486. The average rising rate of the flux density above 200 GHz is only 20 sfu/GHz (corresponding spectral index $α$ of 1.6) for the THz spectral components of 2003 October 28 and November 4 bursts, while it can attain values of 235 sfu/GHz ($α$=4.8) for 2003 November 2 bu…
▽ More
In the paper we investigate three novel rising submillimeter (THz) bursts occurred sequentially in a super-Active Region NOAA 10486. The average rising rate of the flux density above 200 GHz is only 20 sfu/GHz (corresponding spectral index $α$ of 1.6) for the THz spectral components of 2003 October 28 and November 4 bursts, while it can attain values of 235 sfu/GHz ($α$=4.8) for 2003 November 2 burst. The steeply rising THz spectrum can be produced by a population of high relativistic electrons with a low-energy cutoff of 1 MeV , while it only requires a low-energy cutoff of 30 keV for the two slowly rising THz bursts, via gyrosynchrotron (GS) radiation based on our numerical simulations of burst spectra in the magnetic dipole field case. The electron density variation is much larger in the THz source than that in microwave (MW) one. It is interesting that the THz source radius decreased by 20--50$\%$ during the decay phase for the three events, but the MW one increased by 28$\%$ for the 2003 November 2 event. In the paper we will present a calculation formula of energy released by ultrarelativistic electrons, accounting the relativistic correction for the first time. We find that the energy released by energetic electrons in the THz source exceeds that in microwave one due to the strong GS radiation loss at THz range, although the modeled THz source area is 3--4 orders smaller than the modeled MW one. The total energies released by energetic electrons via the GS radiation in radio sources are estimated, respectively, to be $5.2\times10^{33}$, $3.9\times10^{33}$ and $3.7\times10^{32}$ erg for the October 28, November 2 and 4 bursts, which are 131, 76 and 4 times as large as the thermal energies of $2.9\times10^{31}$, $2.1\times10^{31}$ and $5.2\times10^{31}$ erg estimated from the soft x-ray GOES observations.
△ Less
Submitted 13 September, 2015;
originally announced September 2015.
-
Spin orbit coupling controlled spin pum** effect
Authors:
L. Ma,
H. A. Zhou,
L. Wang,
X. L. Fan,
W. J. Fan,
D. S. Xue,
K. Xia,
G. Y. Guo,
S. M. Zhou
Abstract:
Effective spin mixing conductance (ESMC) across the nonmagnetic metal (NM)/ferromagnet interface, spin Hall conductivity (SHC) and spin diffusion length (SDL) in the NM layer govern the functionality and performance of pure spin current devices with spin pum** technique. We show that all three parameters can be tuned significantly by the spin orbit coupling (SOC) strength of the NM layer in syst…
▽ More
Effective spin mixing conductance (ESMC) across the nonmagnetic metal (NM)/ferromagnet interface, spin Hall conductivity (SHC) and spin diffusion length (SDL) in the NM layer govern the functionality and performance of pure spin current devices with spin pum** technique. We show that all three parameters can be tuned significantly by the spin orbit coupling (SOC) strength of the NM layer in systems consisting of ferromagnetic insulating Y3Fe5O12 layer and metallic Pd1-xPtx layer. Surprisingly, the ESMC is observed to increase significantly with x changing from 0 to 1.0. The SHC in PdPt alloys, dominated by the intrinsic term, is enhanced notably with increasing x. Meanwhile, the SDL is found to decrease when Pd atoms are replaced by heavier Pt atoms, validating the SOC induced spin flip scattering model in polyvalent PdPt alloys. The capabilities of both spin current generation and spin charge conversion are largely heightened via the SOC. These findings highlight the multifold tuning effects of the SOC in develo** the new generation of spintronic devices.
△ Less
Submitted 17 September, 2015; v1 submitted 3 August, 2015;
originally announced August 2015.
-
Study of Temporal Evolution of Emission Spectrum in a Steeply Rising Submillimeter Burst
Authors:
J. P. Li,
A. H. Zhou,
X. D. Wang
Abstract:
In the paper the spectral temporal evolution of a steeply rising submillimeter (THz) burst occurred on 2003 November 2 was investigated in detail for the first time. Observations show that the flux density of the THz spectrum increased steeply with frequency above 200 GHz. Their average rising rates reached a value of 235 sfu/GHz (corresponding spectral index $α$ of 4.8) during the burst. The flux…
▽ More
In the paper the spectral temporal evolution of a steeply rising submillimeter (THz) burst occurred on 2003 November 2 was investigated in detail for the first time. Observations show that the flux density of the THz spectrum increased steeply with frequency above 200 GHz. Their average rising rates reached a value of 235 sfu/GHz (corresponding spectral index $α$ of 4.8) during the burst. The flux densities reached about 4,000 and 70,000 sfu at 212 and 405 GHz at maximum phase, respectively. The emissions at 405 GHz maintained continuous high level that they exceed largely the peak values of the microwave (MW) spectra during the main phase. Our studies suggest that only energetic electrons with a low-energy cutoff of $\sim$1 MeV and number density of $\sim$$10^{6}$--$10^{8}$ cm$^{-3}$ can produce such strong and steeply rising THz component via gyrosynchrotron (GS) radiation based on numerical simulations of burst spectra in the nonuniform magnetic field case. The electron number density $N$, derived from our numerical fits to the THz temporal evolution spectra, increased substantially from $8\times10^{6}$ to $4\times10^{8}$ cm$^{-3}$, i.e., $N$ value increased 50 times during the rise phase. During the decay phase it decreased to $7\times10^{7}$ cm$^{-3}$, i.e., decreased about five times from the maximum phase. The total electron number decreased an order of magnitude from the maximum phase to the decay phase. Nevertheless the variation amplitude of $N$ is only about one time in the MW emission source during this burst, and the total electron number did not decrease but increased by about 20$\%$ during the decay phase. Interestingly, we find that the THz source radius decreased by about 24$\%$ while the MW source one, on the contrary, increased by 28$\%$ during the decay phase.
△ Less
Submitted 17 June, 2015;
originally announced June 2015.
-
Hybrid Pulsators -- Pulsating Stars with Multiple Identities
Authors:
A. -Y. Zhou
Abstract:
We have carried out a statistic survey on the pulsating variable stars with multiple identities. These stars were identified to exhibit two types of pulsation or multiple light variability types in the literature, and are usually called hybrid pulsators. We extracted the hybrid information based on the Simbad database. Actually, all the variables with multiple identities are retrieved. The survey…
▽ More
We have carried out a statistic survey on the pulsating variable stars with multiple identities. These stars were identified to exhibit two types of pulsation or multiple light variability types in the literature, and are usually called hybrid pulsators. We extracted the hybrid information based on the Simbad database. Actually, all the variables with multiple identities are retrieved. The survey covers various pulsating stars across the Hertzsprung-Russell diagram. We aim at giving a clue in selecting interesting targets for further observation. Hybrid pulsators are excellent targets for asteroseismology. An important implication of such stars is their potential in advancing the theories of both stellar evolution and pulsation. By presenting the statistics, we address the open questions and prospects regarding current status of hybrid pulsation studies.
△ Less
Submitted 21 January, 2015;
originally announced January 2015.
-
Completely positive tensor decomposition
Authors:
**yan Fan,
Anwa Zhou
Abstract:
A symmetric tensor, which has a symmetric nonnegative decomposition, is called a completely positive tensor. We consider the completely positive tensor decomposition problem. A semidefinite algorithm is presented for checking whether a symmetric tensor is completely positive. If it is not completely positive, a certificate for it can be obtained; if it is completely positive, a nonnegative decompo…
▽ More
A symmetric tensor, which has a symmetric nonnegative decomposition, is called a completely positive tensor. We consider the completely positive tensor decomposition problem. A semidefinite algorithm is presented for checking whether a symmetric tensor is completely positive. If it is not completely positive, a certificate for it can be obtained; if it is completely positive, a nonnegative decomposition can be obtained.
△ Less
Submitted 19 November, 2014;
originally announced November 2014.
-
The CP-matrix Approximation Problem
Authors:
**yan Fan,
Anwa Zhou
Abstract:
A symmetric matrix $A$ is completely positive (CP) if there exists an entrywise nonnegative matrix $V$ such that $A = V V ^T$. In this paper, we study the CP-matrix approximation problem of projecting a matrix onto the intersection of a set of linear constraints and the cone of CP matrices. We formulate the problem as the linear optimization with the norm cone and the cone of moments. A semidefini…
▽ More
A symmetric matrix $A$ is completely positive (CP) if there exists an entrywise nonnegative matrix $V$ such that $A = V V ^T$. In this paper, we study the CP-matrix approximation problem of projecting a matrix onto the intersection of a set of linear constraints and the cone of CP matrices. We formulate the problem as the linear optimization with the norm cone and the cone of moments. A semidefinite algorithm is presented for the problem. A CP-decomposition of the projection matrix can also be obtained if the problem is feasible.
△ Less
Submitted 4 November, 2014;
originally announced November 2014.