-
A restricted memory quasi-Newton bundle method for nonsmooth optimization on Riemannian manifolds
Authors:
Chunming Tang,
Shajie Xing,
Wen Huang,
**bao Jian
Abstract:
In this paper, a restricted memory quasi-Newton bundle method for minimizing a locally Lipschitz function over a Riemannian manifold is proposed, which extends the classical one in Euclidean spaces to the manifold setting. The curvature information of the objective function is approximated by applying the Riemannian version of the quasi-Newton updating formulas. The subgradient aggregation techniq…
▽ More
In this paper, a restricted memory quasi-Newton bundle method for minimizing a locally Lipschitz function over a Riemannian manifold is proposed, which extends the classical one in Euclidean spaces to the manifold setting. The curvature information of the objective function is approximated by applying the Riemannian version of the quasi-Newton updating formulas. The subgradient aggregation technique is used to avoid solving the time-consuming quadratic programming subproblem when calculating the candidate descent direction. Moreover, a new Riemannian line search procedure is proposed to generate the stepsizes, and the process is finitely terminated under a new version of the Riemannian semismooth assumption. Global convergence of the proposed method is established: if the serious iteration steps are finite, then the last serious iterate is stationary; otherwise, every accumulation point of the serious iteration sequence is stationary. Finally, some preliminary numerical results show that the proposed method is efficient.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Toward a Team of AI-made Scientists for Scientific Discovery from Gene Expression Data
Authors:
Haoyang Liu,
Yijiang Li,
**glin Jian,
Yuxuan Cheng,
Jianrong Lu,
Shuyi Guo,
**glei Zhu,
Mianchen Zhang,
Miantong Zhang,
Haohan Wang
Abstract:
Machine learning has emerged as a powerful tool for scientific discovery, enabling researchers to extract meaningful insights from complex datasets. For instance, it has facilitated the identification of disease-predictive genes from gene expression data, significantly advancing healthcare. However, the traditional process for analyzing such datasets demands substantial human effort and expertise…
▽ More
Machine learning has emerged as a powerful tool for scientific discovery, enabling researchers to extract meaningful insights from complex datasets. For instance, it has facilitated the identification of disease-predictive genes from gene expression data, significantly advancing healthcare. However, the traditional process for analyzing such datasets demands substantial human effort and expertise for the data selection, processing, and analysis. To address this challenge, we introduce a novel framework, a Team of AI-made Scientists (TAIS), designed to streamline the scientific discovery pipeline. TAIS comprises simulated roles, including a project manager, data engineer, and domain expert, each represented by a Large Language Model (LLM). These roles collaborate to replicate the tasks typically performed by data scientists, with a specific focus on identifying disease-predictive genes. Furthermore, we have curated a benchmark dataset to assess TAIS's effectiveness in gene identification, demonstrating our system's potential to significantly enhance the efficiency and scope of scientific exploration. Our findings represent a solid step towards automating scientific discovery through large language models.
△ Less
Submitted 20 February, 2024; v1 submitted 15 February, 2024;
originally announced February 2024.
-
Criteria for nilpotency of fusion systems
Authors:
Jie Jian,
Jun Liao,
Heguo Liu
Abstract:
Let $p$ be an odd prime and let $\mathcal{F}$ be a fusion system over a finite $p$-group $P$. A fusion system $\mathcal{F}$ is said to be nilpotent if $\mathcal{F}=\mathcal{F}_{P}(P)$. In this paper we provide new criteria for saturated fusion systems $\mathcal{F}$ to be nilpotent, which can be viewed as extension of the $p$-nilpotency theorem of Glauberman and Thompson for fusion systems attribut…
▽ More
Let $p$ be an odd prime and let $\mathcal{F}$ be a fusion system over a finite $p$-group $P$. A fusion system $\mathcal{F}$ is said to be nilpotent if $\mathcal{F}=\mathcal{F}_{P}(P)$. In this paper we provide new criteria for saturated fusion systems $\mathcal{F}$ to be nilpotent, which can be viewed as extension of the $p$-nilpotency theorem of Glauberman and Thompson for fusion systems attributed to Kessar and Linckelmann.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
FDA-MIMO-based Integrated Sensing and Communication System with Frequency Offset Permutation Index Modulation
Authors:
Jiangwei Jian,
Qimao Huang,
Bang Huang,
Wen-Qin Wang
Abstract:
Considering that frequency diverse array multiple-input multiple-output (FDA-MIMO) possesses extra range information to enhance sensing performance, this paper explores the FDA-MIMO-based integrated sensing and communication (ISAC) system. To reinforce the system communication capability, we propose the frequency offset permutation index modulation (FOPIM) scheme, which conveys extra information b…
▽ More
Considering that frequency diverse array multiple-input multiple-output (FDA-MIMO) possesses extra range information to enhance sensing performance, this paper explores the FDA-MIMO-based integrated sensing and communication (ISAC) system. To reinforce the system communication capability, we propose the frequency offset permutation index modulation (FOPIM) scheme, which conveys extra information bits by selecting and permutating frequency offsets from a frequency offsets pool. For the system communication sub-functionality, considering the fact that the traditional maximum likelihood detection method suffers from high complexity and bit error rate (BER), the maximum likelihood-based two-stage detection (MLTSD) approach is presented to overcome this issue. For the system sensing sub-function, we employ the two-step maximum likelihood estimator (TSMLE) to stepwise estimate the angle and range of the interested target. Furthermore, we derive the closed-form expressions for the tight upper bound on the communication BER, along with the sensing Cramér-Rao bound (CRB). The simulation results validate the theoretical analysis, demonstrating that the proposed system exhibits lower BER and superior range resolution than independent MIMO communication and MIMO sensing modules.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Understanding the Impact of Seasonal Climate Change on Canada's Economy by Region and Sector
Authors:
Shiyu He,
Trang Bui,
Yuying Huang,
Wenling Zhang,
Jie Jian,
Samuel W. K. Wong,
Tony S. Wirjanto
Abstract:
To assess the impact of climate change on the Canadian economy, we investigate and model the relationship between seasonal climate variables and economic growth across provinces and economic sectors. We further provide projections of climate change impacts up to the year 2050, taking into account the diverse climate change patterns and economic conditions across Canada. Our results indicate that r…
▽ More
To assess the impact of climate change on the Canadian economy, we investigate and model the relationship between seasonal climate variables and economic growth across provinces and economic sectors. We further provide projections of climate change impacts up to the year 2050, taking into account the diverse climate change patterns and economic conditions across Canada. Our results indicate that rising Fall temperature anomalies have a notable adverse impact on Canadian economic growth. Province-wide, Saskatchewan and Manitoba are anticipated to experience the most substantial declines, whereas British Columbia and the Maritime provinces will be less impacted. Industry-wide, Mining is projected to see the greatest benefits, while Agriculture and Manufacturing are projected to have the most significant downturns. The disparities of climate change effects between provinces and industries highlight the need for governments to tailor their policies accordingly, and offer targeted assistance to regions and industries that are particularly vulnerable in the face of climate change. Targeted approaches to climate change mitigation are likely to be more effective than one-size-fits-all policies for the whole economy.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Restricted Tweedie Stochastic Block Models
Authors:
Jie Jian,
Mu Zhu,
Peijun Sang
Abstract:
The stochastic block model (SBM) is a widely used framework for community detection in networks, where the network structure is typically represented by an adjacency matrix. However, conventional SBMs are not directly applicable to an adjacency matrix that consists of non-negative zero-inflated continuous edge weights. To model the international trading network, where edge weights represent tradin…
▽ More
The stochastic block model (SBM) is a widely used framework for community detection in networks, where the network structure is typically represented by an adjacency matrix. However, conventional SBMs are not directly applicable to an adjacency matrix that consists of non-negative zero-inflated continuous edge weights. To model the international trading network, where edge weights represent trading values between countries, we propose an innovative SBM based on a restricted Tweedie distribution. Additionally, we incorporate nodal information, such as the geographical distance between countries, and account for its dynamic effect on edge weights. Notably, we show that given a sufficiently large number of nodes, estimating this covariate effect becomes independent of community labels of each node when computing the maximum likelihood estimator of parameters in our model. This result enables the development of an efficient two-step algorithm that separates the estimation of covariate effects from other parameters. We demonstrate the effectiveness of our proposed method through extensive simulation studies and an application to real-world international trading data.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
A Partially Feasible Distributed SQO Method for Two-block General Linearly Constrained Smooth Optimization
Authors:
**bao jian,
Wenrui Chen,
Chunming Tang,
Jianghua Yin
Abstract:
This paper discusses a class of two-block smooth large-scale optimization problems with both linear equality and linear inequality constraints, which have a wide range of applications, such as economic power dispatch, data mining, signal processing, etc.Our goal is to develop a novel partially feasible distributed (PFD) sequential quadratic optimization (SQO) method (PFD-SQO method) for this kind…
▽ More
This paper discusses a class of two-block smooth large-scale optimization problems with both linear equality and linear inequality constraints, which have a wide range of applications, such as economic power dispatch, data mining, signal processing, etc.Our goal is to develop a novel partially feasible distributed (PFD) sequential quadratic optimization (SQO) method (PFD-SQO method) for this kind of problems. The design of the method is based on the ideas of SQO method and augmented Lagrangian Jacobian splitting scheme as well as feasible direction method,which decomposes the quadratic optimization (QO) subproblem into two small-scale QOs that can be solved independently and parallelly. A novel disturbance contraction term that can be suitably adjusted is introduced into the inequality constraints so that the feasible step size along the search direction can be increased to 1. The new iteration points are generated by the Armijo line search and the partially augmented Lagrangian function that only contains equality constraints as the merit function. The iteration points always satisfy all the inequality constraints of the problem. The theoretical properties, such as global convergence, iterative complexity, superlinear and quadratic rates of convergence of the proposed PFD-SQO method are analyzed under appropriate assumptions, respectively. Finally, the numerical effectiveness of the method is tested on a class of academic examples and an economic power dispatch problem, which shows that the proposed method is quite promising.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
AffordPose: A Large-scale Dataset of Hand-Object Interactions with Affordance-driven Hand Pose
Authors:
Juntao Jian,
** Liu,
Manyi Li,
Ruizhen Hu,
Jian Liu
Abstract:
How human interact with objects depends on the functional roles of the target objects, which introduces the problem of affordance-aware hand-object interaction. It requires a large number of human demonstrations for the learning and understanding of plausible and appropriate hand-object interactions. In this work, we present AffordPose, a large-scale dataset of hand-object interactions with afford…
▽ More
How human interact with objects depends on the functional roles of the target objects, which introduces the problem of affordance-aware hand-object interaction. It requires a large number of human demonstrations for the learning and understanding of plausible and appropriate hand-object interactions. In this work, we present AffordPose, a large-scale dataset of hand-object interactions with affordance-driven hand pose. We first annotate the specific part-level affordance labels for each object, e.g. twist, pull, handle-grasp, etc, instead of the general intents such as use or handover, to indicate the purpose and guide the localization of the hand-object interactions. The fine-grained hand-object interactions reveal the influence of hand-centered affordances on the detailed arrangement of the hand poses, yet also exhibit a certain degree of diversity. We collect a total of 26.7K hand-object interactions, each including the 3D object shape, the part-level affordance label, and the manually adjusted hand poses. The comprehensive data analysis shows the common characteristics and diversity of hand-object interactions per affordance via the parameter statistics and contacting computation. We also conduct experiments on the tasks of hand-object affordance understanding and affordance-oriented hand-object interaction generation, to validate the effectiveness of our dataset in learning the fine-grained hand-object interactions. Project page: https://github.com/GentlesJan/AffordPose.
△ Less
Submitted 16 September, 2023;
originally announced September 2023.
-
Graphene/silicon heterojunction for reconfigurable phase-relevant activation function in coherent optical neural networks
Authors:
Chuyu Zhong,
Kun Liao,
Tianxiang Dai,
Maoliang Wei,
Hui Ma,
Jianghong Wu,
Zhibin Zhang,
Yuting Ye,
Ye Luo,
Zequn Chen,
Jialing Jian,
Chulei Sun,
Bo Tang,
Peng Zhang,
Ruonan Liu,
Junying Li,
Jianyi Yang,
Lan Li,
Kaihui Liu,
Xiaoyong Hu,
Hongtao Lin
Abstract:
Optical neural networks (ONNs) herald a new era in information and communication technologies and have implemented various intelligent applications. In an ONN, the activation function (AF) is a crucial component determining the network performances and on-chip AF devices are still in development. Here, we first demonstrate on-chip reconfigurable AF devices with phase activation fulfilled by dual-f…
▽ More
Optical neural networks (ONNs) herald a new era in information and communication technologies and have implemented various intelligent applications. In an ONN, the activation function (AF) is a crucial component determining the network performances and on-chip AF devices are still in development. Here, we first demonstrate on-chip reconfigurable AF devices with phase activation fulfilled by dual-functional graphene/silicon (Gra/Si) heterojunctions. With optical modulation and detection in one device, time delays are shorter, energy consumption is lower, reconfigurability is higher and the device footprint is smaller than other on-chip AF strategies. The experimental modulation voltage (power) of our Gra/Si heterojunction achieves as low as 1 V (0.5 mW), superior to many pure silicon counterparts. In the photodetection aspect, a high responsivity of over 200 mA/W is realized. Special nonlinear functions generated are fed into a complex-valued ONN to challenge handwritten letters and image recognition tasks, showing improved accuracy and potential of high-efficient, all-component-integration on-chip ONN. Our results offer new insights for on-chip ONN devices and pave the way to high-performance integrated optoelectronic computing circuits.
△ Less
Submitted 13 July, 2023;
originally announced July 2023.
-
Convergence Rate of LQG Mean Field Games with Common Noise
Authors:
Jiamin Jian,
Qingshuo Song,
Jiaxuan Ye
Abstract:
This paper focuses on exploring the convergence properties of a generic player's trajectory and empirical measures in an N-player Linear-Quadratic-Gaussian Nash game, where Brownian motion serves as the common noise. The study establishes three distinct convergence rates concerning the representative player and empirical measure. To investigate the convergence, the methodology relies on a specific…
▽ More
This paper focuses on exploring the convergence properties of a generic player's trajectory and empirical measures in an N-player Linear-Quadratic-Gaussian Nash game, where Brownian motion serves as the common noise. The study establishes three distinct convergence rates concerning the representative player and empirical measure. To investigate the convergence, the methodology relies on a specific decomposition of the equilibrium path in the N-player game and utilizes the associated Mean Field Game framework.
△ Less
Submitted 2 July, 2023;
originally announced July 2023.
-
A descent method for nonsmooth multiobjective optimization problems on Riemannian manifolds
Authors:
Chunming Tang,
Hao He,
**bao Jian,
Miantao Chao
Abstract:
In this paper, a descent method for nonsmooth multiobjective optimization problems on complete Riemannian manifolds is proposed. The objective functions are only assumed to be locally Lipschitz continuous instead of convexity used in existing methods. A necessary condition for Pareto optimality in Euclidean space is generalized to the Riemannian setting. At every iteration, an acceptable descent d…
▽ More
In this paper, a descent method for nonsmooth multiobjective optimization problems on complete Riemannian manifolds is proposed. The objective functions are only assumed to be locally Lipschitz continuous instead of convexity used in existing methods. A necessary condition for Pareto optimality in Euclidean space is generalized to the Riemannian setting. At every iteration, an acceptable descent direction is obtained by constructing a convex hull of some Riemannian $\varepsilon$-subgradients. And then a Riemannian Armijo-type line search is executed to produce the next iterate. The convergence result is established in the sense that a point satisfying the necessary condition for Pareto optimality can be generated by the algorithm in a finite number of iterations. Finally, some preliminary numerical results are reported, which show that the proposed method is efficient.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
The kernels of powers of linear operator via Weyr characteristic
Authors:
Jie Jian,
Jun Liao,
Heguo Liu
Abstract:
The adjoint of a matrix in the Lie algebra associated with a matrix algebra is a fundamental operator, which can be generalized to a more general operator $\varphi_{AB}: X\rightarrow AX-XB$ by two matrices $A$ and $B$. The kernel of the operator is very well-known and it can be found in Gantmacher's book. The formulas for the dimensions of the kernels of arbitrary powers of the operator…
▽ More
The adjoint of a matrix in the Lie algebra associated with a matrix algebra is a fundamental operator, which can be generalized to a more general operator $\varphi_{AB}: X\rightarrow AX-XB$ by two matrices $A$ and $B$. The kernel of the operator is very well-known and it can be found in Gantmacher's book. The formulas for the dimensions of the kernels of arbitrary powers of the operator $\varphi_{AB}$ were given in terms of the Segre characteristics of these two matrices by the second and third authors in this paper and their collaborators. This paper provides an alternative approach to this problem via the Weyr characteristic in a more essential method. We obtain formulas for the dimensions of the kernels of arbitrary powers of the operator in terms of the Weyr characteristics. Furthermore, the basis for kernel of each power of the operator is described explicitly. As a consequence, for arbitrary square matrices $A$ and $B$ over an algebraically closed field, the dimension of the kernel of each power of the operator $\varphi_{A-λI,B}$ for eigenvalues $λ$ of $\varphi_{AB}$ can be viewed as a similarity invariant of the operator $\varphi_{AB}$, so we characterise the operator within similarity, which should be of interest to a number of people (including physicists).
△ Less
Submitted 17 February, 2024; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Monotone Splitting SQP Algorithms for Two-block Nonconvex Optimization Problems with General Linear Constraints and Applications
Authors:
**bao Jian,
Guodong Ma,
Xiao Xu,
Daolan Han
Abstract:
In this work, based on the ideas of alternating direction method with multipliers (ADMM) and sequential quadratic programming (SQP), as well as Armijo line search technology, monotone splitting SQP algorithms for two-block nonconvex optimization problems with linear equality, inequality and box constraints are discussed. Firstly, the discussed problem is transformed into an optimization problem wi…
▽ More
In this work, based on the ideas of alternating direction method with multipliers (ADMM) and sequential quadratic programming (SQP), as well as Armijo line search technology, monotone splitting SQP algorithms for two-block nonconvex optimization problems with linear equality, inequality and box constraints are discussed. Firstly, the discussed problem is transformed into an optimization problem with only linear equality and box constraints by introducing slack variables. Secondly, we use the idea of ADMM to decompose the quadratic programming (QP) subproblem. Especially, the QP subproblem corresponding to the introducing slack variable is simple, and it has an explicit optimal solution without increasing computational cost. Thirdly, the search direction is generated by the optimal solutions of the subproblems, and the new iteration point is yielded by Armijo line search with augmented Lagrange function. And the global convergence of the algorithm is analyzed under weaker assumptions. In addition, box constraints are extended to general nonempty closed convex sets, moreover, the global convergence of the corresponding algorithm is also proved. Finally, some preliminary numerical experiments and applications in the mid-to-large-scale economic dispatch problems for power systems are reported, and these show that our proposed algorithm is promising.
△ Less
Submitted 30 January, 2023;
originally announced January 2023.
-
TIDAL: Topology-Inferred Drug Addiction Learning
Authors:
Zhu Zailiang,
Dou Bozheng,
Cao Yukang,
Jiang Jian,
Zhu Yueying,
Chen Dong,
Feng Hongsong,
Liu Jie,
Zhang Bengong,
Zhou Tianshou,
Wei Guowei
Abstract:
Drug addiction or drug overdose is a global public health crisis, and the design of anti-addiction drugs remains a major challenge due to intricate mechanisms. Since experimental drug screening and optimization are too time-consuming and expensive, there is urgent need to develop innovative artificial intelligence (AI) methods for addressing the challenge. We tackle this challenge by topology-infe…
▽ More
Drug addiction or drug overdose is a global public health crisis, and the design of anti-addiction drugs remains a major challenge due to intricate mechanisms. Since experimental drug screening and optimization are too time-consuming and expensive, there is urgent need to develop innovative artificial intelligence (AI) methods for addressing the challenge. We tackle this challenge by topology-inferred drug addiction learning (TIDAL) built from integrating topological Laplacian, deep bidirectional transformer, and ensemble-assisted neural networks (EANNs). The topological Laplacian is a novel algebraic topology tool that embeds molecular topological invariants and algebraic invariants into its harmonic spectra and non-harmonic spectra, respectively. These invariants complement sequence information extracted from a bidirectional transformer. We validate the proposed TIDAL framework on 22 drug addiction related, 4 hERG, and 12 DAT datasets, showing that TIDAL is a state-of-the-art framework for the modeling and analysis of drug addiction data. We carry out cross-target analysis of the current drug addiction candidates to alert their side effects and identify their repurposing potentials, revealing drugmediated linear and bilinear target correlations. Finally, TIDAL is applied to shed light on relative efficacy, repurposing potential, and potential side effects of 12 existing anti-addiction medications. Our results suggest that TIDAL provides a new computational strategy for pressingly-needed anti-substance addiction drug development.
△ Less
Submitted 2 December, 2022;
originally announced December 2022.
-
Syntactic Substitutability as Unsupervised Dependency Syntax
Authors:
Jasper Jian,
Siva Reddy
Abstract:
Syntax is a latent hierarchical structure which underpins the robust and compositional nature of human language. In this work, we explore the hypothesis that syntactic dependencies can be represented in language model attention distributions and propose a new method to induce these structures theory-agnostically. Instead of modeling syntactic relations as defined by annotation schemata, we model a…
▽ More
Syntax is a latent hierarchical structure which underpins the robust and compositional nature of human language. In this work, we explore the hypothesis that syntactic dependencies can be represented in language model attention distributions and propose a new method to induce these structures theory-agnostically. Instead of modeling syntactic relations as defined by annotation schemata, we model a more general property implicit in the definition of dependency relations, syntactic substitutability. This property captures the fact that words at either end of a dependency can be substituted with words from the same category. Substitutions can be used to generate a set of syntactically invariant sentences whose representations are then used for parsing. We show that increasing the number of substitutions used improves parsing accuracy on natural data. On long-distance subject-verb agreement constructions, our method achieves 79.5% recall compared to 8.9% using a previous method. Our method also provides improvements when transferred to a different parsing setup, demonstrating that it generalizes.
△ Less
Submitted 20 October, 2023; v1 submitted 29 November, 2022;
originally announced November 2022.
-
Effects of network topology and trait distribution on collective decision making
Authors:
Pengyu Liu,
Jie Jian
Abstract:
Social networks play an important role in analyzing the impact of individual-level interactions on societal or economic outcomes. We model interactive decision making for a community of individuals with different traits, represented by a social network with trait-attributed nodes. We develop a deterministic process generating a sequence of choices for each individual based on a trait-attributed so…
▽ More
Social networks play an important role in analyzing the impact of individual-level interactions on societal or economic outcomes. We model interactive decision making for a community of individuals with different traits, represented by a social network with trait-attributed nodes. We develop a deterministic process generating a sequence of choices for each individual based on a trait-attributed social network, initial choices of individuals and a set of predetermined trait-dependent rules for making decisions. The object of interest is the sequence of cumulative sum of choices over all individuals, which we call the cumulative sequence and consider as an index of collective decisions. We observe that, in a time period, a cumulative sequence can be unpredictable or predictable showing a repeated pattern either escalating to an extreme or constantly oscillating. We consider that predictable cumulative sequences represent unstable collective decisions of communities either extremizing or internally conflicting, while unpredictable cumulative sequences show stable changes. We analyze the effects of network topology and trait distribution on the probability of cumulative sequences being predictable, escalating and oscillating by simulations. Our findings include that unstable collective decisions are more probable as network density increases, that centralized networks are more likely to have unstable collective decisions and that networks with excessively clustered or scattered conformists and rebels tend to produce unstable cumulative sequences. We discuss the potential of the model as a framework for studying individuals with different traits on a social network directly and indirectly interacting in decision making.
△ Less
Submitted 7 August, 2022;
originally announced August 2022.
-
Efficient Convolutional Neural Networks on Raspberry Pi for Image Classification
Authors:
Rui-Yang Ju,
Ting-Yu Lin,
Jia-Hao Jian,
Jen-Shiun Chiang
Abstract:
With the good performance of deep learning algorithms in the field of computer vision (CV), the convolutional neural network (CNN) architecture has become a main backbone of the computer vision task. With the widespread use of mobile devices, neural network models based on platforms with low computing power are gradually being paid attention. However, due to the limitation of computing power, deep…
▽ More
With the good performance of deep learning algorithms in the field of computer vision (CV), the convolutional neural network (CNN) architecture has become a main backbone of the computer vision task. With the widespread use of mobile devices, neural network models based on platforms with low computing power are gradually being paid attention. However, due to the limitation of computing power, deep learning algorithms are usually not available on mobile devices. This paper proposes a lightweight convolutional neural network, TripleNet, which can operate easily on Raspberry Pi. Adopted from the concept of block connections in ThreshNet, the newly proposed network model compresses and accelerates the network model, reduces the amount of parameters of the network, and shortens the inference time of each image while ensuring the accuracy. Our proposed TripleNet and other state-of-the-art (SOTA) neural networks perform image classification experiments with the CIFAR-10 and SVHN datasets on Raspberry Pi. The experimental results show that, compared with GhostNet, MobileNet, ThreshNet, EfficientNet, and HarDNet, the inference time of TripleNet per image is shortened by 15%, 16%, 17%, 24%, and 30%, respectively. The detail codes of this work are available at https://github.com/RuiyangJu/TripleNet.
△ Less
Submitted 19 November, 2022; v1 submitted 2 April, 2022;
originally announced April 2022.
-
Predicting the longevity of resources shared in scientific publications
Authors:
Daniel E. Acuna,
Jian Jian,
Tong Zeng,
Lizhen Liang,
Han Zhuang
Abstract:
Research has shown that most resources shared in articles (e.g., URLs to code or data) are not kept up to date and mostly disappear from the web after some years (Zeng et al., 2019). Little is known about the factors that differentiate and predict the longevity of these resources. This article explores a range of explanatory features related to the publication venue, authors, references, and where…
▽ More
Research has shown that most resources shared in articles (e.g., URLs to code or data) are not kept up to date and mostly disappear from the web after some years (Zeng et al., 2019). Little is known about the factors that differentiate and predict the longevity of these resources. This article explores a range of explanatory features related to the publication venue, authors, references, and where the resource is shared. We analyze an extensive repository of publications and, through web archival services, reconstruct how they looked at different time points. We discover that the most important factors are related to where and how the resource is shared, and surprisingly little is explained by the author's reputation or prestige of the journal. By examining the places where long-lasting resources are shared, we suggest that it is critical to disseminate and create standards with modern technologies. Finally, we discuss implications for reproducibility and recognizing scientific datasets as first-class citizens.
△ Less
Submitted 23 March, 2022;
originally announced March 2022.
-
Aggregated Pyramid Vision Transformer: Split-transform-merge Strategy for Image Recognition without Convolutions
Authors:
Rui-Yang Ju,
Ting-Yu Lin,
Jen-Shiun Chiang,
Jia-Hao Jian,
Yu-Shian Lin,
Liu-Rui-Yi Huang
Abstract:
With the achievements of Transformer in the field of natural language processing, the encoder-decoder and the attention mechanism in Transformer have been applied to computer vision. Recently, in multiple tasks of computer vision (image classification, object detection, semantic segmentation, etc.), state-of-the-art convolutional neural networks have introduced some concepts of Transformer. This p…
▽ More
With the achievements of Transformer in the field of natural language processing, the encoder-decoder and the attention mechanism in Transformer have been applied to computer vision. Recently, in multiple tasks of computer vision (image classification, object detection, semantic segmentation, etc.), state-of-the-art convolutional neural networks have introduced some concepts of Transformer. This proves that Transformer has a good prospect in the field of image recognition. After Vision Transformer was proposed, more and more works began to use self-attention to completely replace the convolutional layer. This work is based on Vision Transformer, combined with the pyramid architecture, using Split-transform-merge to propose the group encoder and name the network architecture Aggregated Pyramid Vision Transformer (APVT). We perform image classification tasks on the CIFAR-10 dataset and object detection tasks on the COCO 2017 dataset. Compared with other network architectures that use Transformer as the backbone, APVT has excellent results while reducing the computational cost. We hope this improved strategy can provide a reference for future Transformer research in computer vision.
△ Less
Submitted 2 March, 2022;
originally announced March 2022.
-
Two Gaussian regularization methods for time-varying networks
Authors:
Jie Jian,
Peijun Sang,
Mu Zhu
Abstract:
We model time-varying network data as realizations from multivariate Gaussian distributions with precision matrices that change over time. To facilitate parameter estimation, we require not only that each precision matrix at any given time point be sparse, but also that precision matrices at neighboring time points be similar. We accomplish this with two different algorithms, by generalizing the e…
▽ More
We model time-varying network data as realizations from multivariate Gaussian distributions with precision matrices that change over time. To facilitate parameter estimation, we require not only that each precision matrix at any given time point be sparse, but also that precision matrices at neighboring time points be similar. We accomplish this with two different algorithms, by generalizing the elastic net and the fused LASSO, respectively. Our main focuses are efficient computational algorithms and convenient degree-of-freedom formulae for choosing tuning parameters. We illustrate our methods with two simulation studies. By applying them to an fMRI data set, we also detect some interesting differences in brain connectivity between healthy individuals and ADHD patients.
△ Less
Submitted 9 March, 2022; v1 submitted 14 February, 2022;
originally announced February 2022.
-
Tunable Chirality-dependent Nonlinear Electrical Responses in 2D Tellurium
Authors:
Chang Niu,
Gang Qiu,
Yixiu Wang,
Pukun Tan,
Mingyi Wang,
Jie Jian,
Haiyan Wang,
Wenzhuo Wu,
Peide D. Ye
Abstract:
Tellurium (Te) is an elemental semiconductor with a simple chiral crystal structure. Te in a two-dimensional (2D) form synthesized by solution-based method shows excellent electrical, optical, and thermal properties. In this work, the chirality of hydrothermally grown 2D Te is identified and analyzed by hot sulfuric acid etching and high-angle tilted high-resolution scanning transmission electron…
▽ More
Tellurium (Te) is an elemental semiconductor with a simple chiral crystal structure. Te in a two-dimensional (2D) form synthesized by solution-based method shows excellent electrical, optical, and thermal properties. In this work, the chirality of hydrothermally grown 2D Te is identified and analyzed by hot sulfuric acid etching and high-angle tilted high-resolution scanning transmission electron microscopy. The gate-tunable nonlinear electrical responses, including the nonreciprocal electrical transport in the longitudinal direction and the nonlinear planar Hall effect in the transverse direction, are observed in 2D Te under a magnetic field. Moreover, the nonlinear electrical responses have opposite signs in left- and right-handed 2D Te due to the opposite spin polarizations ensured by the chiral symmetry. The fundamental relationship between the spin-orbit coupling and the crystal symmetry in two enantiomers provides a viable platform for realizing chirality-based electronic devices by introducing the chirality degree of freedom into electron transport.
△ Less
Submitted 12 September, 2023; v1 submitted 21 January, 2022;
originally announced January 2022.
-
ThreshNet: An Efficient DenseNet Using Threshold Mechanism to Reduce Connections
Authors:
Rui-Yang Ju,
Ting-Yu Lin,
Jia-Hao Jian,
Jen-Shiun Chiang,
Wei-Bin Yang
Abstract:
With the continuous development of neural networks for computer vision tasks, more and more network architectures have achieved outstanding success. As one of the most advanced neural network architectures, DenseNet shortcuts all feature maps to solve the model depth problem. Although this network architecture has excellent accuracy with low parameters, it requires an excessive inference time. To…
▽ More
With the continuous development of neural networks for computer vision tasks, more and more network architectures have achieved outstanding success. As one of the most advanced neural network architectures, DenseNet shortcuts all feature maps to solve the model depth problem. Although this network architecture has excellent accuracy with low parameters, it requires an excessive inference time. To solve this problem, HarDNet reduces the connections between the feature maps, making the remaining connections resemble harmonic waves. However, this compression method may result in a decrease in the model accuracy and an increase in the parameters and model size. This network architecture may reduce the memory access time, but its overall performance can still be improved. Therefore, we propose a new network architecture, ThreshNet, using a threshold mechanism to further optimize the connection method. Different numbers of connections for different convolution layers are discarded to accelerate the inference of the network. The proposed network has been evaluated with image classification using CIFAR 10 and SVHN datasets under platforms of NVIDIA RTX 3050 and Raspberry Pi 4. The experimental results show that, compared with HarDNet68, GhostNet, MobileNetV2, ShuffleNet, and EfficientNet, the inference time of the proposed ThreshNet79 is 5%, 9%, 10%, 18%, and 20% faster, respectively. The number of parameters of ThreshNet95 is 55% less than that of HarDNet85. The new model compression and model acceleration methods can speed up the inference time, enabling network models to operate on mobile devices.
△ Less
Submitted 7 August, 2022; v1 submitted 9 January, 2022;
originally announced January 2022.
-
The convergence rate of the equilibrium measure for the hybrid LQG Mean Field Game
Authors:
Jiamin Jian,
Peiyao Lai,
Qingshuo Song,
Jiaxuan Ye
Abstract:
In this work, we study the convergence rate of the $N$-player LQG game with a Markov chain common noise towards its asymptotic Mean Field Game. By postulating a Markovian structure via two auxiliary processes for the first and second moments of the Mean Field Game equilibrium and applying the fixed point condition in Mean Field Game, we first provide the characterization of the equilibrium measure…
▽ More
In this work, we study the convergence rate of the $N$-player LQG game with a Markov chain common noise towards its asymptotic Mean Field Game. By postulating a Markovian structure via two auxiliary processes for the first and second moments of the Mean Field Game equilibrium and applying the fixed point condition in Mean Field Game, we first provide the characterization of the equilibrium measure in Mean Field Game with a finite-dimensional Riccati system of ODEs. Additionally, with an explicit coupling of the optimal trajectory of the $N$-player game driven by $N$ dimensional Brownian motion and Mean Field Game counterpart driven by one-dimensional Brownian motion, we obtain the convergence rate $O(N^{-1/2})$ with respect to 2-Wasserstein distance.
△ Less
Submitted 28 August, 2023; v1 submitted 8 June, 2021;
originally announced June 2021.
-
Maximum Covering Subtrees for Phylogenetic Networks
Authors:
Nathan Davidov,
Amanda Hernandez,
Justin Jian,
Patrick McKenna,
K. A. Medlin,
Roadra Mojumder,
Megan Owen,
Andrew Quijano,
Amanda Rodriguez,
Katherine St. John,
Katherine Thai,
Meliza Uraga
Abstract:
Tree-based phylogenetic networks, which may be roughly defined as leaf-labeled networks built by adding arcs only between the original tree edges, have elegant properties for modeling evolutionary histories. We answer an open question of Francis, Semple, and Steel about the complexity of determining how far a phylogenetic network is from being tree-based, including non-binary phylogenetic networks…
▽ More
Tree-based phylogenetic networks, which may be roughly defined as leaf-labeled networks built by adding arcs only between the original tree edges, have elegant properties for modeling evolutionary histories. We answer an open question of Francis, Semple, and Steel about the complexity of determining how far a phylogenetic network is from being tree-based, including non-binary phylogenetic networks. We show that finding a phylogenetic tree covering the maximum number of nodes in a phylogenetic network can be be computed in polynomial time via an encoding into a minimum-cost maximum flow problem.
△ Less
Submitted 24 November, 2020; v1 submitted 25 September, 2020;
originally announced September 2020.
-
On the Graphon Mean Field Game Equations: Individual Agent Affine Dynamics and Mean Field Dependent Performance Functions
Authors:
Peter E. Caines,
Daniel W. C. HO,
Minyi Huang,
Jiamin Jian,
Qingshuo Song
Abstract:
This paper establishes unique solvability of a class of Graphon Mean Field Game equations. The special case of a constant graphon yields the result for the Mean Field Game equations.
This paper establishes unique solvability of a class of Graphon Mean Field Game equations. The special case of a constant graphon yields the result for the Mean Field Game equations.
△ Less
Submitted 10 March, 2022; v1 submitted 25 September, 2020;
originally announced September 2020.
-
High-performance Coherent Optical Modulators based on Thin-film Lithium Niobate Platform
Authors:
Mengyue Xu,
Mingbo He,
Hongguang Zhang,
Jian Jian,
Ying Pan,
Xiaoyue Liu,
Lifeng Chen,
Xiangyu Meng,
Hui Chen,
Zhaohui Li,
Xi Xiao,
Shaohua Yu,
Siyuan Yu,
Xinlun Cai
Abstract:
The coherent transmission technology using digital signal processing and advanced modulation formats, is bringing networks closer to the theoretical capacity limit of optical fibres, the Shannon limit. The in-phase quadrature electro-optic modulator that encodes information on both the amplitude and the phase of light, is one of the underpinning devices for the coherent transmission technology. Id…
▽ More
The coherent transmission technology using digital signal processing and advanced modulation formats, is bringing networks closer to the theoretical capacity limit of optical fibres, the Shannon limit. The in-phase quadrature electro-optic modulator that encodes information on both the amplitude and the phase of light, is one of the underpinning devices for the coherent transmission technology. Ideally, such modulator should feature low loss, low drive voltage, large bandwidth, low chirp and compact footprint. However, these requirements have been only met on separate occasions. Here, we demonstrate integrated thin-film lithium niobate in-phase/quadrature modulators that fulfil these requirements simultaneously. The presented devices exhibit greatly improved overall performance (half-wave voltage, bandwidth and optical loss) over traditional lithium niobate counterparts, and support modulation data rate up to 320 Gbit s-1. Our devices pave new routes for future high-speed, energy-efficient, and cost-effective communication networks.
△ Less
Submitted 28 June, 2020;
originally announced June 2020.
-
Raman Response and Transport Properties of One-Dimensional van der Waals Tellurium Nanowires
Authors:
**g-Kai Qin,
Pai-Ying Liao,
Mengwei Si,
Shiyuan Gao,
Gang Qiu,
Jie Jian,
Qingxiao Wang,
Si-Qi Zhang,
Shouyuan Huang,
Adam Charnas,
Yixiu Wang,
Moon J. Kim,
Wenzhuo Wu,
Xianfan Xu,
Hai-Yan Wang,
Li Yang,
Yoke Khin Yap,
Peide D. Ye
Abstract:
Tellurium can form nanowires of helical atomic chains. Given their unique one-dimensional van der Waals structure, these nanowires are expected to show remarkably different physical and electronic properties than bulk tellurium. Here we show that few-chain and single-chain van der Waals tellurium nanowires can be isolated using carbon nanotube and boron nitride nanotube encapsulation. With the app…
▽ More
Tellurium can form nanowires of helical atomic chains. Given their unique one-dimensional van der Waals structure, these nanowires are expected to show remarkably different physical and electronic properties than bulk tellurium. Here we show that few-chain and single-chain van der Waals tellurium nanowires can be isolated using carbon nanotube and boron nitride nanotube encapsulation. With the approach, the number of atomic chains can be controlled by the inner diameter of the nanotube. The Raman response of the structures suggests that the interaction between a single-atomic tellurium chain and a carbon nanotube is weak, and that the inter-chain interaction becomes stronger as the number of chains increases. Compared with bare tellurium nanowires on SiO2, nanowires encapsulated in boron nitride nanotubes exhibit a dramatically enhanced current-carrying capacity, with a current density of 1.5*10^8 A cm-2, which exceeds that of most semiconducting nanowires. We also use our tellurium nanowires encapsulated in boron nitride nanotubes to create field-effect transistors that have a diameter of only 2 nm.
△ Less
Submitted 15 January, 2020;
originally announced January 2020.
-
A systematic TMRT observational study of Galactic $^{12}$C/$^{13}$C ratios from Formaldehyde
Authors:
Y. T. Yan,
J. S. Zhang,
C. Henkel,
T. Mufakharov,
L. W. Jia,
X. D. Tang,
Y. J. Wu,
J. Li,
Z. A. Zeng,
Y. X. Wang,
Y. Q. Li,
J. Huang,
J. M. Jian
Abstract:
We present observations of the C-band $1_{10}-1_{11}$ (4.8 GHz) and Ku-band $2_{11}-2_{12}$ (14.5 GHz) K-doublet lines of H$_2$CO and the C-band $1_{10}-1_{11}$ (4.6 GHz) line of H$_2$$^{13}$CO toward a large sample of Galactic molecular clouds, through the Shanghai Tianma 65-m radio telescope (TMRT). Our sample with 112 sources includes strong H$_2$CO sources from the TMRT molecular line survey a…
▽ More
We present observations of the C-band $1_{10}-1_{11}$ (4.8 GHz) and Ku-band $2_{11}-2_{12}$ (14.5 GHz) K-doublet lines of H$_2$CO and the C-band $1_{10}-1_{11}$ (4.6 GHz) line of H$_2$$^{13}$CO toward a large sample of Galactic molecular clouds, through the Shanghai Tianma 65-m radio telescope (TMRT). Our sample with 112 sources includes strong H$_2$CO sources from the TMRT molecular line survey at C-band and other known H$_2$CO sources. All three lines are detected toward 38 objects (43 radial velocity components) yielding a detection rate of 34\%. Complementary observations of their continuum emission at both C- and Ku-bands were performed. Combining spectral line parameters and continuum data, we calculate the column densities, the optical depths and the isotope ratio H$_2$$^{12}$CO/H$_2$$^{13}$CO for each source. To evaluate photon trap** caused by sometimes significant opacities in the main isotopologue's rotational mm-wave lines connecting our measured K-doublets, and to obtain $^{12}$C/$^{13}$C abundance ratios, we used the RADEX non-LTE model accounting for radiative transfer effects. This implied the use of the new collision rates from \citet{Wiesenfeld2013}. Also implementing distance values from trigonometric parallax measurements for our sources, we obtain a linear fit of $^{12}$C/$^{13}$C = (5.08$\pm$1.10)D$_{GC}$ + (11.86$\pm$6.60), with a correlation coefficient of 0.58. D$_{GC}$ refers to Galactocentric distances. Our $^{12}$C/$^{13}$C ratios agree very well with the ones deduced from CN and C$^{18}$O but are lower than those previously reported on the basis of H$_2$CO, tending to suggest that the bulk of the H$_2$CO in our sources was formed on dust grain mantles and not in the gas phase.
△ Less
Submitted 8 April, 2019;
originally announced April 2019.
-
Room Temperature Electrocaloric Effect in Layered Ferroelectric CuInP2S6 for Solid State Refrigeration
Authors:
Mengwei Si,
Atanu K. Saha,
Pai-Ying Liao,
Shengjie Gao,
Sabine M. Neumayer,
Jie Jian,
**gkai Qin,
Nina Balke,
Haiyan Wang,
Petro Maksymovych,
Wenzhuo Wu,
Sumeet K. Gupta,
Peide D. Ye
Abstract:
A material with reversible temperature change capability under an external electric field, known as the electrocaloric effect (ECE), has long been considered as a promising solid-state cooling solution. However, electrocaloric (EC) performance of EC materials generally is not sufficiently high for real cooling applications. As a result, exploring EC materials with high performance is of great inte…
▽ More
A material with reversible temperature change capability under an external electric field, known as the electrocaloric effect (ECE), has long been considered as a promising solid-state cooling solution. However, electrocaloric (EC) performance of EC materials generally is not sufficiently high for real cooling applications. As a result, exploring EC materials with high performance is of great interest and importance. Here, we report on the ECE of ferroelectric materials with van der Waals layered structure (CuInP2S6 or CIPS in this work in particular). Over 60% polarization charge change is observed within a temperature change of only 10 K at Curie temperature. Large adiabatic temperature change (|ΔT|) of 3.3 K, isothermal entropy change (|ΔS|) of 5.8 J kg-1 K-1 at |ΔE|=142.0 kV cm-1 at 315 K (above and near room temperature) are achieved, with a large EC strength (|ΔT|/|ΔE|) of 29.5 mK cm kV-1. The ECE of CIPS is also investigated theoretically by numerical simulation and a further EC performance projection is provided.
△ Less
Submitted 13 September, 2019; v1 submitted 19 January, 2019;
originally announced January 2019.
-
A Ferroelectric Semiconductor Field-Effect Transistor
Authors:
Mengwei Si,
Atanu K. Saha,
Shengjie Gao,
Gang Qiu,
**gkai Qin,
Yuqin Duan,
Jie Jian,
Chang Niu,
Haiyan Wang,
Wenzhuo Wu,
Sumeet K. Gupta,
Peide D. Ye
Abstract:
Ferroelectric field-effect transistors employ a ferroelectric material as a gate insulator, the polarization state of which can be detected using the channel conductance of the device. As a result, the devices are of potential to use in non-volatile memory technology, but suffer from short retention times, which limits their wider application. Here we report a ferroelectric semiconductor field-eff…
▽ More
Ferroelectric field-effect transistors employ a ferroelectric material as a gate insulator, the polarization state of which can be detected using the channel conductance of the device. As a result, the devices are of potential to use in non-volatile memory technology, but suffer from short retention times, which limits their wider application. Here we report a ferroelectric semiconductor field-effect transistor in which a two-dimensional ferroelectric semiconductor, indium selenide (α-In2Se3), is used as the channel material in the device. α-In2Se3 was chosen due to its appropriate bandgap, room temperature ferroelectricity, ability to maintain ferroelectricity down to a few atomic layers, and potential for large-area growth. A passivation method based on the atomic-layer deposition of aluminum oxide (Al2O3) was developed to protect and enhance the performance of the transistors. With 15-nm-thick hafnium oxide (HfO2) as a scaled gate dielectric, the resulting devices offer high performance with a large memory window, a high on/off ratio of over 108, a maximum on-current of 862 μA μm-1, and a low supply voltage.
△ Less
Submitted 9 January, 2020; v1 submitted 7 December, 2018;
originally announced December 2018.
-
High-Performance Hybrid Silicon and Lithium Niobate Mach-Zehnder Modulators for 100 Gbit/s and Beyond
Authors:
Mingbo He,
Mengyue Xu,
Yuxuan Ren,
Jian Jian,
Ziliang Ruan,
Yongsheng Xu,
Shengqian Gao,
Shihao Sun,
Xueqin Wen,
Lidan Zhou,
Lin Liu,
Changjian Guo,
Hui Chen,
Siyuan Yu,
Liu Liu,
Xinlun Cai
Abstract:
Optical modulators are at the heart of optical communication links. Ideally, they should feature low insertion loss, low drive voltage, large modulation bandwidth, high linearity, compact footprint and low manufacturing cost. Unfortunately, these criteria have only been achieved on separate occasions.Based on a Silicon and Lithium Niobate hybrid integration platform, we demonstrate Mach-Zehnder mo…
▽ More
Optical modulators are at the heart of optical communication links. Ideally, they should feature low insertion loss, low drive voltage, large modulation bandwidth, high linearity, compact footprint and low manufacturing cost. Unfortunately, these criteria have only been achieved on separate occasions.Based on a Silicon and Lithium Niobate hybrid integration platform, we demonstrate Mach-Zehnder modulators that simultaneously fulfill these criteria. The presented device exhibits an insertion loss of 2.5 dB, voltage-length product of 2.2 Vcm, high linearity, electro-optic bandwidth of at least 70 GHz and modulation rates up to 112 Gbit/s. The high-performance modulator is realized by seamless integration of high-contrast waveguide based on Lithium Niobate - the most mature modulator material - with compact, low-loss silicon circuits. The hybrid platform demonstrated here allows for the combination of 'best-in-breed' active and passive components, opening up new avenues for enabling future high-speed, energy efficient and cost-effective optical communication networks.
△ Less
Submitted 2 November, 2018; v1 submitted 7 July, 2018;
originally announced July 2018.
-
Controlled Growth of a Large-Size 2D Selenium Nanosheet and Its Electronic and Optoelectronic Applications
Authors:
**gkai Qin,
Gang Qiu,
Jie Jian,
Hong Zhou,
Lingming Yang,
Adam Charnas,
Dmitry Y Zemlyanov,
Cheng-Yan Xu,
Xianfan Xu,
Wenzhuo Wu,
Haiyan Wang,
Peide D Ye
Abstract:
Selenium has attracted intensive attention as a promising material candidate for future optoelectronic applications. However, selenium has a strong tendency to grow into nanowire forms due to its anisotropic atomic structure, which has largely hindered the exploration of its potential applications. In this work, using a physical vapor deposition method, we have demonstrated the synthesis of large-…
▽ More
Selenium has attracted intensive attention as a promising material candidate for future optoelectronic applications. However, selenium has a strong tendency to grow into nanowire forms due to its anisotropic atomic structure, which has largely hindered the exploration of its potential applications. In this work, using a physical vapor deposition method, we have demonstrated the synthesis of large-size, high-quality 2D selenium nanosheets, the minimum thickness of which could be as thin as 5 nm. The Se nanosheet exhibits a strong in-plane anisotropic property, which is determined by angle-resolved Raman spectroscopy. Back-gating field-effect transistors based on a Se nanosheet exhibit p-type transport behaviors with on-state current density around 20 mA/mm at Vds = 3 V. Four-terminal field effect devices are also fabricated to evaluate the intrinsic hole mobility of the selenium nanosheet, and the value is determined to be 0.26 cm2 Vs at 300 K. The selenium nanosheet phototransistors show an excellent photoresponsivity of up to 263 A/W, with a rise time of 0.1 s and fall time of 0.12 s. These results suggest that crystal selenium as a 2D form of a 1D van der Waals solid opens up the possibility to explore device applications.
△ Less
Submitted 2 November, 2017;
originally announced November 2017.
-
A First Transients Survey with JWST: the FLARE project
Authors:
Lifan Wang,
D. Baade,
E. Baron,
S. Bernard,
V. Bromm,
P. Brown,
G. Clayton,
J. Cooke,
D. Croton,
C. Curtin,
M. Drout,
M. Doi,
I. Dominguez,
S. Finkelstein,
A. Gal-Yam,
P. Geil,
A. Heger,
P. Hoeflich,
J. Jian,
K. Krisciunas,
A. Koekemoer,
R. Lunnan,
K. Maeda,
J. Maund,
M. Modjaz
, et al. (21 additional authors not shown)
Abstract:
JWST was conceived and built to answer one of the most fundamental questions that humans can address empirically: "How did the Universe make its first stars?". Our First Lights At REionization (FLARE) project transforms the quest for the epoch of reionization from the static to the time domain. It targets the complementary question: "What happened to those first stars?". It will be answered by obs…
▽ More
JWST was conceived and built to answer one of the most fundamental questions that humans can address empirically: "How did the Universe make its first stars?". Our First Lights At REionization (FLARE) project transforms the quest for the epoch of reionization from the static to the time domain. It targets the complementary question: "What happened to those first stars?". It will be answered by observations of the most luminous events: supernovae and accretion on to black holes formed by direct collapse from the primordial gas clouds. These transients provide direct constraints on star-formation rates (SFRs) and the truly initial Initial Mass Function (IMF), and they may identify possible stellar seeds of supermassive black holes (SMBHs). Furthermore, our knowledge of the physics of these events at ultra-low metallicity will be much expanded. JWST's unique capabilities will detect these most luminous and earliest cosmic messengers easily in fairly shallow observations. However, these events are very rare at the dawn of cosmic structure formation and so require large area coverage. Time domain astronomy can be advanced to an unprecedented depth by means of a shallow field of JWST reaching 27 mag AB in 2 and 4.4 microns over a field as large as 0.1 square degree visited multiple times each year. Such a survey may set strong constraints or detect massive Pop III SNe at redshifts beyond 10, pinpointing the redshift of the first stars, or at least their death. Based on our current knowledge of superluminous supernovae (SLSNe), such a survey will find one or more SLSNe at redshifts above 6 in five years and possibly several direct collapse black holes. Although JWST is not designed as a wide field survey telescope, we show that such a wide field survey is possible with JWST and is critical in addressing several of its key scientific goals.
△ Less
Submitted 26 November, 2017; v1 submitted 19 October, 2017;
originally announced October 2017.
-
Solution to dynamic economic dispatch with prohibited operating zones via MILP
Authors:
Shanshan Pan,
**bao Jian,
Linfeng Yang
Abstract:
Dynamic economic dispatch (DED) problem considering prohibited operating zones (POZ), ramp rate constraints, transmission losses and spinning reserve constraints is a complicated non-linear problem which is difficult to solve efficiently. In this paper, a mixed integer linear programming (MILP) method is proposed to solve such a DED problem. Firstly, a novel MILP formulation for DED problem withou…
▽ More
Dynamic economic dispatch (DED) problem considering prohibited operating zones (POZ), ramp rate constraints, transmission losses and spinning reserve constraints is a complicated non-linear problem which is difficult to solve efficiently. In this paper, a mixed integer linear programming (MILP) method is proposed to solve such a DED problem. Firstly, a novel MILP formulation for DED problem without considering the transmission losses, denoted by MILP-1, is presented by using perspective cut reformulation technique. When the transmission losses are considered, the quadratic terms in the transmission losses are replaced by their first order Taylor expansions, and then an MILP formulation for DED considering the transmission losses, denoted by MILP-2, is obtained. Based on MILP-1 and MILP-2, an MILP-iteration algorithm (MILP-IA) is proposed to solve the complicated DED problem. The effectiveness of the MILP-1 and MILP-IA are assessed by several cases and the simulation results show that both of them can solve to competitive solutions in a short time.
△ Less
Submitted 6 April, 2017;
originally announced April 2017.
-
A Hybrid MILP and IPM for Dynamic Economic Dispatch with Valve Point Effect
Authors:
Shanshan Pan,
**bao Jian,
Linfeng Yang
Abstract:
Dynamic economic dispatch with valve-point effect (DED-VPE) is a non-convex and non-differentiable optimization problem which is difficult to solve efficiently. In this paper, a hybrid mixed integer linear programming (MILP) and interior point method (IPM), denoted by MILP-IPM, is proposed to solve such a DED-VPE problem, where the complicated transmission loss is also included. Due to the non-dif…
▽ More
Dynamic economic dispatch with valve-point effect (DED-VPE) is a non-convex and non-differentiable optimization problem which is difficult to solve efficiently. In this paper, a hybrid mixed integer linear programming (MILP) and interior point method (IPM), denoted by MILP-IPM, is proposed to solve such a DED-VPE problem, where the complicated transmission loss is also included. Due to the non-differentiable characteristic of DED-VPE, the classical derivative-based optimization methods can not be used any more. With the help of model reformulation, a differentiable non-linear programming (NLP) formulation which can be directly solved by IPM is derived. However, if the DED-VPE is solved by IPM in a single step, the optimization will easily trap in a poor local optima due to its non-convex and multiple local minima characteristics. To exploit a better solution, an MILP method is required to solve the DED-VPE without transmission loss, yielding a good initial point for IPM to improve the quality of the solution. Simulation results demonstrate the validity and effectiveness of the proposed MILP-IPM in solving DED-VPE.
△ Less
Submitted 10 March, 2017;
originally announced March 2017.
-
A Mixed Integer Linear Programming Method for Dynamic Economic Dispatch with Valve Point Effect
Authors:
Shanshan Pan,
**bao Jian,
Linfeng Yang
Abstract:
In this paper, a mixed integer linear programming (MILP) formulation is proposed to solve the dynamic economic dispatch with valve-point effect (DED-VPE). Based on piecewise linearization technique, the non-convex and non-smooth generation cost is reformulated into a linear lower approximation which is better than the quadratic one, yielding an MILP formulation for the DED-VPE. When the segment pa…
▽ More
In this paper, a mixed integer linear programming (MILP) formulation is proposed to solve the dynamic economic dispatch with valve-point effect (DED-VPE). Based on piecewise linearization technique, the non-convex and non-smooth generation cost is reformulated into a linear lower approximation which is better than the quadratic one, yielding an MILP formulation for the DED-VPE. When the segment parameter is set appropriately, the MILP formulation can be solved by a mixed integer programming (MIP) solver directly and efficiently. Thus, a global optimal solution within a preset tolerance can be guaranteed for the MILP formulation. Simulation results show that the proposed MILP formulation can be solved to reliable solutions in reasonable time.
△ Less
Submitted 16 February, 2017;
originally announced February 2017.
-
A Novel Projected Two Binary Variables Formulation for Unit Commitment Problem
Authors:
Linfeng Yang,
Chen Zhang,
**bao Jian,
Ke Meng,
Zhaoyang Dong
Abstract:
The thermal unit commitment (UC) problem often can be formulated as a mixed integer quadratic programming (MIQP), which is difficult to solve efficiently, especially for large-scale instances. In this paper, with projecting unit generation level onto [0,1] and reformulation techniques, a novel two binary (2-bin) variables MIQP formulation for UC problem is presented. We show that 2-bin formulation…
▽ More
The thermal unit commitment (UC) problem often can be formulated as a mixed integer quadratic programming (MIQP), which is difficult to solve efficiently, especially for large-scale instances. In this paper, with projecting unit generation level onto [0,1] and reformulation techniques, a novel two binary (2-bin) variables MIQP formulation for UC problem is presented. We show that 2-bin formulation is more compact than the state-of-the-art one binary (1-bin) variable formulation and three binary (3-bin) variables formulation. Moreover, 2-bin formulation is tighter than 1-bin and 3-bin formulations in quadratic cost function, and it is tighter than 1-bin formulation in linear constraints. Three mixed integer linear programming (MILP) formulations can be obtained from three UC MIQPs by replacing the quadratic terms in the objective functions by a sequence of piece-wise perspective-cuts. 2-bin MILP is also the best one due to the similar reasons of MIQP. The simulation results for realistic instances that range in size from 10 to 200 units over a scheduling period of 24 hours show that the proposed 2-bin formulations are competitive with currently state-of-the-art formulations and promising for large-scale UC problems.
△ Less
Submitted 7 February, 2017; v1 submitted 17 June, 2016;
originally announced June 2016.
-
A New Superlinearly Convergent Algorithm of Combining QP Subproblem with System of Linear Equations for Nonlinear Optimization
Authors:
**-Bao Jian,
Chuan-Hao Guo,
Chun-Ming Tang,
Yan-Qin Bai
Abstract:
In this paper, a class of optimization problems with nonlinear inequality constraints is discussed. Based on the ideas of sequential quadratic programming algorithm and the method of strongly sub-feasible directions, a new superlinearly convergent algorithm is proposed. The initial iteration point can be chosen arbitrarily for the algorithm. At each iteration, the new algorithm solves one quadrati…
▽ More
In this paper, a class of optimization problems with nonlinear inequality constraints is discussed. Based on the ideas of sequential quadratic programming algorithm and the method of strongly sub-feasible directions, a new superlinearly convergent algorithm is proposed. The initial iteration point can be chosen arbitrarily for the algorithm. At each iteration, the new algorithm solves one quadratic programming subproblem which is always feasible, and one or two systems of linear equations with a common coefficient matrix. Moreover, the coefficient matrix is uniformly nonsingular. After finite iterations, the iteration points can always enter into the feasible set of the problem, and the search direction is obtained by solving one quadratic programming subproblem and only one system of linear equations. The new algorithm possesses global and superlinear convergence under some suitable assumptions without the strict complementarity. Finally, some preliminary numerical experiments are reported to show that the algorithm is promising.
△ Less
Submitted 27 June, 2012;
originally announced June 2012.
-
An Improved Sequential Quadratic Programming Algorithm for Solving General Nonlinear Programming Problems
Authors:
Chuan-Hao Guo,
Yan-Qin Bai,
**-Bao Jian
Abstract:
In this paper, a class of general nonlinear programming problems with inequality and equality constraints is discussed. Firstly, the original problem is transformed into an associated simpler equivalent problem with only inequality constraints. Then, inspired by the ideals of sequential quadratic programming (SQP) method and the method of system of linear equations (SLE), a new type of SQP algorit…
▽ More
In this paper, a class of general nonlinear programming problems with inequality and equality constraints is discussed. Firstly, the original problem is transformed into an associated simpler equivalent problem with only inequality constraints. Then, inspired by the ideals of sequential quadratic programming (SQP) method and the method of system of linear equations (SLE), a new type of SQP algorithm for solving the original problem is proposed. At each iteration, the search direction is generated by the combination of two directions, which are obtained by solving an always feasible quadratic programming (QP) subproblem and a SLE, respectively. Moreover, in order to overcome the Maratos effect, the higher-order correction direction is obtained by solving another SLE. The two SLEs have the same coefficient matrices, and we only need to solve the one of them after a finite number of iterations. By a new line search technique, the proposed algorithm possesses global and superlinear convergence under some suitable assumptions without the strict complementarity. Finally, some comparative numerical results are reported to show that the proposed algorithm is effective and promising.
△ Less
Submitted 18 December, 2012; v1 submitted 14 June, 2012;
originally announced June 2012.
-
Sublimation of the Martian CO2 Seasonal South Polar Cap
Authors:
Frederic Schmidt,
Bernard Schmitt,
Sylvain Doute,
Francois Forget,
Jeng-Jong Jian,
Patrick Martin,
Yves Langevin,
Jean-Pierre Bibring,
the OMEGA Team
Abstract:
The polar condensation/sublimation of CO2, that involve about one fourth of the atmosphere mass, is the major Martian climatic cycle. Early observations in visible and thermal infrared have shown that the sublimation of the Seasonal South Polar Cap (SSPC) is not symmetric around the geographic South Pole. Here we use observations by OMEGA/Mars Express in the near-infrared to detect unambiguously t…
▽ More
The polar condensation/sublimation of CO2, that involve about one fourth of the atmosphere mass, is the major Martian climatic cycle. Early observations in visible and thermal infrared have shown that the sublimation of the Seasonal South Polar Cap (SSPC) is not symmetric around the geographic South Pole. Here we use observations by OMEGA/Mars Express in the near-infrared to detect unambiguously the presence of CO2 at the surface, and to estimate albedo. Second, we estimate the sublimation of CO2 released in the atmosphere and show that there is a two-step process. From Ls=180° to 220°, the sublimation is nearly symmetric with a slight advantage for the cryptic region. After Ls=220° the anti-cryptic region sublimation is stronger. Those two phases are not balanced such that there is 22% +/- 9 more mass the anti-cryptic region, arguing for more snow precipitation. We compare those results with the MOLA height measurements. Finally we discuss implications for the Martian atmosphere about general circulation and gas tracers, e.g. Ar.
△ Less
Submitted 22 July, 2010; v1 submitted 23 March, 2010;
originally announced March 2010.