-
Augmented Quaternion and Augmented Unit Quaternion Optimization
Authors:
Liqun Qi,
Xiangke Wang,
Chunfeng Cui
Abstract:
In this paper, we introduce and explore augmented quaternions and augmented unit quaternions, and present an augmented unit quaternion optimization model. An augmented quaternion consist of a quaternion and a translation vector. The multiplication rule of augmented quaternion is defined. An augmented unit quaternion consists of a unit quaternion and a translation vector. The augmented unit quatern…
▽ More
In this paper, we introduce and explore augmented quaternions and augmented unit quaternions, and present an augmented unit quaternion optimization model. An augmented quaternion consist of a quaternion and a translation vector. The multiplication rule of augmented quaternion is defined. An augmented unit quaternion consists of a unit quaternion and a translation vector. The augmented unit quaternions form a Lie group. By means of augmented unit quaternions, we study the error model and kinematics. Then we formulate two classical problems in robot research, i.e., the hand-eye calibration problem and the simultaneous localization and map** (SLAM) problem as augmented unit quaternion optimization problems, which are actually real smooth spherical equality constrained optimization problems. Comparing with the corresponding unit dual quaternion optimization model, the augmented unit quaternion optimization model has less variables and removes the orthogonality constraints.
△ Less
Submitted 27 February, 2023; v1 submitted 9 January, 2023;
originally announced January 2023.
-
eVAE: Evolutionary Variational Autoencoder
Authors:
Zhangkai Wu,
Longbing Cao,
Lei Qi
Abstract:
The surrogate loss of variational autoencoders (VAEs) poses various challenges to their training, inducing the imbalance between task fitting and representation inference. To avert this, the existing strategies for VAEs focus on adjusting the tradeoff by introducing hyperparameters, deriving a tighter bound under some mild assumptions, or decomposing the loss components per certain neural settings…
▽ More
The surrogate loss of variational autoencoders (VAEs) poses various challenges to their training, inducing the imbalance between task fitting and representation inference. To avert this, the existing strategies for VAEs focus on adjusting the tradeoff by introducing hyperparameters, deriving a tighter bound under some mild assumptions, or decomposing the loss components per certain neural settings. VAEs still suffer from uncertain tradeoff learning.We propose a novel evolutionary variational autoencoder (eVAE) building on the variational information bottleneck (VIB) theory and integrative evolutionary neural learning. eVAE integrates a variational genetic algorithm into VAE with variational evolutionary operators including variational mutation, crossover, and evolution. Its inner-outer-joint training mechanism synergistically and dynamically generates and updates the uncertain tradeoff learning in the evidence lower bound (ELBO) without additional constraints. Apart from learning a lossy compression and representation of data under the VIB assumption, eVAE presents an evolutionary paradigm to tune critical factors of VAEs and deep neural networks and addresses the premature convergence and random search problem by integrating evolutionary optimization into deep learning. Experiments show that eVAE addresses the KL-vanishing problem for text generation with low reconstruction loss, generates all disentangled factors with sharp images, and improves the image generation quality,respectively. eVAE achieves better reconstruction loss, disentanglement, and generation-inference balance than its competitors.
△ Less
Submitted 1 January, 2023;
originally announced January 2023.
-
Linear analysis and crossphase dynamics in the $\nabla T_e$-driven CTEM fluid model
Authors:
M. Leconte,
Lei Qi,
J. Anderson
Abstract:
Collisionless trapped-electron mode (CTEM) turbulence is an important contributor to heat and particle transport in fusion devices. The ITG/TEM fluid models are rarely treated analytically, due to the large number of transport channels involved, e.g. particle and ion/electron heat transport. The $\nabla T_e$-driven CTEM fluid model [Anderson et al, Plasma Phys. Control. Fusion 48, 651 (2006)] prov…
▽ More
Collisionless trapped-electron mode (CTEM) turbulence is an important contributor to heat and particle transport in fusion devices. The ITG/TEM fluid models are rarely treated analytically, due to the large number of transport channels involved, e.g. particle and ion/electron heat transport. The $\nabla T_e$-driven CTEM fluid model [Anderson et al, Plasma Phys. Control. Fusion 48, 651 (2006)] provides a simplified model, in the regime where the density gradient drive is negligeable compared to the electron temperature gradient drive ($\nabla T_e$). This provides an interesting model to study mechanisms associated to linear waves, such as crossphase dynamics, and its possible role in the formation of $E\times B$ staircase. Here, the $\nabla T_e$-driven CTEM fluid model is rigourously derived from the more general ITG/TEM model, and its linear dynamics is first analyzed and compared with CTEM gyrokinetic simulations with bounce-averaged kinetic electrons, while nonlinear analysis is left for future work. Comparisons of linear ITG spectrum are also made with other analytical models.
△ Less
Submitted 29 December, 2022;
originally announced December 2022.
-
Motion, Unit Dual Quaternion and Motion Optimization
Authors:
Liqun Qi
Abstract:
We introduce motions as real six-dimensional vectors. A motion means a rotation and a translation. We define a motion operator which maps unit dual quaternions to motions, and a UDQ operator which maps motions to unit dual quaternions. By these operators, we present the formulation of motion optimization, which is actually a real unconstrained optimization formulation. Then we formulate two classi…
▽ More
We introduce motions as real six-dimensional vectors. A motion means a rotation and a translation. We define a motion operator which maps unit dual quaternions to motions, and a UDQ operator which maps motions to unit dual quaternions. By these operators, we present the formulation of motion optimization, which is actually a real unconstrained optimization formulation. Then we formulate two classical problems in robot research, i.e., the hand-eye calibration problem and the simultaneous localization and map** (SLAM) problem as motion optimization problems. This opens a new way to solve these problems via real unconstrained optimization.
△ Less
Submitted 26 December, 2022; v1 submitted 22 December, 2022;
originally announced December 2022.
-
Adiabatically controlled motional states of a ground-state cooled CaO$^{+}$ and Ca$^{+}$ trapped ion chain
Authors:
Lu Qi,
Evan C. Reed,
Kenneth R. Brown
Abstract:
Control of the external degree of freedom of trapped molecular ions is a prerequisite for their promising applications to spectroscopy, precision measurements of fundamental constants, and quantum information technology. Here, we demonstrate near ground-state cooling of the axial motional modes of a calcium mono-oxide ion via sympathetic sideband cooling with a co-trapped calcium ion. We also show…
▽ More
Control of the external degree of freedom of trapped molecular ions is a prerequisite for their promising applications to spectroscopy, precision measurements of fundamental constants, and quantum information technology. Here, we demonstrate near ground-state cooling of the axial motional modes of a calcium mono-oxide ion via sympathetic sideband cooling with a co-trapped calcium ion. We also show that the phonon state of the axial out-of-phase mode of the ion chain is maintained while the mode frequency is adiabatically ramped up and down. The adiabatic ram** of the motional mode frequency is a prerequisite for searching for the proposed molecular dipole-phonon interaction.
△ Less
Submitted 9 December, 2022;
originally announced December 2022.
-
TSGP: Two-Stage Generative Prompting for Unsupervised Commonsense Question Answering
Authors:
Yueqing Sun,
Yu Zhang,
Le Qi,
Qi Shi
Abstract:
Unsupervised commonsense question answering requires mining effective commonsense knowledge without the rely on the labeled task data. Previous methods typically retrieved from traditional knowledge bases or used pre-trained language models (PrLMs) to generate fixed types of knowledge, which have poor generalization ability. In this paper, we aim to address the above limitation by leveraging the i…
▽ More
Unsupervised commonsense question answering requires mining effective commonsense knowledge without the rely on the labeled task data. Previous methods typically retrieved from traditional knowledge bases or used pre-trained language models (PrLMs) to generate fixed types of knowledge, which have poor generalization ability. In this paper, we aim to address the above limitation by leveraging the implicit knowledge stored in PrLMs and propose a two-stage prompt-based unsupervised commonsense question answering framework (TSGP). Specifically, we first use knowledge generation prompts to generate the knowledge required for questions with unlimited types and possible candidate answers independent of specified choices. Then, we further utilize answer generation prompts to generate possible candidate answers independent of specified choices. Experimental results and analysis on three different commonsense reasoning tasks, CommonsenseQA, OpenBookQA, and SocialIQA, demonstrate that TSGP significantly improves the reasoning ability of language models in unsupervised settings. Our code is available at: https://github.com/Yueqing-Sun/TSGP.
△ Less
Submitted 24 November, 2022;
originally announced November 2022.
-
High-Quality Entity Segmentation
Authors:
Lu Qi,
Jason Kuen,
Weidong Guo,
Tiancheng Shen,
Jiuxiang Gu,
Jiaya Jia,
Zhe Lin,
Ming-Hsuan Yang
Abstract:
Dense image segmentation tasks e.g., semantic, panoptic) are useful for image editing, but existing methods can hardly generalize well in an in-the-wild setting where there are unrestricted image domains, classes, and image resolution and quality variations. Motivated by these observations, we construct a new entity segmentation dataset, with a strong focus on high-quality dense segmentation in th…
▽ More
Dense image segmentation tasks e.g., semantic, panoptic) are useful for image editing, but existing methods can hardly generalize well in an in-the-wild setting where there are unrestricted image domains, classes, and image resolution and quality variations. Motivated by these observations, we construct a new entity segmentation dataset, with a strong focus on high-quality dense segmentation in the wild. The dataset contains images spanning diverse image domains and entities, along with plentiful high-resolution images and high-quality mask annotations for training and testing. Given the high-quality and -resolution nature of the dataset, we propose CropFormer which is designed to tackle the intractability of instance-level segmentation on high-resolution images. It improves mask prediction by fusing high-res image crops that provide more fine-grained image details and the full image. CropFormer is the first query-based Transformer architecture that can effectively fuse mask predictions from multiple image views, by learning queries that effectively associate the same entities across the full image and its crop. With CropFormer, we achieve a significant AP gain of $1.9$ on the challenging entity segmentation task. Furthermore, CropFormer consistently improves the accuracy of traditional segmentation tasks and datasets. The dataset and code will be released at http://luqi.info/entityv2.github.io/.
△ Less
Submitted 2 April, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
PalGAN: Image Colorization with Palette Generative Adversarial Networks
Authors:
Yi Wang,
Menghan Xia,
Lu Qi,
**g Shao,
Yu Qiao
Abstract:
Multimodal ambiguity and color bleeding remain challenging in colorization. To tackle these problems, we propose a new GAN-based colorization approach PalGAN, integrated with palette estimation and chromatic attention. To circumvent the multimodality issue, we present a new colorization formulation that estimates a probabilistic palette from the input gray image first, then conducts color assignme…
▽ More
Multimodal ambiguity and color bleeding remain challenging in colorization. To tackle these problems, we propose a new GAN-based colorization approach PalGAN, integrated with palette estimation and chromatic attention. To circumvent the multimodality issue, we present a new colorization formulation that estimates a probabilistic palette from the input gray image first, then conducts color assignment conditioned on the palette through a generative model. Further, we handle color bleeding with chromatic attention. It studies color affinities by considering both semantic and intensity correlation. In extensive experiments, PalGAN outperforms state-of-the-arts in quantitative evaluation and visual comparison, delivering notable diverse, contrastive, and edge-preserving appearances. With the palette design, our method enables color transfer between images even with irrelevant contexts.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
CATCH: Chasing All Transients Constellation Hunters Space Mission
Authors:
Pan** Li,
Qian-Qing Yin,
Zhengwei Li,
Lian Tao,
Xiangyang Wen,
Shuang-Nan Zhang,
Liqiang Qi,
Juan Zhang,
Donghua Zhao,
Dalin Li,
Xizheng Yu,
Qingcui Bu,
Wen Chen,
Yupeng Chen,
Yiming Huang,
Yue Huang,
Ge **,
Gang Li,
Hongbang Liu,
Xiao**g Liu,
Ruican Ma,
Wenxi Peng,
Rui**g Tang,
Yusa Wang,
**gyu Xiao
, et al. (12 additional authors not shown)
Abstract:
In time-domain astronomy, a substantial number of transients will be discovered by multi-wavelength and multi-messenger observatories, posing a great challenge for follow-up capabilities. We have thus proposed an intelligent X-ray constellation, the Chasing All Transients Constellation Hunters (CATCH) space mission. Consisting of 126 micro-satellites in three types, CATCH will have the capability…
▽ More
In time-domain astronomy, a substantial number of transients will be discovered by multi-wavelength and multi-messenger observatories, posing a great challenge for follow-up capabilities. We have thus proposed an intelligent X-ray constellation, the Chasing All Transients Constellation Hunters (CATCH) space mission. Consisting of 126 micro-satellites in three types, CATCH will have the capability to perform follow-up observations for a large number of different types of transients simultaneously. Each satellite in the constellation will carry lightweight X-ray optics and use a deployable mast to increase the focal length. The combination of different optics and detector systems enables different types of satellites to have multiform observation capabilities, including timing, spectroscopy, imaging, and polarization. Controlled by the intelligent system, different satellites can cooperate to perform uninterrupted monitoring, all-sky follow-up observations, and scanning observations with a flexible field of view (FOV) and multi-dimensional observations. Therefore, CATCH will be a powerful mission to study the dynamic universe. Here, we present the current design of the spacecraft, optics, detector system, constellation configuration and observing modes, as well as the development plan.
△ Less
Submitted 16 November, 2022; v1 submitted 12 October, 2022;
originally announced October 2022.
-
Prototype Design and Efficiency Analysis of a Novel Robot Drive Based on 3K-H-V Topology
Authors:
Le Qi,
Dapeng Yang,
Baoshi Cao,
Zhiqi Li,
Yikun Gu,
Zongwu Xie,
Hong Liu
Abstract:
Robot actuators directly affect the performance of robots, and robot drives directly affect the performance of robot actuators. With the development of robotics, robots have put higher requirements on robot drives, such as high stiffness, high accuracy, high loading, high efficiency, low backlash, compact size, and hollow structure. In order to meet the demand development of robot actuators, this…
▽ More
Robot actuators directly affect the performance of robots, and robot drives directly affect the performance of robot actuators. With the development of robotics, robots have put higher requirements on robot drives, such as high stiffness, high accuracy, high loading, high efficiency, low backlash, compact size, and hollow structure. In order to meet the demand development of robot actuators, this research base proposes a new robot drive based on 3K-H-V topology using involute and cycloidal gear shapes, planetary cycloidal drive, from the perspective of drive topology and through the design idea of decoupling. In this study, the reduction ratio and the efficiency model of the 3K-H-V topology were analyzed, and a prototype planetary cycloidal actuator was designed. The feasibility of the drive is initially verified by experimentally concluding that the PCA has a hollow structure, compact size, and high torque density (69 kg/Nm).
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
Systematic calculations of cluster radioactivity half-lives in trans-lead nuclei
Authors:
Lin**g Qi,
DongMeng Zhang,
Song Luo,
XiaoHua Li,
XiJun Wu,
ChunTian Liang
Abstract:
In the present work, based on Wentzel-Kramers-Brillouin (WKB) theory, considering the cluster preformation probability (Pc), we systematically investigate the cluster radioactivity half-lives of 22 trans-lead nuclei ranging from 221Fr to 242Cm. As for Pc, when the mass number of the emitted cluster Ac<28, it is obtained by the exponential relationship of Pc to the alpha decay preformation probabil…
▽ More
In the present work, based on Wentzel-Kramers-Brillouin (WKB) theory, considering the cluster preformation probability (Pc), we systematically investigate the cluster radioactivity half-lives of 22 trans-lead nuclei ranging from 221Fr to 242Cm. As for Pc, when the mass number of the emitted cluster Ac<28, it is obtained by the exponential relationship of Pc to the alpha decay preformation probability Pa proposed by R.Blendowskeis et al. [Phys. Rev. Lett. 61, 1930 (1988)], while Pa is calculated through cluster-formation model (CFM). Whereas Ac >= 28, it is achieved through the charge-number dependence of Pc on the decay products proposed by Ren et al. [Phys. Rev. C 70, 034304 (2004)]. The half-lives of cluster radioactivity have been calculated by the density-dependent cluster model [Phys. Rev. C 70, 034304 (2004)] and by the unified formula of half-lives for alpha decay and cluster radioactivity [Phys. Rev. C 78, 044310 (2008)]. For comparison, a universal decay law (UDL) proposed by Qi et al. [Phys. Rev. C 80, 044326 (2009)], a semi-empirical model for both alpha decay and cluster radioactivity proposed by Santhosh [J. Phys. G: Nucl. Part. Phys. 35, 085102 (2008)] and a unified formula of half-lives for alpha decay and cluster radioactivity [Phys. Rev. C 78, 044310 (2008)] are also used. The calculated results in our work, Ni's formula as well as UDL can well reproduce the experimental data and are better than those in Santhosh's model. In addition, we extend this model to predict the half-lives for 51 nuclei, whose cluster radioactivity is energetically allowed or observed but yet not quantified in NUBASE2020.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
Intrinsically Motivated Reinforcement Learning based Recommendation with Counterfactual Data Augmentation
Authors:
Xiaocong Chen,
Siyu Wang,
Lina Yao,
Lianyong Qi,
Yong Li
Abstract:
Deep reinforcement learning (DRL) has been proven its efficiency in capturing users' dynamic interests in recent literature. However, training a DRL agent is challenging, because of the sparse environment in recommender systems (RS), DRL agents could spend times either exploring informative user-item interaction trajectories or using existing trajectories for policy learning. It is also known as t…
▽ More
Deep reinforcement learning (DRL) has been proven its efficiency in capturing users' dynamic interests in recent literature. However, training a DRL agent is challenging, because of the sparse environment in recommender systems (RS), DRL agents could spend times either exploring informative user-item interaction trajectories or using existing trajectories for policy learning. It is also known as the exploration and exploitation trade-off which affects the recommendation performance significantly when the environment is sparse. It is more challenging to balance the exploration and exploitation in DRL RS where RS agent need to deeply explore the informative trajectories and exploit them efficiently in the context of recommender systems. As a step to address this issue, We design a novel intrinsically ,otivated reinforcement learning method to increase the capability of exploring informative interaction trajectories in the sparse environment, which are further enriched via a counterfactual augmentation strategy for more efficient exploitation. The extensive experiments on six offline datasets and three online simulation platforms demonstrate the superiority of our model to a set of existing state-of-the-art methods.
△ Less
Submitted 16 September, 2022;
originally announced September 2022.
-
A regularization-patching dual quaternion optimization method for solving the hand-eye calibration problem
Authors:
Zhongming Chen,
Chen Ling,
Liqun Qi,
Hong Yan
Abstract:
The hand-eye calibration problem is an important application problem in robot research. Based on the 2-norm of dual quaternion vectors, we propose a new dual quaternion optimization method for the hand-eye calibration problem. The dual quaternion optimization problem is decomposed to two quaternion optimization subproblems. The first quaternion optimization subproblem governs the rotation of the r…
▽ More
The hand-eye calibration problem is an important application problem in robot research. Based on the 2-norm of dual quaternion vectors, we propose a new dual quaternion optimization method for the hand-eye calibration problem. The dual quaternion optimization problem is decomposed to two quaternion optimization subproblems. The first quaternion optimization subproblem governs the rotation of the robot hand. It can be solved efficiently by the eigenvalue decomposition or singular value decomposition. If the optimal value of the first quaternion optimization subproblem is zero, then the system is rotationwise noiseless, i.e., there exists a ``perfect'' robot hand motion which meets all the testing poses rotationwise exactly. In this case, we apply the regularization technique for solving the second subproblem to minimize the distance of the translation. Otherwise we apply the patching technique to solve the second quaternion optimization subproblem. Then solving the second quaternion optimization subproblem turns out to be solving a quadratically constrained quadratic program. In this way, we give a complete description for the solution set of hand-eye calibration problems. This is new in the hand-eye calibration literature. The numerical results are also presented to show the efficiency of the proposed method.
△ Less
Submitted 16 September, 2022;
originally announced September 2022.
-
EV Charging Station Wholesale Market Participation: A Strategic Bidding and Pricing Approach
Authors:
Mohammad Mousavi,
Li "Lisa" Qi,
Alexander Brissette,
Meng Wu
Abstract:
This paper presents a framework for simultaneous bidding and pricing strategy for wholesale market participation of electric vehicle (EV) charging stations aggregator. The proposed framework incorporates the EV charging stations' technical constraints as well as EV owners' preferences. A bi-level optimization is adopted to model the problem. In the upper level, the total profit of the EV charging…
▽ More
This paper presents a framework for simultaneous bidding and pricing strategy for wholesale market participation of electric vehicle (EV) charging stations aggregator. The proposed framework incorporates the EV charging stations' technical constraints as well as EV owners' preferences. A bi-level optimization is adopted to model the problem. In the upper level, the total profit of the EV charging station aggregator is maximized. In the lower-level problem, the EV owner's utility function is maximized. The EV owners' preferences are modeled using the quadratic utility function. The bi-level optimization problem which is non-convex and hard to solve is converted to a mixed-integer convex quadratic programming model by writing the optimal conditions of the lower-level problem that is solvable with commercial solvers. The effectiveness of the proposed framework is investigated by implementing simulation results.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation
Authors:
Lihe Yang,
Lei Qi,
Litong Feng,
Wayne Zhang,
Yinghuan Shi
Abstract:
In this work, we revisit the weak-to-strong consistency framework, popularized by FixMatch from semi-supervised classification, where the prediction of a weakly perturbed image serves as supervision for its strongly perturbed version. Intriguingly, we observe that such a simple pipeline already achieves competitive results against recent advanced works, when transferred to our segmentation scenari…
▽ More
In this work, we revisit the weak-to-strong consistency framework, popularized by FixMatch from semi-supervised classification, where the prediction of a weakly perturbed image serves as supervision for its strongly perturbed version. Intriguingly, we observe that such a simple pipeline already achieves competitive results against recent advanced works, when transferred to our segmentation scenario. Its success heavily relies on the manual design of strong data augmentations, however, which may be limited and inadequate to explore a broader perturbation space. Motivated by this, we propose an auxiliary feature perturbation stream as a supplement, leading to an expanded perturbation space. On the other, to sufficiently probe original image-level augmentations, we present a dual-stream perturbation technique, enabling two strong views to be simultaneously guided by a common weak view. Consequently, our overall Unified Dual-Stream Perturbations approach (UniMatch) surpasses all existing methods significantly across all evaluation protocols on the Pascal, Cityscapes, and COCO benchmarks. Its superiority is also demonstrated in remote sensing interpretation and medical image analysis. We hope our reproduced FixMatch and our results can inspire more future works. Code and logs are available at https://github.com/LiheYoung/UniMatch.
△ Less
Submitted 26 March, 2023; v1 submitted 21 August, 2022;
originally announced August 2022.
-
MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization
Authors:
Lei Qi,
Hongpeng Yang,
Yinghuan Shi,
Xin Geng
Abstract:
Domain generalization (DG) aims at learning a model on source domains to well generalize on the unseen target domain. Although it has achieved great success, most of existing methods require the label information for all training samples in source domains, which is time-consuming and expensive in the real-world application. In this paper, we resort to solving the semi-supervised domain generalizat…
▽ More
Domain generalization (DG) aims at learning a model on source domains to well generalize on the unseen target domain. Although it has achieved great success, most of existing methods require the label information for all training samples in source domains, which is time-consuming and expensive in the real-world application. In this paper, we resort to solving the semi-supervised domain generalization (SSDG) task, where there are a few label information in each source domain. To address the task, we first analyze the theory of the multi-domain learning, which highlights that 1) mitigating the impact of domain gap and 2) exploiting all samples to train the model can effectively reduce the generalization error in each source domain so as to improve the quality of pseudo-labels. According to the analysis, we propose MultiMatch, i.e., extending FixMatch to the multi-task learning framework, producing the high-quality pseudo-label for SSDG. To be specific, we consider each training domain as a single task (i.e., local task) and combine all training domains together (i.e., global task) to train an extra task for the unseen test domain. In the multi-task framework, we utilize the independent BN and classifier for each task, which can effectively alleviate the interference from different domains during pseudo-labeling. Also, most of parameters in the framework are shared, which can be trained by all training samples sufficiently. Moreover, to further boost the pseudo-label accuracy and the model's generalization, we fuse the predictions from the global task and local task during training and testing, respectively. A series of experiments validate the effectiveness of the proposed method, and it outperforms the existing semi-supervised methods and the SSDG method on several benchmark DG datasets.
△ Less
Submitted 29 April, 2024; v1 submitted 11 August, 2022;
originally announced August 2022.
-
Convexity of multiplicities of filtrations on local rings
Authors:
Harold Blum,
Yuchen Liu,
Lu Qi
Abstract:
We prove that the multiplicity of a filtration of a local ring satisfies various convexity properties. In particular, we show the multiplicity is convex along geodesics. As a consequence, we prove that the volume of a valuation is log convex on simplices of quasi-monomial valuations and give a new proof of a theorem of Xu and Zhuang on the uniqueness of normalized volume minimizers. In another dir…
▽ More
We prove that the multiplicity of a filtration of a local ring satisfies various convexity properties. In particular, we show the multiplicity is convex along geodesics. As a consequence, we prove that the volume of a valuation is log convex on simplices of quasi-monomial valuations and give a new proof of a theorem of Xu and Zhuang on the uniqueness of normalized volume minimizers. In another direction, we generalize a theorem of Rees on multiplicities of ideals to filtrations and characterize when the Minkowski inequality for filtrations is an equality under mild assumptions.
△ Less
Submitted 20 March, 2024; v1 submitted 9 August, 2022;
originally announced August 2022.
-
RDA: Reciprocal Distribution Alignment for Robust Semi-supervised Learning
Authors:
Yue Duan,
Lei Qi,
Lei Wang,
Lu** Zhou,
Yinghuan Shi
Abstract:
In this work, we propose Reciprocal Distribution Alignment (RDA) to address semi-supervised learning (SSL), which is a hyperparameter-free framework that is independent of confidence threshold and works with both the matched (conventionally) and the mismatched class distributions. Distribution mismatch is an often overlooked but more general SSL scenario where the labeled and the unlabeled data do…
▽ More
In this work, we propose Reciprocal Distribution Alignment (RDA) to address semi-supervised learning (SSL), which is a hyperparameter-free framework that is independent of confidence threshold and works with both the matched (conventionally) and the mismatched class distributions. Distribution mismatch is an often overlooked but more general SSL scenario where the labeled and the unlabeled data do not fall into the identical class distribution. This may lead to the model not exploiting the labeled data reliably and drastically degrade the performance of SSL methods, which could not be rescued by the traditional distribution alignment. In RDA, we enforce a reciprocal alignment on the distributions of the predictions from two classifiers predicting pseudo-labels and complementary labels on the unlabeled data. These two distributions, carrying complementary information, could be utilized to regularize each other without any prior of class distribution. Moreover, we theoretically show that RDA maximizes the input-output mutual information. Our approach achieves promising performance in SSL under a variety of scenarios of mismatched distributions, as well as the conventional matched SSL setting. Our code is available at: https://github.com/NJUyued/RDA4RobustSSL.
△ Less
Submitted 12 August, 2022; v1 submitted 9 August, 2022;
originally announced August 2022.
-
Generalizable Medical Image Segmentation via Random Amplitude Mixup and Domain-Specific Image Restoration
Authors:
Ziqi Zhou,
Lei Qi,
Yinghuan Shi
Abstract:
For medical image analysis, segmentation models trained on one or several domains lack generalization ability to unseen domains due to discrepancies between different data acquisition policies. We argue that the degeneration in segmentation performance is mainly attributed to overfitting to source domains and domain shift. To this end, we present a novel generalizable medical image segmentation me…
▽ More
For medical image analysis, segmentation models trained on one or several domains lack generalization ability to unseen domains due to discrepancies between different data acquisition policies. We argue that the degeneration in segmentation performance is mainly attributed to overfitting to source domains and domain shift. To this end, we present a novel generalizable medical image segmentation method. To be specific, we design our approach as a multi-task paradigm by combining the segmentation model with a self-supervision domain-specific image restoration (DSIR) module for model regularization. We also design a random amplitude mixup (RAM) module, which incorporates low-level frequency information of different domain images to synthesize new images. To guide our model be resistant to domain shift, we introduce a semantic consistency loss. We demonstrate the performance of our method on two public generalizable segmentation benchmarks in medical images, which validates our method could achieve the state-of-the-art performance.
△ Less
Submitted 7 August, 2022;
originally announced August 2022.
-
Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning
Authors:
Lu Zhang,
Lu Qi,
Xu Yang,
Hong Qiao,
Ming-Hsuan Yang,
Zhiyong Liu
Abstract:
This paper tackles the problem of novel category discovery (NCD), which aims to discriminate unknown categories in large-scale image collections. The NCD task is challenging due to the closeness to the real-world scenarios, where we have only encountered some partial classes and images. Unlike other works on the NCD, we leverage the prototypes to emphasize the importance of category discrimination…
▽ More
This paper tackles the problem of novel category discovery (NCD), which aims to discriminate unknown categories in large-scale image collections. The NCD task is challenging due to the closeness to the real-world scenarios, where we have only encountered some partial classes and images. Unlike other works on the NCD, we leverage the prototypes to emphasize the importance of category discrimination and alleviate the issue of missing annotations of novel classes. Concretely, we propose a novel adaptive prototype learning method consisting of two main stages: prototypical representation learning and prototypical self-training. In the first stage, we obtain a robust feature extractor, which could serve for all images with base and novel categories. This ability of instance and category discrimination of the feature extractor is boosted by self-supervised learning and adaptive prototypes. In the second stage, we utilize the prototypes again to rectify offline pseudo labels and train a final parametric classifier for category clustering. We conduct extensive experiments on four benchmark datasets and demonstrate the effectiveness and robustness of the proposed method with state-of-the-art performance.
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
Mesoscopic transport in KSTAR plasmas: avalanches and the $E \times B$ staircase
Authors:
Minjun J. Choi,
Jae-Min Kwon,
Lei Qi,
P. H. Diamond,
T. S. Hahm,
Hogun Jhang,
Juhyung Kim,
Michael Leconte,
Hyun-Seok Kim,
Jisung Kang,
Byoung-Ho Park,
**il Chung,
Jaehyun Lee,
Minho Kim,
Gunsu S. Yun,
Y. U. Nam,
Jaewook Kim,
Won-Ha Ko,
K. D. Lee,
J. W. Juhn,
the KSTAR team
Abstract:
The self-organization is one of the most interesting phenomena in the non-equilibrium complex system, generating ordered structures of different sizes and durations. In tokamak plasmas, various self-organized phenomena have been reported, and two of them, coexisting in the near-marginal (interaction dominant) regime, are avalanches and the $E \times B$ staircase. Avalanches mean the ballistic flux…
▽ More
The self-organization is one of the most interesting phenomena in the non-equilibrium complex system, generating ordered structures of different sizes and durations. In tokamak plasmas, various self-organized phenomena have been reported, and two of them, coexisting in the near-marginal (interaction dominant) regime, are avalanches and the $E \times B$ staircase. Avalanches mean the ballistic flux propagation event through successive interactions as it propagates, and the $E \times B$ staircase means a globally ordered pattern of self-organized zonal flow layers. Various models have been suggested to understand their characteristics and relation, but experimental researches have been mostly limited to the demonstration of their existence. Here we report detailed analyses of their dynamics and statistics and explain their relation. Avalanches influence the formation and the width distribution of the $E \times B$ staircase, while the $E \times B$ staircase confines avalanches within its mesoscopic width until dissipated or penetrated. Our perspective to consider them the self-organization phenomena enhances our fundamental understanding of them as well as links our findings with the self-organization of mesoscopic structures in various complex systems.
△ Less
Submitted 20 February, 2024; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Standard Dual Quaternion Optimization and Its Applications in Hand-Eye Calibration and SLAM
Authors:
Liqun Qi
Abstract:
Several common dual quaternion functions, such as the power function, the magnitude function, the $2$-norm function and the $k$th largest eigenvalue of a dual quaternion Hermitian matrix, are standard dual quaternion functions, i.e., the standard parts of their function values depend upon only the standard parts of their dual quaternion variables. Furthermore, the sum, product, minimum, maximum an…
▽ More
Several common dual quaternion functions, such as the power function, the magnitude function, the $2$-norm function and the $k$th largest eigenvalue of a dual quaternion Hermitian matrix, are standard dual quaternion functions, i.e., the standard parts of their function values depend upon only the standard parts of their dual quaternion variables. Furthermore, the sum, product, minimum, maximum and composite functions of two standard dual functions, the logarithm and the exponential of standard unit dual quaternion functions, are still standard dual quaternion functions. On the other hand, the dual quaternion optimization problem, where objective and constraint function values are dual numbers but variables are dual quaternions, naturally arises from applications. We show that to solve an equality constrained dual quaternion optimization problem, we only need to solve two quaternion optimization problems. If the involved dual quaternion functions are all standard, the optimization problem is called a standard dual quaternion optimization problem, and some better results hold. Then, we show that the dual quaternion optimization problems arising from the hand-eye calibration problem and the simultaneous localization and map** (SLAM) problem are equality constrained standard dual quaternion optimization problems.
△ Less
Submitted 7 August, 2022; v1 submitted 29 June, 2022;
originally announced June 2022.
-
fETSmcs: Feature-based ETS model component selection
Authors:
Lingzhi Qi,
Xixi Li,
Qiang Wang,
Suling Jia
Abstract:
The well-developed ETS (ExponenTial Smoothing or Error, Trend, Seasonality) method incorporating a family of exponential smoothing models in state space representation has been widely used for automatic forecasting. The existing ETS method uses information criteria for model selection by choosing an optimal model with the smallest information criterion among all models fitted to a given time serie…
▽ More
The well-developed ETS (ExponenTial Smoothing or Error, Trend, Seasonality) method incorporating a family of exponential smoothing models in state space representation has been widely used for automatic forecasting. The existing ETS method uses information criteria for model selection by choosing an optimal model with the smallest information criterion among all models fitted to a given time series. The ETS method under such a model selection scheme suffers from computational complexity when applied to large-scale time series data. To tackle this issue, we propose an efficient approach for ETS model selection by training classifiers on simulated data to predict appropriate model component forms for a given time series. We provide a simulation study to show the model selection ability of the proposed approach on simulated data. We evaluate our approach on the widely used forecasting competition data set M4, in terms of both point forecasts and prediction intervals. To demonstrate the practical value of our method, we showcase the performance improvements from our approach on a monthly hospital data set.
△ Less
Submitted 26 June, 2022;
originally announced June 2022.
-
Quantum transport in a one-dimensional quasicrystal with mobility edges
Authors:
Yan Xing,
Lu Qi,
Xuedong Zhao,
Zhe Lü,
Shutian Liu,
Shou Zhang,
Hong-Fu Wang
Abstract:
Quantum transport in a one-dimensional (1D) quasiperiodic lattice with mobility edges is explored. We first investigate the adiabatic pum** between left and right edge modes by resorting to two edge-bulk-edge channels and demonstrate that the success or failure of the adiabatic pum** depends on whether the corresponding bulk subchannel undergoes a localization-delocalization transition. Compar…
▽ More
Quantum transport in a one-dimensional (1D) quasiperiodic lattice with mobility edges is explored. We first investigate the adiabatic pum** between left and right edge modes by resorting to two edge-bulk-edge channels and demonstrate that the success or failure of the adiabatic pum** depends on whether the corresponding bulk subchannel undergoes a localization-delocalization transition. Compared with the paradigmatic Aubry-André (AA) model, the introduction of mobility edges triggers an opposite outcome for successful pum** in the two channels, showing a discrepancy of critical condition, and facilitates the robustness of the adiabatic pum** against quasidisorder. We also consider the transfer between excitations at both boundaries of the lattice and an anomalous phenomenon characterized by the enhanced quasidisorder contributing to the excitation transfer is found. Furthermore, there exists a parametric regime where a nonreciprocal effect emerges in the presence of mobility edges, which leads to a unidirectional transport for the excitation transfer and enables potential applications in the engineering of quantum diodes.
△ Less
Submitted 15 June, 2022;
originally announced June 2022.
-
Mechanism of Local Lattice Distortion Effects on Vacancy Migration Barriers in FCC Alloys
Authors:
Zhucong Xi,
Mingfei Zhang,
Louis G. Hector Jr.,
Amit Misra,
Liang Qi
Abstract:
Accurate prediction of vacancy migration energy barriers, $ΔE_a$, in multi-component alloys is extremely challenging yet critical for the development of diffusional transformation kinetics needed to model alloy behavior in many technological applications. Here, results from $ΔE_a$ and the energy driving force $ΔE$ of many (>1000) vacancy migration events calculated using density functional theory…
▽ More
Accurate prediction of vacancy migration energy barriers, $ΔE_a$, in multi-component alloys is extremely challenging yet critical for the development of diffusional transformation kinetics needed to model alloy behavior in many technological applications. Here, results from $ΔE_a$ and the energy driving force $ΔE$ of many (>1000) vacancy migration events calculated using density functional theory and nudged elastic band method show large changes (~1eV) of $ΔE_a$ in different local chemical environments of the model face-centered cubic Al-Mg-Zn alloys. Due to local lattice distortion effects induced by solute atoms (such as Mg) with different sizes than the matrix element (Al), the changes of $ΔE_a$ for one type of migrating atoms originate primarily from fluctuations of $Δe_a\equiv ΔE_a - \frac{1}{2}ΔE$. To understand the fluctuations, a quartic function is shown to accurately describe the energy landscape of the minimum energy path (MEP) for each vacancy migration event. Analyses of the quartic function show that $Δe_a$ can be approximated with $Δe_a \approx αk_fD^2$, where $α\sim 0.022$ is a constant of all types of migrating atoms. Here $D$ is the distance of a migrating atom between two adjacent equilibrium positions and $k_f$ is the average vibration spring constant of this atom at these two equilibrium positions. $k_f$ and $D$ quantitatively describe the lattice distortion effects on the curvatures and locations of the MEP at its initial and final states in different local chemical environments. We also used the local lattice occupations as inputs to train surrogate models to predict coefficients of the quartic function, which accurately and efficiently output both $ΔE_a$ and $ΔE$ as the necessary inputs for the mesoscale studies of diffusional transformation in Al-Mg-Zn alloys.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
Observation of Non-Vanishing Optical Helicity in Thermal Radiation from Symmetry-Broken Metasurfaces
Authors:
Xueji Wang,
Tyler Sentz,
Sathwik Bharadwaj,
Subir Ray,
Yifan Wang,
Dan Jiao,
Limei Qi,
Zubin Jacob
Abstract:
Spinning thermal radiation is a unique phenomenon observed in condensed astronomical objects including the Wolf-Rayet star EZ-CMa and the red degenerate star G99-47, due to existence of strong magnetic fields. Here, by designing symmetry-broken metasurfaces, we demonstrate that spinning thermal radiation with a non-vanishing optical helicity can be realized even without applying a magnetic field.…
▽ More
Spinning thermal radiation is a unique phenomenon observed in condensed astronomical objects including the Wolf-Rayet star EZ-CMa and the red degenerate star G99-47, due to existence of strong magnetic fields. Here, by designing symmetry-broken metasurfaces, we demonstrate that spinning thermal radiation with a non-vanishing optical helicity can be realized even without applying a magnetic field. We design non-vanishing optical helicity by engineering a dispersionless band which emits omnidirectional spinning thermal radiation, where our design reaches 39% of the fundamental limit. Our results firmly suggest metasurfaces can impart spin coherence in the incoherent radiation excited by thermal fluctuations. The symmetry-based design strategy also provides a general pathway for comprehensively controlling thermal radiation in its temporal and spin coherence.
△ Less
Submitted 7 January, 2023; v1 submitted 12 May, 2022;
originally announced May 2022.
-
Multi-mode Tensor Train Factorization with Spatial-spectral Regularization for Remote Sensing Images Recovery
Authors:
Gaohang Yu,
Shaochun Wan,
Liqun Qi,
Yanwei Xu
Abstract:
Tensor train (TT) factorization and corresponding TT rank, which can well express the low-rankness and mode correlations of higher-order tensors, have attracted much attention in recent years. However, TT factorization based methods are generally not sufficient to characterize low-rankness along each mode of third-order tensor. Inspired by this, we generalize the tensor train factorization to the…
▽ More
Tensor train (TT) factorization and corresponding TT rank, which can well express the low-rankness and mode correlations of higher-order tensors, have attracted much attention in recent years. However, TT factorization based methods are generally not sufficient to characterize low-rankness along each mode of third-order tensor. Inspired by this, we generalize the tensor train factorization to the mode-k tensor train factorization and introduce a corresponding multi-mode tensor train (MTT) rank. Then, we proposed a novel low-MTT-rank tensor completion model via multi-mode TT factorization and spatial-spectral smoothness regularization. To tackle the proposed model, we develop an efficient proximal alternating minimization (PAM) algorithm. Extensive numerical experiment results on visual data demonstrate that the proposed MTTD3R method outperforms compared methods in terms of visual and quantitative measures.
△ Less
Submitted 5 May, 2022;
originally announced May 2022.
-
von Neumann type trace inequality for dual quaternion matrices
Authors:
Chen Ling,
Hong** He,
Liqun Qi,
Tingting Feng
Abstract:
Dual quaternion matrices have important applications in multi-agent formation control. In this paper, we first address the concept of spectral norm of dual quaternion matrices. Then, we introduce a von Neumann type trace inequality and a Hoffman-Wielandt type inequality for general dual quaternion matrices, where the latter characterizes a simultaneous perturbation bound on all singular values of…
▽ More
Dual quaternion matrices have important applications in multi-agent formation control. In this paper, we first address the concept of spectral norm of dual quaternion matrices. Then, we introduce a von Neumann type trace inequality and a Hoffman-Wielandt type inequality for general dual quaternion matrices, where the latter characterizes a simultaneous perturbation bound on all singular values of a dual quaternion matrix. In particular, we also present two variants of the above two inequalities expressed by eigenvalues of dual quaternion Hermitian matrices. Our results are helpful for the further study of dual quaternion matrix theory, algorithmic design, and applications.
△ Less
Submitted 12 April, 2023; v1 submitted 20 April, 2022;
originally announced April 2022.
-
Label Distribution Learning for Generalizable Multi-source Person Re-identification
Authors:
Lei Qi,
Jiaying Shen,
Jiaqi Liu,
Yinghuan Shi,
Xin Geng
Abstract:
Person re-identification (Re-ID) is a critical technique in the video surveillance system, which has achieved significant success in the supervised setting. However, it is difficult to directly apply the supervised model to arbitrary unseen domains due to the domain gap between the available source domains and unseen target domains. In this paper, we propose a novel label distribution learning (LD…
▽ More
Person re-identification (Re-ID) is a critical technique in the video surveillance system, which has achieved significant success in the supervised setting. However, it is difficult to directly apply the supervised model to arbitrary unseen domains due to the domain gap between the available source domains and unseen target domains. In this paper, we propose a novel label distribution learning (LDL) method to address the generalizable multi-source person Re-ID task (i.e., there are multiple available source domains, and the testing domain is unseen during training), which aims to explore the relation of different classes and mitigate the domain-shift across different domains so as to improve the discrimination of the model and learn the domain-invariant feature, simultaneously. Specifically, during the training process, we produce the label distribution via the online manner to mine the relation information of different classes, thus it is beneficial for extracting the discriminative feature. Besides, for the label distribution of each class, we further revise it to give more and equal attention to the other domains that the class does not belong to, which can effectively reduce the domain gap across different domains and obtain the domain-invariant feature. Furthermore, we also give the theoretical analysis to demonstrate that the proposed method can effectively deal with the domain-shift issue. Extensive experiments on multiple benchmark datasets validate the effectiveness of the proposed method and show that the proposed method can outperform the state-of-the-art methods. Besides, further analysis also reveals the superiority of the proposed method.
△ Less
Submitted 24 August, 2022; v1 submitted 12 April, 2022;
originally announced April 2022.
-
Dual Quaternion Matrices in Multi-Agent Formation Control
Authors:
Liqun Qi,
Xiangke Wang,
Ziyan Luo
Abstract:
Three kinds of dual quaternion matrices associated with the mutual visibility graph, namely the relative configuration adjacency matrix, the logarithm adjacency matrix and the relative twist adjacency matrix, play important roles in multi-agent formation control. In this paper, we study their properties and applications. We show that the relative configuration adjacency matrix and the logarithm ad…
▽ More
Three kinds of dual quaternion matrices associated with the mutual visibility graph, namely the relative configuration adjacency matrix, the logarithm adjacency matrix and the relative twist adjacency matrix, play important roles in multi-agent formation control. In this paper, we study their properties and applications. We show that the relative configuration adjacency matrix and the logarithm adjacency matrix are all Hermitian matrices, and thus have very nice spectral properties. We introduce dual quaternion Laplacian matrices, and prove a Gershgorin-type theorem for square dual quaternion Hermitian matrices, for studying properties of dual quaternion Laplacian matrices. The role of the dual quaternion Laplacian matrices in formation control is discussed.
△ Less
Submitted 20 December, 2022; v1 submitted 4 April, 2022;
originally announced April 2022.
-
MAT: Mask-Aware Transformer for Large Hole Image Inpainting
Authors:
Wenbo Li,
Zhe Lin,
Kun Zhou,
Lu Qi,
Yi Wang,
Jiaya Jia
Abstract:
Recent studies have shown the importance of modeling long-range interactions in the inpainting problem. To achieve this goal, existing approaches exploit either standalone attention techniques or transformers, but usually under a low resolution in consideration of computational cost. In this paper, we present a novel transformer-based model for large hole inpainting, which unifies the merits of tr…
▽ More
Recent studies have shown the importance of modeling long-range interactions in the inpainting problem. To achieve this goal, existing approaches exploit either standalone attention techniques or transformers, but usually under a low resolution in consideration of computational cost. In this paper, we present a novel transformer-based model for large hole inpainting, which unifies the merits of transformers and convolutions to efficiently process high-resolution images. We carefully design each component of our framework to guarantee the high fidelity and diversity of recovered images. Specifically, we customize an inpainting-oriented transformer block, where the attention module aggregates non-local information only from partial valid tokens, indicated by a dynamic mask. Extensive experiments demonstrate the state-of-the-art performance of the new model on multiple benchmark datasets. Code is released at https://github.com/fenglinglwb/MAT.
△ Less
Submitted 26 June, 2022; v1 submitted 29 March, 2022;
originally announced March 2022.
-
MutexMatch: Semi-Supervised Learning with Mutex-Based Consistency Regularization
Authors:
Yue Duan,
Zhen Zhao,
Lei Qi,
Lei Wang,
Lu** Zhou,
Yinghuan Shi,
Yang Gao
Abstract:
The core issue in semi-supervised learning (SSL) lies in how to effectively leverage unlabeled data, whereas most existing methods tend to put a great emphasis on the utilization of high-confidence samples yet seldom fully explore the usage of low-confidence samples. In this paper, we aim to utilize low-confidence samples in a novel way with our proposed mutex-based consistency regularization, nam…
▽ More
The core issue in semi-supervised learning (SSL) lies in how to effectively leverage unlabeled data, whereas most existing methods tend to put a great emphasis on the utilization of high-confidence samples yet seldom fully explore the usage of low-confidence samples. In this paper, we aim to utilize low-confidence samples in a novel way with our proposed mutex-based consistency regularization, namely MutexMatch. Specifically, the high-confidence samples are required to exactly predict "what it is" by conventional True-Positive Classifier, while the low-confidence samples are employed to achieve a simpler goal -- to predict with ease "what it is not" by True-Negative Classifier. In this sense, we not only mitigate the pseudo-labeling errors but also make full use of the low-confidence unlabeled data by consistency of dissimilarity degree. MutexMatch achieves superior performance on multiple benchmark datasets, i.e., CIFAR-10, CIFAR-100, SVHN, STL-10, mini-ImageNet and Tiny-ImageNet. More importantly, our method further shows superiority when the amount of labeled data is scarce, e.g., 92.23% accuracy with only 20 labeled data on CIFAR-10. Our code and model weights have been released at https://github.com/NJUyued/MutexMatch4SSL.
△ Less
Submitted 21 December, 2022; v1 submitted 27 March, 2022;
originally announced March 2022.
-
A bond counting model for accurate prediction of lattice parameter of bcc solid solution alloys
Authors:
Chris Tandoc,
Liang Qi,
Yong-Jie Hu
Abstract:
Lattice Parameter is an important material feature in High Entropy Alloy (HEA) Design. Vegards Law is typically used to estimate lattice parameters but is often inaccurate for metal alloys due to an inability to account for charge transfer which can affect atomic volumes. The present study used ab initio simulation to calculate bond lengths between atoms of dissimilar elements in B2 intermetallic…
▽ More
Lattice Parameter is an important material feature in High Entropy Alloy (HEA) Design. Vegards Law is typically used to estimate lattice parameters but is often inaccurate for metal alloys due to an inability to account for charge transfer which can affect atomic volumes. The present study used ab initio simulation to calculate bond lengths between atoms of dissimilar elements in B2 intermetallic compounds which was then combined with a bond counting model to produce a model to estimate the lattice parameters of Refractory BCC HEAS. The model was tested using a supercell method which modeled various random solid solution HEAs. The proposed model produced lattice parameters with superior accuracy to Vegards Law without the need for large DFT calculations or fitting parameters. The proposed model had a root mean squared error (RMSE) of 0.006 Angstroms which is half that of Vegards Law RMSE (0.012 Angstrom).
△ Less
Submitted 17 March, 2022; v1 submitted 14 March, 2022;
originally announced March 2022.
-
SSCU-Net: Spatial-Spectral Collaborative Unmixing Network for Hyperspectral Images
Authors:
Lin Qi,
Feng Gao,
Junyu Dong,
Xinbo Gao,
Qian Du
Abstract:
Linear spectral unmixing is an essential technique in hyperspectral image processing and interpretation. In recent years, deep learning-based approaches have shown great promise in hyperspectral unmixing, in particular, unsupervised unmixing methods based on autoencoder networks are a recent trend. The autoencoder model, which automatically learns low-dimensional representations (abundances) and r…
▽ More
Linear spectral unmixing is an essential technique in hyperspectral image processing and interpretation. In recent years, deep learning-based approaches have shown great promise in hyperspectral unmixing, in particular, unsupervised unmixing methods based on autoencoder networks are a recent trend. The autoencoder model, which automatically learns low-dimensional representations (abundances) and reconstructs data with their corresponding bases (endmembers), has achieved superior performance in hyperspectral unmixing. In this article, we explore the effective utilization of spatial and spectral information in autoencoder-based unmixing networks. Important findings on the use of spatial and spectral information in the autoencoder framework are discussed. Inspired by these findings, we propose a spatial-spectral collaborative unmixing network, called SSCU-Net, which learns a spatial autoencoder network and a spectral autoencoder network in an end-to-end manner to more effectively improve the unmixing performance. SSCU-Net is a two-stream deep network and shares an alternating architecture, where the two autoencoder networks are efficiently trained in a collaborative way for estimation of endmembers and abundances. Meanwhile, we propose a new spatial autoencoder network by introducing a superpixel segmentation method based on abundance information, which greatly facilitates the employment of spatial information and improves the accuracy of unmixing network. Moreover, extensive ablation studies are carried out to investigate the performance gain of SSCU-Net. Experimental results on both synthetic and real hyperspectral data sets illustrate the effectiveness and competitiveness of the proposed SSCU-Net compared with several state-of-the-art hyperspectral unmixing methods.
△ Less
Submitted 8 August, 2022; v1 submitted 12 March, 2022;
originally announced March 2022.
-
Multi-scale Investigation of Chemical Short-Range Order and Dislocation Glide in the MoNbTi and TaNbTi Refractory Multi-Principal Element Alloys
Authors:
Hui Zheng,
Lauren T. W. Fey,
Xiang-Guo Li,
Yong-Jie Hu,
Liang Qi,
Chi Chen,
Shuozhi Xu,
Irene J. Beyerlein,
Shyue ** Ong
Abstract:
Refractory multi-principal element alloys (RMPEAs) are promising materials for high-temperature structural applications. Here, we investigate the role of chemical short-range ordering (CSRO) on dislocation glide in two model RMPEAs - TaNbTi and MoNbTi - using a multi-scale modeling approach. A highly accurate machine learning interatomic potential was developed for the Mo-Ta-Nb-Ti system and used…
▽ More
Refractory multi-principal element alloys (RMPEAs) are promising materials for high-temperature structural applications. Here, we investigate the role of chemical short-range ordering (CSRO) on dislocation glide in two model RMPEAs - TaNbTi and MoNbTi - using a multi-scale modeling approach. A highly accurate machine learning interatomic potential was developed for the Mo-Ta-Nb-Ti system and used to demonstrate that MoNbTi exhibits a much greater degree of SRO than TaNbTi and the local composition has a direct effect on the unstable stacking fault energies (USFE). From mesoscale phase-field dislocation dynamics simulations, we find that increasing SRO leads to higher mean USFEs, thereby increasing the stress required for dislocation glide. The gliding dislocations experience significant hardening due to pinning and depinning caused by random compositional fluctuations, with higher SRO decreasing the degree of USFE dispersion and hence, amount of hardening. Finally, we show how the morphology of an expanding dislocation loop is affected by the applied stress, with higher SRO requiring higher applied stresses to achieve smooth screw dislocation glide.
△ Less
Submitted 7 March, 2022;
originally announced March 2022.
-
Minimax principle for right eigenvalues of dual quaternion matrices and their generalized inverses
Authors:
Chen Ling,
Liqun Qi,
Hong Yan
Abstract:
Dual quaternions can represent rigid body motion in 3D spaces, and have found wide applications in robotics, 3D motion modelling and control, and computer graphics. In this paper, we introduce three different right linear independency for a set of dual quaternion vectors, and study some related basic properties for the set of dual quaternion vectors and dual quaternion matrices. We present a minim…
▽ More
Dual quaternions can represent rigid body motion in 3D spaces, and have found wide applications in robotics, 3D motion modelling and control, and computer graphics. In this paper, we introduce three different right linear independency for a set of dual quaternion vectors, and study some related basic properties for the set of dual quaternion vectors and dual quaternion matrices. We present a minimax principle for right eigenvalues of dual quaternion Hermitian matrices. Based upon a newly established Cauchy-Schwarz inequality for dual quaternion vectors and singular value decomposition of dual quaternion matrices, we propose an important inequality for singular values of dual quaternion matrices. We finally introduce the concept of generalized inverse of dual quaternion matrices, and present the necessary and sufficient conditions for a dual quaternion matrix to be one of four types of generalized inverses of another dual quaternion matrix.
△ Less
Submitted 4 May, 2023; v1 submitted 7 March, 2022;
originally announced March 2022.
-
Low Rank Approximation of Dual Complex Matrices
Authors:
Liqun Qi,
David M. Alexander,
Zhongming Chen,
Chen Ling,
Ziyan Luo
Abstract:
Dual complex numbers can represent rigid body motion in 2D spaces. Dual complex matrices are linked with screw theory, and have potential applications in various areas. In this paper, we study low rank approximation of dual complex matrices. We define $2$-norm for dual complex vectors, and Frobenius norm for dual complex matrices. These norms are nonnegative dual numbers. We establish the unitary…
▽ More
Dual complex numbers can represent rigid body motion in 2D spaces. Dual complex matrices are linked with screw theory, and have potential applications in various areas. In this paper, we study low rank approximation of dual complex matrices. We define $2$-norm for dual complex vectors, and Frobenius norm for dual complex matrices. These norms are nonnegative dual numbers. We establish the unitary invariance property of dual complex matrices. We study eigenvalues of square dual complex matrices, and show that an $n \times n$ dual complex Hermitian matrix has exactly $n$ eigenvalues, which are dual numbers. We present a singular value decomposition (SVD) theorem for dual complex matrices, define ranks and appreciable ranks for dual complex matrices, and study their properties. We establish an Eckart-Young like theorem for dual complex matrices, and present an algorithm framework for low rank approximation of dual complex matrices via truncated SVD. The SVD of dual complex matrices also provides a basic tool for Principal Component Analysis (PCA) via these matrices. Numerical experiments are reported.
△ Less
Submitted 30 January, 2022;
originally announced January 2022.
-
A Novel Mix-normalization Method for Generalizable Multi-source Person Re-identification
Authors:
Lei Qi,
Lei Wang,
Yinghuan Shi,
Xin Geng
Abstract:
Person re-identification (Re-ID) has achieved great success in the supervised scenario. However, it is difficult to directly transfer the supervised model to arbitrary unseen domains due to the model overfitting to the seen source domains. In this paper, we aim to tackle the generalizable multi-source person Re-ID task (i.e., there are multiple available source domains, and the testing domain is u…
▽ More
Person re-identification (Re-ID) has achieved great success in the supervised scenario. However, it is difficult to directly transfer the supervised model to arbitrary unseen domains due to the model overfitting to the seen source domains. In this paper, we aim to tackle the generalizable multi-source person Re-ID task (i.e., there are multiple available source domains, and the testing domain is unseen during training) from the data augmentation perspective, thus we put forward a novel method, termed MixNorm, which consists of domain-aware mix-normalization (DMN) and domain-ware center regularization (DCR). Different from the conventional data augmentation, the proposed domain-aware mix-normalization to enhance the diversity of features during training from the normalization view of the neural network, which can effectively alleviate the model overfitting to the source domains, so as to boost the generalization capability of the model in the unseen domain. To better learn the domain-invariant model, we further develop the domain-aware center regularization to better map the produced diverse features into the same space. Extensive experiments on multiple benchmark datasets validate the effectiveness of the proposed method and show that the proposed method can outperform the state-of-the-art methods. Besides, further analysis also reveals the superiority of the proposed method.
△ Less
Submitted 12 June, 2022; v1 submitted 24 January, 2022;
originally announced January 2022.
-
MVDG: A Unified Multi-view Framework for Domain Generalization
Authors:
Jian Zhang,
Lei Qi,
Yinghuan Shi,
Yang Gao
Abstract:
To generalize the model trained in source domains to unseen target domains, domain generalization (DG) has recently attracted lots of attention. Since target domains can not be involved in training, overfitting source domains is inevitable. As a popular regularization technique, the meta-learning training scheme has shown its ability to resist overfitting. However, in the training stage, current m…
▽ More
To generalize the model trained in source domains to unseen target domains, domain generalization (DG) has recently attracted lots of attention. Since target domains can not be involved in training, overfitting source domains is inevitable. As a popular regularization technique, the meta-learning training scheme has shown its ability to resist overfitting. However, in the training stage, current meta-learning-based methods utilize only one task along a single optimization trajectory, which might produce a biased and noisy optimization direction. Beyond the training stage, overfitting could also cause unstable prediction in the test stage. In this paper, we propose a novel multi-view DG framework to effectively reduce the overfitting in both the training and test stage. Specifically, in the training stage, we develop a multi-view regularized meta-learning algorithm that employs multiple optimization trajectories to produce a suitable optimization direction for model updating. We also theoretically show that the generalization bound could be reduced by increasing the number of tasks in each trajectory. In the test stage, we utilize multiple augmented images to yield a multi-view prediction to alleviate unstable prediction, which significantly promotes model reliability. Extensive experiments on three benchmark datasets validate that our method can find a flat minimum to enhance generalization and outperform several state-of-the-art approaches.
△ Less
Submitted 8 August, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Generalizable Cross-modality Medical Image Segmentation via Style Augmentation and Dual Normalization
Authors:
Ziqi Zhou,
Lei Qi,
Xin Yang,
Dong Ni,
Yinghuan Shi
Abstract:
For medical image segmentation, imagine if a model was only trained using MR images in source domain, how about its performance to directly segment CT images in target domain? This setting, namely generalizable cross-modality segmentation, owning its clinical potential, is much more challenging than other related settings, e.g., domain adaptation. To achieve this goal, we in this paper propose a n…
▽ More
For medical image segmentation, imagine if a model was only trained using MR images in source domain, how about its performance to directly segment CT images in target domain? This setting, namely generalizable cross-modality segmentation, owning its clinical potential, is much more challenging than other related settings, e.g., domain adaptation. To achieve this goal, we in this paper propose a novel dual-normalization model by leveraging the augmented source-similar and source-dissimilar images during our generalizable segmentation. To be specific, given a single source domain, aiming to simulate the possible appearance change in unseen target domains, we first utilize a nonlinear transformation to augment source-similar and source-dissimilar images. Then, to sufficiently exploit these two types of augmentations, our proposed dual-normalization based model employs a shared backbone yet independent batch normalization layer for separate normalization. Afterward, we put forward a style-based selection scheme to automatically choose the appropriate path in the test stage. Extensive experiments on three publicly available datasets, i.e., BraTS, Cross-Modality Cardiac, and Abdominal Multi-Organ datasets, have demonstrated that our method outperforms other state-of-the-art domain generalization methods. Code is available at https://github.com/zzzqzhou/Dual-Normalization.
△ Less
Submitted 28 March, 2022; v1 submitted 21 December, 2021;
originally announced December 2021.
-
CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation
Authors:
Lu Qi,
Jason Kuen,
Zhe Lin,
Jiuxiang Gu,
Fengyun Rao,
Dian Li,
Weidong Guo,
Zhen Wen,
Ming-Hsuan Yang,
Jiaya Jia
Abstract:
To improve instance-level detection/segmentation performance, existing self-supervised and semi-supervised methods extract either task-unrelated or task-specific training signals from unlabeled data. We show that these two approaches, at the two extreme ends of the task-specificity spectrum, are suboptimal for the task performance. Utilizing too little task-specific training signals causes underfi…
▽ More
To improve instance-level detection/segmentation performance, existing self-supervised and semi-supervised methods extract either task-unrelated or task-specific training signals from unlabeled data. We show that these two approaches, at the two extreme ends of the task-specificity spectrum, are suboptimal for the task performance. Utilizing too little task-specific training signals causes underfitting to the ground-truth labels of downstream tasks, while the opposite causes overfitting to the ground-truth labels. To this end, we propose a novel Class-Agnostic Semi-Supervised Learning (CA-SSL) framework to achieve a more favorable task-specificity balance in extracting training signals from unlabeled data. CA-SSL has three training stages that act on either ground-truth labels (labeled data) or pseudo labels (unlabeled data). This decoupling strategy avoids the complicated scheme in traditional SSL methods that balances the contributions from both data types. Especially, we introduce a warmup training stage to achieve a more optimal balance in task specificity by ignoring class information in the pseudo labels, while preserving localization training signals. As a result, our warmup model can better avoid underfitting/overfitting when fine-tuned on the ground-truth labels in detection and segmentation tasks. Using 3.6M unlabeled data, we achieve a significant performance gain of 4.7% over ImageNet-pretrained baseline on FCOS object detection. In addition, our warmup model demonstrates excellent transferability to other detection and segmentation frameworks.
△ Less
Submitted 19 July, 2022; v1 submitted 9 December, 2021;
originally announced December 2021.
-
PLACE dropout: A Progressive Layer-wise and Channel-wise Dropout for Domain Generalization
Authors:
**tao Guo,
Lei Qi,
Yinghuan Shi,
Yang Gao
Abstract:
Domain generalization (DG) aims to learn a generic model from multiple observed source domains that generalizes well to arbitrary unseen target domains without further training. The major challenge in DG is that the model inevitably faces a severe overfitting issue due to the domain gap between source and target domains. To mitigate this problem, some dropout-based methods have been proposed to re…
▽ More
Domain generalization (DG) aims to learn a generic model from multiple observed source domains that generalizes well to arbitrary unseen target domains without further training. The major challenge in DG is that the model inevitably faces a severe overfitting issue due to the domain gap between source and target domains. To mitigate this problem, some dropout-based methods have been proposed to resist overfitting by discarding part of the representation of the intermediate layers. However, we observe that most of these methods only conduct the dropout operation in some specific layers, leading to an insufficient regularization effect on the model. We argue that applying dropout at multiple layers can produce stronger regularization effects, which could alleviate the overfitting problem on source domains more adequately than previous layer-specific dropout methods. In this paper, we develop a novel layer-wise and channel-wise dropout for DG, which randomly selects one layer and then randomly selects its channels to conduct dropout. Particularly, the proposed method can generate a variety of data variants to better deal with the overfitting issue. We also provide theoretical analysis for our dropout method and prove that it can effectively reduce the generalization error bound. Besides, we leverage the progressive scheme to increase the dropout ratio with the training progress, which can gradually boost the difficulty of training the model to enhance its robustness. Extensive experiments on three standard benchmark datasets have demonstrated that our method outperforms several state-of-the-art DG methods. Our code is available at https://github.com/lingeringlight/PLACEdropout.
△ Less
Submitted 17 September, 2023; v1 submitted 7 December, 2021;
originally announced December 2021.
-
JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering
Authors:
Yueqing Sun,
Qi Shi,
Le Qi,
Yu Zhang
Abstract:
Existing KG-augmented models for commonsense question answering primarily focus on designing elaborate Graph Neural Networks (GNNs) to model knowledge graphs (KGs). However, they ignore (i) the effectively fusing and reasoning over question context representations and the KG representations, and (ii) automatically selecting relevant nodes from the noisy KGs during reasoning. In this paper, we prop…
▽ More
Existing KG-augmented models for commonsense question answering primarily focus on designing elaborate Graph Neural Networks (GNNs) to model knowledge graphs (KGs). However, they ignore (i) the effectively fusing and reasoning over question context representations and the KG representations, and (ii) automatically selecting relevant nodes from the noisy KGs during reasoning. In this paper, we propose a novel model, JointLK, which solves the above limitations through the joint reasoning of LM and GNN and the dynamic KGs pruning mechanism. Specifically, JointLK performs joint reasoning between LM and GNN through a novel dense bidirectional attention module, in which each question token attends on KG nodes and each KG node attends on question tokens, and the two modal representations fuse and update mutually by multi-step interactions. Then, the dynamic pruning module uses the attention weights generated by joint reasoning to prune irrelevant KG nodes recursively. We evaluate JointLK on the CommonsenseQA and OpenBookQA datasets, and demonstrate its improvements to the existing LM and LM+KG models, as well as its capability to perform interpretable reasoning.
△ Less
Submitted 2 May, 2022; v1 submitted 5 December, 2021;
originally announced December 2021.
-
Unsupervised Domain Generalization for Person Re-identification: A Domain-specific Adaptive Framework
Authors:
Lei Qi,
Jiaqi Liu,
Lei Wang,
Yinghuan Shi,
Xin Geng
Abstract:
Domain generalization (DG) has attracted much attention in person re-identification (ReID) recently. It aims to make a model trained on multiple source domains generalize to an unseen target domain. Although achieving promising progress, existing methods usually need the source domains to be labeled, which could be a significant burden for practical ReID tasks. In this paper, we turn to investigat…
▽ More
Domain generalization (DG) has attracted much attention in person re-identification (ReID) recently. It aims to make a model trained on multiple source domains generalize to an unseen target domain. Although achieving promising progress, existing methods usually need the source domains to be labeled, which could be a significant burden for practical ReID tasks. In this paper, we turn to investigate unsupervised domain generalization for ReID, by assuming that no label is available for any source domains.
To address this challenging setting, we propose a simple and efficient domain-specific adaptive framework, and realize it with an adaptive normalization module designed upon the batch and instance normalization techniques. In doing so, we successfully yield reliable pseudo-labels to implement training and also enhance the domain generalization capability of the model as required. In addition, we show that our framework can even be applied to improve person ReID under the settings of supervised domain generalization and unsupervised domain adaptation, demonstrating competitive performance with respect to relevant methods. Extensive experimental study on benchmark datasets is conducted to validate the proposed framework. A significance of our work lies in that it shows the potential of unsupervised domain generalization for person ReID and sets a strong baseline for the further research on this topic.
△ Less
Submitted 23 March, 2023; v1 submitted 29 November, 2021;
originally announced November 2021.
-
High Quality Segmentation for Ultra High-resolution Images
Authors:
Tiancheng Shen,
Yuechen Zhang,
Lu Qi,
Jason Kuen,
Xingyu Xie,
Jianlong Wu,
Zhe Lin,
Jiaya Jia
Abstract:
To segment 4K or 6K ultra high-resolution images needs extra computation consideration in image segmentation. Common strategies, such as down-sampling, patch crop**, and cascade model, cannot address well the balance issue between accuracy and computation cost. Motivated by the fact that humans distinguish among objects continuously from coarse to precise levels, we propose the Continuous Refine…
▽ More
To segment 4K or 6K ultra high-resolution images needs extra computation consideration in image segmentation. Common strategies, such as down-sampling, patch crop**, and cascade model, cannot address well the balance issue between accuracy and computation cost. Motivated by the fact that humans distinguish among objects continuously from coarse to precise levels, we propose the Continuous Refinement Model~(CRM) for the ultra high-resolution segmentation refinement task. CRM continuously aligns the feature map with the refinement target and aggregates features to reconstruct these images' details. Besides, our CRM shows its significant generalization ability to fill the resolution gap between low-resolution training images and ultra high-resolution testing ones. We present quantitative performance evaluation and visualization to show that our proposed method is fast and effective on image segmentation refinement. Code will be released at https://github.com/dvlab-research/Entity.
△ Less
Submitted 26 December, 2021; v1 submitted 29 November, 2021;
originally announced November 2021.
-
Crowdsourcing-based Multi-Device Communication Cooperation for Mobile High-Quality Video Enhancement
Authors:
Xiaotong Wu,
Lianyong Qi,
Xiaolong Xu,
Shui Yu,
Wanchun Dou,
Xuyun Zhang
Abstract:
The widespread use of mobile devices propels the development of new-fashioned video applications like 3D (3-Dimensional) stereo video and mobile cloud game via web or App, exerting more pressure on current mobile access network. To address this challenge, we adopt the crowdsourcing paradigm to offer some incentive for guiding the movement of recruited crowdsourcing users and facilitate the optimiz…
▽ More
The widespread use of mobile devices propels the development of new-fashioned video applications like 3D (3-Dimensional) stereo video and mobile cloud game via web or App, exerting more pressure on current mobile access network. To address this challenge, we adopt the crowdsourcing paradigm to offer some incentive for guiding the movement of recruited crowdsourcing users and facilitate the optimization of the movement control decision. In this paper, based on a practical 4G (4th-Generation) network throughput measurement study, we formulate the movement control decision as a cost-constrained user recruitment optimization problem. Considering the intractable complexity of this problem, we focus first on a single crowdsourcing user case and propose a pseudo-polynomial time complexity optimal solution. Then, we apply this solution to solve the more general problem of multiple users and propose a graph-partition-based algorithm. Extensive experiments show that our solutions can improve the efficiency of real-time D2D communication for mobile videos.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
Estimate of the Background and Sensitivity of theFollow-up X-ray Telescope onboard Einstein Probe
Authors:
Juan Zhang,
Liqiang Qi,
Yanji Yang,
Juan Wang,
Yuan Liu,
Weiwei Cui,
Donghua Zhao,
Shumie Jia,
Tianming Li,
Tianxiang Chen,
Gang Li,
Xiaofan Zhao,
Yong Chen,
Huaqiu Liu,
Congying Bao,
Ju Guan,
Liming Song,
Weimin Yuan
Abstract:
As a space X-ray imaging mission dedicated to time-domain astrophysics, the Einstein Probe (EP) carries two kinds of scientific payloads, the wide-field X-ray telescope (WXT) and the follow-up X-ray telescope (FXT). FXT utilizes Wolter-I type mirrors and the pn-CCD detectors. In this work, we investigate the in-orbit background of FXT based on Geant4 simulation. The impact of various space compone…
▽ More
As a space X-ray imaging mission dedicated to time-domain astrophysics, the Einstein Probe (EP) carries two kinds of scientific payloads, the wide-field X-ray telescope (WXT) and the follow-up X-ray telescope (FXT). FXT utilizes Wolter-I type mirrors and the pn-CCD detectors. In this work, we investigate the in-orbit background of FXT based on Geant4 simulation. The impact of various space components present in the EP orbital environment are considered, such as the cosmic photon background, cosmic ray primary and secondary particles (e.g. protons, electrons and positrons), albedo gamma rays, and the low-energy protons near the geomagnetic equator. The obtained instrumental background at 0.5-10 keV, which is mainly induced by cosmic ray protons and cosmic photon background, corresponds to a level of $\sim$3.1$\times$10$^{-2}$ counts s$^{-1}$ keV$^{-1}$ in the imaging area of the focal plane detector (FPD), i.e. 3.7$\times$10$^{-3}$ counts s$^{-1}$ keV$^{-1}$ cm$^{-2}$ after normalization. Compared with the instrumental background, the field of view (FOV) background, which is induced by cosmic photons reflected by the optical mirror, dominates below 2 keV. Based on the simulated background level within the focal spot (a 30$^{\prime\prime}$-radius circle), the sensitivity of FXT is calculated, which could theoretically achieve several $μ$crab (in the order of 10$^{-14}$ erg cm$^{-2}$ s$^{-1}$) in 0.5-2 keV and several tens of $μ$crab (in the order of 10$^{-13}$ erg cm$^{-2}$ s$^{-1}$) in 2-10 keV for a pointed observation with an exposure of 25 minutes. This sensitivity becomes worse by a factor of $\sim2$ if additional 10% systematic uncertainty of the background subtraction is included.
△ Less
Submitted 25 November, 2021;
originally announced November 2021.
-
Eigenvalues and Singular Values of Dual Quaternion Matrices
Authors:
Liqun Qi,
Ziyan Luo
Abstract:
The poses of $m$ robotics in $n$ time points may be represented by an $m \times n$ dual quaternion matrix. In this paper, we study the spectral theory of dual quaternion matrices. We introduce right and left eigenvalues for square dual quaternion matrices. If a right eigenvalue is a dual number, then it is also a left eigenvalue. In this case, this dual number is called an eigenvalue of that dual…
▽ More
The poses of $m$ robotics in $n$ time points may be represented by an $m \times n$ dual quaternion matrix. In this paper, we study the spectral theory of dual quaternion matrices. We introduce right and left eigenvalues for square dual quaternion matrices. If a right eigenvalue is a dual number, then it is also a left eigenvalue. In this case, this dual number is called an eigenvalue of that dual quaternion matrix. We show that the right eigenvalues of a dual quaternion Hermitian matrix are dual numbers. Thus, they are eigenvalues. An $n \times n$ dual quaternion Hermitian matrix is shown to have exactly $n$ eigenvalues. It is positive semidefinite, or positive definite, if and only if all of its eigenvalues are nonnegative, or positive and appreciable, dual numbers, respectively. We present a unitary decomposition of a dual quaternion Hermitian matrix, and the singular value decomposition for a general dual quaternion matrix. The singular values of a dual quaternion matrix are nonnegative dual numbers.
△ Less
Submitted 30 November, 2021; v1 submitted 23 November, 2021;
originally announced November 2021.
-
Dual Quaternions and Dual Quaternion Vectors
Authors:
Liqun Qi,
Chen Ling,
Hong Yan
Abstract:
We introduce a total order and the absolute value function for dual numbers. The absolute value function of dual numbers are with dual number values, and have properties similar to the properties of the absolute value function of real numbers. We define the magnitude of a dual quaternion, as a dual number. Based upon these, we extended $1$-norm, $\infty$-norm and $2$-norm to dual quaternion vector…
▽ More
We introduce a total order and the absolute value function for dual numbers. The absolute value function of dual numbers are with dual number values, and have properties similar to the properties of the absolute value function of real numbers. We define the magnitude of a dual quaternion, as a dual number. Based upon these, we extended $1$-norm, $\infty$-norm and $2$-norm to dual quaternion vectors.
△ Less
Submitted 23 November, 2021; v1 submitted 8 November, 2021;
originally announced November 2021.
-
SO{U}RCERER: Developer-Driven Security Testing Framework for Android Apps
Authors:
Muhammad Sajidur Rahman,
Blas Kojusner,
Ryon Kennedy,
Prerit Pathak,
Lin Qi,
Byron Williams
Abstract:
Frequently advised secure development recommendations often fall short in practice for app developers. Tool-driven (e.g., using static analysis tools) approaches lack context and domain-specific requirements of an app being tested. App developers struggle to find an actionable and prioritized list of vulnerabilities from a laundry list of security warnings reported by static analysis tools. Proces…
▽ More
Frequently advised secure development recommendations often fall short in practice for app developers. Tool-driven (e.g., using static analysis tools) approaches lack context and domain-specific requirements of an app being tested. App developers struggle to find an actionable and prioritized list of vulnerabilities from a laundry list of security warnings reported by static analysis tools. Process-driven (e.g., applying threat modeling methods) approaches require substantial resources (e.g., security testing team, budget) and security expertise, which small to medium-scale app dev teams could barely afford. To help app developers securing their apps, we propose SO{U}RCERER, a guiding framework for Android app developers for security testing. SO{U}RCERER guides developers to identify domain-specific assets of an app, detect and prioritize vulnerabilities, and mitigate those vulnerabilities based on secure development guidelines. We evaluated SO{U}RCERER with a case study on analyzing and testing 36 Android mobile money apps. We found that by following activities guided by SO{U}RCERER, an app developer could get a concise and actionable list of vulnerabilities (24-61% fewer security warnings produced by SO{U}RCERER than a standalone static analyzer), directly affecting a mobile money app's critical assets, and devise a mitigation plan. Our findings from this preliminary study indicate a viable approach to Android app security testing without being overwhelmingly complex for app developers.
△ Less
Submitted 2 November, 2021; v1 submitted 2 November, 2021;
originally announced November 2021.