-
Physics-Coupled Spatio-Temporal Active Learning for Dynamical Systems
Authors:
Yu Huang,
Yufei Tang,
Xingquan Zhu,
Min Shi,
Ali Muhamed Ali,
Hanqi Zhuang,
Laurent Cherubin
Abstract:
Spatio-temporal forecasting is of great importance in a wide range of dynamical systems applications from atmospheric science, to recent COVID-19 spread modeling. These applications rely on accurate predictions of spatio-temporal structured data reflecting real-world phenomena. A stunning characteristic is that the dynamical system is not only driven by some physics laws but also impacted by the l…
▽ More
Spatio-temporal forecasting is of great importance in a wide range of dynamical systems applications from atmospheric science, to recent COVID-19 spread modeling. These applications rely on accurate predictions of spatio-temporal structured data reflecting real-world phenomena. A stunning characteristic is that the dynamical system is not only driven by some physics laws but also impacted by the localized factor in spatial and temporal regions. One of the major challenges is to infer the underlying causes, which generate the perceived data stream and propagate the involved causal dynamics through the distributed observing units. Another challenge is that the success of machine learning based predictive models requires massive annotated data for model training. However, the acquisition of high-quality annotated data is objectively manual and tedious as it needs a considerable amount of human intervention, making it infeasible in fields that require high levels of expertise. To tackle these challenges, we advocate a spatio-temporal physics-coupled neural networks (ST-PCNN) model to learn the underlying physics of the dynamical system and further couple the learned physics to assist the learning of the recurring dynamics. To deal with data-acquisition constraints, an active learning mechanism with Kriging for actively acquiring the most informative data is proposed for ST-PCNN training in a partially observable environment. Our experiments on both synthetic and real-world datasets exhibit that the proposed ST-PCNN with active learning converges to near optimal accuracy with substantially fewer instances.
△ Less
Submitted 11 August, 2021;
originally announced August 2021.
-
Characterization of the boundedness of generalized fractional integral and maximal operators on Orlicz-Morrey and weak Orlicz-Morrey spaces
Authors:
Ryota Kawasumi,
Eiichi Nakai,
Minglei Shi
Abstract:
We give necessary and sufficient conditions for the boundedness of generalized fractional integral and maximal operators on Orlicz-Morrey and weak Orlicz-Morrey spaces. To do this we prove the weak-weak type modular inequality of the Hardy-Littlewood maximal operator with respect to the Young function. Orlicz-Morrey spaces contain $L^p$ spaces ($1\le p\le\infty$), Orlicz spaces and generalized Mor…
▽ More
We give necessary and sufficient conditions for the boundedness of generalized fractional integral and maximal operators on Orlicz-Morrey and weak Orlicz-Morrey spaces. To do this we prove the weak-weak type modular inequality of the Hardy-Littlewood maximal operator with respect to the Young function. Orlicz-Morrey spaces contain $L^p$ spaces ($1\le p\le\infty$), Orlicz spaces and generalized Morrey spaces as special cases. Hence we get necessary and sufficient conditions on these function spaces as corollaries.
△ Less
Submitted 22 July, 2021;
originally announced July 2021.
-
Evolution of transport properties in FeSe thin flakes with thickness approaching the two-dimensional limit
Authors:
C. S. Zhu,
B. Lei,
Z. L. Sun,
J. H. Cui,
M. Z. Shi,
W. Z. Zhuo,
X. G. Luo,
X. H. Chen
Abstract:
Electronic properties of FeSe can be tuned by various routes. Here, we present a comprehensive study on the evolution of the superconductivity and nematicity in FeSe with thickness from bulk single crystal down to bilayer ($\sim$ 1.1 nm) through exfoliation. With decreasing flake thickness, both the structural transition temperature $T_{\rm s}$ and the superconducting transition temperature…
▽ More
Electronic properties of FeSe can be tuned by various routes. Here, we present a comprehensive study on the evolution of the superconductivity and nematicity in FeSe with thickness from bulk single crystal down to bilayer ($\sim$ 1.1 nm) through exfoliation. With decreasing flake thickness, both the structural transition temperature $T_{\rm s}$ and the superconducting transition temperature $T_{\rm c}^{\rm zero}$ are greatly suppressed. The magnetic field ($B$) dependence of Hall resistance $R_{xy}$ at 15 K changes from $B$-nonlinear to $B$-linear behavior up to 9 T, as the thickness ($d$) is reduced to 13 nm. $T_{\rm c}$ is linearly dependent on the inverse of flake thickness (1/$d$) when $d\le$ 13 nm, and a clear drop of $T_{\rm c}$ appears with thickness smaller than 27 nm. The $I$-$V$ characteristic curves in ultrathin flakes reveal the signature of Berezinskii-Kosterlitz-Thouless (BKT) transition, indicating the presence of two-dimensional superconductivity. Anisotropic magnetoresistance measurements further support 2D superconductivity in few-layer FeSe. Increase of disorder scattering, anisotropic strains and dimensionality effect with reducing the thickness of FeSe flakes, might be taken into account for understanding these behaviors. Our study provides systematic insights into the evolution of the superconducting properties, structural transition and Hall resistance of a superconductor FeSe with flakes thickness and provides an effective way to find two-dimensional superconductivity as well as other 2D novel phenomena.
△ Less
Submitted 22 July, 2021;
originally announced July 2021.
-
Learning to Recommend Items to Wikidata Editors
Authors:
Kholoud AlGhamdi,
Miao**g Shi,
Elena Simperl
Abstract:
Wikidata is an open knowledge graph built by a global community of volunteers. As it advances in scale, it faces substantial challenges around editor engagement. These challenges are in terms of both attracting new editors to keep up with the sheer amount of work and retaining existing editors. Experience from other online communities and peer-production systems, including Wikipedia, suggests that…
▽ More
Wikidata is an open knowledge graph built by a global community of volunteers. As it advances in scale, it faces substantial challenges around editor engagement. These challenges are in terms of both attracting new editors to keep up with the sheer amount of work and retaining existing editors. Experience from other online communities and peer-production systems, including Wikipedia, suggests that personalised recommendations could help, especially newcomers, who are sometimes unsure about how to contribute best to an ongoing effort. For this reason, we propose a recommender system WikidataRec for Wikidata items. The system uses a hybrid of content-based and collaborative filtering techniques to rank items for editors relying on both item features and item-editor previous interaction. A neural network, named a neural mixture of representations, is designed to learn fine weights for the combination of item-based representations and optimize them with editor-based representation by item-editor interaction. To facilitate further research in this space, we also create two benchmark datasets, a general-purpose one with 220,000 editors responsible for 14 million interactions with 4 million items and a second one focusing on the contributions of more than 8,000 more active editors. We perform an offline evaluation of the system on both datasets with promising results. Our code and datasets are available at https://github.com/WikidataRec-developer/Wikidata_Recommender.
△ Less
Submitted 30 July, 2021; v1 submitted 13 July, 2021;
originally announced July 2021.
-
Observation of a singular Weyl point surrounded by charged nodal walls in PtGa
Authors:
J. -Z. Ma,
Q. -S. Wu,
M. Song,
S. -N. Zhang,
E. B. Guedes,
S. A. Ekahana,
M. Krivenkov,
M. Y. Yao,
S. -Y. Gao,
W. -H. Fan,
T. Qian,
H. Ding,
N. C. Plumb,
M. Radovic,
J. H. Dil,
Y. -M. Xiong,
K. Manna,
C. Felser,
O. V. Yazyev,
M. Shi
Abstract:
Constrained by the Nielsen-Ninomiya no-go theorem, in all so-far experimentally determined Weyl semimetals (WSMs) the Weyl points (WPs) always appear in pairs in the momentum space with no exception. As a consequence, Fermi arcs occur on surfaces which connect the projections of the WPs with opposite chiral charges. However, this situation can be circumvented in the case of unpaired WP, without re…
▽ More
Constrained by the Nielsen-Ninomiya no-go theorem, in all so-far experimentally determined Weyl semimetals (WSMs) the Weyl points (WPs) always appear in pairs in the momentum space with no exception. As a consequence, Fermi arcs occur on surfaces which connect the projections of the WPs with opposite chiral charges. However, this situation can be circumvented in the case of unpaired WP, without relevant surface Fermi arc connecting its surface projection, appearing singularly, while its Berry curvature field is absorbed by nontrivial charged nodal walls. Here, combining angle-resolved photoemission spectroscopy with density functional theory calculations, we show experimentally that a singular Weyl point emerges in PtGa at the center of the Brillouin zone (BZ), which is surrounded by closed Weyl nodal walls located at the BZ boundaries and there is no Fermi arc connecting its surface projection. Our results reveal that nontrivial band crossings of different dimensionalities can emerge concomitantly in condensed matter, while their coexistence ensures the net topological charge of different dimensional topological objects to be zero. Our observation extends the applicable range of the original Nielsen-Ninomiya no-go theorem which was derived from zero dimensional paired WPs with opposite chirality.
△ Less
Submitted 2 July, 2021;
originally announced July 2021.
-
Self-orthogonal codes over a non-unital ring and combinatorial matrices
Authors:
Minjia Shi,
Shukai Wang,
Jon-Lark Kim,
Patrick Solé
Abstract:
There is a local ring $E$ of order $4,$ without identity for the multiplication, defined by generators and relations as $E=\langle a,b \mid 2a=2b=0,\, a^2=a,\, b^2=b,\,ab=a,\, ba=b\rangle.$
We study a special construction of self-orthogonal codes over $E,$ based on combinatorial matrices related to two-class association schemes, Strongly Regular Graphs (SRG), and Doubly Regular Tournaments (DRT)…
▽ More
There is a local ring $E$ of order $4,$ without identity for the multiplication, defined by generators and relations as $E=\langle a,b \mid 2a=2b=0,\, a^2=a,\, b^2=b,\,ab=a,\, ba=b\rangle.$
We study a special construction of self-orthogonal codes over $E,$ based on combinatorial matrices related to two-class association schemes, Strongly Regular Graphs (SRG), and Doubly Regular Tournaments (DRT).
We construct quasi self-dual codes over $E,$ and Type IV codes, that is, quasi self-dual codes whose all codewords have even Hamming weight. All these codes can be represented as formally self-dual additive codes over $\F_4.$ The classical invariant theory bound for the weight enumerators of this class of codesimproves the known bound on the minimum distance of Type IV codes over $E.$
△ Less
Submitted 13 June, 2021;
originally announced June 2021.
-
Rich Nature of Van Hove Singularities in Kagome Superconductor CsV$_3$Sb$_5$
Authors:
Yong Hu,
Xianxin Wu,
Brenden R. Ortiz,
Sailong Ju,
Xinlong Han,
J. Z. Ma,
N. C. Plumb,
Milan Radovic,
Ronny Thomale,
S. D. Wilson,
Andreas P. Schnyder,
M. Shi
Abstract:
The recently discovered layered kagome metals AV$_3$Sb$_5$ (A=K, Rb, Cs) exhibit diverse correlated phenomena, which are intertwined with a topological electronic structure with multiple van Hove singularities (VHSs) in the vicinity of the Fermi level. As the VHSs with their large density of states enhance correlation effects, it is of crucial importance to determine their nature and properties. H…
▽ More
The recently discovered layered kagome metals AV$_3$Sb$_5$ (A=K, Rb, Cs) exhibit diverse correlated phenomena, which are intertwined with a topological electronic structure with multiple van Hove singularities (VHSs) in the vicinity of the Fermi level. As the VHSs with their large density of states enhance correlation effects, it is of crucial importance to determine their nature and properties. Here, we combine polarization-dependent angle-resolved photoemission spectroscopy with density functional theory to directly reveal the sublattice properties of 3d-orbital VHSs in CsV$_3$Sb$_5$. Four VHSs are identified around the M point and three of them are close to the Fermi level, with two having sublattice-pure and one sublattice-mixed nature. Remarkably, the VHS just below the Fermi level displays an extremely flat dispersion along MK, establishing the experimental discovery of higher-order VHS. The characteristic intensity modulation of Dirac cones around K further demonstrates the sublattice interference embedded in the electronic structure. The crucial insights into the electronic structure, revealed by our work, provide a solid starting point for the understanding of the intriguing correlation phenomena in the kagome metals AV$_3$Sb$_5$.
△ Less
Submitted 10 June, 2021;
originally announced June 2021.
-
Analysis of Magnetohydrodynamic Perturbations in Radial-field Solar Wind from Parker Solar Probe Observations
Authors:
S. Q. Zhao,
Huirong Yan,
Terry Z. Liu,
Mingzhe Liu,
Mijie Shi
Abstract:
We report analysis of sub-Alfvénic magnetohydrodynamic (MHD) perturbations in the low-\b{eta} radial-field solar wind using the Parker Solar Probe spacecraft data from 31 October to 12 November 2018. We calculate wave vectors using the singular value decomposition method and separate the MHD perturbations into three types of linear eigenmodes (Alfvén, fast, and slow modes) to explore the propertie…
▽ More
We report analysis of sub-Alfvénic magnetohydrodynamic (MHD) perturbations in the low-\b{eta} radial-field solar wind using the Parker Solar Probe spacecraft data from 31 October to 12 November 2018. We calculate wave vectors using the singular value decomposition method and separate the MHD perturbations into three types of linear eigenmodes (Alfvén, fast, and slow modes) to explore the properties of the sub-Alfvénic perturbations and the role of compressible perturbations in solar wind heating. The MHD perturbations there show a high degree of Alfvénicity in the radial-field solar wind, with the energy fraction of Alfvén modes dominating (~45%-83%) over those of fast modes (~16%-43%) and slow modes (~1%-19%). We present a detailed analysis of a representative event on 10 November 2018. Observations show that fast modes dominate magnetic compressibility, whereas slow modes dominate density compressibility. The energy dam** rate of compressible modes is comparable to the heating rate, suggesting the collisionless dam** of compressible modes could be significant for solar wind heating. These results are valuable for further studies of the imbalanced turbulence near the Sun and possible heating effects of compressible modes at MHD scales in low-\b{eta} plasma.
△ Less
Submitted 25 October, 2021; v1 submitted 7 June, 2021;
originally announced June 2021.
-
Distinct band reconstructions in kagome superconductor CsV$_3$Sb$_5$
Authors:
Yang Luo,
Shuting Peng,
Samuel M. L. Teicher,
Linwei Huai,
Yong Hu,
Brenden R. Ortiz,
Zhiyuan Wei,
Jianchang Shen,
Zhipeng Ou,
Bingqian Wang,
Yu Miao,
Mingyao Guo,
M. Shi,
Stephen D. Wilson,
J. -F. He
Abstract:
The new two-dimensional (2D) kagome superconductor CsV$_3$Sb$_5$ has attracted much recent attention due to the coexistence of superconductivity, charge order, topology and kagome physics. A key issue in this field is to unveil the unique reconstructed electronic structure, which successfully accommodates different orders and interactions to form a fertile ground for emergent phenomena. Here, we r…
▽ More
The new two-dimensional (2D) kagome superconductor CsV$_3$Sb$_5$ has attracted much recent attention due to the coexistence of superconductivity, charge order, topology and kagome physics. A key issue in this field is to unveil the unique reconstructed electronic structure, which successfully accommodates different orders and interactions to form a fertile ground for emergent phenomena. Here, we report angle-resolved photoemission spectroscopy (ARPES) evidence for two distinct band reconstructions in CsV$_3$Sb$_5$. The first one is characterized by the appearance of new electron energy band at low temperature. The new band is theoretically reproduced when the three dimensionality of the charge order is considered for a band-folding along the out-of-plane direction. The second reconstruction is identified as a surface induced orbital-selective shift of the electron energy band. Our results provide the first evidence for the three dimensionality of the charge order in single-particle spectral function, highlighting the importance of long-range out-of-plane electronic correlations in this layered kagome superconductor. They also point to the feasibility of orbital-selective control of the band structure via surface modification, which would open a new avenue for manipulating exotic phenomena in this system, including superconductivity.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report
Authors:
Andrey Ignatov,
Grigory Malivenko,
David Plowman,
Samarth Shukla,
Radu Timofte,
Ziyu Zhang,
Yicheng Wang,
Zilong Huang,
Guozhong Luo,
Gang Yu,
Bin Fu,
Yiran Wang,
Xingyi Li,
Min Shi,
Ke Xian,
Zhiguo Cao,
**-Hua Du,
Pei-Lin Wu,
Chao Ge,
Jiaoyang Yao,
Fangwen Tu,
Bo Li,
Jung Eun Yoo,
Kwanggyoon Seo,
Jialei Xu
, et al. (13 additional authors not shown)
Abstract:
Depth estimation is an important computer vision problem with many practical applications to mobile devices. While many solutions have been proposed for this task, they are usually very computationally expensive and thus are not applicable for on-device inference. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based d…
▽ More
Depth estimation is an important computer vision problem with many practical applications to mobile devices. While many solutions have been proposed for this task, they are usually very computationally expensive and thus are not applicable for on-device inference. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based depth estimation solutions that can demonstrate a nearly real-time performance on smartphones and IoT platforms. For this, the participants were provided with a new large-scale dataset containing RGB-depth image pairs obtained with a dedicated stereo ZED camera producing high-resolution depth maps for objects located at up to 50 meters. The runtime of all models was evaluated on the popular Raspberry Pi 4 platform with a mobile ARM-based Broadcom chipset. The proposed solutions can generate VGA resolution depth maps at up to 10 FPS on the Raspberry Pi 4 while achieving high fidelity results, and are compatible with any Android or Linux-based mobile devices. A detailed description of all models developed in the challenge is provided in this paper.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
Designs, permutations, and transitive groups
Authors:
Minjia Shi,
XiaoXiao Li,
Patrick Solé
Abstract:
A notion of $t$-designs in the symmetric group on $n$ letters was introduced by Godsil in 1988. In particular $t$-transitive sets of permutations form a $t$-design. We derive special lower bounds for $t=1$ and $t=2$ by a power moment method. For general $n,t$ we give a %linear programming lower bound . For $n\ge 4$ and $t=2,$ this bound is strong enough to show a lower bound on the size of such…
▽ More
A notion of $t$-designs in the symmetric group on $n$ letters was introduced by Godsil in 1988. In particular $t$-transitive sets of permutations form a $t$-design. We derive special lower bounds for $t=1$ and $t=2$ by a power moment method. For general $n,t$ we give a %linear programming lower bound . For $n\ge 4$ and $t=2,$ this bound is strong enough to show a lower bound on the size of such $t$-designs of $n(n-1)\dots (n-t+1),$ which is best possible when sharply $t$-transitive sets of permutations exist. This shows, in particular, that tight $2$-designs do not exist.
△ Less
Submitted 24 June, 2023; v1 submitted 17 May, 2021;
originally announced May 2021.
-
Nodeless superconductivity in the centro- and noncentrosymmetric rhenium-boron superconductors
Authors:
T. Shang,
W. Xie,
J. Z. Zhao,
Y. Chen,
D. J. Gawryluk,
M. Medarde,
M. Shi,
H. Q. Yuan,
E. Pomjakushina,
T. Shiroka
Abstract:
We report a comprehensive study of the centrosymmetric Re$_3$B and noncentrosymmetric Re$_7$B$_3$ superconductors. At a macroscopic level, their bulk superconductivity (SC), with $T_c$ = 5.1 K (Re$_3$B) and 3.3 K (Re$_7$B$_3$), was characterized via electrical-resistivity, magnetization, and heat-capacity measurements, while their microscopic superconducting properties were investigated by means o…
▽ More
We report a comprehensive study of the centrosymmetric Re$_3$B and noncentrosymmetric Re$_7$B$_3$ superconductors. At a macroscopic level, their bulk superconductivity (SC), with $T_c$ = 5.1 K (Re$_3$B) and 3.3 K (Re$_7$B$_3$), was characterized via electrical-resistivity, magnetization, and heat-capacity measurements, while their microscopic superconducting properties were investigated by means of muon-spin rotation/relaxation ($μ$SR). In both Re$_3$B and Re$_7$B$_3$ the low-$T$ zero-field electronic specific heat and the superfluid density (determined via tranverse-field $μ$SR) suggest a nodeless SC. Both compounds exhibit some features of multigap SC, as evidenced by temperature-dependent upper critical fields $H_\mathrm{c2}(T)$, as well as by electronic band-structure calculations. The absence of spontaneous magnetic fields below the onset of SC, as determined from zero-field $μ$SR measurements, indicates a preserved time-reversal symmetry in the superconducting state of both Re$_3$B and Re$_7$B$_3$. Our results suggest that a lack of inversion symmetry and the accompanying antisymmetric spin-orbit coupling effects are not essential for the occurrence of multigap SC in these rhenium-boron compounds.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
A programmable $k\cdot p$ Hamiltonian method and application to magnetic topological insulator MnBi$_2$Te$_4$
Authors:
Guohui Zhan,
Minji Shi,
Zhilong Yang,
Haijun Zhang
Abstract:
In the band theory, first-principles calculations, the tight-binding method and the effective $k\cdot p$ model are usually employed to investigate the electronic structure of condensed matters. The effective $k\cdot p$ model has a compact form with a clear physical picture, and first-principles calculations can give more accurate results. Nowadays, it has been widely recognized to combine the…
▽ More
In the band theory, first-principles calculations, the tight-binding method and the effective $k\cdot p$ model are usually employed to investigate the electronic structure of condensed matters. The effective $k\cdot p$ model has a compact form with a clear physical picture, and first-principles calculations can give more accurate results. Nowadays, it has been widely recognized to combine the $k\cdot p$ model and first-principles calculations to explore topological materials. However, the traditional method to derive the $k\cdot p$ Hamiltonian is complicated and time-consuming by hand. In this work, we independently develop a programmable algorithm to construct effective $k\cdot p$ Hamiltonians. Symmetries and orbitals are used as the input information to produce the one-/two-/three-dimension $k\cdot p$ Hamiltonian in our method, and the open-source code can be directly downloaded online. At last, we also demonstrate the application to MnBi$_2$Te$_4$-family magnetic topological materials.
△ Less
Submitted 8 May, 2021; v1 submitted 28 April, 2021;
originally announced April 2021.
-
Charge-order-assisted topological surface states and flat bands in the kagome superconductor CsV$_3$Sb$_5$
Authors:
Yong Hu,
Samuel M. L. Teicher,
Brenden R. Ortiz,
Yang Luo,
Shuting Peng,
Linwei Huai,
J. Z. Ma,
N. C. Plumb,
Stephen D. Wilson,
J. -F. He,
M. Shi
Abstract:
The diversity of emergent phenomena in quantum materials often arises from the interplay between different physical energy scales or broken symmetries. Cooperative interactions among them are rare; however, when they do occur, they often stabilize fundamentally new ground states or phase behaviors. For instance, a pair density wave can form when the superconducting order parameter borrows spatial…
▽ More
The diversity of emergent phenomena in quantum materials often arises from the interplay between different physical energy scales or broken symmetries. Cooperative interactions among them are rare; however, when they do occur, they often stabilize fundamentally new ground states or phase behaviors. For instance, a pair density wave can form when the superconducting order parameter borrows spatial periodical variation from charge order; a topological superconductor can arise when topologically nontrivial electronic states proximitize with or participate in the formation of the superconducting condensate. Here, we report spectroscopic evidence for a unique synergy of topology and correlation effects in the kagome superconductor CsV$_3$Sb$_5$ - one where topologically nontrivial surface states are pushed below the Fermi energy (E$_F$) by charge order, making the topological physics active near E$_F$ upon entering the superconducting state. Flat bands are observed, indicating that electron correlation effects are also at play in this system. Our results reveal the peculiar electronic structure of CsV$_3$Sb$_5$, which holds the potential for realizing Majorana zero modes and anomalous superconducting states in kagome lattices. They also establish CsV$_3$Sb$_5$ as a unique platform for exploring the cooperation between the charge order, topology, correlation effects and superconductivity.
△ Less
Submitted 26 April, 2021;
originally announced April 2021.
-
Real-time Forecast Models for TBM Load Parameters Based on Machine Learning Methods
Authors:
Xianjie Gao,
Xueguan Song,
Maolin Shi,
Chao Zhang,
Hongwei Zhang
Abstract:
Because of the fast advance rate and the improved personnel safety, tunnel boring machines (TBMs) have been widely used in a variety of tunnel construction projects. The dynamic modeling of TBM load parameters (including torque, advance rate and thrust) plays an essential part in the design, safe operation and fault prognostics of this complex engineering system. In this paper, based on in-situ TB…
▽ More
Because of the fast advance rate and the improved personnel safety, tunnel boring machines (TBMs) have been widely used in a variety of tunnel construction projects. The dynamic modeling of TBM load parameters (including torque, advance rate and thrust) plays an essential part in the design, safe operation and fault prognostics of this complex engineering system. In this paper, based on in-situ TBM operational data, we use the machine-learning (ML) methods to build the real-time forecast models for TBM load parameters, which can instantaneously provide the future values of the TBM load parameters as long as the current data are collected. To decrease the model complexity and improve the generalization, we also apply the least absolute shrinkage and selection (Lasso) method to extract the essential features of the forecast task. The experimental results show that the forecast models based on deep-learning methods, {\it e.g.}, recurrent neural network and its variants, outperform the ones based on the shallow-learning methods, {\it e.g.}, support vector regression and random forest. Moreover, the Lasso-based feature extraction significantly improves the performance of the resultant models.
△ Less
Submitted 12 April, 2021;
originally announced April 2021.
-
Deep Attributed Network Representation Learning via Attribute Enhanced Neighborhood
Authors:
Cong Li,
Min Shi,
Bo Qu,
Xiang Li
Abstract:
Attributed network representation learning aims at learning node embeddings by integrating network structure and attribute information. It is a challenge to fully capture the microscopic structure and the attribute semantics simultaneously, where the microscopic structure includes the one-step, two-step and multi-step relations, indicating the first-order, second-order and high-order proximity of…
▽ More
Attributed network representation learning aims at learning node embeddings by integrating network structure and attribute information. It is a challenge to fully capture the microscopic structure and the attribute semantics simultaneously, where the microscopic structure includes the one-step, two-step and multi-step relations, indicating the first-order, second-order and high-order proximity of nodes, respectively. In this paper, we propose a deep attributed network representation learning via attribute enhanced neighborhood (DANRL-ANE) model to improve the robustness and effectiveness of node representations. The DANRL-ANE model adopts the idea of the autoencoder, and expands the decoder component to three branches to capture different order proximity. We linearly combine the adjacency matrix with the attribute similarity matrix as the input of our model, where the attribute similarity matrix is calculated by the cosine similarity between the attributes based on the social homophily. In this way, we preserve the second-order proximity to enhance the robustness of DANRL-ANE model on sparse networks, and deal with the topological and attribute information simultaneously. Moreover, the sigmoid cross-entropy loss function is extended to capture the neighborhood character, so that the first-order proximity is better preserved. We compare our model with the state-of-the-art models on five real-world datasets and two network analysis tasks, i.e., link prediction and node classification. The DANRL-ANE model performs well on various networks, even on sparse networks or networks with isolated nodes given the attribute information is sufficient.
△ Less
Submitted 12 April, 2021;
originally announced April 2021.
-
Discovery of Ĉ$_2$ rotation anomaly in topological crystalline insulator SrPb
Authors:
Wenhui Fan,
Simin Nie,
Cuixiang Wang,
Binbin Fu,
Changjiang Yi,
Shunye Gao,
Zhicheng Rao,
Dayu Yan,
Junzhang Ma,
Ming Shi,
Yaobo Huang,
Youguo Shi,
Zhijun Wang,
Tian Qian,
Hong Ding
Abstract:
Topological crystalline insulators (TCIs) are insulating electronic states with nontrivial topology protected by crystalline symmetries. Recently, theory has proposed new classes of TCIs protected by rotation symmetries Ĉ$_n$, which have surface rotation anomaly evading the fermion doubling theorem, i.e. n instead of 2n Dirac cones on the surface preserving the rotation symmetry. Here, we report t…
▽ More
Topological crystalline insulators (TCIs) are insulating electronic states with nontrivial topology protected by crystalline symmetries. Recently, theory has proposed new classes of TCIs protected by rotation symmetries Ĉ$_n$, which have surface rotation anomaly evading the fermion doubling theorem, i.e. n instead of 2n Dirac cones on the surface preserving the rotation symmetry. Here, we report the first realization of the Ĉ$_2$ rotation anomaly in a binary compound SrPb. Our first-principles calculations reveal two massless Dirac fermions protected by the combination of time-reversal symmetry T and Ĉ$_{2y}$ on the (010) surface. Using angle-resolved photoemission spectroscopy, we identify two Dirac surface states inside the bulk band gap of SrPb, confirming the Ĉ$_2$ rotation anomaly in the new classes of TCIs. The findings enrich the classification of topological phases, which pave the way for exploring exotic behaviour of the new classes of TCIs.
△ Less
Submitted 10 April, 2021;
originally announced April 2021.
-
Quantized State Feedback Stabilization of Nonlinear Systems under Denial-of-Service
Authors:
Mingming Shi,
Shuai Feng,
Hideaki Ishii
Abstract:
This paper studies the resilient control of networked systems in the presence of cyber attacks. In particular, we consider the state feedback stabilization problem for nonlinear systems when the state measurement is sent to the controller via a communication channel that only has a finite transmitting rate and is moreover subject to cyber attacks in the form of Denial-of-Service (DoS). We use a dy…
▽ More
This paper studies the resilient control of networked systems in the presence of cyber attacks. In particular, we consider the state feedback stabilization problem for nonlinear systems when the state measurement is sent to the controller via a communication channel that only has a finite transmitting rate and is moreover subject to cyber attacks in the form of Denial-of-Service (DoS). We use a dynamic quantization method to update the quantization range of the encoder/decoder and characterize the number of bits for quantization needed to stabilize the system under a given level of DoS attacks in terms of duration and frequency. Our theoretical result shows that under DoS attacks, the required data bits to stabilize nonlinear systems by state feedback control are larger than those without DoS since the communication interruption induced by DoS makes the quantization uncertainty expand more between two successful transmissions. Even so, in the simulation, we show that the actual quantization bits can be much smaller than the theoretical value.
△ Less
Submitted 9 April, 2021;
originally announced April 2021.
-
Are energy savings the only reason for the emergence of bird echelon formation?
Authors:
Mingming Shi,
Julien M. Hendrickx
Abstract:
We analyze the conditions under which the emergence of frequently observed echelon formation can be explained solely by the maximization of energy savings. We consider a two-dimensional multi-agent echelon formation, where each agent receives a benefit that depends on its position relative to the others, and adjusts its position to increase this benefit. We analyze the selfish case where each agen…
▽ More
We analyze the conditions under which the emergence of frequently observed echelon formation can be explained solely by the maximization of energy savings. We consider a two-dimensional multi-agent echelon formation, where each agent receives a benefit that depends on its position relative to the others, and adjusts its position to increase this benefit. We analyze the selfish case where each agent maximizes its own benefit, leading to a Nash-equilibrium problem, and the collaborative case in which agents maximize the global benefit of the group. We provide conditions on the benefit function under which the frequently observed echelon formations cannot be Nash equilbriums or group optimums.
We then show that these conditions are satisfied by the conventionally used fixed-wing wake benefit model. This implies that energy saving alone is not sufficient to explain the emergence of the migratory formations observed, based on the fixed-wing model. Hence, either non-aerodynamic aspects or a more accurate model of bird dynamics should be considered to construct such formations.
△ Less
Submitted 24 March, 2021;
originally announced March 2021.
-
Gradient Policy on "CartPole" game and its' expansibility to F1Tenth Autonomous Vehicles
Authors:
Mingwei Shi
Abstract:
Policy gradient is an effective way to estimate continuous action on the environment. This paper, it about explaining the mathematical formula and code implementation. In the end, comparing between the rotation angle of the stick on CartPole , and the angle of the Autonomous vehicle when turning, and utilizing the Bicycle Model, a simple Kinematic dynamic model, are the purpose to discover the sim…
▽ More
Policy gradient is an effective way to estimate continuous action on the environment. This paper, it about explaining the mathematical formula and code implementation. In the end, comparing between the rotation angle of the stick on CartPole , and the angle of the Autonomous vehicle when turning, and utilizing the Bicycle Model, a simple Kinematic dynamic model, are the purpose to discover the similarity between these two models, so as to facilitate the model transfer from CartPole to the F1tenth Autonomous vehicle.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
Time-reversal symmetry breaking driven topological phase transition in EuB$_6$
Authors:
Shun-Ye Gao,
Sheng Xu,
Hang Li,
Chang-Jiang Yi,
Si-Min Nie,
Zhi- Cheng Rao,
Huan Wang,
Quan-Xin Hu,
Xue-Zhi Chen,
Wen-Hui Fan,
Jie- Rui Huang,
Yao-Bo Huang,
Nini Pryds,
Ming Shi,
Zhi-Jun Wang,
You-Guo Shi,
Tian-Long Xia,
Tian Qian,
Hong Ding
Abstract:
The interplay between time-reversal symmetry (TRS) and band topology plays a crucial role in topological states of quantum matter. In time-reversal-invariant (TRI) systems, the inversion of spin-degenerate bands with opposite parity leads to nontrivial topological states, such as topological insulators and Dirac semimetals. When the TRS is broken, the exchange field induces spin splitting of the b…
▽ More
The interplay between time-reversal symmetry (TRS) and band topology plays a crucial role in topological states of quantum matter. In time-reversal-invariant (TRI) systems, the inversion of spin-degenerate bands with opposite parity leads to nontrivial topological states, such as topological insulators and Dirac semimetals. When the TRS is broken, the exchange field induces spin splitting of the bands. The inversion of a pair of spin-splitting subbands can generate more exotic topological states, such as quantum anomalous Hall insulators and magnetic Weyl semimetals. So far, such topological phase transitions driven by the TRS breaking have not been visualized. In this work, using angle-resolved photoemission spectroscopy, we have demonstrated that the TRS breaking induces a band inversion of a pair of spin-splitting subbands at the TRI points of Brillouin zone in EuB$_6$, when a long-range ferromagnetic order is developed. The dramatic changes in the electronic structure result in a topological phase transition from a TRI ordinary insulator state to a TRS-broken topological semimetal (TSM) state. Remarkably, the magnetic TSM state has an ideal electronic structure, in which the band crossings are located at the Fermi level without any interference from other bands. Our findings not only reveal the topological phase transition driven by the TRS breaking, but also provide an excellent platform to explore novel physical behavior in the magnetic topological states of quantum matter.
△ Less
Submitted 8 March, 2021;
originally announced March 2021.
-
LCD Codes from tridiagonal Toeplitz matrice
Authors:
Minjia Shi,
Ferruh Özbudak,
Li Xu,
Patrick Solé
Abstract:
Double Toeplitz (DT) codes are codes with a generator matrix of the form $(I,T)$ with $T$ a Toeplitz matrix, that is to say constant on the diagonals parallel to the main. When $T$ is tridiagonal and symmetric we determine its spectrum explicitly by using Dickson polynomials, and deduce from there conditions for the code to be LCD. Using a special concatenation process, we construct optimal or qua…
▽ More
Double Toeplitz (DT) codes are codes with a generator matrix of the form $(I,T)$ with $T$ a Toeplitz matrix, that is to say constant on the diagonals parallel to the main. When $T$ is tridiagonal and symmetric we determine its spectrum explicitly by using Dickson polynomials, and deduce from there conditions for the code to be LCD. Using a special concatenation process, we construct optimal or quasi-optimal examples of binary and ternary LCD codes from DT codes over extension fields.
△ Less
Submitted 7 March, 2021;
originally announced March 2021.
-
Dynamic Offloading Loading Optimization in distributed Fault Diagnosis system with Deep Reinforcement Learning Approach
Authors:
Liang Yu,
Qixin Guo,
Rui Wang,
Minyan Shi,
Fucheng Yan,
Ran Wang
Abstract:
Artificial intelligence and distributed algorithms have been widely used in mechanical fault diagnosis with the explosive growth of diagnostic data. A novel intelligent fault diagnosis system framework that allows intelligent terminals to offload computational tasks to Mobile edge computing (MEC) servers is provided in this paper, which can effectively address the problems of task processing delay…
▽ More
Artificial intelligence and distributed algorithms have been widely used in mechanical fault diagnosis with the explosive growth of diagnostic data. A novel intelligent fault diagnosis system framework that allows intelligent terminals to offload computational tasks to Mobile edge computing (MEC) servers is provided in this paper, which can effectively address the problems of task processing delays and enhanced computational complexity. As the resources at the MEC and intelligent terminals are limited, performing reasonable resource allocation optimization can improve the performance, especially for a multi-terminals offloading system. In this study, to minimize the task computation delay, we jointly optimize the local content splitting ratio, the transmission/computation power allocation, and the MEC server selection under a dynamic environment with stochastic task arrivals. The challenging dynamic joint optimization problem is formulated as a reinforcement learning (RL) problem, which is designed as the computational offloading policies to minimize the long-term average delay cost. Two deep RL strategies, deep Q-learning network (DQN) and deep deterministic policy gradient (DDPG), are adopted to learn the computational offloading policies adaptively and efficiently. The proposed DQN strategy takes the MEC selection as a unique action while using the convex optimization approach to obtain the local content splitting ratio and the transmission/computation power allocation. Simultaneously, the actions of the DDPG strategy are selected as all dynamic variables, including the local content splitting ratio, the transmission/computation power allocation, and the MEC server selection. Numerical results demonstrate that both proposed strategies perform better than the traditional non-learning schemes.
△ Less
Submitted 15 February, 2023; v1 submitted 2 March, 2021;
originally announced March 2021.
-
On the Numerical Performance of Derivative-Free Optimization Methods Based on Finite-Difference Approximations
Authors:
Hao-Jun Michael Shi,
Melody Qiming Xuan,
Figen Oztoprak,
Jorge Nocedal
Abstract:
The goal of this paper is to investigate an approach for derivative-free optimization that has not received sufficient attention in the literature and is yet one of the simplest to implement and parallelize. It consists of computing gradients of a smoothed approximation of the objective function (and constraints), and employing them within established codes. These gradient approximations are calcu…
▽ More
The goal of this paper is to investigate an approach for derivative-free optimization that has not received sufficient attention in the literature and is yet one of the simplest to implement and parallelize. It consists of computing gradients of a smoothed approximation of the objective function (and constraints), and employing them within established codes. These gradient approximations are calculated by finite differences, with a differencing interval determined by the noise level in the functions and a bound on the second or third derivatives. It is assumed that noise level is known or can be estimated by means of difference tables or sampling. The use of finite differences has been largely dismissed in the derivative-free optimization literature as too expensive in terms of function evaluations and/or as impractical when the objective function contains noise. The test results presented in this paper suggest that such views should be re-examined and that the finite-difference approach has much to be recommended. The tests compared NEWUOA, DFO-LS and COBYLA against the finite-difference approach on three classes of problems: general unconstrained problems, nonlinear least squares, and general nonlinear programs with equality constraints.
△ Less
Submitted 19 February, 2021;
originally announced February 2021.
-
On isodual double Toeplitz codes
Authors:
Minjia Shi,
Li Xu,
Patrick Solé
Abstract:
Double Toeplitz (shortly DT) codes are introduced here as a generalization of double circulant codes. We show that such a code is isodual, hence formally self-dual. Self-dual DT codes are characterized as double circulant or double negacirculant. Likewise, even DT binary codes are characterized as double circulants. Numerical examples obtained by exhaustive search show that the codes constructed h…
▽ More
Double Toeplitz (shortly DT) codes are introduced here as a generalization of double circulant codes. We show that such a code is isodual, hence formally self-dual. Self-dual DT codes are characterized as double circulant or double negacirculant. Likewise, even DT binary codes are characterized as double circulants. Numerical examples obtained by exhaustive search show that the codes constructed have best-known minimum distance, up to one unit, amongst formally self-dual codes, and sometimes improve on the known values. Over $\F_4$ an explicit construction of DT codes, based on quadratic residues in a prime field, performs equally well. We show that DT codes are asymptotically good over $\F_q$. Specifically, we construct DT codes arbitrarily close to the asymptotic varshamov-Gilbert bound for codes of rate one half.
△ Less
Submitted 18 February, 2021;
originally announced February 2021.
-
Consistent Right-Invariant Fixed-Lag Smoother with Application to Visual Inertial SLAM
Authors:
Jianzhu Huai,
Yukai Lin,
Yuan Zhuang,
Min Shi
Abstract:
State estimation problems without absolute position measurements routinely arise in navigation of unmanned aerial vehicles, autonomous ground vehicles, etc., whose proper operation relies on accurate state estimates and reliable covariances. Unaware of absolute positions, these problems have immanent unobservable directions. Traditional causal estimators, however, usually gain spurious information…
▽ More
State estimation problems without absolute position measurements routinely arise in navigation of unmanned aerial vehicles, autonomous ground vehicles, etc., whose proper operation relies on accurate state estimates and reliable covariances. Unaware of absolute positions, these problems have immanent unobservable directions. Traditional causal estimators, however, usually gain spurious information on the unobservable directions, leading to over-confident covariance inconsistent with actual estimator errors. The consistency problem of fixed-lag smoothers (FLSs) has only been attacked by the first estimate Jacobian (FEJ) technique because of the complexity to analyze their observability property. But the FEJ has several drawbacks hampering its wide adoption. To ensure the consistency of a FLS, this paper introduces the right invariant error formulation into the FLS framework. To our knowledge, we are the first to analyze the observability of a FLS with the right invariant error. Our main contributions are twofold. As the first novelty, to bypass the complexity of analysis with the classic observability matrix, we show that observability analysis of FLSs can be done equivalently on the linearized system. Second, we prove that the inconsistency issue in the traditional FLS can be elegantly solved by the right invariant error formulation without artificially correcting Jacobians. By applying the proposed FLS to the monocular visual inertial simultaneous localization and map** (SLAM) problem, we confirm that the method consistently estimates covariance similarly to a batch smoother in simulation and that our method achieved comparable accuracy as traditional FLSs on real data.
△ Less
Submitted 21 March, 2021; v1 submitted 17 February, 2021;
originally announced February 2021.
-
Designs in finite metric spaces: a probabilistic approach
Authors:
Minjia Shi,
Olivier Rioul,
Patrick Solé
Abstract:
A finite metric space is called here distance degree regular if its distance degree sequence is the same for every vertex. A notion of designs in such spaces is introduced that generalizes that of designs in $Q$-polynomial distance-regular graphs. An approximation of their cumulative distribution function, based on the notion of Christoffel function in approximation theory is given. As an applicat…
▽ More
A finite metric space is called here distance degree regular if its distance degree sequence is the same for every vertex. A notion of designs in such spaces is introduced that generalizes that of designs in $Q$-polynomial distance-regular graphs. An approximation of their cumulative distribution function, based on the notion of Christoffel function in approximation theory is given. As an application we derive limit laws on the weight distributions of binary orthogonal arrays of strength going to infinity. An analogous result for combinatorial designs of strength going to infinity is given.
△ Less
Submitted 16 February, 2021;
originally announced February 2021.
-
Unexpected Suppression of Leidenfrost Phenomenon on Superhydrophobic Surfaces
Authors:
Meng Shi,
Ratul Das,
Sankara Arunachalam,
Himanshu Mishra
Abstract:
The Leidenfrost phenomenon entails the levitation of a liquid droplet over a superheated surface, cushioned by its vapor layer. For water, superhydrophobic surfaces are believed to suppress the Leidenfrost point ($\it{T}$$_{\rm L}$)-the temperature at which this phenomenon occurs. The vapor film obstructs boiling heat transfer in heat exchangers, thereby compromising energy efficiency and safety.…
▽ More
The Leidenfrost phenomenon entails the levitation of a liquid droplet over a superheated surface, cushioned by its vapor layer. For water, superhydrophobic surfaces are believed to suppress the Leidenfrost point ($\it{T}$$_{\rm L}$)-the temperature at which this phenomenon occurs. The vapor film obstructs boiling heat transfer in heat exchangers, thereby compromising energy efficiency and safety. Thus, it is desirable to realize superhydrophobicity without suppressing $\it{T}$$_{\rm L}$. Here we demonstrate that the $\it{T}$$_{\rm L}$ of water on microtextured superhydrophobic surfaces comprising doubly reentrant pillars (DRPs) can exceed those on hydrophilic and even superhydrophilic surfaces. We disentangle the contributions of microtexture, heat transfer, and surface chemistry on $\it{T}$$_{\rm L}$ and reveal how superhydrophobicity can be realized without suppressing $\it{T}$$_{\rm L}$. For instance, silica surfaces with DRPs facilitate ~300% greater heat transfer to water droplets at 200$^{\circ}$C in comparison with silica surfaces coated with perfluorinated-nanoparticles. Thus, superhydrophobic surfaces could be harnessed for energy efficient thermal machinery.
△ Less
Submitted 4 February, 2021;
originally announced February 2021.
-
Zero sum sets in abelian groups
Authors:
Minjia Shi,
Denis S. Krotov,
Xiaoxiao Li,
Patrick Solé
Abstract:
The distribution of cardinalities of zero-sum sets in abelian groups is completely determined. A complex summation involving the Möbius function is given for the general abelian group, while in many special cases, including the case of elementary abelian groups, solved earlier by Li and Wan, it has a compact form. The proof involves two different Möbius transforms, on positive integers and on set…
▽ More
The distribution of cardinalities of zero-sum sets in abelian groups is completely determined. A complex summation involving the Möbius function is given for the general abelian group, while in many special cases, including the case of elementary abelian groups, solved earlier by Li and Wan, it has a compact form. The proof involves two different Möbius transforms, on positive integers and on set partitions.
△ Less
Submitted 7 February, 2021; v1 submitted 29 January, 2021;
originally announced February 2021.
-
Superconductivity at 40 K in lithiation-processed [(Fe,Al)(OH)2][FeSe]1.2 with a layered structure
Authors:
Guobing Hu,
Mengzhu Shi,
Wenxiang Wang,
Changsheng Zhu,
Zeliang Sun,
Jianhua Cui,
Weizhuang Zhuo,
Fanghang Yu,
Xigang Luo,
Xianhui Chen
Abstract:
Exploration of new superconductors has always been one of the research directions in condensed matter physics. We report here a new layered heterostructure of [(Fe,Al)(OH)2][FeSe]1.2, which is synthesized by the hydrothermal ion-exchange technique. The structure is suggested by a combination of X-ray powder diffraction and the electron diffraction (ED). [(Fe,Al)(OH)2][FeSe]1.2 is composed of the a…
▽ More
Exploration of new superconductors has always been one of the research directions in condensed matter physics. We report here a new layered heterostructure of [(Fe,Al)(OH)2][FeSe]1.2, which is synthesized by the hydrothermal ion-exchange technique. The structure is suggested by a combination of X-ray powder diffraction and the electron diffraction (ED). [(Fe,Al)(OH)2][FeSe]1.2 is composed of the alternating stacking of tetragonal FeSe layer and hexagonal (Fe,Al)(OH)2 layer. In [(Fe,Al)(OH)2][FeSe]1.2, there exists mismatch between the FeSe sub-layer and (Fe,Al)(OH)2 sub-layer, and the lattice of the layered heterostructure is quasi-commensurate. The as-synthesized [(Fe,Al)(OH)2][FeSe]1.2 is non-superconducting due to the Fe vacancies in the FeSe layer. The superconductivity with a Tc of 40 K can be achieved after a lithiation process, which is due to the elimination of the Fe vacancies in the FeSe layer. The Tc is nearly the same as that of (Li,Fe)OHFeSe although the structure of [(Fe,Al)(OH)2][FeSe]1.2 is quite different from that of (Li,Fe)OHFeSe. The new layered heterostructure of [(Fe,Al)(OH)2][FeSe]1.2 contains an iron selenium tetragonal lattice interleaved with a hexagonal metal hydroxide lattice. These results indicate that the superconductivity is very robust for FeSe-based superconductors. It opens a path for exploring super-conductivity in iron-base superconductors.
△ Less
Submitted 26 January, 2021;
originally announced January 2021.
-
Surface state at $BaSnO_3$ evidenced by angle-resolved photoemission spectroscopy and ab initio calculations
Authors:
Muntaser Naamneh,
Abhinav Prakash,
Eduardo B. Guedes,
W. H. Brito,
Ming Shi,
Nicholas C. Plumb,
Bharat Jalan,
Milan Radović
Abstract:
Perovskite alkaline earth stannates, such as $BaSnO_3$ and $SrSnO_3$, showing light transparency and high electrical conductivity (when doped), have become promising candidates for novel optoelectrical devices. Such devices are mostly based on hetero-structures and understanding of their electronic structure, which usually deviates from the bulk, is mandatory for exploring a full application poten…
▽ More
Perovskite alkaline earth stannates, such as $BaSnO_3$ and $SrSnO_3$, showing light transparency and high electrical conductivity (when doped), have become promising candidates for novel optoelectrical devices. Such devices are mostly based on hetero-structures and understanding of their electronic structure, which usually deviates from the bulk, is mandatory for exploring a full application potential. Employing angle-resolved photoemission spectroscopy and ab initio calculations we reveal the existence of a 2-dimensional metallic state at the $SnO_2$-terminated surface of a 1\% La-doped $BaSnO_3$ thin film. The observed surface state is characterized by distinct carrier density and a smaller effective mass in comparison with the corresponding bulk values. The small surface effective mass of about $0.12m_e$ can cause an improvement of the electrical conductivity of BSO based heterostructures.
△ Less
Submitted 9 January, 2021;
originally announced January 2021.
-
The First 3D Coronal Loop Model Heated by MHD Waves against Radiative Losses
Authors:
Mijie Shi,
Tom Van Doorsselaere,
Mingzhe Guo,
Konstantinos Karampelas,
Bo Li,
Patrick Antolin
Abstract:
In the quest to solve the long-standing coronal heating problem, it has been suggested half a century ago that coronal loops could be heated by waves. Despite the accumulating observational evidence of the possible importance of coronal waves, still no 3D MHD simulations exist that show significant heating by MHD waves. Here we report on the first 3D coronal loop model heating the plasma against r…
▽ More
In the quest to solve the long-standing coronal heating problem, it has been suggested half a century ago that coronal loops could be heated by waves. Despite the accumulating observational evidence of the possible importance of coronal waves, still no 3D MHD simulations exist that show significant heating by MHD waves. Here we report on the first 3D coronal loop model heating the plasma against radiative cooling. The coronal loop is driven at the footpoint by transverse oscillations and subsequently the induced Kelvin-Helmholtz instability deforms the loop cross-section and generates small-scale structures. Wave energy is transfered to smaller scales where it is dissipated, overcoming the internal energy losses by radiation. These results open up a new avenue to address the coronal heating problem.
△ Less
Submitted 25 February, 2021; v1 submitted 4 January, 2021;
originally announced January 2021.
-
Knowledge Graphs Evolution and Preservation -- A Technical Report from ISWS 2019
Authors:
Nacira Abbas,
Kholoud Alghamdi,
Mortaza Alinam,
Francesca Alloatti,
Glenda Amaral,
Claudia d'Amato,
Luigi Asprino,
Martin Beno,
Felix Bensmann,
Russa Biswas,
Ling Cai,
Riley Capshaw,
Valentina Anita Carriero,
Irene Celino,
Amine Dadoun,
Stefano De Giorgis,
Harm Delva,
John Domingue,
Michel Dumontier,
Vincent Emonet,
Marieke van Erp,
Paola Espinoza Arias,
Omaima Fallatah,
Sebastián Ferrada,
Marc Gallofré Ocaña
, et al. (49 additional authors not shown)
Abstract:
One of the grand challenges discussed during the Dagstuhl Seminar "Knowledge Graphs: New Directions for Knowledge Representation on the Semantic Web" and described in its report is that of a: "Public FAIR Knowledge Graph of Everything: We increasingly see the creation of knowledge graphs that capture information about the entirety of a class of entities. [...] This grand challenge extends this fur…
▽ More
One of the grand challenges discussed during the Dagstuhl Seminar "Knowledge Graphs: New Directions for Knowledge Representation on the Semantic Web" and described in its report is that of a: "Public FAIR Knowledge Graph of Everything: We increasingly see the creation of knowledge graphs that capture information about the entirety of a class of entities. [...] This grand challenge extends this further by asking if we can create a knowledge graph of "everything" ranging from common sense concepts to location based entities. This knowledge graph should be "open to the public" in a FAIR manner democratizing this mass amount of knowledge." Although linked open data (LOD) is one knowledge graph, it is the closest realisation (and probably the only one) to a public FAIR Knowledge Graph (KG) of everything. Surely, LOD provides a unique testbed for experimenting and evaluating research hypotheses on open and FAIR KG. One of the most neglected FAIR issues about KGs is their ongoing evolution and long term preservation. We want to investigate this problem, that is to understand what preserving and supporting the evolution of KGs means and how these problems can be addressed. Clearly, the problem can be approached from different perspectives and may require the development of different approaches, including new theories, ontologies, metrics, strategies, procedures, etc. This document reports a collaborative effort performed by 9 teams of students, each guided by a senior researcher as their mentor, attending the International Semantic Web Research School (ISWS 2019). Each team provides a different perspective to the problem of knowledge graph evolution substantiated by a set of research questions as the main subject of their investigation. In addition, they provide their working definition for KG preservation and evolution.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
Link between superconductivity and a Lifshitz transition in intercalated Bi$_2$Se$_3$
Authors:
A. Almoalem,
I. Silber,
S. Sandik,
M. Lotem,
A. Ribak,
Y. Nitzav,
A. Yu. Kuntsevich,
O. A. Sobolevskiy,
Yu. G. Selivanov,
V. A. Prudkoglyad,
M. Shi,
L. Petaccia,
M. Goldstein,
Y. Dagan,
A. Kanigel
Abstract:
Topological superconductivity is an exotic phase of matter in which the fully gapped superconducting bulk hosts gapless Majorana surface states protected by topology. Intercalation of copper, strontium or niobium between the quintuple layers of the topological insulator Bi$_2$Se$_3$ increases the carrier density and leads to superconductivity that is suggested to be topological. Here we study the…
▽ More
Topological superconductivity is an exotic phase of matter in which the fully gapped superconducting bulk hosts gapless Majorana surface states protected by topology. Intercalation of copper, strontium or niobium between the quintuple layers of the topological insulator Bi$_2$Se$_3$ increases the carrier density and leads to superconductivity that is suggested to be topological. Here we study the electronic structure of strontium-intercalated Bi$_2$Se$_3$ using angle resolved photoemission spectroscopy (ARPES) and Shubnikov-de Haas (SdH) oscillations. Despite the apparent low Hall number of $\sim2 \times 10 ^{19}$cm$^{-3}$, we show that the Fermi surface is shaped as an open cylinder with a larger carrier density of $\sim 10 ^{20}$cm$^{-3}$. We suggest that superconductivity in intercalated Bi$_2$Se$_3$ emerges with the appearance of a quasi-2D open Fermi surface.
△ Less
Submitted 19 December, 2020; v1 submitted 12 December, 2020;
originally announced December 2020.
-
Anomalous Hall resistivity and possible topological Hall effect in the EuAl$_4$ antiferromagnet
Authors:
T. Shang,
Y. Xu,
D. J. Gawryluk,
J. Z. Ma,
T. Shiroka,
M. Shi,
E. Pomjakushina
Abstract:
We report the observation of anomalous Hall resistivity in single crystals of EuAl$_4$, a centrosymmetric tetragonal compound, which exhibits coexisting antiferromagnetic (AFM) and charge-density-wave (CDW) orders with onset at $T_\mathrm{N} \sim 15.6$ K and $T_\mathrm{CDW} \sim 140$ K, respectively. In the AFM state, when the magnetic field is applied along the $c$-axis direction, EuAl$_4$ underg…
▽ More
We report the observation of anomalous Hall resistivity in single crystals of EuAl$_4$, a centrosymmetric tetragonal compound, which exhibits coexisting antiferromagnetic (AFM) and charge-density-wave (CDW) orders with onset at $T_\mathrm{N} \sim 15.6$ K and $T_\mathrm{CDW} \sim 140$ K, respectively. In the AFM state, when the magnetic field is applied along the $c$-axis direction, EuAl$_4$ undergoes a series of metamagnetic transitions. Within this field range, we observe a clear hump-like anomaly in the Hall resistivity, representing part of the anomalous Hall resistivity. By considering different scenarios, we conclude that such a hump-like feature is most likely a manifestation of the topological Hall effect, normally occurring in noncentrosymmetric materials known to host nontrivial topological spin textures. In view of this, EuAl$_4$ would represent a rare case where the topological Hall effect not only arises in a centrosymmetric structure, but it also coexists with CDW order.
△ Less
Submitted 10 December, 2020;
originally announced December 2020.
-
Detecting Human-Object Interaction with Mixed Supervision
Authors:
Suresh Kirthi Kumaraswamy,
Miao**g Shi,
Ewa Kijak
Abstract:
Human object interaction (HOI) detection is an important task in image understanding and reasoning. It is in a form of HOI triplet <human; verb; object>, requiring bounding boxes for human and object, and action between them for the task completion. In other words, this task requires strong supervision for training that is however hard to procure. A natural solution to overcome this is to pursue w…
▽ More
Human object interaction (HOI) detection is an important task in image understanding and reasoning. It is in a form of HOI triplet <human; verb; object>, requiring bounding boxes for human and object, and action between them for the task completion. In other words, this task requires strong supervision for training that is however hard to procure. A natural solution to overcome this is to pursue weakly-supervised learning, where we only know the presence of certain HOI triplets in images but their exact location is unknown. Most weakly-supervised learning methods do not make provision for leveraging data with strong supervision, when they are available; and indeed a naïve combination of this two paradigms in HOI detection fails to make contributions to each other. In this regard we propose a mixed-supervised HOI detection pipeline: thanks to a specific design of momentum-independent learning that learns seamlessly across these two types of supervision. Moreover, in light of the annotation insufficiency in mixed supervision, we introduce an HOI element swap** technique to synthesize diverse and hard negatives across images and improve the robustness of the model. Our method is evaluated on the challenging HICO-DET dataset. It performs close to or even better than many fully-supervised methods by using a mixed amount of strong and weak annotations; furthermore, it outperforms representative state of the art weakly and fully-supervised methods under the same supervision.
△ Less
Submitted 12 November, 2020; v1 submitted 10 November, 2020;
originally announced November 2020.
-
Fast Fourier Intrinsic Network
Authors:
Yanlin Qian,
Miao**g Shi,
Joni-Kristian Kämäräinen,
Jiri Matas
Abstract:
We address the problem of decomposing an image into albedo and shading. We propose the Fast Fourier Intrinsic Network, FFI-Net in short, that operates in the spectral domain, splitting the input into several spectral bands. Weights in FFI-Net are optimized in the spectral domain, allowing faster convergence to a lower error. FFI-Net is lightweight and does not need auxiliary networks for training.…
▽ More
We address the problem of decomposing an image into albedo and shading. We propose the Fast Fourier Intrinsic Network, FFI-Net in short, that operates in the spectral domain, splitting the input into several spectral bands. Weights in FFI-Net are optimized in the spectral domain, allowing faster convergence to a lower error. FFI-Net is lightweight and does not need auxiliary networks for training. The network is trained end-to-end with a novel spectral loss which measures the global distance between the network prediction and corresponding ground truth. FFI-Net achieves state-of-the-art performance on MPI-Sintel, MIT Intrinsic, and IIW datasets.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Limit theorems and ergodicity for general bootstrap random walks
Authors:
A. Collevecchio,
K. Hamza,
M. Shi,
R. J. Williams
Abstract:
Given the increments of a simple symmetric random walk $(X_n)_{n\ge0}$, we characterize all possible ways of recycling these increments into a simple symmetric random walk $(Y_n)_{n\ge0}$ adapted to the filtration of $(X_n)_{n\ge0}$. We study the long term behavior of a suitably normalized two-dimensional process $((X_n,Y_n))_{n\ge0}$. In particular, we provide necessary and sufficient conditions…
▽ More
Given the increments of a simple symmetric random walk $(X_n)_{n\ge0}$, we characterize all possible ways of recycling these increments into a simple symmetric random walk $(Y_n)_{n\ge0}$ adapted to the filtration of $(X_n)_{n\ge0}$. We study the long term behavior of a suitably normalized two-dimensional process $((X_n,Y_n))_{n\ge0}$. In particular, we provide necessary and sufficient conditions for the process to converge to a two-dimensional Brownian motion (possibly degenerate). We also discuss cases in which the limit is not Gaussian. Finally, we provide a simple necessary and sufficient condition for the ergodicity of the recycling transformation, thus generalizing results from Dubins and Smorodinsky (1992) and Fujita (2008), and solving the discrete version of the open problem of the ergodicity of the general Lévy transformation (see Mansuy and Yor, 2006).
△ Less
Submitted 30 June, 2021; v1 submitted 29 October, 2020;
originally announced October 2020.
-
Restoring Negative Information in Few-Shot Object Detection
Authors:
Yukuan Yang,
Fangyun Wei,
Miao**g Shi,
Guoqi Li
Abstract:
Few-shot learning has recently emerged as a new challenge in the deep learning field: unlike conventional methods that train the deep neural networks (DNNs) with a large number of labeled data, it asks for the generalization of DNNs on new classes with few annotated samples. Recent advances in few-shot learning mainly focus on image classification while in this paper we focus on object detection.…
▽ More
Few-shot learning has recently emerged as a new challenge in the deep learning field: unlike conventional methods that train the deep neural networks (DNNs) with a large number of labeled data, it asks for the generalization of DNNs on new classes with few annotated samples. Recent advances in few-shot learning mainly focus on image classification while in this paper we focus on object detection. The initial explorations in few-shot object detection tend to simulate a classification scenario by using the positive proposals in images with respect to certain object class while discarding the negative proposals of that class. Negatives, especially hard negatives, however, are essential to the embedding space learning in few-shot object detection. In this paper, we restore the negative information in few-shot object detection by introducing a new negative- and positive-representative based metric learning framework and a new inference scheme with negative and positive representatives. We build our work on a recent few-shot pipeline RepMet with several new modules to encode negative information for both training and testing. Extensive experiments on ImageNet-LOC and PASCAL VOC show our method substantially improves the state-of-the-art few-shot object detection solutions. Our code is available at https://github.com/yang-yk/NP-RepMet.
△ Less
Submitted 25 October, 2020; v1 submitted 22 October, 2020;
originally announced October 2020.
-
A Noise-Tolerant Quasi-Newton Algorithm for Unconstrained Optimization
Authors:
Hao-Jun Michael Shi,
Yuchen Xie,
Richard Byrd,
Jorge Nocedal
Abstract:
This paper describes an extension of the BFGS and L-BFGS methods for the minimization of a nonlinear function subject to errors. This work is motivated by applications that contain computational noise, employ low-precision arithmetic, or are subject to statistical noise. The classical BFGS and L-BFGS methods can fail in such circumstances because the updating procedure can be corrupted and the lin…
▽ More
This paper describes an extension of the BFGS and L-BFGS methods for the minimization of a nonlinear function subject to errors. This work is motivated by applications that contain computational noise, employ low-precision arithmetic, or are subject to statistical noise. The classical BFGS and L-BFGS methods can fail in such circumstances because the updating procedure can be corrupted and the line search can behave erratically. The proposed method addresses these difficulties and ensures that the BFGS update is stable by employing a lengthening procedure that spaces out the points at which gradient differences are collected. A new line search, designed to tolerate errors, guarantees that the Armijo-Wolfe conditions are satisfied under most reasonable conditions, and works in conjunction with the lengthening procedure. The proposed methods are shown to enjoy convergence guarantees for strongly convex functions. Detailed implementations of the methods are presented, together with encouraging numerical results.
△ Less
Submitted 8 September, 2021; v1 submitted 8 October, 2020;
originally announced October 2020.
-
Evolutionary Architecture Search for Graph Neural Networks
Authors:
Min Shi,
David A. Wilson,
Xingquan Zhu,
Yu Huang,
Yuan Zhuang,
Jianxun Liu,
Yufei Tang
Abstract:
Automated machine learning (AutoML) has seen a resurgence in interest with the boom of deep learning over the past decade. In particular, Neural Architecture Search (NAS) has seen significant attention throughout the AutoML research community, and has pushed forward the state-of-the-art in a number of neural models to address grid-like data such as texts and images. However, very litter work has b…
▽ More
Automated machine learning (AutoML) has seen a resurgence in interest with the boom of deep learning over the past decade. In particular, Neural Architecture Search (NAS) has seen significant attention throughout the AutoML research community, and has pushed forward the state-of-the-art in a number of neural models to address grid-like data such as texts and images. However, very litter work has been done about Graph Neural Networks (GNN) learning on unstructured network data. Given the huge number of choices and combinations of components such as aggregator and activation function, determining the suitable GNN structure for a specific problem normally necessitates tremendous expert knowledge and laborious trails. In addition, the slight variation of hyper parameters such as learning rate and dropout rate could dramatically hurt the learning capacity of GNN. In this paper, we propose a novel AutoML framework through the evolution of individual models in a large GNN architecture space involving both neural structures and learning parameters. Instead of optimizing only the model structures with fixed parameter settings as existing work, an alternating evolution process is performed between GNN structures and learning parameters to dynamically find the best fit of each other. To the best of our knowledge, this is the first work to introduce and evaluate evolutionary architecture search for GNN models. Experiments and validations demonstrate that evolutionary NAS is capable of matching existing state-of-the-art reinforcement learning approaches for both the semi-supervised transductive and inductive node representation learning and classification.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
Multiple mobile excitons manifested as sidebands in quasi-one-dimensional metallic TaSe3
Authors:
Junzhang Ma,
Simin Nie,
Xin Gui,
Muntaser Naamneh,
Jasmin Jandke,
Chuanying Xi,
**glei Zhang,
Tian Shang,
Yimin Xiong,
Itzik Kapon,
Neeraj Kumar,
Yona Soh,
Daniel Gosálbez-Martínez,
Oleg V. Yazyev,
Wenhui Fan,
Hannes Hübener,
Umberto De Giovannini,
Nicholas Clark Plumb,
Milan Radovic,
Michael Andreas Sentef,
Weiwei Xie,
Zhijun Wang,
Christopher Mudry,
Markus Müller,
Ming Shi
Abstract:
Charge neutrality and their expected itinerant nature makes excitons potential transmitters of information. However, exciton mobility remains inaccessible to traditional optical experiments that only create and detect excitons with negligible momentum. Here, using angle-resolved photoemission spectroscopy, we detect dispersing excitons in the quasi-one-dimensional metallic trichalcogenide, TaSe3.…
▽ More
Charge neutrality and their expected itinerant nature makes excitons potential transmitters of information. However, exciton mobility remains inaccessible to traditional optical experiments that only create and detect excitons with negligible momentum. Here, using angle-resolved photoemission spectroscopy, we detect dispersing excitons in the quasi-one-dimensional metallic trichalcogenide, TaSe3. The low density of conduction electrons and the low dimensionality in TaSe3 combined with a polaronic renormalization of the conduction band and the poorly screened interaction between these polarons and photo-induced valence holes leads to various excitonic bound states that we interpret as intrachain and interchain excitons, and possibly trions. The thresholds for the formation of a photo-hole together with an exciton appear as side valence bands with dispersions nearly parallel to the main valence band, but shifted to lower excitation energies. The energy separation between side and main valence bands can be controlled by surface do**, enabling the tuning of certain exciton properties.
△ Less
Submitted 24 February, 2022; v1 submitted 15 September, 2020;
originally announced September 2020.
-
Experimental Evidence of Stable 2$H$ Phase on the Surface of Layered 1$T'$-TaTe$_2$
Authors:
Indrani Kar,
Kapildeb Dolui,
Luminita Harnagea,
Y. Kushnirenko,
G. Shipunov,
N. C. Plumb,
M. Shi,
B. Büchner,
S. Thirupathaiah
Abstract:
We report on the low-energy electronic structure of Tantalum ditelluride (1$T'$-TaTe$_2$), one of the charge density wave (CDW) materials from the group V transition metal dichalcogenides using angle-resolved photoemission spectroscopy (ARPES) and density functional theory (DFT). We find that the Fermi surface topology of TaTe$_2$ is quite complicated compared to its isovalent compounds such as Ta…
▽ More
We report on the low-energy electronic structure of Tantalum ditelluride (1$T'$-TaTe$_2$), one of the charge density wave (CDW) materials from the group V transition metal dichalcogenides using angle-resolved photoemission spectroscopy (ARPES) and density functional theory (DFT). We find that the Fermi surface topology of TaTe$_2$ is quite complicated compared to its isovalent compounds such as TaS$_2$, TaSe$_2$, and isostructural compound NbTe$_2$. More importantly, we discover that the surface electronic structure of 1$T'$-TaTe$_2$ has more resemblance to the 2$H$-TaTe$_2$, while the bulk electronic structure has more resemblance to the hypothetical 1$T$-TaTe$_2$. These experimental observations are thoroughly compared with our DFT calculations performed on 1$T$-, 2$H$- and 2$H$ (monolayer)/1$T$- TaTe$_2$. We further notice that the Fermi surface topology is temperature independent up to 180 K, confirming that the 2$H$ phase on the surface is stable up to 180 K and the CDW order is not due to the Fermi surface nesting.
△ Less
Submitted 22 December, 2020; v1 submitted 2 September, 2020;
originally announced September 2020.
-
Search for a moving target in a competitive environment
Authors:
Benoit Duvocelle,
János Flesch,
Hui Min Shi,
Dries Vermeulen
Abstract:
We consider a discrete-time dynamic search game in which a number of players compete to find an invisible object that is moving according to a time-varying Markov chain. We examine the subgame perfect equilibria of these games. The main result of the paper is that the set of subgame perfect equilibria is exactly the set of greedy strategy profiles, i.e. those strategy profiles in which the players…
▽ More
We consider a discrete-time dynamic search game in which a number of players compete to find an invisible object that is moving according to a time-varying Markov chain. We examine the subgame perfect equilibria of these games. The main result of the paper is that the set of subgame perfect equilibria is exactly the set of greedy strategy profiles, i.e. those strategy profiles in which the players always choose an action that maximizes their probability of immediately finding the object. We discuss various variations and extensions of the model.
△ Less
Submitted 25 August, 2020; v1 submitted 21 August, 2020;
originally announced August 2020.
-
Unconventional transverse transport above and below the magnetic transition temperature in Weyl semimetal EuCd$_2$As$_2$
Authors:
Y. Xu,
L. Das,
J. Z. Ma,
C. J. Yi,
S. M. Nie,
Y. G. Shi,
A. Tiwari,
S. S. Tsirkin,
T. Neupert,
M. Medarde,
M. Shi,
J. Chang,
T. Shang
Abstract:
As exemplified by the growing interest in the quantum anomalous Hall effect, the research on topology as an organizing principle of quantum matter is greatly enriched from the interplay with magnetism. In this vein, we present a combined electrical and thermoelectrical transport study on the magnetic Weyl semimetal EuCd$_2$As$_2$. Unconventional contribution to the anomalous Hall and anomalous Ner…
▽ More
As exemplified by the growing interest in the quantum anomalous Hall effect, the research on topology as an organizing principle of quantum matter is greatly enriched from the interplay with magnetism. In this vein, we present a combined electrical and thermoelectrical transport study on the magnetic Weyl semimetal EuCd$_2$As$_2$. Unconventional contribution to the anomalous Hall and anomalous Nernst effects were observed both above and below the magnetic transition temperature of EuCd$_2$As$_2$, indicating the existence of significant Berry curvature. EuCd$_2$As$_2$ represents a rare case in which this unconventional transverse transport emerges both above and below the magnetic transition temperature in the same material. The transport properties evolve with temperature and field in the antiferromagnetic phase in a different manner than in the paramagnetic phase, suggesting different mechanisms to their origin. Our results indicate EuCd$_2$As$_2$ is a fertile playground for investigating the interplay between magnetism and topology, and potentially a plethora of topologically nontrivial phases rooted in this interplay.
△ Less
Submitted 27 January, 2021; v1 submitted 12 August, 2020;
originally announced August 2020.
-
Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer
Authors:
Yuting Liu,
Zheng Wang,
Miao**g Shi,
Shin'ichi Satoh,
Qijun Zhao,
Hongyu Yang
Abstract:
Unsupervised crowd counting is a challenging yet not largely explored task. In this paper, we explore it in a transfer learning setting where we learn to detect and count persons in an unlabeled target set by transferring bi-knowledge learnt from regression- and detection-based models in a labeled source set. The dual source knowledge of the two models is heterogeneous and complementary as they ca…
▽ More
Unsupervised crowd counting is a challenging yet not largely explored task. In this paper, we explore it in a transfer learning setting where we learn to detect and count persons in an unlabeled target set by transferring bi-knowledge learnt from regression- and detection-based models in a labeled source set. The dual source knowledge of the two models is heterogeneous and complementary as they capture different modalities of the crowd distribution. We formulate the mutual transformations between the outputs of regression- and detection-based models as two scene-agnostic transformers which enable knowledge distillation between the two models. Given the regression- and detection-based models and their mutual transformers learnt in the source, we introduce an iterative self-supervised learning scheme with regression-detection bi-knowledge transfer in the target. Extensive experiments on standard crowd counting benchmarks, ShanghaiTech, UCF\_CC\_50, and UCF\_QNRF demonstrate a substantial improvement of our method over other state-of-the-arts in the transfer learning setting.
△ Less
Submitted 27 September, 2020; v1 submitted 12 August, 2020;
originally announced August 2020.
-
Defending Adversarial Examples via DNN Bottleneck Reinforcement
Authors:
Wenqing Liu,
Miao**g Shi,
Teddy Furon,
Li Li
Abstract:
This paper presents a DNN bottleneck reinforcement scheme to alleviate the vulnerability of Deep Neural Networks (DNN) against adversarial attacks. Typical DNN classifiers encode the input image into a compressed latent representation more suitable for inference. This information bottleneck makes a trade-off between the image-specific structure and class-specific information in an image. By reinfo…
▽ More
This paper presents a DNN bottleneck reinforcement scheme to alleviate the vulnerability of Deep Neural Networks (DNN) against adversarial attacks. Typical DNN classifiers encode the input image into a compressed latent representation more suitable for inference. This information bottleneck makes a trade-off between the image-specific structure and class-specific information in an image. By reinforcing the former while maintaining the latter, any redundant information, be it adversarial or not, should be removed from the latent representation. Hence, this paper proposes to jointly train an auto-encoder (AE) sharing the same encoding weights with the visual classifier. In order to reinforce the information bottleneck, we introduce the multi-scale low-pass objective and multi-scale high-frequency communication for better frequency steering in the network. Unlike existing approaches, our scheme is the first reforming defense per se which keeps the classifier structure untouched without appending any pre-processing head and is trained with clean images only. Extensive experiments on MNIST, CIFAR-10 and ImageNet demonstrate the strong defense of our method against various adversarial attacks.
△ Less
Submitted 12 August, 2020;
originally announced August 2020.
-
Multigap superconductivity in the Mo$_5$PB$_2$ boron-phosphorus compound
Authors:
T. Shang,
W. Xie,
D. J. Gawryluk,
R. Khasanov,
J. Z. Zhao,
M. Medarde,
M. Shi,
H. Q. Yuan,
E. Pomjakushina,
T. Shiroka
Abstract:
The tetragonal Mo$_5$PB$_2$ compound was recently reported to show superconductivity with a critical temperature up to 9.2 K. In search of evidence for multiple superconducting gaps in Mo$_5$PB$_2$, comprehensive measurements, including magnetic susceptibility, electrical resistivity, heat capacity, and muon-spin rotation and relaxation ($μ$SR) measurements were carried out. Data from both low-tem…
▽ More
The tetragonal Mo$_5$PB$_2$ compound was recently reported to show superconductivity with a critical temperature up to 9.2 K. In search of evidence for multiple superconducting gaps in Mo$_5$PB$_2$, comprehensive measurements, including magnetic susceptibility, electrical resistivity, heat capacity, and muon-spin rotation and relaxation ($μ$SR) measurements were carried out. Data from both low-temperature superfluid density and electronic specific heat suggest a nodeless superconducting ground state in Mo$_5$PB$_2$. Two superconducting energy gaps $Δ_0$ = 1.02 meV (25%) and 1.49 meV (75%) are required to describe the low-$T$ electronic specific-heat data. The multigap features are clearly evidenced by the field dependence of the electronic specific-heat coefficient and the Gaussian relaxation rate in the superconducting state (i.e., superfluid density), as well as by the temperature dependence of the upper critical field. By combining our extensive experimental results with numerical band-structure calculations, we provide compelling evidence of multigap superconductivity in Mo$_5$PB$_2$.
△ Less
Submitted 5 August, 2020;
originally announced August 2020.
-
DBS: Dynamic Batch Size For Distributed Deep Neural Network Training
Authors:
Qing Ye,
Yuhao Zhou,
Mingjia Shi,
Yanan Sun,
Jiancheng Lv
Abstract:
Synchronous strategies with data parallelism, such as the Synchronous StochasticGradient Descent (S-SGD) and the model averaging methods, are widely utilizedin distributed training of Deep Neural Networks (DNNs), largely owing to itseasy implementation yet promising performance. Particularly, each worker ofthe cluster hosts a copy of the DNN and an evenly divided share of the datasetwith the fixed…
▽ More
Synchronous strategies with data parallelism, such as the Synchronous StochasticGradient Descent (S-SGD) and the model averaging methods, are widely utilizedin distributed training of Deep Neural Networks (DNNs), largely owing to itseasy implementation yet promising performance. Particularly, each worker ofthe cluster hosts a copy of the DNN and an evenly divided share of the datasetwith the fixed mini-batch size, to keep the training of DNNs convergence. In thestrategies, the workers with different computational capability, need to wait foreach other because of the synchronization and delays in network transmission,which will inevitably result in the high-performance workers wasting computation.Consequently, the utilization of the cluster is relatively low. To alleviate thisissue, we propose the Dynamic Batch Size (DBS) strategy for the distributedtraining of DNNs. Specifically, the performance of each worker is evaluatedfirst based on the fact in the previous epoch, and then the batch size and datasetpartition are dynamically adjusted in consideration of the current performanceof the worker, thereby improving the utilization of the cluster. To verify theeffectiveness of the proposed strategy, extensive experiments have been conducted,and the experimental results indicate that the proposed strategy can fully utilizethe performance of the cluster, reduce the training time, and have good robustnesswith disturbance by irrelevant tasks. Furthermore, rigorous theoretical analysis hasalso been provided to prove the convergence of the proposed strategy.
△ Less
Submitted 3 November, 2022; v1 submitted 23 July, 2020;
originally announced July 2020.
-
Active Crowd Counting with Limited Supervision
Authors:
Zhen Zhao,
Miao**g Shi,
Xiaoxiao Zhao,
Li Li
Abstract:
To learn a reliable people counter from crowd images, head center annotations are normally required. Annotating head centers is however a laborious and tedious process in dense crowds. In this paper, we present an active learning framework which enables accurate crowd counting with limited supervision: given a small labeling budget, instead of randomly selecting images to annotate, we first introd…
▽ More
To learn a reliable people counter from crowd images, head center annotations are normally required. Annotating head centers is however a laborious and tedious process in dense crowds. In this paper, we present an active learning framework which enables accurate crowd counting with limited supervision: given a small labeling budget, instead of randomly selecting images to annotate, we first introduce an active labeling strategy to annotate the most informative images in the dataset and learn the counting model upon them. The process is repeated such that in every cycle we select the samples that are diverse in crowd density and dissimilar to previous selections. In the last cycle when the labeling budget is met, the large amount of unlabeled data are also utilized: a distribution classifier is introduced to align the labeled data with unlabeled data; furthermore, we propose to mix up the distribution labels and latent representations of data in the network to particularly improve the distribution alignment in-between training samples. We follow the popular density estimation pipeline for crowd counting. Extensive experiments are conducted on standard benchmarks i.e. ShanghaiTech, UCF CC 50, MAll, TRANCOS, and DCC. By annotating limited number of images (e.g. 10% of the dataset), our method reaches levels of performance not far from the state of the art which utilize full annotations of the dataset.
△ Less
Submitted 14 July, 2020; v1 submitted 13 July, 2020;
originally announced July 2020.