-
Application and modeling of an online distillation method to reduce krypton and argon in XENON1T
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
A. Bernard,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Capelli,
J. M. R. Cardoso,
D. Cichon,
B. Cimmino
, et al. (129 additional authors not shown)
Abstract:
A novel online distillation technique was developed for the XENON1T dark matter experiment to reduce intrinsic background components more volatile than xenon, such as krypton or argon, while the detector was operating. The method is based on a continuous purification of the gaseous volume of the detector system using the XENON1T cryogenic distillation column. A krypton-in-xenon concentration of…
▽ More
A novel online distillation technique was developed for the XENON1T dark matter experiment to reduce intrinsic background components more volatile than xenon, such as krypton or argon, while the detector was operating. The method is based on a continuous purification of the gaseous volume of the detector system using the XENON1T cryogenic distillation column. A krypton-in-xenon concentration of $(360 \pm 60)$ ppq was achieved. It is the lowest concentration measured in the fiducial volume of an operating dark matter detector to date. A model was developed and fit to the data to describe the krypton evolution in the liquid and gas volumes of the detector system for several operation modes over the time span of 550 days, including the commissioning and science runs of XENON1T. The online distillation was also successfully applied to remove Ar-37 after its injection for a low energy calibration in XENON1T. This makes the usage of Ar-37 as a regular calibration source possible in the future. The online distillation can be applied to next-generation experiments to remove krypton prior to, or during, any science run. The model developed here allows further optimization of the distillation strategy for future large scale detectors.
△ Less
Submitted 14 June, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Emission of Single and Few Electrons in XENON1T and Limits on Light Dark Matter
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
A. Bernard,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Capelli,
J. M. R. Cardoso,
D. Cichon,
B. Cimmino
, et al. (130 additional authors not shown)
Abstract:
Delayed single- and few-electron emissions plague dual-phase time projection chambers, limiting their potential to search for light-mass dark matter. This paper examines the origins of these events in the XENON1T experiment. Characterization of the intensity of delayed electron backgrounds shows that the resulting emissions are correlated, in time and position, with high-energy events and can effe…
▽ More
Delayed single- and few-electron emissions plague dual-phase time projection chambers, limiting their potential to search for light-mass dark matter. This paper examines the origins of these events in the XENON1T experiment. Characterization of the intensity of delayed electron backgrounds shows that the resulting emissions are correlated, in time and position, with high-energy events and can effectively be vetoed. In this work we extend previous S2-only analyses down to a single electron. From this analysis, after removing the correlated backgrounds, we observe rates < 30 events/(electron*kg*day) in the region of interest spanning 1 to 5 electrons. We derive 90% confidence upper limits for dark matter-electron scattering, first direct limits on the electric dipole, magnetic dipole, and anapole interactions, and bosonic dark matter models, where we exclude new parameter space for dark photons and solar dark photons.
△ Less
Submitted 28 June, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Entanglement between superconducting qubits and a tardigrade
Authors:
K. S. Lee,
Y. P. Tan,
L. H. Nguyen,
R. P. Budoyo,
K. H. Park,
C. Hufnagel,
Y. S. Yap,
N. Møbjerg,
V. Vedral,
T. Paterek,
R. Dumke
Abstract:
Quantum and biological systems are seldom discussed together as they seemingly demand opposing conditions. Life is complex, "hot and wet" whereas quantum objects are small, cold and well controlled. Here, we overcome this barrier with a tardigrade -- a microscopic multicellular organism known to tolerate extreme physiochemical conditions via a latent state of life known as cryptobiosis. We observe…
▽ More
Quantum and biological systems are seldom discussed together as they seemingly demand opposing conditions. Life is complex, "hot and wet" whereas quantum objects are small, cold and well controlled. Here, we overcome this barrier with a tardigrade -- a microscopic multicellular organism known to tolerate extreme physiochemical conditions via a latent state of life known as cryptobiosis. We observe coupling between the animal in cryptobiosis and a superconducting quantum bit and prepare a highly entangled state between this combined system and another qubit. The tardigrade itself is shown to be entangled with the remaining subsystems. The animal is then observed to return to its active form after 420 hours at sub 10 mK temperatures and pressure of $6\times 10^{-6}$ mbar, setting a new record for the conditions that a complex form of life can survive.
△ Less
Submitted 16 December, 2021; v1 submitted 15 December, 2021;
originally announced December 2021.
-
Material radiopurity control in the XENONnT experiment
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Capelli,
J. M. R. Cardoso,
D. Cichon,
B. Cimmino,
M. Clark
, et al. (128 additional authors not shown)
Abstract:
The selection of low-radioactive construction materials is of the utmost importance for rare-event searches and thus critical to the XENONnT experiment. Results of an extensive radioassay program are reported, in which material samples have been screened with gamma-ray spectroscopy, mass spectrometry, and $^{222}$Rn emanation measurements. Furthermore, the cleanliness procedures applied to remove…
▽ More
The selection of low-radioactive construction materials is of the utmost importance for rare-event searches and thus critical to the XENONnT experiment. Results of an extensive radioassay program are reported, in which material samples have been screened with gamma-ray spectroscopy, mass spectrometry, and $^{222}$Rn emanation measurements. Furthermore, the cleanliness procedures applied to remove or mitigate surface contamination of detector materials are described. Screening results, used as inputs for a XENONnT Monte Carlo simulation, predict a reduction of materials background ($\sim$17%) with respect to its predecessor XENON1T. Through radon emanation measurements, the expected $^{222}$Rn activity concentration in XENONnT is determined to be 4.2$\,(^{+0.5}_{-0.7})\,μ$Bq/kg, a factor three lower with respect to XENON1T. This radon concentration will be further suppressed by means of the novel radon distillation system.
△ Less
Submitted 26 January, 2023; v1 submitted 10 December, 2021;
originally announced December 2021.
-
ICARUS-Q: Integrated Control and Readout Unit for Scalable Quantum Processors
Authors:
Kun Hee Park,
Yung Szen Yap,
Yuanzheng Paul Tan,
Christoph Hufnagel,
Long Hoang Nguyen,
Karn Hwa Lau,
Patrick Bore,
Stavros Efthymiou,
Stefano Carrazza,
Rangga P. Budoyo,
Rainer Dumke
Abstract:
We present a control and measurement setup for superconducting qubits based on Xilinx 16-channel radio-frequency system-on-chip (RFSoC) device. The proposed setup consists of four parts: multiple RFSoC boards, a setup to synchronise every digital to analog converter (DAC), and analog to digital converter (ADC) channel across multiple boards, a low-noise direct current (DC) supply for tuning the qu…
▽ More
We present a control and measurement setup for superconducting qubits based on Xilinx 16-channel radio-frequency system-on-chip (RFSoC) device. The proposed setup consists of four parts: multiple RFSoC boards, a setup to synchronise every digital to analog converter (DAC), and analog to digital converter (ADC) channel across multiple boards, a low-noise direct current (DC) supply for tuning the qubit frequency and cloud access for remotely performing experiments. We also design the setup to be free of physical mixers. The RFSoC boards directly generate microwave pulses using sixteen DAC channels up to the third Nyquist zone which are directly sampled by its eight ADC channels between the fifth and the ninth zones.
△ Less
Submitted 1 September, 2022; v1 submitted 6 December, 2021;
originally announced December 2021.
-
GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification
Authors:
Lizhe Liu,
Mingqiang Chen,
Xiaohao Chen,
Siyu Zhu,
** Tan
Abstract:
State-of-the-art face recognition methods typically take the multi-classification pipeline and adopt the softmax-based loss for optimization. Although these methods have achieved great success, the softmax-based loss has its limitation from the perspective of open set classification: the multi-classification objective in the training phase does not strictly match the objective of open set classifi…
▽ More
State-of-the-art face recognition methods typically take the multi-classification pipeline and adopt the softmax-based loss for optimization. Although these methods have achieved great success, the softmax-based loss has its limitation from the perspective of open set classification: the multi-classification objective in the training phase does not strictly match the objective of open set classification testing. In this paper, we derive a new loss named global boundary CosFace (GB-CosFace). Our GB-CosFace introduces an adaptive global boundary to determine whether two face samples belong to the same identity so that the optimization objective is aligned with the testing process from the perspective of open set classification. Meanwhile, since the loss formulation is derived from the softmax-based loss, our GB-CosFace retains the excellent properties of the softmax-based loss, and CosFace is proved to be a special case of the proposed loss. We analyze and explain the proposed GB-CosFace geometrically. Comprehensive experiments on multiple face recognition benchmarks indicate that the proposed GB-CosFace outperforms current state-of-the-art face recognition losses in mainstream face recognition tasks. Compared to CosFace, our GB-CosFace improves 1.58%, 0.57%, and 0.28% at TAR@FAR=1e-6, 1e-5, 1e-4 on IJB-C benchmark.
△ Less
Submitted 10 February, 2023; v1 submitted 22 November, 2021;
originally announced November 2021.
-
Handshakes AI Research at CASE 2021 Task 1: Exploring different approaches for multilingual tasks
Authors:
Vivek Kalyan,
Paul Tan,
Shaun Tan,
Martin Andrews
Abstract:
The aim of the CASE 2021 Shared Task 1 (Hürriyetoğlu et al., 2021) was to detect and classify socio-political and crisis event information at document, sentence, cross-sentence, and token levels in a multilingual setting, with each of these subtasks being evaluated separately in each test language. Our submission contained entries in all of the subtasks, and the scores obtained validated our resea…
▽ More
The aim of the CASE 2021 Shared Task 1 (Hürriyetoğlu et al., 2021) was to detect and classify socio-political and crisis event information at document, sentence, cross-sentence, and token levels in a multilingual setting, with each of these subtasks being evaluated separately in each test language. Our submission contained entries in all of the subtasks, and the scores obtained validated our research finding: That the multilingual aspect of the tasks should be embraced, so that modeling and training regimes use the multilingual nature of the tasks to their mutual benefit, rather than trying to tackle the different languages separately. Our code is available at https://github.com/HandshakesByDC/case2021/
△ Less
Submitted 29 October, 2021;
originally announced October 2021.
-
A random batch Ewald method for charged particles in the isothermal-isobaric ensemble
Authors:
Jiuyang Liang,
Pan Tan,
Liang Hong,
Shi **,
Zhenli Xu,
Lei Li
Abstract:
We develop an accurate, highly efficient and scalable random batch Ewald (RBE) method to conduct simulations in the isothermal-isobaric ensemble (the NPT ensemble) for charged particles in a periodic box. After discretizing the Langevin equations of motion derived using suitable Lagrangians, the RBE method builds the mini-batch strategy into the Fourier space in the Ewald summation for the pressur…
▽ More
We develop an accurate, highly efficient and scalable random batch Ewald (RBE) method to conduct simulations in the isothermal-isobaric ensemble (the NPT ensemble) for charged particles in a periodic box. After discretizing the Langevin equations of motion derived using suitable Lagrangians, the RBE method builds the mini-batch strategy into the Fourier space in the Ewald summation for the pressure and forces so the computational cost is reduced from $\mathcal{O}(N^2)$ to $\mathcal{O}(N)$ per time step. We implement the method in the LAMMPS package and report accurate simulation results for both dynamical quantities and statistics for equilibrium for typical systems including all-atom bulk water and a semi-isotropic membrane system. Numerical simulations on massive supercomputing cluster are also performed to show promising CPU efficiency of RBE.
△ Less
Submitted 27 October, 2021;
originally announced October 2021.
-
Quaternions over Galois rings and their codes
Authors:
Pierre Lance Tan,
Virgilio Sison
Abstract:
It is shown in this paper that, if $R$ is a Frobenius ring, then the quaternion ring $\mathcal{H}_{a,b}(R)$ is a Frobenius ring for all units $a,b \in R$. In particular, if $q$ is an odd prime power then $\mathcal{H}_{a,b}(\mathbb{F}_q)$ is the semisimple non-commutative matrix ring $M_2(\mathbb{F}_q)$. Consequently, a homogeneous weight that depends on the field size $q$ is obtained. On the other…
▽ More
It is shown in this paper that, if $R$ is a Frobenius ring, then the quaternion ring $\mathcal{H}_{a,b}(R)$ is a Frobenius ring for all units $a,b \in R$. In particular, if $q$ is an odd prime power then $\mathcal{H}_{a,b}(\mathbb{F}_q)$ is the semisimple non-commutative matrix ring $M_2(\mathbb{F}_q)$. Consequently, a homogeneous weight that depends on the field size $q$ is obtained. On the other hand, the homogeneous weight of a finite Frobenius ring with a unique minimal ideal is derived in terms of the size of the ideal. This is illustrated by the quaternions over the Galois ring $GR(2^r,m)$. Finally, one-sided linear block codes over the quaternions over Galois rings are constructed, and certain bounds on the homogeneous distance of the images of these codes are proved. These bounds are based on the Hamming distance of the quaternion code and the parameters of the Galois ring. Good examples of one-sided rate-2/6, 3-quasi-cyclic quaternion codes and their images are generated. One of these codes meets the Singleton bound and is therefore a maximum distance separable code.
△ Less
Submitted 2 September, 2021;
originally announced September 2021.
-
DV-Det: Efficient 3D Point Cloud Object Detection with Dynamic Voxelization
Authors:
Zhaoyu Su,
Pin Siang Tan,
Yu-Hsing Wang
Abstract:
In this work, we propose a novel two-stage framework for the efficient 3D point cloud object detection. Instead of transforming point clouds into 2D bird eye view projections, we parse the raw point cloud data directly in the 3D space yet achieve impressive efficiency and accuracy. To achieve this goal, we propose dynamic voxelization, a method that voxellizes points at local scale on-the-fly. By…
▽ More
In this work, we propose a novel two-stage framework for the efficient 3D point cloud object detection. Instead of transforming point clouds into 2D bird eye view projections, we parse the raw point cloud data directly in the 3D space yet achieve impressive efficiency and accuracy. To achieve this goal, we propose dynamic voxelization, a method that voxellizes points at local scale on-the-fly. By doing so, we preserve the point cloud geometry with 3D voxels, and therefore waive the dependence on expensive MLPs to learn from point coordinates. On the other hand, we inherently still follow the same processing pattern as point-wise methods (e.g., PointNet) and no longer suffer from the quantization issue like conventional convolutions. For further speed optimization, we propose the grid-based downsampling and voxelization method, and provide different CUDA implementations to accommodate to the discrepant requirements during training and inference phases. We highlight our efficiency on KITTI 3D object detection dataset with 75 FPS and on Waymo Open dataset with 25 FPS inference speed with satisfactory accuracy.
△ Less
Submitted 27 July, 2021;
originally announced July 2021.
-
Superscalability of the random batch Ewald method
Authors:
Jiuyang Liang,
Pan Tan,
Yue Zhao,
Lei Li,
Shi **,
Liang Hong,
Zhenli Xu
Abstract:
Coulomb interaction, following an inverse-square force-law, quantifies the amount of force between two stationary and electrically charged particles. The long-range nature of Coulomb interactions poses a major challenge to molecular dynamics simulations which are major tools for problems at the nano-/micro- scale. Various algorithms are developed to calculate the pairwise Coulomb interactions to a…
▽ More
Coulomb interaction, following an inverse-square force-law, quantifies the amount of force between two stationary and electrically charged particles. The long-range nature of Coulomb interactions poses a major challenge to molecular dynamics simulations which are major tools for problems at the nano-/micro- scale. Various algorithms are developed to calculate the pairwise Coulomb interactions to a linear scaling but the poor scalability limits the size of simulated systems. Here, we conduct an efficient molecular dynamics algorithm with the random batch Ewald method on all-atom systems where the complete Fourier components in the Coulomb interaction are replaced by randomly selected mini-batches. By simulating the $N$-body systems up to 100 million particles using $10$ thousand CPU cores, we show that this algorithm furnishes $O(N)$ complexity, almost perfect scalability and an order of magnitude faster computational speed when compared to the existing state-of-the-art algorithms. Further examinations of our algorithm on distinct systems, including pure water, micro-phase-separated electrolyte and protein solution demonstrate that the spatiotemporal information on all time and length scales investigated and thermodynamic quantities derived from our algorithm are in perfect agreement with those obtained from the existing algorithms. Therefore, our algorithm provides a breakthrough solution on scalability of computing the Coulomb interaction. It is particularly useful and cost-effective to simulate ultra-large systems, which was either impossible or very costing to conduct using existing algorithms, thus would benefit the broad community of sciences.
△ Less
Submitted 10 October, 2021; v1 submitted 10 June, 2021;
originally announced June 2021.
-
FloorPlanCAD: A Large-Scale CAD Drawing Dataset for Panoptic Symbol Spotting
Authors:
Zhiwen Fan,
Lingjie Zhu,
Honghua Li,
Xiaohao Chen,
Siyu Zhu,
** Tan
Abstract:
Access to large and diverse computer-aided design (CAD) drawings is critical for develo** symbol spotting algorithms. In this paper, we present FloorPlanCAD, a large-scale real-world CAD drawing dataset containing over 10,000 floor plans, ranging from residential to commercial buildings. CAD drawings in the dataset are all represented as vector graphics, which enable us to provide line-grained a…
▽ More
Access to large and diverse computer-aided design (CAD) drawings is critical for develo** symbol spotting algorithms. In this paper, we present FloorPlanCAD, a large-scale real-world CAD drawing dataset containing over 10,000 floor plans, ranging from residential to commercial buildings. CAD drawings in the dataset are all represented as vector graphics, which enable us to provide line-grained annotations of 30 object categories. Equipped by such annotations, we introduce the task of panoptic symbol spotting, which requires to spot not only instances of countable things, but also the semantic of uncountable stuff. Aiming to solve this task, we propose a novel method by combining Graph Convolutional Networks (GCNs) with Convolutional Neural Networks (CNNs), which captures both non-Euclidean and Euclidean features and can be trained end-to-end. The proposed CNN-GCN method achieved state-of-the-art (SOTA) performance on the task of semantic symbol spotting, and help us build a baseline network for the panoptic symbol spotting task. Our contributions are three-fold: 1) to the best of our knowledge, the presented CAD drawing dataset is the first of its kind; 2) the panoptic symbol spotting task considers the spotting of both thing instances and stuff semantic as one recognition problem; and 3) we presented a baseline solution to the panoptic symbol spotting task based on a novel CNN-GCN method, which achieved SOTA performance on semantic symbol spotting. We believe that these contributions will boost research in related areas.
△ Less
Submitted 29 November, 2021; v1 submitted 15 May, 2021;
originally announced May 2021.
-
Silicon Photonics in Optical Access Networks for 5G Communications
Authors:
Xun Guan,
Wei Shi,
Jia Liu,
Peng Tan,
Jim Slevinsky,
Leslie A. Rusch
Abstract:
Only radio access networks can provide connectivity across multiple antenna sites to achieve the great leap forward in capacity targeted by 5G. Optical fronthaul remains a sticking point in that connectivity, and we make the case for analog radio over fiber signals and an optical access network smartedge to achieve the potential of radio access networks. The edge of the network would house the int…
▽ More
Only radio access networks can provide connectivity across multiple antenna sites to achieve the great leap forward in capacity targeted by 5G. Optical fronthaul remains a sticking point in that connectivity, and we make the case for analog radio over fiber signals and an optical access network smartedge to achieve the potential of radio access networks. The edge of the network would house the intelligence that coordinates wireless transmissions to minimize interference and maximize throughput. As silicon photonics provides a hardware platform well adapted to support optical fronthaul, it is poised to drive smart edge adoption. We draw out the issues in adopting oursolution, propose a strategy for network densification, and cite recent demonstrations to support our approach.
△ Less
Submitted 12 July, 2021; v1 submitted 12 May, 2021;
originally announced May 2021.
-
CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution
Authors:
Lizhe Liu,
Xiaohao Chen,
Siyu Zhu,
** Tan
Abstract:
Modern deep-learning-based lane detection methods are successful in most scenarios but struggling for lane lines with complex topologies. In this work, we propose CondLaneNet, a novel top-to-down lane detection framework that detects the lane instances first and then dynamically predicts the line shape for each instance. Aiming to resolve lane instance-level discrimination problem, we introduce a…
▽ More
Modern deep-learning-based lane detection methods are successful in most scenarios but struggling for lane lines with complex topologies. In this work, we propose CondLaneNet, a novel top-to-down lane detection framework that detects the lane instances first and then dynamically predicts the line shape for each instance. Aiming to resolve lane instance-level discrimination problem, we introduce a conditional lane detection strategy based on conditional convolution and row-wise formulation. Further, we design the Recurrent Instance Module(RIM) to overcome the problem of detecting lane lines with complex topologies such as dense lines and fork lines. Benefit from the end-to-end pipeline which requires little post-process, our method has real-time efficiency. We extensively evaluate our method on three benchmarks of lane detection. Results show that our method achieves state-of-the-art performance on all three benchmark datasets. Moreover, our method has the coexistence of accuracy and efficiency, e.g. a 78.14 F1 score and 220 FPS on CULane. Our code is available at https://github.com/aliyun/conditional-lane-detection.
△ Less
Submitted 10 February, 2023; v1 submitted 11 May, 2021;
originally announced May 2021.
-
Superconductivity in Layered van der Waals Hydrogenated Germanene at High Pressure
Authors:
Yilian Xi,
Xiaoling **g,
Zhongfei Xu,
Nana Liu,
Yani Liu,
Miao-Ling Lin,
Ming Yang,
Ying Sun,
**cheng Zhuang,
Xun Xu,
Weichang Hao,
Yanchun Li,
Xiaodong Li,
**-Heng Tan,
Quanjun Li,
Bingbing Liu,
Shi Xue Dou,
Yi Du
Abstract:
Structural and superconducting transitions of layered van der Waals (vdW) hydrogenated germanene (GeH) were observed under high-pressure compression and decompression processes. GeH possesses a superconducting transition at critical temperature (Tc) of 5.41 K at 8.39 GPa. A crystalline to amorphous transition occurs at 16.80 GPa while superconductivity remains. An abnormally increased Tc up to 6.1…
▽ More
Structural and superconducting transitions of layered van der Waals (vdW) hydrogenated germanene (GeH) were observed under high-pressure compression and decompression processes. GeH possesses a superconducting transition at critical temperature (Tc) of 5.41 K at 8.39 GPa. A crystalline to amorphous transition occurs at 16.80 GPa while superconductivity remains. An abnormally increased Tc up to 6.1 K has been observed in the decompression process while the GeH remained amorphous. Thorough in-situ high-pressure synchrotron X-ray diffraction and in-situ high-pressure Raman spectroscopy with the density functional theory simulations suggest that the superconductivity of GeH should be attributed to the increased density of states at the Fermi level as well as the enhanced electron-phonon coupling effect under high pressure. The decompression-driven superconductivity enhancement arises from pressure-induced phonon softening related to an in-plane Ge-Ge phonon mode. As an amorphous metal hydride superconductor, GeH provides a platform to study amorphous hydride superconductivity in layered vdW materials.
△ Less
Submitted 3 June, 2021; v1 submitted 9 May, 2021;
originally announced May 2021.
-
OCRTOC: A Cloud-Based Competition and Benchmark for Robotic Gras** and Manipulation
Authors:
Ziyuan Liu,
Wei Liu,
Yuzhe Qin,
Fanbo Xiang,
Minghao Gou,
Songyan Xin,
Maximo A. Roa,
Berk Calli,
Hao Su,
Yu Sun,
** Tan
Abstract:
In this paper, we propose a cloud-based benchmark for robotic gras** and manipulation, called the OCRTOC benchmark. The benchmark focuses on the object rearrangement problem, specifically table organization tasks. We provide a set of identical real robot setups and facilitate remote experiments of standardized table organization scenarios in varying difficulties. In this workflow, users upload t…
▽ More
In this paper, we propose a cloud-based benchmark for robotic gras** and manipulation, called the OCRTOC benchmark. The benchmark focuses on the object rearrangement problem, specifically table organization tasks. We provide a set of identical real robot setups and facilitate remote experiments of standardized table organization scenarios in varying difficulties. In this workflow, users upload their solutions to our remote server and their code is executed on the real robot setups and scored automatically. After each execution, the OCRTOC team resets the experimental setup manually. We also provide a simulation environment that researchers can use to develop and test their solutions. With the OCRTOC benchmark, we aim to lower the barrier of conducting reproducible research on robotic gras** and manipulation and accelerate progress in this field. Executing standardized scenarios on identical real robot setups allows us to quantify algorithm performances and achieve fair comparisons. Using this benchmark we held a competition in the 2020 International Conference on Intelligence Robots and Systems (IROS 2020). In total, 59 teams took part in this competition worldwide. We present the results and our observations of the 2020 competition, and discuss our adjustments and improvements for the upcoming OCRTOC 2021 competition. The homepage of the OCRTOC competition is www.ocrtoc.org, and the OCRTOC software package is available at https://github.com/OCRTOC/OCRTOC_software_package.
△ Less
Submitted 18 July, 2021; v1 submitted 23 April, 2021;
originally announced April 2021.
-
Ghost factors in Gauss-sum factorization with transmon qubits
Authors:
Lin Htoo Zaw,
Yuanzheng Paul Tan,
Long Hoang Nguyen,
Rangga P. Budoyo,
Kun Hee Park,
Zhi Yang Koh,
Alessandro Landra,
Christoph Hufnagel,
Yung Szen Yap,
Teck Seng Koh,
Rainer Dumke
Abstract:
A challenge in the Gauss sums factorization scheme is the presence of ghost factors - non-factors that behave similarly to actual factors of an integer - which might lead to the misidentification of non-factors as factors or vice versa, especially in the presence of noise. We investigate Type II ghost factors, which are the class of ghost factors that cannot be suppressed with techniques previousl…
▽ More
A challenge in the Gauss sums factorization scheme is the presence of ghost factors - non-factors that behave similarly to actual factors of an integer - which might lead to the misidentification of non-factors as factors or vice versa, especially in the presence of noise. We investigate Type II ghost factors, which are the class of ghost factors that cannot be suppressed with techniques previously laid out in the literature. The presence of Type II ghost factors and the coherence time of the qubit set an upper limit for the total experiment time, and hence the largest factorizable number with this scheme. Discernability is a figure of merit introduced to characterize this behavior. We introduce preprocessing as a strategy to increase the discernability of a system, and demonstrate the technique with a transmon qubit. This can bring the total experiment time of the system closer to its decoherence limit, and increase the largest factorizable number.
△ Less
Submitted 8 December, 2021; v1 submitted 22 April, 2021;
originally announced April 2021.
-
Stereo Matching by Self-supervision of Multiscopic Vision
Authors:
Weihao Yuan,
Yazhan Zhang,
Bingkun Wu,
Siyu Zhu,
** Tan,
Michael Yu Wang,
Qifeng Chen
Abstract:
Self-supervised learning for depth estimation possesses several advantages over supervised learning. The benefits of no need for ground-truth depth, online fine-tuning, and better generalization with unlimited data attract researchers to seek self-supervised solutions. In this work, we propose a new self-supervised framework for stereo matching utilizing multiple images captured at aligned camera…
▽ More
Self-supervised learning for depth estimation possesses several advantages over supervised learning. The benefits of no need for ground-truth depth, online fine-tuning, and better generalization with unlimited data attract researchers to seek self-supervised solutions. In this work, we propose a new self-supervised framework for stereo matching utilizing multiple images captured at aligned camera positions. A cross photometric loss, an uncertainty-aware mutual-supervision loss, and a new smoothness loss are introduced to optimize the network in learning disparity maps end-to-end without ground-truth depth information. To train this framework, we build a new multiscopic dataset consisting of synthetic images rendered by 3D engines and real images captured by real cameras. After being trained with only the synthetic images, our network can perform well in unseen outdoor scenes. Our experiment shows that our model obtains better disparity maps than previous unsupervised methods on the KITTI dataset and is comparable to supervised methods when generalized to unseen data. Our source code and dataset are available at https://sites.google.com/view/multiscopic.
△ Less
Submitted 16 August, 2021; v1 submitted 8 April, 2021;
originally announced April 2021.
-
Riggable 3D Face Reconstruction via In-Network Optimization
Authors:
Ziqian Bai,
Zhaopeng Cui,
Xiaoming Liu,
** Tan
Abstract:
This paper presents a method for riggable 3D face reconstruction from monocular images, which jointly estimates a personalized face rig and per-image parameters including expressions, poses, and illuminations. To achieve this goal, we design an end-to-end trainable network embedded with a differentiable in-network optimization. The network first parameterizes the face rig as a compact latent code…
▽ More
This paper presents a method for riggable 3D face reconstruction from monocular images, which jointly estimates a personalized face rig and per-image parameters including expressions, poses, and illuminations. To achieve this goal, we design an end-to-end trainable network embedded with a differentiable in-network optimization. The network first parameterizes the face rig as a compact latent code with a neural decoder, and then estimates the latent code as well as per-image parameters via a learnable optimization. By estimating a personalized face rig, our method goes beyond static reconstructions and enables downstream applications such as video retargeting. In-network optimization explicitly enforces constraints derived from the first principles, thus introduces additional priors than regression-based methods. Finally, data-driven priors from deep learning are utilized to constrain the ill-posed monocular setting and ease the optimization difficulty. Experiments demonstrate that our method achieves SOTA reconstruction accuracy, reasonable robustness and generalization ability, and supports standard face rig applications.
△ Less
Submitted 7 April, 2021;
originally announced April 2021.
-
Phonon-related monochromatic THz radiation and its magneto-modulation in 2D ferromagnetic Cr2Ge2Te6
Authors:
Long Cheng,
Hui** Li,
Gaoting Lin,
Jian Yan,
Lei Zhang,
Cheng Yang,
Wei Tong,
Zhuang Ren,
Wang Zhu,
Xin Cong,
**g**g Gao,
**heng Tan,
Xuan Luo,
Yu** sun,
Wenguang Zhu,
Zhigao Sheng
Abstract:
Searching multiple types of terahertz (THz) irradiation source is crucial for the THz technology. Here, by utilizing a two-dimensional (2D) ferromagnetic Cr2Ge2Te6 crystal, we firstly demonstrate a magneto-tunable monochromatic THz irradiation source. With a low-photonic-energy broadband THz pump, a strong THz irradiation with frequency ~0.9 THz and bandwidth ~0.25 THz can be generated and its con…
▽ More
Searching multiple types of terahertz (THz) irradiation source is crucial for the THz technology. Here, by utilizing a two-dimensional (2D) ferromagnetic Cr2Ge2Te6 crystal, we firstly demonstrate a magneto-tunable monochromatic THz irradiation source. With a low-photonic-energy broadband THz pump, a strong THz irradiation with frequency ~0.9 THz and bandwidth ~0.25 THz can be generated and its conversion efficiency could even reach 2.1% at 160 K. Moreover, it is intriguing to find that such monochromatic THz irradiation can be efficiently modulated by the magnetic field below 160 K. According to both experimental and theoretical analyses, the emergent THz irradiation is identified as the emission from the phonon-polariton and its temperature and magnetic field dependent behaviors confirmed the large spin-lattice coupling in this 2D ferromagnetic crystal. These observations provide a new route for the creation of tunable monochromatic THz source which may have great practical interests in future applications in photonic and spintronic devices.
△ Less
Submitted 12 May, 2021; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Learning Camera Localization via Dense Scene Matching
Authors:
Shitao Tang,
Chengzhou Tang,
Rui Huang,
Siyu Zhu,
** Tan
Abstract:
Camera localization aims to estimate 6 DoF camera poses from RGB images. Traditional methods detect and match interest points between a query image and a pre-built 3D model. Recent learning-based approaches encode scene structures into a specific convolutional neural network (CNN) and thus are able to predict dense coordinates from RGB images. However, most of them require re-training or re-adapti…
▽ More
Camera localization aims to estimate 6 DoF camera poses from RGB images. Traditional methods detect and match interest points between a query image and a pre-built 3D model. Recent learning-based approaches encode scene structures into a specific convolutional neural network (CNN) and thus are able to predict dense coordinates from RGB images. However, most of them require re-training or re-adaption for a new scene and have difficulties in handling large-scale scenes due to limited network capacity. We present a new method for scene agnostic camera localization using dense scene matching (DSM), where a cost volume is constructed between a query image and a scene. The cost volume and the corresponding coordinates are processed by a CNN to predict dense coordinates. Camera poses can then be solved by PnP algorithms. In addition, our method can be extended to temporal domain, which leads to extra performance boost during testing time. Our scene-agnostic approach achieves comparable accuracy as the existing scene-specific approaches, such as KFNet, on the 7scenes and Cambridge benchmark. This approach also remarkably outperforms state-of-the-art scene-agnostic dense coordinate regression network SANet. The Code is available at https://github.com/Tangshitao/Dense-Scene-Matching.
△ Less
Submitted 30 March, 2021;
originally announced March 2021.
-
HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences
Authors:
Feitong Tan,
Danhang Tang,
Mingsong Dou,
Kaiwen Guo,
Rohit Pandey,
Cem Keskin,
Ruofei Du,
Deqing Sun,
Sofien Bouaziz,
Sean Fanello,
** Tan,
Yinda Zhang
Abstract:
In this paper, we address the problem of building dense correspondences between human images under arbitrary camera viewpoints and body poses. Prior art either assumes small motion between frames or relies on local descriptors, which cannot handle large motion or visually ambiguous body parts, e.g., left vs. right hand. In contrast, we propose a deep learning framework that maps each pixel to a fe…
▽ More
In this paper, we address the problem of building dense correspondences between human images under arbitrary camera viewpoints and body poses. Prior art either assumes small motion between frames or relies on local descriptors, which cannot handle large motion or visually ambiguous body parts, e.g., left vs. right hand. In contrast, we propose a deep learning framework that maps each pixel to a feature space, where the feature distances reflect the geodesic distances among pixels as if they were projected onto the surface of a 3D human scan. To this end, we introduce novel loss functions to push features apart according to their geodesic distances on the surface. Without any semantic annotation, the proposed embeddings automatically learn to differentiate visually similar parts and align different subjects into an unified feature space. Extensive experiments show that the learned embeddings can produce accurate correspondences between images with remarkable generalization capabilities on both intra and inter subjects.
△ Less
Submitted 29 March, 2021;
originally announced March 2021.
-
AR Map**: Accurate and Efficient Map** for Augmented Reality
Authors:
Rui Huang,
Chuan Fang,
Kejie Qiu,
Le Cui,
Zilong Dong,
Siyu Zhu,
** Tan
Abstract:
Augmented reality (AR) has gained increasingly attention from both research and industry communities. By overlaying digital information and content onto the physical world, AR enables users to experience the world in a more informative and efficient manner. As a major building block for AR systems, localization aims at determining the device's pose from a pre-built "map" consisting of visual and d…
▽ More
Augmented reality (AR) has gained increasingly attention from both research and industry communities. By overlaying digital information and content onto the physical world, AR enables users to experience the world in a more informative and efficient manner. As a major building block for AR systems, localization aims at determining the device's pose from a pre-built "map" consisting of visual and depth information in a known environment. While the localization problem has been widely studied in the literature, the "map" for AR systems is rarely discussed. In this paper, we introduce the AR Map for a specific scene to be composed of 1) color images with 6-DOF poses; 2) dense depth maps for each image and 3) a complete point cloud map. We then propose an efficient end-to-end solution to generating and evaluating AR Maps. Firstly, for efficient data capture, a backpack scanning device is presented with a unified calibration pipeline. Secondly, we propose an AR map** pipeline which takes the input from the scanning device and produces accurate AR Maps. Finally, we present an approach to evaluating the accuracy of AR Maps with the help of the highly accurate reconstruction result from a high-end laser scanner. To the best of our knowledge, it is the first time to present an end-to-end solution to efficient and accurate map** for AR applications.
△ Less
Submitted 27 March, 2021;
originally announced March 2021.
-
Compact 3D Map-Based Monocular Localization Using Semantic Edge Alignment
Authors:
Kejie Qiu,
Shenzhou Chen,
Jiahui Zhang,
Rui Huang,
Le Cui,
Siyu Zhu,
** Tan
Abstract:
Accurate localization is fundamental to a variety of applications, such as navigation, robotics, autonomous driving, and Augmented Reality (AR). Different from incremental localization, global localization has no drift caused by error accumulation, which is desired in many application scenarios. In addition to GPS used in the open air, 3D maps are also widely used as alternative global localizatio…
▽ More
Accurate localization is fundamental to a variety of applications, such as navigation, robotics, autonomous driving, and Augmented Reality (AR). Different from incremental localization, global localization has no drift caused by error accumulation, which is desired in many application scenarios. In addition to GPS used in the open air, 3D maps are also widely used as alternative global localization references. In this paper, we propose a compact 3D map-based global localization system using a low-cost monocular camera and an IMU (Inertial Measurement Unit). The proposed compact map consists of two types of simplified elements with multiple semantic labels, which is well adaptive to various man-made environments like urban environments. Also, semantic edge features are used for the key image-map registration, which is robust against occlusion and long-term appearance changes in the environments. To further improve the localization performance, the key semantic edge alignment is formulated as an optimization problem based on initial poses predicted by an independent VIO (Visual-Inertial Odometry) module. The localization system is realized with modular design in real time. We evaluate the localization accuracy through real-world experimental results compared with ground truth, long-term localization performance is also demonstrated.
△ Less
Submitted 27 March, 2021;
originally announced March 2021.
-
Learning Efficient Photometric Feature Transform for Multi-view Stereo
Authors:
Kaizhang Kang,
Cihui Xie,
Ruisheng Zhu,
Xiaohe Ma,
** Tan,
Hongzhi Wu,
Kun Zhou
Abstract:
We present a novel framework to learn to convert the perpixel photometric information at each view into spatially distinctive and view-invariant low-level features, which can be plugged into existing multi-view stereo pipeline for enhanced 3D reconstruction. Both the illumination conditions during acquisition and the subsequent per-pixel feature transform can be jointly optimized in a differentiab…
▽ More
We present a novel framework to learn to convert the perpixel photometric information at each view into spatially distinctive and view-invariant low-level features, which can be plugged into existing multi-view stereo pipeline for enhanced 3D reconstruction. Both the illumination conditions during acquisition and the subsequent per-pixel feature transform can be jointly optimized in a differentiable fashion. Our framework automatically adapts to and makes efficient use of the geometric information available in different forms of input data. High-quality 3D reconstructions of a variety of challenging objects are demonstrated on the data captured with an illumination multiplexing device, as well as a point light. Our results compare favorably with state-of-the-art techniques.
△ Less
Submitted 26 March, 2021;
originally announced March 2021.
-
DRO: Deep Recurrent Optimizer for Video to Depth
Authors:
Xiaodong Gu,
Weihao Yuan,
Zuozhuo Dai,
Siyu Zhu,
Chengzhou Tang,
Zilong Dong,
** Tan
Abstract:
There are increasing interests of studying the video-to-depth (V2D) problem with machine learning techniques. While earlier methods directly learn a map** from images to depth maps and camera poses, more recent works enforce multi-view geometry constraints through optimization embedded in the learning framework. This paper presents a novel optimization method based on recurrent neural networks t…
▽ More
There are increasing interests of studying the video-to-depth (V2D) problem with machine learning techniques. While earlier methods directly learn a map** from images to depth maps and camera poses, more recent works enforce multi-view geometry constraints through optimization embedded in the learning framework. This paper presents a novel optimization method based on recurrent neural networks to further exploit the potential of neural networks in V2D. Specifically, our neural optimizer alternately updates the depth and camera poses through iterations to minimize a feature-metric cost, and two gated recurrent units iteratively improve the results by tracing historical information. Extensive experimental results demonstrate that our method outperforms previous methods and is more efficient in computation and memory consumption than cost-volume-based methods. In particular, our self-supervised method outperforms previous supervised methods on the KITTI and ScanNet datasets. Our source code is available at https://github.com/aliyun/dro-sfm.
△ Less
Submitted 7 March, 2023; v1 submitted 24 March, 2021;
originally announced March 2021.
-
Single-Shot is Enough: Panoramic Infrastructure Based Calibration of Multiple Cameras and 3D LiDARs
Authors:
Chuan Fang,
Shuai Ding,
Zilong Dong,
Honghua Li,
Siyu Zhu,
** Tan
Abstract:
The integration of multiple cameras and 3D Li- DARs has become basic configuration of augmented reality devices, robotics, and autonomous vehicles. The calibration of multi-modal sensors is crucial for a system to properly function, but it remains tedious and impractical for mass production. Moreover, most devices require re-calibration after usage for certain period of time. In this paper, we pro…
▽ More
The integration of multiple cameras and 3D Li- DARs has become basic configuration of augmented reality devices, robotics, and autonomous vehicles. The calibration of multi-modal sensors is crucial for a system to properly function, but it remains tedious and impractical for mass production. Moreover, most devices require re-calibration after usage for certain period of time. In this paper, we propose a single-shot solution for calibrating extrinsic transformations among multiple cameras and 3D LiDARs. We establish a panoramic infrastructure, in which a camera or LiDAR can be robustly localized using data from single frame. Experiments are conducted on three devices with different camera-LiDAR configurations, showing that our approach achieved comparable calibration accuracy with the state-of-the-art approaches but with much greater efficiency.
△ Less
Submitted 10 July, 2021; v1 submitted 23 March, 2021;
originally announced March 2021.
-
Cluster Contrast for Unsupervised Person Re-Identification
Authors:
Zuozhuo Dai,
Guangyuan Wang,
Weihao Yuan,
Xiaoli Liu,
Siyu Zhu,
** Tan
Abstract:
State-of-the-art unsupervised re-ID methods train the neural networks using a memory-based non-parametric softmax loss. Instance feature vectors stored in memory are assigned pseudo-labels by clustering and updated at instance level. However, the varying cluster sizes leads to inconsistency in the updating progress of each cluster. To solve this problem, we present Cluster Contrast which stores fe…
▽ More
State-of-the-art unsupervised re-ID methods train the neural networks using a memory-based non-parametric softmax loss. Instance feature vectors stored in memory are assigned pseudo-labels by clustering and updated at instance level. However, the varying cluster sizes leads to inconsistency in the updating progress of each cluster. To solve this problem, we present Cluster Contrast which stores feature vectors and computes contrast loss at the cluster level. Our approach employs a unique cluster representation to describe each cluster, resulting in a cluster-level memory dictionary. In this way, the consistency of clustering can be effectively maintained throughout the pipline and the GPU memory consumption can be significantly reduced. Thus, our method can solve the problem of cluster inconsistency and be applicable to larger data sets. In addition, we adopt different clustering algorithms to demonstrate the robustness and generalization of our framework. The application of Cluster Contrast to a standard unsupervised re-ID pipeline achieves considerable improvements of 9.9%, 8.3%, 12.1% compared to state-of-the-art purely unsupervised re-ID methods and 5.5%, 4.8%, 4.4% mAP compared to the state-of-the-art unsupervised domain adaptation re-ID methods on the Market, Duke, and MSMT17 datasets. Code is available at https://github.com/alibaba/cluster-contrast.
△ Less
Submitted 10 February, 2023; v1 submitted 21 March, 2021;
originally announced March 2021.
-
Quantum circuits with many photons on a programmable nanophotonic chip
Authors:
J. M. Arrazola,
V. Bergholm,
K. Brádler,
T. R. Bromley,
M. J. Collins,
I. Dhand,
A. Fumagalli,
T. Gerrits,
A. Goussev,
L. G. Helt,
J. Hundal,
T. Isacsson,
R. B. Israel,
J. Izaac,
S. Jahangiri,
R. Janik,
N. Killoran,
S. P. Kumar,
J. Lavoie,
A. E. Lita,
D. H. Mahler,
M. Menotti,
B. Morrison,
S. W. Nam,
L. Neuhaus
, et al. (14 additional authors not shown)
Abstract:
Growing interest in quantum computing for practical applications has led to a surge in the availability of programmable machines for executing quantum algorithms. Present day photonic quantum computers have been limited either to non-deterministic operation, low photon numbers and rates, or fixed random gate sequences. Here we introduce a full-stack hardware-software system for executing many-phot…
▽ More
Growing interest in quantum computing for practical applications has led to a surge in the availability of programmable machines for executing quantum algorithms. Present day photonic quantum computers have been limited either to non-deterministic operation, low photon numbers and rates, or fixed random gate sequences. Here we introduce a full-stack hardware-software system for executing many-photon quantum circuits using integrated nanophotonics: a programmable chip, operating at room temperature and interfaced with a fully automated control system. It enables remote users to execute quantum algorithms requiring up to eight modes of strongly squeezed vacuum initialized as two-mode squeezed states in single temporal modes, a fully general and programmable four-mode interferometer, and genuine photon number-resolving readout on all outputs. Multi-photon detection events with photon numbers and rates exceeding any previous quantum optical demonstration on a programmable device are made possible by strong squeezing and high sampling rates. We verify the non-classicality of the device output, and use the platform to carry out proof-of-principle demonstrations of three quantum algorithms: Gaussian boson sampling, molecular vibronic spectra, and graph similarity.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
Solving Inverse Problems by Joint Posterior Maximization with Autoencoding Prior
Authors:
Mario González,
Andrés Almansa,
Pauline Tan
Abstract:
In this work we address the problem of solving ill-posed inverse problems in imaging where the prior is a variational autoencoder (VAE). Specifically we consider the decoupled case where the prior is trained once and can be reused for many different log-concave degradation models without retraining. Whereas previous MAP-based approaches to this problem lead to highly non-convex optimization algori…
▽ More
In this work we address the problem of solving ill-posed inverse problems in imaging where the prior is a variational autoencoder (VAE). Specifically we consider the decoupled case where the prior is trained once and can be reused for many different log-concave degradation models without retraining. Whereas previous MAP-based approaches to this problem lead to highly non-convex optimization algorithms, our approach computes the joint (space-latent) MAP that naturally leads to alternate optimization algorithms and to the use of a stochastic encoder to accelerate computations. The resulting technique (JPMAP) performs Joint Posterior Maximization using an Autoencoding Prior. We show theoretical and experimental evidence that the proposed objective function is quite close to bi-convex. Indeed it satisfies a weak bi-convexity property which is sufficient to guarantee that our optimization scheme converges to a stationary point. We also highlight the importance of correctly training the VAE using a denoising criterion, in order to ensure that the encoder generalizes well to out-of-distribution images, without affecting the quality of the generative model. This simple modification is key to providing robustness to the whole procedure. Finally we show how our joint MAP methodology relates to more common MAP approaches, and we propose a continuation scheme that makes use of our JPMAP algorithm to provide more robust MAP estimates. Experimental results also show the higher quality of the solutions obtained by our JPMAP approach with respect to other non-convex MAP approaches which more often get stuck in spurious local optima.
△ Less
Submitted 25 April, 2022; v1 submitted 2 March, 2021;
originally announced March 2021.
-
s-d coupling enhanced phonon anharmonicity in copper-based compounds
Authors:
Kaike Yang,
Huai Yang,
Yujia Sun,
Zhongming Wei,
Jun Zhang,
Jun-Wei Luo,
**-Heng Tan,
Shu-Shen Li,
Su-Huai Wei,
Hui-Xiong Deng
Abstract:
Materials with ultralow thermal conductivity are of great interest for efficient energy conversion and thermal barrier coating. Copper-based semiconductors such as copper chalcogenides and copper halides are known to possess extreme low thermal conductivity, whereas the fundamental origin of the low thermal conductivity observed in the copper-based materials remains elusive. Here, we reveal that s…
▽ More
Materials with ultralow thermal conductivity are of great interest for efficient energy conversion and thermal barrier coating. Copper-based semiconductors such as copper chalcogenides and copper halides are known to possess extreme low thermal conductivity, whereas the fundamental origin of the low thermal conductivity observed in the copper-based materials remains elusive. Here, we reveal that s-d coupling induced giant phonon anharmonicity is the fundamental mechanism responsible for the ultralow thermal conductivity of copper compounds. The symmetry controlled strong coupling of high-lying occupied copper 3d orbital with the unoccupied 4s state under thermal vibration remarkably lowers the lattice potential barrier, which enhances anharmonic scattering between phonons. This understanding is confirmed by temperature-dependent Raman spectra measurements. Our study offers an insight at atomic level connecting electronic structures with phonon vibration modes, and thus sheds light on materials properties that rely on electron-phonon coupling, such as thermoelectricity and superconductivity.
△ Less
Submitted 24 February, 2021;
originally announced February 2021.
-
Learning Deep Neural Networks under Agnostic Corrupted Supervision
Authors:
Boyang Liu,
Mengying Sun,
Ding Wang,
Pang-Ning Tan,
Jiayu Zhou
Abstract:
Training deep neural models in the presence of corrupted supervision is challenging as the corrupted data points may significantly impact the generalization performance. To alleviate this problem, we present an efficient robust algorithm that achieves strong guarantees without any assumption on the type of corruption and provides a unified framework for both classification and regression problems.…
▽ More
Training deep neural models in the presence of corrupted supervision is challenging as the corrupted data points may significantly impact the generalization performance. To alleviate this problem, we present an efficient robust algorithm that achieves strong guarantees without any assumption on the type of corruption and provides a unified framework for both classification and regression problems. Unlike many existing approaches that quantify the quality of the data points (e.g., based on their individual loss values), and filter them accordingly, the proposed algorithm focuses on controlling the collective impact of data points on the average gradient. Even when a corrupted data point failed to be excluded by our algorithm, the data point will have a very limited impact on the overall loss, as compared with state-of-the-art filtering methods based on loss values. Extensive experiments on multiple benchmark datasets have demonstrated the robustness of our algorithm under different types of corruption.
△ Less
Submitted 12 February, 2021;
originally announced February 2021.
-
The minimum linear locality of linear codes
Authors:
Pan Tan,
Cuiling Fan,
Cunsheng Ding,
Zhengchun Zhou
Abstract:
Locally recoverable codes (LRCs) were proposed for the recovery of data in distributed and cloud storage systems about nine years ago. A lot of progress on the study of LRCs has been made by now. However, there is a lack of general theory on the minimum linear locality of linear codes. In addition, the minimum linear locality of many known families of linear codes is not studied in the literature.…
▽ More
Locally recoverable codes (LRCs) were proposed for the recovery of data in distributed and cloud storage systems about nine years ago. A lot of progress on the study of LRCs has been made by now. However, there is a lack of general theory on the minimum linear locality of linear codes. In addition, the minimum linear locality of many known families of linear codes is not studied in the literature. Motivated by these two facts, this paper develops some general theory about the minimum linear locality of linear codes, and investigates the minimum linear locality of a number of families of linear codes, such as $q$-ary Hamming codes, $q$-ary Simplex codes, generalized Reed-Muller codes, ovoid codes, maximum arc codes, the extended hyperoval codes, and near MDS codes. Many classes of both distance-optimal and dimension-optimal LRCs are presented in this paper. The minimum linear locality of many families of linear codes are settled with the general theory developed in this paper.
△ Less
Submitted 31 January, 2021;
originally announced February 2021.
-
Measuring bulk and surface acoustic modes in diamond by angle-resolved Brillouin spectroscopy
Authors:
Ya-Ru Xie,
Shu-Liang Ren,
Yuan-Fei Gao,
Xue-Lu Liu,
**-Heng Tan,
Jun Zhang
Abstract:
The acoustic modes of diamond not only are of profound significance for studying its thermal conductivity, mechanical properties and optical properties, but also play a determined role in the performance of high-frequency and high-power acoustic wave devices. Here we report the bulk acoustic waves (BAWs) and surface acoustic waves (SAWs) of single crystal diamond by using the angle-resolved Brillo…
▽ More
The acoustic modes of diamond not only are of profound significance for studying its thermal conductivity, mechanical properties and optical properties, but also play a determined role in the performance of high-frequency and high-power acoustic wave devices. Here we report the bulk acoustic waves (BAWs) and surface acoustic waves (SAWs) of single crystal diamond by using the angle-resolved Brillouin light scattering (BLS) spectroscopy. We identify two high-velocity surface skimming bulk waves, with sound velocities of 1.277x10^6 and 1.727x10^6 cm/s, respectively. Furthermore, we conduct the relationship among the refractive index, incident angle and the velocities of BAWs propagating along an arbitrary direction. Our results may provide a valuable reference for fundamental researches and devices engineering in the community of diamond-based acoustic study.
△ Less
Submitted 9 January, 2021;
originally announced January 2021.
-
AIM 2020 Challenge on Learned Image Signal Processing Pipeline
Authors:
Andrey Ignatov,
Radu Timofte,
Zhilu Zhang,
Ming Liu,
Haolin Wang,
Wangmeng Zuo,
Jiawei Zhang,
Ruimao Zhang,
Zhanglin Peng,
Sijie Ren,
Linhui Dai,
Xiaohong Liu,
Chengqi Li,
Jun Chen,
Yuichi Ito,
Bhavya Vasudeva,
Puneesh Deora,
Umapada Pal,
Zhenyu Guo,
Yu Zhu,
Tian Liang,
Chenghua Li,
Cong Leng,
Zhihong Pan,
Baopu Li
, et al. (14 additional authors not shown)
Abstract:
This paper reviews the second AIM learned ISP challenge and provides the description of the proposed solutions and results. The participating teams were solving a real-world RAW-to-RGB map** problem, where to goal was to map the original low-quality RAW images captured by the Huawei P20 device to the same photos obtained with the Canon 5D DSLR camera. The considered task embraced a number of com…
▽ More
This paper reviews the second AIM learned ISP challenge and provides the description of the proposed solutions and results. The participating teams were solving a real-world RAW-to-RGB map** problem, where to goal was to map the original low-quality RAW images captured by the Huawei P20 device to the same photos obtained with the Canon 5D DSLR camera. The considered task embraced a number of complex computer vision subtasks, such as image demosaicing, denoising, white balancing, color and contrast correction, demoireing, etc. The target metric used in this challenge combined fidelity scores (PSNR and SSIM) with solutions' perceptual results measured in a user study. The proposed solutions significantly improved the baseline results, defining the state-of-the-art for practical image signal processing pipeline modeling.
△ Less
Submitted 10 November, 2020;
originally announced November 2020.
-
Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning
Authors:
Cong Zhang,
Wen Song,
Zhiguang Cao,
Jie Zhang,
Puay Siew Tan,
Chi Xu
Abstract:
Priority dispatching rule (PDR) is widely used for solving real-world Job-shop scheduling problem (JSSP). However, the design of effective PDRs is a tedious task, requiring a myriad of specialized knowledge and often delivering limited performance. In this paper, we propose to automatically learn PDRs via an end-to-end deep reinforcement learning agent. We exploit the disjunctive graph representat…
▽ More
Priority dispatching rule (PDR) is widely used for solving real-world Job-shop scheduling problem (JSSP). However, the design of effective PDRs is a tedious task, requiring a myriad of specialized knowledge and often delivering limited performance. In this paper, we propose to automatically learn PDRs via an end-to-end deep reinforcement learning agent. We exploit the disjunctive graph representation of JSSP, and propose a Graph Neural Network based scheme to embed the states encountered during solving. The resulting policy network is size-agnostic, effectively enabling generalization on large-scale instances. Experiments show that the agent can learn high-quality PDRs from scratch with elementary raw features, and demonstrates strong performance against the best existing PDRs. The learned policies also perform well on much larger instances that are unseen in training.
△ Less
Submitted 23 October, 2020;
originally announced October 2020.
-
Hyperbolic jigsaws and families of pseudomodular groups II
Authors:
Beicheng Lou,
Ser Peow Tan,
Anh Duc Vo
Abstract:
In our previous paper, we introduced a hyperbolic jigsaw construction and constructed infinitely many non-commensurable, non-uniform, non-arithmetic lattices of $\mathrm{PSL}(2, \mathbb{R})$ with cusp set $\mathbb{Q} \cup \{\infty\}$ (called pseudomodular groups by Long and Reid), thus answering a question posed by Long and Reid. In this paper, we continue with our study of these jigsaw groups exp…
▽ More
In our previous paper, we introduced a hyperbolic jigsaw construction and constructed infinitely many non-commensurable, non-uniform, non-arithmetic lattices of $\mathrm{PSL}(2, \mathbb{R})$ with cusp set $\mathbb{Q} \cup \{\infty\}$ (called pseudomodular groups by Long and Reid), thus answering a question posed by Long and Reid. In this paper, we continue with our study of these jigsaw groups exploring questions of arithmeticity, pseudomodularity, and also related pseudo-euclidean and continued fraction algorithms arising from these groups. We also answer another question of Long and Reid by demonstrating a recursive formula for the tessellation of the hyperbolic plane arising from Weierstrass groups which generalizes the well-known "Farey addition" used to generate the Farey tessellation.
△ Less
Submitted 20 October, 2020;
originally announced October 2020.
-
MeshMVS: Multi-View Stereo Guided Mesh Reconstruction
Authors:
Rakesh Shrestha,
Zhiwen Fan,
Qingkun Su,
Zuozhuo Dai,
Siyu Zhu,
** Tan
Abstract:
Deep learning based 3D shape generation methods generally utilize latent features extracted from color images to encode the semantics of objects and guide the shape generation process. These color image semantics only implicitly encode 3D information, potentially limiting the accuracy of the generated shapes. In this paper we propose a multi-view mesh generation method which incorporates geometry…
▽ More
Deep learning based 3D shape generation methods generally utilize latent features extracted from color images to encode the semantics of objects and guide the shape generation process. These color image semantics only implicitly encode 3D information, potentially limiting the accuracy of the generated shapes. In this paper we propose a multi-view mesh generation method which incorporates geometry information explicitly by using the features from intermediate depth representations of multi-view stereo and regularizing the 3D shapes against these depth images. First, our system predicts a coarse 3D volume from the color images by probabilistically merging voxel occupancy grids from the prediction of individual views. Then the depth images from multi-view stereo along with the rendered depth images of the coarse shape are used as a contrastive input whose features guide the refinement of the coarse shape through a series of graph convolution networks. Notably, we achieve superior results than state-of-the-art multi-view shape generation methods with 34% decrease in Chamfer distance to ground truth and 14% increase in F1-score on ShapeNet dataset.Our source code is available at https://git.io/Jmalg
△ Less
Submitted 11 April, 2021; v1 submitted 16 October, 2020;
originally announced October 2020.
-
Dynamic fingerprint of fractionalized excitations in single-crystalline Cu$_3$Zn(OH)$_6$FBr
Authors:
Ying Fu,
Miao-Ling Lin,
Le Wang,
Qiye Liu,
Lianglong Huang,
Wenrui Jiang,
Zhanyang Hao,
Cai Liu,
Hu Zhang,
Xingqiang Shi,
Jun Zhang,
Junfeng Dai,
Dapeng Yu,
Fei Ye,
Patrick A. Lee,
**-Heng Tan,
Jia-Wei Mei
Abstract:
Quantum spin liquid (QSL) represents a new class of condensed matter states characterized by the long-range many-body entanglement of topological orders. The most prominent feature of the elusive QSL state is the existence of fractionalized spin excitations. Subject to the strong quantum fluctuations, the spin-1/2 antiferromagnetic system on a kagome lattice is the promising candidate for hosting…
▽ More
Quantum spin liquid (QSL) represents a new class of condensed matter states characterized by the long-range many-body entanglement of topological orders. The most prominent feature of the elusive QSL state is the existence of fractionalized spin excitations. Subject to the strong quantum fluctuations, the spin-1/2 antiferromagnetic system on a kagome lattice is the promising candidate for hosting a QSL ground state, but the structurally ideal realization is rare. Here, we report Raman scattering on the single crystalline Cu$_3$Zn(OH)$_6$FBr, and confirm that the ideal kagome structure remains down to low temperatures without any lattice distortion by the angle-resolved polarized Raman responses and second-harmonic-generation measurements. Furthermore, at low temperatures the Raman scattering reveals a continuum of the spin excitations in Cu$_3$Zn(OH)$_6$FBr, in contrast to the sharp magnon peak in the ordered kagome antiferromagnet EuCu$_3$(OH)$_6$Cl$_3$. Such magnetic Raman continuum, in particular, the substantial low-energy one-pair spinon excitation serves as strong evidence for fractionalized spin excitations in Cu$_3$Zn(OH)$_6$FBr.
△ Less
Submitted 15 October, 2020;
originally announced October 2020.
-
Fairness Perception from a Network-Centric Perspective
Authors:
Farzan Masrour,
Pang-Ning Tan,
Abdol-Hossein Esfahanian
Abstract:
Algorithmic fairness is a major concern in recent years as the influence of machine learning algorithms becomes more widespread. In this paper, we investigate the issue of algorithmic fairness from a network-centric perspective. Specifically, we introduce a novel yet intuitive function known as network-centric fairness perception and provide an axiomatic approach to analyze its properties. Using a…
▽ More
Algorithmic fairness is a major concern in recent years as the influence of machine learning algorithms becomes more widespread. In this paper, we investigate the issue of algorithmic fairness from a network-centric perspective. Specifically, we introduce a novel yet intuitive function known as network-centric fairness perception and provide an axiomatic approach to analyze its properties. Using a peer-review network as case study, we also examine its utility in terms of assessing the perception of fairness in paper acceptance decisions. We show how the function can be extended to a group fairness metric known as fairness visibility and demonstrate its relationship to demographic parity. We also illustrate a potential pitfall of the fairness visibility measure that can be exploited to mislead individuals into perceiving that the algorithmic decisions are fair. We demonstrate how the problem can be alleviated by increasing the local neighborhood size of the fairness perception function.
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
Phonon Renormalization in Reconstructed MoS$_2$ Moiré Superlattices
Authors:
Jiamin Quan,
Lukas Linhart,
Miao-Ling Lin,
Daehun Lee,
Jihang Zhu,
Chun-Yuan Wang,
Wei-Ting Hsu,
Junho Choi,
Jacob Embley,
Carter Young,
Takashi Taniguchi,
Kenji Watanabe,
Chih-Kang Shih,
Keji Lai,
Allan H. MacDonald,
**-Heng Tan,
Florian Libisch,
Xiaoqin Li
Abstract:
In moiré crystals formed by stacking van der Waals (vdW) materials, surprisingly diverse correlated electronic phases and optical properties can be realized by a subtle change in the twist angle. Here, we discover that phonon spectra are also renormalized in MoS$_2$ twisted bilayers, adding a new perspective to moiré physics. Over a range of small twist angles, the phonon spectra evolve rapidly du…
▽ More
In moiré crystals formed by stacking van der Waals (vdW) materials, surprisingly diverse correlated electronic phases and optical properties can be realized by a subtle change in the twist angle. Here, we discover that phonon spectra are also renormalized in MoS$_2$ twisted bilayers, adding a new perspective to moiré physics. Over a range of small twist angles, the phonon spectra evolve rapidly due to ultra-strong coupling between different phonon modes and atomic reconstructions of the moiré pattern. We develop a new low-energy continuum model for phonons that overcomes the outstanding challenge of calculating properties of large moiré supercells and successfully captures essential experimental observations. Remarkably, simple optical spectroscopy experiments can provide information on strain and lattice distortions in moiré crystals with nanometer-size supercells. The newly developed theory promotes a comprehensive and unified understanding of structural, optical, and electronic properties of moiré superlattices.
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
DV-ConvNet: Fully Convolutional Deep Learning on Point Clouds with Dynamic Voxelization and 3D Group Convolution
Authors:
Zhaoyu Su,
Pin Siang Tan,
Junkang Chow,
Jimmy Wu,
Yehur Cheong,
Yu-Hsing Wang
Abstract:
3D point cloud interpretation is a challenging task due to the randomness and sparsity of the component points. Many of the recently proposed methods like PointNet and PointCNN have been focusing on learning shape descriptions from point coordinates as point-wise input features, which usually involves complicated network architectures. In this work, we draw attention back to the standard 3D convol…
▽ More
3D point cloud interpretation is a challenging task due to the randomness and sparsity of the component points. Many of the recently proposed methods like PointNet and PointCNN have been focusing on learning shape descriptions from point coordinates as point-wise input features, which usually involves complicated network architectures. In this work, we draw attention back to the standard 3D convolutions towards an efficient 3D point cloud interpretation. Instead of converting the entire point cloud into voxel representations like the other volumetric methods, we voxelize the sub-portions of the point cloud only at necessary locations within each convolution layer on-the-fly, using our dynamic voxelization operation with self-adaptive voxelization resolution. In addition, we incorporate 3D group convolution into our dense convolution kernel implementation to further exploit the rotation invariant features of point cloud. Benefiting from its simple fully-convolutional architecture, our network is able to run and converge at a considerably fast speed, while yields on-par or even better performance compared with the state-of-the-art methods on several benchmark datasets.
△ Less
Submitted 27 July, 2021; v1 submitted 7 September, 2020;
originally announced September 2020.
-
A wide-range wavelength-tunable photon-pair source for characterizing single-photon detectors
Authors:
Lijiong Shen,
Jianwei Lee,
Antony Winata Hartanto,
Pengkian Tan,
Christian Kurtsiefer
Abstract:
The temporal response of single-photon detectors is usually obtained by measuring their impulse response to short-pulsed laser sources. In this work, we present an alternative approach using time-correlated photon pairs generated in spontaneous parametric down-conversion (SPDC). By measuring the cross-correlation between the detection times recorded with an unknown and a reference photodetector, t…
▽ More
The temporal response of single-photon detectors is usually obtained by measuring their impulse response to short-pulsed laser sources. In this work, we present an alternative approach using time-correlated photon pairs generated in spontaneous parametric down-conversion (SPDC). By measuring the cross-correlation between the detection times recorded with an unknown and a reference photodetector, the temporal response function of the unknown detector can be extracted. Changing the critical phase-matching conditions of the SPDC process provides a wavelength-tunable source of photon pairs. We demonstrate a continuous wavelength-tunability from 526 nm to 661 nm for one photon of the pair, and 1050 nm to 1760 nm for the other photon. The source allows, in principle, to access an even wider wavelength range by simply changing the pump laser of the SPDC-based source. As an initial demonstration, we characterize single photon avalance detectors sensitive to the two distinct wavelength bands, one based on Silicon, the other based on Indim Gallium Arsenide.
△ Less
Submitted 4 September, 2020;
originally announced September 2020.
-
Electronic Raman Scattering in Suspended Semiconducting Carbon Nanotubes
Authors:
Yuecong Hu,
Shaochuang Chen,
Daqi Zhang,
Xin Cong,
Sida Sun,
Jiangbin Wu,
Feng Yang,
Juan Yang,
**-heng Tan,
Yan Li
Abstract:
The electronic Raman scattering (ERS) features of single-walled carbon nanotubes (SWNTs) can reveal a wealth of information about their electronic structures, but have previously been thought to appear exclusively in metallic (M-) but not in semiconducting (S-) SWNTs. We report the experimental observation of the ERS features with an accuracy of 1 meV in suspended S-SWNTs, the processes of which a…
▽ More
The electronic Raman scattering (ERS) features of single-walled carbon nanotubes (SWNTs) can reveal a wealth of information about their electronic structures, but have previously been thought to appear exclusively in metallic (M-) but not in semiconducting (S-) SWNTs. We report the experimental observation of the ERS features with an accuracy of 1 meV in suspended S-SWNTs, the processes of which are accomplished via the available high-energy electron-hole pairs. The ERS features can facilitate further systematic studies on the properties of SWNT, both metallic and semiconducting, with defined chirality.
△ Less
Submitted 3 September, 2020; v1 submitted 1 September, 2020;
originally announced September 2020.
-
Observation of nonreciprocal magneto-optical scattering in nonencapsulated few-layered CrI3
Authors:
Zhen Liu,
Kai Guo,
Guangwei Hu,
Zhongtai Shi,
Yue Li,
Linbo Zhang,
Haiyan Chen,
Li Zhang,
Peiheng Zhou,
Haipeng Lu,
Miao-Ling Lin,
Sizhao Liu,
Yingchun Cheng,
Xue Lu Liu,
Jianliang Xie,
Lei Bi,
**-Heng Tan,
Longjiang Deng,
Cheng-Wei Qiu,
Bo Peng
Abstract:
Magneto-optical effect refers to a rotation of polarization plane, which has been widely studied in traditional ferromagnetic metal and insulator films and scarcely in two-dimensional layered materials. Here we uncover a new nonreciprocal magneto-inelastic light scattering effect in ferromagnetic few-layer CrI3. We observed a rotation of the polarization plane of inelastic light scattering between…
▽ More
Magneto-optical effect refers to a rotation of polarization plane, which has been widely studied in traditional ferromagnetic metal and insulator films and scarcely in two-dimensional layered materials. Here we uncover a new nonreciprocal magneto-inelastic light scattering effect in ferromagnetic few-layer CrI3. We observed a rotation of the polarization plane of inelastic light scattering between -20o and +60o that are tunable by an out-of-plane magnetic field from -2.5 to 2.5 T. It is experimentally observed that the degree of polarization can be magnetically manipulated between -20% and 85%. This work raises a new magneto-optical phenomenon and could create opportunities of applying 2D ferromagnetic materials in Raman lasing, topological photonics, and magneto-optical modulator for information transport and storage.
△ Less
Submitted 9 October, 2020; v1 submitted 28 August, 2020;
originally announced August 2020.
-
Understanding angle-resolved polarized Raman scattering from black phosphorus at normal and oblique laser incidences
Authors:
Miao-Ling Lin,
Yu-Chen Leng,
Xin Cong,
Da Meng,
Jiahong Wang,
Xiao-Li Li,
Binlu Yu,
Xue-Lu Liu,
Xue-Feng Yu,
**-Heng Tan
Abstract:
The selection rule for angle-resolved polarized Raman (ARPR) intensity of phonons from standard group-theoretical method in isotropic materials would break down in anisotropic layered materials (ALMs) due to birefringence and linear dichroism effects. The two effects result in depth-dependent polarization and intensity of incident laser and scattered signal inside ALMs and thus make a challenge to…
▽ More
The selection rule for angle-resolved polarized Raman (ARPR) intensity of phonons from standard group-theoretical method in isotropic materials would break down in anisotropic layered materials (ALMs) due to birefringence and linear dichroism effects. The two effects result in depth-dependent polarization and intensity of incident laser and scattered signal inside ALMs and thus make a challenge to predict ARPR intensity at any laser incidence direction. Herein, taking in-plane anisotropic black phosphorus as a prototype, we developed a so-called birefringence-linear-dichroism (BLD) model to quantitatively understand its ARPR intensity at both normal and oblique laser incidences by the same set of real Raman tensors for certain laser excitation. No fitting parameter is needed, once the birefringence and linear dichroism effects are considered with the complex refractive indexes. An approach was proposed to experimentally determine real Raman tensor and complex refractive indexes, respectively, from the relative Raman intensity along its principle axes and incident-angle resolved reflectivity by Fresnel$'$s law. The results suggest that the previously reported ARPR intensity of ultrathin ALM flakes deposited on a multilayered substrate at normal laser incidence can be also understood based on the BLD model by considering the depth-dependent polarization and intensity of incident laser and scattered Raman signal induced by both birefringence and linear dichroism effects within ALM flakes and the interference effects in the multilayered structures, which are dependent on the excitation wavelength, thickness of ALM flakes and dielectric layers of the substrate. This work can be generally applicable to any opaque anisotropic crystals, offering a promising route to predict and manipulate the polarized behaviors of related phonons.
△ Less
Submitted 22 August, 2020; v1 submitted 9 August, 2020;
originally announced August 2020.
-
Weakly positive and directed Anosov representations
Authors:
Sungwoon Kim,
Ser Peow Tan,
Tengren Zhang
Abstract:
Given a finitely generated group $Γ$, a directed graph $Λ$, and a map $R:Λ\toΓ$, we introduce the notion of an $(R,Λ)$-directed Anosov representation. This is a weakening of the notion of Anosov representations. Our main theorem gives a procedure to construct $(R,Λ)$-directed Anosov representations using Fock-Goncharov positivity. As an application of our main theorem, we construct large families…
▽ More
Given a finitely generated group $Γ$, a directed graph $Λ$, and a map $R:Λ\toΓ$, we introduce the notion of an $(R,Λ)$-directed Anosov representation. This is a weakening of the notion of Anosov representations. Our main theorem gives a procedure to construct $(R,Λ)$-directed Anosov representations using Fock-Goncharov positivity. As an application of our main theorem, we construct large families of primitive stable representations from $F_2$ to $\mathrm{PGL}(V)$, including non-discrete and non-faithful examples.
△ Less
Submitted 22 July, 2022; v1 submitted 5 August, 2020;
originally announced August 2020.
-
Coupling between particle shape and long-range interaction in the high-density regime
Authors:
Can-can Zhou,
Hongchuan Shen,
Hua Tong,
Ning Xu,
Peng Tan
Abstract:
By using long-range interacting polygons, we experimentally probe the coupling between particle shape and long-range interaction. For two typical space-filling polygons, square and triangle, we find two types of coupling modes that predominantly control the structure formation. Specifically, the rotational ordering of squares brings a lattice deformation that produces a hexagonal-to-rhombic transi…
▽ More
By using long-range interacting polygons, we experimentally probe the coupling between particle shape and long-range interaction. For two typical space-filling polygons, square and triangle, we find two types of coupling modes that predominantly control the structure formation. Specifically, the rotational ordering of squares brings a lattice deformation that produces a hexagonal-to-rhombic transition in the high-density regime, whereas the alignment of triangles introduces a large geometric frustration that causes an order-to-disorder transition. Moreover, the two coupling modes lead to small and large "internal roughness" of the two systems, and thus predominantly control their structure relaxations. Our study thus provides a physical picture to the coupling between long-range interaction effect and short-range shape effect in the high-density regime unexplored before.
△ Less
Submitted 5 August, 2020;
originally announced August 2020.
-
Interpretable Foreground Object Search As Knowledge Distillation
Authors:
Boren Li,
Po-Yu Zhuang,
Jian Gu,
Mingyang Li,
** Tan
Abstract:
This paper proposes a knowledge distillation method for foreground object search (FoS). Given a background and a rectangle specifying the foreground location and scale, FoS retrieves compatible foregrounds in a certain category for later image composition. Foregrounds within the same category can be grouped into a small number of patterns. Instances within each pattern are compatible with any quer…
▽ More
This paper proposes a knowledge distillation method for foreground object search (FoS). Given a background and a rectangle specifying the foreground location and scale, FoS retrieves compatible foregrounds in a certain category for later image composition. Foregrounds within the same category can be grouped into a small number of patterns. Instances within each pattern are compatible with any query input interchangeably. These instances are referred to as interchangeable foregrounds. We first present a pipeline to build pattern-level FoS dataset containing labels of interchangeable foregrounds. We then establish a benchmark dataset for further training and testing following the pipeline. As for the proposed method, we first train a foreground encoder to learn representations of interchangeable foregrounds. We then train a query encoder to learn query-foreground compatibility following a knowledge distillation framework. It aims to transfer knowledge from interchangeable foregrounds to supervise representation learning of compatibility. The query feature representation is projected to the same latent space as interchangeable foregrounds, enabling very efficient and interpretable instance-level search. Furthermore, pattern-level search is feasible to retrieve more controllable, reasonable and diverse foregrounds. The proposed method outperforms the previous state-of-the-art by 10.42% in absolute difference and 24.06% in relative improvement evaluated by mean average precision (mAP). Extensive experimental results also demonstrate its efficacy from various aspects. The benchmark dataset and code will be release shortly.
△ Less
Submitted 21 July, 2020; v1 submitted 19 July, 2020;
originally announced July 2020.
-
Dispersion of speech aerosols in the context of physical distancing recommendations
Authors:
Vrishank Raghav,
Zu Puayen Tan,
Surya P. Bhatt
Abstract:
High-speed particle image velocimetry (PIV) was used to quantify the dispersion of aerosol-laden gas clouds generated during phonetic vocalization by a human subject at different sound intensity levels. The measured PIV data was used to quantify the initial penetration depth. Using classical pulsed jet scaling laws propagation distances were computed for time periods beyond the measured duration.…
▽ More
High-speed particle image velocimetry (PIV) was used to quantify the dispersion of aerosol-laden gas clouds generated during phonetic vocalization by a human subject at different sound intensity levels. The measured PIV data was used to quantify the initial penetration depth. Using classical pulsed jet scaling laws propagation distances were computed for time periods beyond the measured duration. Our results indicate that the penetration distance was comparable between loud intensity speech (for example during singing, classroom lectures, parties etc.) and moderate intensity cough. Based on theoretical aerosol propagation distance and time, the 6 feet physical distancing recommendations are likely sufficient to avoid incidental exposure by the initial penetration of the aerosol cloud, but insufficient for prolonged exposure to slow propagating aerosol clouds.
△ Less
Submitted 7 July, 2020;
originally announced July 2020.