-
Knowledge Graph Embedding: An Overview
Authors:
Xiou Ge,
Yun-Cheng Wang,
Bin Wang,
C. -C. Jay Kuo
Abstract:
Many mathematical models have been leveraged to design embeddings for representing Knowledge Graph (KG) entities and relations for link prediction and many downstream tasks. These mathematically-inspired models are not only highly scalable for inference in large KGs, but also have many explainable advantages in modeling different relation patterns that can be validated through both formal proofs a…
▽ More
Many mathematical models have been leveraged to design embeddings for representing Knowledge Graph (KG) entities and relations for link prediction and many downstream tasks. These mathematically-inspired models are not only highly scalable for inference in large KGs, but also have many explainable advantages in modeling different relation patterns that can be validated through both formal proofs and empirical results. In this paper, we make a comprehensive overview of the current state of research in KG completion. In particular, we focus on two main branches of KG embedding (KGE) design: 1) distance-based methods and 2) semantic matching-based methods. We discover the connections between recently proposed models and present an underlying trend that might help researchers invent novel and more effective models. Next, we delve into CompoundE and CompoundE3D, which draw inspiration from 2D and 3D affine operations, respectively. They encompass a broad spectrum of techniques including distance-based and semantic-based methods. We will also discuss an emerging approach for KG completion which leverages pre-trained language models (PLMs) and textual descriptions of entities and relations and offer insights into the integration of KGE embedding methods with PLMs for KG completion.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Preparing pure $^{43}$Ca$^+$ samples in an ion trap with photoionization and parametric excitations
Authors:
C. -H. Kuo,
Y. -C. Hsiao,
C. -Y. Jhang,
Y. -D. Chen,
S. Tung
Abstract:
We present a practical scheme for the efficient preparation of laser-cooled $^{43}$Ca$^+$ ions in an ion trap. Our approach integrates two well-established methods: isotope-selective photoionization and isotope-specific parametric excitation. Drawing inspiration from the individual merits of each method, we have successfully integrated these techniques to prepare extended chains of $^{43}$Ca$^+$ i…
▽ More
We present a practical scheme for the efficient preparation of laser-cooled $^{43}$Ca$^+$ ions in an ion trap. Our approach integrates two well-established methods: isotope-selective photoionization and isotope-specific parametric excitation. Drawing inspiration from the individual merits of each method, we have successfully integrated these techniques to prepare extended chains of $^{43}$Ca$^+$ ions, overcoming the challenge posed by their low natural abundance of 0.135\% in a natural source. Furthermore, we explore the subtleties of our scheme, focusing on the influence of different factors on the purification process. Our investigation contributes to a broader understanding of the technique and highlights the adaptability of established methods in addressing specific isotopic challenges.
△ Less
Submitted 1 February, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Unsupervised Green Object Tracker (GOT) without Offline Pre-training
Authors:
Zhiruo Zhou,
Suya You,
C. -C. Jay Kuo
Abstract:
Supervised trackers trained on labeled data dominate the single object tracking field for superior tracking accuracy. The labeling cost and the huge computational complexity hinder their applications on edge devices. Unsupervised learning methods have also been investigated to reduce the labeling cost but their complexity remains high. Aiming at lightweight high-performance tracking, feasibility w…
▽ More
Supervised trackers trained on labeled data dominate the single object tracking field for superior tracking accuracy. The labeling cost and the huge computational complexity hinder their applications on edge devices. Unsupervised learning methods have also been investigated to reduce the labeling cost but their complexity remains high. Aiming at lightweight high-performance tracking, feasibility without offline pre-training, and algorithmic transparency, we propose a new single object tracking method, called the green object tracker (GOT), in this work. GOT conducts an ensemble of three prediction branches for robust box tracking: 1) a global object-based correlator to predict the object location roughly, 2) a local patch-based correlator to build temporal correlations of small spatial units, and 3) a superpixel-based segmentator to exploit the spatial information of the target frame. GOT offers competitive tracking accuracy with state-of-the-art unsupervised trackers, which demand heavy offline pre-training, at a lower computation cost. GOT has a tiny model size (<3k parameters) and low inference complexity (around 58M FLOPs per frame). Since its inference complexity is between 0.1%-10% of DL trackers, it can be easily deployed on mobile and edge devices.
△ Less
Submitted 16 September, 2023;
originally announced September 2023.
-
Bias and Fairness in Chatbots: An Overview
Authors:
**tang Xue,
Yun-Cheng Wang,
Chengwei Wei,
Xiaofeng Liu,
Jonghye Woo,
C. -C. Jay Kuo
Abstract:
Chatbots have been studied for more than half a century. With the rapid development of natural language processing (NLP) technologies in recent years, chatbots using large language models (LLMs) have received much attention nowadays. Compared with traditional ones, modern chatbots are more powerful and have been used in real-world applications. There are however, bias and fairness concerns in mode…
▽ More
Chatbots have been studied for more than half a century. With the rapid development of natural language processing (NLP) technologies in recent years, chatbots using large language models (LLMs) have received much attention nowadays. Compared with traditional ones, modern chatbots are more powerful and have been used in real-world applications. There are however, bias and fairness concerns in modern chatbot design. Due to the huge amounts of training data, extremely large model sizes, and lack of interpretability, bias mitigation and fairness preservation of modern chatbots are challenging. Thus, a comprehensive overview on bias and fairness in chatbot systems is given in this paper. The history of chatbots and their categories are first reviewed. Then, bias sources and potential harms in applications are analyzed. Considerations in designing fair and unbiased chatbot systems are examined. Finally, future research directions are discussed.
△ Less
Submitted 10 December, 2023; v1 submitted 15 September, 2023;
originally announced September 2023.
-
AsyncET: Asynchronous Learning for Knowledge Graph Entity Ty** with Auxiliary Relations
Authors:
Yun-Cheng Wang,
Xiou Ge,
Bin Wang,
C. -C. Jay Kuo
Abstract:
Knowledge graph entity ty** (KGET) is a task to predict the missing entity types in knowledge graphs (KG). Previously, KG embedding (KGE) methods tried to solve the KGET task by introducing an auxiliary relation, 'hasType', to model the relationship between entities and their types. However, a single auxiliary relation has limited expressiveness for diverse entity-type patterns. We improve the e…
▽ More
Knowledge graph entity ty** (KGET) is a task to predict the missing entity types in knowledge graphs (KG). Previously, KG embedding (KGE) methods tried to solve the KGET task by introducing an auxiliary relation, 'hasType', to model the relationship between entities and their types. However, a single auxiliary relation has limited expressiveness for diverse entity-type patterns. We improve the expressiveness of KGE methods by introducing multiple auxiliary relations in this work. Similar entity types are grouped to reduce the number of auxiliary relations and improve their capability to model entity-type patterns with different granularities. With the presence of multiple auxiliary relations, we propose a method adopting an Asynchronous learning scheme for Entity Ty**, named AsyncET, which updates the entity and type embeddings alternatively to keep the learned entity embedding up-to-date and informative for entity type prediction. Experiments are conducted on two commonly used KGET datasets to show that the performance of KGE methods on the KGET task can be substantially improved by the proposed multiple auxiliary relations and asynchronous embedding learning. Furthermore, our method has a significant advantage over state-of-the-art methods in model sizes and time complexity.
△ Less
Submitted 30 August, 2023;
originally announced August 2023.
-
A search for pulsars around Sgr A* in the first Event Horizon Telescope dataset
Authors:
Pablo Torne,
Kuo Liu,
Ralph P. Eatough,
Jompoj Wongphechauxsorn,
James M. Cordes,
Gregory Desvignes,
Mariafelicia De Laurentis,
Michael Kramer,
Scott M. Ransom,
Shami Chatterjee,
Robert Wharton,
Ramesh Karuppusamy,
Lindy Blackburn,
Michael Janssen,
Chi-kwan Chan,
Geoffrey B. Crew,
Lynn D. Matthews,
Ciriaco Goddi,
Helge Rottmann,
Jan Wagner,
Salvador Sanchez,
Ignacio Ruiz,
Federico Abbate,
Geoffrey C. Bower,
Juan J. Salamanca
, et al. (261 additional authors not shown)
Abstract:
The Event Horizon Telescope (EHT) observed in 2017 the supermassive black hole at the center of the Milky Way, Sagittarius A* (Sgr A*), at a frequency of 228.1 GHz ($λ$=1.3 mm). The fundamental physics tests that even a single pulsar orbiting Sgr A* would enable motivate searching for pulsars in EHT datasets. The high observing frequency means that pulsars - which typically exhibit steep emission…
▽ More
The Event Horizon Telescope (EHT) observed in 2017 the supermassive black hole at the center of the Milky Way, Sagittarius A* (Sgr A*), at a frequency of 228.1 GHz ($λ$=1.3 mm). The fundamental physics tests that even a single pulsar orbiting Sgr A* would enable motivate searching for pulsars in EHT datasets. The high observing frequency means that pulsars - which typically exhibit steep emission spectra - are expected to be very faint. However, it also negates pulse scattering, an effect that could hinder pulsar detections in the Galactic Center. Additionally, magnetars or a secondary inverse Compton emission could be stronger at millimeter wavelengths than at lower frequencies. We present a search for pulsars close to Sgr A* using the data from the three most-sensitive stations in the EHT 2017 campaign: the Atacama Large Millimeter/submillimeter Array, the Large Millimeter Telescope and the IRAM 30 m Telescope. We apply three detection methods based on Fourier-domain analysis, the Fast-Folding-Algorithm and single pulse search targeting both pulsars and burst-like transient emission; using the simultaneity of the observations to confirm potential candidates. No new pulsars or significant bursts were found. Being the first pulsar search ever carried out at such high radio frequencies, we detail our analysis methods and give a detailed estimation of the sensitivity of the search. We conclude that the EHT 2017 observations are only sensitive to a small fraction ($\lesssim$2.2%) of the pulsars that may exist close to Sgr A*, motivating further searches for fainter pulsars in the region.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Fe substitution in URu$_2$Si$_2$: singlet magnetism in an extended Doniach phase diagram
Authors:
Andrea Marino,
Denise S. Christovam,
Chun-Fu Chang,
Johannes Falke,
Chang-Yang Kuo,
Chi-Nan Wu,
Martin Sundermann,
Andrea Amorese,
Hlynur Gretarsson,
Eric Lee Wong,
Camilla M. Moir,
Yuang Deng,
M. Brian Maple,
Peter Thalmeier,
Liu Hao Tjeng,
Andrea Severing
Abstract:
The application of pressure as well as the successive substitution of Ru with Fe in the hidden order (HO) compound URu$_2$Si$_2$ leads to the formation of the large moment antiferromagnetic phase (LMAFM). Here we have investigated the substitution series URu$_{2-x}$Fe$_x$Si$_2$ from $x$\,=\,0.0 to 2.0 by U\,4$f$ core-level photoelectron spectroscopy and have observed non-monotonic changes in the s…
▽ More
The application of pressure as well as the successive substitution of Ru with Fe in the hidden order (HO) compound URu$_2$Si$_2$ leads to the formation of the large moment antiferromagnetic phase (LMAFM). Here we have investigated the substitution series URu$_{2-x}$Fe$_x$Si$_2$ from $x$\,=\,0.0 to 2.0 by U\,4$f$ core-level photoelectron spectroscopy and have observed non-monotonic changes in the spectra. The initial increase and subsequent decrease of the spectral weight of the 4$f$ core level satellite with increasing $x$ stands for a non-monotonic 5$f$ filling across the substitution series. The competition of chemical pressure and increase of the density of states at the Fermi energy, both due to substitution of Ru with Fe, can explain such a behavior. An extended Doniach phase diagram including the $x$ dependence of the density of states is proposed. Also in URu$_{2-x}$Fe$_x$Si$_2$ the ground state is a singlet or quasi-doublet state consisting of two singlets. Hence, the formation of magnetic order in the URu$_{2-x}$Fe$_x$Si$_2$ substitution series must be explained within a singlet magnetism model.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
A Measurement of Gravitational Lensing of the Cosmic Microwave Background Using SPT-3G 2018 Data
Authors:
Z. Pan,
F. Bianchini,
W. L. K. Wu,
P. A. R. Ade,
Z. Ahmed,
E. Anderes,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
K. Aylor,
L. Balkenhol,
P. S. Barry,
R. Basu Thakur,
K. Benabed,
A. N. Bender,
B. A. Benson,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
K. Byrum,
E. Camphuis,
J. E. Carlstrom,
F. W. Carter,
T. W. Cecil,
C. L. Chang
, et al. (111 additional authors not shown)
Abstract:
We present a measurement of gravitational lensing over 1500 deg$^2$ of the Southern sky using SPT-3G temperature data at 95 and 150 GHz taken in 2018. The lensing amplitude relative to a fiducial Planck 2018 $Λ$CDM cosmology is found to be $1.020\pm0.060$, excluding instrumental and astrophysical systematic uncertainties. We conduct extensive systematic and null tests to check the robustness of th…
▽ More
We present a measurement of gravitational lensing over 1500 deg$^2$ of the Southern sky using SPT-3G temperature data at 95 and 150 GHz taken in 2018. The lensing amplitude relative to a fiducial Planck 2018 $Λ$CDM cosmology is found to be $1.020\pm0.060$, excluding instrumental and astrophysical systematic uncertainties. We conduct extensive systematic and null tests to check the robustness of the lensing measurements, and report a minimum-variance combined lensing power spectrum over angular multipoles of $50<L<2000$, which we use to constrain cosmological models. When analyzed alone and jointly with primary cosmic microwave background (CMB) spectra within the $Λ$CDM model, our lensing amplitude measurements are consistent with measurements from SPT-SZ, SPTpol, ACT, and Planck. Incorporating loose priors on the baryon density and other parameters including uncertainties on a foreground bias template, we obtain a $1σ$ constraint on $σ_8 Ω_{\rm m}^{0.25}=0.595 \pm 0.026$ using the SPT-3G 2018 lensing data alone, where $σ_8$ is a common measure of the amplitude of structure today and $Ω_{\rm m}$ is the matter density parameter. Combining SPT-3G 2018 lensing measurements with baryon acoustic oscillation (BAO) data, we derive parameter constraints of $σ_8 = 0.810 \pm 0.033$, $S_8 \equiv σ_8(Ω_{\rm m}/0.3)^{0.5}= 0.836 \pm 0.039$, and Hubble constant $H_0 =68.8^{+1.3}_{-1.6}$ km s$^{-1}$ Mpc$^{-1}$. Using CMB anisotropy and lensing measurements from SPT-3G only, we provide independent constraints on the spatial curvature of $Ω_{K} = 0.014^{+0.023}_{-0.026}$ (95% C.L.) and the dark energy density of $Ω_Λ= 0.722^{+0.031}_{-0.026}$ (68% C.L.). When combining SPT-3G lensing data with SPT-3G CMB anisotropy and BAO data, we find an upper limit on the sum of the neutrino masses of $\sum m_ν< 0.30$ eV (95% C.L.).
△ Less
Submitted 29 January, 2024; v1 submitted 22 August, 2023;
originally announced August 2023.
-
ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection
Authors:
Tao Tu,
Shun-Po Chuang,
Yu-Lun Liu,
Cheng Sun,
Ke Zhang,
Donna Roy,
Cheng-Hao Kuo,
Min Sun
Abstract:
We propose ImGeoNet, a multi-view image-based 3D object detection framework that models a 3D space by an image-induced geometry-aware voxel representation. Unlike previous methods which aggregate 2D features into 3D voxels without considering geometry, ImGeoNet learns to induce geometry from multi-view images to alleviate the confusion arising from voxels of free space, and during the inference ph…
▽ More
We propose ImGeoNet, a multi-view image-based 3D object detection framework that models a 3D space by an image-induced geometry-aware voxel representation. Unlike previous methods which aggregate 2D features into 3D voxels without considering geometry, ImGeoNet learns to induce geometry from multi-view images to alleviate the confusion arising from voxels of free space, and during the inference phase, only images from multiple views are required. Besides, a powerful pre-trained 2D feature extractor can be leveraged by our representation, leading to a more robust performance. To evaluate the effectiveness of ImGeoNet, we conduct quantitative and qualitative experiments on three indoor datasets, namely ARKitScenes, ScanNetV2, and ScanNet200. The results demonstrate that ImGeoNet outperforms the current state-of-the-art multi-view image-based method, ImVoxelNet, on all three datasets in terms of detection accuracy. In addition, ImGeoNet shows great data efficiency by achieving results comparable to ImVoxelNet with 100 views while utilizing only 40 views. Furthermore, our studies indicate that our proposed image-induced geometry-aware representation can enable image-based methods to attain superior detection accuracy than the seminal point cloud-based method, VoteNet, in two practical scenarios: (1) scenarios where point clouds are sparse and noisy, such as in ARKitScenes, and (2) scenarios involve diverse object classes, particularly classes of small objects, as in the case in ScanNet200.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Surface Second Harmonic Generation from Topological Dirac Semimetal PdTe$_2$
Authors:
Syed Mohammed Faizanuddin,
Ching-Hang Chien,
Yao-Jui Chan,
Si-Tong Liu,
Chia-Nung Kuo,
Chin Shuan Lue,
Yu-Chieh Wen
Abstract:
Recent experiments and calculations in topological semimetals have observed anomalously strong second-order optical nonlinearity, but yet whether the enhancement also occurs at surfaces of topological semimetals in general remains an open question. In this work, we tackle this problem by measuring polarization-dependent and rotational-anisotropy optical second harmonic generation (SHG) from centro…
▽ More
Recent experiments and calculations in topological semimetals have observed anomalously strong second-order optical nonlinearity, but yet whether the enhancement also occurs at surfaces of topological semimetals in general remains an open question. In this work, we tackle this problem by measuring polarization-dependent and rotational-anisotropy optical second harmonic generation (SHG) from centrosymmetric type-II Dirac semimetal PdTe$_2$. We found the SHG to follow C$_{3v}$ surface symmetry with a time-varying intensity dictated by the oxidation kinetics of the material after its surface cleavage, indicating the surface origin of SHG. Quantitative characterization of the surface nonlinear susceptibility indicates a large out-of-plane response of PdTe$_2$ with $|χ_{ccc}^{(2)}|$ up to 25 $\times$ 10$^{-18}$ m$^2$/V. Our results support the topological surfaces/interfaces as a new route toward applications of nonlinear optical effects with released symmetry constraints, and demonstrate SHG as a viable means to in situ study of kinetics of topological surfaces.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
A Comprehensive Overview of Computational Nuclei Segmentation Methods in Digital Pathology
Authors:
Vasileios Magoulianitis,
Catherine A. Alexander,
C. -C. Jay Kuo
Abstract:
In the cancer diagnosis pipeline, digital pathology plays an instrumental role in the identification, staging, and grading of malignant areas on biopsy tissue specimens. High resolution histology images are subject to high variance in appearance, sourcing either from the acquisition devices or the H\&E staining process. Nuclei segmentation is an important task, as it detects the nuclei cells over…
▽ More
In the cancer diagnosis pipeline, digital pathology plays an instrumental role in the identification, staging, and grading of malignant areas on biopsy tissue specimens. High resolution histology images are subject to high variance in appearance, sourcing either from the acquisition devices or the H\&E staining process. Nuclei segmentation is an important task, as it detects the nuclei cells over background tissue and gives rise to the topology, size, and count of nuclei which are determinant factors for cancer detection. Yet, it is a fairly time consuming task for pathologists, with reportedly high subjectivity. Computer Aided Diagnosis (CAD) tools empowered by modern Artificial Intelligence (AI) models enable the automation of nuclei segmentation. This can reduce the subjectivity in analysis and reading time. This paper provides an extensive review, beginning from earlier works use traditional image processing techniques and reaching up to modern approaches following the Deep Learning (DL) paradigm. Our review also focuses on the weak supervision aspect of the problem, motivated by the fact that annotated data is scarce. At the end, the advantages of different models and types of supervision are thoroughly discussed. Furthermore, we try to extrapolate and envision how future research lines will potentially be, so as to minimize the need for labeled data while maintaining high performance. Future methods should emphasize efficient and explainable models with a transparent underlying process so that physicians can trust their output.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
Calibration and Physics with ARA Station 1: A Unique Askaryan Radio Array Detector
Authors:
M. F. H Seikh,
D. Z. Besson,
S. Ali,
P. Allison,
S. Archambault,
J. J. Beatty,
A. Bishop,
P. Chen,
Y. C. Chen,
B. A. Clark,
W. Clay,
A. Connolly,
K. Couberly,
L. Cremonesi,
A. Cummings,
P. Dasgupta,
R. Debolt,
S. De Kockere,
K. D. de Vries,
C. Deaconu,
M. A. DuVernois,
J. Flaherty,
E. Friedman,
R. Gaior,
P. Giri
, et al. (48 additional authors not shown)
Abstract:
The Askaryan Radio Array Station 1 (A1), the first among five autonomous stations deployed for the ARA experiment at the South Pole, is a unique ultra-high energy neutrino (UHEN) detector based on the Askaryan effect that uses Antarctic ice as the detector medium. Its 16 radio antennas (distributed across 4 strings, each with 2 Vertically Polarized (VPol), 2 Horizontally Polarized (HPol) receivers…
▽ More
The Askaryan Radio Array Station 1 (A1), the first among five autonomous stations deployed for the ARA experiment at the South Pole, is a unique ultra-high energy neutrino (UHEN) detector based on the Askaryan effect that uses Antarctic ice as the detector medium. Its 16 radio antennas (distributed across 4 strings, each with 2 Vertically Polarized (VPol), 2 Horizontally Polarized (HPol) receivers), and 2 strings of transmitting antennas (calibration pulsers, CPs), each with 1 VPol and 1 HPol channel, are deployed at depths less than 100 m within the shallow firn zone of the 2.8 km thick South Pole (SP) ice. We apply different methods to calibrate its Ice Ray Sampler second generation (IRS2) chip for timing offset and ADC-to-Voltage conversion factors using a known continuous wave input signal to the digitizer, and achieve a precision of sub-nanoseconds. We achieve better calibration for odd, compared to even samples, and also find that the HPols under-perform relative to the VPol channels. Our timing calibrated data is subsequently used to calibrate the ADC-to-Voltage conversion as well as precise antenna locations, as a precursor to vertex reconstruction. The calibrated data will then be analyzed for UHEN signals in the final step of data compression. The ability of A1 to scan the firn region of SP ice sheet will contribute greatly towards a 5-station analysis and will inform the design of the planned IceCube Gen-2 radio array.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation
Authors:
Xuefeng Hu,
Ke Zhang,
Lu Xia,
Albert Chen,
Jiajia Luo,
Yuyin Sun,
Ken Wang,
Nan Qiao,
Xiao Zeng,
Min Sun,
Cheng-Hao Kuo,
Ram Nevatia
Abstract:
Large-scale Pre-Training Vision-Language Model such as CLIP has demonstrated outstanding performance in zero-shot classification, e.g. achieving 76.3% top-1 accuracy on ImageNet without seeing any example, which leads to potential benefits to many tasks that have no labeled data. However, while applying CLIP to a downstream target domain, the presence of visual and text domain gaps and cross-modal…
▽ More
Large-scale Pre-Training Vision-Language Model such as CLIP has demonstrated outstanding performance in zero-shot classification, e.g. achieving 76.3% top-1 accuracy on ImageNet without seeing any example, which leads to potential benefits to many tasks that have no labeled data. However, while applying CLIP to a downstream target domain, the presence of visual and text domain gaps and cross-modality misalignment can greatly impact the model performance. To address such challenges, we propose ReCLIP, the first source-free domain adaptation method for vision-language models, which does not require any source data or target labeled data. ReCLIP first learns a projection space to mitigate the misaligned visual-text embeddings and learns pseudo labels, and then deploys cross-modality self-training with the pseudo labels, to update visual and text encoders, refine labels and reduce domain gaps and misalignments iteratively. With extensive experiments, we demonstrate ReCLIP reduces the average error rate of CLIP from 30.17% to 25.06% on 22 image classification benchmarks. Code available at https://github.com/michiganleon/ReCLIP_WACV.
△ Less
Submitted 13 December, 2023; v1 submitted 4 August, 2023;
originally announced August 2023.
-
The Greenland Telescope: Construction, Commissioning, and Operations in Pituffik
Authors:
Ming-Tang Chen,
Keiichi Asada,
Satoki Matsushita,
Philippe Raffin,
Makoto Inoue,
Paul T. P. Ho,
Chih-Chiang Han,
Derek Kubo,
Timothy Norton,
Nimesh A. Patel,
George Nystrom,
Chih-Wei L. Huang,
Pierre Martin-Cocher,
Jun Yi Koay,
Cristina Romero-Cañizales,
Ching-Tang Liu,
Teddy Huang,
Kuan-Yu Liu,
Tashun Wei,
Shu-Hao Chang,
Ryan Chilson,
Peter Oshiro,
Homin Jiang,
Chao-Te Li,
Geoffrey Bower
, et al. (29 additional authors not shown)
Abstract:
In 2018, the Greenland Telescope (GLT) started scientific observation in Greenland. Since then, we have completed several significant improvements and added new capabilities to the telescope system. This paper presents a full review of the GLT system, a summary of our observation activities since 2018, the lessons learned from the operations in the Arctic regions, and the prospect of the telescope…
▽ More
In 2018, the Greenland Telescope (GLT) started scientific observation in Greenland. Since then, we have completed several significant improvements and added new capabilities to the telescope system. This paper presents a full review of the GLT system, a summary of our observation activities since 2018, the lessons learned from the operations in the Arctic regions, and the prospect of the telescope.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
An Overview on Generative AI at Scale with Edge-Cloud Computing
Authors:
Yun-Cheng Wang,
**tang Xue,
Chengwei Wei,
C. -C. Jay Kuo
Abstract:
As a specific category of artificial intelligence (AI), generative artificial intelligence (GenAI) generates new content that resembles what is created by humans. The rapid development of GenAI systems has created a huge amount of new data on the Internet, posing new challenges to current computing and communication frameworks. Currently, GenAI services rely on the traditional cloud computing fram…
▽ More
As a specific category of artificial intelligence (AI), generative artificial intelligence (GenAI) generates new content that resembles what is created by humans. The rapid development of GenAI systems has created a huge amount of new data on the Internet, posing new challenges to current computing and communication frameworks. Currently, GenAI services rely on the traditional cloud computing framework due to the need for large computation resources. However, such services will encounter high latency because of data transmission and a high volume of requests. On the other hand, edge-cloud computing can provide adequate computation power and low latency at the same time through the collaboration between edges and the cloud. Thus, it is attractive to build GenAI systems at scale by leveraging the edge-cloud computing paradigm. In this overview paper, we review recent developments in GenAI and edge-cloud computing, respectively. Then, we use two exemplary GenAI applications to discuss technical challenges in scaling up their solutions using edge-cloud collaborative systems. Finally, we list design considerations for training and deploying GenAI systems at scale and point out future research directions.
△ Less
Submitted 9 July, 2023; v1 submitted 2 June, 2023;
originally announced June 2023.
-
Blind Video Quality Assessment at the Edge
Authors:
Zhanxuan Mei,
Yun-Cheng Wang,
C. -C. Jay Kuo
Abstract:
Owing to the proliferation of user-generated videos on the Internet, blind video quality assessment (BVQA) at the edge attracts growing attention. The usage of deep-learning-based methods is restricted to be applied at the edge due to their large model sizes and high computational complexity. In light of this, a novel lightweight BVQA method called GreenBVQA is proposed in this work. GreenBVQA fea…
▽ More
Owing to the proliferation of user-generated videos on the Internet, blind video quality assessment (BVQA) at the edge attracts growing attention. The usage of deep-learning-based methods is restricted to be applied at the edge due to their large model sizes and high computational complexity. In light of this, a novel lightweight BVQA method called GreenBVQA is proposed in this work. GreenBVQA features a small model size, low computational complexity, and high performance. Its processing pipeline includes: video data crop**, unsupervised representation generation, supervised feature selection, and mean-opinion-score (MOS) regression and ensembles. We conduct experimental evaluations on three BVQA datasets and show that GreenBVQA can offer state-of-the-art performance in PLCC and SROCC metrics while demanding significantly smaller model sizes and lower computational complexity. Thus, GreenBVQA is well-suited for edge devices.
△ Less
Submitted 29 October, 2023; v1 submitted 17 June, 2023;
originally announced June 2023.
-
Filamentary Dust Polarization and the Morphology of Neutral Hydrogen Structures
Authors:
George Halal,
Susan E. Clark,
Ari Cukierman,
Dominic Beck,
Chao-Lin Kuo
Abstract:
Filamentary structures in neutral hydrogen (H I) emission are well-aligned with the interstellar magnetic field, so H I emission morphology can be used to construct templates that strongly correlate with measurements of polarized thermal dust emission. We explore how the quantification of filament morphology affects this correlation. We introduce a new implementation of the Rolling Hough Transform…
▽ More
Filamentary structures in neutral hydrogen (H I) emission are well-aligned with the interstellar magnetic field, so H I emission morphology can be used to construct templates that strongly correlate with measurements of polarized thermal dust emission. We explore how the quantification of filament morphology affects this correlation. We introduce a new implementation of the Rolling Hough Transform (RHT) using spherical harmonic convolutions, which enables efficient quantification of filamentary structure on the sphere. We use this spherical RHT algorithm along with a Hessian-based method to construct H I-based polarization templates. We discuss improvements to each algorithm relative to similar implementations in the literature and compare their outputs. By exploring the parameter space of filament morphologies with the spherical RHT, we find that the most informative H I structures for modeling the magnetic field structure are the thinnest resolved filaments. For this reason, we find a $\sim10\%$ enhancement in the $B$-mode correlation with dust polarization with higher-resolution H I observations. We demonstrate that certain interstellar morphologies can produce parity-violating signatures, i.e., nonzero $TB$ and $EB$, even under the assumption that filaments are locally aligned with the magnetic field. Finally, we demonstrate that $B$ modes from interstellar dust filaments are mostly affected by the topology of the filaments with respect to one another and their relative polarized intensities, whereas $E$ modes are mostly sensitive to the shapes of individual filaments.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Imaging emergent exotic quasiparticle state in a frustrated transition metal oxide
Authors:
Yuita Fujisawa,
Anjana Krishnadas,
Chia-Hsiu Hsu,
Takahito Takeda,
Sheng Liu,
Markel Pardo-Almanza,
Yukiko Obata,
Dyon van Dinter,
Kohei Yamagami,
Guoqing Chang,
Masaki Kobayashi,
Chang-Yang Kuo,
Yoshinori Okada
Abstract:
The existence of rich Fermiology in anomalous metal phase in exotic superconductors has attracted considerable interests, as exemplified in copper, iron-based, and intermetallic frustrated kagome-based compounds. A common feature in these cases is pseudo-gap opening or long-range lattice/electronic ordering above superconducting critical temperature Tc. As yet developed area is the potential exist…
▽ More
The existence of rich Fermiology in anomalous metal phase in exotic superconductors has attracted considerable interests, as exemplified in copper, iron-based, and intermetallic frustrated kagome-based compounds. A common feature in these cases is pseudo-gap opening or long-range lattice/electronic ordering above superconducting critical temperature Tc. As yet developed area is the potential existence of exotic Fermiology in superconducting transition metal oxides on a geometrically frustrated lattice. Here, we focus on the spinel oxide superconductor LiTi2O4, which can be viewed as the hole-doped side of the orbital ordered 3d1 Mott system on the Ti-derived pyrochlore frustrated network. By the in-situ combination of angle-resolved photoemission spectroscopy (ARPES) and epitaxial thin film growth, we discovered the abrupt flattening of near Fermi energy dispersion below the characteristic temperature T* ~ 150 K. While the emergent negative thermal expansion below T* strongly supports a distinct phase at low-temperature, absence of energy gap opening, splitting/folding of bands, nor long-range lattice distortion are seen across T*. We propose that the competition between growing instability towards orbital ordering and its inherent geometric frustration in the Ti-pyrochlore network results in a new quantum state of matter with robust high entropic nature below T*. Our findings collectively point to a unique Fermiology in frustrated three-dimensional transition metal oxides, and its connection to superconductivity below Tc is open as an interesting future challenge. Also, a potential guideline is unexpectedly provided for designing zero thermal expansion metal to develop future solid-state devices.
△ Less
Submitted 11 June, 2023;
originally announced June 2023.
-
Hidden Hydroxides in KOH-Grown BaNiO3 Crystals: A Potential Link to Their Catalytic Behavior
Authors:
Lun **,
Haozhe Wang,
Xianghan Xu,
Danrui Ni,
Chen Yang,
Yu-Chieh Ku,
Cheng-En Liu,
Chang-Yang Kuo,
Chun-Fu Chang,
Raimundas Sereika,
Wenli Bi,
Weiwei Xie,
Robert. J. Cava
Abstract:
The hexagonal perovskite BaNiO3, prepared via non-ceramic approaches, is known to act as a good catalyst for the oxygen-evolution reaction (OER) in alkaline media. Here we report our observation that BaNiO3 synthesized via KOH flux growth and high O2 pressure ceramic synthesis have different magnetic properties. We show that this is because the KOH flux-grown crystals made in open-air are actually…
▽ More
The hexagonal perovskite BaNiO3, prepared via non-ceramic approaches, is known to act as a good catalyst for the oxygen-evolution reaction (OER) in alkaline media. Here we report our observation that BaNiO3 synthesized via KOH flux growth and high O2 pressure ceramic synthesis have different magnetic properties. We show that this is because the KOH flux-grown crystals made in open-air are actually a hydroxide-containing form of BaNiO3 that can be dried upon annealing in O2 flow. This work not only unveils a previously unknown aspect of the BaNiO3 OER catalyst and offers some insights into the underlying mechanism, but also suggests that hydroxide ions may be present in other hexagonal perovskite oxides prepared in wet conditions.
△ Less
Submitted 27 October, 2023; v1 submitted 8 June, 2023;
originally announced June 2023.
-
Green Steganalyzer: A Green Learning Approach to Image Steganalysis
Authors:
Yao Zhu,
Xinyu Wang,
Hong-Shuo Chen,
Ronald Salloum,
C. -C. Jay Kuo
Abstract:
A novel learning solution to image steganalysis based on the green learning paradigm, called Green Steganalyzer (GS), is proposed in this work. GS consists of three modules: 1) pixel-based anomaly prediction, 2) embedding location detection, and 3) decision fusion for image-level detection. In the first module, GS decomposes an image into patches, adopts Saab transforms for feature extraction, and…
▽ More
A novel learning solution to image steganalysis based on the green learning paradigm, called Green Steganalyzer (GS), is proposed in this work. GS consists of three modules: 1) pixel-based anomaly prediction, 2) embedding location detection, and 3) decision fusion for image-level detection. In the first module, GS decomposes an image into patches, adopts Saab transforms for feature extraction, and conducts self-supervised learning to predict an anomaly score of their center pixel. In the second module, GS analyzes the anomaly scores of a pixel and its neighborhood to find pixels of higher embedding probabilities. In the third module, GS focuses on pixels of higher embedding probabilities and fuses their anomaly scores to make final image-level classification. Compared with state-of-the-art deep-learning models, GS achieves comparable detection performance against S-UNIWARD, WOW and HILL steganography schemes with significantly lower computational complexity and a smaller model size, making it attractive for mobile/edge applications. Furthermore, GS is mathematically transparent because of its modular design.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
The ABJM Amplituhedron
Authors:
Song He,
Yu-tin Huang,
Chia-Kai Kuo
Abstract:
In this paper, we take a major step towards the construction and applications of an all-loop, all-multiplicity amplituhedron for three-dimensional planar $\mathcal{N}=6$ Chern-Simons matter theory, or the $\textit{ABJM amplituhedron}$. We show that by simply changing the overall sign of the positive region of the original amplituhedron for four-dimensional planar $\mathcal{N}=4$ super-Yang-Mills (…
▽ More
In this paper, we take a major step towards the construction and applications of an all-loop, all-multiplicity amplituhedron for three-dimensional planar $\mathcal{N}=6$ Chern-Simons matter theory, or the $\textit{ABJM amplituhedron}$. We show that by simply changing the overall sign of the positive region of the original amplituhedron for four-dimensional planar $\mathcal{N}=4$ super-Yang-Mills (sYM) and performing a symplectic reduction, only three-dimensional kinematics in the middle sector of even-multiplicity survive. The resulting form of the geometry, combined with its parity images, gives the full loop integrand. This simple modification geometrically enforces the vanishing of odd-multiplicity cuts, and manifests the correct soft cuts as well as two-particle unitarity cuts. Furthermore, the so-called ``bipartite structures" of four-point all-loop negative geometries also directly generalize to all multiplicities. We introduce a novel approach for triangulating loop amplituhedra based on the kinematics of the tree region, resulting in local integrands tailored to ``prescriptive unitarity". This construction sheds fascinating new light on the interplay between loop and tree amplituhedra for both ABJM and $\mathcal{N}=4$ sYM: the loop geometry demands that the tree region must be dissected into $\textit{chambers}$, defined by the simultaneous positivity of maximal cuts. The loop geometry is then the ``fibration" of the tree region. Using the new construction, we give explicit results of one-loop integrands up to ten points and two-loop integrands up to eight points by computing the canonical form of ABJM loop amplituhedron.
△ Less
Submitted 25 September, 2023; v1 submitted 1 June, 2023;
originally announced June 2023.
-
HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning
Authors:
Chia-Wen Kuo,
Zsolt Kira
Abstract:
A great deal of progress has been made in image captioning, driven by research into how to encode the image using pre-trained models. This includes visual encodings (e.g. image grid features or detected objects) and more recently textual encodings (e.g. image tags or text descriptions of image regions). As more advanced encodings are available and incorporated, it is natural to ask: how to efficie…
▽ More
A great deal of progress has been made in image captioning, driven by research into how to encode the image using pre-trained models. This includes visual encodings (e.g. image grid features or detected objects) and more recently textual encodings (e.g. image tags or text descriptions of image regions). As more advanced encodings are available and incorporated, it is natural to ask: how to efficiently and effectively leverage the heterogeneous set of encodings? In this paper, we propose to regard the encodings as augmented views of the input image. The image captioning model encodes each view independently with a shared encoder efficiently, and a contrastive loss is incorporated across the encoded views in a novel way to improve their representation quality and the model's data efficiency. Our proposed hierarchical decoder then adaptively weighs the encoded views according to their effectiveness for caption generation by first aggregating within each view at the token level, and then across views at the view level. We demonstrate significant performance improvements of +5.6% CIDEr on MS-COCO and +12.9% CIDEr on Flickr30k compared to state of the arts, and conduct rigorous analyses to demonstrate the importance of each part of our design.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
CLIP-GCD: Simple Language Guided Generalized Category Discovery
Authors:
Rabah Ouldnoughi,
Chia-Wen Kuo,
Zsolt Kira
Abstract:
Generalized Category Discovery (GCD) requires a model to both classify known categories and cluster unknown categories in unlabeled data. Prior methods leveraged self-supervised pre-training combined with supervised fine-tuning on the labeled data, followed by simple clustering methods. In this paper, we posit that such methods are still prone to poor performance on out-of-distribution categories,…
▽ More
Generalized Category Discovery (GCD) requires a model to both classify known categories and cluster unknown categories in unlabeled data. Prior methods leveraged self-supervised pre-training combined with supervised fine-tuning on the labeled data, followed by simple clustering methods. In this paper, we posit that such methods are still prone to poor performance on out-of-distribution categories, and do not leverage a key ingredient: Semantic relationships between object categories. We therefore propose to leverage multi-modal (vision and language) models, in two complementary ways. First, we establish a strong baseline by replacing uni-modal features with CLIP, inspired by its zero-shot performance. Second, we propose a novel retrieval-based mechanism that leverages CLIP's aligned vision-language representations by mining text descriptions from a text corpus for the labeled and unlabeled set. We specifically use the alignment between CLIP's visual encoding of the image and textual encoding of the corpus to retrieve top-k relevant pieces of text and incorporate their embeddings to perform joint image+text semi-supervised clustering. We perform rigorous experimentation and ablations (including on where to retrieve from, how much to retrieve, and how to combine information), and validate our results on several datasets including out-of-distribution domains, demonstrating state-of-art results.
△ Less
Submitted 17 May, 2023;
originally announced May 2023.
-
A ring-like accretion structure in M87 connecting its black hole and jet
Authors:
Ru-Sen Lu,
Keiichi Asada,
Thomas P. Krichbaum,
Jongho Park,
Fumie Tazaki,
Hung-Yi Pu,
Masanori Nakamura,
Andrei Lobanov,
Kazuhiro Hada,
Kazunori Akiyama,
Jae-Young Kim,
Ivan Marti-Vidal,
José L. Gómez,
Tomohisa Kawashima,
Feng Yuan,
Eduardo Ros,
Walter Alef,
Silke Britzen,
Michael Bremer,
Avery E. Broderick,
Akihiro Doi,
Gabriele Giovannini,
Marcello Giroletti,
Paul T. P. Ho,
Mareki Honma
, et al. (96 additional authors not shown)
Abstract:
The nearby radio galaxy M87 is a prime target for studying black hole accretion and jet formation^{1,2}. Event Horizon Telescope observations of M87 in 2017, at a wavelength of 1.3 mm, revealed a ring-like structure, which was interpreted as gravitationally lensed emission around a central black hole^3. Here we report images of M87 obtained in 2018, at a wavelength of 3.5 mm, showing that the comp…
▽ More
The nearby radio galaxy M87 is a prime target for studying black hole accretion and jet formation^{1,2}. Event Horizon Telescope observations of M87 in 2017, at a wavelength of 1.3 mm, revealed a ring-like structure, which was interpreted as gravitationally lensed emission around a central black hole^3. Here we report images of M87 obtained in 2018, at a wavelength of 3.5 mm, showing that the compact radio core is spatially resolved. High-resolution imaging shows a ring-like structure of 8.4_{-1.1}^{+0.5} Schwarzschild radii in diameter, approximately 50% larger than that seen at 1.3 mm. The outer edge at 3.5 mm is also larger than that at 1.3 mm. This larger and thicker ring indicates a substantial contribution from the accretion flow with absorption effects in addition to the gravitationally lensed ring-like emission. The images show that the edge-brightened jet connects to the accretion flow of the black hole. Close to the black hole, the emission profile of the jet-launching region is wider than the expected profile of a black-hole-driven jet, suggesting the possible presence of a wind associated with the accretion flow.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
Unsupervised Synthetic Image Refinement via Contrastive Learning and Consistent Semantic-Structural Constraints
Authors:
Ganning Zhao,
Tingwei Shen,
Suya You,
C. -C. Jay Kuo
Abstract:
Ensuring the realism of computer-generated synthetic images is crucial to deep neural network (DNN) training. Due to different semantic distributions between synthetic and real-world captured datasets, there exists semantic mismatch between synthetic and refined images, which in turn results in the semantic distortion. Recently, contrastive learning (CL) has been successfully used to pull correlat…
▽ More
Ensuring the realism of computer-generated synthetic images is crucial to deep neural network (DNN) training. Due to different semantic distributions between synthetic and real-world captured datasets, there exists semantic mismatch between synthetic and refined images, which in turn results in the semantic distortion. Recently, contrastive learning (CL) has been successfully used to pull correlated patches together and push uncorrelated ones apart. In this work, we exploit semantic and structural consistency between synthetic and refined images and adopt CL to reduce the semantic distortion. Besides, we incorporate hard negative mining to improve the performance furthermore. We compare the performance of our method with several other benchmarking methods using qualitative and quantitative measures and show that our method offers the state-of-the-art performance.
△ Less
Submitted 26 April, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Knowledge Graph Embedding with 3D Compound Geometric Transformations
Authors:
Xiou Ge,
Yun-Cheng Wang,
Bin Wang,
C. -C. Jay Kuo
Abstract:
The cascade of 2D geometric transformations were exploited to model relations between entities in a knowledge graph (KG), leading to an effective KG embedding (KGE) model, CompoundE. Furthermore, the rotation in the 3D space was proposed as a new KGE model, Rotate3D, by leveraging its non-commutative property. Inspired by CompoundE and Rotate3D, we leverage 3D compound geometric transformations, i…
▽ More
The cascade of 2D geometric transformations were exploited to model relations between entities in a knowledge graph (KG), leading to an effective KG embedding (KGE) model, CompoundE. Furthermore, the rotation in the 3D space was proposed as a new KGE model, Rotate3D, by leveraging its non-commutative property. Inspired by CompoundE and Rotate3D, we leverage 3D compound geometric transformations, including translation, rotation, scaling, reflection, and shear and propose a family of KGE models, named CompoundE3D, in this work. CompoundE3D allows multiple design variants to match rich underlying characteristics of a KG. Since each variant has its own advantages on a subset of relations, an ensemble of multiple variants can yield superior performance. The effectiveness and flexibility of CompoundE3D are experimentally verified on four popular link prediction datasets.
△ Less
Submitted 1 April, 2023;
originally announced April 2023.
-
Hot QCD White Paper
Authors:
M. Arslandok,
S. A. Bass,
A. A. Baty,
I. Bautista,
C. Beattie,
F. Becattini,
R. Bellwied,
Y. Berdnikov,
A. Berdnikov,
J. Bielcik,
J. T. Blair,
F. Bock,
B. Boimska,
H. Bossi,
H. Caines,
Y. Chen,
Y. -T. Chien,
M. Chiu,
M. E. Connors,
M. Csanád,
C. L. da Silva,
A. P. Dash,
G. David,
K. Dehmelt,
V. Dexheimer
, et al. (149 additional authors not shown)
Abstract:
Hot QCD physics studies the nuclear strong force under extreme temperature and densities. Experimentally these conditions are achieved via high-energy collisions of heavy ions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). In the past decade, a unique and substantial suite of data was collected at RHIC and the LHC, probing hydrodynamics at the nucleon scale, the…
▽ More
Hot QCD physics studies the nuclear strong force under extreme temperature and densities. Experimentally these conditions are achieved via high-energy collisions of heavy ions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). In the past decade, a unique and substantial suite of data was collected at RHIC and the LHC, probing hydrodynamics at the nucleon scale, the temperature dependence of the transport properties of quark-gluon plasma, the phase diagram of nuclear matter, the interaction of quarks and gluons at different scales and much more. This document, as part of the 2023 nuclear science long range planning process, was written to review the progress in hot QCD since the 2015 Long Range Plan for Nuclear Science, as well as highlight the realization of previous recommendations, and present opportunities for the next decade, building on the accomplishments and investments made in theoretical developments and the construction of new detectors. Furthermore, this document provides additional context to support the recommendations voted on at the Joint Hot and Cold QCD Town Hall Meeting, which are reported in a separate document.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
Lightweight High-Performance Blind Image Quality Assessment
Authors:
Zhanxuan Mei,
Yun-Cheng Wang,
Xingze He,
Yong Yan,
C. -C. Jay Kuo
Abstract:
Blind image quality assessment (BIQA) is a task that predicts the perceptual quality of an image without its reference. Research on BIQA attracts growing attention due to the increasing amount of user-generated images and emerging mobile applications where reference images are unavailable. The problem is challenging due to the wide range of content and mixed distortion types. Many existing BIQA me…
▽ More
Blind image quality assessment (BIQA) is a task that predicts the perceptual quality of an image without its reference. Research on BIQA attracts growing attention due to the increasing amount of user-generated images and emerging mobile applications where reference images are unavailable. The problem is challenging due to the wide range of content and mixed distortion types. Many existing BIQA methods use deep neural networks (DNNs) to achieve high performance. However, their large model sizes hinder their applicability to edge or mobile devices. To meet the need, a novel BIQA method with a small model, low computational complexity, and high performance is proposed and named "GreenBIQA" in this work. GreenBIQA includes five steps: 1) image crop**, 2) unsupervised representation generation, 3) supervised feature selection, 4) distortion-specific prediction, and 5) regression and decision ensemble. Experimental results show that the performance of GreenBIQA is comparable with that of state-of-the-art deep-learning (DL) solutions while demanding a much smaller model size and significantly lower computational complexity.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
Tomography Scan of Charge Density Wave in NbSe2
Authors:
Jyun-Yu Wu,
Yung-Ting Lee,
Guan-Hao Chen,
Zheng-Hong Li,
Chang-Tsan Lee,
Jie-Yu Hsu,
Chia-Nung Kuo,
Juhn-Jong Lin,
Wen-Hao Chang,
Chin-Shan Lue,
Po-Tuan Cheng,
Cheng-Tien Chiang,
Chien-Cheng Kuo,
Chien-Te Wu,
Chi-Cheng Lee,
Ming-Chiang Chung,
Hung-Chung Hsueh,
Chun-Liang Lin
Abstract:
Charge density wave (CDW) resulted from a small distortion in the lattice is able to create new orders beyond the original lattice. In 2H-NbSe2, one of the layered transition metal dichalcogenides (TMD), the 3x3 charge order appears in two-dimensional (2D) layers. Although CDW is usually described by a sine wave, the spatial distribution within a 2D layer has never been systematically visualized.…
▽ More
Charge density wave (CDW) resulted from a small distortion in the lattice is able to create new orders beyond the original lattice. In 2H-NbSe2, one of the layered transition metal dichalcogenides (TMD), the 3x3 charge order appears in two-dimensional (2D) layers. Although CDW is usually described by a sine wave, the spatial distribution within a 2D layer has never been systematically visualized. Here by using scanning tunneling microscopy (STM) and density functional theory (DFT), we have monitored the evolution of 3x3 CDW along c-axis and realized a nearly tomography scan of CDW of the topmost layer. The results show that the strength of 3x3 charge order varies while increasing the tunneling current. The 3x3 charge order is relatively strong at the outermost Se level and decreases while probing in between Se and Nb levels. Interestingly, the 3x3 charge order gets strong again as reaching Nb level but along with a phase shift. We further calculated the orbital charge distributions and found that both CDW intensity modulation and phase shift are strongly correlated with the distribution of Se p orbitals and Nb d orbitals.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
Comparison of Polarized Radiative Transfer Codes used by the EHT Collaboration
Authors:
Ben S. Prather,
Jason Dexter,
Monika Moscibrodzka,
Hung-Yi Pu,
Thomas Bronzwaer,
Jordy Davelaar,
Ziri Younsi,
Charles F. Gammie,
Roman Gold,
George N. Wong,
Kazunori Akiyama,
Antxon Alberdi,
Walter Alef,
Juan Carlos Algaba,
Richard Anantua,
Keiichi Asada,
Rebecca Azulay,
Uwe Bach,
Anne-Kathrin Baczko,
David Ball,
Mislav Baloković,
John Barrett,
Michi Bauböck,
Bradford A. Benson,
Dan Bintley
, et al. (248 additional authors not shown)
Abstract:
Interpretation of resolved polarized images of black holes by the Event Horizon Telescope (EHT) requires predictions of the polarized emission observable by an Earth-based instrument for a particular model of the black hole accretion system. Such predictions are generated by general relativistic radiative transfer (GRRT) codes, which integrate the equations of polarized radiative transfer in curve…
▽ More
Interpretation of resolved polarized images of black holes by the Event Horizon Telescope (EHT) requires predictions of the polarized emission observable by an Earth-based instrument for a particular model of the black hole accretion system. Such predictions are generated by general relativistic radiative transfer (GRRT) codes, which integrate the equations of polarized radiative transfer in curved spacetime. A selection of ray-tracing GRRT codes used within the EHT collaboration is evaluated for accuracy and consistency in producing a selection of test images, demonstrating that the various methods and implementations of radiative transfer calculations are highly consistent. When imaging an analytic accretion model, we find that all codes produce images similar within a pixel-wise normalized mean squared error (NMSE) of 0.012 in the worst case. When imaging a snapshot from a cell-based magnetohydrodynamic simulation, we find all test images to be similar within NMSEs of 0.02, 0.04, 0.04, and 0.12 in Stokes I, Q, U , and V respectively. We additionally find the values of several image metrics relevant to published EHT results to be in agreement to much better precision than measurement uncertainties.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
A Tiny Machine Learning Model for Point Cloud Object Classification
Authors:
Min Zhang,
**tang Xue,
Pranav Kadam,
Hardik Prajapati,
Shan Liu,
C. -C. Jay Kuo
Abstract:
The design of a tiny machine learning model, which can be deployed in mobile and edge devices, for point cloud object classification is investigated in this work. To achieve this objective, we replace the multi-scale representation of a point cloud object with a single-scale representation for complexity reduction, and exploit rich 3D geometric information of a point cloud object for performance i…
▽ More
The design of a tiny machine learning model, which can be deployed in mobile and edge devices, for point cloud object classification is investigated in this work. To achieve this objective, we replace the multi-scale representation of a point cloud object with a single-scale representation for complexity reduction, and exploit rich 3D geometric information of a point cloud object for performance improvement. The proposed solution is named Green-PointHop due to its low computational complexity. We evaluate the performance of Green-PointHop on ModelNet40 and ScanObjectNN two datasets. Green-PointHop has a model size of 64K parameters. It demands 2.3M floating-point operations (FLOPs) to classify a ModelNet40 object of 1024 down-sampled points. Its classification performance gaps against the state-of-the-art DGCNN method are 3% and 7% for ModelNet40 and ScanObjectNN, respectively. On the other hand, the model size and inference complexity of DGCNN are 42X and 1203X of those of Green-PointHop, respectively.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
An Overview on Language Models: Recent Developments and Outlook
Authors:
Chengwei Wei,
Yun-Cheng Wang,
Bin Wang,
C. -C. Jay Kuo
Abstract:
Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine translation, etc. Conventional language models (CLMs) aim to predict the probability of linguistic sequences in a causal manner, while pre-trained language models (PLMs) c…
▽ More
Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine translation, etc. Conventional language models (CLMs) aim to predict the probability of linguistic sequences in a causal manner, while pre-trained language models (PLMs) cover broader concepts and can be used in both causal sequential modeling and fine-tuning for downstream applications. PLMs have their own training paradigms (usually self-supervised) and serve as foundation models in modern NLP systems. This overview paper provides an introduction to both CLMs and PLMs from five aspects, i.e., linguistic units, architectures, training methods, evaluation methods, and applications. Furthermore, we discuss the relationship between CLMs and PLMs and shed light on the future directions of language modeling in the pre-trained era.
△ Less
Submitted 3 July, 2023; v1 submitted 10 March, 2023;
originally announced March 2023.
-
Forecasts of CMB lensing reconstruction of AliCPT-1 from the foreground cleaned polarization data
Authors:
Jiakang Han,
Bin Hu,
Shamik Ghosh,
Siyu Li,
Jiazheng Dou,
Jacques Delabrouille,
**g **,
Hong Li,
Yang Liu,
Mathieu Remazeilles,
Wen Zhao,
Pengjie Zhang,
Zheng-Wei Li,
Cong-Zhan Liu,
Yong-jie Zhang,
Chao-Lin Kuo,
Xinmin Zhang
Abstract:
Cosmic microwave background radiation (CMB) observations are unavoidably contaminated by emission from various extra-galactic foregrounds, which must be removed to obtain reliable measurements of the cosmological signal. In this paper, we demonstrate CMB lensing reconstruction in AliCPT-1 after foreground removal, combine the two bands of AliCPT-1 (90 and 150~GHz) with Planck HFI bands (100, 143,…
▽ More
Cosmic microwave background radiation (CMB) observations are unavoidably contaminated by emission from various extra-galactic foregrounds, which must be removed to obtain reliable measurements of the cosmological signal. In this paper, we demonstrate CMB lensing reconstruction in AliCPT-1 after foreground removal, combine the two bands of AliCPT-1 (90 and 150~GHz) with Planck HFI bands (100, 143, 217 and 353~GHz) and with the WMAP-K band (23~GHz). In order to balance contamination by instrumental noise and foreground residual bias, we adopt the Needlet Internal Linear Combination (NILC) method to clean the E-map and the constrained Internal Linear Combination (cILC) method to clean the B-map. The latter utilizes additional constraints on average frequency scaling of the dust and synchrotron to remove foregrounds at the expense of somewhat noisier maps. Assuming 4 modules observing 1 season from simulation data, the resulting effective residual noise in E- and B-map are roughly $15~μ{\rm K}\cdot{\rm arcmin}$ and $25~μ{\rm K}\cdot{\rm arcmin}$, respectively. As a result, the CMB lensing reconstruction signal-to-noise ratio (SNR) from polarization data is about SNR$\,\approx\,$4.5. This lensing reconstruction capability is comparable to that of other stage-III small aperture millimeter CMB telescopes.
△ Less
Submitted 10 March, 2023;
originally announced March 2023.
-
Emergent unitarity, all-loop cuts and integrations from the ABJM amplituhedron
Authors:
Song He,
Chia-Kai Kuo,
Zhenjie Li,
Yao-Qi Zhang
Abstract:
We elaborate on aspects of a new positive geometry proposed recently, which was conjectured to be the four-point amplituhedron for ABJM theory. We study generalized unitarity cuts from the geometry, and in particular we prove that (1) the four-point integrand satisfies perturbative unitarity (or optical theorem) to all loops, which follows directly from the geometry, and (2) vanishing cuts involvi…
▽ More
We elaborate on aspects of a new positive geometry proposed recently, which was conjectured to be the four-point amplituhedron for ABJM theory. We study generalized unitarity cuts from the geometry, and in particular we prove that (1) the four-point integrand satisfies perturbative unitarity (or optical theorem) to all loops, which follows directly from the geometry, and (2) vanishing cuts involving odd-point amplitudes follow from the ``bipartite" nature of the associated ``negative geometries", which justifies their appearance in ABJM theory. We also take a first step in integrating the forms of these negative geometries and obtain an infrared-finite quantity up to two loops, from which we extract the cusp anomalous dimension at leading order.
△ Less
Submitted 7 June, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
PointFlowHop: Green and Interpretable Scene Flow Estimation from Consecutive Point Clouds
Authors:
Pranav Kadam,
Jiahao Gu,
Shan Liu,
C. -C. Jay Kuo
Abstract:
An efficient 3D scene flow estimation method called PointFlowHop is proposed in this work. PointFlowHop takes two consecutive point clouds and determines the 3D flow vectors for every point in the first point cloud. PointFlowHop decomposes the scene flow estimation task into a set of subtasks, including ego-motion compensation, object association and object-wise motion estimation. It follows the g…
▽ More
An efficient 3D scene flow estimation method called PointFlowHop is proposed in this work. PointFlowHop takes two consecutive point clouds and determines the 3D flow vectors for every point in the first point cloud. PointFlowHop decomposes the scene flow estimation task into a set of subtasks, including ego-motion compensation, object association and object-wise motion estimation. It follows the green learning (GL) pipeline and adopts the feedforward data processing path. As a result, its underlying mechanism is more transparent than deep-learning (DL) solutions based on end-to-end optimization of network parameters. We conduct experiments on the stereoKITTI and the Argoverse LiDAR point cloud datasets and demonstrate that PointFlowHop outperforms deep-learning methods with a small model size and less training time. Furthermore, we compare the Floating Point Operations (FLOPs) required by PointFlowHop and other learning-based methods in inference, and show its big savings in computational complexity.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
LSR: A Light-Weight Super-Resolution Method
Authors:
Wei Wang,
Xue**g Lei,
Yueru Chen,
Ming-Sui Lee,
C. -C. Jay Kuo
Abstract:
A light-weight super-resolution (LSR) method from a single image targeting mobile applications is proposed in this work. LSR predicts the residual image between the interpolated low-resolution (ILR) and high-resolution (HR) images using a self-supervised framework. To lower the computational complexity, LSR does not adopt the end-to-end optimization deep networks. It consists of three modules: 1)…
▽ More
A light-weight super-resolution (LSR) method from a single image targeting mobile applications is proposed in this work. LSR predicts the residual image between the interpolated low-resolution (ILR) and high-resolution (HR) images using a self-supervised framework. To lower the computational complexity, LSR does not adopt the end-to-end optimization deep networks. It consists of three modules: 1) generation of a pool of rich and diversified representations in the neighborhood of a target pixel via unsupervised learning, 2) selecting a subset from the representation pool that is most relevant to the underlying super-resolution task automatically via supervised learning, 3) predicting the residual of the target pixel via regression. LSR has low computational complexity and reasonable model size so that it can be implemented on mobile/edge platforms conveniently. Besides, it offers better visual quality than classical exemplar-based methods in terms of PSNR/SSIM measures.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
S3I-PointHop: SO(3)-Invariant PointHop for 3D Point Cloud Classification
Authors:
Pranav Kadam,
Hardik Prajapati,
Min Zhang,
**tang Xue,
Shan Liu,
C. -C. Jay Kuo
Abstract:
Many point cloud classification methods are developed under the assumption that all point clouds in the dataset are well aligned with the canonical axes so that the 3D Cartesian point coordinates can be employed to learn features. When input point clouds are not aligned, the classification performance drops significantly. In this work, we focus on a mathematically transparent point cloud classific…
▽ More
Many point cloud classification methods are developed under the assumption that all point clouds in the dataset are well aligned with the canonical axes so that the 3D Cartesian point coordinates can be employed to learn features. When input point clouds are not aligned, the classification performance drops significantly. In this work, we focus on a mathematically transparent point cloud classification method called PointHop, analyze its reason for failure due to pose variations, and solve the problem by replacing its pose dependent modules with rotation invariant counterparts. The proposed method is named SO(3)-Invariant PointHop (or S3I-PointHop in short). We also significantly simplify the PointHop pipeline using only one single hop along with multiple spatial aggregation techniques. The idea of exploiting more spatial information is novel. Experiments on the ModelNet40 dataset demonstrate the superiority of S3I-PointHop over traditional PointHop-like methods.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
Spin State Disproportionation in Insulating Ferromagnetic LaCoO3 Epitaxial Thin Films
Authors:
Shanquan Chen,
Jhong-Yi Chang,
Qinghua Zhang,
Qiuyue Li,
Ting Lin,
Fanqi Meng,
Haoliang Huang,
Shengwei Zeng,
Xinmao Yin,
My Ngoc Duong,
Yalin Lu,
Lang Chen,
Er-Jia Guo,
Hanghui Chen,
Chun-Fu Chang,
Chang-Yang Kuo,
Zuhuang Chen
Abstract:
The origin of insulating ferromagnetism in epitaxial LaCoO3 films under tensile strain remains elusive despite extensive research efforts have been devoted. Surprisingly, the spin state of its Co ions, the main parameter of its ferromagnetism, is still to be determined. Here, we have systematically investigated the spin state in epitaxial LaCoO3 thin films to clarify the mechanism of strain induce…
▽ More
The origin of insulating ferromagnetism in epitaxial LaCoO3 films under tensile strain remains elusive despite extensive research efforts have been devoted. Surprisingly, the spin state of its Co ions, the main parameter of its ferromagnetism, is still to be determined. Here, we have systematically investigated the spin state in epitaxial LaCoO3 thin films to clarify the mechanism of strain induced ferromagnetism using element-specific x-ray absorption spectroscopy and dichroism. Combining with the configuration interaction cluster calculations, we unambiguously demonstrate that Co3+ in LaCoO3 films under compressive strain (on LaAlO3 substrate) are practically a low spin state, whereas Co3+ in LaCoO3 films under tensile strain (on SrTiO3 substrate) have mixed high spin and low spin states with a ratio close to 1:3. From the identification of this spin state ratio, we infer that the dark strips observed by high-resolution scanning transmission electron microscopy indicate the position of Co3+ high spin state, i.e., an observation of a spin state disproportionation in tensile-strained LaCoO3 films. This consequently explains the nature of ferromagnetism in LaCoO3 films.
△ Less
Submitted 12 February, 2023;
originally announced February 2023.
-
gpcgc: a green point cloud geometry coding method
Authors:
Qingyang Zhou,
Shan Liu,
C. -C. Jay Kuo
Abstract:
A low-complexity point cloud compression method called the Green Point Cloud Geometry Codec (GPCGC), is proposed to encode the 3D spatial coordinates of static point clouds efficiently. GPCGC consists of two modules. In the first module, point coordinates of input point clouds are hierarchically organized into an octree structure. Points at each leaf node are projected along one of three axes to y…
▽ More
A low-complexity point cloud compression method called the Green Point Cloud Geometry Codec (GPCGC), is proposed to encode the 3D spatial coordinates of static point clouds efficiently. GPCGC consists of two modules. In the first module, point coordinates of input point clouds are hierarchically organized into an octree structure. Points at each leaf node are projected along one of three axes to yield image maps. In the second module, the occupancy map is clustered into 9 modes while the depth map is coded by a low-complexity high-efficiency image codec, called the green image codec (GIC). GIC is a multi-resolution codec based on vector quantization (VQ). Its complexity is significantly lower than HEVC-Intra. Furthermore, the rate-distortion optimization (RDO) technique is used to select the optimal coding parameters. GPCGC is a progressive codec, and it offers a coding performance competitive with MPEG's V-PCC and G-PCC standards at significantly lower complexity.
△ Less
Submitted 12 February, 2023;
originally announced February 2023.
-
Van der Waals device integration beyond the limits of van der Waals forces via adhesive matrix transfer
Authors:
Peter F. Satterthwaite,
Weikun Zhu,
Patricia Jastrzebska-Perfect,
Melbourne Tang,
Hongze Gao,
Hikari Kitadai,
Ang-Yu Lu,
Qishuo Tan,
Shin-Yi Tang,
Yu-Lun Chueh,
Chia-Nung Kuo,
Chin Shan Lue,
**g Kong,
Xi Ling,
Farnaz Niroui
Abstract:
Pristine van der Waals (vdW) interfaces between two-dimensional (2D) and other materials are core to emerging optical and electronic devices. Their direct fabrication is, however, challenged as the vdW forces are weak and cannot be tuned to accommodate integration of arbitrary layers without solvents, sacrificial-layers or high-temperatures, steps that can introduce damage. To address these limita…
▽ More
Pristine van der Waals (vdW) interfaces between two-dimensional (2D) and other materials are core to emerging optical and electronic devices. Their direct fabrication is, however, challenged as the vdW forces are weak and cannot be tuned to accommodate integration of arbitrary layers without solvents, sacrificial-layers or high-temperatures, steps that can introduce damage. To address these limitations, we introduce a single-step 2D material-to-device integration approach in which forces promoting transfer are decoupled from the vdW forces at the interface of interest. We use this adhesive matrix transfer to demonstrate conventionally-forbidden direct integration of diverse 2D materials (MoS2, WSe2, PtS2, GaS) with dielectrics (SiO2, Al2O3), and scalable, aligned heterostructure formation, both foundational to device development. We then demonstrate a single-step integration of monolayer-MoS2 into arrays of transistors. With no exposure to polymers or solvents, clean interfaces and pristine surfaces are preserved, which can be further engineered to demonstrate both n- and p-type behavior. Beyond serving as a platform to probe the intrinsic properties of sensitive nanomaterials without the influence of processing steps, our technique allows efficient formation of unconventional device form-factors, with an example of flexible transistors demonstrated.
△ Less
Submitted 12 February, 2023;
originally announced February 2023.
-
183 GHz water megamasers in active galactic nuclei: a new accretion disk tracer
Authors:
Dominic W. Pesce,
James A. Braatz,
Christian Henkel,
Elizabeth M. L. Humphreys,
C. M. Violette Impellizzeri,
Cheng-Yu Kuo
Abstract:
We present the results of an ALMA survey to identify 183 GHz H$_2$O maser emission from AGN already known to host 22 GHz megamaser systems. Out of 20 sources observed, we detect significant 183 GHz maser emission from 13; this survey thus increases the number of AGN known to host (sub)millimeter megamasers by a factor of 5. We find that the 183 GHz emission is systematically fainter than the 22 GH…
▽ More
We present the results of an ALMA survey to identify 183 GHz H$_2$O maser emission from AGN already known to host 22 GHz megamaser systems. Out of 20 sources observed, we detect significant 183 GHz maser emission from 13; this survey thus increases the number of AGN known to host (sub)millimeter megamasers by a factor of 5. We find that the 183 GHz emission is systematically fainter than the 22 GHz emission from the same targets, with typical flux densities being roughly an order of magnitude lower at 183 GHz than at 22 GHz. However, the isotropic luminosities of the detected 183 GHz sources are comparable to their 22 GHz values. For two of our sources -- ESO 269-G012 and the Circinus galaxy -- we detect rich 183 GHz spectral structure containing multiple line complexes. The 183 GHz spectrum of ESO 269-G012 exhibits the triple-peaked structure characteristic of an edge-on AGN disk system. The Circinus galaxy contains the strongest 183 GHz emission detected in our sample, peaking at a flux density of nearly 5 Jy. The high signal-to-noise ratios achieved by these strong lines enable a coarse map** of the 183 GHz maser system, in which the masers appear to be distributed similarly to those seen in VLBI maps of the 22 GHz system in the same galaxy and may be tracing the circumnuclear accretion disk at larger orbital radii than are occupied by the 22 GHz masers. This newly identified population of AGN disk megamasers presents a motivation for develo** VLBI capabilities at 183 GHz.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
Observation of highly anisotropic bulk dispersion and spin-polarized topological surface states in CoTe2
Authors:
Atasi Chakraborty,
Jun Fujii,
Chia-Nung Kuo,
Chin Shan Lue,
Antonio Politano,
Ivana Vobornik,
Amit Agarwal
Abstract:
We present CoTe2 as a new type-II Dirac semimetal supporting Lorentz symmetry violating Dirac fermions in the vicinity of the Fermi energy. By combining first principle ab-initio calculations with experimental angle-resolved photo-emission spectroscopy results, we show the CoTe2 hosts a pair of type-II Dirac fermions around 90 meV above the Fermi energy. In addition to the bulk Dirac fermions, we…
▽ More
We present CoTe2 as a new type-II Dirac semimetal supporting Lorentz symmetry violating Dirac fermions in the vicinity of the Fermi energy. By combining first principle ab-initio calculations with experimental angle-resolved photo-emission spectroscopy results, we show the CoTe2 hosts a pair of type-II Dirac fermions around 90 meV above the Fermi energy. In addition to the bulk Dirac fermions, we find several topological band inversions in bulk CoTe2, which gives rise to a ladder of spin-polarized surface states over a wide range of energies. In contrast to the surface states which typically display Rashba-type in-plane spin splitting, we find that CoTe2 hosts novel out-of-plane spin polarization as well. Our work establishes CoTe2 as a potential candidate for the exploration of Dirac fermiology and applications in spintronic devices, infrared plasmonics, and ultrafast optoelectronics.
△ Less
Submitted 27 January, 2023;
originally announced January 2023.
-
Successive Subspace Learning for Cardiac Disease Classification with Two-phase Deformation Fields from Cine MRI
Authors:
Xiaofeng Liu,
Fangxu Xing,
Hanna K. Gaggin,
C. -C. Jay Kuo,
Georges El Fakhri,
Jonghye Woo
Abstract:
Cardiac cine magnetic resonance imaging (MRI) has been used to characterize cardiovascular diseases (CVD), often providing a noninvasive phenoty** tool.~While recently flourished deep learning based approaches using cine MRI yield accurate characterization results, the performance is often degraded by small training samples. In addition, many deep learning models are deemed a ``black box," for w…
▽ More
Cardiac cine magnetic resonance imaging (MRI) has been used to characterize cardiovascular diseases (CVD), often providing a noninvasive phenoty** tool.~While recently flourished deep learning based approaches using cine MRI yield accurate characterization results, the performance is often degraded by small training samples. In addition, many deep learning models are deemed a ``black box," for which models remain largely elusive in how models yield a prediction and how reliable they are. To alleviate this, this work proposes a lightweight successive subspace learning (SSL) framework for CVD classification, based on an interpretable feedforward design, in conjunction with a cardiac atlas. Specifically, our hierarchical SSL model is based on (i) neighborhood voxel expansion, (ii) unsupervised subspace approximation, (iii) supervised regression, and (iv) multi-level feature integration. In addition, using two-phase 3D deformation fields, including end-diastolic and end-systolic phases, derived between the atlas and individual subjects as input offers objective means of assessing CVD, even with small training samples. We evaluate our framework on the ACDC2017 database, comprising one healthy group and four disease groups. Compared with 3D CNN-based approaches, our framework achieves superior classification performance with 140$\times$ fewer parameters, which supports its potential value in clinical use.
△ Less
Submitted 21 January, 2023;
originally announced January 2023.
-
TeD-Q: a tensor network enhanced distributed hybrid quantum machine learning framework
Authors:
Yaocheng Chen,
Xingyao Wu,
Chung-Yun Kuo,
Yuxuan Du,
Dacheng Tao
Abstract:
TeD-Q is an open-source software framework for quantum machine learning, variational quantum algorithm (VQA), and simulation of quantum computing. It seamlessly integrates classical machine learning libraries with quantum simulators, giving users the ability to leverage the power of classical machine learning while training quantum machine learning models. TeD-Q supports auto-differentiation that…
▽ More
TeD-Q is an open-source software framework for quantum machine learning, variational quantum algorithm (VQA), and simulation of quantum computing. It seamlessly integrates classical machine learning libraries with quantum simulators, giving users the ability to leverage the power of classical machine learning while training quantum machine learning models. TeD-Q supports auto-differentiation that provides backpropagation, parameters shift, and finite difference methods to obtain gradients. With tensor contraction, simulation of quantum circuits with large number of qubits is possible. TeD-Q also provides a graphical mode in which the quantum circuit and the training progress can be visualized in real-time.
△ Less
Submitted 13 January, 2023;
originally announced January 2023.
-
Design and Control of a Novel Variable Stiffness Series Elastic Actuator
Authors:
Emre Sariyildiz,
Rahim Mutlu,
Jon Roberts,
Chin-Hsing Kuo,
Barkan Ugurlu
Abstract:
This paper expounds the design and control of a new Variable Stiffness Series Elastic Actuator (VSSEA). It is established by employing a modular mechanical design approach that allows us to effectively optimise the stiffness modulation characteristics and power density of the actuator. The proposed VSSEA possesses the following features: i) no limitation in the work-range of output link, ii) a wid…
▽ More
This paper expounds the design and control of a new Variable Stiffness Series Elastic Actuator (VSSEA). It is established by employing a modular mechanical design approach that allows us to effectively optimise the stiffness modulation characteristics and power density of the actuator. The proposed VSSEA possesses the following features: i) no limitation in the work-range of output link, ii) a wide range of stiffness modulation (~20Nm/rad to ~1KNm/rad), iii) low-energy-cost stiffness modulation at equilibrium and non-equilibrium positions, iv) compact design and high torque density (~36Nm/kg), and v) high-speed stiffness modulation (~3000Nm/rad/s). Such features can help boost the safety and performance of many advanced robotic systems, e.g., a cobot that physically interacts with unstructured environments and an exoskeleton that provides physical assistance to human users. These features can also enable us to utilise variable stiffness property to attain various regulation and trajectory tracking control tasks only by employing conventional controllers, eliminating the need for synthesising complex motion control systems in compliant actuation. To this end, it is experimentally demonstrated that the proposed VSSEA is capable of precisely tracking desired position and force control references through the use of conventional Proportional-Integral-Derivative (PID) controllers.
△ Less
Submitted 2 January, 2023;
originally announced January 2023.
-
SupeRGB-D: Zero-shot Instance Segmentation in Cluttered Indoor Environments
Authors:
Evin Pınar Örnek,
Aravindhan K Krishnan,
Shreekant Gayaka,
Cheng-Hao Kuo,
Arnie Sen,
Nassir Navab,
Federico Tombari
Abstract:
Object instance segmentation is a key challenge for indoor robots navigating cluttered environments with many small objects. Limitations in 3D sensing capabilities often make it difficult to detect every possible object. While deep learning approaches may be effective for this problem, manually annotating 3D data for supervised learning is time-consuming. In this work, we explore zero-shot instanc…
▽ More
Object instance segmentation is a key challenge for indoor robots navigating cluttered environments with many small objects. Limitations in 3D sensing capabilities often make it difficult to detect every possible object. While deep learning approaches may be effective for this problem, manually annotating 3D data for supervised learning is time-consuming. In this work, we explore zero-shot instance segmentation (ZSIS) from RGB-D data to identify unseen objects in a semantic category-agnostic manner. We introduce a zero-shot split for Tabletop Objects Dataset (TOD-Z) to enable this study and present a method that uses annotated objects to learn the ``objectness'' of pixels and generalize to unseen object categories in cluttered indoor environments. Our method, SupeRGB-D, groups pixels into small patches based on geometric cues and learns to merge the patches in a deep agglomerative clustering fashion. SupeRGB-D outperforms existing baselines on unseen objects while achieving similar performance on seen objects. We further show competitive results on the real dataset OCID. With its lightweight design (0.4 MB memory requirement), our method is extremely suitable for mobile and robotic applications. Additional DINO features can increase performance with a higher memory requirement. The dataset split and code are available at https://github.com/evinpinar/supergb-d.
△ Less
Submitted 25 May, 2023; v1 submitted 22 December, 2022;
originally announced December 2022.
-
SALVE: Self-supervised Adaptive Low-light Video Enhancement
Authors:
Zohreh Azizi,
C. -C. Jay Kuo
Abstract:
A self-supervised adaptive low-light video enhancement method, called SALVE, is proposed in this work. SALVE first enhances a few key frames of an input low-light video using a retinex-based low-light image enhancement technique. For each keyframe, it learns a map** from low-light image patches to enhanced ones via ridge regression. These map**s are then used to enhance the remaining frames in…
▽ More
A self-supervised adaptive low-light video enhancement method, called SALVE, is proposed in this work. SALVE first enhances a few key frames of an input low-light video using a retinex-based low-light image enhancement technique. For each keyframe, it learns a map** from low-light image patches to enhanced ones via ridge regression. These map**s are then used to enhance the remaining frames in the low-light video. The combination of traditional retinex-based image enhancement and learning-based ridge regression leads to a robust, adaptive and computationally inexpensive solution to enhance low-light videos. Our extensive experiments along with a user study show that 87% of participants prefer SALVE over prior work.
△ Less
Submitted 21 February, 2023; v1 submitted 22 December, 2022;
originally announced December 2022.
-
A Measurement of the CMB Temperature Power Spectrum and Constraints on Cosmology from the SPT-3G 2018 TT/TE/EE Data Set
Authors:
L. Balkenhol,
D. Dutcher,
A. Spurio Mancini,
A. Doussot,
K. Benabed,
S. Galli,
P. A. R. Ade,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
P. M. Chichura,
T. -L. Chou,
A. Coerver,
T. M. Crawford
, et al. (62 additional authors not shown)
Abstract:
We present a sample-variance-limited measurement of the temperature power spectrum ($TT$) of the cosmic microwave background (CMB) using observations of a $\sim\! 1500 \,\mathrm{deg}^2$ field made by SPT-3G in 2018. We report multifrequency power spectrum measurements at 95, 150, and 220GHz covering the angular multipole range $750 \leq \ell < 3000$. We combine this $TT$ measurement with the publi…
▽ More
We present a sample-variance-limited measurement of the temperature power spectrum ($TT$) of the cosmic microwave background (CMB) using observations of a $\sim\! 1500 \,\mathrm{deg}^2$ field made by SPT-3G in 2018. We report multifrequency power spectrum measurements at 95, 150, and 220GHz covering the angular multipole range $750 \leq \ell < 3000$. We combine this $TT$ measurement with the published polarization power spectrum measurements from the 2018 observing season and update their associated covariance matrix to complete the SPT-3G 2018 $TT/TE/EE$ data set. This is the first analysis to present cosmological constraints from SPT $TT$, $TE$, and $EE$ power spectrum measurements jointly. We blind the cosmological results and subject the data set to a series of consistency tests at the power spectrum and parameter level. We find excellent agreement between frequencies and spectrum types and our results are robust to the modeling of astrophysical foregrounds. We report results for $Λ$CDM and a series of extensions, drawing on the following parameters: the amplitude of the gravitational lensing effect on primary power spectra $A_\mathrm{L}$, the effective number of neutrino species $N_{\mathrm{eff}}$, the primordial helium abundance $Y_{\mathrm{P}}$, and the baryon clum** factor due to primordial magnetic fields $b$. We find that the SPT-3G 2018 $T/TE/EE$ data are well fit by $Λ$CDM with a probability-to-exceed of $15\%$. For $Λ$CDM, we constrain the expansion rate today to $H_0 = 68.3 \pm 1.5\,\mathrm{km\,s^{-1}\,Mpc^{-1}}$ and the combined structure growth parameter to $S_8 = 0.797 \pm 0.042$. The SPT-based results are effectively independent of Planck, and the cosmological parameter constraints from either data set are within $<1\,σ$ of each other. (abridged)
△ Less
Submitted 27 July, 2023; v1 submitted 11 December, 2022;
originally announced December 2022.
-
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation
Authors:
Chia-Wen Kuo,
Chih-Yao Ma,
Judy Hoffman,
Zsolt Kira
Abstract:
In Vision-and-Language Navigation (VLN), researchers typically take an image encoder pre-trained on ImageNet without fine-tuning on the environments that the agent will be trained or tested on. However, the distribution shift between the training images from ImageNet and the views in the navigation environments may render the ImageNet pre-trained image encoder suboptimal. Therefore, in this paper,…
▽ More
In Vision-and-Language Navigation (VLN), researchers typically take an image encoder pre-trained on ImageNet without fine-tuning on the environments that the agent will be trained or tested on. However, the distribution shift between the training images from ImageNet and the views in the navigation environments may render the ImageNet pre-trained image encoder suboptimal. Therefore, in this paper, we design a set of structure-encoding auxiliary tasks (SEA) that leverage the data in the navigation environments to pre-train and improve the image encoder. Specifically, we design and customize (1) 3D jigsaw, (2) traversability prediction, and (3) instance classification to pre-train the image encoder. Through rigorous ablations, our SEA pre-trained features are shown to better encode structural information of the scenes, which ImageNet pre-trained features fail to properly encode but is crucial for the target navigation task. The SEA pre-trained features can be easily plugged into existing VLN agents without any tuning. For example, on Test-Unseen environments, the VLN agents combined with our SEA pre-trained features achieve absolute success rate improvement of 12% for Speaker-Follower, 5% for Env-Dropout, and 4% for AuxRN.
△ Less
Submitted 20 November, 2022;
originally announced November 2022.
-
Improving Federated Learning Communication Efficiency with Global Momentum Fusion for Gradient Compression Schemes
Authors:
Chun-Chih Kuo,
Ted Tsei Kuo,
Chia-Yu Lin
Abstract:
Communication costs within Federated learning hinder the system scalability for reaching more data from more clients. The proposed FL adopts a hub-and-spoke network topology. All clients communicate through the central server. Hence, reducing communication overheads via techniques such as data compression has been proposed to mitigate this issue. Another challenge of federated learning is unbalanc…
▽ More
Communication costs within Federated learning hinder the system scalability for reaching more data from more clients. The proposed FL adopts a hub-and-spoke network topology. All clients communicate through the central server. Hence, reducing communication overheads via techniques such as data compression has been proposed to mitigate this issue. Another challenge of federated learning is unbalanced data distribution, data on each client are not independent and identically distributed (non-IID) in a typical federated learning setting. In this paper, we proposed a new compression compensation scheme called Global Momentum Fusion (GMF) which reduces communication overheads between FL clients and the server and maintains comparable model accuracy in the presence of non-IID data. GitHub repository: https://github.com/tony92151/global-momentum-fusion-fl
△ Less
Submitted 16 November, 2022;
originally announced November 2022.