-
GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond
Authors:
Kelvin C. K. Chan,
Xiangyu Xu,
Xintao Wang,
**wei Gu,
Chen Change Loy
Abstract:
We show that pre-trained Generative Adversarial Networks (GANs) such as StyleGAN and BigGAN can be used as a latent bank to improve the performance of image super-resolution. While most existing perceptual-oriented approaches attempt to generate realistic outputs through learning with adversarial loss, our method, Generative LatEnt bANk (GLEAN), goes beyond existing practices by directly leveragin…
▽ More
We show that pre-trained Generative Adversarial Networks (GANs) such as StyleGAN and BigGAN can be used as a latent bank to improve the performance of image super-resolution. While most existing perceptual-oriented approaches attempt to generate realistic outputs through learning with adversarial loss, our method, Generative LatEnt bANk (GLEAN), goes beyond existing practices by directly leveraging rich and diverse priors encapsulated in a pre-trained GAN. But unlike prevalent GAN inversion methods that require expensive image-specific optimization at runtime, our approach only needs a single forward pass for restoration. GLEAN can be easily incorporated in a simple encoder-bank-decoder architecture with multi-resolution skip connections. Employing priors from different generative models allows GLEAN to be applied to diverse categories (\eg~human faces, cats, buildings, and cars). We further present a lightweight version of GLEAN, named LightGLEAN, which retains only the critical components in GLEAN. Notably, LightGLEAN consists of only 21% of parameters and 35% of FLOPs while achieving comparable image quality. We extend our method to different tasks including image colorization and blind image restoration, and extensive experiments show that our proposed models perform favorably in comparison to existing methods. Codes and models are available at https://github.com/open-mmlab/mmediting.
△ Less
Submitted 29 July, 2022;
originally announced July 2022.
-
Simulating the Impact of Dynamic Rerouting on Metropolitan-Scale Traffic Systems
Authors:
Cy Chan,
Anu Kuncheria,
Jane Macfarlane
Abstract:
The rapid introduction of mobile navigation aides that use real-time road network information to suggest alternate routes to drivers is making it more difficult for researchers and government transportation agencies to understand and predict the dynamics of congested transportation systems. Computer simulation is a key capability for these organizations to analyze hypothetical scenarios; however,…
▽ More
The rapid introduction of mobile navigation aides that use real-time road network information to suggest alternate routes to drivers is making it more difficult for researchers and government transportation agencies to understand and predict the dynamics of congested transportation systems. Computer simulation is a key capability for these organizations to analyze hypothetical scenarios; however, the complexity of transportation systems makes it challenging for them to simulate very large geographical regions, such as multi-city metropolitan areas. In this paper, we describe the Mobiliti traffic simulator, which includes mechanisms to capture congestion delays, timing constraints, and link storage capacity constraints. The simulator is designed to support distributed memory parallel execution and be scalable on high-performance computing platforms. We introduce a method to model dynamic rerouting behavior with the addition of vehicle controller actors and reroute request events. We demonstrate the potential of the simulator by analyzing the impact of varying the population penetration rate of dynamic rerouting on the San Francisco Bay Area road network. Using high-performance parallel computing, we can simulate a day of the San Francisco Bay Area with 19 million vehicle trips with 50 percent dynamic rerouting penetration over a road network with 0.5 million nodes and 1 million links in less than three minutes. We present an analysis of system-level impacts when changing the dynamic rerouting penetration rate and examine the varying effects on different functional classes and geographical regions. Finally, we present a validation of the simulation results compared to real world data.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Exploring CLIP for Assessing the Look and Feel of Images
Authors:
Jianyi Wang,
Kelvin C. K. Chan,
Chen Change Loy
Abstract:
Measuring the perception of visual content is a long-standing problem in computer vision. Many mathematical models have been developed to evaluate the look or quality of an image. Despite the effectiveness of such tools in quantifying degradations such as noise and blurriness levels, such quantification is loosely coupled with human language. When it comes to more abstract perception about the fee…
▽ More
Measuring the perception of visual content is a long-standing problem in computer vision. Many mathematical models have been developed to evaluate the look or quality of an image. Despite the effectiveness of such tools in quantifying degradations such as noise and blurriness levels, such quantification is loosely coupled with human language. When it comes to more abstract perception about the feel of visual content, existing methods can only rely on supervised models that are explicitly trained with labeled data collected via laborious user study. In this paper, we go beyond the conventional paradigms by exploring the rich visual language prior encapsulated in Contrastive Language-Image Pre-training (CLIP) models for assessing both the quality perception (look) and abstract perception (feel) of images in a zero-shot manner. In particular, we discuss effective prompt designs and show an effective prompt pairing strategy to harness the prior. We also provide extensive experiments on controlled datasets and Image Quality Assessment (IQA) benchmarks. Our results show that CLIP captures meaningful priors that generalize well to different perceptual assessments. Code is avaliable at https://github.com/IceClear/CLIP-IQA.
△ Less
Submitted 23 November, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
AI Fairness: from Principles to Practice
Authors:
Arash Bateni,
Matthew C. Chan,
Ray Eitel-Porter
Abstract:
This paper summarizes and evaluates various approaches, methods, and techniques for pursuing fairness in artificial intelligence (AI) systems. It examines the merits and shortcomings of these measures and proposes practical guidelines for defining, measuring, and preventing bias in AI. In particular, it cautions against some of the simplistic, yet common, methods for evaluating bias in AI systems,…
▽ More
This paper summarizes and evaluates various approaches, methods, and techniques for pursuing fairness in artificial intelligence (AI) systems. It examines the merits and shortcomings of these measures and proposes practical guidelines for defining, measuring, and preventing bias in AI. In particular, it cautions against some of the simplistic, yet common, methods for evaluating bias in AI systems, and offers more sophisticated and effective alternatives. The paper also addresses widespread controversies and confusions in the field by providing a common language among different stakeholders of high-impact AI systems. It describes various trade-offs involving AI fairness, and provides practical recommendations for balancing them. It offers techniques for evaluating the costs and benefits of fairness targets, and defines the role of human judgment in setting these targets. This paper provides discussions and guidelines for AI practitioners, organization leaders, and policymakers, as well as various links to additional materials for a more technical audience. Numerous real-world examples are provided to clarify the concepts, challenges, and recommendations from a practical perspective.
△ Less
Submitted 20 July, 2022;
originally announced July 2022.
-
Language models show human-like content effects on reasoning tasks
Authors:
Ishita Dasgupta,
Andrew K. Lampinen,
Stephanie C. Y. Chan,
Hannah R. Sheahan,
Antonia Creswell,
Dharshan Kumaran,
James L. McClelland,
Felix Hill
Abstract:
Abstract reasoning is a key ability for an intelligent system. Large language models (LMs) achieve above-chance performance on abstract reasoning tasks, but exhibit many imperfections. However, human abstract reasoning is also imperfect. For example, human reasoning is affected by our real-world knowledge and beliefs, and shows notable "content effects"; humans reason more reliably when the semant…
▽ More
Abstract reasoning is a key ability for an intelligent system. Large language models (LMs) achieve above-chance performance on abstract reasoning tasks, but exhibit many imperfections. However, human abstract reasoning is also imperfect. For example, human reasoning is affected by our real-world knowledge and beliefs, and shows notable "content effects"; humans reason more reliably when the semantic content of a problem supports the correct logical inferences. These content-entangled reasoning patterns play a central role in debates about the fundamental nature of human intelligence. Here, we investigate whether language models $\unicode{x2014}$ whose prior expectations capture some aspects of human knowledge $\unicode{x2014}$ similarly mix content into their answers to logical problems. We explored this question across three logical reasoning tasks: natural language inference, judging the logical validity of syllogisms, and the Wason selection task. We evaluate state of the art large language models, as well as humans, and find that the language models reflect many of the same patterns observed in humans across these tasks $\unicode{x2014}$ like humans, models answer more accurately when the semantic content of a task supports the logical inferences. These parallels are reflected both in answer patterns, and in lower-level features like the relationship between model answer distributions and human response times. Our findings have implications for understanding both these cognitive effects in humans, and the factors that contribute to language model performance.
△ Less
Submitted 30 October, 2023; v1 submitted 14 July, 2022;
originally announced July 2022.
-
Millimeter light curves of Sagittarius A* observed during the 2017 Event Horizon Telescope campaign
Authors:
Maciek Wielgus,
Nicola Marchili,
Ivan Marti-Vidal,
Garrett K. Keating,
Venkatessh Ramakrishnan,
Paul Tiede,
Ed Fomalont,
Sara Issaoun,
Joey Neilsen,
Michael A. Nowak,
Lindy Blackburn,
Charles F. Gammie,
Ciriaco Goddi,
Daryl Haggard,
Daeyoung Lee,
Monika Moscibrodzka,
Alexandra J. Tetarenko,
Geoffrey C. Bower,
Chi-Kwan Chan,
Koushik Chatterjee,
Paul M. Chesler,
Jason Dexter,
Sheperd S. Doeleman,
Boris Georgiev,
Mark Gurwell
, et al. (6 additional authors not shown)
Abstract:
The Event Horizon Telescope (EHT) observed the compact radio source, Sagittarius A* (Sgr A*), in the Galactic Center on 2017 April 5-11 in the 1.3 millimeter wavelength band. At the same time, interferometric array data from the Atacama Large Millimeter/submillimeter Array and the Submillimeter Array were collected, providing Sgr A* light curves simultaneous with the EHT observations. These data s…
▽ More
The Event Horizon Telescope (EHT) observed the compact radio source, Sagittarius A* (Sgr A*), in the Galactic Center on 2017 April 5-11 in the 1.3 millimeter wavelength band. At the same time, interferometric array data from the Atacama Large Millimeter/submillimeter Array and the Submillimeter Array were collected, providing Sgr A* light curves simultaneous with the EHT observations. These data sets, complementing the EHT very-long-baseline interferometry, are characterized by a cadence and signal-to-noise ratio previously unattainable for Sgr A* at millimeter wavelengths, and they allow for the investigation of source variability on timescales as short as a minute. While most of the light curves correspond to a low variability state of Sgr A*, the April 11 observations follow an X-ray flare, and exhibit strongly enhanced variability. All of the light curves are consistent with a red noise process, with a power spectral density (PSD) slope measured to be between -2 and -3 on timescales between 1 min and several hours. Our results indicate a steepening of the PSD slope for timescales shorter than 0.3 h. The spectral energy distribution is flat at 220 GHz and there are no time-lags between the 213 and 229 GHz frequency bands, suggesting low optical depth for the event horizon scale source. We characterize Sgr A*'s variability, highlighting the different behavior observed just after the X-ray flare, and use Gaussian process modeling to extract a decorrelation timescale and a PSD slope. We also investigate the systematic calibration uncertainties by analyzing data from independent data reduction pipelines.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Electrically Conductive 2D Material Coatings for Flexible & Stretchable Electronics: A Comparative Review of Graphenes & MXenes
Authors:
Vicente Orts Mercadillo,
Kai Chio Chan,
Mario Caironi,
Athanassia Athanassiou,
Ian A. Kinloch,
Mark Bissett,
Pietro Cataldi
Abstract:
There is growing interest in transitioning electronic components and circuitry from stiff and rigid substrates to more flexible and stretchable platforms, such as thin plastics, textiles, and foams. In parallel, the push for more sustainable, biocompatible, and cost-efficient conductive inks to coat these substrates, has led to the development of formulations with novel nanomaterials. Among these,…
▽ More
There is growing interest in transitioning electronic components and circuitry from stiff and rigid substrates to more flexible and stretchable platforms, such as thin plastics, textiles, and foams. In parallel, the push for more sustainable, biocompatible, and cost-efficient conductive inks to coat these substrates, has led to the development of formulations with novel nanomaterials. Among these, 2D materials, and particularly graphenes and MXenes, have received intense research interest due to their increasingly facile and scalable production, high electrical conductivity, and compatibility with existing manufacturing techniques. They enable a range of electronic devices, including strain and pressure sensors, supercapacitors, thermoelectric generators, and heaters. These new flexible and stretchable electronic devices developed with 2D material coatings are poised to unlock exciting applications in the wearable, healthcare and Internet of Things sectors. This review has surveyed key data from more than 200 articles published over the last 6 years, to provide a quantitative analysis of recent progress in the field and shade light on future directions and prospects of this technology. We find that despite the different chemical origins of graphenes and MXenes, their shared electrical properties and 2D morphology, guarantee intriguing performance in end applications, leaving plenty of space for shared progress and advancements in the future.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
First Dark Matter Search Results from the LUX-ZEPLIN (LZ) Experiment
Authors:
J. Aalbers,
D. S. Akerib,
C. W. Akerlof,
A. K. Al Musalhi,
F. Alder,
A. Alqahtani,
S. K. Alsum,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
S. Azadi,
A. J. Bailey,
A. Baker,
J. Balajthy,
S. Balashov,
J. Bang,
J. W. Bargemann,
M. J. Barry,
J. Barthel,
D. Bauer,
A. Baxter
, et al. (322 additional authors not shown)
Abstract:
The LUX-ZEPLIN experiment is a dark matter detector centered on a dual-phase xenon time projection chamber operating at the Sanford Underground Research Facility in Lead, South Dakota, USA. This Letter reports results from LUX-ZEPLIN's first search for weakly interacting massive particles (WIMPs) with an exposure of 60~live days using a fiducial mass of 5.5 t. A profile-likelihood ratio analysis s…
▽ More
The LUX-ZEPLIN experiment is a dark matter detector centered on a dual-phase xenon time projection chamber operating at the Sanford Underground Research Facility in Lead, South Dakota, USA. This Letter reports results from LUX-ZEPLIN's first search for weakly interacting massive particles (WIMPs) with an exposure of 60~live days using a fiducial mass of 5.5 t. A profile-likelihood ratio analysis shows the data to be consistent with a background-only hypothesis, setting new limits on spin-independent WIMP-nucleon, spin-dependent WIMP-neutron, and spin-dependent WIMP-proton cross sections for WIMP masses above 9 GeV/c$^2$. The most stringent limit is set for spin-independent scattering at 36 GeV/c$^2$, rejecting cross sections above 9.2$\times 10^{-48}$ cm$^2$ at the 90% confidence level.
△ Less
Submitted 2 August, 2023; v1 submitted 8 July, 2022;
originally announced July 2022.
-
Nonparametric Estimation of the Potential Impact Fraction and Population Attributable Fraction with Individual-Level and Aggregated Data
Authors:
Colleen E. Chan,
Rodrigo Zepeda-Tello,
Dalia Camacho-García-Formentí,
Frederick Cudhea,
Rafael Meza,
Eliane Rodrigues,
Donna Spiegelman,
Tonatiuh Barrientos-Gutierrez,
Xin Zhou
Abstract:
The estimation of the potential impact fraction (including the population attributable fraction) with continuous exposure data frequently relies on strong distributional assumptions. However, these assumptions are often violated if the underlying exposure distribution is unknown or if the same distribution is assumed across time or space. Nonparametric methods to estimate the potential impact frac…
▽ More
The estimation of the potential impact fraction (including the population attributable fraction) with continuous exposure data frequently relies on strong distributional assumptions. However, these assumptions are often violated if the underlying exposure distribution is unknown or if the same distribution is assumed across time or space. Nonparametric methods to estimate the potential impact fraction are available for cohort data, but no alternatives exist for cross-sectional data. In this article, we discuss the impact of distributional assumptions in the estimation of the population impact fraction, showing that under an infinite set of possibilities, distributional violations lead to biased estimates. We propose nonparametric methods to estimate the potential impact fraction for aggregated (mean and standard deviation) or individual data (e.g. observations from a cross-sectional population survey), and develop simulation scenarios to compare their performance against standard parametric procedures. We illustrate our methodology on an application of sugar-sweetened beverage consumption on incidence of type 2 diabetes. We also present an R package pifpaf to implement these methods.
△ Less
Submitted 24 January, 2023; v1 submitted 7 July, 2022;
originally announced July 2022.
-
Systematic Investigation of Millimeter-Wave Optic Modulation Performance in Thin-Film Lithium Niobate
Authors:
Yiwen Zhang,
Linbo Shao,
**gwei Yang,
Zhaoxi Chen,
Ke Zhang,
Kam-Man Shum,
Di Zhu,
Chi Hou Chan,
Marko Lončar,
Cheng Wang
Abstract:
Millimeter-wave (mmWave) band (30 - 300 GHz) is an emerging spectrum range for wireless communication, short-range radar and sensor applications. mmWave-optic modulators that could efficiently convert mmWave signals into optical domain are crucial components for long-haul transmission of mmWave signals through optical networks. At these ultrahigh frequencies, however, the modulation performances a…
▽ More
Millimeter-wave (mmWave) band (30 - 300 GHz) is an emerging spectrum range for wireless communication, short-range radar and sensor applications. mmWave-optic modulators that could efficiently convert mmWave signals into optical domain are crucial components for long-haul transmission of mmWave signals through optical networks. At these ultrahigh frequencies, however, the modulation performances are highly sensitive to the transmission line loss as well as the velocity- and impedance-matching conditions, while precise measurements and modeling of these parameters are often non-trivial. Here we present a systematic investigation of the mmWave-optic modulation performances of thin-film lithium niobate modulators through theoretical modeling, electrical verifications and electro-optic measurements at frequencies up to 325 GHz. Based on our experimentally verified model, we demonstrate thin-film lithium niobate mmWave-optic modulators with a measured 3-dB electro-optic bandwidth of 170 GHz and a 6-dB bandwidth of 295 GHz. The device also shows a low RF half-wave voltage of 7.3 V measured at an ultrahigh modulation frequency of 250 GHz. This work provides a comprehensive guideline for the design and characterization of mmWave-optic modulators and paves the way toward future integrated mmWave photonic systems for beyond-5G communication and radar applications.
△ Less
Submitted 5 July, 2022; v1 submitted 28 June, 2022;
originally announced June 2022.
-
Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Authors:
Shangchen Zhou,
Kelvin C. K. Chan,
Chongyi Li,
Chen Change Loy
Abstract:
Blind face restoration is a highly ill-posed problem that often requires auxiliary guidance to 1) improve the map** from degraded inputs to desired outputs, or 2) complement high-quality details lost in the inputs. In this paper, we demonstrate that a learned discrete codebook prior in a small proxy space largely reduces the uncertainty and ambiguity of restoration map** by casting blind face…
▽ More
Blind face restoration is a highly ill-posed problem that often requires auxiliary guidance to 1) improve the map** from degraded inputs to desired outputs, or 2) complement high-quality details lost in the inputs. In this paper, we demonstrate that a learned discrete codebook prior in a small proxy space largely reduces the uncertainty and ambiguity of restoration map** by casting blind face restoration as a code prediction task, while providing rich visual atoms for generating high-quality faces. Under this paradigm, we propose a Transformer-based prediction network, named CodeFormer, to model the global composition and context of the low-quality faces for code prediction, enabling the discovery of natural faces that closely approximate the target faces even when the inputs are severely degraded. To enhance the adaptiveness for different degradation, we also propose a controllable feature transformation module that allows a flexible trade-off between fidelity and quality. Thanks to the expressive codebook prior and global modeling, CodeFormer outperforms the state of the arts in both quality and fidelity, showing superior robustness to degradation. Extensive experimental results on synthetic and real-world datasets verify the effectiveness of our method.
△ Less
Submitted 31 October, 2022; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Fast and Accurate Variational Inference for Large Bayesian VARs with Stochastic Volatility
Authors:
Joshua C. C. Chan,
Xuewen Yu
Abstract:
We propose a new variational approximation of the joint posterior distribution of the log-volatility in the context of large Bayesian VARs. In contrast to existing approaches that are based on local approximations, the new proposal provides a global approximation that takes into account the entire support of the joint distribution. In a Monte Carlo study we show that the new global approximation i…
▽ More
We propose a new variational approximation of the joint posterior distribution of the log-volatility in the context of large Bayesian VARs. In contrast to existing approaches that are based on local approximations, the new proposal provides a global approximation that takes into account the entire support of the joint distribution. In a Monte Carlo study we show that the new global approximation is over an order of magnitude more accurate than existing alternatives. We illustrate the proposed methodology with an application of a 96-variable VAR with stochastic volatility to measure global bank network connectedness.
△ Less
Submitted 16 June, 2022;
originally announced June 2022.
-
Holographic Amplitude-Modulated (AM) Leaky-Wave Antennas for Near-Field and Far-Field Applications
Authors:
Geng-Bo Wu,
Ka Fai Chan,
Chi Hou Chan
Abstract:
Amplitude-modulated (AM) leaky-wave antenna (LWA), a concept following amplitude modulation technique from classical communications theory, is a promising structure that enables transforming traveling wave into the radiating wave. In this paper, we provide a different perspective based on the classical holographic theory to gain insight into the physical mechanism of AM LWA and design novel LWAs.…
▽ More
Amplitude-modulated (AM) leaky-wave antenna (LWA), a concept following amplitude modulation technique from classical communications theory, is a promising structure that enables transforming traveling wave into the radiating wave. In this paper, we provide a different perspective based on the classical holographic theory to gain insight into the physical mechanism of AM LWA and design novel LWAs. In analogy to the classical optical Gabor hologram, we demonstrate that only the amplitude variation of the traveling wave is needed to record both the amplitude and phase information of the object wave. The consistency between the holography theory and previous spatial spectrum approach for explaining AM LWA operating mechanism is also demonstrated. For validation purpose, two novel millimeter-wave (mmW) holographic AM LWAs based on the substrate integrated inset dielectric waveguide (IDW) are designed. The first one is for far-field high-gain applications while the second is for near-field focusing (NFF) applications. Both simulated and measured results demonstrate the effectiveness of the AM holography theory for AM LWAs analysis and design.
△ Less
Submitted 16 May, 2022;
originally announced May 2022.
-
Mass Testing and Characterization of 20-inch PMTs for JUNO
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
Joao Pedro Athayde Marcondes de Andre,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Antonio Bergnoli
, et al. (541 additional authors not shown)
Abstract:
Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program whic…
▽ More
Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).
△ Less
Submitted 17 September, 2022; v1 submitted 17 May, 2022;
originally announced May 2022.
-
High-resolution ALMA study of CO (2-1) line and dust continuum emissions in cluster galaxies at z = 1.46
Authors:
Ryota Ikeda,
Ken-ichi Tadaki,
Daisuke Iono,
Tadayuki Kodama,
Jeffrey C. C. Chan,
Bunyo Hatsukade,
Masao Hayashi,
Takuma Izumi,
Kotaro Kohno,
Yusei Koyama,
Rhythm Shimakawa,
Tomoko L. Suzuki,
Yoichi Tamura,
Ichi Tanaka
Abstract:
We present new Atacama Large Millimeter/submillimeter Array (ALMA) results obtained from spatially resolved CO $J$=2-1 line ($0.4''$ resolution) and 870 $μ$m continuum ($0.2''$ resolution) observations of cluster galaxies in XMMXCS J2215.9-1738 at $z=1.46$. Our sample comprises 17 galaxies within $\sim0.5$ Mpc ($0.6R_{200}$) of the cluster center, all of which have previously been detected in the…
▽ More
We present new Atacama Large Millimeter/submillimeter Array (ALMA) results obtained from spatially resolved CO $J$=2-1 line ($0.4''$ resolution) and 870 $μ$m continuum ($0.2''$ resolution) observations of cluster galaxies in XMMXCS J2215.9-1738 at $z=1.46$. Our sample comprises 17 galaxies within $\sim0.5$ Mpc ($0.6R_{200}$) of the cluster center, all of which have previously been detected in the CO $J$=2-1 line at a lower resolution. The effective radii of both the CO $J$=2-1 line and 870 $μ$m dust continuum emissions are robustly measured for nine galaxies by modeling the visibilities. We find that the CO $J$=2-1 line emission in all of the nine galaxies is more extended than the dust continuum emission by a factor of $2.8\pm1.4$. We investigate the spatially resolved Kennicutt-Schmidt (KS) relation in two regions within the interstellar medium of the galaxies. The relation for our sample reveals that the central region ($0<r<R_{e,{\rm 870μm}}$) of galaxies tends to have a shorter gas depletion timescale, i.e., a higher star formation efficiency, compared to the extended region ($R_{e,{\rm 870μm}}<r<R_{e,{\rm CO}}$). Overall, our result suggests that star formation activities are concentrated inside the extended gas reservoir, possibly resulting in the formation of a bulge structure. We find consistency between the ALMA 870 $μ$m radii of star-forming members and the Hubble Space Telescope/1.6 $μ$m radii of passive members in a mass-size distribution, which suggests a transition from star-forming to passive members within $\sim0.5$ Gyr. In addition, no clear differences in the KS relation nor in the sizes are found between galaxies with and without a close companion.
△ Less
Submitted 23 June, 2022; v1 submitted 11 May, 2022;
originally announced May 2022.
-
Data Distributional Properties Drive Emergent In-Context Learning in Transformers
Authors:
Stephanie C. Y. Chan,
Adam Santoro,
Andrew K. Lampinen,
Jane X. Wang,
Aaditya Singh,
Pierre H. Richemond,
Jay McClelland,
Felix Hill
Abstract:
Large transformer-based models are able to perform in-context few-shot learning, without being explicitly trained for it. This observation raises the question: what aspects of the training regime lead to this emergent behavior? Here, we show that this behavior is driven by the distributions of the training data itself. In-context learning emerges when the training data exhibits particular distribu…
▽ More
Large transformer-based models are able to perform in-context few-shot learning, without being explicitly trained for it. This observation raises the question: what aspects of the training regime lead to this emergent behavior? Here, we show that this behavior is driven by the distributions of the training data itself. In-context learning emerges when the training data exhibits particular distributional properties such as burstiness (items appear in clusters rather than being uniformly distributed over time) and having large numbers of rarely occurring classes. In-context learning also emerges more strongly when item meanings or interpretations are dynamic rather than fixed. These properties are exemplified by natural language, but are also inherent to naturalistic data in a wide range of other domains. They also depart significantly from the uniform, i.i.d. training distributions typically used for standard supervised learning. In our initial experiments, we found that in-context learning traded off against more conventional weight-based learning, and models were unable to achieve both simultaneously. However, our later experiments uncovered that the two modes of learning could co-exist in a single model when it was trained on data following a skewed Zipfian distribution -- another common property of naturalistic data, including language. In further experiments, we found that naturalistic data distributions were only able to elicit in-context learning in transformers, and not in recurrent models. In sum, our findings indicate how the transformer architecture works together with particular properties of the training data to drive the intriguing emergent in-context learning behaviour of large language models, and how future work might encourage both in-context and in-weights learning in domains beyond language.
△ Less
Submitted 17 November, 2022; v1 submitted 22 April, 2022;
originally announced May 2022.
-
Sub-percent Precision Measurement of Neutrino Oscillation Parameters with JUNO
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato
, et al. (581 additional authors not shown)
Abstract:
JUNO is a multi-purpose neutrino observatory under construction in the south of China. This publication presents new sensitivity estimates for the measurement of the $Δm^2_{31}$, $Δm^2_{21}$, $\sin^2 θ_{12}$, and $\sin^2 θ_{13}$ oscillation parameters using reactor antineutrinos, which is one of the primary physics goals of the experiment. The sensitivities are obtained using the best knowledge av…
▽ More
JUNO is a multi-purpose neutrino observatory under construction in the south of China. This publication presents new sensitivity estimates for the measurement of the $Δm^2_{31}$, $Δm^2_{21}$, $\sin^2 θ_{12}$, and $\sin^2 θ_{13}$ oscillation parameters using reactor antineutrinos, which is one of the primary physics goals of the experiment. The sensitivities are obtained using the best knowledge available to date on the location and overburden of the experimental site, the nuclear reactors in the surrounding area and beyond, the detector response uncertainties, and the reactor antineutrino spectral shape constraints expected from the TAO satellite detector. It is found that the $Δm^2_{31}$, $Δm^2_{21}$, and $\sin^2 θ_{12}$ oscillation parameters will be determined to better than 0.5% precision in six years of data collection, which represents approximately an order of magnitude improvement over existing constraints.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
Power Bundle Adjustment for Large-Scale 3D Reconstruction
Authors:
Simon Weber,
Nikolaus Demmel,
Tin Chon Chan,
Daniel Cremers
Abstract:
We introduce Power Bundle Adjustment as an expansion type algorithm for solving large-scale bundle adjustment problems. It is based on the power series expansion of the inverse Schur complement and constitutes a new family of solvers that we call inverse expansion methods. We theoretically justify the use of power series and we prove the convergence of our approach. Using the real-world BAL datase…
▽ More
We introduce Power Bundle Adjustment as an expansion type algorithm for solving large-scale bundle adjustment problems. It is based on the power series expansion of the inverse Schur complement and constitutes a new family of solvers that we call inverse expansion methods. We theoretically justify the use of power series and we prove the convergence of our approach. Using the real-world BAL dataset we show that the proposed solver challenges the state-of-the-art iterative methods and significantly accelerates the solution of the normal equation, even for reaching a very high accuracy. This easy-to-implement solver can also complement a recently presented distributed bundle adjustment framework. We demonstrate that employing the proposed Power Bundle Adjustment as a sub-problem solver significantly improves speed and accuracy of the distributed optimization.
△ Less
Submitted 17 April, 2023; v1 submitted 27 April, 2022;
originally announced April 2022.
-
Symmetry-protected topological exceptional chains in non-Hermitian crystals
Authors:
Ruo-Yang Zhang,
Xiaohan Cui,
Wen-Jie Chen,
Zhao-Qing Zhang,
C. T. Chan
Abstract:
In non-Hermitian systems, the defective band degeneracies, so-called exceptional points (EPs), can form robust exceptional lines (ELs) in 3D momentum space in the absence of any symmetries. Here, we show that a natural orientation can be assigned to every EL according to the eigenenergy braiding around it, and prove the source-free principle of ELs as a corollary of the generalized Fermion doublin…
▽ More
In non-Hermitian systems, the defective band degeneracies, so-called exceptional points (EPs), can form robust exceptional lines (ELs) in 3D momentum space in the absence of any symmetries. Here, we show that a natural orientation can be assigned to every EL according to the eigenenergy braiding around it, and prove the source-free principle of ELs as a corollary of the generalized Fermion doubling theorem for EPs on an arbitrary closed oriented surface, which indicates that if several ELs flow into a junction, the same number of outflow ELs from the junction must exist. Based on this principle, we discover three different mechanisms that can stabilize the junction of ELs and therefore guarantee the formation of various types of exceptional chains (ECs) under the protection of mirror, mirror-adjoint, or ${C}_2\mathcal{T}$ symmetries. Furthermore, we analyze the thresholdless perturbations to a Hermitian nodal line and map out all possible EC configurations that can be evolved. By strategically designing the structure and materials, we further exhibit that these exotic ECs can be readily observed in non-Hermitian photonic crystals. Our results directly manifest the combined effect of spatial symmetry and topology on the non-Hermitian singularities and pave the way for manipulating the morphology of ELs in non-Hermitian crystalline systems.
△ Less
Submitted 11 December, 2022; v1 submitted 17 April, 2022;
originally announced April 2022.
-
A Comparative Study of Faithfulness Metrics for Model Interpretability Methods
Authors:
Chun Sik Chan,
Huanqi Kong,
Guanqing Liang
Abstract:
Interpretation methods to reveal the internal reasoning processes behind machine learning models have attracted increasing attention in recent years. To quantify the extent to which the identified interpretations truly reflect the intrinsic decision-making mechanisms, various faithfulness evaluation metrics have been proposed. However, we find that different faithfulness metrics show conflicting p…
▽ More
Interpretation methods to reveal the internal reasoning processes behind machine learning models have attracted increasing attention in recent years. To quantify the extent to which the identified interpretations truly reflect the intrinsic decision-making mechanisms, various faithfulness evaluation metrics have been proposed. However, we find that different faithfulness metrics show conflicting preferences when comparing different interpretations. Motivated by this observation, we aim to conduct a comprehensive and comparative study of the widely adopted faithfulness metrics. In particular, we introduce two assessment dimensions, namely diagnosticity and time complexity. Diagnosticity refers to the degree to which the faithfulness metric favours relatively faithful interpretations over randomly generated ones, and time complexity is measured by the average number of model forward passes. According to the experimental results, we find that sufficiency and comprehensiveness metrics have higher diagnosticity and lower time complexity than the other faithfulness metric
△ Less
Submitted 12 April, 2022;
originally announced April 2022.
-
On the Generalization of BasicVSR++ to Video Deblurring and Denoising
Authors:
Kelvin C. K. Chan,
Shangchen Zhou,
Xiangyu Xu,
Chen Change Loy
Abstract:
The exploitation of long-term information has been a long-standing problem in video restoration. The recent BasicVSR and BasicVSR++ have shown remarkable performance in video super-resolution through long-term propagation and effective alignment. Their success has led to a question of whether they can be transferred to different video restoration tasks. In this work, we extend BasicVSR++ to a gene…
▽ More
The exploitation of long-term information has been a long-standing problem in video restoration. The recent BasicVSR and BasicVSR++ have shown remarkable performance in video super-resolution through long-term propagation and effective alignment. Their success has led to a question of whether they can be transferred to different video restoration tasks. In this work, we extend BasicVSR++ to a generic framework for video restoration tasks. In tasks where inputs and outputs possess identical spatial size, the input resolution is reduced by strided convolutions to maintain efficiency. With only minimal changes from BasicVSR++, the proposed framework achieves compelling performance with great efficiency in various video restoration tasks including video deblurring and denoising. Notably, BasicVSR++ achieves comparable performance to Transformer-based approaches with up to 79% of parameter reduction and 44x speedup. The promising results demonstrate the importance of propagation and alignment in video restoration tasks beyond just video super-resolution. Code and models are available at https://github.com/ckkelvinchan/BasicVSR_PlusPlus.
△ Less
Submitted 18 June, 2022; v1 submitted 11 April, 2022;
originally announced April 2022.
-
Semantic Exploration from Language Abstractions and Pretrained Representations
Authors:
Allison C. Tam,
Neil C. Rabinowitz,
Andrew K. Lampinen,
Nicholas A. Roy,
Stephanie C. Y. Chan,
DJ Strouse,
Jane X. Wang,
Andrea Banino,
Felix Hill
Abstract:
Effective exploration is a challenge in reinforcement learning (RL). Novelty-based exploration methods can suffer in high-dimensional state spaces, such as continuous partially-observable 3D environments. We address this challenge by defining novelty using semantically meaningful state abstractions, which can be found in learned representations shaped by natural language. In particular, we evaluat…
▽ More
Effective exploration is a challenge in reinforcement learning (RL). Novelty-based exploration methods can suffer in high-dimensional state spaces, such as continuous partially-observable 3D environments. We address this challenge by defining novelty using semantically meaningful state abstractions, which can be found in learned representations shaped by natural language. In particular, we evaluate vision-language representations, pretrained on natural image captioning datasets. We show that these pretrained representations drive meaningful, task-relevant exploration and improve performance on 3D simulated environments. We also characterize why and how language provides useful abstractions for exploration by considering the impacts of using representations from a pretrained model, a language oracle, and several ablations. We demonstrate the benefits of our approach in two very different task domains -- one that stresses the identification and manipulation of everyday objects, and one that requires navigational exploration in an expansive world. Our results suggest that using language-shaped representations could improve exploration for various algorithms and agents in challenging environments.
△ Less
Submitted 26 April, 2023; v1 submitted 8 April, 2022;
originally announced April 2022.
-
Search for continuous gravitational wave emission from the Milky Way center in O3 LIGO--Virgo data
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Allocca,
P. A. Altin
, et al. (1645 additional authors not shown)
Abstract:
We present a directed search for continuous gravitational wave (CW) signals emitted by spinning neutron stars located in the inner parsecs of the Galactic Center (GC). Compelling evidence for the presence of a numerous population of neutron stars has been reported in the literature, turning this region into a very interesting place to look for CWs. In this search, data from the full O3 LIGO--Virgo…
▽ More
We present a directed search for continuous gravitational wave (CW) signals emitted by spinning neutron stars located in the inner parsecs of the Galactic Center (GC). Compelling evidence for the presence of a numerous population of neutron stars has been reported in the literature, turning this region into a very interesting place to look for CWs. In this search, data from the full O3 LIGO--Virgo run in the detector frequency band $[10,2000]\rm~Hz$ have been used. No significant detection was found and 95$\%$ confidence level upper limits on the signal strain amplitude were computed, over the full search band, with the deepest limit of about $7.6\times 10^{-26}$ at $\simeq 142\rm~Hz$. These results are significantly more constraining than those reported in previous searches. We use these limits to put constraints on the fiducial neutron star ellipticity and r-mode amplitude. These limits can be also translated into constraints in the black hole mass -- boson mass plane for a hypothetical population of boson clouds around spinning black holes located in the GC.
△ Less
Submitted 9 April, 2022;
originally announced April 2022.
-
Revealing directed effective connectivity of cortical neuronal networks from measurements
Authors:
Chumin Sun,
K. C. Lin,
C. Y. Yeung,
Emily S. C. Ching,
Yu-Ting Huang,
Pik-Yin Lai,
C. K. Chan
Abstract:
In the study of biological networks, one of the major challenges is to understand the relationships between network structure and dynamics. In this paper, we model in vitro cortical neuronal cultures as stochastic dynamical systems and apply a method that reconstructs directed networks from dynamics [Ching and Tam, Phys. Rev. E 95, 010301(R), 2017] to reveal directed effective connectivity, namely…
▽ More
In the study of biological networks, one of the major challenges is to understand the relationships between network structure and dynamics. In this paper, we model in vitro cortical neuronal cultures as stochastic dynamical systems and apply a method that reconstructs directed networks from dynamics [Ching and Tam, Phys. Rev. E 95, 010301(R), 2017] to reveal directed effective connectivity, namely the directed links and synaptic weights, of the neuronal cultures from voltage measurements recorded by a multielectrode array. The effective connectivity so obtained reproduces several features of cortical regions in rats and monkeys and has similar network properties as the synaptic network of the nematode C. elegans, the only organism whose entire nervous system has been mapped out as of today. The distribution of the incoming degree is bimodal and the distributions of the average incoming and outgoing synaptic strength are non-Gaussian with long tails. The effective connectivity captures different information from the commonly studied functional connectivity, estimated using statistical correlation between spiking activities. The average synaptic strengths of excitatory incoming and outgoing links are found to increase with the spiking activity in the estimated effective connectivity but not in the functional connectivity estimated using the same sets of voltage measurements. These results thus demonstrate that the reconstructed effective connectivity can capture the general properties of synaptic connections and better reveal relationships between network structure and dynamics.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
Solid-state heteronuclear multiple-quantum spectroscopy under a magic-angle spinning frequency of 150 kHz
Authors:
Eric Chung-Yueh Yuan,
Po-Wen Chen,
Shing-Jong Huang,
Mai-Liis Org,
Ago Samoson,
Jerry Chun Chung Chan
Abstract:
We hereby demonstrate that 1H detected 15N-1H heteronuclear multiple-quantum spectroscopy can be carried out at a magic angle spinning frequency of 150 kHz. While the 15N-1H multiple-quantum coherences can be directly excited from the dipolar order created by the method of adiabatic demagnetization in the rotating frame, it is technically more advantageous to acquire the chemical shift evolution o…
▽ More
We hereby demonstrate that 1H detected 15N-1H heteronuclear multiple-quantum spectroscopy can be carried out at a magic angle spinning frequency of 150 kHz. While the 15N-1H multiple-quantum coherences can be directly excited from the dipolar order created by the method of adiabatic demagnetization in the rotating frame, it is technically more advantageous to acquire the chemical shift evolution of the heteronuclear multiple-quantum coherence by two separate chemical shift evolution periods for 1H and 15N. We also show that the heteronuclear multiple-quantum correlation spectrum can be obtained by shearing the corresponding heteronuclear single-quantum correlation spectrum.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
Can language models learn from explanations in context?
Authors:
Andrew K. Lampinen,
Ishita Dasgupta,
Stephanie C. Y. Chan,
Kory Matthewson,
Michael Henry Tessler,
Antonia Creswell,
James L. McClelland,
Jane X. Wang,
Felix Hill
Abstract:
Language Models (LMs) can perform new tasks by adapting to a few in-context examples. For humans, explanations that connect examples to task principles can improve learning. We therefore investigate whether explanations of few-shot examples can help LMs. We annotate questions from 40 challenging tasks with answer explanations, and various matched control explanations. We evaluate how different typ…
▽ More
Language Models (LMs) can perform new tasks by adapting to a few in-context examples. For humans, explanations that connect examples to task principles can improve learning. We therefore investigate whether explanations of few-shot examples can help LMs. We annotate questions from 40 challenging tasks with answer explanations, and various matched control explanations. We evaluate how different types of explanations, instructions, and controls affect zero- and few-shot performance. We analyze these results using statistical multilevel modeling techniques that account for the nested dependencies among conditions, tasks, prompts, and models. We find that explanations can improve performance -- even without tuning. Furthermore, explanations hand-tuned for performance on a small validation set offer substantially larger benefits, and building a prompt by selecting examples and explanations together substantially improves performance over selecting examples alone. Finally, even untuned explanations outperform carefully matched controls, suggesting that the benefits are due to the link between an example and its explanation, rather than lower-level features. However, only large models benefit. In summary, explanations can support the in-context learning of large LMs on challenging tasks.
△ Less
Submitted 10 October, 2022; v1 submitted 5 April, 2022;
originally announced April 2022.
-
Equity, diversity, and inclusion in sports analytics
Authors:
Craig Fernandes,
Jason D. Vescovi,
Richard Norman,
Cheri L. Bradish,
Nathan Taback,
Timothy C. Y. Chan
Abstract:
This paper presents a landmark study of equity, diversity and inclusion (EDI) in the field of sports analytics. We developed a survey that examined personal and job-related demographics, as well as individual perceptions and experiences about EDI in the workplace. We sent the survey to individuals in the five major North American professional leagues, representatives from the Olympic and Paralympi…
▽ More
This paper presents a landmark study of equity, diversity and inclusion (EDI) in the field of sports analytics. We developed a survey that examined personal and job-related demographics, as well as individual perceptions and experiences about EDI in the workplace. We sent the survey to individuals in the five major North American professional leagues, representatives from the Olympic and Paralympic Committees in Canada and the U.S., the NCAA Division I programs, companies in sports tech/analytics, and university research groups. Our findings indicate the presence of a clear dominant group in sports analytics identifying as: young (72.0%), White (69.5%), heterosexual (89.7%) and male (82.0%). Within professional sports, males in management positions earned roughly 30,000 USD (27%) more on average compared to females. A smaller but equally alarming pay gap of 17,000 USD (14%) was found between White and non-White management personnel. Of concern, females were nearly five times as likely to experience discrimination and twice as likely to have considered leaving their job due to isolation or feeling unwelcome. While they had similar levels of agreement regarding fair processes for rewards and compensation, females "strongly agreed" less often than males regarding equitable support, equitable workload, having a voice, and being taken seriously. Over one third (36.3%) of females indicated that they "strongly agreed" that they must work harder than others to be valued equally, compared to 9.8% of males. We conclude the paper with concrete recommendations that could be considered to create a more equitable, diverse and inclusive environment for individuals working within the sports analytics sector.
△ Less
Submitted 14 June, 2022; v1 submitted 2 April, 2022;
originally announced April 2022.
-
Extremely Low-light Image Enhancement with Scene Text Restoration
Authors:
Pohao Hsu,
Che-Tsung Lin,
Chun Chet Ng,
Jie-Long Kew,
Mei Yih Tan,
Shang-Hong Lai,
Chee Seng Chan,
Christopher Zach
Abstract:
Deep learning-based methods have made impressive progress in enhancing extremely low-light images - the image quality of the reconstructed images has generally improved. However, we found out that most of these methods could not sufficiently recover the image details, for instance, the texts in the scene. In this paper, a novel image enhancement framework is proposed to precisely restore the scene…
▽ More
Deep learning-based methods have made impressive progress in enhancing extremely low-light images - the image quality of the reconstructed images has generally improved. However, we found out that most of these methods could not sufficiently recover the image details, for instance, the texts in the scene. In this paper, a novel image enhancement framework is proposed to precisely restore the scene texts, as well as the overall quality of the image simultaneously under extremely low-light images conditions. Mainly, we employed a self-regularised attention map, an edge map, and a novel text detection loss. In addition, leveraging synthetic low-light images is beneficial for image enhancement on the genuine ones in terms of text detection. The quantitative and qualitative experimental results have shown that the proposed model outperforms state-of-the-art methods in image restoration, text detection, and text spotting on See In the Dark and ICDAR15 datasets.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
The restriction problem on the ellipsoid
Authors:
Chi Hin Chan,
Magdalena Czubak,
Tsuyoshi Yoneda
Abstract:
Following a restriction argument in the Euclidean space, we derive a geometric invariant formula for a possible viscosity operator for an incompressible fluid flow on an ellipsoid embedded in $\mathbb R^3$. We also give an asymptotic expansion of the formula in terms of the eccentricity associated with the ellipsoid.
Following a restriction argument in the Euclidean space, we derive a geometric invariant formula for a possible viscosity operator for an incompressible fluid flow on an ellipsoid embedded in $\mathbb R^3$. We also give an asymptotic expansion of the formula in terms of the eccentricity associated with the ellipsoid.
△ Less
Submitted 30 March, 2022;
originally announced March 2022.
-
SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder Bottlenecks
Authors:
Chak Ho Chan,
Kaizhi Qian,
Yang Zhang,
Mark Hasegawa-Johnson
Abstract:
SpeechSplit can perform aspect-specific voice conversion by disentangling speech into content, rhythm, pitch, and timbre using multiple autoencoders in an unsupervised manner. However, SpeechSplit requires careful tuning of the autoencoder bottlenecks, which can be time-consuming and less robust. This paper proposes SpeechSplit 2.0, which constrains the information flow of the speech component to…
▽ More
SpeechSplit can perform aspect-specific voice conversion by disentangling speech into content, rhythm, pitch, and timbre using multiple autoencoders in an unsupervised manner. However, SpeechSplit requires careful tuning of the autoencoder bottlenecks, which can be time-consuming and less robust. This paper proposes SpeechSplit 2.0, which constrains the information flow of the speech component to be disentangled on the autoencoder input using efficient signal processing methods instead of bottleneck tuning. Evaluation results show that SpeechSplit 2.0 achieves comparable performance to SpeechSplit in speech disentanglement and superior robustness to the bottleneck size variations. Our code is available at https://github.com/biggytruck/SpeechSplit2.
△ Less
Submitted 26 March, 2022;
originally announced March 2022.
-
Learning to generate line drawings that convey geometry and semantics
Authors:
Caroline Chan,
Fredo Durand,
Phillip Isola
Abstract:
This paper presents an unpaired method for creating line drawings from photographs. Current methods often rely on high quality paired datasets to generate line drawings. However, these datasets often have limitations due to the subjects of the drawings belonging to a specific domain, or in the amount of data collected. Although recent work in unsupervised image-to-image translation has shown much…
▽ More
This paper presents an unpaired method for creating line drawings from photographs. Current methods often rely on high quality paired datasets to generate line drawings. However, these datasets often have limitations due to the subjects of the drawings belonging to a specific domain, or in the amount of data collected. Although recent work in unsupervised image-to-image translation has shown much progress, the latest methods still struggle to generate compelling line drawings. We observe that line drawings are encodings of scene information and seek to convey 3D shape and semantic meaning. We build these observations into a set of objectives and train an image translation to map photographs into line drawings. We introduce a geometry loss which predicts depth information from the image features of a line drawing, and a semantic loss which matches the CLIP features of a line drawing with its corresponding photograph. Our approach outperforms state-of-the-art unpaired image translation and line drawing generation methods on creating line drawings from arbitrary photographs. For code and demo visit our webpage carolineec.github.io/informative_drawings
△ Less
Submitted 28 March, 2022; v1 submitted 23 March, 2022;
originally announced March 2022.
-
Search for Gravitational Waves Associated with Fast Radio Bursts Detected by CHIME/FRB During the LIGO--Virgo Observing Run O3a
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
the CHIME/FRB Collaboration,
:,
R. Abbott,
T. D. Abbott,
F. Acernese,
K. Ackley,
C. Adams,
N. Adhikari,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
A. Allocca
, et al. (1633 additional authors not shown)
Abstract:
We search for gravitational-wave transients associated with fast radio bursts (FRBs) detected by the Canadian Hydrogen Intensity Map** Experiment Fast Radio Burst Project (CHIME/FRB), during the first part of the third observing run of Advanced LIGO and Advanced Virgo (1 April 2019 15:00 UTC-1 Oct 2019 15:00 UTC). Triggers from 22 FRBs were analyzed with a search that targets compact binary coal…
▽ More
We search for gravitational-wave transients associated with fast radio bursts (FRBs) detected by the Canadian Hydrogen Intensity Map** Experiment Fast Radio Burst Project (CHIME/FRB), during the first part of the third observing run of Advanced LIGO and Advanced Virgo (1 April 2019 15:00 UTC-1 Oct 2019 15:00 UTC). Triggers from 22 FRBs were analyzed with a search that targets compact binary coalescences with at least one neutron star component. A targeted search for generic gravitational-wave transients was conducted on 40 FRBs. We find no significant evidence for a gravitational-wave association in either search. Given the large uncertainties in the distances of the FRBs inferred from the dispersion measures in our sample, however, this does not conclusively exclude any progenitor models that include emission of a gravitational wave of the types searched for from any of these FRB events. We report $90\%$ confidence lower bounds on the distance to each FRB for a range of gravitational-wave progenitor models. By combining the inferred maximum distance information for each FRB with the sensitivity of the gravitational-wave searches, we set upper limits on the energy emitted through gravitational waves for a range of emission scenarios. We find values of order $10^{51}$-$10^{57}$ erg for a range of different emission models with central gravitational wave frequencies in the range 70-3560 Hz. Finally, we also found no significant coincident detection of gravitational waves with the repeater, FRB 20200120E, which is the closest known extragalactic FRB.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
Forecast of Neutrino Cosmology from the CSST Photometric Galaxy Clustering and Cosmic Shear Surveys
Authors:
Hengjie Lin,
Yan Gong,
Xuelei Chen,
Kwan Chuen Chan,
Zuhui Fan,
Hu Zhan
Abstract:
China Space Station Telescope (CSST) is a forthcoming powerful Stage IV space-based optical survey equipment. It is expected to explore a number of important cosmological problems in extremely high precision. In this work, we focus on investigating the constraints on neutrino mass and other cosmological parameters under the model of cold dark matter with a constant equation of state of dark energy…
▽ More
China Space Station Telescope (CSST) is a forthcoming powerful Stage IV space-based optical survey equipment. It is expected to explore a number of important cosmological problems in extremely high precision. In this work, we focus on investigating the constraints on neutrino mass and other cosmological parameters under the model of cold dark matter with a constant equation of state of dark energy ($w$CDM), using the mock data from the CSST photometric galaxy clustering and cosmic shear surveys (i.e. 3$\times$2pt). The systematics from galaxy bias, photometric redshift uncertainties, intrinsic alignment, shear calibration, baryonic feedback, non-linear, and instrumental effects are also included in the analysis. We generate the mock data based on the COSMOS catalog considering the instrumental and observational effects of the CSST, and make use of the Markov Chain Monte Carlo (MCMC) method to perform the constraints. Comparing to the results from current similar measurements, we find that CSST 3$\times$2pt surveys can improve the constraints on the cosmological parameters by one order of magnitude at least. We can obtain an upper limit for the sum of neutrino mass $Σm_ν \lesssim 0.36$ (0.56) eV at 68\% (95\%) confidence level, and $Σm_ν \lesssim 0.23$ (0.29) eV at 68\% (95\%) confidence level if ignore the baryonic effect, which is comparable to the {\it Planck} results and much better than the current photometric surveys. This indicates that the CSST photometric surveys can provide stringent constraints on the neutrino mass and other cosmological parameters, and the results also can be further improved by including data from other kinds of CSST cosmological surveys.
△ Less
Submitted 26 July, 2022; v1 submitted 21 March, 2022;
originally announced March 2022.
-
Zipfian environments for Reinforcement Learning
Authors:
Stephanie C. Y. Chan,
Andrew K. Lampinen,
Pierre H. Richemond,
Felix Hill
Abstract:
As humans and animals learn in the natural world, they encounter distributions of entities, situations and events that are far from uniform. Typically, a relatively small set of experiences are encountered frequently, while many important experiences occur only rarely. The highly-skewed, heavy-tailed nature of reality poses particular learning challenges that humans and animals have met by evolvin…
▽ More
As humans and animals learn in the natural world, they encounter distributions of entities, situations and events that are far from uniform. Typically, a relatively small set of experiences are encountered frequently, while many important experiences occur only rarely. The highly-skewed, heavy-tailed nature of reality poses particular learning challenges that humans and animals have met by evolving specialised memory systems. By contrast, most popular RL environments and benchmarks involve approximately uniform variation of properties, objects, situations or tasks. How will RL algorithms perform in worlds (like ours) where the distribution of environment features is far less uniform? To explore this question, we develop three complementary RL environments where the agent's experience varies according to a Zipfian (discrete power law) distribution. On these benchmarks, we find that standard Deep RL architectures and algorithms acquire useful knowledge of common situations and tasks, but fail to adequately learn about rarer ones. To understand this failure better, we explore how different aspects of current approaches may be adjusted to help improve performance on rare events, and show that the RL objective function, the agent's memory system and self-supervised learning objectives can all influence an agent's ability to learn from uncommon experiences. Together, these results show that learning robustly from skewed experience is a critical challenge for applying Deep RL methods beyond simulations or laboratories, and our Zipfian environments provide a basis for measuring future progress towards this goal.
△ Less
Submitted 8 August, 2022; v1 submitted 15 March, 2022;
originally announced March 2022.
-
Locally refined quad meshing for linear elasticity problems based on convolutional neural networks
Authors:
Chiu Ling Chan,
Felix Scholz,
Thomas Takacs
Abstract:
In this paper we propose a method to generate suitably refined finite element meshes using neural networks. As a model problem we consider a linear elasticity problem on a planar domain (possibly with holes) having a polygonal boundary. We impose boundary conditions by fixing the position of a part of the boundary and applying a force on another part of the boundary. The resulting displacement and…
▽ More
In this paper we propose a method to generate suitably refined finite element meshes using neural networks. As a model problem we consider a linear elasticity problem on a planar domain (possibly with holes) having a polygonal boundary. We impose boundary conditions by fixing the position of a part of the boundary and applying a force on another part of the boundary. The resulting displacement and distribution of stresses depend on the geometry of the domain and on the boundary conditions. When applying a standard Galerkin discretization using quadrilateral finite elements, one usually has to perform adaptive refinement to properly resolve maxima of the stress distribution. Such an adaptive scheme requires a local error estimator and a corresponding local refinement strategy. The overall costs of such a strategy are high. We propose to reduce the costs of obtaining a suitable discretization by training a neural network whose evaluation replaces this adaptive refinement procedure. We set up a single network for a large class of possible domains and boundary conditions and not on a single domain of interest. The computational domain and boundary conditions are interpreted as images, which are suitable inputs for convolution neural networks. We use the U-net architecture and we devise training strategies by dividing the possible inputs into different categories based on their overall geometric complexity. Thus, we compare different training strategies based on varying geometric complexity. One of the advantages of the proposed approach is the interpretation of input and output as images, which do not depend on the underlying discretization scheme. Another is the generalizability and geometric flexibility. The network can be applied to previously unseen geometries, even with different topology and level of detail. Thus, training can easily be extended to other classes of geometries.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models
Authors:
Ning Ding,
Yujia Qin,
Guang Yang,
Fuchao Wei,
Zonghan Yang,
Yusheng Su,
Shengding Hu,
Yulin Chen,
Chi-Min Chan,
Weize Chen,
**g Yi,
Weilin Zhao,
Xiaozhi Wang,
Zhiyuan Liu,
Hai-Tao Zheng,
Jianfei Chen,
Yang Liu,
Jie Tang,
Juanzi Li,
Maosong Sun
Abstract:
Despite the success, the process of fine-tuning large-scale PLMs brings prohibitive adaptation costs. In fact, fine-tuning all the parameters of a colossal model and retaining separate instances for different tasks are practically infeasible. This necessitates a new branch of research focusing on the parameter-efficient adaptation of PLMs, dubbed as delta tuning in this paper. In contrast with the…
▽ More
Despite the success, the process of fine-tuning large-scale PLMs brings prohibitive adaptation costs. In fact, fine-tuning all the parameters of a colossal model and retaining separate instances for different tasks are practically infeasible. This necessitates a new branch of research focusing on the parameter-efficient adaptation of PLMs, dubbed as delta tuning in this paper. In contrast with the standard fine-tuning, delta tuning only fine-tunes a small portion of the model parameters while kee** the rest untouched, largely reducing both the computation and storage costs. Recent studies have demonstrated that a series of delta tuning methods with distinct tuned parameter selection could achieve performance on a par with full-parameter fine-tuning, suggesting a new promising way of stimulating large-scale PLMs. In this paper, we first formally describe the problem of delta tuning and then comprehensively review recent delta tuning approaches. We also propose a unified categorization criterion that divide existing delta tuning methods into three groups: addition-based, specification-based, and reparameterization-based methods. Though initially proposed as an efficient method to steer large models, we believe that some of the fascinating evidence discovered along with delta tuning could help further reveal the mechanisms of PLMs and even deep neural networks. To this end, we discuss the theoretical principles underlying the effectiveness of delta tuning and propose frameworks to interpret delta tuning from the perspective of optimization and optimal control, respectively. Furthermore, we provide a holistic empirical study of representative methods, where results on over 100 NLP tasks demonstrate a comprehensive performance comparison of different approaches. The experimental results also cover the analysis of combinatorial, scaling and transferable properties of delta tuning.
△ Less
Submitted 14 March, 2022; v1 submitted 14 March, 2022;
originally announced March 2022.
-
Unimon qubit
Authors:
Eric Hyyppä,
Suman Kundu,
Chun Fai Chan,
András Gunyhó,
Juho Hotari,
David Janzso,
Kristinn Juliusson,
Olavi Kiuru,
Janne Kotilahti,
Alessandro Landra,
Wei Liu,
Fabian Marxer,
Akseli Mäkinen,
Jean-Luc Orgiazzi,
Mario Palma,
Mykhailo Savytskyi,
Francesca Tosto,
Jani Tuorila,
Vasilii Vadimov,
Tianyi Li,
Caspar Ockeloen-Korppi,
Johannes Heinsoo,
Kuan Yen Tan,
Juha Hassel,
Mikko Möttönen
Abstract:
Superconducting qubits are one of the most promising candidates to implement quantum computers. The superiority of superconducting quantum computers over any classical device in simulating random but well-determined quantum circuits has already been shown in two independent experiments and important steps have been taken in quantum error correction. However, the currently wide-spread qubit designs…
▽ More
Superconducting qubits are one of the most promising candidates to implement quantum computers. The superiority of superconducting quantum computers over any classical device in simulating random but well-determined quantum circuits has already been shown in two independent experiments and important steps have been taken in quantum error correction. However, the currently wide-spread qubit designs do not yet provide high enough performance to enable practical applications or efficient scaling of logical qubits owing to one or several following issues: sensitivity to charge or flux noise leading to decoherence, too weak non-linearity preventing fast operations, undesirably dense excitation spectrum, or complicated design vulnerable to parasitic capacitance. Here, we introduce and demonstrate a superconducting-qubit type, the unimon, which combines the desired properties of high non-linearity, full insensitivity to dc charge noise, insensitivity to flux noise, and a simple structure consisting only of a single Josephson junction in a resonator. We measure the qubit frequency, $ω_{01}/(2π)$, and anharmonicity $α$ over the full dc-flux range and observe, in agreement with our quantum models, that the qubit anharmonicity is greatly enhanced at the optimal operation point, yielding, for example, 99.9% and 99.8% fidelity for 13-ns single-qubit gates on two qubits with $(ω_{01},α)=(4.49~\mathrm{GHz}, 434~\mathrm{ MHz})\times 2π$ and $(3.55~\mathrm{GHz}, 744~\mathrm{ MHz})\times 2π$, respectively. The energy relaxation time $T_1\lesssim 10~μ\mathrm{s}$ is stable for hours and seems to be limited by dielectric losses. Thus, future improvements of the design, materials, and gate time may promote the unimon to break the 99.99% fidelity target for efficient quantum error correction and possible quantum advantage with noisy systems.
△ Less
Submitted 5 April, 2022; v1 submitted 11 March, 2022;
originally announced March 2022.
-
A Next-Generation Liquid Xenon Observatory for Dark Matter and Neutrino Physics
Authors:
J. Aalbers,
K. Abe,
V. Aerne,
F. Agostini,
S. Ahmed Maouloud,
D. S. Akerib,
D. Yu. Akimov,
J. Akshat,
A. K. Al Musalhi,
F. Alder,
S. K. Alsum,
L. Althueser,
C. S. Amarasinghe,
F. D. Amaro,
A. Ames,
T. J. Anderson,
B. Andrieu,
N. Angelides,
E. Angelino,
J. Angevaare,
V. C. Antochi,
D. Antón Martin,
B. Antunovic,
E. Aprile,
H. M. Araújo
, et al. (572 additional authors not shown)
Abstract:
The nature of dark matter and properties of neutrinos are among the most pressing issues in contemporary particle physics. The dual-phase xenon time-projection chamber is the leading technology to cover the available parameter space for Weakly Interacting Massive Particles (WIMPs), while featuring extensive sensitivity to many alternative dark matter candidates. These detectors can also study neut…
▽ More
The nature of dark matter and properties of neutrinos are among the most pressing issues in contemporary particle physics. The dual-phase xenon time-projection chamber is the leading technology to cover the available parameter space for Weakly Interacting Massive Particles (WIMPs), while featuring extensive sensitivity to many alternative dark matter candidates. These detectors can also study neutrinos through neutrinoless double-beta decay and through a variety of astrophysical sources. A next-generation xenon-based detector will therefore be a true multi-purpose observatory to significantly advance particle physics, nuclear physics, astrophysics, solar physics, and cosmology. This review article presents the science cases for such a detector.
△ Less
Submitted 4 March, 2022;
originally announced March 2022.
-
First joint observation by the underground gravitational-wave detector, KAGRA, with GEO600
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Allocca,
P. A. Altin
, et al. (1647 additional authors not shown)
Abstract:
We report the results of the first joint observation of the KAGRA detector with GEO600. KAGRA is a cryogenic and underground gravitational-wave detector consisting of a laser interferometer with three-kilometer arms, and located in Kamioka, Gifu, Japan. GEO600 is a British--German laser interferometer with 600 m arms, and located near Hannover, Germany. GEO600 and KAGRA performed a joint observing…
▽ More
We report the results of the first joint observation of the KAGRA detector with GEO600. KAGRA is a cryogenic and underground gravitational-wave detector consisting of a laser interferometer with three-kilometer arms, and located in Kamioka, Gifu, Japan. GEO600 is a British--German laser interferometer with 600 m arms, and located near Hannover, Germany. GEO600 and KAGRA performed a joint observing run from April 7 to 20, 2020. We present the results of the joint analysis of the GEO--KAGRA data for transient gravitational-wave signals, including the coalescence of neutron-star binaries and generic unmodeled transients. We also perform dedicated searches for binary coalescence signals and generic transients associated with gamma-ray burst events observed during the joint run. No gravitational-wave events were identified. We evaluate the minimum detectable amplitude for various types of transient signals and the spacetime volume for which the network is sensitive to binary neutron-star coalescences. We also place lower limits on the distances to the gamma-ray bursts analysed based on the non-detection of an associated gravitational-wave signal for several signal models, including binary coalescences. These analyses demonstrate the feasibility and utility of KAGRA as a member of the global gravitational-wave detector network.
△ Less
Submitted 19 August, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
Constrained tandem neural network assisted inverse design of metasurfaces for microwave absorption
Authors:
Xiangxu He,
Xiaohan Cui,
C. T. Chan
Abstract:
Designing microwave absorbers with customized spectrums is an attractive topic in both scientific and engineering communities. However, due to the massive number of design parameters involved, the design process is typically time-consuming and computationally expensive. To address this challenge, machine learning has emerged as a powerful tool for optimizing design parameters. In this work, we pre…
▽ More
Designing microwave absorbers with customized spectrums is an attractive topic in both scientific and engineering communities. However, due to the massive number of design parameters involved, the design process is typically time-consuming and computationally expensive. To address this challenge, machine learning has emerged as a powerful tool for optimizing design parameters. In this work, we present an analytical model for an absorber composed of a multi-layered metasurface and propose a novel inverse design method based on a constrained tandem neural network. The network can provide structural and material parameters optimized for a given absorption spectrum, without requiring professional knowledge. Furthermore, additional physical attributes, such as absorber thickness, can be optimized when soft constraints are applied. As an illustrative example, we use the neural network to design broadband microwave absorbers with a thickness close to the causality limit imposed by the Kramers-Kronig relation. Our approach provides new insights into the reverse engineering of physical devices.
△ Less
Submitted 23 November, 2023; v1 submitted 22 February, 2022;
originally announced March 2022.
-
Dynamic Control of Service Systems with Returns: Application to Design of Post-Discharge Hospital Readmission Prevention Programs
Authors:
Timothy C. Y. Chan,
Simon Y. Huang,
Vahid Sarhangian
Abstract:
We study a control problem for queueing systems where customers may return for additional episodes of service after their initial service completion. At each service completion epoch, the decision maker can choose to reduce the probability of return for the departing customer but at a cost that is convex increasing in the amount of reduction in the return probability. Other costs are incurred as c…
▽ More
We study a control problem for queueing systems where customers may return for additional episodes of service after their initial service completion. At each service completion epoch, the decision maker can choose to reduce the probability of return for the departing customer but at a cost that is convex increasing in the amount of reduction in the return probability. Other costs are incurred as customers wait in the queue and every time they return for service. Our primary motivation comes from post-discharge Quality Improvement (QI) interventions (e.g., follow up phone-calls, appointments) frequently used in a variety of healthcare settings to reduce unplanned hospital readmissions. Our objective is to understand how the cost of interventions should be balanced with the reductions in congestion and service costs. To this end, we consider a fluid approximation of the queueing system and characterize the structure of optimal long-run average and bias-optimal transient control policies for the fluid model. Our structural results motivate the design of intuitive surge protocols whereby different intensities of interventions (corresponding to different levels of reduction in the return probability) are provided based on the congestion in the system. Through extensive simulation experiments, we study the performance of the fluid policy for the stochastic system and identify parameter regimes where it leads to significant cost savings compared to a fixed long-run average optimal policy that ignores holding costs and a simple policy that uses the highest level of intervention whenever the queue is non-empty. In particular, we find that in a parameter regime relevant to our motivating application, dynamically adjusting the intensity of interventions could result in up to 25.4% reduction in long-run average cost and 33.7% in finite-horizon costs compared to the simple aggressive policy.
△ Less
Submitted 10 June, 2024; v1 submitted 28 February, 2022;
originally announced March 2022.
-
Markov Chain Monte Carlo-Based Machine Unlearning: Unlearning What Needs to be Forgotten
Authors:
Quoc Phong Nguyen,
Ryutaro Oikawa,
Dinil Mon Divakaran,
Mun Choon Chan,
Bryan Kian Hsiang Low
Abstract:
As the use of machine learning (ML) models is becoming increasingly popular in many real-world applications, there are practical challenges that need to be addressed for model maintenance. One such challenge is to 'undo' the effect of a specific subset of dataset used for training a model. This specific subset may contain malicious or adversarial data injected by an attacker, which affects the mod…
▽ More
As the use of machine learning (ML) models is becoming increasingly popular in many real-world applications, there are practical challenges that need to be addressed for model maintenance. One such challenge is to 'undo' the effect of a specific subset of dataset used for training a model. This specific subset may contain malicious or adversarial data injected by an attacker, which affects the model performance. Another reason may be the need for a service provider to remove data pertaining to a specific user to respect the user's privacy. In both cases, the problem is to 'unlearn' a specific subset of the training data from a trained model without incurring the costly procedure of retraining the whole model from scratch. Towards this goal, this paper presents a Markov chain Monte Carlo-based machine unlearning (MCU) algorithm. MCU helps to effectively and efficiently unlearn a trained model from subsets of training dataset. Furthermore, we show that with MCU, we are able to explain the effect of a subset of a training dataset on the model prediction. Thus, MCU is useful for examining subsets of data to identify the adversarial data to be removed. Similarly, MCU can be used to erase the lineage of a user's personal data from trained ML models, thus upholding a user's "right to be forgotten". We empirically evaluate the performance of our proposed MCU algorithm on real-world phishing and diabetes datasets. Results show that MCU can achieve a desirable performance by efficiently removing the effect of a subset of training dataset and outperform an existing algorithm that utilizes the remaining dataset.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
PATOKA: Simulating Electromagnetic Observables of Black Hole Accretion
Authors:
George N. Wong,
Ben S. Prather,
Vedant Dhruv,
Benjamin R. Ryan,
Monika Moscibrodzka,
Chi-kwan Chan,
Abhishek V. Joshi,
Ricardo Yarza,
Angelo Ricarte,
Hotaka Shiokawa,
Joshua C. Dolence,
Scott C. Noble,
Jonathan C. McKinney,
Charles F. Gammie
Abstract:
The Event Horizon Telescope (EHT) has released analyses of reconstructed images of horizon-scale millimeter emission near the supermassive black hole at the center of the M87 galaxy. Parts of the analyses made use of a large library of synthetic black hole images and spectra, which were produced using numerical general relativistic magnetohydrodynamics fluid simulations and polarized ray tracing.…
▽ More
The Event Horizon Telescope (EHT) has released analyses of reconstructed images of horizon-scale millimeter emission near the supermassive black hole at the center of the M87 galaxy. Parts of the analyses made use of a large library of synthetic black hole images and spectra, which were produced using numerical general relativistic magnetohydrodynamics fluid simulations and polarized ray tracing. In this article, we describe the PATOKA pipeline, which was used to generate the Illinois contribution to the EHT simulation library. We begin by describing the relevant accretion systems and radiative processes. We then describe the details of the three numerical codes we use, iharm, ipole, and igrmonty, paying particular attention to differences between the current generation of the codes and the originally published versions. Finally, we provide a brief overview of simulated data as produced by PATOKA and conclude with a discussion of limitations and future directions.
△ Less
Submitted 11 March, 2022; v1 submitted 23 February, 2022;
originally announced February 2022.
-
Observation of boundary induced chiral anomaly bulk states and their transport properties
Authors:
Mudi Wang,
Qiyun Ma,
Shan Liu,
Ruo-Yang Zhang,
Lei Zhang,
Manzhu Ke,
Zhengyou Liu,
C. T. Chan
Abstract:
The robust transport of edge modes is perhaps the most useful property of topological materials. The existence of edge modes is guaranteed by the bulk-edge correspondence, which states that the number of topological edge modes is determined by the bulk topological invariants. To obtain robust transport on the edge, we need to make volumetric changes to many bulk atoms to control the properties of…
▽ More
The robust transport of edge modes is perhaps the most useful property of topological materials. The existence of edge modes is guaranteed by the bulk-edge correspondence, which states that the number of topological edge modes is determined by the bulk topological invariants. To obtain robust transport on the edge, we need to make volumetric changes to many bulk atoms to control the properties of a few edge atoms in a lower dimension. We suggest here that we can do the reverse in some cases: the properties of the edge can guarantee chiral transport phenomena in some bulk modes, achieving phenomena that are essentially the same as those observed in topological valley-Hall systems. Specifically, we show that a topologically trivial 2D hexagonal phononic crystal slab (waveguide) bounded by hardwall boundaries guarantees the existence of bulk modes with chiral anomaly inside a pseudogap. We experimentally observed robust valley-selected transport, complete valley state conversion, and valley focusing of the chiral anomaly bulk states (CABSs) in such phononic crystal waveguides.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
OpenKBP-Opt: An international and reproducible evaluation of 76 knowledge-based planning pipelines
Authors:
Aaron Babier,
Rafid Mahmood,
Binghao Zhang,
Victor G. L. Alves,
Ana Maria Barragán-Montero,
Joel Beaudry,
Carlos E. Cardenas,
Yankui Chang,
Zijie Chen,
Jaehee Chun,
Kelly Diaz,
Harold David Eraso,
Erik Faustmann,
Sibaji Gaj,
Skylar Gay,
Mary Gronberg,
Bingqi Guo,
Junjun He,
Gerd Heilemann,
Sanchit Hira,
Yuliang Huang,
Fuxin Ji,
Dashan Jiang,
Jean Carlo Jimenez Giraldo,
Hoyeon Lee
, et al. (34 additional authors not shown)
Abstract:
We establish an open framework for develo** plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization mode…
▽ More
We establish an open framework for develo** plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization models to form 76 unique KBP pipelines that generated 7600 plans. The predictions and plans were compared to the reference plans via: dose score, which is the average mean absolute voxel-by-voxel difference in dose a model achieved; the deviation in dose-volume histogram (DVH) criterion; and the frequency of clinical planning criteria satisfaction. We also performed a theoretical investigation to justify our dose mimicking models. The range in rank order correlation of the dose score between predictions and their KBP pipelines was 0.50 to 0.62, which indicates that the quality of the predictions is generally positively correlated with the quality of the plans. Additionally, compared to the input predictions, the KBP-generated plans performed significantly better (P<0.05; one-sided Wilcoxon test) on 18 of 23 DVH criteria. Similarly, each optimization model generated plans that satisfied a higher percentage of criteria than the reference plans. Lastly, our theoretical investigation demonstrated that the dose mimicking models generated plans that are also optimal for a conventional planning model. This was the largest international effort to date for evaluating the combination of KBP prediction and optimization models. In the interest of reproducibility, our data and code is freely available at https://github.com/ababier/open-kbp-opt.
△ Less
Submitted 16 February, 2022;
originally announced February 2022.
-
ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning
Authors:
Jia Huei Tan,
Ying Hua Tan,
Chee Seng Chan,
Joon Huang Chuah
Abstract:
Recent research that applies Transformer-based architectures to image captioning has resulted in state-of-the-art image captioning performance, capitalising on the success of Transformers on natural language tasks. Unfortunately, though these models work well, one major flaw is their large model sizes. To this end, we present three parameter reduction methods for image captioning Transformers: Rad…
▽ More
Recent research that applies Transformer-based architectures to image captioning has resulted in state-of-the-art image captioning performance, capitalising on the success of Transformers on natural language tasks. Unfortunately, though these models work well, one major flaw is their large model sizes. To this end, we present three parameter reduction methods for image captioning Transformers: Radix Encoding, cross-layer parameter sharing, and attention parameter sharing. By combining these methods, our proposed ACORT models have 3.7x to 21.6x fewer parameters than the baseline model without compromising test performance. Results on the MS-COCO dataset demonstrate that our ACORT models are competitive against baselines and SOTA approaches, with CIDEr score >=126. Finally, we present qualitative results and ablation studies to demonstrate the efficacy of the proposed changes further. Code and pre-trained models are publicly available at https://github.com/jiahuei/sparse-image-captioning.
△ Less
Submitted 11 February, 2022;
originally announced February 2022.
-
Deep Learning for Computational Cytology: A Survey
Authors:
Hao Jiang,
Yanning Zhou,
Yi Lin,
Ronald CK Chan,
Jiang Liu,
Hao Chen
Abstract:
Computational cytology is a critical, rapid-develo**, yet challenging topic in the field of medical image computing which analyzes the digitized cytology image by computer-aided technologies for cancer screening. Recently, an increasing number of deep learning (DL) algorithms have made significant progress in medical image analysis, leading to the boosting publications of cytological studies. To…
▽ More
Computational cytology is a critical, rapid-develo**, yet challenging topic in the field of medical image computing which analyzes the digitized cytology image by computer-aided technologies for cancer screening. Recently, an increasing number of deep learning (DL) algorithms have made significant progress in medical image analysis, leading to the boosting publications of cytological studies. To investigate the advanced methods and comprehensive applications, we survey more than 120 publications of DL-based cytology image analysis in this article. We first introduce various deep learning methods, including fully supervised, weakly supervised, unsupervised, and transfer learning. Then, we systematically summarize the public datasets, evaluation metrics, versatile cytology image analysis applications including classification, detection, segmentation, and other related tasks. Finally, we discuss current challenges and potential research directions of computational cytology.
△ Less
Submitted 16 February, 2022; v1 submitted 10 February, 2022;
originally announced February 2022.
-
Demonstration of non-Abelian frame charge flow in photonic crystals
Authors:
Dongyang Wang,
Z. Q. Zhang,
C. T. Chan
Abstract:
In PT symmetric systems, the notion of non-Abelian frame charges enables multiband topological characterization of the degeneracy nodes through examining the eigenvector frame rotations. Interestingly, some features of these frame charges can be viewed as an analogue of electric charges confined in conducting wires, only that they flow in momentum space along nodal lines. However, these frame char…
▽ More
In PT symmetric systems, the notion of non-Abelian frame charges enables multiband topological characterization of the degeneracy nodes through examining the eigenvector frame rotations. Interestingly, some features of these frame charges can be viewed as an analogue of electric charges confined in conducting wires, only that they flow in momentum space along nodal lines. However, these frame charges are not integers, and non-Abelian signatures emerge when braiding between adjacent band nodal lines occurs, which flips the direction of the flow. In photonic systems, we discover that the photonic Γ point serves as the source or sink of such frame charge flow due to a hidden braiding induced by the often-ignored electrostatic mode at zero-frequency. We use biaxial photonic crystals as examples and show how complex nodal line configurations can be explained as the topological consequences of the frame charge flow from the Γ point to the Brillouin zone boundaries. We further designed and fabricated meta-crystals to experimentally observe these line nodes as manifestation of the non-Abelian frame charge flow.
△ Less
Submitted 28 October, 2022; v1 submitted 7 February, 2022;
originally announced February 2022.
-
Topological Data Analysis of Black Hole Images
Authors:
Pierre Christian,
Chi-kwan Chan,
Anthony Hsu,
Feryal Ozel,
Dimitrios Psaltis,
Iniyan Natarajan
Abstract:
Features such as photon rings, jets, or hot. spots can leave particular topological signatures in a black hole image. As such, topological data analysis can be used to characterize images resulting from high resolution observations (synthetic or real) of black holes in the electromagnetic sector. We demonstrate that persistent homology allows for this characterization to be made automatically by c…
▽ More
Features such as photon rings, jets, or hot. spots can leave particular topological signatures in a black hole image. As such, topological data analysis can be used to characterize images resulting from high resolution observations (synthetic or real) of black holes in the electromagnetic sector. We demonstrate that persistent homology allows for this characterization to be made automatically by counting the number of connected components and one-dimensional holes. Further, persistent homology also allows for the distance between connected components or diameter of holes to be extracted from the image. In order to apply persistent homology on synthetic black hole images, we also introduce metronization, a new algorithm to prepare black hole images into a form that is suitable for topological analysis.
△ Less
Submitted 8 October, 2022; v1 submitted 1 February, 2022;
originally announced February 2022.
-
Search for gravitational waves from Scorpius X-1 with a hidden Markov model in O3 LIGO data
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Allocca,
P. A. Altin
, et al. (1647 additional authors not shown)
Abstract:
Results are presented for a semi-coherent search for continuous gravitational waves from the low-mass X-ray binary Scorpius X-1, using a hidden Markov model (HMM) to allow for spin wandering. This search improves on previous HMM-based searches of Laser Interferometer Gravitational-wave Observatory (LIGO) data by including the orbital period in the search template grid, and by analyzing data from t…
▽ More
Results are presented for a semi-coherent search for continuous gravitational waves from the low-mass X-ray binary Scorpius X-1, using a hidden Markov model (HMM) to allow for spin wandering. This search improves on previous HMM-based searches of Laser Interferometer Gravitational-wave Observatory (LIGO) data by including the orbital period in the search template grid, and by analyzing data from the latest (third) observing run (O3). In the frequency range searched, from 60 to 500 Hz, we find no evidence of gravitational radiation. This is the most sensitive search for Scorpius X-1 using a HMM to date. For the most sensitive sub-band, starting at $256.06$Hz, we report an upper limit on gravitational wave strain (at $95 \%$ confidence) of $h_{0}^{95\%}=6.16\times10^{-26}$, assuming the orbital inclination angle takes its electromagnetically restricted value $ι=44^{\circ}$. The upper limits on gravitational wave strain reported here are on average a factor of $\sim 3$ lower than in the O2 HMM search. This is the first Scorpius X-1 HMM search with upper limits that reach below the indirect torque-balance limit for certain sub-bands, assuming $ι=44^{\circ}$.
△ Less
Submitted 25 January, 2022;
originally announced January 2022.