Search | arXiv e-print repository

Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model

Authors: Qi Song, Ziyuan Luo, Ka Chun Cheung, Simon See, Renjie Wan

Abstract: Neural Radiance Fields (NeRFs) have become a key method for 3D scene representation. With the rising prominence and influence of NeRF, safeguarding its intellectual property has become increasingly important. In this paper, we propose \textbf{NeRFProtector}, which adopts a plug-and-play strategy to protect NeRF's copyright during its creation. NeRFProtector utilizes a pre-trained watermarking base… ▽ More Neural Radiance Fields (NeRFs) have become a key method for 3D scene representation. With the rising prominence and influence of NeRF, safeguarding its intellectual property has become increasingly important. In this paper, we propose \textbf{NeRFProtector}, which adopts a plug-and-play strategy to protect NeRF's copyright during its creation. NeRFProtector utilizes a pre-trained watermarking base model, enabling NeRF creators to embed binary messages directly while creating their NeRF. Our plug-and-play property ensures NeRF creators can flexibly choose NeRF variants without excessive modifications. Leveraging our newly designed progressive distillation, we demonstrate performance on par with several leading-edge neural rendering methods. Our project is available at: \url{https://qsong2001.github.io/NeRFProtector}. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: Accepted by ECCV2024

arXiv:2406.17245 [pdf, other]

Unlocking Continual Learning Abilities in Language Models

Authors: Wenyu Du, Shuang Cheng, Tongxu Luo, Zihan Qiu, Zeyu Huang, Ka Chun Cheung, Reynold Cheng, Jie Fu

Abstract: Language models (LMs) exhibit impressive performance and generalization capabilities. However, LMs struggle with the persistent challenge of catastrophic forgetting, which undermines their long-term sustainability in continual learning (CL). Existing approaches usually address the issue by incorporating old task data or task-wise inductive bias into LMs. However, old data and accurate task informa… ▽ More Language models (LMs) exhibit impressive performance and generalization capabilities. However, LMs struggle with the persistent challenge of catastrophic forgetting, which undermines their long-term sustainability in continual learning (CL). Existing approaches usually address the issue by incorporating old task data or task-wise inductive bias into LMs. However, old data and accurate task information are often unavailable or costly to collect, hindering the availability of current CL approaches for LMs. To address this limitation, we introduce $\textbf{MIGU}$ ($\textbf{M}$agn$\textbf{I}$tude-based $\textbf{G}$radient $\textbf{U}$pdating for continual learning), a rehearsal-free and task-label-free method that only updates the model parameters with large magnitudes of output in LMs' linear layers. MIGU is based on our observation that the L1-normalized magnitude distribution of the output in LMs' linear layers is different when the LM models deal with different task data. By imposing this simple constraint on the gradient update process, we can leverage the inherent behaviors of LMs, thereby unlocking their innate CL abilities. Our experiments demonstrate that MIGU is universally applicable to all three LM architectures (T5, RoBERTa, and Llama2), delivering state-of-the-art or on-par performance across continual finetuning and continual pre-training settings on four CL benchmarks. For example, MIGU brings a 15.2% average accuracy improvement over conventional parameter-efficient finetuning baselines in a 15-task CL benchmark. MIGU can also seamlessly integrate with all three existing CL types to further enhance performance. Code is available at \href{https://github.com/wenyudu/MIGU}{this https URL}. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: preprint, 19 pages

arXiv:2405.15724 [pdf, other]

Reconfiguration Algorithms for Cubic Modular Robots with Realistic Movement Constraints

Authors: MIT--NASA Space Robots Team, Josh Brunner, Kenneth C. Cheung, Erik D. Demaine, Jenny Diomidova, Christine Gregg, Della H. Hendrickson, Irina Kostitsyna

Abstract: We introduce and analyze a model for self-reconfigurable robots made up of unit-cube modules. Compared to past models, our model aims to newly capture two important practical aspects of real-world robots. First, modules often do not occupy an exact unit cube, but rather have features like bumps extending outside the allotted space so that modules can interlock. Thus, for example, our model forbids… ▽ More We introduce and analyze a model for self-reconfigurable robots made up of unit-cube modules. Compared to past models, our model aims to newly capture two important practical aspects of real-world robots. First, modules often do not occupy an exact unit cube, but rather have features like bumps extending outside the allotted space so that modules can interlock. Thus, for example, our model forbids modules from squeezing in between two other modules that are one unit distance apart. Second, our model captures the practical scenario of many passive modules assembled by a single robot, instead of requiring all modules to be able to move on their own. We prove two universality results. First, with a supply of auxiliary modules, we show that any connected polycube structure can be constructed by a carefully aligned plane sweep. Second, without additional modules, we show how to construct any structure for which a natural notion of external feature size is at least a constant; this property largely consolidates forbidden-pattern properties used in previous works on reconfigurable modular robots. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2403.02330 [pdf, other]

RegionGPT: Towards Region Understanding Vision Language Model

Authors: Qiushan Guo, Shalini De Mello, Hongxu Yin, Wonmin Byeon, Ka Chun Cheung, Yizhou Yu, ** Luo, Sifei Liu

Abstract: Vision language models (VLMs) have experienced rapid advancements through the integration of large language models (LLMs) with image-text pairs, yet they struggle with detailed regional visual understanding due to limited spatial awareness of the vision encoder, and the use of coarse-grained training data that lacks detailed, region-specific captions. To address this, we introduce RegionGPT (short… ▽ More Vision language models (VLMs) have experienced rapid advancements through the integration of large language models (LLMs) with image-text pairs, yet they struggle with detailed regional visual understanding due to limited spatial awareness of the vision encoder, and the use of coarse-grained training data that lacks detailed, region-specific captions. To address this, we introduce RegionGPT (short as RGPT), a novel framework designed for complex region-level captioning and understanding. RGPT enhances the spatial awareness of regional representation with simple yet effective modifications to existing visual encoders in VLMs. We further improve performance on tasks requiring a specific output scope by integrating task-guided instruction prompts during both training and inference phases, while maintaining the model's versatility for general-purpose tasks. Additionally, we develop an automated region caption data generation pipeline, enriching the training set with detailed region-level captions. We demonstrate that a universal RGPT model can be effectively applied and significantly enhancing performance across a range of region-level tasks, including but not limited to complex region descriptions, reasoning, object classification, and referring expressions comprehension. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: Accepted by CVPR 2024

arXiv:2401.15977 [pdf, other]

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

Authors: Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

Abstract: We introduce Motion-I2V, a novel framework for consistent and controllable image-to-video generation (I2V). In contrast to previous methods that directly learn the complicated image-to-video map**, Motion-I2V factorizes I2V into two stages with explicit motion modeling. For the first stage, we propose a diffusion-based motion field predictor, which focuses on deducing the trajectories of the ref… ▽ More We introduce Motion-I2V, a novel framework for consistent and controllable image-to-video generation (I2V). In contrast to previous methods that directly learn the complicated image-to-video map**, Motion-I2V factorizes I2V into two stages with explicit motion modeling. For the first stage, we propose a diffusion-based motion field predictor, which focuses on deducing the trajectories of the reference image's pixels. For the second stage, we propose motion-augmented temporal attention to enhance the limited 1-D temporal attention in video latent diffusion models. This module can effectively propagate reference image's feature to synthesized frames with the guidance of predicted trajectories from the first stage. Compared with existing methods, Motion-I2V can generate more consistent videos even at the presence of large motion and viewpoint variation. By training a sparse trajectory ControlNet for the first stage, Motion-I2V can support users to precisely control motion trajectories and motion regions with sparse trajectory and region annotations. This offers more controllability of the I2V process than solely relying on textual instructions. Additionally, Motion-I2V's second stage naturally supports zero-shot video-to-video translation. Both qualitative and quantitative comparisons demonstrate the advantages of Motion-I2V over prior approaches in consistent and controllable image-to-video generation. Please see our project page at https://xiaoyushi97.github.io/Motion-I2V/. △ Less

Submitted 31 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

Comments: Project page: https://xiaoyushi97.github.io/Motion-I2V/

arXiv:2401.14619 [pdf, other]

Resilient Practical Test-Time Adaptation: Soft Batch Normalization Alignment and Entropy-driven Memory Bank

Authors: Xingzhi Zhou, Zhiliang Tian, Ka Chun Cheung, Simon See, Nevin L. Zhang

Abstract: Test-time domain adaptation effectively adjusts the source domain model to accommodate unseen domain shifts in a target domain during inference. However, the model performance can be significantly impaired by continuous distribution changes in the target domain and non-independent and identically distributed (non-i.i.d.) test samples often encountered in practical scenarios. While existing memory… ▽ More Test-time domain adaptation effectively adjusts the source domain model to accommodate unseen domain shifts in a target domain during inference. However, the model performance can be significantly impaired by continuous distribution changes in the target domain and non-independent and identically distributed (non-i.i.d.) test samples often encountered in practical scenarios. While existing memory bank methodologies use memory to store samples and mitigate non-i.i.d. effects, they do not inherently prevent potential model degradation. To address this issue, we propose a resilient practical test-time adaptation (ResiTTA) method focused on parameter resilience and data quality. Specifically, we develop a resilient batch normalization with estimation on normalization statistics and soft alignments to mitigate overfitting and model degradation. We use an entropy-driven memory bank that accounts for timeliness, the persistence of over-confident samples, and sample uncertainty for high-quality data in adaptation. Our framework periodically adapts the source domain model using a teacher-student model through a self-training loss on the memory samples, incorporating soft alignment losses on batch normalization. We empirically validate ResiTTA across various benchmark datasets, demonstrating state-of-the-art performance. △ Less

Submitted 25 January, 2024; originally announced January 2024.

arXiv:2308.12925 [pdf, other]

doi 10.1109/MLSP55844.2023.10285979

Low-count Time Series Anomaly Detection

Authors: Philipp Renz, Kurt Cutajar, Niall Twomey, Gavin K. C. Cheung, Hanting Xie

Abstract: Low-count time series describe sparse or intermittent events, which are prevalent in large-scale online platforms that capture and monitor diverse data types. Several distinct challenges surface when modelling low-count time series, particularly low signal-to-noise ratios (when anomaly signatures are provably undetectable), and non-uniform performance (when average metrics are not representative o… ▽ More Low-count time series describe sparse or intermittent events, which are prevalent in large-scale online platforms that capture and monitor diverse data types. Several distinct challenges surface when modelling low-count time series, particularly low signal-to-noise ratios (when anomaly signatures are provably undetectable), and non-uniform performance (when average metrics are not representative of local behaviour). The time series anomaly detection community currently lacks explicit tooling and processes to model and reliably detect anomalies in these settings. We address this gap by introducing a novel generative procedure for creating benchmark datasets comprising of low-count time series with anomalous segments. Via a mixture of theoretical and empirical analysis, our work explains how widely-used algorithms struggle with the distribution overlap between normal and anomalous segments. In order to mitigate this shortcoming, we then leverage our findings to demonstrate how anomaly score smoothing consistently improves performance. The practical utility of our analysis and recommendation is validated on a real-world dataset containing sales data for retail stores. △ Less

Submitted 24 August, 2023; originally announced August 2023.

Comments: 6 pages, 7 figures, to be published in IEEE 2023 Workshop on Machine Learning for Signal Processing (MLSP)

Journal ref: 2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP)

arXiv:2307.11526 [pdf, other]

CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields

Authors: Ziyuan Luo, Qing Guo, Ka Chun Cheung, Simon See, Renjie Wan

Abstract: Neural Radiance Fields (NeRF) have the potential to be a major representation of media. Since training a NeRF has never been an easy task, the protection of its model copyright should be a priority. In this paper, by analyzing the pros and cons of possible copyright protection solutions, we propose to protect the copyright of NeRF models by replacing the original color representation in NeRF with… ▽ More Neural Radiance Fields (NeRF) have the potential to be a major representation of media. Since training a NeRF has never been an easy task, the protection of its model copyright should be a priority. In this paper, by analyzing the pros and cons of possible copyright protection solutions, we propose to protect the copyright of NeRF models by replacing the original color representation in NeRF with a watermarked color representation. Then, a distortion-resistant rendering scheme is designed to guarantee robust message extraction in 2D renderings of NeRF. Our proposed method can directly protect the copyright of NeRF models while maintaining high rendering quality and bit accuracy when compared among optional solutions. △ Less

Submitted 29 July, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

Comments: 11 pages, 6 figures, accepted by ICCV 2023 non-camera-ready version

arXiv:2306.05888 [pdf, other]

TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses

Authors: Xuesong Chen, Shaoshuai Shi, Chao Zhang, Ben** Zhu, Qiang Wang, Ka Chun Cheung, Simon See, Hongsheng Li

Abstract: 3D multi-object tracking (MOT) is vital for many applications including autonomous driving vehicles and service robots. With the commonly used tracking-by-detection paradigm, 3D MOT has made important progress in recent years. However, these methods only use the detection boxes of the current frame to obtain trajectory-box association results, which makes it impossible for the tracker to recover o… ▽ More 3D multi-object tracking (MOT) is vital for many applications including autonomous driving vehicles and service robots. With the commonly used tracking-by-detection paradigm, 3D MOT has made important progress in recent years. However, these methods only use the detection boxes of the current frame to obtain trajectory-box association results, which makes it impossible for the tracker to recover objects missed by the detector. In this paper, we present TrajectoryFormer, a novel point-cloud-based 3D MOT framework. To recover the missed object by detector, we generates multiple trajectory hypotheses with hybrid candidate boxes, including temporally predicted boxes and current-frame detection boxes, for trajectory-box association. The predicted boxes can propagate object's history trajectory information to the current frame and thus the network can tolerate short-term miss detection of the tracked objects. We combine long-term object motion feature and short-term object appearance feature to create per-hypothesis feature embedding, which reduces the computational overhead for spatial-temporal encoding. Additionally, we introduce a Global-Local Interaction Module to conduct information interaction among all hypotheses and models their spatial relations, leading to accurate estimation of hypotheses. Our TrajectoryFormer achieves state-of-the-art performance on the Waymo 3D MOT benchmarks. Code is available at https://github.com/poodarchu/EFG . △ Less

Submitted 18 August, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

Comments: Accepted by ICCV 2023

arXiv:2303.08340 [pdf, other]

VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation

Authors: Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

Abstract: We introduce VideoFlow, a novel optical flow estimation framework for videos. In contrast to previous methods that learn to estimate optical flow from two frames, VideoFlow concurrently estimates bi-directional optical flows for multiple frames that are available in videos by sufficiently exploiting temporal cues. We first propose a TRi-frame Optical Flow (TROF) module that estimates bi-directiona… ▽ More We introduce VideoFlow, a novel optical flow estimation framework for videos. In contrast to previous methods that learn to estimate optical flow from two frames, VideoFlow concurrently estimates bi-directional optical flows for multiple frames that are available in videos by sufficiently exploiting temporal cues. We first propose a TRi-frame Optical Flow (TROF) module that estimates bi-directional optical flows for the center frame in a three-frame manner. The information of the frame triplet is iteratively fused onto the center frame. To extend TROF for handling more frames, we further propose a MOtion Propagation (MOP) module that bridges multiple TROFs and propagates motion features between adjacent TROFs. With the iterative flow estimation refinement, the information fused in individual TROFs can be propagated into the whole sequence via MOP. By effectively exploiting video information, VideoFlow presents extraordinary performance, ranking 1st on all public benchmarks. On the Sintel benchmark, VideoFlow achieves 1.649 and 0.991 average end-point-error (AEPE) on the final and clean passes, a 15.1% and 7.6% error reduction from the best-published results (1.943 and 1.073 from FlowFormer++). On the KITTI-2015 benchmark, VideoFlow achieves an F1-all error of 3.65%, a 19.2% error reduction from the best-published result (4.52% from FlowFormer++). Code is released at \url{https://github.com/XiaoyuShi97/VideoFlow}. △ Less

Submitted 20 August, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

arXiv:2303.01237 [pdf, other]

FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation

Authors: Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

Abstract: FlowFormer introduces a transformer architecture into optical flow estimation and achieves state-of-the-art performance. The core component of FlowFormer is the transformer-based cost-volume encoder. Inspired by the recent success of masked autoencoding (MAE) pretraining in unleashing transformers' capacity of encoding visual representation, we propose Masked Cost Volume Autoencoding (MCVA) to enh… ▽ More FlowFormer introduces a transformer architecture into optical flow estimation and achieves state-of-the-art performance. The core component of FlowFormer is the transformer-based cost-volume encoder. Inspired by the recent success of masked autoencoding (MAE) pretraining in unleashing transformers' capacity of encoding visual representation, we propose Masked Cost Volume Autoencoding (MCVA) to enhance FlowFormer by pretraining the cost-volume encoder with a novel MAE scheme. Firstly, we introduce a block-sharing masking strategy to prevent masked information leakage, as the cost maps of neighboring source pixels are highly correlated. Secondly, we propose a novel pre-text reconstruction task, which encourages the cost-volume encoder to aggregate long-range information and ensures pretraining-finetuning consistency. We also show how to modify the FlowFormer architecture to accommodate masks during pretraining. Pretrained with MCVA, FlowFormer++ ranks 1st among published methods on both Sintel and KITTI-2015 benchmarks. Specifically, FlowFormer++ achieves 1.07 and 1.94 average end-point error (AEPE) on the clean and final pass of Sintel benchmark, leading to 7.76\% and 7.18\% error reductions from FlowFormer. FlowFormer++ obtains 4.52 F1-all on the KITTI-2015 test set, improving FlowFormer by 0.16. △ Less

Submitted 2 March, 2023; originally announced March 2023.

arXiv:2211.12759 [pdf, other]

NAS-LID: Efficient Neural Architecture Search with Local Intrinsic Dimension

Authors: Xin He, Jiangchao Yao, Yuxin Wang, Zhenheng Tang, Ka Chu Cheung, Simon See, Bo Han, Xiaowen Chu

Abstract: One-shot neural architecture search (NAS) substantially improves the search efficiency by training one supernet to estimate the performance of every possible child architecture (i.e., subnet). However, the inconsistency of characteristics among subnets incurs serious interference in the optimization, resulting in poor performance ranking correlation of subnets. Subsequent explorations decompose su… ▽ More One-shot neural architecture search (NAS) substantially improves the search efficiency by training one supernet to estimate the performance of every possible child architecture (i.e., subnet). However, the inconsistency of characteristics among subnets incurs serious interference in the optimization, resulting in poor performance ranking correlation of subnets. Subsequent explorations decompose supernet weights via a particular criterion, e.g., gradient matching, to reduce the interference; yet they suffer from huge computational cost and low space separability. In this work, we propose a lightweight and effective local intrinsic dimension (LID)-based method NAS-LID. NAS-LID evaluates the geometrical properties of architectures by calculating the low-cost LID features layer-by-layer, and the similarity characterized by LID enjoys better separability compared with gradients, which thus effectively reduces the interference among subnets. Extensive experiments on NASBench-201 indicate that NAS-LID achieves superior performance with better efficiency. Specifically, compared to the gradient-driven method, NAS-LID can save up to 86% of GPU memory overhead when searching on NASBench-201. We also demonstrate the effectiveness of NAS-LID on ProxylessNAS and OFA spaces. Source code: https://github.com/marsggbo/NAS-LID. △ Less

Submitted 24 November, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

Comments: Accepted by AAAI2023, AutoML, NAS

arXiv:2211.08760 [pdf, other]

doi 10.1109/SSCI51031.2022.10022281

SVD-PINNs: Transfer Learning of Physics-Informed Neural Networks via Singular Value Decomposition

Authors: Yihang Gao, Ka Chun Cheung, Michael K. Ng

Abstract: Physics-informed neural networks (PINNs) have attracted significant attention for solving partial differential equations (PDEs) in recent years because they alleviate the curse of dimensionality that appears in traditional methods. However, the most disadvantage of PINNs is that one neural network corresponds to one PDE. In practice, we usually need to solve a class of PDEs, not just one. With the… ▽ More Physics-informed neural networks (PINNs) have attracted significant attention for solving partial differential equations (PDEs) in recent years because they alleviate the curse of dimensionality that appears in traditional methods. However, the most disadvantage of PINNs is that one neural network corresponds to one PDE. In practice, we usually need to solve a class of PDEs, not just one. With the explosive growth of deep learning, many useful techniques in general deep learning tasks are also suitable for PINNs. Transfer learning methods may reduce the cost for PINNs in solving a class of PDEs. In this paper, we proposed a transfer learning method of PINNs via kee** singular vectors and optimizing singular values (namely SVD-PINNs). Numerical experiments on high dimensional PDEs (10-d linear parabolic equations and 10-d Allen-Cahn equations) show that SVD-PINNs work for solving a class of PDEs with different but close right-hand-side functions. △ Less

Submitted 14 March, 2024; v1 submitted 16 November, 2022; originally announced November 2022.

Comments: Accepted to The 2022 IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2022)

arXiv:2210.13459 [pdf, other]

Adaptive Label Smoothing with Self-Knowledge in Natural Language Generation

Authors: Dongkyu Lee, Ka Chun Cheung, Nevin L. Zhang

Abstract: Overconfidence has been shown to impair generalization and calibration of a neural network. Previous studies remedy this issue by adding a regularization term to a loss function, preventing a model from making a peaked distribution. Label smoothing smoothes target labels with a pre-defined prior label distribution; as a result, a model is learned to maximize the likelihood of predicting the soft l… ▽ More Overconfidence has been shown to impair generalization and calibration of a neural network. Previous studies remedy this issue by adding a regularization term to a loss function, preventing a model from making a peaked distribution. Label smoothing smoothes target labels with a pre-defined prior label distribution; as a result, a model is learned to maximize the likelihood of predicting the soft label. Nonetheless, the amount of smoothing is the same in all samples and remains fixed in training. In other words, label smoothing does not reflect the change in probability distribution mapped by a model over the course of training. To address this issue, we propose a regularization scheme that brings dynamic nature into the smoothing parameter by taking model probability distribution into account, thereby varying the parameter per instance. A model in training self-regulates the extent of smoothing on the fly during forward propagation. Furthermore, inspired by recent work in bridging label smoothing and knowledge distillation, our work utilizes self-knowledge as a prior label distribution in softening target labels, and presents theoretical support for the regularization effect by knowledge distillation and the dynamic smoothing parameter. Our regularizer is validated comprehensively, and the result illustrates marked improvements in model generalization and calibration, enhancing robustness and trustworthiness of a model. △ Less

Submitted 22 October, 2022; originally announced October 2022.

Comments: EMNLP 2022

arXiv:2210.12427 [pdf, other]

Hard Gate Knowledge Distillation -- Leverage Calibration for Robust and Reliable Language Model

Authors: Dongkyu Lee, Zhiliang Tian, Yingxiu Zhao, Ka Chun Cheung, Nevin L. Zhang

Abstract: In knowledge distillation, a student model is trained with supervisions from both knowledge from a teacher and observations drawn from a training data distribution. Knowledge of a teacher is considered a subject that holds inter-class relations which send a meaningful supervision to a student; hence, much effort has been put to find such knowledge to be distilled. In this paper, we explore a quest… ▽ More In knowledge distillation, a student model is trained with supervisions from both knowledge from a teacher and observations drawn from a training data distribution. Knowledge of a teacher is considered a subject that holds inter-class relations which send a meaningful supervision to a student; hence, much effort has been put to find such knowledge to be distilled. In this paper, we explore a question that has been given little attention: "when to distill such knowledge." The question is answered in our work with the concept of model calibration; we view a teacher model not only as a source of knowledge but also as a gauge to detect miscalibration of a student. This simple and yet novel view leads to a hard gate knowledge distillation scheme that switches between learning from a teacher model and training data. We verify the gating mechanism in the context of natural language generation at both the token-level and the sentence-level. Empirical comparisons with strong baselines show that hard gate knowledge distillation not only improves model generalization, but also significantly lowers model calibration error. △ Less

Submitted 22 October, 2022; originally announced October 2022.

Comments: EMNLP 2022

arXiv:2209.08896 [pdf, other]

NeuralMarker: A Framework for Learning General Marker Correspondence

Authors: Zhaoyang Huang, Xiaokun Pan, Weihong Pan, Weikang Bian, Yan Xu, Ka Chun Cheung, Guofeng Zhang, Hongsheng Li

Abstract: We tackle the problem of estimating correspondences from a general marker, such as a movie poster, to an image that captures such a marker. Conventionally, this problem is addressed by fitting a homography model based on sparse feature matching. However, they are only able to handle plane-like markers and the sparse features do not sufficiently utilize appearance information. In this paper, we pro… ▽ More We tackle the problem of estimating correspondences from a general marker, such as a movie poster, to an image that captures such a marker. Conventionally, this problem is addressed by fitting a homography model based on sparse feature matching. However, they are only able to handle plane-like markers and the sparse features do not sufficiently utilize appearance information. In this paper, we propose a novel framework NeuralMarker, training a neural network estimating dense marker correspondences under various challenging conditions, such as marker deformation, harsh lighting, etc. Besides, we also propose a novel marker correspondence evaluation method circumstancing annotations on real marker-image pairs and create a new benchmark. We show that NeuralMarker significantly outperforms previous methods and enables new interesting applications, including Augmented Reality (AR) and video editing. △ Less

Submitted 19 September, 2022; originally announced September 2022.

Comments: Accepted by ToG (SIGGRAPH Asia 2022). Project Page: https://drinkingcoder.github.io/publication/neuralmarker/

arXiv:2208.05244 [pdf, other]

Learning Degradation Representations for Image Deblurring

Authors: Dasong Li, Yi Zhang, Ka Chun Cheung, Xiaogang Wang, Hongwei Qin, Hongsheng Li

Abstract: In various learning-based image restoration tasks, such as image denoising and image super-resolution, the degradation representations were widely used to model the degradation process and handle complicated degradation patterns. However, they are less explored in learning-based image deblurring as blur kernel estimation cannot perform well in real-world challenging cases. We argue that it is part… ▽ More In various learning-based image restoration tasks, such as image denoising and image super-resolution, the degradation representations were widely used to model the degradation process and handle complicated degradation patterns. However, they are less explored in learning-based image deblurring as blur kernel estimation cannot perform well in real-world challenging cases. We argue that it is particularly necessary for image deblurring to model degradation representations since blurry patterns typically show much larger variations than noisy patterns or high-frequency textures.In this paper, we propose a framework to learn spatially adaptive degradation representations of blurry images. A novel joint image reblurring and deblurring learning process is presented to improve the expressiveness of degradation representations. To make learned degradation representations effective in reblurring and deblurring, we propose a Multi-Scale Degradation Injection Network (MSDI-Net) to integrate them into the neural networks. With the integration, MSDI-Net can handle various and complicated blurry patterns adaptively. Experiments on the GoPro and RealBlur datasets demonstrate that our proposed deblurring framework with the learned degradation representations outperforms state-of-the-art methods with appealing improvements. The code is released at https://github.com/dasongli1/Learning_degradation. △ Less

Submitted 10 August, 2022; originally announced August 2022.

Comments: Accepted to ECCV 2022

Journal ref: ECCV 2022

arXiv:2206.10810 [pdf, other]

A Simple Baseline for Video Restoration with Grouped Spatial-temporal Shift

Authors: Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li

Abstract: Video restoration, which aims to restore clear frames from degraded videos, has numerous important applications. The key to video restoration depends on utilizing inter-frame information. However, existing deep learning methods often rely on complicated network architectures, such as optical flow estimation, deformable convolution, and cross-frame self-attention layers, resulting in high computati… ▽ More Video restoration, which aims to restore clear frames from degraded videos, has numerous important applications. The key to video restoration depends on utilizing inter-frame information. However, existing deep learning methods often rely on complicated network architectures, such as optical flow estimation, deformable convolution, and cross-frame self-attention layers, resulting in high computational costs. In this study, we propose a simple yet effective framework for video restoration. Our approach is based on grouped spatial-temporal shift, which is a lightweight and straightforward technique that can implicitly capture inter-frame correspondences for multi-frame aggregation. By introducing grouped spatial shift, we attain expansive effective receptive fields. Combined with basic 2D convolution, this simple framework can effectively aggregate inter-frame information. Extensive experiments demonstrate that our framework outperforms the previous state-of-the-art method, while using less than a quarter of its computational cost, on both video deblurring and video denoising tasks. These results indicate the potential for our approach to significantly reduce computational overhead while maintaining high-quality results. Code is avaliable at https://github.com/dasongli1/Shift-Net. △ Less

Submitted 22 May, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

Comments: Accepted to CVPR2023

Journal ref: 2023 Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

arXiv:2205.05979 [pdf, other]

MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection

Authors: Xuesong Chen, Shaoshuai Shi, Ben** Zhu, Ka Chun Cheung, Hang Xu, Hongsheng Li

Abstract: Accurate and reliable 3D detection is vital for many applications including autonomous driving vehicles and service robots. In this paper, we present a flexible and high-performance 3D detection framework, named MPPNet, for 3D temporal object detection with point cloud sequences. We propose a novel three-hierarchy framework with proxy points for multi-frame feature encoding and interactions to ach… ▽ More Accurate and reliable 3D detection is vital for many applications including autonomous driving vehicles and service robots. In this paper, we present a flexible and high-performance 3D detection framework, named MPPNet, for 3D temporal object detection with point cloud sequences. We propose a novel three-hierarchy framework with proxy points for multi-frame feature encoding and interactions to achieve better detection. The three hierarchies conduct per-frame feature encoding, short-clip feature fusion, and whole-sequence feature aggregation, respectively. To enable processing long-sequence point clouds with reasonable computational resources, intra-group feature mixing and inter-group feature attention are proposed to form the second and third feature encoding hierarchies, which are recurrently applied for aggregating multi-frame trajectory features. The proxy points not only act as consistent object representations for each frame, but also serve as the courier to facilitate feature interaction between frames. The experiments on large Waymo Open dataset show that our approach outperforms state-of-the-art methods with large margins when applied to both short (e.g., 4-frame) and long (e.g., 16-frame) point cloud sequences. Code is available at https://github.com/open-mmlab/OpenPCDet. △ Less

Submitted 2 September, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

Comments: Accepted by ECCV 2022

arXiv:2204.02990 [pdf, ps, other]

doi 10.1007/JHEP08(2022)082

M5-branes wrapped on four-dimensional orbifolds

Authors: K. C. Matthew Cheung, Jacob H. T. Fry, Jerome P. Gauntlett, James Sparks

Abstract: We construct supersymmetric $AdS_3$ solutions of $D=11$ supergravity, dual to $d=2$, $\mathcal{N}=(0,2)$ SCFTs, that are associated with M5-branes wrap** two different four-dimensional orbifolds. In one case the orbifold is a spindle fibred over another spindle, while in the other it is a spindle fibred over a Riemann surface with genus $g>1$. We show that the central charges of the $d=2$ SCFTs… ▽ More We construct supersymmetric $AdS_3$ solutions of $D=11$ supergravity, dual to $d=2$, $\mathcal{N}=(0,2)$ SCFTs, that are associated with M5-branes wrap** two different four-dimensional orbifolds. In one case the orbifold is a spindle fibred over another spindle, while in the other it is a spindle fibred over a Riemann surface with genus $g>1$. We show that the central charges of the $d=2$ SCFTs calculated from the gravity solutions agree with field theory computations using anomaly polynomials. The new $D=11$ solutions are obtained after constructing a new consistent Kaluza-Klein truncation of maximal $D=7$ gauged supergravity reduced on a spindle down to $D=5$ minimal gauged supergravity. △ Less

Submitted 5 August, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

Comments: 37 pages. Very minor changes. Published version

Report number: Imperial/TP/2022/JG/01

arXiv:2203.16194 [pdf, other]

FlowFormer: A Transformer Architecture for Optical Flow

Authors: Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li

Abstract: We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built from an image pair, encodes the cost tokens into a cost memory with alternate-group transformer (AGT) layers in a novel latent space, and decodes the cost memory via a recurrent transformer decoder with dynamic positio… ▽ More We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built from an image pair, encodes the cost tokens into a cost memory with alternate-group transformer (AGT) layers in a novel latent space, and decodes the cost memory via a recurrent transformer decoder with dynamic positional cost queries. On the Sintel benchmark, FlowFormer achieves 1.159 and 2.088 average end-point-error (AEPE) on the clean and final pass, a 16.5% and 15.5% error reduction from the best published result (1.388 and 2.47). Besides, FlowFormer also achieves strong generalization performance. Without being trained on Sintel, FlowFormer achieves 1.01 AEPE on the clean pass of Sintel training set, outperforming the best published result (1.29) by 21.7%. △ Less

Submitted 21 September, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

Comments: Accepted to ECCV 2022. Project Page: https://drinkingcoder.github.io/publication/flowformer/

arXiv:2203.15114 [pdf, ps, other]

doi 10.1007/JHEP06(2022)051

Type IIA embeddings of $D=5$ minimal gauged supergravity via Non-Abelian T-duality

Authors: K. C. Matthew Cheung, Rahim Leung

Abstract: In this note, we construct explicit Type IIA uplifts of $D=5$ minimal gauged supergravity, by T-dualising known Type IIB uplifts on $N_5 = S^5$, $T^{1,1}$ and $Y^{p,q}$ along their $SU(2)$ isometries. When the $D=5$ gauge field is set to zero, our uplifts recover precisely the known non-Abelian T-duals of the $AdS_5\times N_5$ solutions. As an application, we obtain new supersymmetric… ▽ More In this note, we construct explicit Type IIA uplifts of $D=5$ minimal gauged supergravity, by T-dualising known Type IIB uplifts on $N_5 = S^5$, $T^{1,1}$ and $Y^{p,q}$ along their $SU(2)$ isometries. When the $D=5$ gauge field is set to zero, our uplifts recover precisely the known non-Abelian T-duals of the $AdS_5\times N_5$ solutions. As an application, we obtain new supersymmetric $AdS_3\timesΣ\times M_5$ solutions in Type IIA, where $Σ= \mathbb{WCP}^1_{[n_-,n_+]}$ is a weighted projective space. Existing holographic results of T-dualised AdS solutions suggest that our solutions capture features of $d = 2$ SCFTs with $\mathcal{N}=(0, 2)$ supersymmetry. △ Less

Submitted 28 March, 2022; originally announced March 2022.

Comments: 41 pages, 1 figure

arXiv:2202.05388 [pdf, other]

Massively parallel pixel-by-pixel nanophotonic optimization using a Green's function formalism

Authors: Jiahui Wang, Alfred K. C. Cheung, Aleksandra Spyra, Ian A. D. Williamson, Jian Guan, Martin F. Schubert

Abstract: We introduce an efficient parallelization scheme to implement pixel-by-pixel nanophotonic optimization using a Green's function based formalism. The crucial insight in our proposal is the reframing of the optimization algorithm as a large-scale data processing pipeline, which allows for the efficient distribution of computational tasks across thousands of workers. We demonstrate the utility of our… ▽ More We introduce an efficient parallelization scheme to implement pixel-by-pixel nanophotonic optimization using a Green's function based formalism. The crucial insight in our proposal is the reframing of the optimization algorithm as a large-scale data processing pipeline, which allows for the efficient distribution of computational tasks across thousands of workers. We demonstrate the utility of our implementation by exercising it to optimize a high numerical aperture focusing metalens at problem sizes that would otherwise be far out of reach for the Green's function based method. Finally, we highlight the connection to powerful ideas from reinforcement learning as a natural corollary of reinterpreting the nanophotonic inverse design problem as a graph traversal enabled by the pixel-by-pixel optimization paradigm. △ Less

Submitted 10 February, 2022; originally announced February 2022.

Comments: 10 pages, 7 figures

arXiv:2201.12965 [pdf, other]

doi 10.1021/acsphotonics.2c00313

Inverse design of photonic devices with strict foundry fabrication constraints

Authors: Martin F. Schubert, Alfred K. C. Cheung, Ian A. D. Williamson, Aleksandra Spyra, David H. Alexander

Abstract: We introduce a new method for inverse design of nanophotonic devices which guarantees that resulting designs satisfy strict length scale constraints - including minimum width and spacing constraints required by commercial semiconductor foundries. The method adopts several concepts from machine learning to transform the problem of topology optimization with strict length scale constraints to an unc… ▽ More We introduce a new method for inverse design of nanophotonic devices which guarantees that resulting designs satisfy strict length scale constraints - including minimum width and spacing constraints required by commercial semiconductor foundries. The method adopts several concepts from machine learning to transform the problem of topology optimization with strict length scale constraints to an unconstrained stochastic gradient optimization problem. Specifically, we introduce a conditional generator for feasible designs and adopt a straight-through estimator for backpropagation of gradients to a latent design. We demonstrate the performance and reliability of our method by designing several common integrated photonic components. △ Less

Submitted 13 June, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

Comments: 16 pages, 17 figures

Journal ref: ACS Photonics, vol. 9, no. 7, pp. 2327-2336, Jun. 2022

arXiv:2109.03409 [pdf, other]

doi 10.1137/21M1444369

A kernel-based least-squares collocation method for surface diffusion

Authors: Meng Chen, Ka Chun Cheung, Leevan Ling

Abstract: There are plenty of applications and analysis for time-independent elliptic partial differential equations in the literature hinting at the benefits of overtesting by using more collocation conditions than the number of basis functions. Overtesting not only reduces the problem size, but is also known to be necessary for stability and convergence of widely used unsymmetric Kansa-type strong-form co… ▽ More There are plenty of applications and analysis for time-independent elliptic partial differential equations in the literature hinting at the benefits of overtesting by using more collocation conditions than the number of basis functions. Overtesting not only reduces the problem size, but is also known to be necessary for stability and convergence of widely used unsymmetric Kansa-type strong-form collocation methods. We consider kernel-based meshfree methods, which is a method of lines with collocation and overtesting spatially, for solving parabolic partial differential equations on surfaces without parametrization. In this paper, we extend the time-independent convergence theories for overtesting techniques to the parabolic equations on smooth and closed surfaces. △ Less

Submitted 4 May, 2023; v1 submitted 7 September, 2021; originally announced September 2021.

Comments: 4 figures, 21 pages

MSC Class: 65D15; 65N35; 65N40; 41A63

Journal ref: SIAM Journal on Numerical Analysis,2023,Vol.61(3): 1386-1404

arXiv:2106.11318 [pdf, ps, other]

doi 10.1007/JHEP09(2021)052

Wrapped NS5-Branes, Consistent Truncations and Inönü-Wigner Contractions

Authors: K. C. Matthew Cheung, Rahim Leung

Abstract: We construct consistent Kaluza-Klein truncations of type IIA supergravity on (i) $Σ_2\times S^3$ and (ii) $Σ_3\times S^3$, where $Σ_2 = S^2/Γ$, $\mathbb{R}^2/Γ$, or $\mathbb{H}^2/Γ$, and $Σ_3 = S^3/Γ$, $\mathbb{R}^3/Γ$, or $\mathbb{H}^3/Γ$, with $Γ$ a discrete group of symmetries, corresponding to NS5-branes wrapped on $Σ_2$ and $Σ_3$. The resulting theories are a $D=5$, $\mathcal{N}=4$ gauged sup… ▽ More We construct consistent Kaluza-Klein truncations of type IIA supergravity on (i) $Σ_2\times S^3$ and (ii) $Σ_3\times S^3$, where $Σ_2 = S^2/Γ$, $\mathbb{R}^2/Γ$, or $\mathbb{H}^2/Γ$, and $Σ_3 = S^3/Γ$, $\mathbb{R}^3/Γ$, or $\mathbb{H}^3/Γ$, with $Γ$ a discrete group of symmetries, corresponding to NS5-branes wrapped on $Σ_2$ and $Σ_3$. The resulting theories are a $D=5$, $\mathcal{N}=4$ gauged supergravity coupled to three vector multiplets with scalar manifold $SO(1,1)\times SO(5,3)/(SO(5)\times SO(3))$ and gauge group $SO(2)\times\left(SO(2)\ltimes_{Σ_2}\mathbb{R}^4\right)$ which depends on the curvature of $Σ_2$, and a $D=4$, $\mathcal{N}=2$ gauged supergravity coupled to one vector multiplet and two hypermultiplets with scalar manifold $SU(1,1)/U(1)\times G_{2(2)}/SO(4)$ and gauge group $\mathbb{R}^+\times\mathbb{R}^+$ for truncations (i) and (ii) respectively. Instead of carrying out the truncations at the 10-dimensional level, we show that they can be obtained directly by performing Inönü-Wigner contractions on the 5 and 4-dimensional gauged supergravity theories that come from consistent truncations of 11-dimensional supergravity associated with M5-branes wrap** $Σ_2$ and $Σ_3$. This suggests the existence of a broader class of lower-dimensional gauged supergravity theories related by group contractions that have a 10 or 11-dimensional origin. △ Less

Submitted 11 September, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

Comments: 2+73 pages, 1 figure; very minor changes, reference added, published version

arXiv:2106.10523 [pdf, other]

doi 10.1016/j.jcp.2022.111380

Learning Rays via Deep Neural Network in a Ray-based IPDG Method for High-Frequency Helmholtz Equations in Inhomogeneous Media

Authors: Tak Shing Au Yeung, Ka Chun Cheung, Eric T. Chung, Shubin Fu, Jianliang Qian

Abstract: We develop a deep learning approach to extract ray directions at discrete locations by analyzing highly oscillatory wave fields. A deep neural network is trained on a set of local plane-wave fields to predict ray directions at discrete locations. The resulting deep neural network is then applied to a reduced-frequency Helmholtz solution to extract the directions, which are further incorporated int… ▽ More We develop a deep learning approach to extract ray directions at discrete locations by analyzing highly oscillatory wave fields. A deep neural network is trained on a set of local plane-wave fields to predict ray directions at discrete locations. The resulting deep neural network is then applied to a reduced-frequency Helmholtz solution to extract the directions, which are further incorporated into a ray-based interior-penalty discontinuous Galerkin (IPDG) method to solve the Helmholtz equations at higher frequencies. In this way, we observe no apparent pollution effects in the resulting Helmholtz solutions in inhomogeneous media. Our 2D and 3D numerical results show that the proposed scheme is very efficient and yields highly accurate solutions. △ Less

Submitted 19 June, 2021; originally announced June 2021.

Comments: 30 pages

arXiv:2104.13298 [pdf, other]

Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification

Authors: Yixiao Ge, Xiao Zhang, Ching Lam Choi, Ka Chun Cheung, Peipei Zhao, Feng Zhu, Xiaogang Wang, Rui Zhao, Hongsheng Li

Abstract: The recent studies of knowledge distillation have discovered that ensembling the "dark knowledge" from multiple teachers or students contributes to creating better soft targets for training, but at the cost of significantly more computations and/or parameters. In this work, we present BAtch Knowledge Ensembling (BAKE) to produce refined soft targets for anchor images by propagating and ensembling… ▽ More The recent studies of knowledge distillation have discovered that ensembling the "dark knowledge" from multiple teachers or students contributes to creating better soft targets for training, but at the cost of significantly more computations and/or parameters. In this work, we present BAtch Knowledge Ensembling (BAKE) to produce refined soft targets for anchor images by propagating and ensembling the knowledge of the other samples in the same mini-batch. Specifically, for each sample of interest, the propagation of knowledge is weighted in accordance with the inter-sample affinities, which are estimated on-the-fly with the current network. The propagated knowledge can then be ensembled to form a better soft target for distillation. In this way, our BAKE framework achieves online knowledge ensembling across multiple samples with only a single network. It requires minimal computational and memory overhead compared to existing knowledge ensembling methods. Extensive experiments demonstrate that the lightweight yet effective BAKE consistently boosts the classification performance of various architectures on multiple datasets, e.g., a significant +0.7% gain of Swin-T on ImageNet with only +1.5% computational overhead and zero additional parameters. BAKE does not only improve the vanilla baselines, but also surpasses the single-network state-of-the-arts on all the benchmarks. △ Less

Submitted 20 November, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

Comments: Project Page: https://geyixiao.com/projects/bake

arXiv:2104.03097 [pdf, other]

LIFE: Lighting Invariant Flow Estimation

Authors: Zhaoyang Huang, Xiaokun Pan, Runsen Xu, Yan Xu, Ka chun Cheung, Guofeng Zhang, Hongsheng Li

Abstract: We tackle the problem of estimating flow between two images with large lighting variations. Recent learning-based flow estimation frameworks have shown remarkable performance on image pairs with small displacement and constant illuminations, but cannot work well on cases with large viewpoint change and lighting variations because of the lack of pixel-wise flow annotations for such cases. We observ… ▽ More We tackle the problem of estimating flow between two images with large lighting variations. Recent learning-based flow estimation frameworks have shown remarkable performance on image pairs with small displacement and constant illuminations, but cannot work well on cases with large viewpoint change and lighting variations because of the lack of pixel-wise flow annotations for such cases. We observe that via the Structure-from-Motion (SfM) techniques, one can easily estimate relative camera poses between image pairs with large viewpoint change and lighting variations. We propose a novel weakly supervised framework LIFE to train a neural network for estimating accurate lighting-invariant flows between image pairs. Sparse correspondences are conventionally established via feature matching with descriptors encoding local image contents. However, local image contents are inevitably ambiguous and error-prone during the cross-image feature matching process, which hinders downstream tasks. We propose to guide feature matching with the flows predicted by LIFE, which addresses the ambiguous matching by utilizing abundant context information in the image pairs. We show that LIFE outperforms previous flow learning frameworks by large margins in challenging scenarios, consistently improves feature matching, and benefits downstream tasks. △ Less

Submitted 19 April, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

Comments: Project page: https://drinkingcoder.github.io/publication/life/

arXiv:2101.07264 [pdf, other]

doi 10.1007/JHEP05(2021)222

A new family of $AdS_4$ S-folds in type IIB string theory

Authors: Igal Arav, K. C. Matthew Cheung, Jerome P. Gauntlett, Matthew M. Roberts, Christopher Rosen

Abstract: We construct infinite new classes of $AdS_4\times S^1\times S^5$ solutions of type IIB string theory which have non-trivial $SL(2,\mathbb{Z})$ monodromy along the $S^1$ direction. The solutions are supersymmetric and holographically dual, generically, to $\mathcal{N}=1$ SCFTs in $d=3$. The solutions are first constructed as $AdS_4\times \mathbb{R}$ solutions in $D=5$ $SO(6)$ gauged supergravity an… ▽ More We construct infinite new classes of $AdS_4\times S^1\times S^5$ solutions of type IIB string theory which have non-trivial $SL(2,\mathbb{Z})$ monodromy along the $S^1$ direction. The solutions are supersymmetric and holographically dual, generically, to $\mathcal{N}=1$ SCFTs in $d=3$. The solutions are first constructed as $AdS_4\times \mathbb{R}$ solutions in $D=5$ $SO(6)$ gauged supergravity and then uplifted to $D=10$. Unlike the known $AdS_4\times \mathbb{R}$ S-fold solutions, there is no continuous symmetry associated with the $\mathbb{R}$ direction. The solutions all arise as limiting cases of Janus solutions of $d=4$, $\mathcal{N}=4$ SYM theory which are supported both by a different value of the coupling constant on either side of the interface, as well as by fermion and boson mass deformations. As special cases, the construction recovers three known S-fold constructions, preserving $\mathcal{N}=1,2$ and 4 supersymmetry, as well as a recently constructed $\mathcal{N}=1$ $AdS_4\times S^1\times S^5$ solution (not S-folded). We also present some novel "one-sided Janus" solutions that are non-singular. △ Less

Submitted 23 May, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

Comments: 56 pages, 13 figures; very minor changes, published version

Report number: Imperial/TP/2021/JG/01; ICCUB-20-XXX

arXiv:2008.06432 [pdf, other]

doi 10.1007/JHEP02(2021)100

$DK$ $I=0,$ $D\bar{K}$ $I=0,1$ scattering and the $D_{s0}^\ast(2317)$ from lattice QCD

Authors: Gavin K. C. Cheung, Christopher E. Thomas, David J. Wilson, Graham Moir, Michael Peardon, Sinéad M. Ryan

Abstract: Elastic scattering amplitudes for $I=0$ $DK$ and $I=0,1$ $D\bar{K}$ are computed in $S$, $P$ and $D$ partial waves using lattice QCD with light-quark masses corresponding to $m_π= 239$ MeV and $m_π= 391$ MeV. The $S$-waves contain interesting features including a near-threshold $J^P=0^+$ bound state in $I=0$ $DK$, corresponding to the $D_{s0}^\ast(2317)$, with an effect that is clearly visible abo… ▽ More Elastic scattering amplitudes for $I=0$ $DK$ and $I=0,1$ $D\bar{K}$ are computed in $S$, $P$ and $D$ partial waves using lattice QCD with light-quark masses corresponding to $m_π= 239$ MeV and $m_π= 391$ MeV. The $S$-waves contain interesting features including a near-threshold $J^P=0^+$ bound state in $I=0$ $DK$, corresponding to the $D_{s0}^\ast(2317)$, with an effect that is clearly visible above threshold, and suggestions of a $0^+$ virtual bound state in $I=0$ $D\bar{K}$. The $S$-wave $I=1$ $D\bar{K}$ amplitude is found to be weakly repulsive. The computed finite-volume spectra also contain a deeply-bound $D^\ast$ vector resonance, but negligibly small $P$-wave $DK$ interactions are observed in the energy region considered; the $P$ and $D$-wave $D\bar{K}$ amplitudes are also small. There is some evidence of $1^+$ and $2^+$ resonances in $I=0$ $DK$ at higher energies. △ Less

Submitted 19 February, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

Comments: 53 pages, 22 figures, small changes to match published version

Journal ref: JHEP 02 (2021) 100

arXiv:2007.15095 [pdf, other]

doi 10.1007/JHEP11(2020)156

Spatially modulated and supersymmetric mass deformations of $\mathcal{N}=4$ SYM

Authors: Igal Arav, K. C. Matthew Cheung, Jerome P. Gauntlett, Matthew M. Roberts, Christopher Rosen

Abstract: We study mass deformations of $\mathcal{N}=4$, $d=4$ SYM theory that are spatially modulated in one spatial dimension and preserve some residual supersymmetry. We focus on generalisations of $\mathcal{N}=1^*$ theories and show that it is also possible, for suitably chosen supersymmetric masses, to preserve $d=3$ conformal symmetry associated with a co-dimension one interface. Holographic solutions… ▽ More We study mass deformations of $\mathcal{N}=4$, $d=4$ SYM theory that are spatially modulated in one spatial dimension and preserve some residual supersymmetry. We focus on generalisations of $\mathcal{N}=1^*$ theories and show that it is also possible, for suitably chosen supersymmetric masses, to preserve $d=3$ conformal symmetry associated with a co-dimension one interface. Holographic solutions can be constructed using $D=5$ theories of gravity that arise from consistent truncations of $SO(6)$ gauged supergravity and hence type IIB supergravity. For the mass deformations that preserve $d=3$ superconformal symmetry we construct a rich set of Janus solutions of $\mathcal{N}=4$ SYM theory which have the same coupling constant on either side of the interface. Limiting classes of these solutions give rise to RG interface solutions with $\mathcal{N}=4$ SYM on one side of the interface and the Leigh-Strassler (LS) SCFT on the other, and also to a Janus solution for the LS theory. Another limiting solution is a new supersymmetric $AdS_4\times S^1\times S^5$ solution of type IIB supergravity. △ Less

Submitted 21 November, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

Comments: 78 pages, 19 figures. Minor changes, references added

Report number: Imperial/TP/2020/JG/03; ICCUB-20-XXX

arXiv:2007.07891 [pdf, other]

doi 10.1007/JHEP11(2020)168

Superconformal RG interfaces in holography

Authors: Igal Arav, K. C. Matthew Cheung, Jerome P. Gauntlett, Matthew M. Roberts, Christopher Rosen

Abstract: We construct gravitational solutions that holographically describe two different $d=4$ SCFTs joined together at a co-dimension one, planar RG interface and preserving $d=3$ superconformal symmetry. The RG interface joins $\mathcal{N}=4$ SYM theory on one side with the $\mathcal{N}=1$ Leigh-Strassler SCFT on the other. We construct a family of such solutions, which in general are associated with sp… ▽ More We construct gravitational solutions that holographically describe two different $d=4$ SCFTs joined together at a co-dimension one, planar RG interface and preserving $d=3$ superconformal symmetry. The RG interface joins $\mathcal{N}=4$ SYM theory on one side with the $\mathcal{N}=1$ Leigh-Strassler SCFT on the other. We construct a family of such solutions, which in general are associated with spatially dependent mass deformations on the $\mathcal{N}=4$ SYM side, but there is a particular solution for which these deformations vanish. We also construct a Janus solution with the Leigh-Strassler SCFT on either side of the interface. Gravitational solutions associated with superconformal interfaces involving ABJM theory and two $d=3$ $\mathcal{N}=1$ SCFTs with $G_2$ symmetry are also discussed and shown to have similar properties, but they also exhibit some new features. △ Less

Submitted 25 October, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

Comments: 33 pages, 12 pages; references added, typos fixed. Published version

Report number: Imperial/TP/2020/JG/02; ICCUB-20-XXX

arXiv:1911.08772 [pdf, other]

Understanding Top-k Sparsification in Distributed Deep Learning

Authors: Shaohuai Shi, Xiaowen Chu, Ka Chun Cheung, Simon See

Abstract: Distributed stochastic gradient descent (SGD) algorithms are widely deployed in training large-scale deep learning models, while the communication overhead among workers becomes the new system bottleneck. Recently proposed gradient sparsification techniques, especially Top-$k$ sparsification with error compensation (TopK-SGD), can significantly reduce the communication traffic without an obvious i… ▽ More Distributed stochastic gradient descent (SGD) algorithms are widely deployed in training large-scale deep learning models, while the communication overhead among workers becomes the new system bottleneck. Recently proposed gradient sparsification techniques, especially Top-$k$ sparsification with error compensation (TopK-SGD), can significantly reduce the communication traffic without an obvious impact on the model accuracy. Some theoretical studies have been carried out to analyze the convergence property of TopK-SGD. However, existing studies do not dive into the details of Top-$k$ operator in gradient sparsification and use relaxed bounds (e.g., exact bound of Random-$k$) for analysis; hence the derived results cannot well describe the real convergence performance of TopK-SGD. To this end, we first study the gradient distributions of TopK-SGD during the training process through extensive experiments. We then theoretically derive a tighter bound for the Top-$k$ operator. Finally, we exploit the property of gradient distribution to propose an approximate top-$k$ selection algorithm, which is computing-efficient for GPUs, to improve the scaling efficiency of TopK-SGD by significantly reducing the computing overhead. Codes are available at: \url{https://github.com/hclhkbu/GaussianK-SGD}. △ Less

Submitted 20 November, 2019; originally announced November 2019.

Comments: 14 pages

arXiv:1910.09917 [pdf, other]

Folding Polyominoes with Holes into a Cube

Authors: Oswin Aichholzer, Hugo A. Akitaya, Kenneth C. Cheung, Erik D. Demaine, Martin L. Demaine, Sándor P. Fekete, Linda Kleist, Irina Kostitsyna, Maarten Löffler, Zuzana Masárová, Klara Mundilova, Christiane Schmidt

Abstract: When can a polyomino piece of paper be folded into a unit cube? Prior work studied tree-like polyominoes, but polyominoes with holes remain an intriguing open problem. We present sufficient conditions for a polyomino with one or several holes to fold into a cube, and conditions under which cube folding is impossible. In particular, we show that all but five special \emph{simple} holes guarantee fo… ▽ More When can a polyomino piece of paper be folded into a unit cube? Prior work studied tree-like polyominoes, but polyominoes with holes remain an intriguing open problem. We present sufficient conditions for a polyomino with one or several holes to fold into a cube, and conditions under which cube folding is impossible. In particular, we show that all but five special \emph{simple} holes guarantee foldability. △ Less

Submitted 2 July, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

Comments: 24 pages, 21 figures

ACM Class: F.2.2

arXiv:1906.08900 [pdf, other]

doi 10.1088/1361-6382/ab41b3

Consistent KK truncations for M5-branes wrapped on Riemann surfaces

Authors: K. C. Matthew Cheung, Jerome P. Gauntlett, Christopher Rosen

Abstract: We construct a consistent Kaluza-Klein reduction of $D=11$ supergravity on $Σ_2\times S^4$, where $Σ_2=S^2,\mathbb{R}^2$ or $H^2$, or a quotient thereof, at the level of the bosonic fields. The result is a gauged $N=4$, $D=5$ supergravity theory coupled to three vector multiplets, with the gauging lying in an $SO(2)\times SE(3)\subset SO(5,3)$ subgroup of the $SO(1,1)\times SO(5,3)$ global symmetr… ▽ More We construct a consistent Kaluza-Klein reduction of $D=11$ supergravity on $Σ_2\times S^4$, where $Σ_2=S^2,\mathbb{R}^2$ or $H^2$, or a quotient thereof, at the level of the bosonic fields. The result is a gauged $N=4$, $D=5$ supergravity theory coupled to three vector multiplets, with the gauging lying in an $SO(2)\times SE(3)\subset SO(5,3)$ subgroup of the $SO(1,1)\times SO(5,3)$ global symmetry group of the ungauged theory. For $Σ_2=H^2$, the $D=5$ theory has a maximally supersymmetric $AdS_5$ vacuum which uplifts to the known solution of $D=11$ supergravity corresponding to M5-branes wrap** a Riemann surface with genus greater than one and dual to an $N=2$ SCFT in $d=4$. For $Σ_2=S^2$, we find two $AdS_5$ solutions, one of which is new, and both of which are unstable. There is an additional subtruncation to an $N=2$ gauged supergravity coupled to two vector multiplets, with very special real manifold $SO(1,1)\times SO(1,1)$, and a single hypermultiplet, with quaternionic Kähler manifold $SU(2,1)/S[U(2)\times U(1)]$ and gauging associated with an $SO(2)\times\mathbb{R}\subset SU(2,1)$ subgroup. △ Less

Submitted 12 September, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

Comments: 40 pages. Very minor changes, reference added, published version

Report number: Imperial/TP/2019/JG/02

arXiv:1810.05491 [pdf]

Light Reconfigurable Geometric Phase Optical Element with Multi-stable States

Authors: Xiao-Qian Wang, Alwin Ming-Wai Tam, Wei-Qiang Yang, Engle Liao, Ka Chun Cheung, Wei Hu, Dong Shen, Vladimir Chigrinov, Hoi-Sing Kwok, Zhi-gang Zheng, Yanqing Lu, Quan Li

Abstract: We present the design methodology of a light reconfigurable geometric phase optical element with multi-stable diffraction efficiency states, enabled by a photoresponsive self-organized chiral liquid crystal. Experimental demonstration shows the device exhibits a broad diffraction efficiency tunable range that can be smoothly modulated under alternate stimulation of ultraviolet and green lights. Di… ▽ More We present the design methodology of a light reconfigurable geometric phase optical element with multi-stable diffraction efficiency states, enabled by a photoresponsive self-organized chiral liquid crystal. Experimental demonstration shows the device exhibits a broad diffraction efficiency tunable range that can be smoothly modulated under alternate stimulation of ultraviolet and green lights. Distinctive to previous designs, the regulation of diffraction efficiency fundamentally stems from the modulation of geometric phase together with dynamical phase retardation, and any intermediate diffractive state is memorized. Such multi-stability facilitates applications including energy-saving all-optical signal processing in classical and quantum level, and phase hologram for anti-counterfeit. △ Less

Submitted 17 October, 2018; v1 submitted 12 October, 2018; originally announced October 2018.

Comments: 6 pages, 2 figures

arXiv:1809.10714 [pdf, ps, other]

doi 10.1103/PhysRevB.99.024516

Hund's coupling stabilized superconductivity in the presence of spin-orbit interactions

Authors: Alfred K. C. Cheung, D. F. Agterberg

Abstract: The intraorbital repulsive Hubbard interaction cannot lead to attractive superconducting pairing states, except through the Kohn-Luttinger mechanism. This situation may change when we include additional local interactions such as the interorbital repulsion $U^\prime$ and Hund's interactions $J$. Adding these local interactions, we study the nature of the superconducting pairs in systems with tetra… ▽ More The intraorbital repulsive Hubbard interaction cannot lead to attractive superconducting pairing states, except through the Kohn-Luttinger mechanism. This situation may change when we include additional local interactions such as the interorbital repulsion $U^\prime$ and Hund's interactions $J$. Adding these local interactions, we study the nature of the superconducting pairs in systems with tetragonal crystal symmetry including the $d_{xz}$ and $d_{yz}$ orbitals, and in octahedral systems including all three of $d_{xz}$, $d_{yz}$, and $d_{xy}$ orbitals. In the tetragonal case, spin-orbit interactions can stabilize attractive pairing channels containing spin triplet, orbital singlet character. Depending on the form of spin-orbit coupling, pairing channels belonging to degenerate, non-trivial irreducible representations may be stabilized. In the octahedral case, the pairing interactions of superconducting channels are found to depend critically on the number of bands crossing the Fermi energy. △ Less

Submitted 27 September, 2018; originally announced September 2018.

Comments: 9 pages

Journal ref: Phys. Rev. B 99, 024516 (2019)

arXiv:1808.08029 [pdf, other]

doi 10.1103/PhysRevB.98.184507

Residual spin susceptibility in the spin-triplet, orbital-singlet model

Authors: Yue Yu, Alfred K. C. Cheung, S. Raghu, D. F. Agterberg

Abstract: Nuclear magnetic resonance (NMR) and Knight shift measurements are critical tools in the identification of spin-triplet superconductors. We discuss the effects of spin orbit coupling on the Knight shift and susceptibilities for a variety of spin triplet multi-orbital gap functions with orbital-singlet character and compare their responses to "traditional" single band spin-triplet ($p_x+ip_y$) supe… ▽ More Nuclear magnetic resonance (NMR) and Knight shift measurements are critical tools in the identification of spin-triplet superconductors. We discuss the effects of spin orbit coupling on the Knight shift and susceptibilities for a variety of spin triplet multi-orbital gap functions with orbital-singlet character and compare their responses to "traditional" single band spin-triplet ($p_x+ip_y$) superconductors. We observe a non-negligible residual spin-susceptibility at low temperature. △ Less

Submitted 24 August, 2018; originally announced August 2018.

Journal ref: Phys. Rev. B 98, 184507 (2018)

arXiv:1805.00047 [pdf, other]

doi 10.1103/PhysRevLett.121.167003

Superconducting tunneling spectroscopy of spin-orbit coupling and orbital depairing in Nb:SrTiO$_3$

Authors: Adrian G. Swartz, Alfred K. C. Cheung, Hyeok Yoon, Zhuoyu Chen, Yasuyuki Hikita, Srinivas Raghu, Harold Y. Hwang

Abstract: We have examined the intrinsic spin-orbit coupling (SOC) and orbital depairing in thin films of Nb-doped SrTiO$_3$ by superconducting tunneling spectroscopy. The orbital depairing is geometrically suppressed in the two-dimensional limit, enabling a quantitative evaluation of the Fermi level spin-orbit scattering using Maki's theory. The response of the superconducting gap under in-plane magnetic f… ▽ More We have examined the intrinsic spin-orbit coupling (SOC) and orbital depairing in thin films of Nb-doped SrTiO$_3$ by superconducting tunneling spectroscopy. The orbital depairing is geometrically suppressed in the two-dimensional limit, enabling a quantitative evaluation of the Fermi level spin-orbit scattering using Maki's theory. The response of the superconducting gap under in-plane magnetic fields demonstrates short spin-orbit scattering times $τ_{so} \leq 1.1$ ps. Analysis of the orbital depairing indicates that the heavy electron band contributes significantly to pairing. These results suggest that the intrinsic spin-orbit scattering time in SrTiO$_3$ is comparable to those associated with Rashba effects in SrTiO$_3$ interfacial conducting layers and can be considered significant in all forms of superconductivity in SrTiO$_3$. △ Less

Submitted 30 April, 2018; originally announced May 2018.

Journal ref: Phys. Rev. Lett. 121, 167003 (2018)

arXiv:1711.06142 [pdf, ps, other]

High Fidelity Quantum Gates beyond spectral selection

Authors: Kwok Chung Matthew Cheung, Florian Mintert

Abstract: Driving a certain transition without including undesired transitions is an ubiquitous problem in quantum control and the implementation of quantum information processing. This problem gets the more challenging the weaker the desired transition couples to the control field, and the denser the system's spectrum is. With the explicit example of a trapped ion we show how temporally shaped driving help… ▽ More Driving a certain transition without including undesired transitions is an ubiquitous problem in quantum control and the implementation of quantum information processing. This problem gets the more challenging the weaker the desired transition couples to the control field, and the denser the system's spectrum is. With the explicit example of a trapped ion we show how temporally shaped driving helps to increase the fidelity of a gate operation beyond the regular spectral selection of resonantly driven transitions. We chose the explicit example of side-band transitions, since those couple more weakly to a control field than carrier transitions. Driving a sideband transition without carrier excitation thus allows us to test the limits of frequently employed control tools, and we discuss their potential and limitations. △ Less

Submitted 16 July, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

Comments: 20 pages

arXiv:1709.01417 [pdf, other]

doi 10.1007/JHEP11(2017)033

Tetraquark operators in lattice QCD and exotic flavour states in the charm sector

Authors: Gavin K. C. Cheung, Christopher E. Thomas, Jozef J. Dudek, Robert G. Edwards

Abstract: We present a general class of operators resembling compact tetraquarks which have a range of colour-flavour-spin structures, transform irreducibly under the symmetries of the lattice and respect other relevant symmetries. These constructions are demonstrated in lattice QCD calculations with light quarks corresponding to $m_π=$ 391 MeV. Using the distillation framework, correlation functions involv… ▽ More We present a general class of operators resembling compact tetraquarks which have a range of colour-flavour-spin structures, transform irreducibly under the symmetries of the lattice and respect other relevant symmetries. These constructions are demonstrated in lattice QCD calculations with light quarks corresponding to $m_π=$ 391 MeV. Using the distillation framework, correlation functions involving large bases of meson-meson and tetraquark operators are computed in the isospin-1 hidden-charm and doubly-charmed sectors, and finite-volume spectra are extracted with the variational method. We find the spectra are insensitive to the addition of tetraquark operators to the bases of meson-meson operators. For the first time, through using diverse bases of meson-meson operators, the multiple energy levels associated with meson-meson levels which would be degenerate in the non-interacting limit are extracted reliably. The number of energy levels in each spectrum is found to be equal to the number of expected non-interacting meson-meson levels in the energy region considered and the majority of energies lie close to the non-interacting levels. Therefore, there is no strong indication for any bound state or narrow resonance in the channels we study. △ Less

Submitted 16 November, 2017; v1 submitted 5 September, 2017; originally announced September 2017.

Comments: 31 pages, 9 figures, minor changes to reflect published version

Report number: DAMTP-2017-33, JLAB-THY-17-2541

Journal ref: JHEP 11 (2017) 033

arXiv:1703.00044 [pdf, other]

Digital Cellular Solid Pressure Vessels: A Novel Approach for Human Habitation in Space

Authors: Daniel Cellucci, Benjamin Jenett, Kenneth C. Cheung

Abstract: It is widely assumed that human exploration beyond Earth's orbit will require vehicles capable of providing long-duration habitats that simulate an Earthlike environment: consistent artificial gravity, breathable atmosphere, and sufficient living space- while requiring the minimum possible launch mass. This paper examines how the qualities of digital cellular solids - high-performance, repairabili… ▽ More It is widely assumed that human exploration beyond Earth's orbit will require vehicles capable of providing long-duration habitats that simulate an Earthlike environment: consistent artificial gravity, breathable atmosphere, and sufficient living space- while requiring the minimum possible launch mass. This paper examines how the qualities of digital cellular solids - high-performance, repairability, reconfigurability, tunable mechanical response - allow the accomplishment of long-duration habitat objectives at a fraction of the mass required for traditional structural technologies. To illustrate the impact digital cellular solids could make as a replacement to conventional habitat subsystems, we compare recent proposed deep space habitat structural systems with a digital cellular solids pressure vessel design that consists of a carbon fiber reinforced polymer (CFRP) digital cellular solid cylindrical framework that is lined with an ultra-high molecular weight polyethylene (UHMWPE) skin. We use the analytical treatment of a linear specific modulus scaling cellular solid to find the minimum mass pressure vessel for a structure and find that, for equivalent habitable volume and appropriate safety factors, the use of digital cellular solids provides clear methods for producing structures that are not only repairable and reconfigurable, but also higher performance than their conventionally-manufactured counterparts. △ Less

Submitted 21 February, 2017; originally announced March 2017.

Comments: 7 pages, Presented at 2017 IEEE Aeroconference in Big Sky, MT

arXiv:1612.00485 [pdf, other]

doi 10.1016/j.nancom.2017.02.002

Simulating with AcCoRD: Actor-Based Communication via Reaction-Diffusion

Authors: Adam Noel, Karen C. Cheung, Robert Schober, Dimitrios Makrakis, Abdelhakim Hafid

Abstract: This paper introduces AcCoRD (Actor-based Communication via Reaction-Diffusion) version 1.0. AcCoRD is a sandbox reaction-diffusion solver designed for the study of molecular communication systems. It uses a hybrid of microscopic and mesoscopic simulation models that enables scalability via user control of local accuracy. AcCoRD is developed in C as an open source command line tool and includes ut… ▽ More This paper introduces AcCoRD (Actor-based Communication via Reaction-Diffusion) version 1.0. AcCoRD is a sandbox reaction-diffusion solver designed for the study of molecular communication systems. It uses a hybrid of microscopic and mesoscopic simulation models that enables scalability via user control of local accuracy. AcCoRD is developed in C as an open source command line tool and includes utilities to process simulation output in MATLAB. The latest code and links to user documentation can be found at https://github.com/adamjgnoel/AcCoRD/. This paper provides an overview of AcCoRD's design, including the motivation for develo** a specialized reaction-diffusion solver. The corresponding algorithms are presented in detail, including the computational complexity of the microscopic and mesoscopic models. Other novel derivations include the transition rates between adjacent mesoscopic subvolumes of different sizes. Simulation results demonstrate the use of AcCoRD as both an accurate reaction-diffusion solver and one that is catered to the analysis of molecular communication systems. A link is included to videos that demonstrate many of the simulated scenarios. Additional insights from the simulation results include the selection of suitable hybrid model parameters, the impact of reactive surfaces that are in the proximity of a hybrid interface, and the size of a bounded environment that is necessary to assume that it is unbounded. The development of AcCoRD is ongoing, so its future direction is also discussed in order to highlight improvements that will expand its potential areas of application. New features that are being planned at the time of writing include a fluid flow model and more complex actor behavior. △ Less

Submitted 2 February, 2017; v1 submitted 1 December, 2016; originally announced December 2016.

Comments: 42 pages, 21 figures, 2 tables. To appear in Nano Communication Networks

arXiv:1611.08910 [pdf, ps, other]

doi 10.1103/PhysRevB.95.235424

Weiss oscillations and particle-hole symmetry at the half-filled Landau level

Authors: Alfred K. C. Cheung, S. Raghu, Michael Mulligan

Abstract: Particle-hole symmetry in the lowest Landau level of the two-dimensional electron gas requires the electrical Hall conductivity to equal $\pm e^2/2h$ at half-filling. We study the consequences of weakly broken particle-hole symmetry for magnetoresistance oscillations about half-filling in the presence of an applied periodic one-dimensional electrostatic potential using the Dirac composite fermion… ▽ More Particle-hole symmetry in the lowest Landau level of the two-dimensional electron gas requires the electrical Hall conductivity to equal $\pm e^2/2h$ at half-filling. We study the consequences of weakly broken particle-hole symmetry for magnetoresistance oscillations about half-filling in the presence of an applied periodic one-dimensional electrostatic potential using the Dirac composite fermion theory proposed by Son. At fixed electron density, the oscillation minima are asymmetrically biased towards higher magnetic fields, while at fixed magnetic field, the oscillations occur symmetrically as the electron density is varied about half-filling. We find an approximate "sum rule" obeyed for all pairs of oscillation minima that can be tested in experiment. The locations of the magnetoresistance oscillation minima for the composite fermion theory of Halperin, Lee, and Read (HLR) and its particle-hole conjugate agree exactly. Within the current experimental resolution, the locations of the oscillation minima produced by the Dirac composite fermion coincide with those of HLR. These results may indicate that all three composite fermion theories describe the same long wavelength physics. △ Less

Submitted 22 May, 2017; v1 submitted 27 November, 2016; originally announced November 2016.

Comments: 22 pages, 3 figures; v2 corrected discussion of composite fermion theory comparison (thanks to C. Wang, N. Cooper, B. Halperin, and A. Stern for discussions)

Journal ref: Phys. Rev. B 95, 235424 (2017)

arXiv:1610.01073 [pdf, other]

doi 10.1007/JHEP12(2016)089

Excited and exotic charmonium, $D_s$ and $D$ meson spectra for two light quark masses from lattice QCD

Authors: Gavin K. C. Cheung, Cian O'Hara, Graham Moir, Michael Peardon, Sinéad M. Ryan, Christopher E. Thomas, David Tims

Abstract: We present highly-excited charmonium, $D_s$ and $D$ meson spectra from dynamical lattice QCD calculations with light quarks corresponding to $M_π \sim 240$ MeV and compare these to previous results with $M_π \sim 400$ MeV. Utilising the distillation framework, large bases of carefully constructed interpolating operators and a variational procedure, we extract and reliably identify the continuum sp… ▽ More We present highly-excited charmonium, $D_s$ and $D$ meson spectra from dynamical lattice QCD calculations with light quarks corresponding to $M_π \sim 240$ MeV and compare these to previous results with $M_π \sim 400$ MeV. Utilising the distillation framework, large bases of carefully constructed interpolating operators and a variational procedure, we extract and reliably identify the continuum spin of an extensive set of excited mesons. These include states with exotic quantum numbers which, along with a number with non-exotic quantum numbers, we identify as having excited gluonic degrees of freedom and interpret as hybrid mesons. Comparing the spectra at the two different $M_π$, we find only a mild light-quark mass dependence and no change in the overall pattern of states. △ Less

Submitted 5 January, 2017; v1 submitted 4 October, 2016; originally announced October 2016.

Comments: 21 pages, 8 figures, minor changes to reflect published version

Report number: DAMTP-2016-63

Journal ref: JHEP 12 (2016) 089

arXiv:1601.00681 [pdf, ps, other]

Modeling and Simulation of Molecular Communication Systems with a Reversible Adsorption Receiver

Authors: Yansha Deng, Adam Noel, Maged Elkashlan, Arumugam Nallanathan, Karen C. Cheung

Abstract: In this paper, we present an analytical model for the diffusive molecular communication (MC) system with a reversible adsorption receiver in a fluid environment. The widely used concentration shift keying (CSK) is considered for modulation. The time-varying spatial distribution of the information molecules under the reversible adsorption and desorption reaction at the surface of a receiver is anal… ▽ More In this paper, we present an analytical model for the diffusive molecular communication (MC) system with a reversible adsorption receiver in a fluid environment. The widely used concentration shift keying (CSK) is considered for modulation. The time-varying spatial distribution of the information molecules under the reversible adsorption and desorption reaction at the surface of a receiver is analytically characterized. Based on the spatial distribution, we derive the net number of newly-adsorbed information molecules expected in any time duration. We further derive the number of newly-adsorbed molecules expected at the steady state to demonstrate the equilibrium concentration. Given the number of newly-adsorbed information molecules, the bit error probability of the proposed MC system is analytically approximated. Importantly, we present a simulation framework for the proposed model that accounts for the diffusion and reversible reaction. Simulation results show the accuracy of our derived expressions, and demonstrate the positive effect of the adsorption rate and the negative effect of the desorption rate on the error probability of reversible adsorption receiver with last transmit bit-1. Moreover, our analytical results simplify to the special cases of a full adsorption receiver and a partial adsorption receiver, both of which do not include desorption. △ Less

Submitted 22 June, 2016; v1 submitted 4 January, 2016; originally announced January 2016.

Comments: 14 pages, 8 figures, 1 algorithm, submitted

arXiv:1512.08286 [pdf, ps, other]

doi 10.1103/PhysRevB.93.134516

Topological properties of ferromagnetic superconductors

Authors: Alfred K. C. Cheung, S. Raghu

Abstract: A variety of heavy fermion superconductors, such as UCoGe, UGe$_2$, and URhGe exhibit a striking coexistence of bulk ferromagnetism and superconductivity. In these systems, the magnetic moment decreases with pressure, and vanishes at a ferromagnetic quantum critical point (qcp). Remarkably, the superconductivity in UCoGe varies smoothly with pressure across the qcp and exists in both the ferromagn… ▽ More A variety of heavy fermion superconductors, such as UCoGe, UGe$_2$, and URhGe exhibit a striking coexistence of bulk ferromagnetism and superconductivity. In these systems, the magnetic moment decreases with pressure, and vanishes at a ferromagnetic quantum critical point (qcp). Remarkably, the superconductivity in UCoGe varies smoothly with pressure across the qcp and exists in both the ferromagnetic and paramagnetic regimes. We argue that in UCoGe, spin-orbit interactions stabilize a time-reversal invariant odd-parity superconductor in the high pressure paramagnetic regime. Based on a simple phenomenological model, we predict that the transition from the paramagnetic normal state to the phase where superconductivity and ferromagnetism coexist, is a first-order transition. △ Less

Submitted 8 January, 2016; v1 submitted 27 December, 2015; originally announced December 2015.

Comments: 6 pages, 3 figures, References added

Journal ref: Phys. Rev. B 93, 134516 (2016)

arXiv:1512.07512 [pdf, other]

Evaluation of Cellular Solids Derived from Triply Periodic Minimal Surfaces

Authors: Daniel Cellucci, Kenneth C. Cheung

Abstract: Cellular solids are a class of materials that have many interesting engineering applications, including ultralight structural materials. The traditional method for analyzing these solids uses convex uniform polyhedral honeycombs to represent the geometry of the material, and this approach has carried over into the design of digital cellular solids. However, the use of such honeycomb-derived lattic… ▽ More Cellular solids are a class of materials that have many interesting engineering applications, including ultralight structural materials. The traditional method for analyzing these solids uses convex uniform polyhedral honeycombs to represent the geometry of the material, and this approach has carried over into the design of digital cellular solids. However, the use of such honeycomb-derived lattices makes the problem of decomposing a three-dimensional lattice into a library of two-dimensional parts non-trivial. We introduce a method for generating periodic frameworks from triply periodic minimal surfaces, which result in geometries that are easier to decompose into digital parts. Additionally, we perform finite element modelling of two cellular solids generated from two TPMS, the P- and D-Schwarz, and two cellular solids, the Kelvin and Octet honeycombs. We show that the simulated behavior of these TMPS-derived structures shows the expected modulus of the cellular solid scaling linearly with relative density, which matches the behavior of the highest-coordination honeycomb structure, the octet truss. △ Less

Submitted 6 March, 2016; v1 submitted 23 December, 2015; originally announced December 2015.

Comments: Proceedings of the ASME 2016 Manufacturing Science and Engineering Conference, 5 pages, 4 figures

arXiv:1511.09413 [pdf, ps, other]

Molecular Communication with a Reversible Adsorption Receiver

Authors: Yansha Deng, Adam Noel, Maged Elkashlan, Arumugam Nallanathan, Karen C. Cheung

Abstract: In this paper, we present an analytical model for a diffusive molecular communication (MC) system with a reversible adsorption receiver in a fluid environment. The time-varying spatial distribution of the information molecules under the reversible adsorption and desorption reaction at the surface of a bio-receiver is analytically characterized. Based on the spatial distribution, we derive the numb… ▽ More In this paper, we present an analytical model for a diffusive molecular communication (MC) system with a reversible adsorption receiver in a fluid environment. The time-varying spatial distribution of the information molecules under the reversible adsorption and desorption reaction at the surface of a bio-receiver is analytically characterized. Based on the spatial distribution, we derive the number of newly-adsorbed information molecules expected in any time duration. Importantly, we present a simulation framework for the proposed model that accounts for the diffusion and reversible reaction. Simulation results show the accuracy of our derived expressions, and demonstrate the positive effect of the adsorption rate and the negative effect of the desorption rate on the net number of newly-adsorbed information molecules expected. Moreover, our analytical results simplify to the special case of an absorbing receiver. △ Less

Submitted 7 April, 2016; v1 submitted 30 November, 2015; originally announced November 2015.

Comments: Submitted to ICC 2016

Showing 1–50 of 62 results for author: Cheung, K c