Skip to main content

Showing 201–250 of 531 results for author: Fu, Z

.
  1. arXiv:2111.01674  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Minimizing Energy Consumption Leads to the Emergence of Gaits in Legged Robots

    Authors: Zipeng Fu, Ashish Kumar, Jitendra Malik, Deepak Pathak

    Abstract: Legged locomotion is commonly studied and expressed as a discrete set of gait patterns, like walk, trot, gallop, which are usually treated as given and pre-programmed in legged robots for efficient locomotion at different speeds. However, fixing a set of pre-programmed gaits limits the generality of locomotion. Recent animal motor studies show that these conventional gaits are only prevalent in id… ▽ More

    Submitted 25 October, 2021; originally announced November 2021.

    Comments: CoRL 2021. Website at https://energy-locomotion.github.io

  2. arXiv:2111.01276  [pdf, other

    cs.LG

    Multi network InfoMax: A pre-training method involving graph convolutional networks

    Authors: Usman Mahmood, Zening Fu, Vince Calhoun, Sergey Plis

    Abstract: Discovering distinct features and their relations from data can help us uncover valuable knowledge crucial for various tasks, e.g., classification. In neuroimaging, these features could help to understand, classify, and possibly prevent brain disorders. Model introspection of highly performant overparameterized deep learning (DL) models could help find these features and relations. However, to ach… ▽ More

    Submitted 14 February, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract

  3. arXiv:2111.01271  [pdf, other

    cs.LG

    Brain dynamics via Cumulative Auto-Regressive Self-Attention

    Authors: Usman Mahmood, Zening Fu, Vince Calhoun, Sergey Plis

    Abstract: Multivariate dynamical processes can often be intuitively described by a weighted connectivity graph between components representing each individual time-series. Even a simple representation of this graph as a Pearson correlation matrix may be informative and predictive as demonstrated in the brain imaging literature. However, there is a consensus expectation that powerful graph neural networks (G… ▽ More

    Submitted 14 February, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract. Typos fixed

  4. arXiv:2110.12468  [pdf, other

    cs.LG cs.AI

    False Correlation Reduction for Offline Reinforcement Learning

    Authors: Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Tianyi Zhou, Zhaoran Wang, **g Jiang

    Abstract: Offline reinforcement learning (RL) harnesses the power of massive datasets for resolving sequential decision problems. Most existing papers only discuss defending against out-of-distribution (OOD) actions while we investigate a broader issue, the false correlations between epistemic uncertainty and decision-making, an essential factor that causes suboptimality. In this paper, we propose falSe COr… ▽ More

    Submitted 1 November, 2023; v1 submitted 24 October, 2021; originally announced October 2021.

    Comments: 16 pages, 14 figures

  5. arXiv:2110.11869  [pdf, other

    cs.CL cs.LG

    FLiText: A Faster and Lighter Semi-Supervised Text Classification with Convolution Networks

    Authors: Chen Liu, Mengchao Zhang, Zhibin Fu, Pan Hou, Yu Li

    Abstract: In natural language processing (NLP), state-of-the-art (SOTA) semi-supervised learning (SSL) frameworks have shown great performance on deep pre-trained language models such as BERT, and are expected to significantly reduce the demand for manual labeling. However, our empirical studies indicate that these frameworks are not suitable for lightweight models such as TextCNN, LSTM and etc. In this wor… ▽ More

    Submitted 12 September, 2021; originally announced October 2021.

  6. arXiv:2110.11583  [pdf, other

    cs.CV cs.AI cs.NE

    EvoGAN: An Evolutionary Computation Assisted GAN

    Authors: Feng Liu, HanYang Wang, Jiahao Zhang, Ziwang Fu, Aimin Zhou, Jiayin Qi, Zhibin Li

    Abstract: The image synthesis technique is relatively well established which can generate facial images that are indistinguishable even by human beings. However, all of these approaches uses gradients to condition the output, resulting in the outputting the same image with the same input. Also, they can only generate images with basic expression or mimic an expression instead of generating compound expressi… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: 20 pages, 9 figures, 1 table

  7. arXiv:2110.10823  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Electronic confinement in quantum dots of twisted bilayer graphene

    Authors: Xiao-Feng Zhou, Yi-Wen Liu, Hong-Yi Yan, Zhong-Qiu Fu, Haiwen Liu, Lin He

    Abstract: Electronic properties of quantum dots (QDs) depend sensitively on their parent materials. Therefore, confined electronic states in graphene QDs (GQDs) of monolayer and Bernal-stacked bilayer graphene are quite different. Twisted bilayer graphene (TBG) is distinct from monolayer and Bernal-stacked bilayer graphene because of the new degree of freedom: twist angle. In the past few years, numerous ef… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

  8. arXiv:2110.09777  [pdf, other

    cs.CV cs.AI

    Towards Toxic and Narcotic Medication Detection with Rotated Object Detector

    Authors: Jiao Peng, Feifan Wang, Zhongqiang Fu, Yiying Hu, Zichen Chen, Xinghan Zhou, Lijun Wang

    Abstract: Recent years have witnessed the advancement of deep learning vision technologies and applications in the medical industry. Intelligent devices for special medication management are in great need of, which requires more precise detection algorithms to identify the specifications and locations. In this work, YOLO (You only look once) based object detectors are tailored for toxic and narcotic medicat… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  9. arXiv:2110.07829  [pdf, other

    cs.LG

    FedSEAL: Semi-Supervised Federated Learning with Self-Ensemble Learning and Negative Learning

    Authors: Jieming Bian, Zhu Fu, Jie Xu

    Abstract: Federated learning (FL), a popular decentralized and privacy-preserving machine learning (FL) framework, has received extensive research attention in recent years. The majority of existing works focus on supervised learning (SL) problems where it is assumed that clients carry labeled datasets while the server has no data. However, in realistic scenarios, clients are often unable to label their dat… ▽ More

    Submitted 8 June, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: 15 pages, 7 figures

  10. arXiv:2110.07110  [pdf, other

    cs.CV

    Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast

    Authors: Ye Du, Zehua Fu, Qingjie Liu, Yunhong Wang

    Abstract: Though image-level weakly supervised semantic segmentation (WSSS) has achieved great progress with Class Activation Maps (CAMs) as the cornerstone, the large supervision gap between classification and segmentation still hampers the model to generate more complete and precise pseudo masks for segmentation. In this study, we propose weakly-supervised pixel-to-prototype contrast that can provide pixe… ▽ More

    Submitted 13 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: 10 pages, 5 figures. Accepted by CVPR'22

  11. arXiv:2110.00121  [pdf, other

    cs.MA cs.AI cs.CY

    Emergence of Theory of Mind Collaboration in Multiagent Systems

    Authors: Luyao Yuan, Zipeng Fu, Linqi Zhou, Kexin Yang, Song-Chun Zhu

    Abstract: Currently, in the study of multiagent systems, the intentions of agents are usually ignored. Nonetheless, as pointed out by Theory of Mind (ToM), people regularly reason about other's mental states, including beliefs, goals, and intentions, to obtain performance advantage in competition, cooperation or coalition. However, due to its intrinsic recursion and intractable modeling of distribution over… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Journal ref: Emergent Communication Workshop, 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  12. arXiv:2109.15262  [pdf

    physics.optics eess.SY physics.app-ph physics.class-ph quant-ph

    Non-Hermitian physics and engineering in silicon photonics

    Authors: Changqing Wang, Zhoutian Fu, Lan Yang

    Abstract: Silicon photonics has been studied as an integratable optical platform where numerous applicable devices and systems are created based on modern physics and state-of-the-art nanotechnologies. The implementation of quantum mechanics has been the driving force of the most intriguing design of photonic structures, since the optical systems are found of great capability and potential in realizing the… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

    Comments: 30 pages, 12 figures, 225 references. Link to the published version: https://link.springer.com/chapter/10.1007%2F978-3-030-68222-4_7

    Journal ref: Wang C., Fu Z., Yang L. (2021) Non-Hermitian Physics and Engineering in Silicon Photonics. In: Lockwood D.J., Pavesi L. (eds) Silicon Photonics IV. Topics in Applied Physics, vol 139. Springer, Cham

  13. arXiv:2109.14671  [pdf, other

    cs.CV cs.LG eess.IV

    Segmentation of Roads in Satellite Images using specially modified U-Net CNNs

    Authors: Jonas Bokstaller, Yihang She, Zhehan Fu, Tommaso Macrì

    Abstract: The image classification problem has been deeply investigated by the research community, with computer vision algorithms and with the help of Neural Networks. The aim of this paper is to build an image classifier for satellite images of urban scenes that identifies the portions of the images in which a road is located, separating these portions from the rest. Unlike conventional computer vision al… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: 4 pages, 4 figures

  14. arXiv:2109.11778  [pdf, other

    cs.CV cs.CL

    Dense Contrastive Visual-Linguistic Pretraining

    Authors: Lei Shi, Kai Shuang, Shijie Geng, Peng Gao, Zuohui Fu, Gerard de Melo, Yunpeng Chen, Sen Su

    Abstract: Inspired by the success of BERT, several multimodal representation learning approaches have been proposed that jointly represent image and text. These approaches achieve superior performance by capturing high-level semantic information from large-scale multimodal pretraining. In particular, LXMERT and UNITER adopt visual region feature regression and label classification as pretext tasks. However,… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Accepted by ACM Multimedia 2021. arXiv admin note: text overlap with arXiv:2007.13135

  15. arXiv:2109.05485  [pdf, other

    cs.CV cs.LG eess.IV

    Facial Anatomical Landmark Detection using Regularized Transfer Learning with Application to Fetal Alcohol Syndrome Recognition

    Authors: Zeyu Fu, Jianbo Jiao, Michael Suttie, J. Alison Noble

    Abstract: Fetal alcohol syndrome (FAS) caused by prenatal alcohol exposure can result in a series of cranio-facial anomalies, and behavioral and neurocognitive problems. Current diagnosis of FAS is typically done by identifying a set of facial characteristics, which are often obtained by manual examination. Anatomical landmark detection, which provides rich geometric information, is important to detect the… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

    Comments: To appear in IEEE journal of Biomedical and Health Informatics 2021

  16. arXiv:2109.05166  [pdf

    physics.app-ph

    Ultrabroadband THz/IR upconversion and photovoltaic response in semi-conductor ratchet based upconverter

    Authors: Peng Bai, Ning Yang1, Weidong Chu, Yueheng Zhang, Wenzhong Shen, Zhanglong Fu, Dixiang Shao, Kang Zhou, Zhiyong Tan, Hua Li, Juncheng Cao, Lianhe Li, Edmund Harold Linfield, Yan Xie, Ziran Zhao

    Abstract: An ultrabroadband upconversion device is demonstrated by direct tandem integration of a p-type GaAs/AlxGa1-xAs ratchet photodetector (RP) with a GaAs double heterojunction LED (DH-LED) using the molecular beam epitaxy (MBE). An ultrabroadband photoresponse from terahertz (THz) to near infrared (NIR) region (4-200 THz) was realized that covers a much wider frequency range com-pared with the existin… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  17. arXiv:2109.03395  [pdf, other

    cs.NI

    From Cloud to Edge: A First Look at Public Edge Platforms

    Authors: Mengwei Xu, Zhe Fu, Xiao Ma, Li Zhang, Yanan Li, Feng Qian, Shangguang Wang, Ke Li, **gyu Yang, Xuanzhe Liu

    Abstract: Public edge platforms have drawn increasing attention from both academia and industry. In this study, we perform a first-of-its-kind measurement study on a leading public edge platform that has been densely deployed in China. Based on this measurement, we quantitatively answer two critical yet unexplored questions. First, from end users' perspective, what is the performance of commodity edge platf… ▽ More

    Submitted 8 November, 2021; v1 submitted 7 September, 2021; originally announced September 2021.

  18. High fidelity entanglement of neutral atoms via a Rydberg-mediated single-modulated-pulse controlled-PHASE gate

    Authors: Zhuo Fu, Peng Xu, Yuan Sun, Yangyang Liu, Xiaodong He, Xiao Li, Min Liu, Runbing Li, ** Wang, Liang Liu, Mingsheng Zhan

    Abstract: Neutral atom platform has become an attractive choice to study the science of quantum information and quantum simulation, where intense efforts have been devoted to the entangling processes between individual atoms. For the development of this area, two-qubit controlled-PHASE gate via Rydberg blockade is one of the most essential elements. Recent theoretical studies have suggested the advantages o… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: 4 figures

  19. arXiv:2109.02437  [pdf, other

    physics.ins-det physics.optics

    Force Detection Sensitivity Spectrum Calibration of Levitated Nanomechanical Sensor Using Harmonic Coulomb Force

    Authors: Zhenhai Fu, Shaochong Zhu, Ying Dong, Xingfan Chen, Huizhu Hu, Xiaowen Gao

    Abstract: Oscillators based on levitated particles are promising for the development of ultrasensitive force detectors. The theoretical performance of levitated nanomechanical sensors is usually characterized by the so-called thermal noise limit force detection sensitivity, which does not exhibit spectral specificity in practical measurements. To characterize the actual detection performance, we propose a m… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

  20. arXiv:2108.13300  [pdf, other

    cs.IR cs.CL

    Deep Natural Language Processing for LinkedIn Search

    Authors: Weiwei Guo, Xiaowei Liu, Sida Wang, Michaeel Kazi, Zhiwei Wang, Zhoutong Fu, Jun Jia, Liang Zhang, Huiji Gao, Bo Long

    Abstract: Many search systems work with large amounts of natural language data, e.g., search queries, user profiles, and documents. Building a successful search system requires a thorough understanding of textual data semantics, where deep learning based natural language processing techniques (deep NLP) can be of great help. In this paper, we introduce a comprehensive study for applying deep NLP techniques… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: 18 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:2108.08252

  21. arXiv:2108.10737  [pdf

    physics.optics

    Spatially homogeneous few-cycle compression of Yb lasers via all-solid-state free-space soliton management

    Authors: Bingbing Zhu, Zongyuan Fu, Yudong Chen, Sainan Peng, Cheng **, Guangyu Fan, Sheng Zhang, Shunjia Wang, Hao Ru, Chuanshan Tian, Yihua Wang, Henry Kapteyn, Margaret Murnane, Zhensheng Tao

    Abstract: The high power and variable repetition rate of Yb femtosecond lasers make them very attractive for ultrafast science. However, for capturing sub-200 fs dynamics, efficient, high-fidelity, and high-stability pulse compression techniques are essential. Spectral broadening using an all-solid-state free-space geometry is particularly attractive, as it is simple, robust, and low-cost. However, spatial… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Journal ref: Optics Express (2022)

  22. arXiv:2108.08765  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation

    Authors: Zhihan Liu, Yufeng Zhang, Zuyue Fu, Zhuoran Yang, Zhaoran Wang

    Abstract: In generative adversarial imitation learning (GAIL), the agent aims to learn a policy from an expert demonstration so that its performance cannot be discriminated from the expert policy on a certain predefined reward set. In this paper, we study GAIL in both online and offline settings with linear function approximation, where both the transition and reward function are linear in the feature maps.… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: 54 pages, in submission

  23. arXiv:2108.08527  [pdf, other

    astro-ph.HE hep-ex

    Limits on astrophysical antineutrinos with the KamLAND experiment

    Authors: S. Abe, S. Asami, A. Gando, Y. Gando, T. Gima, A. Goto, T. Hachiya, K. Hata, S. Hayashida, K. Hosokawa, K. Ichimura, S. Ieki, H. Ikeda, K. Inoue, K. Ishidoshiro, Y. Kamei, N. Kawada, T. Kinoshita, Y. Kishimoto, M. Koga, N. Maemura, T. Mitsui, H. Miyake, K. Nakamura, K. Nakamura , et al. (45 additional authors not shown)

    Abstract: We report on a search for electron antineutrinos ($\barν_e$) from astrophysical sources in the neutrino energy range 8.3 to 30.8 MeV with the KamLAND detector. In an exposure of 6.72 kton-year of the liquid scintillator, we observe 18 candidate events via the inverse beta decay reaction. Although there is a large background uncertainty from neutral current atmospheric neutrino interactions, we fin… ▽ More

    Submitted 22 October, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: 21 pages, 9 figures, 4 tables, accepted for publication in Astrophysical Journal

    Journal ref: The Astrophysical Journal, Volume 925, Number 1, Page 14 (2022)

  24. arXiv:2108.08252  [pdf, other

    cs.CL cs.AI

    Deep Natural Language Processing for LinkedIn Search Systems

    Authors: Weiwei Guo, Xiaowei Liu, Sida Wang, Michaeel Kazi, Zhoutong Fu, Huiji Gao, Jun Jia, Liang Zhang, Bo Long

    Abstract: Many search systems work with large amounts of natural language data, e.g., search queries, user profiles and documents, where deep learning based natural language processing techniques (deep NLP) can be of great help. In this paper, we introduce a comprehensive study of applying deep NLP techniques to five representative tasks in search engines. Through the model design and experiments of the fiv… ▽ More

    Submitted 30 July, 2021; originally announced August 2021.

  25. arXiv:2108.06652  [pdf, other

    cs.RO eess.SY

    Force-feedback based Whole-body Stabilizer for Position-Controlled Humanoid Robots

    Authors: Shunpeng Yang, Hua Chen, Zhen Fu, Wei Zhang

    Abstract: This paper studies stabilizer design for position-controlled humanoid robots. Stabilizers are an essential part for position-controlled humanoids, whose primary objective is to adjust the control input sent to the robot to assist the tracking controller to better follow the planned reference trajectory. To achieve this goal, this paper develops a novel force-feedback based whole-body stabilizer th… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

    Comments: IROS 2021, 8 pages

  26. arXiv:2108.06033  [pdf

    physics.app-ph

    Realization of ultrabroadband THz/IR photoresponse in a bias-tunable ratchet photodetector

    Authors: Peng Bai, Xiaohong Li, Ning Yang, Weidong Chu, Xueqi Bai, Siheng Huang, Yueheng Zhang, Wenzhong Shen, Zhanglong Fu, Dixiang Shao, Zhiyong Tan, Hua Li, Juncheng Cao, Lianhe Li, Edmund Harold Linfield, Yan Xie, Ziran Zhao

    Abstract: High performance Terahertz (THz) photodetector has drawn wide attention and got great improvement due to its significant application in biomedical, astrophysics, nondestructive inspection, 6th generation communication system as well as national security application. Here we demonstrate a novel broadband photon-type THz/infrared (IR) photodetector based on the GaAs/AlxGa1-xAs ratchet structure. Thi… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  27. The pickup and delivery problem with synchronized en-route transfers for microtransit planning

    Authors: Zhexi Fu, Joseph Y. J. Chow

    Abstract: Microtransit and other flexible transit fleet services can reduce costs by incorporating transfers. However, transfers are costly to users if they must get off a vehicle and wait at a stop for another pickup. A mixed integer linear programming model (MILP) is proposed to solve pickup and delivery problems with vehicle-synchronized en-route transfers (PDPSET). The transfer location is determined by… ▽ More

    Submitted 19 January, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

    Journal ref: Transportation Research Part E 157, 102562 (2022)

  28. arXiv:2107.04034  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    RMA: Rapid Motor Adaptation for Legged Robots

    Authors: Ashish Kumar, Zipeng Fu, Deepak Pathak, Jitendra Malik

    Abstract: Successful real-world deployment of legged robots would require them to adapt in real-time to unseen scenarios like changing terrains, changing payloads, wear and tear. This paper presents Rapid Motor Adaptation (RMA) algorithm to solve this problem of real-time online adaptation in quadruped robots. RMA consists of two components: a base policy and an adaptation module. The combination of these c… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: RSS 2021. Webpage at https://ashish-kmr.github.io/rma-legged-robots/

  29. TableSense: Spreadsheet Table Detection with Convolutional Neural Networks

    Authors: Haoyu Dong, Shijie Liu, Shi Han, Zhouyu Fu, Dongmei Zhang

    Abstract: Spreadsheet table detection is the task of detecting all tables on a given sheet and locating their respective ranges. Automatic table detection is a key enabling technique and an initial step in spreadsheet data intelligence. However, the detection task is challenged by the diversity of table structures and table layouts on the spreadsheet. Considering the analogy between a cell matrix as spreads… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  30. arXiv:2106.08148  [pdf, other

    cs.CV

    Weakly-Supervised Photo-realistic Texture Generation for 3D Face Reconstruction

    Authors: Xiangnan Yin, Di Huang, Zehua Fu, Yunhong Wang, Liming Chen

    Abstract: Although much progress has been made recently in 3D face reconstruction, most previous work has been devoted to predicting accurate and fine-grained 3D shapes. In contrast, relatively little work has focused on generating high-fidelity face textures. Compared with the prosperity of photo-realistic 2D face image generation, high-fidelity 3D face texture generation has yet to be studied. In this pap… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

  31. Pixel Sampling for Style Preserving Face Pose Editing

    Authors: Xiangnan Yin, Di Huang, Hongyu Yang, Zehua Fu, Yunhong Wang, Liming Chen

    Abstract: The existing auto-encoder based face pose editing methods primarily focus on modeling the identity preserving ability during pose synthesis, but are less able to preserve the image style properly, which refers to the color, brightness, saturation, etc. In this paper, we take advantage of the well-known frontal/profile optical illusion and present a novel two-stage approach to solve the aforementio… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Journal ref: IJCB,2020,pp. 1-10

  32. arXiv:2105.08629  [pdf, other

    eess.IV cs.CV cs.LG

    Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Kim Byeoung-su, Radu Timofte, Angeline Pouget, Fenglong Song, Cheng Li, Shuai Xiao, Zhongqian Fu, Matteo Maggioni, Yibin Huang, Shen Cheng, Xin Lu, Yifeng Zhou, Liangyu Chen, Donghao Liu, Xiangyu Zhang, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Bin Huang , et al. (7 additional authors not shown)

    Abstract: Image denoising is one of the most critical problems in mobile photo processing. While many solutions have been proposed for this task, they are usually working with synthetic data and are too computationally expensive to run on mobile devices. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based image denoising solut… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/. arXiv admin note: substantial text overlap with arXiv:2105.07809, arXiv:2105.07825

  33. arXiv:2105.05320  [pdf, ps, other

    cs.SI cs.AI cs.CV cs.LG

    Seeing All From a Few: Nodes Selection Using Graph Pooling for Graph Clustering

    Authors: Yiming Wang, Dongxia Chang, Zhiqian Fu, Yao Zhao

    Abstract: Recently, there has been considerable research interest in graph clustering aimed at data partition using the graph information. However, one limitation of the most of graph-based methods is that they assume the graph structure to operate is fixed and reliable. And there are inevitably some edges in the graph that are not conducive to graph clustering, which we call spurious edges. This paper is t… ▽ More

    Submitted 7 June, 2021; v1 submitted 30 April, 2021; originally announced May 2021.

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2022

  34. Consistent Multiple Graph Embedding for Multi-View Clustering

    Authors: Yiming Wang, Dongxia Chang, Zhiqiang Fu, Yao Zhao

    Abstract: Graph-based multi-view clustering aiming to obtain a partition of data across multiple views, has received considerable attention in recent years. Although great efforts have been made for graph-based multi-view clustering, it remains a challenge to fuse characteristics from various views to learn a common representation for clustering. In this paper, we propose a novel Consistent Multiple Graph E… ▽ More

    Submitted 20 December, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Journal ref: IEEE Transactions on Multimedia, 2021

  35. arXiv:2105.04281  [pdf, other

    cs.CV

    Visual Grounding with Transformers

    Authors: Ye Du, Zehua Fu, Qingjie Liu, Yunhong Wang

    Abstract: In this paper, we propose a transformer based approach for visual grounding. Unlike previous proposal-and-rank frameworks that rely heavily on pretrained object detectors or proposal-free frameworks that upgrade an off-the-shelf one-stage detector by fusing textual embeddings, our approach is built on top of a transformer encoder-decoder and is independent of any pretrained detectors or word embed… ▽ More

    Submitted 13 March, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: 7 pagrs, 3 figures. Accepted by ICME'22

  36. arXiv:2105.03538  [pdf, other

    math.AP math.NA

    Equivalent formulations of the oxygen depletion problem, other implicit free boundary value problems, and implications for numerical approximation

    Authors: Xinyu Cheng, Zhaohui Fu, Brian Wetton

    Abstract: The Oxygen Depletion problem is an implicit free boundary value problem. The dynamics allow topological changes in the free boundary. We show several mathematical formulations of this model from the literature and give a new formulation based on a gradient flow with constraint. All formulations are shown to be equivalent. We explore the possibilities for the numerical approximation of the problem… ▽ More

    Submitted 20 May, 2022; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: 30 pages, 4 figures

    MSC Class: 35R35

  37. Search for Solar Flare Neutrinos with the KamLAND detector

    Authors: S. Abe, S. Asami, A. Gando, Y. Gando, T. Gima, A. Goto, T. Hachiya, K. Hata, S. Hayashida, K. Hosokawa, K. Ichimura, S. Ieki, H. Ikeda, K. Inoue, K. Ishidoshiro, Y. Kamei, N. Kawada, Y. Kishimoto, T. Kinoshita, M. Koga, N. Maemura, T. Mitsui, H. Miyake, K. Nakamura, K. Nakamura , et al. (44 additional authors not shown)

    Abstract: We report the result of a search for neutrinos in coincidence with solar flares from the GOES flare database. The search was performed on a 10.8 kton-year exposure of KamLAND collected from 2002 to 2019. This large exposure allows us to explore previously unconstrained parameter space for solar flare neutrinos. We found no statistical excess of neutrinos and established 90% confidence level upper… ▽ More

    Submitted 26 October, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: 13 pages, 9 figures, accepted October 27, 2021

    Journal ref: The Astrophysical Journal, Volume 924, Number 2, Page 103 (2022)

  38. arXiv:2105.01128  [pdf, other

    cs.LG eess.SP

    Fusing multimodal neuroimaging data with a variational autoencoder

    Authors: Eloy Geenjaar, Noah Lewis, Zening Fu, Rohan Venkatdas, Sergey Plis, Vince Calhoun

    Abstract: Neuroimaging studies often involve the collection of multiple data modalities. These modalities contain both shared and mutually exclusive information about the brain. This work aims at finding a scalable and interpretable method to fuse the information of multiple neuroimaging modalities using a variational autoencoder (VAE). To provide an initial assessment, this work evaluates the representatio… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  39. arXiv:2104.12308  [pdf, other

    cs.LG

    Auto-weighted low-rank representation for clustering

    Authors: Zhiqiang Fu, Yao Zhao, Dongxia Chang, Xingxing Zhang, Yiming Wang

    Abstract: In this paper, a novel unsupervised low-rank representation model, i.e., Auto-weighted Low-Rank Representation (ALRR), is proposed to construct a more favorable similarity graph (SG) for clustering. In particular, ALRR enhances the discriminability of SG by capturing the multi-subspace structure and extracting the salient features simultaneously. Specifically, an auto-weighted penalty is introduce… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

  40. Efficient Non-Sampling Knowledge Graph Embedding

    Authors: Zelong Li, Jianchao Ji, Zuohui Fu, Yingqiang Ge, Shuyuan Xu, Chong Chen, Yongfeng Zhang

    Abstract: Knowledge Graph (KG) is a flexible structure that is able to describe the complex relationship between data entities. Currently, most KG embedding models are trained based on negative sampling, i.e., the model aims to maximize some similarity of the connected entities in the KG, while minimizing the similarity of the sampled disconnected entities. Negative sampling helps to reduce the time complex… ▽ More

    Submitted 16 June, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: 10 pages, 3 figures. The first two authors contributed equally to the work. Accepted to WWW 2021

  41. arXiv:2104.10781  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

    Authors: Ren Yang, Radu Timofte, **g Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng li, Thomas Tanay , et al. (47 additional authors not shown)

    Abstract: This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at… ▽ More

    Submitted 31 August, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Corrected the MOS values in Table 2, and corrected some minor typos

  42. arXiv:2104.10671  [pdf, other

    cs.IR cs.AI cs.HC cs.LG cs.SI

    User-oriented Fairness in Recommendation

    Authors: Yunqi Li, Hanxiong Chen, Zuohui Fu, Yingqiang Ge, Yongfeng Zhang

    Abstract: As a highly data-driven application, recommender systems could be affected by data bias, resulting in unfair results for different data groups, which could be a reason that affects the system performance. Therefore, it is important to identify and solve the unfairness issues in recommendation scenarios. In this paper, we address the unfairness problem in recommender systems from the user perspecti… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: Accepted to the 30th Web Conference (WWW 2021)

  43. arXiv:2104.08489  [pdf, other

    cs.LG cs.CV

    Semi-Supervised Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport

    Authors: Yang Yang, Zhao-Yang Fu, De-Chuan Zhan, Zhi-Bin Liu, Yuan Jiang

    Abstract: Complex objects are usually with multiple labels, and can be represented by multiple modal representations, e.g., the complex articles contain text and image information as well as multiple annotations. Previous methods assume that the homogeneous multi-modal data are consistent, while in real applications, the raw data are disordered, e.g., the article constitutes with variable number of inconsis… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  44. arXiv:2104.08451  [pdf, other

    cs.CL cs.AI

    Context-Aware Interaction Network for Question Matching

    Authors: Zhe Hu, Zuohui Fu, Yu Yin, Gerard de Melo

    Abstract: Impressive milestones have been achieved in text matching by adopting a cross-attention mechanism to capture pertinent semantic connections between two sentence representations. However, regular cross-attention focuses on word-level links between the two input sequences, neglecting the importance of contextual information. We propose a context-aware interaction network (COIN) to properly align two… ▽ More

    Submitted 18 September, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

  45. arXiv:2104.07869  [pdf, other

    cs.IR

    Faithfully Explainable Recommendation via Neural Logic Reasoning

    Authors: Yaxin Zhu, Yikun Xian, Zuohui Fu, Gerard de Melo, Yongfeng Zhang

    Abstract: Knowledge graphs (KG) have become increasingly important to endow modern recommender systems with the ability to generate traceable reasoning paths to explain the recommendation process. However, prior research rarely considers the faithfulness of the derived explanations to justify the decision making process. To the best of our knowledge, this is the first work that models and evaluates faithful… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted in NAACL 2021

  46. arXiv:2104.03743  [pdf, other

    cs.LG cs.CE stat.AP

    Residual Gaussian Process: A Tractable Nonparametric Bayesian Emulator for Multi-fidelity Simulations

    Authors: Wei W. Xing, Akeel A. Shah, Peng Wang, Shandian Zhe Qian Fu, Robert. M. Kirby

    Abstract: Challenges in multi-fidelity modeling relate to accuracy, uncertainty estimation and high-dimensionality. A novel additive structure is introduced in which the highest fidelity solution is written as a sum of the lowest fidelity solution and residuals between the solutions at successive fidelity levels, with Gaussian process priors placed over the low fidelity solution and each of the residuals. T… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

  47. arXiv:2104.00324  [pdf, other

    cs.CV

    STMTrack: Template-free Visual Tracking with Space-time Memory Networks

    Authors: Zhihong Fu, Qingjie Liu, Zehua Fu, Yunhong Wang

    Abstract: Boosting performance of the offline trained siamese trackers is getting harder nowadays since the fixed information of the template cropped from the first frame has been almost thoroughly mined, but they are poorly capable of resisting target appearance changes. Existing trackers with template updating mechanisms rely on time-consuming numerical optimization and complex hand-designed strategies to… ▽ More

    Submitted 2 April, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: Accepted by CVPR 2021

  48. arXiv:2103.16173  [pdf, other

    cs.CV cs.AI

    Contrastive Embedding for Generalized Zero-Shot Learning

    Authors: Zongyan Han, Zhenyong Fu, Shuo Chen, Jian Yang

    Abstract: Generalized zero-shot learning (GZSL) aims to recognize objects from both seen and unseen classes, when only the labeled examples from seen classes are provided. Recent feature generation methods learn a generative model that can synthesize the missing visual features of unseen classes to mitigate the data-imbalance problem in GZSL. However, the original visual feature space is suboptimal for GZSL… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted by CVPR2021

  49. Efficient Multi-Stage Video Denoising with Recurrent Spatio-Temporal Fusion

    Authors: Matteo Maggioni, Yibin Huang, Cheng Li, Shuai Xiao, Zhongqian Fu, Fenglong Song

    Abstract: In recent years, denoising methods based on deep learning have achieved unparalleled performance at the cost of large computational complexity. In this work, we propose an Efficient Multi-stage Video Denoising algorithm, called EMVD, to drastically reduce the complexity while maintaining or even improving the performance. First, a fusion stage reduces the noise through a recursive combination of a… ▽ More

    Submitted 30 March, 2023; v1 submitted 9 March, 2021; originally announced March 2021.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 3465-3474

  50. The Observation of Ferroelastic and Ferrielectric Domains in AgNbO3 Single Crystal

    Authors: Wei Zhao, Zhengqian Fu, Jianming Deng, Song Li, Yifeng Han, Man-Rong Li, Xueyun Wang, Jiawang Hong

    Abstract: Compared to AgNbO3 based ceramics, the experimental investigations on the single crystalline AgNbO3, especially the ground state and ferroic domain structures, are not on the same level. Here in this work, based on successfully synthesized AgNbO3 single crystal using flux method, we observed the coexistence of ferroelastic and ferrielectric domain structures by a combination study of polarized lig… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Journal ref: Chin. Phys. Lett. 2021, 38 (3): 037701