Skip to main content

Showing 51–94 of 94 results for author: Zhou, J T

.
  1. arXiv:2105.06943  [pdf, other

    cs.NE

    Efficient Spiking Neural Networks with Radix Encoding

    Authors: Zhehui Wang, Xiaozhe Gu, Rick Goh, Joey Tianyi Zhou, Tao Luo

    Abstract: Spiking neural networks (SNNs) have advantages in latency and energy efficiency over traditional artificial neural networks (ANNs) due to its event-driven computation mechanism and replacement of energy-consuming weight multiplications with additions. However, in order to reach accuracy of its ANN counterpart, it usually requires long spike trains to ensure the accuracy. Traditionally, a spike tra… ▽ More

    Submitted 2 November, 2023; v1 submitted 14 May, 2021; originally announced May 2021.

  2. arXiv:2105.06247  [pdf, other

    cs.CL cs.CV cs.IR

    Video Corpus Moment Retrieval with Contrastive Learning

    Authors: Hao Zhang, Aixin Sun, Wei **g, Guoshun Nan, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh

    Abstract: Given a collection of untrimmed and unsegmented videos, video corpus moment retrieval (VCMR) is to retrieve a temporal moment (i.e., a fraction of a video) that semantically corresponds to a given text query. As video and text are from two distinct feature spaces, there are two general approaches to address VCMR: (i) to separately encode each modality representations, then align the two modality r… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

    Comments: 11 pages, 7 figures and 6 tables. Accepted by SIGIR 2021

  3. arXiv:2103.16074  [pdf, other

    cs.LG cs.CR cs.CV

    PointBA: Towards Backdoor Attacks in 3D Point Cloud

    Authors: Xinke Li, Zhirui Chen, Yue Zhao, Zekun Tong, Yabang Zhao, Andrew Lim, Joey Tianyi Zhou

    Abstract: 3D deep learning has been increasingly more popular for a variety of tasks including many safety-critical applications. However, recently several works raise the security issues of 3D deep models. Although most of them consider adversarial attacks, we identify that backdoor attack is indeed a more serious threat to 3D deep learning systems but remains unexplored. We present the backdoor attacks in… ▽ More

    Submitted 22 August, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted by ICCV 2021

  4. arXiv:2103.14493  [pdf, other

    cs.LG cs.NE

    RCT: Resource Constrained Training for Edge AI

    Authors: Tian Huang, Tao Luo, Ming Yan, Joey Tianyi Zhou, Rick Goh

    Abstract: Neural networks training on edge terminals is essential for edge AI computing, which needs to be adaptive to evolving environment. Quantised models can efficiently run on edge devices, but existing training methods for these compact models are designed to run on powerful servers with abundant memory and energy budget. For example, quantisation-aware training (QAT) method involves two copies of mod… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: 14 pages

    MSC Class: 68T07 (Primary) 68T05 (Secondary) ACM Class: I.5.1; I.2.6

  5. Natural Language Video Localization: A Revisit in Span-based Question Answering Framework

    Authors: Hao Zhang, Aixin Sun, Wei **g, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh

    Abstract: Natural Language Video Localization (NLVL) aims to locate a target moment from an untrimmed video that semantically corresponds to a text query. Existing approaches mainly solve the NLVL problem from the perspective of computer vision by formulating it as ranking, anchor, or regression tasks. These methods suffer from large performance degradation when localizing on long videos. In this work, we a… ▽ More

    Submitted 2 March, 2021; v1 submitted 26 February, 2021; originally announced February 2021.

    Comments: 15 pages, 18 figures, and 10 tables. Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). arXiv admin note: substantial text overlap with arXiv:2004.13931

    Report number: TPAMI-2020-09-1337.R1

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021

  6. arXiv:2102.02051  [pdf, other

    cs.LG cs.CV

    Trusted Multi-View Classification

    Authors: Zongbo Han, Changqing Zhang, Huazhu Fu, Joey Tianyi Zhou

    Abstract: Multi-view classification (MVC) generally focuses on improving classification accuracy by using information from different views, typically integrating them into a unified comprehensive representation for downstream tasks. However, it is also crucial to dynamically assess the quality of a view for different samples in order to provide reliable uncertainty estimations, which indicate whether predic… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: Accepted by ICLR 2021

  7. arXiv:2101.01149  [pdf, ps, other

    eess.SP

    Deep Learning for Latent Events Forecasting in Twitter Aided Caching Networks

    Authors: Zhong Yang, Yuanwei Liu, Yue Chen, Joey Tianyi Zhou

    Abstract: A novel Twitter context aided content caching (TAC) framework is proposed for enhancing the caching efficiency by taking advantage of the legibility and massive volume of Twitter data. For the purpose of promoting the caching efficiency, three machine learning models are proposed to predict latent events and events popularity, utilizing collect Twitter data with geo-tags and geographic information… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: 30 pages, 15 figures

  8. arXiv:2012.12775  [pdf, other

    cs.LG

    Adaptive Precision Training for Resource Constrained Devices

    Authors: Tian Huang, Tao Luo, Joey Tianyi Zhou

    Abstract: Learn in-situ is a growing trend for Edge AI. Training deep neural network (DNN) on edge devices is challenging because both energy and memory are constrained. Low precision training helps to reduce the energy cost of a single training iteration, but that does not necessarily translate to energy savings for the whole training process, because low precision could slows down the convergence rate. On… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

    Comments: 6 pages

  9. arXiv:2011.06170  [pdf, other

    cs.LG

    Deep Partial Multi-View Learning

    Authors: Changqing Zhang, Yajie Cui, Zongbo Han, Joey Tianyi Zhou, Huazhu Fu, Qinghua Hu

    Abstract: Although multi-view learning has made signifificant progress over the past few decades, it is still challenging due to the diffificulty in modeling complex correlations among different views, especially under the context of view missing. To address the challenge, we propose a novel framework termed Cross Partial Multi-View Networks (CPM-Nets), which aims to fully and flflexibly take advantage of m… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

  10. arXiv:2010.11655  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games

    Authors: Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou, Chengqi Zhang

    Abstract: We study reinforcement learning (RL) for text-based games, which are interactive simulations in the context of natural language. While different methods have been developed to represent the environment information and language actions, existing RL agents are not empowered with any reasoning capabilities to deal with textual games. In this work, we aim to conduct explicit reasoning with knowledge g… ▽ More

    Submitted 25 December, 2020; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted by NeurIPS2020

  11. arXiv:2009.11719  [pdf, other

    cs.LG

    Deep Neural Networks with Short Circuits for Improved Gradient Learning

    Authors: Ming Yan, Xueli Xiao, Joey Tianyi Zhou, Yi Pan

    Abstract: Deep neural networks have achieved great success both in computer vision and natural language processing tasks. However, mostly state-of-art methods highly rely on external training or computing to improve the performance. To alleviate the external reliance, we proposed a gradient enhancement approach, conducted by the short circuit neural connections, to improve the gradient learning of deep neur… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

  12. arXiv:2009.10465  [pdf, other

    cs.CV cs.DC

    Deep N-ary Error Correcting Output Codes

    Authors: Hao Zhang, Joey Tianyi Zhou, Tianying Wang, Ivor W. Tsang, Rick Siow Mong Goh

    Abstract: Ensemble learning consistently improves the performance of multi-class classification through aggregating a series of base classifiers. To this end, data-independent ensemble methods like Error Correcting Output Codes (ECOC) attract increasing attention due to its easiness of implementation and parallelization. Specifically, traditional ECOCs and its general extension N-ary ECOC decompose the orig… ▽ More

    Submitted 14 December, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: EAI MOBIMEDIA 2020

  13. arXiv:2009.09687  [pdf, other

    cs.LG cs.CV stat.ML

    Contrastive Clustering

    Authors: Yunfan Li, Peng Hu, Zitao Liu, Dezhong Peng, Joey Tianyi Zhou, Xi Peng

    Abstract: In this paper, we propose a one-stage online clustering method called Contrastive Clustering (CC) which explicitly performs the instance- and cluster-level contrastive learning. To be specific, for a given dataset, the positive and negative instance pairs are constructed through data augmentations and then projected into a feature space. Therein, the instance- and cluster-level contrastive learnin… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

  14. arXiv:2008.11401  [pdf, other

    cs.CV

    Point Adversarial Self Mining: A Simple Method for Facial Expression Recognition

    Authors: ** Liu, Yuewei Lin, Zibo Meng, Lu Lu, Weihong Deng, Joey Tianyi Zhou, Yi Yang

    Abstract: In this paper, we propose a simple yet effective approach, named Point Adversarial Self Mining (PASM), to improve the recognition accuracy in facial expression recognition. Unlike previous works focusing on designing specific architectures or loss functions to solve this problem, PASM boosts the network capability by simulating human learning processes: providing updated learning materials and gui… ▽ More

    Submitted 8 May, 2021; v1 submitted 26 August, 2020; originally announced August 2020.

  15. arXiv:2007.06878  [pdf, other

    cs.LG stat.ML

    Attentive Graph Neural Networks for Few-Shot Learning

    Authors: Hao Cheng, Joey Tianyi Zhou, Wee Peng Tay, Bihan Wen

    Abstract: Graph Neural Networks (GNN) has demonstrated the superior performance in many challenging applications, including the few-shot learning tasks. Despite its powerful capacity to learn and generalize the model from few samples, GNN usually suffers from severe over-fitting and over-smoothing as the model becomes deep, which limit the scalability. In this work, we propose a novel Attentive GNN to tackl… ▽ More

    Submitted 2 October, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

  16. arXiv:2007.05720  [pdf, other

    cs.CV

    ECML: An Ensemble Cascade Metric Learning Mechanism towards Face Verification

    Authors: Fu Xiong, Yang Xiao, Zhiguo Cao, Yancheng Wang, Joey Tianyi Zhou, Jianxi Wu

    Abstract: Face verification can be regarded as a 2-class fine-grained visual recognition problem. Enhancing the feature's discriminative power is one of the key problems to improve its performance. Metric learning technology is often applied to address this need, while achieving a good tradeoff between underfitting and overfitting plays the vital role in metric learning. Hence, we propose a novel ensemble c… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: Accepted to IEEE Transaction on Cybernetics

  17. arXiv:2006.16829  [pdf, other

    cs.CV

    You Only Look Yourself: Unsupervised and Untrained Single Image Dehazing Neural Network

    Authors: Boyun Li, Yuanbiao Gou, Shuhang Gu, Jerry Zitao Liu, Joey Tianyi Zhou, Xi Peng

    Abstract: In this paper, we study two challenging and less-touched problems in single image dehazing, namely, how to make deep learning achieve image dehazing without training on the ground-truth clean image (unsupervised) and a image collection (untrained). An unsupervised neural network will avoid the intensive labor collection of hazy-clean image pairs, and an untrained model is a ``real'' single image d… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

  18. arXiv:2006.04588  [pdf, ps, other

    cs.LG stat.ML

    EDCompress: Energy-Aware Model Compression for Dataflows

    Authors: Zhehui Wang, Tao Luo, Joey Tianyi Zhou, Rick Siow Mong Goh

    Abstract: Edge devices demand low energy consumption, cost and small form factor. To efficiently deploy convolutional neural network (CNN) models on edge device, energy-aware model compression becomes extremely important. However, existing work did not study this problem well because the lack of considering the diversity of dataflow types in hardware architectures. In this paper, we propose EDCompress, an E… ▽ More

    Submitted 11 July, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

  19. arXiv:2005.08551  [pdf, other

    cs.CV

    Omni-supervised Facial Expression Recognition via Distilled Data

    Authors: ** Liu, Yunchao Wei, Zibo Meng, Weihong Deng, Joey Tianyi Zhou, Yi Yang

    Abstract: Facial expression plays an important role in understanding human emotions. Most recently, deep learning based methods have shown promising for facial expression recognition. However, the performance of the current state-of-the-art facial expression recognition (FER) approaches is directly related to the labeled data for training. To solve this issue, prior works employ the pretrain-and-finetune st… ▽ More

    Submitted 8 December, 2021; v1 submitted 18 May, 2020; originally announced May 2020.

  20. arXiv:2005.07902  [pdf, other

    eess.IV cs.CV

    The Power of Triply Complementary Priors for Image Compressive Sensing

    Authors: Zhiyuan Zha, Xin Yuan, Joey Tianyi Zhou, Jiantao Zhou, Bihan Wen, Ce Zhu

    Abstract: Recent works that utilized deep models have achieved superior results in various image restoration applications. Such approach is typically supervised which requires a corpus of training images with distribution similar to the images to be recovered. On the other hand, the shallow methods which are usually unsupervised remain promising performance in many inverse problems, \eg, image compressive s… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Journal ref: 2020 International Conference on Image Processing

  21. arXiv:2005.05501  [pdf, other

    cs.CV

    3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

    Authors: Yancheng Wang, Yang Xiao, Fu Xiong, Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

    Abstract: To facilitate depth-based 3D action recognition, 3D dynamic voxel (3DV) is proposed as a novel 3D motion representation. With 3D space voxelization, the key idea of 3DV is to encode 3D motion information within depth video into a regular voxel set (i.e., 3DV) compactly, via temporal rank pooling. Each available 3DV voxel intrinsically involves 3D spatial and motion feature jointly. 3DV is then abs… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: Accepted by CVPR2020

  22. arXiv:2004.14798  [pdf, other

    cs.CR cs.CV cs.LG eess.IV

    RAIN: A Simple Approach for Robust and Accurate Image Classification Networks

    Authors: Jiawei Du, Hanshu Yan, Vincent Y. F. Tan, Joey Tianyi Zhou, Rick Siow Mong Goh, Jiashi Feng

    Abstract: It has been shown that the majority of existing adversarial defense methods achieve robustness at the cost of sacrificing prediction accuracy. The undesirable severe drop in accuracy adversely affects the reliability of machine learning algorithms and prohibits their deployment in realistic applications. This paper aims to address this dilemma by proposing a novel preprocessing framework, which we… ▽ More

    Submitted 4 November, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

  23. arXiv:2004.13931  [pdf, other

    cs.CL cs.CV

    Span-based Localizing Network for Natural Language Video Localization

    Authors: Hao Zhang, Aixin Sun, Wei **g, Joey Tianyi Zhou

    Abstract: Given an untrimmed video and a text query, natural language video localization (NLVL) is to locate a matching span from the video that semantically corresponds to the query. Existing solutions formulate NLVL either as a ranking task and apply multimodal matching architecture, or as a regression task to directly regress the target video span. In this work, we address NLVL task with a span-based QA… ▽ More

    Submitted 14 June, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: To appear at ACL 2020

  24. arXiv:2004.13303  [pdf, other

    cs.LG stat.ML

    Heterogeneous Representation Learning: A Review

    Authors: Joey Tianyi Zhou, Xi Peng, Yew-Soon Ong

    Abstract: The real-world data usually exhibits heterogeneous properties such as modalities, views, or resources, which brings some unique challenges wherein the key is Heterogeneous Representation Learning (HRL) termed in this paper. This brief survey covers the topic of HRL, centered around several major learning settings and real-world applications. First of all, from the mathematical perspective, we pres… ▽ More

    Submitted 30 April, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

  25. arXiv:2004.01980  [pdf, other

    cs.CL cs.AI cs.LG

    Hooks in the Headline: Learning to Generate Headlines with Controlled Styles

    Authors: Di **, Zhi**g **, Joey Tianyi Zhou, Lisa Orii, Peter Szolovits

    Abstract: Current summarization systems only produce plain, factual headlines, but do not meet the practical needs of creating memorable titles to increase exposure. We propose a new task, Stylistic Headline Generation (SHG), to enrich the headlines with three style options (humor, romance and clickbait), in order to attract more readers. With no style-specific article-headline pair (only a standard headlin… ▽ More

    Submitted 28 May, 2020; v1 submitted 4 April, 2020; originally announced April 2020.

    Comments: ACL 2020

    Report number: 12 pages

  26. arXiv:2001.08140  [pdf, other

    cs.CL cs.LG

    A Simple Baseline to Semi-Supervised Domain Adaptation for Machine Translation

    Authors: Di **, Zhi**g **, Joey Tianyi Zhou, Peter Szolovits

    Abstract: State-of-the-art neural machine translation (NMT) systems are data-hungry and perform poorly on new domains with no supervised data. As data collection is expensive and infeasible in many cases, domain adaptation methods are needed. In this work, we propose a simple but effect approach to the semi-supervised domain adaptation scenario of NMT, where the aim is to improve the performance of a transl… ▽ More

    Submitted 5 June, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

    Comments: Under review

  27. arXiv:1912.11236  [pdf, other

    cs.CV

    Ordered or Orderless: A Revisit for Video based Person Re-Identification

    Authors: Le Zhang, Zenglin Shi, Joey Tianyi Zhou, Ming-Ming Cheng, Yun Liu, Jia-Wang Bian, Zeng Zeng, Chunhua Shen

    Abstract: Is recurrent network really necessary for learning a good visual representation for video based person re-identification (VPRe-id)? In this paper, we first show that the common practice of employing recurrent neural networks (RNNs) to aggregate temporal spatial features may not be optimal. Specifically, with a diagnostic analysis, we show that the recurrent structure may not be effective to learn… ▽ More

    Submitted 24 December, 2019; originally announced December 2019.

    Comments: Under Minor Revision in IEEE TPAMI

  28. arXiv:1911.06137  [pdf, other

    cs.CL cs.LG

    Unsupervised Domain Adaptation on Reading Comprehension

    Authors: Yu Cao, Meng Fang, Baosheng Yu, Joey Tianyi Zhou

    Abstract: Reading comprehension (RC) has been studied in a variety of datasets with the boosted performance brought by deep neural networks. However, the generalization capability of these models across different domains remains unclear. To alleviate this issue, we are going to investigate unsupervised domain adaptation on RC, wherein a model is trained on labeled source domain and to be applied to the targ… ▽ More

    Submitted 26 July, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

    Comments: 8 pages, 6 figures, 5 tables, Accepted by AAAI 2020

  29. arXiv:1909.06940  [pdf, other

    cs.LG cs.CV stat.ML

    Multi-graph Fusion for Multi-view Spectral Clustering

    Authors: Zhao Kang, Guoxin Shi, Shudong Huang, Wenyu Chen, Xiaorong Pu, Joey Tianyi Zhou, Zenglin Xu

    Abstract: A panoply of multi-view clustering algorithms has been developed to deal with prevalent multi-view data. Among them, spectral clustering-based methods have drawn much attention and demonstrated promising results recently. Despite progress, there are still two fundamental questions that stay unanswered to date. First, how to fuse different views into one graph. More often than not, the similarities… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

    Comments: submitted to Knowledge-based Systems

  30. arXiv:1908.09999  [pdf, other

    cs.CV

    A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image

    Authors: Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan

    Abstract: For 3D hand and body pose estimation task in depth image, a novel anchor-based approach termed Anchor-to-Joint regression network (A2J) with the end-to-end learning ability is proposed. Within A2J, anchor points able to capture global-local spatial context information are densely set on depth image as local regressors for the joints. They contribute to predict the positions of the joints in ensemb… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: Accepted by ICCV2019

  31. arXiv:1908.09066  [pdf, other

    cs.CV

    Robust Regression via Deep Negative Correlation Learning

    Authors: Le Zhang, Zenglin Shi, Ming-Ming Cheng, Yun Liu, Jia-Wang Bian, Joey Tianyi Zhou, Guoyan Zheng, Zeng Zeng

    Abstract: Nonlinear regression has been extensively employed in many computer vision problems (e.g., crowd counting, age estimation, affective computing). Under the umbrella of deep learning, two common solutions exist i) transforming nonlinear regression to a robust loss function which is jointly optimizable with the deep convolutional network, and ii) utilizing ensemble of deep networks. Although some imp… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

  32. arXiv:1907.11932  [pdf, other

    cs.CL cs.AI cs.LG

    Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment

    Authors: Di **, Zhi**g **, Joey Tianyi Zhou, Peter Szolovits

    Abstract: Machine learning algorithms are often vulnerable to adversarial examples that have imperceptible alterations from the original counterparts but can fool the state-of-the-art models. It is helpful to evaluate or even improve the robustness of these models by exposing the maliciously crafted adversarial examples. In this paper, we present TextFooler, a simple but strong baseline to generate natural… ▽ More

    Submitted 8 April, 2020; v1 submitted 27 July, 2019; originally announced July 2019.

    Comments: AAAI 2020 (Oral)

  33. arXiv:1906.02398  [pdf, other

    cs.CV

    Query-efficient Meta Attack to Deep Neural Networks

    Authors: Jiawei Du, Hu Zhang, Joey Tianyi Zhou, Yi Yang, Jiashi Feng

    Abstract: Black-box attack methods aim to infer suitable attack patterns to targeted DNN models by only using output feedback of the models and the corresponding input queries. However, due to lack of prior and inefficiency in leveraging the query and feedback information, existing methods are mostly query-intensive for obtaining effective attack patterns. In this work, we propose a meta attack approach tha… ▽ More

    Submitted 14 February, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

  34. Towards Real-time Eyeblink Detection in The Wild:Dataset,Theory and Practices

    Authors: Guilei Hu, Yang Xiao, Zhiguo Cao, Lubin Meng, Zhiwen Fang, Joey Tianyi Zhou, Junsong Yuan

    Abstract: Effective and real-time eyeblink detection is of wide-range applications, such as deception detection, drive fatigue detection, face anti-spoofing, etc. Although numerous of efforts have already been paid, most of them focus on addressing the eyeblink detection problem under the constrained indoor conditions with the relative consistent subject and environment setup. Nevertheless, towards the prac… ▽ More

    Submitted 18 December, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

    Journal ref: IEEE Transactions on Information Forensics and Security 2019

  35. arXiv:1808.07292  [pdf, other

    cs.LG stat.ML

    XAI Beyond Classification: Interpretable Neural Clustering

    Authors: Xi Peng, Yunnan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou

    Abstract: In this paper, we study two challenging problems in explainable AI (XAI) and data clustering. The first is how to directly design a neural network with inherent interpretability, rather than giving post-hoc explanations of a black-box model. The second is implementing discrete $k$-means with a differentiable neural network that embraces the advantages of parallel computing, online clustering, and… ▽ More

    Submitted 22 April, 2022; v1 submitted 22 August, 2018; originally announced August 2018.

    Comments: 28 pages

    Journal ref: Journal of Machine Learning Research, 2022

  36. arXiv:1807.11042  [pdf, other

    cs.CV

    Towards Good Practices on Building Effective CNN Baseline Model for Person Re-identification

    Authors: Fu Xiong, Yang Xiao, Zhiguo Cao, Kaicheng Gong, Zhiwen Fang, Joey Tianyi Zhou

    Abstract: Person re-identification is indeed a challenging visual recognition task due to the critical issues of human pose variation, human body occlusion, camera view variation, etc. To address this, most of the state-of-the-art approaches are proposed based on deep convolutional neural network (CNN), being leveraged by its strong feature learning power and classification boundary fitting capacity. Althou… ▽ More

    Submitted 29 July, 2018; originally announced July 2018.

  37. arXiv:1806.11269  [pdf, other

    cs.CV

    Action Recognition for Depth Video using Multi-view Dynamic Images

    Authors: Yang Xiao, Jun Chen, Yancheng Wang, Zhiguo Cao, Joey Tianyi Zhou, Xiang Bai

    Abstract: Dynamic imaging is a recently proposed action description paradigm for simultaneously capturing motion and temporal evolution information, particularly in the context of deep convolutional neural networks (CNNs). Compared with optical flow for motion characterization, dynamic imaging exhibits superior efficiency and compactness. Inspired by the success of dynamic imaging in RGB video, this study e… ▽ More

    Submitted 27 December, 2018; v1 submitted 29 June, 2018; originally announced June 2018.

    Comments: accepted by Information Sciences

  38. arXiv:1702.08681  [pdf, other

    cs.CV

    MIML-FCN+: Multi-instance Multi-label Learning via Fully Convolutional Networks with Privileged Information

    Authors: Hao Yang, Joey Tianyi Zhou, Jianfei Cai, Yew Soon Ong

    Abstract: Multi-instance multi-label (MIML) learning has many interesting applications in computer visions, including multi-object recognition and automatic image tagging. In these applications, additional information such as bounding-boxes, image captions and descriptions is often available during training phrase, which is referred as privileged information (PI). However, as existing works on learning usin… ▽ More

    Submitted 28 February, 2017; originally announced February 2017.

    Comments: Accepted in CVPR 2017

  39. arXiv:1608.01441  [pdf, other

    cs.CV

    Improving Multi-label Learning with Missing Labels by Structured Semantic Correlations

    Authors: Hao Yang, Joey Tianyi Zhou, Jianfei Cai

    Abstract: Multi-label learning has attracted significant interests in computer vision recently, finding applications in many vision tasks such as multiple object recognition and automatic image annotation. Associating multiple labels to a complex image is very difficult, not only due to the intricacy of describing the image, but also because of the incompleteness nature of the observed labels. Existing work… ▽ More

    Submitted 4 August, 2016; originally announced August 2016.

    Comments: Accepted in ECCV 2016

  40. arXiv:1605.04034  [pdf, other

    cs.LG stat.ML

    Transfer Hashing with Privileged Information

    Authors: Joey Tianyi Zhou, Xinxing Xu, Sinno Jialin Pan, Ivor W. Tsang, Zheng Qin, Rick Siow Mong Goh

    Abstract: Most existing learning to hash methods assume that there are sufficient data, either labeled or unlabeled, on the domain of interest (i.e., the target domain) for training. However, this assumption cannot be satisfied in some real-world applications. To address this data sparsity issue in hashing, inspired by transfer learning, we propose a new framework named Transfer Hashing with Privileged Info… ▽ More

    Submitted 12 May, 2016; originally announced May 2016.

    Comments: Accepted by IJCAI-2016

  41. arXiv:1604.01518  [pdf, ps, other

    cs.LG

    Simple and Efficient Learning using Privileged Information

    Authors: Xinxing Xu, Joey Tianyi Zhou, IvorW. Tsang, Zheng Qin, Rick Siow Mong Goh, Yong Liu

    Abstract: The Support Vector Machine using Privileged Information (SVM+) has been proposed to train a classifier to utilize the additional privileged information that is only available in the training phase but not available in the test phase. In this work, we propose an efficient solution for SVM+ by simply utilizing the squared hinge loss instead of the hinge loss as in the existing SVM+ formulation, whic… ▽ More

    Submitted 6 April, 2016; originally announced April 2016.

  42. arXiv:1603.05850  [pdf, other

    cs.LG

    N-ary Error Correcting Coding Scheme

    Authors: Joey Tianyi Zhou, Ivor W. Tsang, Shen-Shyang Ho, Klaus-Robert Muller

    Abstract: The coding matrix design plays a fundamental role in the prediction performance of the error correcting output codes (ECOC)-based multi-class task. {In many-class classification problems, e.g., fine-grained categorization, it is difficult to distinguish subtle between-class differences under existing coding schemes due to a limited choices of coding values.} In this paper, we investigate whether o… ▽ More

    Submitted 18 March, 2016; originally announced March 2016.

    Comments: Under submission to IEEE Transaction on Information Theory

  43. arXiv:1507.01101  [pdf, other

    cs.DC

    Utility Optimal Thread Assignment and Resource Allocation in Multi-Server Systems

    Authors: Pan Lai, Rui Fan, Xiao Zhang, Wei Zhang, Fang Liu, Joey Tianyi Zhou

    Abstract: Achieving high performance in many multi-server systems requires finding a good assignment of worker threads to servers and also effectively allocating each server's resources to its assigned threads. The assignment and allocation components of this problem have been studied extensively but largely separately in the literature. In this paper, we introduce the assign and allocate (AA) problem, whic… ▽ More

    Submitted 9 June, 2021; v1 submitted 4 July, 2015; originally announced July 2015.

    Comments: 17 pages

    ACM Class: C.1.4; D.4.2; F.2.1

  44. arXiv:1504.05843  [pdf, other

    cs.CV cs.LG

    Exploit Bounding Box Annotations for Multi-label Object Recognition

    Authors: Hao Yang, Joey Tianyi Zhou, Yu Zhang, Bin-Bin Gao, Jianxin Wu, Jianfei Cai

    Abstract: Convolutional neural networks (CNNs) have shown great performance as general feature representations for object recognition applications. However, for multi-label images that contain multiple objects from different categories, scales and locations, global CNN features are not optimal. In this paper, we incorporate local information to enhance the feature discriminative power. In particular, we fir… ▽ More

    Submitted 3 June, 2016; v1 submitted 22 April, 2015; originally announced April 2015.

    Comments: Accepted in CVPR 2016