Skip to main content

Showing 1–28 of 28 results for author: Pietikäinen, M

Searching in archive cs. Search in all archives.
.
  1. Lightweight Pixel Difference Networks for Efficient Visual Representation Learning

    Authors: Zhuo Su, Jiehua Zhang, Longguang Wang, Hua Zhang, Zhen Liu, Matti Pietikäinen, Li Liu

    Abstract: Recently, there have been tremendous efforts in develo** lightweight Deep Neural Networks (DNNs) with satisfactory accuracy, which can enable the ubiquitous deployment of DNNs in edge devices. The core challenge of develo** compact and efficient DNNs lies in how to balance the competing goals of achieving high accuracy and high efficiency. In this paper we propose two novel types of convolutio… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: We design a novel lightweight convolutional operator for computer vision tasks. Both full-precision networks and BNNs are developed. Accepted by TPAMI

  2. arXiv:2308.06764  [pdf, other

    cs.LG cs.AI

    Few-shot Class-incremental Learning: A Survey

    Authors: **ghua Zhang, Li Liu, Olli Silvén, Matti Pietikäinen, Dewen Hu

    Abstract: Few-shot Class-Incremental Learning (FSCIL) presents a unique challenge in Machine Learning (ML), as it necessitates the Incremental Learning (IL) of new classes from sparsely labeled training samples without forgetting previous knowledge. While this field has seen recent progress, it remains an active exploration area. This paper aims to provide a comprehensive and systematic review of FSCIL. In… ▽ More

    Submitted 16 December, 2023; v1 submitted 13 August, 2023; originally announced August 2023.

  3. arXiv:2304.06305  [pdf, other

    cs.CV

    Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution

    Authors: Zhuo Su, Jiehua Zhang, Tianpeng Liu, Zhen Liu, Shuanghui Zhang, Matti Pietikäinen, Li Liu

    Abstract: This paper proposes a novel module called middle spectrum grouped convolution (MSGC) for efficient deep convolutional neural networks (DCNNs) with the mechanism of grouped convolution. It explores the broad "middle spectrum" area between channel pruning and conventional grouped convolution. Compared with channel pruning, MSGC can retain most of the information from the input feature maps due to th… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 13 pages, 11 figures, submitted to IEEEE Transactions on xxx

  4. arXiv:2303.08414  [pdf, other

    cs.CV

    From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning

    Authors: Zhuo Su, Matti Pietikäinen, Li Liu

    Abstract: LBP is a successful hand-crafted feature descriptor in computer vision. However, in the deep learning era, deep neural networks, especially convolutional neural networks (CNNs) can automatically learn powerful task-aware features that are more discriminative and of higher representational capacity. To some extent, such hand-crafted features can be safely ignored when designing deep computer vision… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: A small survey paper on Local Binary Pattern (LBP) inspired Deep Learning networks, accepted in SCIA 2023 (Scandinavian Conference on Image Analysis)

  5. arXiv:2211.02292  [pdf, other

    eess.IV cs.CV

    Boosting Binary Neural Networks via Dynamic Thresholds Learning

    Authors: Jiehua Zhang, Xueyang Zhang, Zhuo Su, Zitong Yu, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu

    Abstract: Develo** lightweight Deep Convolutional Neural Networks (DCNNs) and Vision Transformers (ViTs) has become one of the focuses in vision research since the low computational cost is essential for deploying vision models on edge devices. Recently, researchers have explored highly computational efficient Binary Neural Networks (BNNs) by binarizing weights and activations of Full-precision Neural Net… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  6. arXiv:2209.05924  [pdf, other

    cs.CV

    SVNet: Where SO(3) Equivariance Meets Binarization on Point Cloud Representation

    Authors: Zhuo Su, Max Welling, Matti Pietikäinen, Li Liu

    Abstract: Efficiency and robustness are increasingly needed for applications on 3D point clouds, with the ubiquitous use of edge devices in scenarios like autonomous driving and robotics, which often demand real-time and reliable responses. The paper tackles the challenge by designing a general framework to construct 3D learning architectures with SO(3) equivariance and network binarization. However, a naiv… ▽ More

    Submitted 20 September, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted in 3DV 2022. 11 pages including the appendix

  7. arXiv:2205.10839  [pdf, other

    cs.CV

    Deep Learning for Visual Speech Analysis: A Survey

    Authors: Changchong Sheng, Gangyao Kuang, Liang Bai, Chen** Hou, Yulan Guo, Xin Xu, Matti Pietikäinen, Li Liu

    Abstract: Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment. As a powerful AI strategy, deep learning techniques have extensively promoted the development of visual speech learning. Over the past five years, numerous deep learning based methods have bee… ▽ More

    Submitted 14 March, 2024; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: 20 pages, 8 figures. Accepted by IEEE TPAMI

  8. arXiv:2201.01466  [pdf

    cs.AI cs.CV cs.LG

    Challenges of Artificial Intelligence -- From Machine Learning and Computer Vision to Emotional Intelligence

    Authors: Matti Pietikäinen, Olli Silven

    Abstract: Artificial intelligence (AI) has become a part of everyday conversation and our lives. It is considered as the new electricity that is revolutionizing the world. AI is heavily invested in both industry and academy. However, there is also a lot of hype in the current AI debate. AI based on so-called deep learning has achieved impressive results in many problems, but its limits are already visible.… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: 234 pages. Published as an electronic publication at the University of Oulu, Finland, in December 2021, ISBN: 978-952-62-3199-0 link http://jultika.oulu.fi/Record/isbn978-952-62-3199-0

  9. arXiv:2110.05185  [pdf, other

    cs.LG eess.IV

    Dynamic Binary Neural Network by learning channel-wise thresholds

    Authors: Jiehua Zhang, Zhuo Su, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu

    Abstract: Binary neural networks (BNNs) constrain weights and activations to +1 or -1 with limited storage and computational cost, which is hardware-friendly for portable devices. Recently, BNNs have achieved remarkable progress and been adopted into various fields. However, the performance of BNNs is sensitive to activation distribution. The existing BNNs utilized the Sign function with predefined or learn… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

    Comments: 5 pages, 3 figures

  10. arXiv:2108.07009  [pdf, other

    cs.CV

    Pixel Difference Networks for Efficient Edge Detection

    Authors: Zhuo Su, Wenzhe Liu, Zitong Yu, Dewen Hu, Qing Liao, Qi Tian, Matti Pietikäinen, Li Liu

    Abstract: Recently, deep Convolutional Neural Networks (CNNs) can achieve human-level performance in edge detection with the rich and abstract edge representation capacities. However, the high performance of CNN based edge detection is achieved with a large pretrained CNN backbone, which is memory and energy consuming. In addition, it is surprising that the previous wisdom from the traditional edge detector… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  11. arXiv:2101.10531  [pdf, other

    cs.CV

    Deep Learning for Scene Classification: A Survey

    Authors: Delu Zeng, Minyu Liao, Mohammad Tavakolian, Yulan Guo, Bolei Zhou, Dewen Hu, Matti Pietikäinen, Li Liu

    Abstract: Scene classification, aiming at classifying a scene image to one of the predefined scene categories by comprehending the entire image, is a longstanding, fundamental and challenging problem in computer vision. The rise of large-scale datasets, which constitute the corresponding dense sampling of diverse real-world scenes, and the renaissance of deep learning techniques, which learn powerful featur… ▽ More

    Submitted 19 February, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

  12. arXiv:2010.09294  [pdf, other

    cs.CV

    FTBNN: Rethinking Non-linearity for 1-bit CNNs and Going Beyond

    Authors: Zhuo Su, Linpu Fang, Deke Guo, Dewen Hu, Matti Pietikäinen, Li Liu

    Abstract: Binary neural networks (BNNs), where both weights and activations are binarized into 1 bit, have been widely studied in recent years due to its great benefit of highly accelerated computation and substantially reduced memory footprint that appeal to the development of resource constrained devices. In contrast to previous methods tending to reduce the quantization error for training BNN structures,… ▽ More

    Submitted 30 December, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Openreview: https://openreview.net/forum?id=9wHe4F-lpp

  13. arXiv:2007.04242  [pdf, other

    cs.CV

    Dynamic Group Convolution for Accelerating Convolutional Neural Networks

    Authors: Zhuo Su, Linpu Fang, Wenxiong Kang, Dewen Hu, Matti Pietikäinen, Li Liu

    Abstract: Replacing normal convolutions with group convolutions can significantly increase the computational efficiency of modern deep convolutional networks, which has been widely adopted in compact network architecture designs. However, existing group convolutions undermine the original network structures by cutting off some connections permanently resulting in significant accuracy degradation. In this pa… ▽ More

    Submitted 10 July, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: 21 pages, 10 figures

  14. arXiv:1907.09160  [pdf, other

    cs.CV

    Extended Local Binary Patterns for Efficient and Robust Spontaneous Facial Micro-Expression Recognition

    Authors: Chengyu Guo, **gyun Liang, Geng Zhan, Zhong Liu, Matti Pietikäinen, Li Liu

    Abstract: Facial Micro-Expressions (MEs) are spontaneous, involuntary facial movements when a person experiences an emotion but deliberately or unconsciously attempts to conceal his or her genuine emotions. Recently, ME recognition has attracted increasing attention due to its potential applications such as clinical diagnosis, business negotiation, interrogations, and security. However, it is expensive to b… ▽ More

    Submitted 17 September, 2019; v1 submitted 22 July, 2019; originally announced July 2019.

  15. arXiv:1809.03313  [pdf, other

    cs.CV

    A Global Alignment Kernel based Approach for Group-level Happiness Intensity Estimation

    Authors: Xiaohua Huang, Abhinav Dhall, Roland Goecke, Matti Pietikainen, Guoying Zhao

    Abstract: With the progress in automatic human behavior understanding, analysing the perceived affect of multiple people has been recieved interest in affective computing community. Unlike conventional facial expression analysis, this paper primarily focuses on analysing the behaviour of multiple people in an image. The proposed method is based on support vector regression with the combined global alignment… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

  16. arXiv:1809.02165  [pdf, other

    cs.CV

    Deep Learning for Generic Object Detection: A Survey

    Authors: Li Liu, Wanli Ouyang, Xiaogang Wang, Paul Fieguth, Jie Chen, Xinwang Liu, Matti Pietikäinen

    Abstract: Object detection, one of the most fundamental and challenging problems in computer vision, seeks to locate object instances from a large number of predefined categories in natural images. Deep learning techniques have emerged as a powerful strategy for learning feature representations directly from data and have led to remarkable breakthroughs in the field of generic object detection. Given this p… ▽ More

    Submitted 22 August, 2019; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: IJCV Minor

  17. Texture Classification in Extreme Scale Variations using GANet

    Authors: Li Liu, Jie Chen, Guoying Zhao, Paul Fieguth, Xilin Chen, Matti Pietikäinen

    Abstract: Research in texture recognition often concentrates on recognizing textures with intraclass variations such as illumination, rotation, viewpoint and small scale changes. In contrast, in real-world applications a change in scale can have a dramatic impact on texture appearance, to the point of changing completely from one texture category to another. As a result, texture variations due to changes in… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

    Comments: submitted to IEEE Transactions on Image Processing

  18. arXiv:1801.10324  [pdf, other

    cs.CV cs.LG

    From BoW to CNN: Two Decades of Texture Representation for Texture Classification

    Authors: Li Liu, Jie Chen, Paul Fieguth, Guoying Zhao, Rama Chellappa, Matti Pietikainen

    Abstract: Texture is a fundamental characteristic of many types of images, and texture representation is one of the essential and challenging problems in computer vision and pattern recognition which has attracted extensive research attention. Since 2000, texture representations based on Bag of Words (BoW) and on Convolutional Neural Networks (CNNs) have been extensively studied with impressive performance.… ▽ More

    Submitted 3 October, 2018; v1 submitted 31 January, 2018; originally announced January 2018.

    Comments: Accepted by IJCV

    MSC Class: 68T10

  19. arXiv:1710.00166  [pdf, other

    cs.CV

    PCANet-II: When PCANet Meets the Second Order Pooling

    Authors: Lei Tian, Xiaopeng Hong, Guoying Zhao, Chunxiao Fan, Yue Ming, Matti Pietikäinen

    Abstract: PCANet, as one noticeable shallow network, employs the histogram representation for feature pooling. However, there are three main problems about this kind of pooling method. First, the histogram-based pooling method binarizes the feature maps and leads to inevitable discriminative information loss. Second, it is difficult to effectively combine other visual cues into a compact representation, bec… ▽ More

    Submitted 30 September, 2017; originally announced October 2017.

  20. Two decades of local binary patterns: A survey

    Authors: Matti Pietikäinen, Guoying Zhao

    Abstract: Texture is an important characteristic for many types of images. In recent years very discriminative and computationally efficient local texture descriptors based on local binary patterns (LBP) have been developed, which has led to significant progress in applying texture methods to different problems and applications. Due to this progress, the division between texture descriptors and more generic… ▽ More

    Submitted 15 January, 2017; v1 submitted 20 December, 2016; originally announced December 2016.

    Comments: In Advances in Independent Component Analysis and Learning Machines, Academic Press, 2015, Pages 175-210

  21. arXiv:1610.03640  [pdf, other

    cs.CV

    Analyzing the Affect of a Group of People Using Multi-modal Framework

    Authors: Xiaohua Huang, Abhinav Dhall, Xin Liu, Guoying Zhao, **gang Shi, Roland Goecke, Matti Pietikainen

    Abstract: Millions of images on the web enable us to explore images from social events such as a family party, thus it is of interest to understand and model the affect exhibited by a group of people in images. But analysis of the affect expressed by multiple people is challenging due to varied indoor and outdoor settings, and interactions taking place between various numbers of people. A few existing works… ▽ More

    Submitted 13 October, 2016; v1 submitted 12 October, 2016; originally announced October 2016.

    Comments: 11 pages. Submitted to the IEEE Transactions on Cybernetics

  22. arXiv:1608.02255  [pdf, other

    cs.CV

    Spontaneous Facial Micro-Expression Recognition using Discriminative Spatiotemporal Local Binary Pattern with an Improved Integral Projection

    Authors: Xiaohua Huang, Su**g Wang, Xin Liu, Guoying Zhao, Xiaoyi Feng, Matti Pietikainen

    Abstract: Recently, there are increasing interests in inferring mirco-expression from facial image sequences. Due to subtle facial movement of micro-expressions, feature extraction has become an important and critical issue for spontaneous facial micro-expression recognition. Recent works usually used spatiotemporal local binary pattern for micro-expression analysis. However, the commonly used spatiotempora… ▽ More

    Submitted 7 August, 2016; originally announced August 2016.

    Comments: 13pages, 8 figures, 5 tables, submitted to IEEE Transactions on Image Processing

  23. arXiv:1604.04473  [pdf, ps, other

    cs.CV

    Probing the Intra-Component Correlations within Fisher Vector for Material Classification

    Authors: Xiaopeng Hong, Xianbiao Qi, Guoying Zhao, Matti Pietikäinen

    Abstract: Fisher vector (FV) has become a popular image representation. One notable underlying assumption of the FV framework is that local descriptors are well decorrelated within each cluster so that the covariance matrix for each Gaussian can be simplified to be diagonal. Though the FV usually relies on the Principal Component Analysis (PCA) to decorrelate local features, the PCA is applied to the entire… ▽ More

    Submitted 15 April, 2016; originally announced April 2016.

    Comments: It is manuscript submitted to Neurocomputing on the end of April, 2015 (!). One year past but no review comments we received yet!

  24. Towards Reading Hidden Emotions: A comparative Study of Spontaneous Micro-expression Spotting and Recognition Methods

    Authors: Xiaobai Li, Xiaopeng Hong, Antti Moilanen, Xiaohua Huang, Tomas Pfister, Guoying Zhao, Matti Pietikäinen

    Abstract: Micro-expressions (MEs) are rapid, involuntary facial expressions which reveal emotions that people do not intend to show. Studying MEs is valuable as recognizing them has many important applications, particularly in forensic science and psychotherapy. However, analyzing spontaneous MEs is very challenging due to their short duration and low intensity. Automatic ME analysis includes two tasks: ME… ▽ More

    Submitted 8 February, 2017; v1 submitted 2 November, 2015; originally announced November 2015.

  25. arXiv:1509.02320  [pdf, other

    cs.CV

    HEp-2 Cell Classification: The Role of Gaussian Scale Space Theory as A Pre-processing Approach

    Authors: Xianbiao Qi, Guoying Zhao, Jie Chen, Matti Pietikäinen

    Abstract: \textit{Indirect Immunofluorescence Imaging of Human Epithelial Type 2} (HEp-2) cells is an effective way to identify the presence of Anti-Nuclear Antibody (ANA). Most existing works on HEp-2 cell classification mainly focus on feature extraction, feature encoding and classifier design. Very few efforts have been devoted to study the importance of the pre-processing techniques. In this paper, we a… ▽ More

    Submitted 8 September, 2015; originally announced September 2015.

    Comments: 9 pages, 6 figures

  26. arXiv:1504.05809  [pdf, other

    cs.CV

    LOAD: Local Orientation Adaptive Descriptor for Texture and Material Classification

    Authors: Xianbiao Qi, Guoying Zhao, Linlin Shen, Qingquan Li, Matti Pietikainen

    Abstract: In this paper, we propose a novel local feature, called Local Orientation Adaptive Descriptor (LOAD), to capture regional texture in an image. In LOAD, we proposed to define point description on an Adaptive Coordinate System (ACS), adopt a binary sequence descriptor to capture relationships between one point and its neighbors and use multi-scale strategy to enhance the discriminative power of the… ▽ More

    Submitted 22 April, 2015; originally announced April 2015.

    Comments: 13 pages, 7 figures

  27. arXiv:1502.04658  [pdf, ps, other

    cs.CV

    HEp-2 Cell Classification via Fusing Texture and Shape Information

    Authors: Xianbiao Qi, Guoying Zhao, Chun-Guang Li, Jun Guo, Matti Pietikäinen

    Abstract: Indirect Immunofluorescence (IIF) HEp-2 cell image is an effective evidence for diagnosis of autoimmune diseases. Recently computer-aided diagnosis of autoimmune diseases by IIF HEp-2 cell classification has attracted great attention. However the HEp-2 cell classification task is quite challenging due to large intra-class variation and small between-class variation. In this paper we propose an eff… ▽ More

    Submitted 16 February, 2015; originally announced February 2015.

    Comments: 11 pages, 7 figures

  28. arXiv:1502.00303  [pdf, ps, other

    cs.CV

    Dynamic texture and scene classification by transferring deep image features

    Authors: Xianbiao Qi, Chun-Guang Li, Guoying Zhao, Xiaopeng Hong, Matti Pietikäinen

    Abstract: Dynamic texture and scene classification are two fundamental problems in understanding natural video content. Extracting robust and effective features is a crucial step towards solving these problems. However the existing approaches suffer from the sensitivity to either varying illumination, or viewpoint changing, or even camera motion, and/or the lack of spatial information. Inspired by the succe… ▽ More

    Submitted 1 February, 2015; originally announced February 2015.