Skip to main content

Showing 1–17 of 17 results for author: Lew, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.13137  [pdf, other

    cs.CV cs.RO

    RONELDv2: A faster, improved lane tracking method

    Authors: Zhe Ming Chng, Joseph Mun Hung Lew, Jimmy Addison Lee

    Abstract: Lane detection is an integral part of control systems in autonomous vehicles and lane departure warning systems as lanes are a key component of the operating environment for road vehicles. In a previous paper, a robust neural network output enhancement for active lane detection (RONELD) method augmenting deep learning lane detection models to improve active, or ego, lane accuracy performance was p… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: 9 pages, 8 figures, 6 tables

  2. arXiv:2104.04991  [pdf, other

    cs.CV

    Integrating Information Theory and Adversarial Learning for Cross-modal Retrieval

    Authors: Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew

    Abstract: Accurately matching visual and textual data in cross-modal retrieval has been widely studied in the multimedia community. To address these challenges posited by the heterogeneity gap and the semantic gap, we propose integrating Shannon information theory and adversarial learning. In terms of the heterogeneity gap, we integrate modality classification and information entropy maximization adversaria… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted by Pattern Recognition

  3. arXiv:2103.12462  [pdf, other

    cs.CV

    Lifelong Person Re-Identification via Adaptive Knowledge Accumulation

    Authors: Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew

    Abstract: Person ReID methods always learn through a stationary domain that is fixed by the choice of a given dataset. In many contexts (e.g., lifelong learning), those methods are ineffective because the domain is continually changing in which case incremental learning over multiple domains is required potentially. In this work we explore a new and challenging ReID task, namely lifelong person re-identific… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: 10 pages, 5 figures, Accepted by CVPR2021

  4. arXiv:2103.06583  [pdf, other

    cs.CV

    Preprint: Norm Loss: An efficient yet effective regularization method for deep neural networks

    Authors: Theodoros Georgiou, Sebastian Schmitt, Thomas Bäck, Wei Chen, Michael Lew

    Abstract: Convolutional neural network training can suffer from diverse issues like exploding or vanishing gradients, scaling-based weight space symmetry and covariant-shift. In order to address these issues, researchers develop weight regularization methods and activation normalization methods. In this work we propose a weight soft-regularization method based on the Oblique manifold. The proposed method us… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Journal ref: Proceedings of the International Conference on Pattern Recognition (ICPR) 2020

  5. arXiv:2103.06552  [pdf, other

    cs.CV

    PREPRINT: Comparison of deep learning and hand crafted features for mining simulation data

    Authors: Theodoros Georgiou, Sebastian Schmitt, Thomas Bäck, Nan Pu, Wei Chen, Michael Lew

    Abstract: Computational Fluid Dynamics (CFD) simulations are a very important tool for many industrial applications, such as aerodynamic optimization of engineering designs like cars shapes, airplanes parts etc. The output of such simulations, in particular the calculated flow fields, are usually very complex and hard to interpret for realistic three-dimensional real-world applications, especially if time-d… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Journal ref: Proceedings of the International Conference on Pattern Recognition (ICPR) 2020

  6. arXiv:2101.11282  [pdf, other

    cs.CV

    Deep Learning for Instance Retrieval: A Survey

    Authors: Wei Chen, Yu Liu, Wei** Wang, Erwin Bakker, Theodoros Georgiou, Paul Fieguth, Li Liu, Michael S. Lew

    Abstract: In recent years a vast amount of visual content has been generated and shared from many fields, such as social media platforms, medical imaging, and robotics. This abundance of content creation and sharing has introduced new challenges, particularly that of searching databases for similar content-Content Based Image Retrieval (CBIR)-a long-established research area in which improved efficiency and… ▽ More

    Submitted 30 October, 2022; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence

  7. arXiv:2010.09548  [pdf, other

    cs.CV

    RONELD: Robust Neural Network Output Enhancement for Active Lane Detection

    Authors: Zhe Ming Chng, Joseph Mun Hung Lew, Jimmy Addison Lee

    Abstract: Accurate lane detection is critical for navigation in autonomous vehicles, particularly the active lane which demarcates the single road space that the vehicle is currently traveling on. Recent state-of-the-art lane detection algorithms utilize convolutional neural networks (CNNs) to train deep learning models on popular benchmarks such as TuSimple and CULane. While each of these models works part… ▽ More

    Submitted 2 November, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Fixed typos; Accepted at ICPR 2020, 8 pages, 6 figures, code to be published at http://github.com/czming/RONELD-Lane-Detection

  8. arXiv:2010.08189  [pdf, other

    cs.CV

    New Ideas and Trends in Deep Multimodal Content Understanding: A Review

    Authors: Wei Chen, Wei** Wang, Li Liu, Michael S. Lew

    Abstract: The focus of this survey is on the analysis of two modalities of multimodal deep learning: image and text. Unlike classic reviews of deep learning where monomodal image classifiers such as VGG, ResNet and Inception module are central topics, this paper will examine recent multimodal deep models and structures, including auto-encoders, generative adversarial nets and their variants. These models go… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: Accepted by Neurocomputing

  9. arXiv:2010.08020  [pdf, other

    cs.CV

    On the Exploration of Incremental Learning for Fine-grained Image Retrieval

    Authors: Wei Chen, Yu Liu, Wei** Wang, Tinne Tuytelaars, Erwin M. Bakker, Michael Lew

    Abstract: In this paper, we consider the problem of fine-grained image retrieval in an incremental setting, when new categories are added over time. On the one hand, repeatedly training the representation on the extended dataset is time-consuming. On the other hand, fine-tuning the learned representation only with the new classes leads to catastrophic forgetting. To this end, we propose an incremental learn… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: BMVC2020

  10. arXiv:2008.02520  [pdf, other

    cs.CV

    Dual Gaussian-based Variational Subspace Disentanglement for Visible-Infrared Person Re-Identification

    Authors: Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew

    Abstract: Visible-infrared person re-identification (VI-ReID) is a challenging and essential task in night-time intelligent surveillance systems. Except for the intra-modality variance that RGB-RGB person re-identification mainly overcomes, VI-ReID suffers from additional inter-modality variance caused by the inherent heterogeneous gap. To solve the problem, we present a carefully designed dual Gaussian-bas… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted by ACM MM 2020 poster. 12 pages, 10 appendixes

  11. A Comparison of CNN and Classic Features for Image Retrieval

    Authors: Umut Özaydın, Theodoros Georgiou, Michael Lew

    Abstract: Feature detectors and descriptors have been successfully used for various computer vision tasks, such as video object tracking and content-based image retrieval. Many methods use image gradients in different stages of the detection-description pipeline to describe local image structures. Recently, some, or all, of these stages have been replaced by convolutional neural networks (CNNs), in order to… ▽ More

    Submitted 25 August, 2019; originally announced August 2019.

    Comments: 5 pages, 3 figures, 3 tables, CBMI 2019

  12. arXiv:1611.05503  [pdf, other

    cs.CV

    On the Exploration of Convolutional Fusion Networks for Visual Recognition

    Authors: Yu Liu, Yanming Guo, Michael S. Lew

    Abstract: Despite recent advances in multi-scale deep representations, their limitations are attributed to expensive parameters and weak fusion modules. Hence, we propose an efficient approach to fuse multi-scale deep representations, called convolutional fusion networks (CFN). Owing to using 1$\times$1 convolution and global average pooling, CFN can efficiently generate the side branches while adding few p… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

    Comments: 23rd International Conference on MultiMedia Modeling (MMM 2017)

  13. arXiv:1101.0243  [pdf

    cs.GR

    Across Browsers SVG Implementation

    Authors: Liang Wang, Nies Huijsmans, Michael S. Lew, Dan Tsymbala

    Abstract: In this work SVG will be translated into VML or HTML by using Javascript based on Backbase Client Framework. The target of this project is to implement SVG to be viewed in Internet Explorer without any plug-in and work together with other Backbase Client Framework languages. The result of this project will be added as an extension to the current Backbase Client Framework.

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20090402

  14. arXiv:1101.0242  [pdf

    cs.CV

    Binary and nonbinary description of hypointensity in human brain MR images

    Authors: Xiao**g Chen, Michael S. Lew

    Abstract: Accumulating evidence has shown that iron is involved in the mechanism underlying many neurodegenerative diseases, such as Alzheimer's disease, Parkinson's disease and Huntington's disease. Abnormal (higher) iron accumulation has been detected in the brains of most neurodegenerative patients, especially in the basal ganglia region. Presence of iron leads to changes in MR signal in both magnitude a… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20080101

  15. arXiv:1101.0237  [pdf

    cs.CV

    A Framework for Real-Time Face and Facial Feature Tracking using Optical Flow Pre-estimation and Template Tracking

    Authors: E. R. Gast, Michael S. Lew

    Abstract: This work presents a framework for tracking head movements and capturing the movements of the mouth and both the eyebrows in real-time. We present a head tracker which is a combination of a optical flow and a template based tracker. The estimation of the optical flow head tracker is used as starting point for the template tracker which fine-tunes the head estimation. This approach together with re… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20100401

  16. arXiv:1101.0235  [pdf

    cs.HC

    Analysis of Using Browser-native Technology to Build Rich Internet Applications for Image Manipulation

    Authors: Thomas Steenbergen, Michael S. Lew

    Abstract: In this work we investigate whether browser-native technologies can be used to perform photo manipulation tasks e.g crop**, resizing or rotating an image within the current mainstream browser. By the use of a case study we will analyze problems that have occurred during the implementation of a prototype web application that utilizes browser-native web technology in order to create an online vers… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20090901

  17. arXiv:1101.0234  [pdf

    cs.HC

    Dynamic Feature Description in Human Action Recognition

    Authors: Ruoyun Gao, Michael S. Lew, Ling Shao

    Abstract: This work aims to present novel description methods for human action recognition. Generally, a video sequence can be represented as a collection of spatial temporal words by detecting space-time interest points and describing the unique features around the detected points (Bag of Words representation). Interest points as well as the cuboids around them are considered informative for feature descri… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20090701