Skip to main content

Showing 1–18 of 18 results for author: Iranmanesh, S M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.16476  [pdf, other

    cs.CV

    Pair DETR: Contrastive Learning Speeds Up DETR Training

    Authors: Seyed Mehdi Iranmanesh, Xiaotong Chen, Kuo-Chin Lien

    Abstract: The DETR object detection approach applies the transformer encoder and decoder architecture to detect objects and achieves promising performance. In this paper, we present a simple approach to address the main problem of DETR, the slow convergence, by using representation learning technique. In this approach, we detect an object bounding box as a pair of keypoints, the top-left corner and the cent… ▽ More

    Submitted 11 November, 2022; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2108.06152

  2. arXiv:2201.00080  [pdf, other

    cs.CV

    PatchTrack: Multiple Object Tracking Using Frame Patches

    Authors: Xiaotong Chen, Seyed Mehdi Iranmanesh, Kuo-Chin Lien

    Abstract: Object motion and object appearance are commonly used information in multiple object tracking (MOT) applications, either for associating detections across frames in tracking-by-detection methods or direct track predictions for joint-detection-and-tracking methods. However, not only are these two types of information often considered separately, but also they do not help optimize the usage of visua… ▽ More

    Submitted 31 December, 2021; originally announced January 2022.

    Comments: 11 pages, 4 figures, 2 tables

    MSC Class: ACM-class: I.4.8

  3. arXiv:2112.05827  [pdf, other

    cs.CV cs.LG

    Quality-Aware Multimodal Biometric Recognition

    Authors: Sobhan Soleymani, Ali Dabouei, Fariborz Taherkhani, Seyed Mehdi Iranmanesh, Jeremy Dawson, Nasser M. Nasrabadi

    Abstract: We present a quality-aware multimodal recognition framework that combines representations from multiple biometric traits with varying quality and number of samples to achieve increased recognition accuracy by extracting complimentary identification information based on the quality of the samples. We develop a quality-aware framework for fusing representations of input modalities by weighting their… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: IEEE Transactions on Biometrics, Behavior, and Identity Science

  4. arXiv:2102.03710  [pdf, other

    cs.CV cs.LG

    HGAN: Hybrid Generative Adversarial Network

    Authors: Seyed Mehdi Iranmanesh, Nasser M. Nasrabadi

    Abstract: In this paper, we present a simple approach to train Generative Adversarial Networks (GANs) in order to avoid a \textit {mode collapse} issue. Implicit models such as GANs tend to generate better samples compared to explicit models that are trained on tractable data likelihood. However, GANs overlook the explicit data density characteristics which leads to undesirable quantitative evaluations and… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

  5. arXiv:2009.01972  [pdf, other

    cs.CV

    Attribute Adaptive Margin Softmax Loss using Privileged Information

    Authors: Seyed Mehdi Iranmanesh, Ali Dabouei, Nasser M. Nasrabadi

    Abstract: We present a novel framework to exploit privileged information for recognition which is provided only during the training phase. Here, we focus on recognition task where images are provided as the main view and soft biometric traits (attributes) are provided as the privileged data (only available during training phase). We demonstrate that more discriminative feature space can be learned by enforc… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

  6. arXiv:2001.03113  [pdf, other

    cs.CV

    Robust Facial Landmark Detection via Aggregation on Geometrically Manipulated Faces

    Authors: Seyed Mehdi Iranmanesh, Ali Dabouei, Sobhan Soleymani, Hadi Kazemi, Nasser M. Nasrabadi

    Abstract: In this work, we present a practical approach to the problem of facial landmark detection. The proposed method can deal with large shape and appearance variations under the rich shape deformation. To handle the shape variations we equip our method with the aggregation of manipulated face images. The proposed framework generates different manipulated faces using only one given face image. The appro… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

  7. arXiv:1911.12451  [pdf, other

    cs.CV

    Empirical Upper Bound in Object Detection and More

    Authors: Ali Borji, Seyed Mehdi Iranmanesh

    Abstract: Object detection remains as one of the most notorious open problems in computer vision. Despite large strides in accuracy in recent years, modern object detectors have started to saturate on popular benchmarks raising the question of how far we can reach with deep learning tools and tricks. Here, by employing 2 state-of-the-art object detection benchmarks, and analyzing more than 15 models over 4… ▽ More

    Submitted 16 December, 2019; v1 submitted 27 November, 2019; originally announced November 2019.

  8. arXiv:1907.11980  [pdf, other

    cs.CV

    Attribute-Guided Deep Polarimetric Thermal-to-visible Face Recognition

    Authors: Seyed Mehdi Iranmanesh, Nasser M. Nasrabadi

    Abstract: In this paper, we present an attribute-guided deep coupled learning framework to address the problem of matching polarimetric thermal face photos against a gallery of visible faces. The coupled framework contains two sub-networks, one dedicated to the visible spectrum and the second sub-network dedicated to the polarimetric thermal spectrum. Each sub-network is made of a generative adversarial net… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.

  9. arXiv:1811.11979  [pdf, other

    cs.CV

    Unsupervised Image-to-Image Translation Using Domain-Specific Variational Information Bound

    Authors: Hadi Kazemi, Sobhan Soleymani, Fariborz Taherkhani, Seyed Mehdi Iranmanesh, Nasser M. Nasrabadi

    Abstract: Unsupervised image-to-image translation is a class of computer vision problems which aims at modeling conditional distribution of images in the target domain, given a set of unpaired images in the source and target domains. An image in the source domain might have multiple representations in the target domain. Therefore, ambiguity in modeling of the conditional distribution arises, specially when… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: NIPS 2018

  10. arXiv:1811.05621  [pdf, other

    cs.CV

    Style and Content Disentanglement in Generative Adversarial Networks

    Authors: Hadi Kazemi, Seyed Mehdi Iranmanesh, Nasser M. Nasrabadi

    Abstract: Disentangling factors of variation within data has become a very challenging problem for image generation tasks. Current frameworks for training a Generative Adversarial Network (GAN), learn to disentangle the representations of the data in an unsupervised fashion and capture the most significant factors of the data variations. However, these approaches ignore the principle of content and style di… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Comments: WACV 2019

  11. arXiv:1808.01026  [pdf, other

    eess.AS cs.CV cs.LG cs.SD

    Prosodic-Enhanced Siamese Convolutional Neural Networks for Cross-Device Text-Independent Speaker Verification

    Authors: Sobhan Soleymani, Ali Dabouei, Seyed Mehdi Iranmanesh, Hadi Kazemi, Jeremy Dawson, Nasser M. Nasrabadi

    Abstract: In this paper a novel cross-device text-independent speaker verification architecture is proposed. Majority of the state-of-the-art deep architectures that are used for speaker verification tasks consider Mel-frequency cepstral coefficients. In contrast, our proposed Siamese convolutional neural network architecture uses Mel-frequency spectrogram coefficients to benefit from the dependency of the… ▽ More

    Submitted 31 July, 2018; originally announced August 2018.

    Comments: Accepted in 9th IEEE International Conference on Biometrics: Theory, Applications, and Systems (BTAS 2018)

  12. arXiv:1808.00059  [pdf, other

    cs.CV

    Deep Sketch-Photo Face Recognition Assisted by Facial Attributes

    Authors: Seyed Mehdi Iranmanesh, Hadi Kazemi, Sobhan Soleymani, Ali Dabouei, Nasser M. Nasrabadi

    Abstract: In this paper, we present a deep coupled framework to address the problem of matching sketch image against a gallery of mugshots. Face sketches have the essential in- formation about the spatial topology and geometric details of faces while missing some important facial attributes such as ethnicity, hair, eye, and skin color. We propose a cou- pled deep neural network architecture which utilizes f… ▽ More

    Submitted 31 July, 2018; originally announced August 2018.

  13. arXiv:1808.00035  [pdf, other

    cs.CV

    ID Preserving Generative Adversarial Network for Partial Latent Fingerprint Reconstruction

    Authors: Ali Dabouei, Sobhan Soleymani, Hadi Kazemi, Seyed Mehdi Iranmanesh, Jeremy Dawson, Nasser M. Nasrabadi

    Abstract: Performing recognition tasks using latent fingerprint samples is often challenging for automated identification systems due to poor quality, distortion, and partially missing information from the input samples. We propose a direct latent fingerprint reconstruction model based on conditional generative adversarial networks (cGANs). Two modifications are applied to the cGAN to adapt it for the task… ▽ More

    Submitted 31 July, 2018; originally announced August 2018.

    Comments: Accepted in BTAS 2018

  14. arXiv:1801.01486  [pdf, other

    cs.CV

    Deep Cross Polarimetric Thermal-to-visible Face Recognition

    Authors: Seyed Mehdi Iranmanesh, Ali Dabouei, Hadi Kazemi, Nasser M. Nasrabadi

    Abstract: In this paper, we present a deep coupled learning frame- work to address the problem of matching polarimetric ther- mal face photos against a gallery of visible faces. Polariza- tion state information of thermal faces provides the miss- ing textural and geometrics details in the thermal face im- agery which exist in visible spectrum. we propose a coupled deep neural network architecture which leve… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

  15. arXiv:1801.01198  [pdf, other

    cs.CV

    Fingerprint Distortion Rectification using Deep Convolutional Neural Networks

    Authors: Ali Dabouei, Hadi Kazemi, Seyed Mehdi Iranmanesh, Jeremi Dawson, Nasser M. Nasrabadi

    Abstract: Elastic distortion of fingerprints has a negative effect on the performance of fingerprint recognition systems. This negative effect brings inconvenience to users in authentication applications. However, in the negative recognition scenario where users may intentionally distort their fingerprints, this can be a serious problem since distortion will prevent recognition system from identifying malic… ▽ More

    Submitted 3 January, 2018; originally announced January 2018.

    Comments: Accepted at ICB 2018

  16. arXiv:1711.02536  [pdf, ps, other

    cs.CV

    Few-Shot Adversarial Domain Adaptation

    Authors: Saeid Motiian, Quinn Jones, Seyed Mehdi Iranmanesh, Gianfranco Doretto

    Abstract: This work provides a framework for addressing the problem of supervised domain adaptation with deep models. The main idea is to exploit adversarial learning to learn an embedded subspace that simultaneously maximizes the confusion between two domains while semantically aligning their embedding. The supervised setting becomes attractive especially when there are only a few target data samples that… ▽ More

    Submitted 5 November, 2017; originally announced November 2017.

    Comments: Accepted to NIPS 2017. arXiv admin note: text overlap with arXiv:1709.10190

  17. 3D Convolutional Neural Networks for Cross Audio-Visual Matching Recognition

    Authors: Amirsina Torfi, Seyed Mehdi Iranmanesh, Nasser M. Nasrabadi, Jeremy Dawson

    Abstract: Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the extracted information from one modality to improve the recognition ability of the other modality by complementing the missing informa… ▽ More

    Submitted 12 August, 2017; v1 submitted 18 June, 2017; originally announced June 2017.

    Journal ref: IEEE Access (Year: 2017, Volume: PP, Issue: 99 )

  18. Polar Coding for Achieving the Capacity of Marginal Channels in Nonbinary-Input Setting

    Authors: Amirsina Torfi, Sobhan Soleymani, Seyed Mehdi Iranmanesh, Hadi Kazemi, Rouzbeh Asghari Shirvani, Vahid Tabataba Vakili

    Abstract: Achieving information-theoretic security using explicit coding scheme in which unlimited computational power for eavesdropper is assumed, is one of the main topics is security consideration. It is shown that polar codes are capacity achieving codes and have a low complexity in encoding and decoding. It has been proven that polar codes reach to secrecy capacity in the binary-input wiretap channels… ▽ More

    Submitted 6 February, 2017; v1 submitted 20 January, 2017; originally announced January 2017.

    Comments: Accepted to be published in "51th Conference on Information Sciences and Systems", Baltimore, Maryland

    Journal ref: 51th Annual Conference on Information Sciences and Systems (CISS), 1-6 (2017)