Skip to main content

Showing 1–8 of 8 results for author: Pranata, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.03610  [pdf, other

    cs.LG cs.AI cs.CL

    RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents

    Authors: Tomoyuki Kagaya, Thong **g Yuan, Yuxuan Lou, Jayashree Karlekar, Sugiri Pranata, Akira Kinose, Koki Oguri, Felix Wick, Yang You

    Abstract: Owing to recent advancements, Large Language Models (LLMs) can now be deployed as agents for increasingly complex decision-making applications in areas including robotics, gaming, and API integration. However, reflecting past experiences in current decision-making processes, an innate human behavior, continues to pose significant challenges. Addressing this, we propose Retrieval-Augmented Planning… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  2. arXiv:2310.14652  [pdf, other

    cs.CV

    Invariant Feature Regularization for Fair Face Recognition

    Authors: Jiali Ma, Zhongqi Yue, Kagaya Tomoyuki, Suzuki Tomoki, Karlekar Jayashree, Sugiri Pranata, Hanwang Zhang

    Abstract: Fair face recognition is all about learning invariant feature that generalizes to unseen faces in any demographic group. Unfortunately, face datasets inevitably capture the imbalanced demographic attributes that are ubiquitous in real-world observations, and the model learns biased feature that generalizes poorly in the minority group. We point out that the bias arises due to the confounding demog… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted by International Conference on Computer Vision (ICCV) 2023

  3. arXiv:2207.12258  [pdf, other

    cs.CV cs.AI cs.LG

    Equivariance and Invariance Inductive Bias for Learning from Insufficient Data

    Authors: Tan Wang, Qianru Sun, Sugiri Pranata, Karlekar Jayashree, Hanwang Zhang

    Abstract: We are interested in learning robust models from insufficient data, without the need for any externally pre-trained checkpoints. First, compared to sufficient data, we show why insufficient data renders the model more easily biased to the limited training environments that are usually different from testing. For example, if all the training swan samples are "white", the model may wrongly use the "… ▽ More

    Submitted 6 September, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV 2022. Codes are available on Github: https://github.com/Wangt-CN/EqInv

  4. arXiv:1906.00619  [pdf, other

    cs.CV

    Deep Face Recognition Model Compression via Knowledge Transfer and Distillation

    Authors: Jayashree Karlekar, Jiashi Feng, Zi Sian Wong, Sugiri Pranata

    Abstract: Fully convolutional networks (FCNs) have become de facto tool to achieve very high-level performance for many vision and non-vision tasks in general and face recognition in particular. Such high-level accuracies are normally obtained by very deep networks or their ensemble. However, deploying such high performing models to resource constraint devices or real-time applications is challenging. In th… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: 7 pages, 5 figures

  5. arXiv:1902.06924  [pdf

    cs.CV

    Anomaly Detection with Adversarial Dual Autoencoders

    Authors: Ha Son Vu, Daisuke Ueta, Kiyoshi Hashimoto, Kazuki Maeno, Sugiri Pranata, Sheng Mei Shen

    Abstract: Semi-supervised and unsupervised Generative Adversarial Networks (GAN)-based methods have been gaining popularity in anomaly detection task recently. However, GAN training is somewhat challenging and unstable. Inspired from previous work in GAN-based image generation, we introduce a GAN-based anomaly detection framework - Adversarial Dual Autoencoders (ADAE) - consists of two autoencoders as gener… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

  6. arXiv:1809.00338  [pdf, other

    cs.CV cs.LG stat.ML

    Look Across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition

    Authors: Jian Zhao, Yu Cheng, Yi Cheng, Yang Yang, Haochong Lan, Fang Zhao, Lin Xiong, Yan Xu, Jianshu Li, Sugiri Pranata, Shengmei Shen, Junliang Xing, Hengzhu Liu, Shuicheng Yan, Jiashi Feng

    Abstract: Despite the remarkable progress in face recognition related technologies, reliably recognizing faces across ages still remains a big challenge. The appearance of a human face changes substantially over time, resulting in significant intra-class variations. As opposed to current techniques for age-invariant face recognition, which either directly extract age-invariant features for recognition, or f… ▽ More

    Submitted 3 October, 2018; v1 submitted 2 September, 2018; originally announced September 2018.

  7. arXiv:1803.10630  [pdf, other

    cs.CV

    Person re-identification with fusion of hand-crafted and deep pose-based body region features

    Authors: Jubin Johnson, Shunsuke Yasugi, Yoichi Sugino, Sugiri Pranata, Shengmei Shen

    Abstract: Person re-identification (re-ID) aims to accurately re- trieve a person from a large-scale database of images cap- tured across multiple cameras. Existing works learn deep representations using a large training subset of unique per- sons. However, identifying unseen persons is critical for a good re-ID algorithm. Moreover, the misalignment be- tween person crops to detection errors or pose variati… ▽ More

    Submitted 27 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1711.08184, arXiv:1707.00798 by other authors

  8. arXiv:1704.00438  [pdf, other

    cs.CV

    A Good Practice Towards Top Performance of Face Recognition: Transferred Deep Feature Fusion

    Authors: Lin Xiong, Jayashree Karlekar, Jian Zhao, Yi Cheng, Yan Xu, Jiashi Feng, Sugiri Pranata, Shengmei Shen

    Abstract: Unconstrained face recognition performance evaluations have traditionally focused on Labeled Faces in the Wild (LFW) dataset for imagery and the YouTubeFaces (YTF) dataset for videos in the last couple of years. Spectacular progress in this field has resulted in saturation on verification and identification accuracies for those benchmark datasets. In this paper, we propose a unified learning frame… ▽ More

    Submitted 9 February, 2018; v1 submitted 3 April, 2017; originally announced April 2017.

    Comments: 13 pages, 10 figures