Skip to main content

Showing 1–14 of 14 results for author: Sim, T

.
  1. arXiv:2403.06381  [pdf, other

    cs.CV

    Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

    Authors: Yang Zhang, Teoh Tze Tzun, Lim Wei Hern, Tiviatis Sim, Kenji Kawaguchi

    Abstract: Recent advancements in diffusion models have notably improved the perceptual quality of generated images in text-to-image synthesis tasks. However, diffusion models often struggle to produce images that accurately reflect the intended semantics of the associated text prompts. We examine cross-attention layers in diffusion models and observe a propensity for these layers to disproportionately focus… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  2. arXiv:2401.16559  [pdf, other

    cs.CV

    IEEE BigData 2023 Keystroke Verification Challenge (KVC)

    Authors: Giuseppe Stragapede, Ruben Vera-Rodriguez, Ruben Tolosana, Aythami Morales, Ivan DeAndres-Tame, Naser Damer, Julian Fierrez, Javier-Ortega Garcia, Nahuel Gonzalez, Andrei Shadrikov, Dmitrii Gordin, Leon Schmitt, Daniel Wimmer, Christoph Grossmann, Joerdis Krieger, Florian Heinz, Ron Krestel, Christoffer Mayer, Simon Haberl, Helena Gschrey, Yosuke Yamagishi, Sanjay Saha, Sanka Rasnayaka, Sandareka Wickramanayake, Terence Sim , et al. (4 additional authors not shown)

    Abstract: This paper describes the results of the IEEE BigData 2023 Keystroke Verification Challenge (KVC), that considers the biometric verification performance of Keystroke Dynamics (KD), captured as tweet-long sequences of variable transcript text from over 185,000 subjects. The data are obtained from two of the largest public databases of KD up to date, the Aalto Desktop and Mobile Keystroke Databases,… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 9 pages, 10 pages, 2 figures. arXiv admin note: text overlap with arXiv:2311.06000

  3. arXiv:2305.06564  [pdf, other

    cs.CV

    Undercover Deepfakes: Detecting Fake Segments in Videos

    Authors: Sanjay Saha, Rashindrie Perera, Sachith Seneviratne, Tamasha Malepathirana, Sanka Rasnayaka, Deshani Geethika, Terence Sim, Saman Halgamuge

    Abstract: The recent renaissance in generative models, driven primarily by the advent of diffusion models and iterative improvement in GAN methods, has enabled many creative applications. However, each advancement is also accompanied by a rise in the potential for misuse. In the arena of the deepfake generation, this is a key societal issue. In particular, the ability to modify segments of videos using such… ▽ More

    Submitted 24 August, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: ICCV 2023 Workshop and Challenge on DeepFake Analysis and Detection

  4. Is Face Recognition Safe from Realizable Attacks?

    Authors: Sanjay Saha, Terence Sim

    Abstract: Face recognition is a popular form of biometric authentication and due to its widespread use, attacks have become more common as well. Recent studies show that Face Recognition Systems are vulnerable to attacks and can lead to erroneous identification of faces. Interestingly, most of these attacks are white-box, or they are manipulating facial images in ways that are not physically realizable. In… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 2020 IEEE International Joint Conference on Biometrics (IJCB)

    Journal ref: 2020 IEEE International Joint Conference on Biometrics (IJCB), Houston, TX, USA, 2020

  5. arXiv:2202.03639  [pdf, ps, other

    cs.LG

    Contrastive predictive coding for Anomaly Detection in Multi-variate Time Series Data

    Authors: Theivendiram Pranavan, Terence Sim, Arulmurugan Ambikapathi, Savitha Ramasamy

    Abstract: Anomaly detection in multi-variate time series (MVTS) data is a huge challenge as it requires simultaneous representation of long term temporal dependencies and correlations across multiple variables. More often, this is solved by breaking the complexity through modeling one dependency at a time. In this paper, we propose a Time-series Representational Learning through Contrastive Predictive Codin… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  6. arXiv:2106.05201  [pdf, ps, other

    math.ST math.PR

    General-order observation-driven models: ergodicity and consistency of the maximum likelihood estimator

    Authors: Tepmony Sim, Randal Douc, François Roueff

    Abstract: The class of observation-driven models (ODMs) includes many models of non-linear time series which, in a fashion similar to, yet different from, hidden Markov models (HMMs), involve hidden variables. Interestingly, in contrast to most HMMs, ODMs enjoy likelihoods that can be computed exactly with computational complexity of the same order as the number of observations, making maximum likelihood es… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  7. arXiv:1904.02893  [pdf, ps, other

    math.ST

    Necessary and sufficient conditions for the identifiability of observation-driven models

    Authors: François Roueff, Randal Douc, Ois Roueff, Tepmony Sim

    Abstract: In this contribution we are interested in proving that a given observation-driven model is identifiable. In the case of a GARCH(p, q) model, a simple sufficient condition has been established in [1] for showing the consistency of the quasi-maximum likelihood estimator. It turns out that this condition applies for a much larger class of observation-driven models, that we call the class of linearly… ▽ More

    Submitted 12 May, 2020; v1 submitted 5 April, 2019; originally announced April 2019.

  8. Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing

    Authors: Jian Zhao, Jianshu Li, Yu Cheng, Li Zhou, Terence Sim, Shuicheng Yan, Jiashi Feng

    Abstract: Despite the noticeable progress in perceptual tasks like detection, instance segmentation and human parsing, computers still perform unsatisfactorily on visually understanding humans in crowded scenes, such as group behavior analysis, person re-identification and autonomous driving, etc. To this end, models need to comprehensively perceive the semantic information and the differences between insta… ▽ More

    Submitted 6 July, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: The first three authors are with equal contributions

  9. arXiv:1711.06055  [pdf, other

    cs.CV

    Integrated Face Analytics Networks through Cross-Dataset Hybrid Training

    Authors: Jianshu Li, Shengtao Xiao, Fang Zhao, Jian Zhao, Jianan Li, Jiashi Feng, Shuicheng Yan, Terence Sim

    Abstract: Face analytics benefits many multimedia applications. It consists of a number of tasks, such as facial emotion recognition and face parsing, and most existing approaches generally treat these tasks independently, which limits their deployment in real scenarios. In this paper we propose an integrated Face Analytics Network (iFAN), which is able to perform multiple tasks jointly for face analytics w… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

    Comments: 10 pages

  10. arXiv:1705.07206  [pdf, other

    cs.CV

    Multiple-Human Parsing in the Wild

    Authors: Jianshu Li, Jian Zhao, Yunchao Wei, Congyan Lang, Yidong Li, Terence Sim, Shuicheng Yan, Jiashi Feng

    Abstract: Human parsing is attracting increasing research attention. In this work, we aim to push the frontier of human parsing by introducing the problem of multi-human parsing in the wild. Existing works on human parsing mainly tackle single-person scenarios, which deviates from real-world applications where multiple persons are present simultaneously with interaction and occlusion. To address the multi-h… ▽ More

    Submitted 14 March, 2018; v1 submitted 19 May, 2017; originally announced May 2017.

    Comments: The first two authors are with equal contribution

  11. arXiv:1509.09048  [pdf, ps, other

    math.ST

    The maximizing set of the asymptotic normalized log-likelihood for partially observed Markov chains

    Authors: Randal Douc, Francois Roueff, Tepmony Sim

    Abstract: This paper deals with a parametrized family of partially observed bivariate Markov chains. We establish that, under very mild assumptions, the limit of the normalized log-likelihood function is maximized when the parameters belong to the equivalence class of the true parameter, which is a key feature for obtaining the consistency of the maximum likelihood estimators (MLEs) in well-specified models… ▽ More

    Submitted 30 September, 2015; originally announced September 2015.

  12. arXiv:1507.04441   

    cs.HC

    Eye-2-I: Eye-tracking for just-in-time implicit user profiling

    Authors: Keng-Teck Ma, Qianli Xu, Liyuan Li, Terence Sim, Mohan Kankanhalli, Rosary Lim

    Abstract: For many applications, such as targeted advertising and content recommendation, knowing users' traits and interests is a prerequisite. User profiling is a helpful approach for this purpose. However, current methods, i.e. self-reporting, web-activity monitoring and social media mining are either intrusive or require data over long periods of time. Recently, there is growing evidence in cognitive sc… ▽ More

    Submitted 13 April, 2016; v1 submitted 15 July, 2015; originally announced July 2015.

    Comments: A bug was found in the codes which resulted in information leak. New experimental results will be updated at a later date. I assume all responsibility for this mistake. KT Ma

    ACM Class: H.3.4

  13. arXiv:1506.01831  [pdf, ps, other

    math.ST

    Handy sufficient conditions for the convergence of the maximum likelihood estimator in observation-driven models

    Authors: Randal Douc, François Roueff, Tepmony Sim

    Abstract: This paper generalizes asymptotic properties obtained in the observation-driven times series models considered by \cite{dou:kou:mou:2013} in the sense that the conditional law of each observation is also permitted to depend on the parameter. The existence of ergodic solutions and the consistency of the Maximum Likelihood Estimator (MLE) are derived under easy-to-check conditions. The obtained cond… ▽ More

    Submitted 5 June, 2015; originally announced June 2015.

  14. arXiv:1403.7876  [pdf, other

    cs.CV

    Correlation Filters with Limited Boundaries

    Authors: Hamed Kiani Galoogahi, Terence Sim, Simon Lucey

    Abstract: Correlation filters take advantage of specific properties in the Fourier domain allowing them to be estimated efficiently: O(NDlogD) in the frequency domain, versus O(D^3 + ND^2) spatially where D is signal length, and N is the number of signals. Recent extensions to correlation filters, such as MOSSE, have reignited interest of their use in the vision community due to their robustness and attract… ▽ More

    Submitted 31 March, 2014; originally announced March 2014.

    Comments: 8 pages, 6 figures, 2 tables