Skip to main content

Showing 1–49 of 49 results for author: Sanchez, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03866  [pdf, other

    cs.CV

    LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model

    Authors: Yixuan Yang, Junru Lu, Zixiang Zhao, Zhen Luo, James J. Q. Yu, Victor Sanchez, Feng Zheng

    Abstract: Designing 3D indoor layouts is a crucial task with significant applications in virtual reality, interior design, and automated space planning. Existing methods for 3D layout design either rely on diffusion models, which utilize spatial relationship priors, or heavily leverage the inferential capabilities of proprietary Large Language Models (LLMs), which require extensive prompt engineering and in… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2405.19349  [pdf, other

    eess.SP cs.CV cs.HC cs.LG

    Beyond Isolated Frames: Enhancing Sensor-Based Human Activity Recognition through Intra- and Inter-Frame Attention

    Authors: Shuai Shao, Yu Guan, Victor Sanchez

    Abstract: Human Activity Recognition (HAR) has become increasingly popular with ubiquitous computing, driven by the popularity of wearable sensors in fields like healthcare and sports. While Convolutional Neural Networks (ConvNets) have significantly contributed to HAR, they often adopt a frame-by-frame analysis, concentrating on individual frames and potentially overlooking the broader temporal dynamics in… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2405.00823  [pdf, other

    cs.CL cs.AI cs.MA

    WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting

    Authors: Olly Styles, Sam Miller, Patricio Cerda-Mardini, Tanaya Guha, Victor Sanchez, Bertie Vidgen

    Abstract: We introduce WorkBench: a benchmark dataset for evaluating agents' ability to execute tasks in a workplace setting. WorkBench contains a sandbox environment with five databases, 26 tools, and 690 tasks. These tasks represent common business activities, such as sending emails and scheduling meetings. The tasks in WorkBench are challenging as they require planning, tool selection, and often multiple… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  4. arXiv:2403.09281  [pdf, other

    cs.CV

    CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification

    Authors: Yiming Ma, Victor Sanchez, Tanaya Guha

    Abstract: The CLIP (Contrastive Language-Image Pretraining) model has exhibited outstanding performance in recognition problems, such as zero-shot image classification and object detection. However, its ability to count remains understudied due to the inherent challenges of transforming counting--a regression task--into a recognition task. In this paper, we investigate CLIP's potential in counting, focusing… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  5. arXiv:2401.04257  [pdf, other

    cs.CV

    Detecting Face Synthesis Using a Concealed Fusion Model

    Authors: Roberto Leyva, Victor Sanchez, Gregory Epiphaniou, Carsten Maple

    Abstract: Face image synthesis is gaining more attention in computer security due to concerns about its potential negative impacts, including those related to fake biometrics. Hence, building models that can detect the synthesized face images is an important challenge to tackle. In this paper, we propose a fusion-based strategy to detect face image synthesis while providing resiliency to several attacks. Th… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  6. arXiv:2401.04241  [pdf, other

    cs.CV

    Data-Agnostic Face Image Synthesis Detection Using Bayesian CNNs

    Authors: Roberto Leyva, Victor Sanchez, Gregory Epiphaniou, Carsten Maple

    Abstract: Face image synthesis detection is considerably gaining attention because of the potential negative impact on society that this type of synthetic data brings. In this paper, we propose a data-agnostic solution to detect the face image synthesis process. Specifically, our solution is based on an anomaly detection framework that requires only real data to learn the inference process. It is therefore… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  7. arXiv:2312.11195  [pdf, other

    cs.CV

    Cross-Age Contrastive Learning for Age-Invariant Face Recognition

    Authors: Haoyi Wang, Victor Sanchez, Chang-Tsun Li

    Abstract: Cross-age facial images are typically challenging and expensive to collect, making noise-free age-oriented datasets relatively small compared to widely-used large-scale facial datasets. Additionally, in real scenarios, images of the same subject at different ages are usually hard or even impossible to obtain. Both of these factors lead to a lack of supervised data, which limits the versatility of… ▽ More

    Submitted 2 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: ICASSP 2024

  8. ViKi-HyCo: A Hybrid-Control approach for complex car-like maneuvers

    Authors: Edison P. Velasco Sánchez, Miguel Ángel Muñoz-Bañón, Francisco A. Candelas, Santiago T. Puente, Fernando Torres

    Abstract: While Visual Servoing is deeply studied to perform simple maneuvers, the literature does not commonly address complex cases where the target is far out of the camera's field of view (FOV) during the maneuver. For this reason, in this paper, we present ViKi-HyCo (Visual Servoing and Kinematic Hybrid-Controller). This approach generates the necessary maneuvers for the complex positioning of a non-ho… ▽ More

    Submitted 16 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: This paper is published at the journal "IEEE Access"

    Journal ref: In IEEE Access, vol. 12, pp. 65428-65443, May. 2024

  9. arXiv:2304.06370  [pdf, other

    cs.CV

    Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-Attention

    Authors: Yiming Ma, Victor Sanchez, Soodeh Nikan, Devesh Upadhyay, Bhushan Atote, Tanaya Guha

    Abstract: Driver Monitoring Systems (DMSs) are crucial for safe hand-over actions in Level-2+ self-driving vehicles. State-of-the-art DMSs leverage multiple sensors mounted at different locations to monitor the driver and the vehicle's interior scene and employ decision-level fusion to integrate these heterogenous data. However, this fusion method may not fully utilize the complementarity of different data… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 9 pages (1 for reference); accepted by the 6th Multimodal Learning and Applications Workshop (MULA) at CVPR 2023

  10. arXiv:2212.02448  [pdf, ps, other

    cs.IT

    The Multi-cluster Fluctuating Two-Ray Fading Model

    Authors: José David Vega Sánchez, F. Javier López-Martínez, José F. Paris, Juan M. Romero-Jerez

    Abstract: We introduce a new class of fading channels, built as the superposition of two fluctuating specular components with random phases, plus a clustering of scattered waves: the Multi-cluster Fluctuating Two-Ray (MFTR) fading channel. The MFTR model emerges as a natural generalization of both the fluctuating two-ray (FTR) and the $κ$-$μ$ shadowed fading models through a more general yet equally mathema… ▽ More

    Submitted 15 September, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: This work was submitted to the IEEE for publication on May 31, 2022. Copyright may be transferred without notice, after which this version may no longer be accessible

  11. arXiv:2212.00873  [pdf, other

    cs.AR

    CONVOLVE: Smart and seamless design of smart edge processors

    Authors: M. Gomony, F. Putter, A. Gebregiorgis, G. Paulin, L. Mei, V. Jain, S. Hamdioui, V. Sanchez, T. Grosser, M. Geilen, M. Verhelst, F. Zenke, F. Gurkaynak, B. Bruin, S. Stuijk, S. Davidson, S. De, M. Ghogho, A. Jimborean, S. Eissa, L. Benini, D. Soudris, R. Bishnoi, S. Ainsworth, F. Corradi , et al. (3 additional authors not shown)

    Abstract: With the rise of Deep Learning (DL), our world braces for AI in every edge device, creating an urgent need for edge-AI SoCs. This SoC hardware needs to support high throughput, reliable and secure AI processing at Ultra Low Power (ULP), with a very short time to market. With its strong legacy in edge solutions and open processing platforms, the EU is well-positioned to become a leader in this SoC… ▽ More

    Submitted 2 May, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

  12. arXiv:2210.09441  [pdf, other

    cs.CV cs.HC cs.RO

    Real-Time Driver Monitoring Systems through Modality and View Analysis

    Authors: Yiming Ma, Victor Sanchez, Soodeh Nikan, Devesh Upadhyay, Bhushan Atote, Tanaya Guha

    Abstract: Driver distractions are known to be the dominant cause of road accidents. While monitoring systems can detect non-driving-related activities and facilitate reducing the risks, they must be accurate and efficient to be applicable. Unfortunately, state-of-the-art methods prioritize accuracy while ignoring latency because they leverage cross-view and multimodal videos in which consecutive frames are… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Paper summaries that our work on the DAD dataset

  13. arXiv:2207.13798  [pdf, other

    cs.CV

    Look at Adjacent Frames: Video Anomaly Detection without Offline Training

    Authors: Yuqi Ouyang, Guodong Shen, Victor Sanchez

    Abstract: We propose a solution to detect anomalous events in videos without the need to train a model offline. Specifically, our solution is based on a randomly-initialized multilayer perceptron that is optimized online to reconstruct video frames, pixel-by-pixel, from their frequency information. Based on the information shifts between adjacent frames, an incremental learner is used to update parameters o… ▽ More

    Submitted 22 January, 2023; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: Accepted in ECCV 2022 RWS

  14. arXiv:2207.07935  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Visually-aware Acoustic Event Detection using Heterogeneous Graphs

    Authors: Amir Shirian, Krishna Somandepalli, Victor Sanchez, Tanaya Guha

    Abstract: Perception of auditory events is inherently multimodal relying on both audio and visual cues. A large number of existing multimodal approaches process each modality using modality-specific models and then fuse the embeddings to encode the joint information. In contrast, we employ heterogeneous graphs to explicitly capture the spatial and temporal relationships between the modalities and represent… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

  15. Video Anomaly Detection via Prediction Network with Enhanced Spatio-Temporal Memory Exchange

    Authors: Guodong Shen, Yuqi Ouyang, Victor Sanchez

    Abstract: Video anomaly detection is a challenging task because most anomalies are scarce and non-deterministic. Many approaches investigate the reconstruction difference between normal and abnormal patterns, but neglect that anomalies do not necessarily correspond to large reconstruction errors. To address this issue, we design a Convolutional LSTM Auto-Encoder prediction framework with enhanced spatio-tem… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: Accepted at ICASSP 2022

  16. Frequency selective extrapolation with residual filtering for image error concealment

    Authors: Ján Koloda, Jürgen Seiler, André Kaup, Victoria Sánchez, Antonio M. Peinado

    Abstract: The purpose of signal extrapolation is to estimate unknown signal parts from known samples. This task is especially important for error concealment in image and video communication. For obtaining a high quality reconstruction, assumptions have to be made about the underlying signal in order to solve this underdetermined problem. Among existent reconstruction algorithms, frequency selective extrapo… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    ACM Class: I.4.3; I.4.5

    Journal ref: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014, pp. 1976-1980

  17. arXiv:2203.08370  [pdf, ps, other

    cs.IT

    Physical Layer Security of RIS-Assisted Communications under Electromagnetic Interference

    Authors: José David Vega Sánchez, Georges Kaddoum, F. Javier López-Martínez

    Abstract: This work investigates the impact of the ever-present electromagnetic interference (EMI) on the achievable secrecy performance of reconfigurable intelligent surface (RIS)-aided communication systems. We characterize the end-to-end RIS channel by considering key practical aspects such as spatial correlation, transmit beamforming vector, phase-shift noise, the coexistence of direct and indirect chan… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  18. FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion

    Authors: Yiming Ma, Victor Sanchez, Tanaya Guha

    Abstract: State-of-the-art crowd counting models follow an encoder-decoder approach. Images are first processed by the encoder to extract features. Then, to account for perspective distortion, the highest-level feature map is fed to extra components to extract multiscale features, which are the input to the decoder to generate crowd densities. However, in these methods, features extracted at earlier stages… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: 5 pages, 11 figures, submit to ICIP

  19. arXiv:2201.09822  [pdf

    cs.CV

    Spectral-PQ: A Novel Spectral Sensitivity-Orientated Perceptual Compression Technique for RGB 4:4:4 Video Data

    Authors: Lee Prangnell, Victor Sanchez

    Abstract: There exists an intrinsic relationship between the spectral sensitivity of the Human Visual System (HVS) and colour perception; these intertwined phenomena are often overlooked in perceptual compression research. In general, most previously proposed visually lossless compression techniques exploit luminance (luma) masking including luma spatiotemporal masking, luma contrast masking and luma textur… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:2005.07928

  20. arXiv:2201.06967  [pdf, other

    cs.CY cs.CL

    Large Scale Analysis of Open MOOC Reviews to Support Learners' Course Selection

    Authors: Manuel J. Gomez, Mario Calderón, Victor Sánchez, Félix J. García Clemente, José A. Ruipérez-Valiente

    Abstract: The recent pandemic has changed the way we see education. It is not surprising that children and college students are not the only ones using online education. Millions of adults have signed up for online classes and courses during last years, and MOOC providers, such as Coursera or edX, are reporting millions of new users signing up in their platforms. However, students do face some challenges wh… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: 36 pages, 8 figures

  21. Improving Face-Based Age Estimation with Attention-Based Dynamic Patch Fusion

    Authors: Haoyi Wang, Victor Sanchez, Chang-Tsun Li

    Abstract: With the increasing popularity of convolutional neural networks (CNNs), recent works on face-based age estimation employ these networks as the backbone. However, state-of-the-art CNN-based methods treat each facial region equally, thus entirely ignoring the importance of some facial patches that may contain rich age-specific information. In this paper, we propose a face-based age estimation framew… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    Comments: IEEE Transactions on Image Processing (accepted for publication)

  22. arXiv:2108.10543  [pdf, other

    cs.CV

    Joint Learning Architecture for Multiple Object Tracking and Trajectory Forecasting

    Authors: Oluwafunmilola Kesa, Olly Styles, Victor Sanchez

    Abstract: This paper introduces a joint learning architecture (JLA) for multiple object tracking (MOT) and trajectory forecasting in which the goal is to predict objects' current and future trajectories simultaneously. Motion prediction is widely used in several state of the art MOT methods to refine predictions in the form of bounding boxes. Typically, a Kalman Filter provides short-term estimations to hel… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

  23. arXiv:2108.04694  [pdf, other

    cs.CV

    Multi-Camera Trajectory Forecasting with Trajectory Tensors

    Authors: Olly Styles, Tanaya Guha, Victor Sanchez

    Abstract: We introduce the problem of multi-camera trajectory forecasting (MCTF), which involves predicting the trajectory of a moving object across a network of cameras. While multi-camera setups are widespread for applications such as surveillance and traffic monitoring, existing trajectory forecasting methods typically focus on single-camera trajectory forecasting (SCTF), limiting their use for such appl… ▽ More

    Submitted 24 August, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence (tPAMI)

  24. On the detection-to-track association for online multi-object tracking

    Authors: Xufeng Lin, Chang-Tsun Li, Victor Sanchez, Carsten Maple

    Abstract: Driven by recent advances in object detection with deep neural networks, the tracking-by-detection paradigm has gained increasing prevalence in the research community of multi-object tracking (MOT). It has long been known that appearance information plays an essential role in the detection-to-track association, which lies at the core of the tracking-by-detection paradigm. While most existing works… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Journal ref: Pattern Recognition Letters 146 (2021) 200-207

  25. arXiv:2106.06924  [pdf, other

    cs.MM cs.CV eess.IV

    Deep Learning for Predictive Analytics in Reversible Steganography

    Authors: Ching-Chun Chang, Xu Wang, Sisheng Chen, Isao Echizen, Victor Sanchez, Chang-Tsun Li

    Abstract: Deep learning is regarded as a promising solution for reversible steganography. There is an accelerating trend of representing a reversible steo-system by monolithic neural networks, which bypass intermediate operations in traditional pipelines of reversible steganography. This end-to-end paradigm, however, suffers from imperfect reversibility. By contrast, the modular paradigm that incorporates n… ▽ More

    Submitted 7 March, 2023; v1 submitted 13 June, 2021; originally announced June 2021.

    Journal ref: IEEE Access (2023), vol. 11, pp. 3494-3510

  26. arXiv:2103.13525  [pdf, ps, other

    cs.IT

    Expectation-Maximization Learning for Wireless Channel Modeling of Reconfigurable Intelligent Surfaces

    Authors: José David Vega Sánchez, Luis Urquiza-Aguiar, Martha Cecilia Paredes Paredes, F. Javier López-Martínez

    Abstract: Channel modeling is a critical issue when designing or evaluating the performance of reconfigurable intelligent surface (RIS)-assisted communications. Inspired by the promising potential of learning-based methods for characterizing the radio environment, we present a general approach to model the RIS end-to-end equivalent channel using the unsupervised expectation-maximization (EM) learning algori… ▽ More

    Submitted 10 August, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

  27. arXiv:2012.01468  [pdf, other

    cs.CV cs.LG eess.IV

    Video Anomaly Detection by Estimating Likelihood of Representations

    Authors: Yuqi Ouyang, Victor Sanchez

    Abstract: Video anomaly detection is a challenging task not only because it involves solving many sub-tasks such as motion representation, object localization and action recognition, but also because it is commonly considered as an unsupervised learning problem that involves detecting outliers. Traditionally, solutions to this task have focused on the map** between video frames and their low-dimensional f… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: Accepted to ICPR 2020

  28. arXiv:2007.12859  [pdf, ps, other

    cs.IT eess.SP

    Physical Layer Security of Large Reflecting Surface Aided Communications with Phase Errors

    Authors: Jose David Vega Sanchez, Pablo Ramirez-Espinosa, F. Javier Lopez-Martinez

    Abstract: The physical layer security (PLS) performance of a wireless communication link through a large reflecting surface (LRS) with phase errors is analyzed. Leveraging recent results that express the \ac{LRS}-based composite channel as an equivalent scalar fading channel, we show that the eavesdropper's link is Rayleigh distributed and independent of the legitimate link. The different scaling laws of th… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: This work has been submitted to the IEEE for publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  29. Age-Oriented Face Synthesis with Conditional Discriminator Pool and Adversarial Triplet Loss

    Authors: Haoyi Wang, Victor Sanchez, Chang-Tsun Li

    Abstract: The vanilla Generative Adversarial Networks (GAN) are commonly used to generate realistic images depicting aged and rejuvenated faces. However, the performance of such vanilla GANs in the age-oriented face synthesis task is often compromised by the mode collapse issue, which may result in the generation of faces with minimal variations and a poor synthesis accuracy. In addition, recent age-oriente… ▽ More

    Submitted 3 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

  30. arXiv:2006.08044  [pdf, ps, other

    cs.IT eess.SP

    Survey on Physical Layer Security for 5G Wireless Networks

    Authors: José David Vega Sánchez, Luis Urquiza-Aguiar, Martha Cecilia Paredes Paredes, Diana Pamela Moya Osorio

    Abstract: Physical layer security is a promising approach that can benefit traditional encryption methods. The idea of physical layer security is to take advantage of the features of the propagation medium and its impairments to ensure secure communication in the physical layer. This work introduces a comprehensive review of the main information-theoretic metrics used to measure the secrecy performance in p… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  31. arXiv:2006.03898  [pdf, other

    cs.CV cs.MM eess.IV

    Ensemble Network for Ranking Images Based on Visual Appeal

    Authors: Sachin Singh, Victor Sanchez, Tanaya Guha

    Abstract: We propose a computational framework for ranking images (group photos in particular) taken at the same event within a short time span. The ranking is expected to correspond with human perception of overall appeal of the images. We hypothesize and provide evidence through subjective analysis that the factors that appeal to humans are its emotional content, aesthetics and image quality. We propose a… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

  32. arXiv:2005.07930  [pdf

    eess.IV cs.CV

    HVS-Based Perceptual Color Compression of Image Data

    Authors: Lee Prangnell, Victor Sanchez

    Abstract: In perceptual image coding applications, the main objective is to decrease, as much as possible, Bits Per Pixel (BPP) while avoiding noticeable distortions in the reconstructed image. In this paper, we propose a novel perceptual image coding technique, named Perceptual Color Compression (PCC). PCC is based on a novel model related to Human Visual System (HVS) spectral sensitivity and CIELAB Just N… ▽ More

    Submitted 9 February, 2021; v1 submitted 16 May, 2020; originally announced May 2020.

    Comments: Preprint: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)

  33. arXiv:2005.07928  [pdf

    cs.MM eess.IV

    Spatiotemporal Adaptive Quantization for the Perceptual Video Coding of RGB 4:4:4 Data

    Authors: Lee Prangnell, Victor Sanchez

    Abstract: Due to the spectral sensitivity phenomenon of the Human Visual System (HVS), the color channels of raw RGB 4:4:4 sequences contain significant psychovisual redundancies; these redundancies can be perceptually quantized. The default quantization systems in the HEVC standard are known as Uniform Reconstruction Quantization (URQ) and Rate Distortion Optimized Quantization (RDOQ); URQ and RDOQ are not… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

  34. arXiv:2005.02441  [pdf, ps, other

    cs.IT

    Information-Theoretic Security of MIMO Networks under $κ$-$μ$ Shadowed Fading Channels

    Authors: José David Vega Sánchez, D. P. Moya Osorio, F. Javier López-Martínez, Martha Cecilia Paredes Paredes, Luis Urquiza-Aguiar

    Abstract: This paper investigates the impact of realistic propagation conditions on the achievable secrecy performance of multiple-input multiple-output systems in the presence of an eavesdropper. Specifically, we concentrate on the $κ$-$μ$ shadowed fading model because its physical underpinnings capture a wide range of propagation conditions, while, at the same time, it allows for much better tractability… ▽ More

    Submitted 30 June, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

  35. arXiv:2005.00282  [pdf, other

    cs.CV

    Multi-Camera Trajectory Forecasting: Pedestrian Trajectory Prediction in a Network of Cameras

    Authors: Olly Styles, Tanaya Guha, Victor Sanchez, Alex Kot

    Abstract: We introduce the task of multi-camera trajectory forecasting (MCTF), where the future trajectory of an object is predicted in a network of cameras. Prior works consider forecasting trajectories in a single camera view. Our work is the first to consider the challenging scenario of forecasting across multiple non-overlap** camera views. This has wide applicability in tasks such as re-identificatio… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: CVPR 2020 Precognition workshop

  36. arXiv:1909.11944  [pdf, other

    cs.CV

    Multiple Object Forecasting: Predicting Future Object Locations in Diverse Environments

    Authors: Olly Styles, Tanaya Guha, Victor Sanchez

    Abstract: This paper introduces the problem of multiple object forecasting (MOF), in which the goal is to predict future bounding boxes of tracked objects. In contrast to existing works on object trajectory forecasting which primarily consider the problem from a birds-eye perspective, we formulate the problem from an object-level perspective and call for the prediction of full object bounding boxes, rather… ▽ More

    Submitted 7 January, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: WACV 2020. Code & dataset: https://github.com/olly-styles/Multiple-Object-Forecasting

  37. arXiv:1905.03681  [pdf, other

    cs.CV cs.RO

    Forecasting Pedestrian Trajectory with Machine-Annotated Training Data

    Authors: Olly Styles, Arun Ross, Victor Sanchez

    Abstract: Reliable anticipation of pedestrian trajectory is imperative for the operation of autonomous vehicles and can significantly enhance the functionality of advanced driver assistance systems. While significant progress has been made in the field of pedestrian detection, forecasting pedestrian trajectories remains a challenging problem due to the unpredictable nature of pedestrians and the huge space… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Comments: 6 pages, 5 figures. To appear in the proceedings of the 2019 IEEE Intelligent Vehicles Symposium (IV)

  38. arXiv:1902.07847  [pdf, ps, other

    cs.IT

    On the Statistics of the Ratio of Non-Constrained Arbitrary α-μ Random Variables: a General Framework and Applications

    Authors: J. D. Vega Sánchez, D. P. Moya Osorio, E. E. Benitez Olivo, H. Alves, M. C. P. Paredes, L. Urquiza-Aguiar

    Abstract: In this paper, we derive closed-form exact expressions for the main statistics of the ratio of squared alpha-mu random variables, which are of interest in many scenarios for future wireless networks where generalized distributions are more suitable to fit with field data. Importantly, different from previous proposals, our expressions are general in the sense that are valid for non constrained arb… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

  39. arXiv:1807.10421  [pdf, other

    cs.CV

    Fusion Network for Face-based Age Estimation

    Authors: Haoyi Wang, Xingjie Wei, Victor Sanchez, Chang-Tsun Li

    Abstract: Convolutional Neural Networks (CNN) have been applied to age-related research as the core framework. Although faces are composed of numerous facial attributes, most works with CNNs still consider a face as a typical object and do not pay enough attention to facial regions that carry age-specific feature for this particular task. In this paper, we propose a novel CNN architecture called Fusion Netw… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: ICIP 2018

  40. arXiv:1803.09875  [pdf, other

    cs.IR

    A Web Scra** Methodology for Bypassing Twitter API Restrictions

    Authors: A. Hernandez-Suarez, G. Sanchez-Perez, K. Toscano-Medina, V. Martinez-Hernandez, V. Sanchez, H. Perez-Meana

    Abstract: Retrieving information from social networks is the first and primordial step many data analysis fields such as Natural Language Processing, Sentiment Analysis and Machine Learning. Important data science tasks relay on historical data gathering for further predictive results. Most of the recent works use Twitter API, a public platform for collecting public streams of information, which allows quer… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

  41. arXiv:1802.05884  [pdf

    cs.MM

    Coding Block-Level Perceptual Video Coding for 4:4:4 Data in HEVC

    Authors: Lee Prangnell, Miguel Hernández-Cabronero, Victor Sanchez

    Abstract: There is an increasing consumer demand for high bit-depth 4:4:4 HD video data playback due to its superior perceptual visual quality compared with standard 8-bit subsampled 4:2:0 video data. Due to vast file sizes and associated bitrates, it is desirable to compress raw high bit-depth 4:4:4 HD video sequences as much as possible without incurring a discernible decrease in visual quality. In this p… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

    Comments: Preprint: 2017 IEEE International Conference on Image Processing (ICIP 2017)

  42. arXiv:1710.09919  [pdf

    cs.MM

    JND-Based Perceptual Video Coding for 4:4:4 Screen Content Data in HEVC

    Authors: Lee Prangnell, Victor Sanchez

    Abstract: The JCT-VC standardized Screen Content Coding (SCC) extension in the HEVC HM RExt + SCM reference codec offers an impressive coding efficiency performance when compared with HM RExt alone; however, it is not significantly perceptually optimized. For instance, it does not include advanced HVS-based perceptual coding methods, such as JND-based spatiotemporal masking schemes. In this paper, we propos… ▽ More

    Submitted 12 February, 2018; v1 submitted 26 October, 2017; originally announced October 2017.

    Comments: Preprint: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018)

  43. arXiv:1612.07893  [pdf

    cs.MM

    Cross-Color Channel Perceptually Adaptive Quantization for HEVC

    Authors: Lee Prangnell, Miguel Hernández-Cabronero, Victor Sanchez

    Abstract: HEVC includes a Coding Unit (CU) level luminance-based perceptual quantization technique known as AdaptiveQP. AdaptiveQP perceptually adjusts the Quantization Parameter (QP) at the CU level based on the spatial activity of raw input video data in a luma Coding Block (CB). In this paper, we propose a novel cross-color channel adaptive quantization scheme which perceptually adjusts the CU level QP a… ▽ More

    Submitted 12 February, 2018; v1 submitted 23 December, 2016; originally announced December 2016.

    Comments: Data Compression Conference 2017

  44. arXiv:1610.01381  [pdf, other

    cs.AI

    The Predictive Context Tree: Predicting Contexts and Interactions

    Authors: Alasdair Thomason, Nathan Griffiths, Victor Sanchez

    Abstract: With a large proportion of people carrying location-aware smartphones, we have an unprecedented platform from which to understand individuals and predict their future actions. This work builds upon the Context Tree data structure that summarises the historical contexts of individuals from augmented geospatial trajectories, and constructs a predictive model for their likely future contexts. The Pre… ▽ More

    Submitted 5 October, 2016; originally announced October 2016.

  45. arXiv:1609.06442  [pdf

    cs.MM

    Minimizing Compression Artifacts for High Resolutions with Adaptive Quantization Matrices for HEVC

    Authors: Lee Prangnell, Victor Sanchez

    Abstract: Visual Display Units (VDUs), capable of displaying video data at High Definition (HD) and Ultra HD (UHD) resolutions, are frequently employed in a variety of technological domains. Quantization-induced video compression artifacts, which are usually unnoticeable in low resolution environments, are typically conspicuous on high resolution VDUs and video data. The default quantization matrices (QMs)… ▽ More

    Submitted 21 September, 2016; originally announced September 2016.

    Comments: PhD Working Paper, University of Warwick, UK. arXiv admin note: substantial text overlap with arXiv:1606.02042

  46. arXiv:1609.06302   

    cs.MM

    Color-Based Coding Unit Level Adaptive Quantization for HEVC

    Authors: Lee Prangnell, Victor Sanchez

    Abstract: HEVC HM 16 includes a Coding Unit (CU) level perceptual quantization technique named AdaptiveQP. AdaptiveQP adjusts the Quantization Parameter (QP) at the CU level based on the spatial activity of samples in the four constituent NxN sub-blocks of the luma Coding Block (CB), which is contained within a 2Nx2N CU. In this paper, we propose C-BAQ, which, in contrast to AdaptiveQP, adjusts the CU level… ▽ More

    Submitted 6 November, 2016; v1 submitted 15 September, 2016; originally announced September 2016.

    Comments: Some of the textual and mathematical contents in this pre-print working paper have been superseded. Therefore, this pre-print has been removed from arXiv, as requested by the co-author of the paper (at The University of Warwick)

  47. arXiv:1606.04269  [pdf, other

    cs.DS cs.LG

    Context Trees: Augmenting Geospatial Trajectories with Context

    Authors: Alasdair Thomason, Nathan Griffiths, Victor Sanchez

    Abstract: Exposing latent knowledge in geospatial trajectories has the potential to provide a better understanding of the movements of individuals and groups. Motivated by such a desire, this work presents the context tree, a new hierarchical data structure that summarises the context behind user actions in a single model. We propose a method for context tree construction that augments geospatial trajectori… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

    Journal ref: ACM Transactions on Information Systems 2016 35(2) 14:1-14:37

  48. arXiv:1606.02042  [pdf

    cs.OH

    Adaptive Quantization Matrices for HD and UHD Display Resolutions in Scalable HEVC

    Authors: Lee Prangnell, Victor Sanchez

    Abstract: HEVC contains an option to enable custom quantization matrices, which are designed based on the Human Visual System and a 2D Contrast Sensitivity Function. Visual Display Units, capable of displaying video data at High Definition and Ultra HD display resolutions, are frequently utilized on a global scale. Video compression artifacts that are present due to high levels of quantization, which are ty… ▽ More

    Submitted 12 June, 2016; v1 submitted 7 June, 2016; originally announced June 2016.

    Comments: Data Compression Conference 2016

  49. arXiv:1207.5545  [pdf, other

    cs.SE

    An analysis of social network connect services

    Authors: Antonio Tapiador, Víctor Sánchez, Joaquín Salvachúa

    Abstract: Social network platforms are increasingly becoming identity providers and a media for showing multiple types of activity from third-party web sites. In this article, we analyze the services provided by seven of the most popular social network platforms. Results show OAuth emerging as the authentication and authorization protocol, giving support to three types of APIs, client-side or Javascript, se… ▽ More

    Submitted 23 July, 2012; originally announced July 2012.

    Comments: Preprint of article published in Proceedings of WEBIST 2012. Porto, Portugal