Skip to main content

Showing 1–22 of 22 results for author: Kotseruba, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00446  [pdf, other

    cs.CV cs.RO

    Diving Deeper Into Pedestrian Behavior Understanding: Intention Estimation, Action Prediction, and Event Risk Assessment

    Authors: Amir Rasouli, Iuliia Kotseruba

    Abstract: In this paper, we delve into the pedestrian behavior understanding problem from the perspective of three different tasks: intention estimation, action prediction, and event risk assessment. We first define the tasks and discuss how these tasks are represented and annotated in two widely used pedestrian datasets, JAAD and PIE. We then propose a new benchmark based on these definitions, available an… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 8 pages, 5 figures, 6 tables

  2. arXiv:2404.08756  [pdf, other

    cs.CV

    SCOUT+: Towards Practical Task-Driven Drivers' Gaze Prediction

    Authors: Iuliia Kotseruba, John K. Tsotsos

    Abstract: Accurate prediction of drivers' gaze is an important component of vision-based driver monitoring and assistive systems. Of particular interest are safety-critical episodes, such as performing maneuvers or crossing intersections. In such scenarios, drivers' gaze distribution changes significantly and becomes difficult to predict, especially if the task and context information is represented implici… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted at IEEE Intelligent Vehicles Symposium (IV), 2024

  3. arXiv:2404.08749  [pdf, other

    cs.CV

    Data Limitations for Modeling Top-Down Effects on Drivers' Attention

    Authors: Iuliia Kotseruba, John K. Tsotsos

    Abstract: Driving is a visuomotor task, i.e., there is a connection between what drivers see and what they do. While some models of drivers' gaze account for top-down effects of drivers' actions, the majority learn only bottom-up correlations between human gaze and driving footage. The crux of the problem is lack of public data with annotations that could be used to train top-down models and evaluate how we… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted at IEEE Intelligent Vehicles Symposium (IV), 2024

  4. arXiv:2310.09275  [pdf, other

    cs.CV

    Understanding and Modeling the Effects of Task and Context on Drivers' Gaze Allocation

    Authors: Iuliia Kotseruba, John K. Tsotsos

    Abstract: To further advance driver monitoring and assistance systems, it is important to understand how drivers allocate their attention, in other words, where do they tend to look and why. Traditionally, factors affecting human visual attention have been divided into bottom-up (involuntary attraction to salient regions) and top-down (driven by the demands of the task being performed). Although both play a… ▽ More

    Submitted 12 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted at IEEE Intelligent Vehicles Symposium (IV), 2024

  5. arXiv:2302.03816  [pdf, other

    cs.AI

    Intend-Wait-Perceive-Cross: Exploring the Effects of Perceptual Limitations on Pedestrian Decision-Making

    Authors: Iuliia Kotseruba, Amir Rasouli

    Abstract: Current research on pedestrian behavior understanding focuses on the dynamics of pedestrians and makes strong assumptions about their perceptual abilities. For instance, it is often presumed that pedestrians have omnidirectional view of the scene around them. In practice, human visual system has a number of limitations, such as restricted field of view (FoV) and range of sensing, which consequentl… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 6 pages, 5 figures, 2 tables

  6. arXiv:2211.07545  [pdf, ps, other

    cs.RO cs.CV cs.LG

    NeurIPS 2022 Competition: Driving SMARTS

    Authors: Amir Rasouli, Randy Goebel, Matthew E. Taylor, Iuliia Kotseruba, Soheil Alizadeh, Tianpei Yang, Montgomery Alban, Florian Shkurti, Yuzheng Zhuang, Adam Scibior, Kasra Rezaee, Animesh Garg, David Meger, Jun Luo, Liam Paull, Weinan Zhang, Xinyu Wang, Xi Chen

    Abstract: Driving SMARTS is a regular competition designed to tackle problems caused by the distribution shift in dynamic interaction contexts that are prevalent in real-world autonomous driving (AD). The proposed competition supports methodologically diverse solutions, such as reinforcement learning (RL) and offline learning methods, trained on a combination of naturalistic AD data and open-source simulati… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 10 pages, 8 figures

  7. arXiv:2210.07886  [pdf, other

    cs.CV cs.RO

    PedFormer: Pedestrian Behavior Prediction via Cross-Modal Attention Modulation and Gated Multitask Learning

    Authors: Amir Rasouli, Iuliia Kotseruba

    Abstract: Predicting pedestrian behavior is a crucial task for intelligent driving systems. Accurate predictions require a deep understanding of various contextual elements that potentially impact the way pedestrians behave. To address this challenge, we propose a novel framework that relies on different data modalities to predict future trajectories and crossing actions of pedestrians from an ego-centric p… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 8 pages, 3 Figures

  8. arXiv:2203.07324  [pdf, other

    cs.RO cs.HC

    Intend-Wait-Cross: Towards Modeling Realistic Pedestrian Crossing Behavior

    Authors: Amir Rasouli, Iuliia Kotseruba

    Abstract: In this paper, we present a microscopic agent-based pedestrian behavior model Intend-Wait-Cross. The model is comprised of rules representing behaviors of pedestrians as a series of decisions that depend on their individual characteristics (e.g. demographics, walking speed, law obedience) and environmental conditions (e.g. traffic flow, road structure). The model's main focus is on generating real… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 8 pages, 8 figures, 2 tables

  9. arXiv:2107.04902  [pdf, other

    cs.CV

    Industry and Academic Research in Computer Vision

    Authors: Iuliia Kotseruba, Manos Papagelis, John K. Tsotsos

    Abstract: This work aims to study the dynamic between research in the industry and academia in computer vision. The results are demonstrated on a set of top-5 vision conferences that are representative of the field. Since data for such analysis was not readily available, significant effort was spent on gathering and processing meta-data from the original publications. First, this study quantifies the share… ▽ More

    Submitted 17 July, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

    Comments: 8 pages, 9 Figures, 2 Tables

  10. arXiv:2104.05677  [pdf, other

    cs.CV cs.RO

    Behavioral Research and Practical Models of Drivers' Attention

    Authors: Iuliia Kotseruba, John K. Tsotsos

    Abstract: Driving is a routine activity for many, but it is far from simple. Drivers deal with multiple concurrent tasks, such as kee** the vehicle in the lane, observing and anticipating the actions of other road users, reacting to hazards, and dealing with distractions inside and outside the vehicle. Failure to notice and respond to the surrounding objects and events can cause accidents. The ongoing imp… ▽ More

    Submitted 13 December, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: 78 pages, 21 figures, 9 tables

  11. arXiv:2101.01533  [pdf

    cs.AI cs.CC cs.CV cs.LG q-bio.NC

    On the Control of Attentional Processes in Vision

    Authors: John K. Tsotsos, Omar Abid, Iuliia Kotseruba, Markus D. Solbach

    Abstract: The study of attentional processing in vision has a long and deep history. Recently, several papers have presented insightful perspectives into how the coordination of multiple attentional functions in the brain might occur. These begin with experimental observations and the authors propose structures, processes, and computations that might explain those observations. Here, we consider a perspecti… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

  12. arXiv:2005.06583  [pdf, other

    cs.CV

    Do Saliency Models Detect Odd-One-Out Targets? New Datasets and Evaluations

    Authors: Iuliia Kotseruba, Calden Wloka, Amir Rasouli, John K. Tsotsos

    Abstract: Recent advances in the field of saliency have concentrated on fixation prediction, with benchmarks reaching saturation. However, there is an extensive body of works in psychology and neuroscience that describe aspects of human visual attention that might not be adequately captured by current approaches. Here, we investigate singleton detection, which can be thought of as a canonical example of sal… ▽ More

    Submitted 5 May, 2021; v1 submitted 13 May, 2020; originally announced May 2020.

    Comments: Published in BMVC 2019. 14 pages, 5 figures

  13. arXiv:2005.06582  [pdf, other

    cs.CV cs.RO

    Pedestrian Action Anticipation using Contextual Feature Fusion in Stacked RNNs

    Authors: Amir Rasouli, Iuliia Kotseruba, John K. Tsotsos

    Abstract: One of the major challenges for autonomous vehicles in urban environments is to understand and predict other road users' actions, in particular, pedestrians at the point of crossing. The common approach to solving this problem is to use the motion history of the agents to predict their future trajectories. However, pedestrians exhibit highly variable actions most of which cannot be understood with… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: This paper was accepted and presented at British Machine Vision Conference (BMVC) 2019

  14. arXiv:1908.10933  [pdf, other

    cs.CV

    A Possible Reason for why Data-Driven Beats Theory-Driven Computer Vision

    Authors: John K. Tsotsos, Iuliia Kotseruba, Alexander Andreopoulos, Yulong Wu

    Abstract: Why do some continue to wonder about the success and dominance of deep learning methods in computer vision and AI? Is it not enough that these methods provide practical solutions to many problems? Well no, it is not enough, at least for those who feel there should be a science that underpins all of this and that we should have a clear understanding of how this success was achieved. Here, this pape… ▽ More

    Submitted 6 September, 2019; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: 8 pages, 5 figures

  15. Rapid Visual Categorization is not Guided by Early Salience-Based Selection

    Authors: John K. Tsotsos, Iuliia Kotseruba, Calden Wloka

    Abstract: The current dominant visual processing paradigm in both human and machine research is the feedforward, layered hierarchy of neural-like processing elements. Within this paradigm, visual saliency is seen by many to have a specific role, namely that of early selection. Early selection is thought to enable very fast visual performance by limiting processing to only the most salient candidate portions… ▽ More

    Submitted 30 January, 2020; v1 submitted 15 January, 2019; originally announced January 2019.

    Comments: 22 pages, 9 figures

  16. arXiv:1812.08848  [pdf, other

    cs.CV

    SMILER: Saliency Model Implementation Library for Experimental Research

    Authors: Calden Wloka, Toni Kunić, Iuliia Kotseruba, Ramin Fahimi, Nicholas Frosst, Neil D. B. Bruce, John K. Tsotsos

    Abstract: The Saliency Model Implementation Library for Experimental Research (SMILER) is a new software package which provides an open, standardized, and extensible framework for maintaining and executing computational saliency models. This work drastically reduces the human effort required to apply saliency algorithms to new tasks and datasets, while also ensuring consistency and procedural correctness fo… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

  17. arXiv:1806.11530  [pdf

    cs.CV

    Visual Attention and its Intimate Links to Spatial Cognition

    Authors: John K. Tsotsos, Iuliia Kotseruba, Amir Rasouli, Markus D. Solbach

    Abstract: It is almost universal to regard attention as the facility that permits an agent, human or machine, to give priority processing resources to relevant stimuli while ignoring the irrelevant. The reality of how this might manifest itself throughout all the forms of perceptual and cognitive processes possessed by humans, however, is not as clear. Here we examine this reality with a broad perspective i… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

    Comments: 10 pages, 10 figures

  18. arXiv:1711.10959  [pdf, other

    cs.CV

    Saccade Sequence Prediction: Beyond Static Saliency Maps

    Authors: Calden Wloka, Iuliia Kotseruba, John K. Tsotsos

    Abstract: Visual attention is a field with a considerable history, with eye movement control and prediction forming an important subfield. Fixation modeling in the past decades has been largely dominated computationally by a number of highly influential bottom-up saliency models, such as the Itti-Koch-Niebur model. The accuracy of such models has dramatically increased recently due to deep learning. However… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

  19. arXiv:1711.09464  [pdf, other

    cs.CV

    STAR-RT: Visual attention for real-time video game playing

    Authors: Iuliia Kotseruba, John K. Tsotsos

    Abstract: In this paper we present STAR-RT - the first working prototype of Selective Tuning Attention Reference (STAR) model and Cognitive Programs (CPs). The Selective Tuning (ST) model received substantial support through psychological and neurophysiological experiments. The STAR framework expands ST and applies it to practical visual tasks. In order to do so, similarly to many cognitive architectures, S… ▽ More

    Submitted 26 November, 2017; originally announced November 2017.

    Comments: 21 page, 13 figures

  20. arXiv:1702.03555  [pdf, other

    cs.RO

    Agreeing to Cross: How Drivers and Pedestrians Communicate

    Authors: Amir Rasouli, Iuliia Kotseruba, John K. Tsotsos

    Abstract: The contribution of this paper is twofold. The first is a novel dataset for studying behaviors of traffic participants while crossing. Our dataset contains more than 650 samples of pedestrian behaviors in various street configurations and weather conditions. These examples were selected from approx. 240 hours of driving in the city, suburban and urban roads. The second contribution is an analysis… ▽ More

    Submitted 12 February, 2017; originally announced February 2017.

    Comments: 6 pages, 6 figures

  21. arXiv:1610.08602  [pdf, ps, other

    cs.AI

    A Review of 40 Years of Cognitive Architecture Research: Core Cognitive Abilities and Practical Applications

    Authors: Iuliia Kotseruba, John K. Tsotsos

    Abstract: In this paper we present a broad overview of the last 40 years of research on cognitive architectures. Although the number of existing architectures is nearing several hundred, most of the existing surveys do not reflect this growth and focus on a handful of well-established architectures. Thus, in this survey we wanted to shift the focus towards a more inclusive and high-level overview of the res… ▽ More

    Submitted 13 January, 2018; v1 submitted 26 October, 2016; originally announced October 2016.

    Comments: 74 pages, 10 figures

  22. arXiv:1609.04741  [pdf, other

    cs.RO

    Joint Attention in Autonomous Driving (JAAD)

    Authors: Iuliia Kotseruba, Amir Rasouli, John K. Tsotsos

    Abstract: In this paper we present a novel dataset for a critical aspect of autonomous driving, the joint attention that must occur between drivers and of pedestrians, cyclists or other drivers. This dataset is produced with the intention of demonstrating the behavioral variability of traffic participants. We also show how visual complexity of the behaviors and scene understanding is affected by various fac… ▽ More

    Submitted 22 April, 2020; v1 submitted 15 September, 2016; originally announced September 2016.

    Comments: fixed formatting, added references