Skip to main content

Showing 1–15 of 15 results for author: Celiktutan, O

.
  1. arXiv:2403.05701  [pdf, other

    cs.RO cs.AI cs.HC

    Are Large Language Models Aligned with People's Social Intuitions for Human-Robot Interactions?

    Authors: Lennart Wachowiak, Andrew Coles, Oya Celiktutan, Gerard Canal

    Abstract: Large language models (LLMs) are increasingly used in robotics, especially for high-level action planning. Meanwhile, many robotics applications involve human supervisors or collaborators. Hence, it is crucial for LLMs to generate socially acceptable actions that align with people's preferences and values. In this work, we test whether LLMs capture people's intuitions about behavior judgments and… ▽ More

    Submitted 9 July, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024

  2. arXiv:2403.02018  [pdf, other

    cs.RO cs.AI

    Cross Domain Policy Transfer with Effect Cycle-Consistency

    Authors: Ruiqi Zhu, Tianhong Dai, Oya Celiktutan

    Abstract: Training a robotic policy from scratch using deep reinforcement learning methods can be prohibitively expensive due to sample inefficiency. To address this challenge, transferring policies trained in the source domain to the target domain becomes an attractive paradigm. Previous research has typically focused on domains with similar state and action spaces but differing in other aspects. In this p… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted to International Conference on Robotics and Automation (ICRA), 2024

  3. arXiv:2309.07477  [pdf, other

    cs.RO

    Self-Supervised Prediction of the Intention to Interact with a Service Robot

    Authors: Gabriele Abbate, Alessandro Giusti, Viktor Schmuck, Oya Celiktutan, Antonio Paolillo

    Abstract: A service robot can provide a smoother interaction experience if it has the ability to proactively detect whether a nearby user intends to interact, in order to adapt its behavior e.g. by explicitly showing that it is available to provide a service. In this work, we propose a learning-based approach to predict the probability that a human user will interact with a robot before the interaction actu… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Paper under revision for Robotics and Autonomous Systems journal

  4. arXiv:2307.02889  [pdf, other

    cs.RO cs.AI cs.LG

    Learning to Solve Tasks with Exploring Prior Behaviours

    Authors: Ruiqi Zhu, Siyuan Li, Tianhong Dai, Chongjie Zhang, Oya Celiktutan

    Abstract: Demonstrations are widely used in Deep Reinforcement Learning (DRL) for facilitating solving tasks with sparse rewards. However, the tasks in real-world scenarios can often have varied initial conditions from the demonstration, which would require additional prior behaviours. For example, consider we are given the demonstration for the task of \emph{picking up an object from an open drawer}, but t… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  5. arXiv:2211.13823  [pdf, other

    cs.CV

    Neural Weight Search for Scalable Task Incremental Learning

    Authors: Jian Jiang, Oya Celiktutan

    Abstract: Task incremental learning aims to enable a system to maintain its performance on previously learned tasks while learning new tasks, solving the problem of catastrophic forgetting. One promising approach is to build an individual network or sub-network for future tasks. However, this leads to an ever-growing memory due to saving extra weights for new tasks and how to address this issue has remained… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

  6. arXiv:2210.06766  [pdf, other

    cs.LG cs.AI

    Policy Gradient With Serial Markov Chain Reasoning

    Authors: Edoardo Cetin, Oya Celiktutan

    Abstract: We introduce a new framework that performs decision-making in reinforcement learning (RL) as an iterative reasoning process. We model agent behavior as the steady-state distribution of a parameterized reasoning Markov chain (RMC), optimized with a new tractable estimate of the policy gradient. We perform action selection by simulating the RMC for enough reasoning steps to approach its steady-state… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  7. arXiv:2209.15370  [pdf, other

    cs.HC cs.AI cs.CV cs.RO

    Automatic Context-Driven Inference of Engagement in HMI: A Survey

    Authors: Hanan Salam, Oya Celiktutan, Hatice Gunes, Mohamed Chetouani

    Abstract: An integral part of seamless human-human communication is engagement, the process by which two or more participants establish, maintain, and end their perceived connection. Therefore, to develop successful human-centered human-machine interaction applications, automatic engagement inference is one of the tasks required to achieve engaging interactions between humans and machines, and to make machi… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  8. arXiv:2207.00986  [pdf, other

    cs.LG cs.AI cs.CV

    Stabilizing Off-Policy Deep Reinforcement Learning from Pixels

    Authors: Edoardo Cetin, Philip J. Ball, Steve Roberts, Oya Celiktutan

    Abstract: Off-policy reinforcement learning (RL) from pixel observations is notoriously unstable. As a result, many successful algorithms must combine different domain-specific practices and auxiliary losses to learn meaningful behaviors in complex environments. In this work, we provide novel analysis demonstrating that these instabilities arise from performing temporal-difference learning with a convolutio… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

    Comments: Short presentation at ICML 2022

  9. arXiv:2111.04397  [pdf, other

    cs.CV cs.RO

    GROWL: Group Detection With Link Prediction

    Authors: Viktor Schmuck, Oya Celiktutan

    Abstract: Interaction group detection has been previously addressed with bottom-up approaches which relied on the position and orientation information of individuals. These approaches were primarily based on pairwise affinity matrices and were limited to static, third-person views. This problem can greatly benefit from a holistic approach based on Graph Neural Networks (GNNs) beyond pairwise relationships,… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

  10. arXiv:2110.09378  [pdf, other

    cs.AI

    Forecasting Nonverbal Social Signals during Dyadic Interactions with Generative Adversarial Neural Networks

    Authors: Nguyen Tan Viet Tuyen, Oya Celiktutan

    Abstract: We are approaching a future where social robots will progressively become widespread in many aspects of our daily lives, including education, healthcare, work, and personal use. All of such practical applications require that humans and robots collaborate in human environments, where social interaction is unavoidable. Along with verbal communication, successful social interaction is closely couple… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  11. arXiv:2110.03375  [pdf, other

    cs.LG cs.AI

    Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning

    Authors: Edoardo Cetin, Oya Celiktutan

    Abstract: Popular off-policy deep reinforcement learning algorithms compensate for overestimation bias during temporal-difference learning by utilizing pessimistic estimates of the expected target returns. In this work, we propose a novel learnable penalty to enact such pessimism, based on a new way to quantify the critic's epistemic uncertainty. Furthermore, we propose to learn the penalty alongside the cr… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

  12. arXiv:2106.02943  [pdf, other

    cs.LG

    Learning Routines for Effective Off-Policy Reinforcement Learning

    Authors: Edoardo Cetin, Oya Celiktutan

    Abstract: The performance of reinforcement learning depends upon designing an appropriate action space, where the effect of each action is measurable, yet, granular enough to permit flexible behavior. So far, this process involved non-trivial user choices in terms of the available actions and their execution frequency. We propose a novel framework for reinforcement learning that effectively lifts such const… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  13. arXiv:2104.10588  [pdf, other

    cs.CV

    IB-DRR: Incremental Learning with Information-Back Discrete Representation Replay

    Authors: Jian Jiang, Edoardo Cetin, Oya Celiktutan

    Abstract: Incremental learning aims to enable machine learning models to continuously acquire new knowledge given new classes, while maintaining the knowledge already learned for old classes. Saving a subset of training samples of previously seen classes in the memory and replaying them during new training phases is proven to be an efficient and effective way to fulfil this aim. It is evident that the large… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: CVPR 2021 Workshop on Continual Learning

  14. arXiv:2103.05079  [pdf, other

    cs.LG cs.AI

    Domain-Robust Visual Imitation Learning with Mutual Information Constraints

    Authors: Edoardo Cetin, Oya Celiktutan

    Abstract: Human beings are able to understand objectives and learn by simply observing others perform a task. Imitation learning methods aim to replicate such capabilities, however, they generally depend on access to a full set of optimal states and actions taken with the agent's actuators and from the agent's point of view. In this paper, we introduce a new algorithm - called Disentangling Generative Adver… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Presented at ICLR 2021

  15. arXiv:1505.00581  [pdf, other

    cs.CV

    Activity recognition from videos with parallel hypergraph matching on GPUs

    Authors: Eric Lombardi, Christian Wolf, Oya Celiktutan, Bülent Sankur

    Abstract: In this paper, we propose a method for activity recognition from videos based on sparse local features and hypergraph matching. We benefit from special properties of the temporal domain in the data to derive a sequential and fast graph matching algorithm for GPUs. Traditionally, graphs and hypergraphs are frequently used to recognize complex and often non-rigid patterns in computer vision, eithe… ▽ More

    Submitted 4 May, 2015; originally announced May 2015.