Skip to main content

Showing 1–12 of 12 results for author: Saglam, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.17393  [pdf

    cs.CY

    Designing Chatbots to Support Victims and Survivors of Domestic Abuse

    Authors: Rahime Belen Saglam, Jason R. C. Nurse, Lisa Sugiura

    Abstract: Objective: Domestic abuse cases have risen significantly over the last four years, in part due to the COVID-19 pandemic and the challenges for victims and survivors in accessing support. In this study, we investigate the role that chatbots - Artificial Intelligence (AI) and rule-based - may play in supporting victims/survivors in situations such as these or where direct access to help is limited.… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  2. arXiv:2211.09702  [pdf, other

    cs.NI cs.LG eess.SP

    Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI

    Authors: Baturay Saglam, Doga Gurgunoglu, Suleyman S. Kozat

    Abstract: We introduce a novel deep reinforcement learning (DRL) approach to jointly optimize transmit beamforming and reconfigurable intelligent surface (RIS) phase shifts in a multiuser multiple input single output (MU-MISO) system to maximize the sum downlink rate under the phase-dependent reflection amplitude model. Our approach addresses the challenge of imperfect channel state information (CSI) and ha… ▽ More

    Submitted 29 March, 2023; v1 submitted 10 October, 2022; originally announced November 2022.

    Comments: 2023 IEEE International Conference on Communications Workshops (ICC Workshops)

  3. arXiv:2210.00293  [pdf, other

    cs.LG cs.AI

    Deep Intrinsically Motivated Exploration in Continuous Control

    Authors: Baturay Saglam, Suleyman S. Kozat

    Abstract: In continuous control, exploration is often performed through undirected strategies in which parameters of the networks or selected actions are perturbed by random noise. Although the deep setting of undirected exploration has been shown to improve the performance of on-policy methods, they introduce an excessive computational complexity and are known to fail in the off-policy setting. The intrins… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

  4. arXiv:2209.00532  [pdf, other

    cs.LG cs.AI

    Actor Prioritized Experience Replay

    Authors: Baturay Saglam, Furkan B. Mutlu, Dogan C. Cicek, Suleyman S. Kozat

    Abstract: A widely-studied deep reinforcement learning (RL) technique known as Prioritized Experience Replay (PER) allows agents to learn from transitions sampled with non-uniform probability proportional to their temporal-difference (TD) error. Although it has been shown that PER is one of the most crucial components for the overall performance of deep RL methods in discrete action domains, many empirical… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: 21 pages, 5 figures, 4 tables

  5. arXiv:2208.00755  [pdf, other

    cs.LG cs.AI

    Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach

    Authors: Baturay Saglam, Dogan C. Cicek, Furkan B. Mutlu, Suleyman S. Kozat

    Abstract: Compared to on-policy counterparts, off-policy model-free deep reinforcement learning can improve data efficiency by repeatedly using the previously gathered data. However, off-policy learning becomes challenging when the discrepancy between the underlying distributions of the agent's policy and collected data increases. Although the well-studied importance sampling and off-policy policy gradient… ▽ More

    Submitted 25 September, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

  6. arXiv:2207.13453  [pdf, other

    cs.LG cs.AI

    Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms

    Authors: Baturay Saglam, Dogan C. Cicek, Furkan B. Mutlu, Suleyman S. Kozat

    Abstract: Learning in high dimensional continuous tasks is challenging, mainly when the experience replay memory is very limited. We introduce a simple yet effective experience sharing mechanism for deterministic policies in continuous action domains for the future off-policy deep reinforcement learning applications in which the allocated memory for the experience replay buffer is limited. To overcome the e… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: ICML 2022 Workshop on Responsible Decision Making in Dynamic Environments (poster: http://responsibledecisionmaking.github.io/assets/poster/19.pdf , presentation: http://drive.google.com/file/d/1vjjMh_z51xdOjsQCcGfU5ojAcrrf3dOS/view?usp=sharing )

  7. arXiv:2111.06780  [pdf, other

    cs.LG cs.AI

    AWD3: Dynamic Reduction of the Estimation Bias

    Authors: Dogan C. Cicek, Enes Duran, Baturay Saglam, Kagan Kaya, Furkan B. Mutlu, Suleyman S. Kozat

    Abstract: Value-based deep Reinforcement Learning (RL) algorithms suffer from the estimation bias primarily caused by function approximation and temporal difference (TD) learning. This problem induces faulty state-action value estimates and therefore harms the performance and robustness of the learning algorithms. Although several techniques were proposed to tackle, learning algorithms still suffer from thi… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: Accepted at The 33rd IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2021)

  8. arXiv:2111.01865  [pdf, other

    cs.LG cs.AI

    Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay

    Authors: Dogan C. Cicek, Enes Duran, Baturay Saglam, Furkan B. Mutlu, Suleyman S. Kozat

    Abstract: The experience replay mechanism allows agents to use the experiences multiple times. In prior works, the sampling probability of the transitions was adjusted according to their importance. Reassigning sampling probabilities for every transition in the replay buffer after each iteration is highly inefficient. Therefore, experience replay prioritization algorithms recalculate the significance of a t… ▽ More

    Submitted 12 November, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: Accepted at The 33rd IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2021)

  9. arXiv:2109.11788  [pdf, other

    cs.LG cs.AI stat.ML

    Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients

    Authors: Baturay Saglam, Furkan Burak Mutlu, Dogan Can Cicek, Suleyman Serdar Kozat

    Abstract: Approximation of the value functions in value-based deep reinforcement learning induces overestimation bias, resulting in suboptimal policies. We show that when the reinforcement signals received by the agents have a high variance, deep actor-critic approaches that overcome the overestimation bias lead to a substantial underestimation bias. We first address the detrimental issues in the existing a… ▽ More

    Submitted 19 May, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

  10. Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

    Authors: Baturay Saglam, Enes Duran, Dogan C. Cicek, Furkan B. Mutlu, Suleyman S. Kozat

    Abstract: In value-based deep reinforcement learning methods, approximation of value functions induces overestimation bias and leads to suboptimal policies. We show that in deep actor-critic methods that aim to overcome the overestimation bias, if the reinforcement signals received by the agent have a high variance, a significant underestimation bias arises. To minimize the underestimation, we introduce a p… ▽ More

    Submitted 23 September, 2021; v1 submitted 22 September, 2021; originally announced September 2021.

  11. arXiv:2107.03959  [pdf, other

    cs.CY cs.AI cs.CL cs.HC

    Privacy Concerns in Chatbot Interactions: When to Trust and When to Worry

    Authors: Rahime Belen Saglam, Jason R. C. Nurse, Duncan Hodges

    Abstract: Through advances in their conversational abilities, chatbots have started to request and process an increasing variety of sensitive personal information. The accurate disclosure of sensitive information is essential where it is used to provide advice and support to users in the healthcare and finance sectors. In this study, we explore users' concerns regarding factors associated with the use of se… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Journal ref: 23rd International Conference on Human-Computer Interaction (HCII 2021)

  12. arXiv:2005.12644  [pdf, ps, other

    cs.CY cs.AI cs.HC cs.SE

    Is your chatbot GDPR compliant? Open issues in agent design

    Authors: Rahime Belen Saglam, Jason R. C. Nurse

    Abstract: Conversational agents open the world to new opportunities for human interaction and ubiquitous engagement. As their conversational abilities and knowledge has improved, these agents have begun to have access to an increasing variety of personally identifiable information and intimate details on their user base. This access raises crucial questions in light of regulations as robust as the General D… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Journal ref: CUI 2020: International Conference on Conversational User Interfaces, July, 2020