Skip to main content

Showing 1–7 of 7 results for author: İlhan, E

.
  1. arXiv:2308.08650  [pdf, other

    cs.IR cs.AI cs.LG

    AdaptEx: A Self-Service Contextual Bandit Platform

    Authors: William Black, Ercument Ilhan, Andrea Marchini, Vilda Markeviciute

    Abstract: This paper presents AdaptEx, a self-service contextual bandit platform widely used at Expedia Group, that leverages multi-armed bandit algorithms to personalize user experiences at scale. AdaptEx considers the unique context of each visitor to select the optimal variants and learns quickly from every interaction they make. It offers a powerful solution to improve user experiences while minimizing… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  2. arXiv:2302.01991  [pdf, other

    cs.CV

    Offloading Deep Learning Powered Vision Tasks from UAV to 5G Edge Server with Denoising

    Authors: Sedat Ozer, Enes Ilhan, Mehmet Akif Ozkanoglu, Hakan Ali Cirpan

    Abstract: Offloading computationally heavy tasks from an unmanned aerial vehicle (UAV) to a remote server helps improve the battery life and can help reduce resource requirements. Deep learning based state-of-the-art computer vision tasks, such as object segmentation and object detection, are computationally heavy algorithms, requiring large memory and computing power. Many UAVs are using (pretrained) off-t… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: This paper is accepted for publication at IEEE Transactions on Vehicular Technology

    ACM Class: I.4.0

  3. arXiv:2204.07254  [pdf, other

    cs.LG cs.AI cs.MA

    Methodical Advice Collection and Reuse in Deep Reinforcement Learning

    Authors: Sahir, Ercüment İlhan, Srijita Das, Matthew E. Taylor

    Abstract: Reinforcement learning (RL) has shown great success in solving many challenging tasks via use of deep neural networks. Although using deep learning for RL brings immense representational power, it also causes a well-known sample-inefficiency problem. This means that the algorithms are data-hungry and require millions of training samples to converge to an adequate policy. One way to combat this iss… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: To be published in ALA2022: Adaptive and Learning Agents Workshop 2022 at AAMAS

  4. arXiv:2104.08441  [pdf, other

    cs.LG cs.AI

    Action Advising with Advice Imitation in Deep Reinforcement Learning

    Authors: Ercument Ilhan, Jeremy Gow, Diego Perez-Liebana

    Abstract: Action advising is a peer-to-peer knowledge exchange technique built on the teacher-student paradigm to alleviate the sample inefficiency problem in deep reinforcement learning. Recently proposed student-initiated approaches have obtained promising results. However, due to being in the early stages of development, these also have some substantial shortcomings. One of the abilities that are absent… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  5. arXiv:2104.08440  [pdf, other

    cs.LG cs.AI

    Learning on a Budget via Teacher Imitation

    Authors: Ercument Ilhan, Jeremy Gow, Diego Perez-Liebana

    Abstract: Deep Reinforcement Learning (RL) techniques can benefit greatly from leveraging prior experience, which can be either self-generated or acquired from other entities. Action advising is a framework that provides a flexible way to transfer such knowledge in the form of actions between teacher-student peers. However, due to the realistic concerns, the number of these interactions is limited with a bu… ▽ More

    Submitted 30 June, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

  6. Student-Initiated Action Advising via Advice Novelty

    Authors: Ercument Ilhan, Jeremy Gow, Diego Perez-Liebana

    Abstract: Action advising is a budget-constrained knowledge exchange mechanism between teacher-student peers that can help tackle exploration and sample inefficiency problems in deep reinforcement learning (RL). Most recently, student-initiated techniques that utilise state novelty and uncertainty estimations have obtained promising results. However, the approaches built on these estimations have some poten… ▽ More

    Submitted 27 February, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

  7. arXiv:1905.01357  [pdf, other

    cs.MA cs.LG

    Teaching on a Budget in Multi-Agent Deep Reinforcement Learning

    Authors: Ercüment İlhan, Jeremy Gow, Diego Perez-Liebana

    Abstract: Deep Reinforcement Learning (RL) algorithms can solve complex sequential decision tasks successfully. However, they have a major drawback of having poor sample efficiency which can often be tackled by knowledge reuse. In Multi-Agent Reinforcement Learning (MARL) this drawback becomes worse, but at the same time, a new set of opportunities to leverage knowledge are also presented through agent inte… ▽ More

    Submitted 28 May, 2019; v1 submitted 19 April, 2019; originally announced May 2019.

    Comments: 8 pages