Skip to main content

Showing 1–11 of 11 results for author: Akkaya, I

.
  1. arXiv:2305.08551  [pdf, other

    cs.CV cs.AI

    Enhancing Performance of Vision Transformers on Small Datasets through Local Inductive Bias Incorporation

    Authors: Ibrahim Batuhan Akkaya, Senthilkumar S. Kathiresan, Elahe Arani, Bahram Zonooz

    Abstract: Vision transformers (ViTs) achieve remarkable performance on large datasets, but tend to perform worse than convolutional neural networks (CNNs) when trained from scratch on smaller datasets, possibly due to a lack of local inductive bias in the architecture. Recent studies have therefore added locality to the architecture and demonstrated that it can help ViTs achieve performance comparable to CN… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  2. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  3. arXiv:2212.04227  [pdf, other

    cs.CV cs.LG

    Self-training via Metric Learning for Source-Free Domain Adaptation of Semantic Segmentation

    Authors: Ibrahim Batuhan Akkaya, Ugur Halici

    Abstract: Unsupervised source-free domain adaptation methods aim to train a model for the target domain utilizing a pretrained source-domain model and unlabeled target-domain data, particularly when accessibility to source data is restricted due to intellectual property or privacy concerns. Traditional methods usually use self-training with pseudo-labeling, which is often subjected to thresholding based on… ▽ More

    Submitted 9 April, 2024; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: This paper is under consideration at Computer Vision and Image Understanding

  4. arXiv:2206.11795  [pdf, other

    cs.LG cs.AI

    Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

    Authors: Bowen Baker, Ilge Akkaya, Peter Zhokhov, Joost Huizinga, Jie Tang, Adrien Ecoffet, Brandon Houghton, Raul Sampedro, Jeff Clune

    Abstract: Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for training models with broad, general capabilities for text, images, and other modalities. However, for many sequential decision domains such as robotics, video games, and computer use, publicly available data does not contain the labels required to train behavioral priors in the same way. We extend the interne… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  5. Focus-and-Detect: A Small Object Detection Framework for Aerial Images

    Authors: Onur Can Koyun, Reyhan Kevser Keser, İbrahim Batuhan Akkaya, Behçet Uğur Töreyin

    Abstract: Despite recent advances, object detection in aerial images is still a challenging task. Specific problems in aerial images makes the detection problem harder, such as small objects, densely packed objects, objects in different sizes and with different orientations. To address small object detection problem, we propose a two-stage object detection framework called "Focus-and-Detect". The first stag… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: 12 pages, 6 figures

    Journal ref: Signal Processing: Image Communication, Volume 104, May 2022, 116675

  6. arXiv:2106.07165  [pdf, ps, other

    cs.CV cs.LG

    Self-training Guided Adversarial Domain Adaptation For Thermal Imagery

    Authors: Ibrahim Batuhan Akkaya, Fazil Altinel, Ugur Halici

    Abstract: Deep models trained on large-scale RGB image datasets have shown tremendous success. It is important to apply such deep models to real-world problems. However, these models suffer from a performance bottleneck under illumination changes. Thermal IR cameras are more robust against such changes, and thus can be very useful for the real-world problems. In order to investigate efficacy of combining fe… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: Accepted to CVPR 2021 Perception Beyond the Visible Spectrum (PBVS) workshop

  7. arXiv:2101.04882  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Asymmetric self-play for automatic goal discovery in robotic manipulation

    Authors: OpenAI OpenAI, Matthias Plappert, Raul Sampedro, Tao Xu, Ilge Akkaya, Vineet Kosaraju, Peter Welinder, Ruben D'Sa, Arthur Petron, Henrique P. d. O. Pinto, Alex Paino, Hyeonwoo Noh, Lilian Weng, Qiming Yuan, Casey Chu, Wojciech Zaremba

    Abstract: We train a single, goal-conditioned policy that can solve many robotic manipulation tasks, including tasks with previously unseen goals and objects. We rely on asymmetric self-play for goal discovery, where two agents, Alice and Bob, play a game. Alice is asked to propose challenging goals and Bob aims to solve them. We show that this method can discover highly diverse and complex goals without an… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: Videos are shown at https://robotics-self-play.github.io

  8. arXiv:2009.00878  [pdf, other

    cs.CV eess.IV

    GAIT: Gradient Adjusted Unsupervised Image-to-Image Translation

    Authors: Ibrahim Batuhan Akkaya, Ugur Halici

    Abstract: Image-to-image translation (IIT) has made much progress recently with the development of adversarial learning. In most of the recent work, an adversarial loss is utilized to match the distributions of the translated and target image sets. However, this may create artifacts if two domains have different marginal distributions, for example, in uniform areas. In this work, we propose an unsupervised… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Comments: Accepted by ICIP2020

  9. arXiv:1910.07113  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Solving Rubik's Cube with a Robot Hand

    Authors: OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang

    Abstract: We demonstrate that models trained only in simulation can be used to solve a manipulation problem of unprecedented complexity on a real robot. This is made possible by two key components: a novel algorithm, which we call automatic domain randomization (ADR) and a robot platform built for machine learning. ADR automatically generates a distribution over randomized environments of ever-increasing di… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  10. Control Improvisation with Probabilistic Temporal Specifications

    Authors: Ilge Akkaya, Daniel J. Fremont, Rafael Valle, Alexandre Donzé, Edward A. Lee, Sanjit A. Seshia

    Abstract: We consider the problem of generating randomized control sequences for complex networked systems typically actuated by human agents. Our approach leverages a concept known as control improvisation, which is based on a combination of data-driven learning and controller synthesis from formal specifications. We learn from existing data a generative model (for instance, an explicit-duration hidden Mar… ▽ More

    Submitted 29 February, 2016; v1 submitted 6 November, 2015; originally announced November 2015.

    Comments: to appear in Proceedings of the 1st IEEE Conference on Internet-of-Things Design and Implementation (IoTDI'16)

  11. arXiv:1008.2867  [pdf, ps, other

    astro-ph.GA

    CCD UBVRI Photometry of the Galactic open clusters: Be~89, Ru~135, and Be~10

    Authors: Inci Akkaya, William J. Schuster, Raul Michel, Carlos Chavarria-K, Andre Moitinho, Roberto Vazquez, Yuksel Karatas

    Abstract: The fundamental parameters of reddening, metallicity, age, and distance are presented for the poorly studied open clusters Be~89, Ru~135, and Be~10, derived from their CCD UBVRI photometry. By fitting the appropriate isochrones to the observed sequences of the clusters in five different color--magnitude diagrams, the weighted averages of distance moduli and heliocentric distances ($(V_0$--… ▽ More

    Submitted 17 August, 2010; originally announced August 2010.

    Comments: accepted to RevMexAA