Skip to main content

Showing 1–4 of 4 results for author: Cenggoro, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06138  [pdf, other

    cs.CL

    Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages

    Authors: Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Rifki Afina Putri, Emmanuel Dave, Jhonson Lee, Nuur Shadieq, Wawan Cenggoro, Salsabil Maulana Akbar, Muhammad Ihza Mahendra, Dea Annisayanti Putri, Bryan Wilie, Genta Indra Winata, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung

    Abstract: Large language models (LLMs) show remarkable human-like capability in various domains and languages. However, a notable quality gap arises in low-resource languages, e.g., Indonesian indigenous languages, rendering them ineffective and inefficient in such linguistic contexts. To bridge this quality gap, we introduce Cendol, a collection of Indonesian LLMs encompassing both decoder-only and encoder… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Cendol models are released under Apache 2.0 license and will be made publicly available soon

  2. arXiv:2309.10661  [pdf, other

    cs.CL cs.AI

    NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages

    Authors: Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Dea Adhista, Emmanuel Dave, Sarah Oktavianti, Salsabil Maulana Akbar, Jhonson Lee, Nuur Shadieq, Tjeng Wawan Cenggoro, Hanung Wahyuning Linuwih, Bryan Wilie, Galih Pradipta Muridan, Genta Indra Winata, David Moeljadi, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung

    Abstract: Democratizing access to natural language processing (NLP) technology is crucial, especially for underrepresented and extremely low-resource languages. Previous research has focused on develo** labeled and unlabeled corpora for these languages through online scra** and document translation. While these methods have proven effective and cost-efficient, we have identified limitations in the resul… ▽ More

    Submitted 19 September, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

  3. arXiv:2004.07136  [pdf, other

    cs.NE

    Transfer-Learning-Aware Neuro-Evolution for Diseases Detection in Chest X-Ray Images

    Authors: Albert Susanto, Herman, Tjeng Wawan Cenggoro, Suharjito, Bens Pardamean

    Abstract: The neural network needs excessive costs of time because of the complexity of architecture when trained on images. Transfer learning and fine-tuning can help improve time and cost efficiency when training a neural network. Yet, Transfer learning and fine-tuning needs a lot of experiment to try with. Therefore, a method to find the best architecture for transfer learning and fine-tuning is needed.… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

  4. arXiv:1807.04585  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Learning for Imbalance Data Classification using Class Expert Generative Adversarial Network

    Authors: Fanny, Tjeng Wawan Cenggoro

    Abstract: Without any specific way for imbalance data classification, artificial intelligence algorithm cannot recognize data from minority classes easily. In general, modifying the existing algorithm by assuming that the training data is imbalanced, is the only way to handle imbalance data. However, for a normal data handling, this way mostly produces a deficient result. In this research, we propose a clas… ▽ More

    Submitted 12 July, 2018; v1 submitted 12 July, 2018; originally announced July 2018.

    Comments: Accepted in 3rd International Conference on Computer Science and Computational Intelligence, 7-8 September 2018