Skip to main content

Showing 1–7 of 7 results for author: Wibowo, H A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16524  [pdf, other

    cs.CL

    The Privileged Students: On the Value of Initialization in Multilingual Knowledge Distillation

    Authors: Haryo Akbarianto Wibowo, Thamar Solorio, Alham Fikri Aji

    Abstract: Knowledge distillation (KD) has proven to be a successful strategy to improve the performance of a smaller model in many NLP tasks. However, most of the work in KD only explores monolingual scenarios. In this paper, we investigate the value of KD in multilingual settings. We find the significance of KD and model initialization by analyzing how well the student model acquires multilingual knowledge… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 8 pages

    MSC Class: 68T50

  2. arXiv:2406.05967  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (50 additional authors not shown)

    Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2311.01012  [pdf, other

    cs.CL

    COPAL-ID: Indonesian Language Reasoning with Local Culture and Nuances

    Authors: Haryo Akbarianto Wibowo, Erland Hilman Fuadi, Made Nindyatama Nityasya, Radityo Eko Prasojo, Alham Fikri Aji

    Abstract: We present COPAL-ID, a novel, public Indonesian language common sense reasoning dataset. Unlike the previous Indonesian COPA dataset (XCOPA-ID), COPAL-ID incorporates Indonesian local and cultural nuances, and therefore, provides a more natural portrayal of day-to-day causal reasoning within the Indonesian cultural sphere. Professionally written by natives from scratch, COPAL-ID is more fluent and… ▽ More

    Submitted 21 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 8 pages, Camera Ready (NAACL 2024 - Main)

    MSC Class: 68T50

  4. arXiv:2306.02870  [pdf, ps, other

    cs.CL

    On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research

    Authors: Made Nindyatama Nityasya, Haryo Akbarianto Wibowo, Alham Fikri Aji, Genta Indra Winata, Radityo Eko Prasojo, Phil Blunsom, Adhiguna Kuncoro

    Abstract: This evidence-based position paper critiques current research practices within the language model pre-training literature. Despite rapid recent progress afforded by increasingly better pre-trained language models (PLMs), current PLM research practices often conflate different possible sources of model improvement, without conducting proper ablation studies and principled comparisons between differ… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted at ACL 2023

  5. arXiv:2201.00558  [pdf, other

    cs.CL

    Which Student is Best? A Comprehensive Knowledge Distillation Exam for Task-Specific BERT Models

    Authors: Made Nindyatama Nityasya, Haryo Akbarianto Wibowo, Rendi Chevi, Radityo Eko Prasojo, Alham Fikri Aji

    Abstract: We perform knowledge distillation (KD) benchmark from task-specific BERT-base teacher models to various student models: BiLSTM, CNN, BERT-Tiny, BERT-Mini, and BERT-Small. Our experiment involves 12 datasets grouped in two tasks: text classification and sequence labeling in the Indonesian language. We also compare various aspects of distillations including the usage of word embeddings and unlabeled… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: 14 pages, 3 figures, submitted to Elsevier

    MSC Class: 68T50 ACM Class: I.2.7; I.2.6

  6. arXiv:2012.08958  [pdf, ps, other

    cs.CL

    Costs to Consider in Adopting NLP for Your Business

    Authors: Made Nindyatama Nityasya, Haryo Akbarianto Wibowo, Radityo Eko Prasojo, Alham Fikri Aji

    Abstract: Recent advances in Natural Language Processing (NLP) have largely pushed deep transformer-based models as the go-to state-of-the-art technique without much regard to the production and utilization cost. Companies planning to adopt these methods into their business face difficulties because of the lack of machine, data, and human resources to build them. We compare both the performance and the cost… ▽ More

    Submitted 14 April, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: 12 pages, 2 figures

    MSC Class: 68T50 ACM Class: I.2.7; I.2.6

  7. arXiv:2011.03286  [pdf, other

    cs.CL

    Semi-Supervised Low-Resource Style Transfer of Indonesian Informal to Formal Language with Iterative Forward-Translation

    Authors: Haryo Akbarianto Wibowo, Tatag Aziz Prawiro, Muhammad Ihsan, Alham Fikri Aji, Radityo Eko Prasojo, Rahmad Mahendra, Suci Fitriany

    Abstract: In its daily use, the Indonesian language is riddled with informality, that is, deviations from the standard in terms of vocabulary, spelling, and word order. On the other hand, current available Indonesian NLP models are typically developed with the standard Indonesian in mind. In this work, we address a style-transfer from informal to formal Indonesian as a low-resource machine translation probl… ▽ More

    Submitted 22 December, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: 6 pages, Camera ready to be presented at IALP 2020

    MSC Class: 68T50