Skip to main content

Showing 1–8 of 8 results for author: Khodra, M L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.08200  [pdf, other

    cs.CL

    IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation

    Authors: Samuel Cahyawijaya, Genta Indra Winata, Bryan Wilie, Karissa Vincentio, Xiaohong Li, Adhiguna Kuncoro, Sebastian Ruder, Zhi Yuan Lim, Syafri Bahar, Masayu Leylia Khodra, Ayu Purwarianti, Pascale Fung

    Abstract: Natural language generation (NLG) benchmarks provide an important avenue to measure progress and develop better NLG systems. Unfortunately, the lack of publicly available NLG benchmarks for low-resource languages poses a challenging barrier for building NLG systems that work well for languages with limited amounts of data. Here we introduce IndoNLG, the first benchmark to measure natural language… ▽ More

    Submitted 9 October, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted in EMNLP 2021, 10 pages

  2. arXiv:2103.03736  [pdf

    cs.CL

    Multi-document Summarization using Semantic Role Labeling and Semantic Graph for Indonesian News Article

    Authors: Yuly Haruka Berliana Gunawan, Masayu Leylia Khodra

    Abstract: In this paper, we proposed a multi-document summarization system using semantic role labeling (SRL) and semantic graph for Indonesian news articles. In order to improve existing summarizer, our system modified summarizer that employed subject, predicate, object, and adverbial (SVOA) extraction for predicate argument structure (PAS) extraction. SVOA extraction is replaced with SRL model for Indones… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  3. arXiv:2103.03732  [pdf

    cs.CL

    Fine-tuning Pretrained Multilingual BERT Model for Indonesian Aspect-based Sentiment Analysis

    Authors: Annisa Nurul Azhar, Masayu Leylia Khodra

    Abstract: Although previous research on Aspect-based Sentiment Analysis (ABSA) for Indonesian reviews in hotel domain has been conducted using CNN and XGBoost, its model did not generalize well in test data and high number of OOV words contributed to misclassification cases. Nowadays, most state-of-the-art results for wide array of NLP tasks are achieved by utilizing pretrained language representation. In t… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  4. arXiv:2103.03730  [pdf

    cs.CL cs.AI

    Parsing Indonesian Sentence into Abstract Meaning Representation using Machine Learning Approach

    Authors: Adylan Roaffa Ilmy, Masayu Leylia Khodra

    Abstract: Abstract Meaning Representation (AMR) provides many information of a sentence such as semantic relations, coreferences, and named entity relation in one representation. However, research on AMR parsing for Indonesian sentence is fairly limited. In this paper, we develop a system that aims to parse an Indonesian sentence using a machine learning approach. Based on Zhang et al. work, our system cons… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  5. arXiv:1908.04899  [pdf

    cs.CL

    Aspect and Opinion Terms Extraction Using Double Embeddings and Attention Mechanism for Indonesian Hotel Reviews

    Authors: Jordhy Fernando, Masayu Leylia Khodra, Ali Akbar Septiandri

    Abstract: Aspect and opinion terms extraction from review texts is one of the key tasks in aspect-based sentiment analysis. In order to extract aspect and opinion terms for Indonesian hotel reviews, we adapt double embeddings feature and attention mechanism that outperform the best system at SemEval 2015 and 2016. We conduct experiments using 4000 reviews to find the best configuration and show the influenc… ▽ More

    Submitted 19 August, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

  6. arXiv:1902.00624  [pdf

    cs.DB cs.AI

    A Question Answering System Using Graph-Pattern Association Rules (QAGPAR) On YAGO Knowledge Base

    Authors: Wahyudi, Masayu Leylia Khodra, Ary Setijadi Prihatmanto, Carmadi Machbub

    Abstract: A question answering system (QA System) was developed that uses graph-pattern association rules on the YAGO knowledge base. The answer as output of the system is provided based on a user question as input. If the answer is missing or unavailable in the database, then graph-pattern association rules are used to get the answer. The architecture of this question answering system is as follows: questi… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

  7. Handling Imbalanced Dataset in Multi-label Text Categorization using Bagging and Adaptive Boosting

    Authors: Genta Indra Winata, Masayu Leylia Khodra

    Abstract: Imbalanced dataset is occurred due to uneven distribution of data available in the real world such as disposition of complaints on government offices in Bandung. Consequently, multi-label text categorization algorithms may not produce the best performance because classifiers tend to be weighed down by the majority of the data and ignore the minority. In this paper, Bagging and Adaptive Boosting al… ▽ More

    Submitted 11 June, 2019; v1 submitted 27 October, 2018; originally announced October 2018.

    Comments: Accepted in the 2015 International Conference on Electrical Engineering and Informatics (ICEEI), Best Student Paper Award

  8. arXiv:1810.00326  [pdf

    cs.DB

    Using Graph-Pattern Association Rules On Yago Knowledge Base

    Authors: Wahyudi, Masayu Leylia Khodra, Ary Setijadi Prihatmanto, Carmadi Machbub

    Abstract: We propose the use of Graph-Pattern Association Rules (GPARs) on the Yago knowledge base. Extending association rules for itemsets, GPARS can help to discover regularities between entities in knowledge bases. A rule-generated graph pattern (RGGP) algorithm was used for extracting rules from the Yago knowledge base and a graph-pattern association rules algorithm for creating association rules. Our… ▽ More

    Submitted 30 September, 2018; originally announced October 2018.