Skip to main content

Showing 1–2 of 2 results for author: Kim, S L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11881  [pdf, other

    cs.CL cs.AI

    Punctuation restoration Model and Spacing Model for Korean Ancient Document

    Authors: Taehong Jang, Joonmo Ahn, Sojung Lucia Kim

    Abstract: In Korean ancient documents, there is no spacing or punctuation, and they are written in classical Chinese characters. This makes it challenging for modern individuals and translation models to accurately interpret and translate them. While China has models predicting punctuation and spacing, applying them directly to Korean texts is problematic due to data differences. Therefore, we developed the… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 5 Pages, 2 Figures

  2. arXiv:2306.14592  [pdf, other

    cs.CL cs.DL

    Transfer Learning across Several Centuries: Machine and Historian Integrated Method to Decipher Royal Secretary's Diary

    Authors: Sojung Lucia Kim, Taehong Jang, Joonmo Ahn, Hyungil Lee, Jaehyuk Lee

    Abstract: A named entity recognition and classification plays the first and foremost important role in capturing semantics in data and anchoring in translation as well as downstream study for history. However, NER in historical text has faced challenges such as scarcity of annotated corpus, multilanguage variety, various noise, and different convention far different from the contemporary language model. Thi… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 7 pages, 9 figures