Learning Chinese Word Representations From Glyphs Of Characters

Su, Tzu-Ray; Lee, Hung-Yi

Computer Science > Computation and Language

arXiv:1708.04755 (cs)

[Submitted on 16 Aug 2017]

Title:Learning Chinese Word Representations From Glyphs Of Characters

Authors:Tzu-Ray Su, Hung-Yi Lee

View PDF

Abstract:In this paper, we propose new methods to learn Chinese word representations. Chinese characters are composed of graphical components, which carry rich semantics. It is common for a Chinese learner to comprehend the meaning of a word from these graphical components. As a result, we propose models that enhance word representations by character glyphs. The character glyph features are directly learned from the bitmaps of characters by convolutional auto-encoder(convAE), and the glyph features improve Chinese word representations which are already enhanced by character embeddings. Another contribution in this paper is that we created several evaluation datasets in traditional Chinese and made them public.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1708.04755 [cs.CL]
	(or arXiv:1708.04755v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1708.04755

Submission history

From: Tzu Ray Su [view email]
[v1] Wed, 16 Aug 2017 03:17:57 UTC (1,576 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Tzu-Ray Su
Hung-yi Lee

export BibTeX citation

Computer Science > Computation and Language

Title:Learning Chinese Word Representations From Glyphs Of Characters

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning Chinese Word Representations From Glyphs Of Characters

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators