Thai rhetorical structure analysis
by Somnuk Sinthupoun
Title: | Thai rhetorical structure analysis |
Author(s): | Somnuk Sinthupoun |
Advisor: | Ohm Sornil, advisor |
Degree name: | Doctor of Philosophy |
Degree level: | Doctoral |
Degree discipline: | Computer Science |
Degree department: | School of Applied Statistics |
Degree grantor: | National Institute of Development Administration |
Issued date: | 2009 |
Digital Object Identifier (DOI): | 10.14457/NIDA.the.2009.139 |
Publisher: | National Institute of Development Administration |
Abstract: |
Rhetorical Structure Analysis (RSA) explores Discourse Relations (DRs) among related Elementary Discourse Units (EDUs) in a Rhetorical Structure Tree (RS tree) to describe meaning in a text. It is very useful in many text processing tasks employing relationships among EDUs to use an input such as text understanding, summarization, discourse parsing, machine translation and question answering. The Thai language, with its distinctive linguistic characteristics, requires a unique technique. Thai linguistic characteristics have no explicit EDU boundaries, referred to as EDU constituent omissions. Within a Thai RS tree, Thai has adjacent markers, implicit markers and marker ambiguities in DR determination. This dissertation proposes a new approach to Thai RSA which consists of three steps. The first is EDU segmentation where EDUs are segmented by phrase and syntactic hidden Markov models which are derived from phrase and syntactic structure rules, respectively. The second is RS tree Construction, where an RS tree is constructed using a clustering technique with its similarity matrix calculated from the three Thai semantic rules: Repetition rules established from the repetition of EDU constituents, Omission rules established from the omission of EDU constituents, and Addition rules established by adding Markers into EDUs. In the final step, DR determination, decision tree learning whose features are derived by relating two EDUs in an RS tree using the semantic rules to determine Discourse Relations. |
Description: |
Thesis (Ph.D. (Computer Science))--National Institute of Development Administration, 2009 |
Subject(s): | Computational linguistics
Natural language processing (Computer science) Thai language -- Rhetoric -- Computer programs |
Resource type: | Dissertation |
Extent: | 99 leaves : ill. ; 30 cm. |
Type: | Text |
File type: | application/pdf |
Language: | eng |
Rights: | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. |
URI: | http://repository.nida.ac.th/handle/662723737/283 |
Files in this item (CONTENT) |
|
View ทรัพยากรสารสนเทศทั้งหมดในคลังปัญญา ใช้เพื่อประโยชน์ทางการเรียนการสอนและการค้นคว้าเท่านั้น และต้องมีการอ้างอิงแหล่งที่มาทุกครั้งที่นำไปใช้ ห้ามดัดแปลงเนื้อหา และทำสำเนาต่อ รวมถึงไม่ให้อนุญาตนำไปใช้ประโยชน์เพื่อการค้า ไม่ว่ากรณีใด ๆ ทั้งสิ้น
|
This item appears in the following Collection(s) |
|
|