Thai rhetorical structure analysis

dc.contributor.advisorOhm Sornil, advisorth
dc.contributor.authorSomnuk Sinthupounth
dc.date.accessioned2014-05-05T08:49:41Z
dc.date.available2014-05-05T08:49:41Z
dc.date.issued2009th
dc.date.issuedBE2552th
dc.descriptionThesis (Ph.D. (Computer Science))--National Institute of Development Administration, 2009th
dc.description.abstractRhetorical Structure Analysis (RSA) explores Discourse Relations (DRs) among related Elementary Discourse Units (EDUs) in a Rhetorical Structure Tree (RS tree) to describe meaning in a text. It is very useful in many text processing tasks employing relationships among EDUs to use an input such as text understanding, summarization, discourse parsing, machine translation and question answering. The Thai language, with its distinctive linguistic characteristics, requires a unique technique. Thai linguistic characteristics have no explicit EDU boundaries, referred to as EDU constituent omissions. Within a Thai RS tree, Thai has adjacent markers, implicit markers and marker ambiguities in DR determination. This dissertation proposes a new approach to Thai RSA which consists of three steps. The first is EDU segmentation where EDUs are segmented by phrase and syntactic hidden Markov models which are derived from phrase and syntactic structure rules, respectively. The second is RS tree Construction, where an RS tree is constructed using a clustering technique with its similarity matrix calculated from the three Thai semantic rules: Repetition rules established from the repetition of EDU constituents, Omission rules established from the omission of EDU constituents, and Addition rules established by adding Markers into EDUs. In the final step, DR determination, decision tree learning whose features are derived by relating two EDUs in an RS tree using the semantic rules to determine Discourse Relations.th
dc.format.extent99 leaves : ill. ; 30 cm.th
dc.format.mimetypeapplication/pdfth
dc.identifier.doi10.14457/NIDA.the.2009.139
dc.identifier.urihttp://repository.nida.ac.th/handle/662723737/283th
dc.language.isoength
dc.publisherNational Institute of Development Administrationth
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.th
dc.subject.lccP 98.3 So55 2009th
dc.subject.otherComputational linguisticsth
dc.subject.otherNatural language processing (Computer science)th
dc.subject.otherThai language -- Rhetoric -- Computer programsth
dc.titleThai rhetorical structure analysisth
dc.typetext--thesis--doctoral thesisth
mods.genreDissertationth
mods.physicalLocationNational Institute of Development Administration. Library and Information Centerth
thesis.degree.departmentSchool of Applied Statisticsth
thesis.degree.disciplineComputer Scienceth
thesis.degree.grantorNational Institute of Development Administrationth
thesis.degree.levelDoctoralth
thesis.degree.nameDoctor of Philosophyth

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
nida-diss-b166399.pdf
Size:
12.18 MB
Format:
Adobe Portable Document Format
Description:
Full Text