Thai soundex using Thai phonetic distance algorithm
dc.contributor.advisor | Ohm Sornil | th |
dc.contributor.author | Chonlasith Jucksriporn | th |
dc.date.accessioned | 2019-01-28T08:50:23Z | |
dc.date.available | 2019-01-28T08:50:23Z | |
dc.date.issued | 2014 | th |
dc.date.issuedBE | 2557 | th |
dc.description | Dissertations (Ph.D. (Computer Science))National Institute of Development Administration, 2014 | th |
dc.description.abstract | Homophones are words with similar sound. Searching for homophones is not only looking for the words with similar spelling, but it should also be looking for the words with similar pronunciation. Specifically in Thai, some consonant clusters can be pronounced in different ways, such as // can be pronounced as // or // depending on cach particular word. This makes Thoi word pronunciation and dictation more dilliculi Thai sounder could not handle these consonant clusters properly: for example, many Thai soundex methods showed cncoded results of a consonant cluster /n/ in the name w as with the letter which does not represent the correct initial consonant sound, lol.Homophones are words with similar sound. Searching for homophones is not only looking for the words with similar spelling, but it should also be looking for the words with similar pronunciation. Specifically in Thai, some consonant clusters can be pronounced in different ways, such as // can be pronounced as // or // depending on cach particular word. This makes Thoi word pronunciation and dictation more dilliculi Thai sounder could not handle these consonant clusters properly: for example, many Thai soundex methods showed cncoded results of a consonant cluster /n/ in the name w as with the letter which does not represent the correct initial consonant sound, lol. | th |
dc.description.abstract | This research proposes a technique to find homophones by calculating the distance between words using their phonetics, instead of spelling. The research also proposes an approach to syllabify word using Thai Minimum Cluster based on trigram statistical model and an improved phonetic representation to resolve ambiguities in Thai standard transcription | th |
dc.description.abstract | The experimental evaluations were performed on a name corpus of Thai people and places. The results show that the proposed method achieves average precision and average recall of 99.8.3% | th |
dc.format.extent | 65 leaves | th |
dc.format.mimetype | application/pdf | th |
dc.identifier.doi | 10.14457/NIDA.the.2014.28 | |
dc.identifier.other | b192203 | th |
dc.identifier.uri | http://repository.nida.ac.th/handle/662723737/4125 | th |
dc.language.iso | eng | th |
dc.publisher | National Institute of Development Administration | th |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | th |
dc.subject | Phonetic distance | th |
dc.subject | Phonetic transcription | th |
dc.subject.other | Phonetic | th |
dc.subject.other | Thai language -- Phonetics | th |
dc.title | Thai soundex using Thai phonetic distance algorithm | th |
dc.type | text--thesis--doctoral thesis | th |
mods.genre | Dissertation | th |
mods.physicalLocation | National Institute of Development Administration. Library and Information Center | th |
thesis.degree.department | School of Applied Statistics | th |
thesis.degree.discipline | Computer Science | th |
thesis.degree.grantor | National Institute of Development Administration | th |
thesis.degree.level | Doctoral | th |
thesis.degree.name | Doctor of Philosophy | th |