Thai soundex using Thai phonetic distance algorithm
Files
Issued Date
2014
Issued Date (B.E.)
2557
Available Date
Copyright Date
Resource Type
Series
Edition
Language
eng
File Type
application/pdf
No. of Pages/File Size
65 leaves
ISBN
ISSN
eISSN
Other identifier(s)
b192203
Identifier(s)
Access Rights
Access Status
Rights
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Rights Holder(s)
Physical Location
National Institute of Development Administration. Library and Information Center
Bibliographic Citation
Citation
Chonlasith Jucksriporn (2014). Thai soundex using Thai phonetic distance algorithm. Retrieved from: http://repository.nida.ac.th/handle/662723737/4125.
Title
Thai soundex using Thai phonetic distance algorithm
Alternative Title(s)
Author(s)
Advisor(s)
Editor(s)
item.page.dc.contrubutor.advisor
Advisor's email
Contributor(s)
Contributor(s)
Abstract
Homophones are words with similar sound. Searching for homophones is not only looking for the words with similar spelling, but it should also be looking for the words with similar pronunciation. Specifically in Thai, some consonant clusters can be pronounced in different ways, such as // can be pronounced as // or // depending on cach particular word. This makes Thoi word pronunciation and dictation more dilliculi Thai sounder could not handle these consonant clusters properly: for example, many Thai soundex methods showed cncoded results of a consonant cluster /n/ in the name w as with the letter which does not represent the correct initial consonant sound, lol.Homophones are words with similar sound. Searching for homophones is not only looking for the words with similar spelling, but it should also be looking for the words with similar pronunciation. Specifically in Thai, some consonant clusters can be pronounced in different ways, such as // can be pronounced as // or // depending on cach particular word. This makes Thoi word pronunciation and dictation more dilliculi Thai sounder could not handle these consonant clusters properly: for example, many Thai soundex methods showed cncoded results of a consonant cluster /n/ in the name w as with the letter which does not represent the correct initial consonant sound, lol.
This research proposes a technique to find homophones by calculating the distance between words using their phonetics, instead of spelling. The research also proposes an approach to syllabify word using Thai Minimum Cluster based on trigram statistical model and an improved phonetic representation to resolve ambiguities in Thai standard transcription
The experimental evaluations were performed on a name corpus of Thai people and places. The results show that the proposed method achieves average precision and average recall of 99.8.3%
This research proposes a technique to find homophones by calculating the distance between words using their phonetics, instead of spelling. The research also proposes an approach to syllabify word using Thai Minimum Cluster based on trigram statistical model and an improved phonetic representation to resolve ambiguities in Thai standard transcription
The experimental evaluations were performed on a name corpus of Thai people and places. The results show that the proposed method achieves average precision and average recall of 99.8.3%
Table of contents
Description
Dissertations (Ph.D. (Computer Science))National Institute of Development Administration, 2014