Multi-objective genetic algorithm for supervised clustering
dc.contributor.advisor | Surapong Auwatanamongkol | |
dc.contributor.author | Vipa Thananant | |
dc.date.accessioned | 2023-05-15T03:32:19Z | |
dc.date.available | 2023-05-15T03:32:19Z | |
dc.date.issued | 2018 | |
dc.date.issuedBE | 2561 | th |
dc.description | Thesis (Ph.D. (Computer Science and Information Systems))--National Institute of Development Administration, 2018 | th |
dc.description.abstract | Supervised clustering organizes data instances into clusters on the basis of similarities between the data instances as well as class labels for the data instances. Supervised clustering seeks to meet multiple objectives, such as compactness of clusters, homogeneity of data in clusters with respect to their class labels, and separateness of clusters. With these objectives in mind, a new supervised clustering algorithm based on a multi-objective crowding genetic algorithm, named SC-MOGA, is proposed in this thesis. The algorithm searches for the optimal clustering solution that simultaneously achieves the three objectives mentioned above. The SC-MOGA performs very well on a small dataset, but for a large dataset it may not be able to converge to an optimal solution or can take a very long running time to converge to a solution. Hence, a data sampling method based on the Bisecting K-Means algorithm is also introduced, to find representatives for supervised clustering. This method groups the data instances of a dataset into small clusters, each containing data instances with the same class label. Data representatives are then randomly selected from each cluster. The experimental results show that SC-MOGA with the proposed data sampling method is very effective. It outperforms three previously proposed supervised clustering algorithms, namely SRIDHCR, LK-Means and SCEC, in terms of four cluster validity indexes. The experimental results show that the proposed data sampling method not only helps to reduce the number of data instances to be clustered by the SC-MOGA, but also enhances the quality of the data clustering results. Moreover, the biased initial approach is proposed in this thesis to find a good initial population to bias. The experimental results show that biased initial population of SCMOGA will improve clustering quality and the more percentage of biased initial population the better clustering quality. | th |
dc.format.extent | 121 leaves | th |
dc.format.mimetype | application/pdf | th |
dc.identifier.doi | 10.14457/NIDA.the.2018.129 | |
dc.identifier.other | b207807 | th |
dc.identifier.uri | https://repository.nida.ac.th/handle/662723737/6424 | |
dc.language.iso | eng | th |
dc.publisher | National Institute of Development Administration | th |
dc.rights | ผลงานนี้เผยแพร่ภายใต้ สัญญาอนุญาตครีเอทีฟคอมมอนส์แบบ แสดงที่มา-ไม่ใช้เพื่อการค้า-ไม่ดัดแปลง 4.0 (CC BY-NC-ND 4.0) | th |
dc.subject | genetic algorithm | th |
dc.subject.other | Computer algorithms | th |
dc.subject.other | Genetic | th |
dc.subject.other | Multiple criteria decision making | th |
dc.subject.other | อัลกอริธึม | th |
dc.title | Multi-objective genetic algorithm for supervised clustering | th |
dc.type | text--thesis--doctoral thesis | th |
mods.genre | Dissertation | th |
mods.physicalLocation | National Institute of Development Administration. Library and Information Center | th |
thesis.degree.department | Graduate School of Applied Statistics | th |
thesis.degree.discipline | Computer Science and Information Systems | th |
thesis.degree.grantor | National Institute of Development Administration | th |
thesis.degree.level | Doctoral | th |
thesis.degree.name | Doctor of Philosophy | th |