Multi-objective genetic algorithm for supervised clustering
Files
Issued Date
2018
Issued Date (B.E.)
2561
Available Date
Copyright Date
Resource Type
Series
Edition
Language
eng
File Type
application/pdf
No. of Pages/File Size
121 leaves
ISBN
ISSN
eISSN
Other identifier(s)
b207807
Identifier(s)
Access Rights
Access Status
Rights
ผลงานนี้เผยแพร่ภายใต้ สัญญาอนุญาตครีเอทีฟคอมมอนส์แบบ แสดงที่มา-ไม่ใช้เพื่อการค้า-ไม่ดัดแปลง 4.0 (CC BY-NC-ND 4.0)
Rights Holder(s)
Physical Location
National Institute of Development Administration. Library and Information Center
Bibliographic Citation
Citation
Vipa Thananant (2018). Multi-objective genetic algorithm for supervised clustering. Retrieved from: https://repository.nida.ac.th/handle/662723737/6424.
Title
Multi-objective genetic algorithm for supervised clustering
Alternative Title(s)
Author(s)
Advisor(s)
Editor(s)
item.page.dc.contrubutor.advisor
Advisor's email
Contributor(s)
Contributor(s)
Abstract
Supervised clustering organizes data instances into clusters on the basis of
similarities between the data instances as well as class labels for the data instances.
Supervised clustering seeks to meet multiple objectives, such as compactness of
clusters, homogeneity of data in clusters with respect to their class labels, and
separateness of clusters. With these objectives in mind, a new supervised clustering
algorithm based on a multi-objective crowding genetic algorithm, named SC-MOGA,
is proposed in this thesis. The algorithm searches for the optimal clustering solution
that simultaneously achieves the three objectives mentioned above. The SC-MOGA
performs very well on a small dataset, but for a large dataset it may not be able to
converge to an optimal solution or can take a very long running time to converge to a
solution. Hence, a data sampling method based on the Bisecting K-Means algorithm is
also introduced, to find representatives for supervised clustering. This method groups
the data instances of a dataset into small clusters, each containing data instances with
the same class label. Data representatives are then randomly selected from each
cluster. The experimental results show that SC-MOGA with the proposed data
sampling method is very effective. It outperforms three previously proposed
supervised clustering algorithms, namely SRIDHCR, LK-Means and SCEC, in terms
of four cluster validity indexes. The experimental results show that the proposed data
sampling method not only helps to reduce the number of data instances to be clustered
by the SC-MOGA, but also enhances the quality of the data clustering results. Moreover, the biased initial approach is proposed in this thesis to find a good initial
population to bias. The experimental results show that biased initial population of SCMOGA will improve clustering quality and the more percentage of biased initial
population the better clustering quality.
Table of contents
Description
Thesis (Ph.D. (Computer Science and Information Systems))--National Institute of Development Administration, 2018