Feature selection using genetic algorithm

Kanyanut Homsapaya

Feature selection using genetic algorithm

dc.contributor.advisor	Ohm Sornil	th
dc.contributor.author	Kanyanut Homsapaya	th
dc.date.accessioned	2022-06-09T06:39:11Z
dc.date.available	2022-06-09T06:39:11Z
dc.date.issued	2017	th
dc.date.issuedBE	2560	th
dc.description.abstract	In this dissertation, a method of feature selection in machine learning, and more particularly supervised learning is presented. Supervised learning is a machine learning task that infers answers from a training data set. In machine learning, training datasets are employed in order to create a model which enables reasonable predictions, while in supervised learning, each training example is a training set consisting of instances and labels, and the learning objective is to be able to predict the label of a new unseen instance with as few errors as possible. In recent years, many proposed learning algorithms that perform fairly well have been proposed. The factors to accomplish successful model building depend on many aspects such as noise and size of data. Most often for learning algorithms, it is assumed that training data is represented by a vector of numerical data for which each measurement is a feature, and an important question related to machine learning is how to represent instances using vectors of these to yield high learning performance.	th
dc.description.abstract	Nowadays, data volumes are tremendously large in terms of aspects such as the number of features and most machine learning and data mining techniques may not be productive for high dimensional data, query accuracy and efficiency lessen swiftly as the dimension increases, the so-called curse of dimensionality. One of the requirements of good representation is conciseness since representation that uses too many features incurs major computational difficulties and may lead to poor prediction performance. Attribute selection is one of the significant methods in which the	th
dc.description.abstract	objective is to choose a small subset to predict the target sufficiently well. Feature selection selects the most importance features, eliminates irrelevant and redundant features from the entire set of attributes, reduces the computational complexity of any learning and prediction algorithm used in the process, and reduces cost by excluding unselected features.	th
dc.description.abstract	A floating search is commonly used for the searching process. They are heuristic search methods which dynamically change the number of attributes included or eliminated at each step; they have produced very good results. The principal improvement of this thesis is focused on filter-based feature selection using genetic algorithm technique. Filters are normally less computationally intensive than wrapper method because wrappers apply a predictive model to score feature subsets. This approach is selected to be fast to reckon, whereas rooted to spot apprehending the goodness of the feature subsets. GA method can help to gain more diversity of population and provides us a way of reducing search space. Moreover, the contributions related to improves the contemporary sequential forward floating selection algorithm. In this thesis, an improving feature step using genetic algorithm is proposed as an additional step in a floating search. The objective is to eliminate weak features and replace a predominant one at each sequential step. From the research observations, the proposed method was discovered to be beneficial in selecting features that can boost the accuracy of data classification. Moreover, the experimental outcomes show that the proposed method with the genetic algorithm enhanced classification correctness and cut down data dimensionality for supervised learning problems.	th
dc.format.extent	110 leaves	th
dc.format.mimetype	application/pdf	th
dc.identifier.doi	10.14457/NIDA.the.2017.43
dc.identifier.other	b201179	th
dc.identifier.uri	https://repository.nida.ac.th/handle/662723737/5874	th
dc.language.iso	eng	th
dc.publisher	National Institute of Development Administration	th
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	th
dc.subject	Supervised learning	th
dc.subject	Machine learning	th
dc.subject.other	Genetic algorithm	th
dc.title	Feature selection using genetic algorithm	th
dc.title.alternative	Feature selection using genetic algorithm	th
dc.type	text--thesis--doctoral thesis	th
mods.genre	Dissertation	th
mods.physicalLocation	National Institute of Development Administration. Library and Information Center	th
thesis.degree.department	School of Applied Statistics	th
thesis.degree.discipline	Computer Science and Information Systems	th
thesis.degree.grantor	National Institute of Development Administration	th
thesis.degree.level	Doctoral	th
thesis.degree.name	Doctor of Philosophy	th