Prediksi Harga Rumah Menggunakan K-Means Dua Tahap

Febrita, Ruth Ema (2019) Prediksi Harga Rumah Menggunakan K-Means Dua Tahap. Magister thesis, Universitas Brawijaya.

Abstract

Rumah merupakan sebuah properti yang banyak dibutuhkan seiring dengan bertambahnya populasi manusia dalam sebuah daerah. Hal tersebut menyebabkan harga rumah mengalami fluktuasi dan sulit untuk diprediksi. Harga sebuah rumah dipengaruhi oleh beberapa faktor, antara lain: atribut fisik sebuah rumah, lokasi, konsep rumah atau reputasi pengembang, serta fenomena ekonomi yang terjadi. Dengan demikian, dataset perumahan merupakan data berdimensi tinggi, karena terdapat banyak faktor yang diamati. Dalam melakukan prediksi, metode berbasis neural network merupakan metode yang paling sering digunakan. Akan tetapi, mekanisme interpolasi pada neural network yang digunakan untuk mengisi nilai atribut yang hilang dapat menyebabkan prediksi harga dalam data perumahan tidak akurat. Oleh karena itu, penelitian ini mencoba melakukan pendekatan clustering berdasarkan kemiripan atribut untuk melakukan prediksi harga. Metode yang diusulkan adalah metode K-Means dua tahap. Tujuan dari penelitian adalah untuk mengevaluasi apakah metode K-Means dua tahap mampu menghasilkan hasil cluster dengan kualitas lebih baik dibandingkan dengan metode clustering lainnya, dan apakah metode ini mampu menghasilkan akurasi hasil prediksi lebih baik. K-Means dua tahap merupakan sebuah metode K-Means yang dijalankan sebanyak dua kali secara bertahap. K-Means tahap pertama digunakan untuk menentukan cluster utama, sedangkan K-Means tahap kedua digunakan untuk menemukan sub cluster yang terdapat dalam tiap-tiap cluster utama. Dengan pembentukan sub cluster ini diharapkan akan membuat prediksi lebih akurat. Kualitas hasil cluster diukur dengan menggunakan Silhouette coefficient dan Davies-Bouldin Index, sedangkan akurasi hasil prediksi diukur dengan menggunakan Mean Absolute Percentage Error (MAPE). Setelah dilakukan pengujian dan analisis, hasil penelitian menunjukkan bahwa K-Means satu tahap dapat menghasilkan cluster dengan kualitas yang lebih baik dibandingkan dengan K-Means dua tahap. Akan tetapi, metode K-Means dua tahap lebih mampu menghasilkan prediksi yang lebih akurat dibandingkan dengan metode K-Means satu tahap dan SOM.

English Abstract

The house is a kind of property that is much needed along with the increasing human population. This causes house prices to fluctuate and is difficult to predict. The price of a house is influenced by several factors, including physical attributes of a house, location, the concept of the house or the reputation of the developer, as well as the economic phenomena occurred. Thus, the housing dataset is a high-dimensional data, because there are many factors observed. In making predictions, neural network-based methods are the most frequently used method. However, the interpolation mechanism of the neural network, which is used to fill the missing attribute values, can cause price predictions in housing data to be inaccurate. Therefore, this study tries to do a clustering approach to predict house prices based on the similarity of attributes. The proposed method is a two-stage K-Means algorithm. The purpose of this research is to evaluate whether the two-stage K-Means algorithm is able to produce better clustering results compared to other clustering methods and whether this method is able to produce better prediction accuracy. Two-stage K-Means is a K-Means method that is carried out twice gradually. The first K-Means stage is used to determine the main cluster of the data, while the second K-Means stage is used to find the sub-clusters contained in each main cluster. With the sub-cluster mechanism, it is expected that the accuracy of the predicted results will increase. The quality of cluster results is measured using the Silhouette coefficient and Davies-Bouldin Index, while the accuracy of the predicted results is measured using the Mean Absolute Percentage Error (MAPE). After doing some the testing and analysis process, the results of the study showed that one-stage K-Means can produce clusters with better quality compared to two-stage K-Means. However, the two-stage K-Means method is more capable of producing more accurate predictions than the one-stage K-Means method and SOM.

Other obstract

Item Type:	Thesis (Magister)
Identification Number:	TES/006.312/FEB/p/2019/041905799
Uncontrolled Keywords:	clustering, k-means, k-means dua tahap, prediksi, rumah, som
Subjects:	000 Computer science, information and general works > 006 Special computer methods > 006.3 Artificial intelligence > 006.31 Machine learning > 006.312 Data mining
Divisions:	S2/S3 > Magister Ilmu Komputer, Fakultas Ilmu Komputer
Depositing User:	Endang Susworini
Date Deposited:	30 May 2022 07:09
Last Modified:	30 May 2022 07:09
URI:	http://repository.ub.ac.id/id/eprint/190785

Text
Ruth Ema Febrita.pdf
Download (3MB)

Actions (login required)

View Item