ARTÍCULO
TITULO

Clustering Analysis with Embedding Vectors: An Application to Real Estate Market Delineation

Changro Lee    

Resumen

Although clustering analysis is a popular tool in unsupervised learning, it is inefficient for the datasets dominated by categorical variables, e.g., real estate datasets. To apply clustering analysis to real estate datasets, this study proposes an entity embedding approach that transforms categorical variables into vector representations. Three variants of a clustering algorithm, i.e., the clustering based on the traditional Euclidean distance, the Gower distance, and the embedding vectors, are applied to the land sales records to delineate the real estate market in Gwacheon-si, Gyeonggi province, South Korea. Then, the relevance of the resultant submarkets is evaluated using the root mean squared errors (RMSE) obtained from a hedonic pricing model. The results show that the RMSE in the embedding vector-based algorithm decreases substantially from 0.076-0.077 to 0.069. This study shows that the clustering algorithm empowered by embedding vectors outperforms the conventional algorithms, thereby enhancing the relevance of the delineated submarkets.

 Artículos similares

       
 
Andrea Adriani, Stefano Serra-Capizzano and Cristina Tablino-Possio    
We consider the Helmholtz equation and the fractional Laplacian in the case of the complex-valued unbounded variable coefficient wave number μ" role="presentation" style="position: relative;">??µ µ , approximated by finite differences. In a rec... ver más
Revista: Algorithms

 
Jiusheng Du, Chengyang Meng and Xingwang Liu    
This study utilizes taxi trajectory data to uncover urban residents? travel patterns, offering critical insights into the spatial and temporal dynamics of urban mobility. A fusion clustering algorithm is introduced, enhancing the clustering accuracy of t... ver más
Revista: Applied Sciences

 
Hongyu Shao, Sizhe Pan, Yufei Song and Quanfu Li    
In the context of rapid product iteration, design conflicts arise from discrepancies in designers? understanding of user needs, influenced by subjective preferences, behavioural stances, and other factors. This paper proposes a product conceptual design ... ver más
Revista: Applied Sciences

 
?tefan Ionescu, Camelia Delcea, Nora Chiri?a and Ionu? Nica    
This research provides a comprehensive analysis of the dynamic interplay between agent-based modeling (ABM) and artificial intelligence (AI) through a meticulous bibliometric study. This study reveals a substantial increase in scholarly interest, particu... ver más
Revista: Algorithms

 
Sasha Petrenko, Daniel B. Hier, Mary A. Bone, Tayo Obafemi-Ajayi, Erik J. Timpson, William E. Marsh, Michael Speight and Donald C. Wunsch II    
Biomedical datasets distill many mechanisms of human diseases, linking diseases to genes and phenotypes (signs and symptoms of disease), genetic mutations to altered protein structures, and altered proteins to changes in molecular functions and biologica... ver más
Revista: Information