Journal of Hebei University (Natural Science Edition) ›› 2010, Vol. 30 ›› Issue (2): 211-215.DOI: 10.3969/j.issn.1000-1565.2010.02.021

Previous Articles     Next Articles

A Incremental Clustering Algorithm in Data Warehouse Environment

WANG Chun-cai,YANG Hua-min,ZHANG Cai-hong,GUO Wei,HAN Gui-dong   

  • Online:2010-03-25 Published:2010-03-25

Abstract: Data warehouse is a challenging field of application for data mining tasks such as clustering. Clustering online requires good result and fast-response ability at the same time. The CURE algorithm can get high-quality clusters but efficiency is relatively low. In this paper, a novel incremental CURE algorithm-InCURE is proposed, after investigating CURE and updates mode of data warehouse. CURE keeps nicely the dynamic clustering characteristic of the original algorithm, while shortens the clustering time consumedly by using the historical clustering results and dealing with added items separately. Performance evaluation of InCURE based on multidimensional data demonstrates that it is well applicable for incremental clustering in data warehouse.

Key words: clustering, data warehouse, incremental clustering, CURE

CLC Number: