TY - GEN
T1 - Algorithm for fuzzy clustering of mixed data with numeric and categorical attributes
AU - Ahmad, Amir
AU - Dey, Lipika
PY - 2005
Y1 - 2005
N2 - In many applications numeric as well as categorical features describe the data objects. A variety of algorithms have been proposed for clustering if fuzzy partitions and descriptive cluster prototypes are desired. However, most of these methods are designed for data sets with variables measured in the same scale type (only categorical, or only numeric). We have developed probabilistic distance measure to compute significance of attributes for numeric data, and distance between two categorical values. We used this distance measure with the cluster center definition proposed by Yasser El-Sonbaty and M. A. Ismail [26] to propose Fuzzy-c mean type clustering algorithm for mixed attributes data. The results of the application of the new algorithm show that new technique is quite encouraging.
AB - In many applications numeric as well as categorical features describe the data objects. A variety of algorithms have been proposed for clustering if fuzzy partitions and descriptive cluster prototypes are desired. However, most of these methods are designed for data sets with variables measured in the same scale type (only categorical, or only numeric). We have developed probabilistic distance measure to compute significance of attributes for numeric data, and distance between two categorical values. We used this distance measure with the cluster center definition proposed by Yasser El-Sonbaty and M. A. Ismail [26] to propose Fuzzy-c mean type clustering algorithm for mixed attributes data. The results of the application of the new algorithm show that new technique is quite encouraging.
UR - http://www.scopus.com/inward/record.url?scp=33744901476&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33744901476&partnerID=8YFLogxK
U2 - 10.1007/11604655_63
DO - 10.1007/11604655_63
M3 - Conference contribution
AN - SCOPUS:33744901476
SN - 3540309993
SN - 9783540309994
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 561
EP - 572
BT - Distributed Computing and Internet Technology - Second International Conference, ICDCIT 2005, Proceedings
T2 - 2nd International Conference on Distributed Computing and Internet Technology, ICDCIT 2005
Y2 - 22 December 2005 through 24 December 2005
ER -