Text clustering based on fusion of ant colony and genetic algorithms

Yun ZHANG; Boqin FENG; Shouqiang MA; Lianmeng LIU

doi:10.1007/s11460-009-0019-9

PDF(145 KB)

Front. Electr. Electron. Eng. ›› 2009, Vol. 4 ›› Issue (1) : 15-19. DOI: 10.1007/s11460-009-0019-9

RESEARCH ARTICLE

Text clustering based on fusion of ant colony and genetic algorithms

Author information +

History +

Abstract

Focusing on the problem that the ant colony algorithm gets into stagnation easily and cannot fully search in solution space, a text clustering approach based on the fusion of the ant colony and genetic algorithms is proposed. The four parameters that influence the performance of the ant colony algorithm are encoded as chromosomes, thereby the fitness function, selection, crossover and mutation operator are designed to find the combination of optimal parameters through a number of iteration, and then it is applied to text clustering. The simulation results show that compared with the classical k-means clustering and the basic ant colony clustering algorithm, the proposed algorithm has better performance and the value of F-Measure is enhanced by 5.69%, 48.60% and 69.60%, respectively, in 3 test data sets. Therefore, it is more suitable for processing a larger dataset.

Keywords

ant colony clustering / genetic algorithm / fusion / text clustering

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Yun ZHANG, Boqin FENG, Shouqiang MA, Lianmeng LIU. Text clustering based on fusion of ant colony and genetic algorithms. Front Elect Electr Eng Chin, 2009, 4(1): 15‒19 https://doi.org/10.1007/s11460-009-0019-9

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Liu Y C, Wang X L, Xu Z M, Guan Y. A survey of document clustering. Journal of Chinese Information Processing, 2006, 20(3): 55–62 (in Chinese)

[2]	Sasaki M, Shinnou H. Spam detection using text clustering. In: Proceedings of the 2005 International Conference on Cyberworlds (CW’05), Singapore. 2005, 316–319

[3]	He F, Ding X Q. Combining text clustering and retrieval for corpus adaptation. Proceedings of SPIE. 2007, 6500: 65000P1–7

[4]	Dorigo M, Blum C. Ant colony optimization theory: a survey. Theoretical Computer Science, 2005, 344(2-3): 243–278 CrossRef Google scholar

[5]	Zhu X L, Li J Z. An ant colony system-based optimization scheme of data mining. In: Proceedings of the 6th International Conference on Intelligent Systems Design and Applications (ISDA’06), Jinan, Shandong, China. 2006, 400–403

[6]	van Rijsbergen C J. Information Retrieval. 2nd ed. London: Butterworths, 1979

[7]	Wu C M, Chen Z, Jiang M. The research on initialization of ants system and configuration of parameters for different TSP problems in ant algorithm. Acta Electronica Sinica, 2006, 34(8): 1530–1533 (in Chinese)

[8]	Huang Y Q, Liang C Y, Zhang X D. Parameter establishment of an ant system based on uniform design. Control and Decision, 2006, 21(1): 93–96 (in Chinese)