Text clustering based on fusion of ant colony and genetic algorithms
Yun ZHANG, Boqin FENG, Shouqiang MA, Lianmeng LIU
Text clustering based on fusion of ant colony and genetic algorithms
Focusing on the problem that the ant colony algorithm gets into stagnation easily and cannot fully search in solution space, a text clustering approach based on the fusion of the ant colony and genetic algorithms is proposed. The four parameters that influence the performance of the ant colony algorithm are encoded as chromosomes, thereby the fitness function, selection, crossover and mutation operator are designed to find the combination of optimal parameters through a number of iteration, and then it is applied to text clustering. The simulation results show that compared with the classical k-means clustering and the basic ant colony clustering algorithm, the proposed algorithm has better performance and the value of F-Measure is enhanced by 5.69%, 48.60% and 69.60%, respectively, in 3 test data sets. Therefore, it is more suitable for processing a larger dataset.
ant colony clustering / genetic algorithm / fusion / text clustering
[1] |
Liu Y C, Wang X L, Xu Z M, Guan Y. A survey of document clustering. Journal of Chinese Information Processing, 2006, 20(3): 55–62 (in Chinese)
|
[2] |
Sasaki M, Shinnou H. Spam detection using text clustering. In: Proceedings of the 2005 International Conference on Cyberworlds (CW’05), Singapore. 2005, 316–319
|
[3] |
He F, Ding X Q. Combining text clustering and retrieval for corpus adaptation. Proceedings of SPIE. 2007, 6500: 65000P1–7
|
[4] |
Dorigo M, Blum C. Ant colony optimization theory: a survey. Theoretical Computer Science, 2005, 344(2-3): 243–278
CrossRef
Google scholar
|
[5] |
Zhu X L, Li J Z. An ant colony system-based optimization scheme of data mining. In: Proceedings of the 6th International Conference on Intelligent Systems Design and Applications (ISDA’06), Jinan, Shandong, China. 2006, 400–403
|
[6] |
van Rijsbergen C J. Information Retrieval. 2nd ed. London: Butterworths, 1979
|
[7] |
Wu C M, Chen Z, Jiang M. The research on initialization of ants system and configuration of parameters for different TSP problems in ant algorithm. Acta Electronica Sinica, 2006, 34(8): 1530–1533 (in Chinese)
|
[8] |
Huang Y Q, Liang C Y, Zhang X D. Parameter establishment of an ant system based on uniform design. Control and Decision, 2006, 21(1): 93–96 (in Chinese)
|
[9] |
Duan H B. Ant Algorithm–Theory and Its Applications. Beijing: Science Press, 2005 (in Chinese)
|
/
〈 | 〉 |