Computer Science ›› 2009, Vol. 36 ›› Issue (10): 217-221.
Previous Articles Next Articles
YAN Peng, ZHENG Xue-feng, ZHU Jian-yong, XIAO Yun-hong
Online:
Published:
Abstract: As one of the most classical TC approaches,k-NN is advantaged in tackling concept drift. However,to avoid curse of dimensionality,it has to employ FS(feature selection) method to reduce dimensionality of feature space and improve operation efficiency. But FS process will generally cause information losing and thus has some side-effects on the whole performance of approach. According to sparsity of text vectors, an optimized k-NN approach was presented in paper. This optimized approach greatly simplified euclidean distance model and reduced the operation cost without any information losing. So it can simultaneously achieve much higher both performance and efficiency than general k-NN approach. It then enhanced the advantage of k-NN in managing concept drift.
Key words: Text categorization,Feature selection,k-NN,Concept drift
YAN Peng, ZHENG Xue-feng, ZHU Jian-yong, XIAO Yun-hong. Optimized k-NN Text Categorization Approach[J].Computer Science, 2009, 36(10): 217-221.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: https://www.jsjkx.com/EN/
https://www.jsjkx.com/EN/Y2009/V36/I10/217
Cited