2014/2015
Automated text categorization by generalized kernel machines
Proceedings of 2014 IEEE International Conference on Information and Automation(ICIA),(pp. 376-381). IEEE
Author(s) | Jiaqi Tan
Wenye Li Haoming Li |
---|---|
Summary | Text categorization refers to the task of designing methods to automatically classify text documents into different groups. With wide applications in intelligent information processing, it has attracted much recent research attention. The classical support vector machines (SVM) algorithm has obtained significant success on this task. Inspired by the achievements of SVM, a family of related kernel methods is being widely studied. This paper investigates a novel kernel method for text categorization. Different from SVM and related approaches which operate on thousands of word term features of text corpus, our method takes concept features into consideration as well. We use a generalized regularizer which leaves concept features unregularized. With a squared-loss function to measure the empirical error, the method has a simple convex solution. In real evaluations we have verified both the effectiveness and the efficiency of the method in benchmarked text categorization applications. |