Feature selection algorithm for text classification based on improved mutual information
CongShuai, ZHANG Ji-bin, XU Zhi-ming, WANG Yu-ying
School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China
Abstract:
In order to solve the poor performance in text classification when using traditional formula of mutual information (MI),a feature selection algorithm were proposed based on improved mutual information.The improved mutual information algorithm,which is on the basis of traditional improved mutual information methods that enhance the MI value of negative characteristics and feature’s frequency,supports the concept of concentration degree and dispersion degree.In accordance with the concept of concentration degree and dispersion degree,formulas which embody concentration degree and dispersion degree were constructed and the improved mutual information was implemented based on these.In this paper,the feature selection algorithm was applied based on improved mutual information to a text classifier based on Biomimetic Pattern Recognition and it was compared with several other feature selection methods.The experimental results showed that the improved mutual information feature selection method greatly enhances the performance compared with traditional mutual information feature selection methods and the performance is better than that of information gain.Through the introduction of the concept of concentration degree and dispersion degree,the improved mutual information feature selection method greatly improves the performance of text classification system.
Key words: text classification feature selection improved mutual information Biomimetic Pattern Recognition
DOI:10.11916/j.issn.1005-9113.2011.03.027
Clc Number:TP391.1
Fund:
删除或更新信息,请邮件至freekaoyan#163.com(#换成@)
Feature selection algorithm for text classification based on improved mutual information
本站小编 哈尔滨工业大学/2019-10-23
相关话题/Feature selection algorithm text classification
Loop Closure Detection of Visual SLAM Based on Point and Line Features
Loop Closure Detection of Visual SLAM Based on Point and Line Features Author NameAffiliationChang’an LiuSchool of Control and Computer Engineering,North China Electric Power University,Beijing 102206, ChinaRuiying ChengSchool of Control and Computer Engineering,North China ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Array Antenna Pattern Synthesis Based on Selective Levy Flight Culture Wolf Pack Algorithm
Array Antenna Pattern Synthesis Based on Selective Levy Flight Culture Wolf Pack Algorithm Author NameAffiliationTing WangSchool of Electronic Information Engineering, Hebei University of Technology, Tianjin 300401, China People’s Liberation Army Air Force 93756, Tianjin 300 ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Research on Patch Near-field Acoustic Holography Based on HELS Algorithm
Research on Patch Near-field Acoustic Holography Based on HELS Algorithm Xiao-Xia Guo,Chao-Feng Lan,Tian-He Yu (Institute of Electrical and Electronics Engineering, Harbin University of Science and Technology, Harbin 150080, Chi ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2019-10-24Novel Spectrum Allocation Algorithm Based on the Activities of Primary Users for Cognitive Radio Net
Novel Spectrum Allocation Algorithm Based on the Activities of Primary Users for Cognitive Radio Networks Yao Wang, Zhong-Zhao Zhang, Lin Ma, Jia-Mei Chen Communication Research Center, Harbin Institute of Technology, Harbin 150 ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2019-10-24A Joint Rate Control and AMC Algorithm for Adaptive Transmission Systems
A Joint Rate Control and AMC Algorithm for Adaptive Transmission Systems Yang Yu, Xue-Zhi Tan, Yong-Gang Chi, Lin Ma, Yao Wang (Communication Research Center, Harbin Institute of Technology, Harbin 150080, China) ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2019-10-24Research on Ant Colony Algorithm in Vehicle Operation Adjustment Based on IOT
Research on Ant Colony Algorithm in Vehicle Operation Adjustment Based on IOT Xian-Min Wei (Computer Engineering School, Weifang University, Weifang 261061, China) Abstract: A ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2019-10-24A Static Spectrum Aggregation Algorithm in Cognitive Radio System
A Static Spectrum Aggregation Algorithm in Cognitive Radio System Cong Yin1, Xue-Zhi Tan1,2,Lin Ma1,2, Xiu-Hua Li1 (1. Communication Research Center, Harbin Institute of Technology, Harbin 150001, China;2. Science and Technolo ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2019-10-24Context-Aware Using Carrier Phase for Adaptive MEMS IMU/GNSS Filtering in Deep Urban Navigation
Context-Aware Using Carrier Phase for Adaptive MEMS IMU/GNSS Filtering in Deep Urban Navigation Hua Liu 1, Tong Liu1, Hang Guo2 (1.School of Automation, Beijing Institute of Technology, Beijing 100081,China; 2.Academy of Space ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2019-10-24Research of Velocity Compensation Method Based on Range Profile Feature
Research of Velocity Compensation Method Based on Range Profile Feature Kai-Feng Guo, Yun Lin, Meng Wang, Xiao-Chun Xu (College of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China) ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2019-10-24A Kind of Edge Detection Algorithm with Edge-Preserving Characteristics
A Kind of Edge Detection Algorithm with Edge-Preserving Characteristics Zheng Dou1, Peng-Yu Shi1,2, Yun Lin1 (1. Institute of Information and Communications Engineering, Harbin Engineering University, Harbin 150001, China; 2 ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2019-10-24