马彪,周瑜,贺建军.面向大规模类不平衡数据的变分高斯过程分类算法[J].,2016,56(3):279-284 |
面向大规模类不平衡数据的变分高斯过程分类算法 |
Variational Gaussian process classification algorithm for large-scale class-imbalanced data |
|
DOI:10.7511/dllgxb201603009 |
中文关键词:类不平衡问题高斯过程变分推理大规模数据分类 |
英文关键词:class-imbalanced problemGaussian processvariational inferencelarge-scale data classification |
基金项目:国家自然科学基金资助项目(6150305861374170);辽宁省自然科学基金资助项目(20150200842015020099);辽宁省教育厅科学技术研究项目(L2014540L2015127);中央高校基本科研业务费专项资金资助项目(DC201501055DC201501060201). |
|
摘要点击次数:1674 |
全文下载次数:1772 |
中文摘要: |
变分高斯过程分类器是最近提出的一种较有效的面向大规模数据的快速核分类算法,其在处理类不平衡问题时,对少数类样本的预测精度通常会较低.针对此问题,通过在似然函数中引入指数权重系数和构造包含相同数目正负类样本的诱导子集解决原始算法的分类面向少数类偏移的问题,建立了一种可以有效处理大规模类不平衡问题的改进变分高斯过程分类算法.在10个大规模UCI数据集上的实验结果表明,改进算法在类不平衡问题上的精度较原始算法得到大幅提高. |
英文摘要: |
Variational Gaussian process classifier is an effective fast kernel algorithm proposed recently for large-scale data classification. However, for the class-imbalanced problem, it usually achieves lower accuracy on the samples of minority class. By assigning different index weight coefficients to the likelihood functions and constructing an inducing set containing equal numbers of positive and negative samples to avoid hyperplane biased toward the side of minority class, an improved variational Gaussian process classification algorithm is proposed, which can deal with the large-scale class-imbalanced problem effectively. The experimental results of ten large-scale UCI datasets show that the proposed algorithm can achieve much higher accuracy than the original one for class-imbalanced problem. |
查看全文查看/发表评论下载PDF阅读器 |
| --> 关闭 |