谢天,,
高超,
李邵梅,
黄瑞阳
国家数字交换系统工程技术研究中心 ??郑州 ??450002
基金项目:国家自然科学基金(61601513)
详细信息
作者简介:陈鸿昶:男,1964年生,教授,博士生导师,研究方向为通信与信息系统,大数据处理分析
谢天:男,1994年生,硕士生,研究方向为机器学习
高超:男,1982年生,博士,研究方向为计算机视觉,机器学习
李邵梅:女,1982年生,博士,研究方向为计算机视觉,机器学习
黄瑞阳:男,1986年生,博士,研究方向为网络大数据分析
通讯作者:谢天 xietianxt@foxmail.com
中图分类号:TP18计量
文章访问数:1734
HTML全文浏览量:937
PDF下载量:51
被引次数:0
出版历程
收稿日期:2018-11-20
修回日期:2019-04-21
网络出版日期:2019-05-16
刊出日期:2019-10-01
Candidate Label-Aware Partial Label Learning Algorithm
Hongchang CHEN,Tian XIE,,
Chao GAO,
Shaomei LI,
Ruiyang HUANG
National Digital Switching System Engineering & Technological R&D Center, Zhengzhou 450002, China
Funds:The National Natural Science Foundation of China (61601513)
摘要
摘要:在偏标记学习中,示例的真实标记隐藏在由一组候选标记组成的标记集中。现有的偏标记学习算法在衡量示例之间的相似度时,只基于示例的特征进行计算,缺乏对候选标记集信息的利用。该文提出一种候选标记感知的偏标记学习算法(CLAPLL),在构建图的阶段有效地结合候选标记集信息来衡量示例之间的相似度。首先,基于杰卡德距离和线性重构,计算出各个示例的标记集之间的相似度,然后结合示例相似度和标记集的相似度构建相似度图,并通过现有的基于图的偏标记学习算法进行学习和预测。3个合成数据集和6个真实数据集上实验结果表明,该文方法相比于基线算法消歧准确率提升了0.3%~16.5%,分类准确率提升了0.2%~2.8%。
关键词:偏标记学习/
弱监督学习/
消歧/
杰卡德距离/
线性重构
Abstract:In partial label learning, the true label of an instance is hidden in a label-set consisting of a group of candidate labels. The existing partial label learning algorithm only measures the similarity between instances based on feature vectors and lacks the utilization of the candidate labelset information. In this paper, a Candidate Label-Aware Partial Label Learning (CLAPLL) method is proposed, which combines effectively candidate label information to measure the similarity between instances during the graph construction phase. First, based on the jaccard distance and linear reconstruction, the similarity between the candidate labelsets of instances is calculated. Then, the similarity graph is constructed by combining the similarity of the instances and the label-sets, and then the existing graph-based partial label learning algorithm is presented for learning and prediction. The experimental results on 3 synthetic datasets and 6 real datasets show that disambiguation accuracy of the proposed method is 0.3%~16.5% higher than baseline algorithm, and the classification accuracy is increased by 0.2%~2.8%.
Key words:Partial label learning/
Weakly supervised learning/
Disambiguation/
Jaccard distance/
Linear reconstruction
PDF全文下载地址:
https://jeit.ac.cn/article/exportPdf?id=960ecfca-c425-4fe3-9ac4-cc25e3842087