删除或更新信息,请邮件至freekaoyan#163.com(#换成@)

哈尔滨工业大学计算机科学与技术学院/国家示范性软件学院研究生考研导师简介-王宏志

本站小编 Free考研网/2019-05-25

基本信息Hongzhi Wang研究方向/Research教学/Teaching(In Chinese)论文/Publications项目/Projects(In Chinese)
基本信息

王宏志,男,汉族,1978年生。教授,博士生导师;哈尔滨工业大学计算机学院海量数据计算研究中心。

研究方向为数据质量、大数据管理、图数据管理、Web数据管理等,发表论文200余篇,SCI/EI检索100余次,他引900余次,先后主持国家自然科学基金重点项目、国际合作项目等10余项项目,以主要成员参与国家自然科学基金重点项目、863重点项目、973项目以及一批省部级重点项目和多项国际合作项目等。 获得微软学者、中国优秀数据库工程师、IBM博士英才等称号,作为第4获奖人获得黑龙江省自然科学奖和教育部高校科技进步奖各1项。

工作经历
时间工作经历
2015-至今哈尔滨工业大学计算机学院 教授
2013-至今哈尔滨工业大学计算机学院 博士生导师
2010-2015哈尔滨工业大学计算机学院 副教授
2012.9-2013.9UC Irvine 访问学者
2011.4-2011.10微软亚洲研究院"助星计划"访问学者
2008-2010哈尔滨工业大学计算机学院 讲师
2006.10-2007.4新加坡国立大学 实习生
2004.5-2005.4新南威尔士大学 访问研究助理


教育经历
2003年-2008年, 就读于哈尔滨工业大学, 获博士学位2001年-2003年, 就读于哈尔滨工业大学, 获硕士学位1997年-2001年, 就读于哈尔滨工业大学, 获学士学位


荣誉和奖励
2018年 CSC-IBM中国优秀教师奖教金

2018年 宝钢奖

2018年 龙江学者青年学者

2018年 “算法设计与分析”评选为中国高等学校计算机教育MOOC联盟建设课程

2018年 "大数据算法"评选为中国高等学校计算机教育MOOC联盟优秀课程

2018年 全国高校人工智能与大数据学术创新奖

2017年 黑龙江省青年科技奖

2017年 “大数据算法”获得“2017年全国精品在线开放课程”

2017年 “大数据算法”和“算法设计与分析”获得“2017年黑龙江省精品在线开放课程”

2017年 《大数据算法》获得黑龙江省高等教育学会第二十二次优秀高等教育研究成果一等奖

2017年 第一届全国高等学校计算机教育教学青年教师优秀论文一等奖

2016年 全国数据库会议“萨师煊优秀学生论文”

2016年 教育部科技进步一等奖

2016年 大学生创新创业教育活动优秀指导教师奖

2016年 优秀专兼职学生工作者

2015年 哈尔滨工业大学专兼职学生工作者标兵

2015年 哈尔滨工业大学“三育人”标兵

2015年 《科学中国人》2014年度人物

2014年 学生活动优秀指导教师

2013年 学生活动优秀指导教师

2011年 黑龙江省自然科学一等奖

2011年 全国数据库会议“萨师煊优秀学生论文”

2009年 博士论文“XML数据查询处理技术”被评为中国计算机学会优秀博士论文

2009年 博士论文“XML数据查询处理技术”被评为哈尔滨工业大学优秀博士论文

2007年 IBM博士英才

2006年 国际会议WISE最佳论文

2006年 中国优秀数据库工程师

2005年 微软学者

特邀报告
工业大数据:背景与探索,工业大数据发展与人才培养论坛,2017.4. 哈尔滨大数据分析:从数据到智能,数据科学与人工智能论坛, 2017.3. 苏州教育大数据分析:方法与探索.中国大学在线开放课程论坛,2017.1. 北京大数据质量问题与大数据治理. 中兴公司,2016.11. 南京大数据清洗:从理论到实践. 第三届中国国际大数据大会,2016.9. 北京大数据课程建设探索. 机械工业出版社华章分社第十三届暑期教师培训班. 2016.7.西安大数据清洗的探索与实践. 2016年中国数据库发展研讨会. 2016.6. 南昌.Big Data Management in Integrated Health Services. ICNISC 2016. 2016.4, 武汉数据质量和数据清洗. 中国电信北京研究院“云计算与大数据”论坛. 2016.4, 北京大数据清洗的探索与实践. 天津大学,2016.4, 天津Manufactory Big Data Analysis. The International Conference on Computer Science and Technology (CST2016), 2016.1, 深圳多快好省的大图查询处理,第二届灯塔大数据论坛,2015.12, 北京Big Data in Industry and Manufactury. 2015 International Conference on Computer Science and Mechanical Automation (CSMA2015), 2015.10, 杭州从数据质量研究看数据库研究. 第32届全国数据库学术会议 研究生辅导讲座, 2015.10, 成都大数据管理与分析: 技术、应用与进展. 航天三院,2015.9, 北京大数据问题求解:算法与系统. 第58期CCF ADL, 2015.7, 北京Data Quality Management for Big Data, The 7th International Conference on Computational Intelligence and Software Engineering (CiSE 2015),2015.5, 北京Big Data Cleaning: Challenges and Solutions, The 2015 International Conference on Network and Information Systems for Computers (ICNISC2015), 2015.1, 武汉Bigdata Cleaning based on Crowdsourcing, CIKM 2014 Workshop on Interactive Mining for Big Data (ImBig), 2014.11, 上海Algorithm Design and Analysis for Big Data, 2014 International Workshop and Summer School on "Biological Big Bytes, 2014.8,哈尔滨从量到质的大数据管理. YOCSEF哈尔滨走进吉林大学报告会,2014.6, 长春大数据的故事. 黑龙江省存量房评估业务培训班,2014.5, 哈尔滨Subgraph Matching on Large Graphs. the Computer Science Department, North Texas University, Dallas, USA, September 2013量质融合的大数据管理技术初探. 第十二届全国博士生学术年会, 2014.5算法设计与分析课程教学初探,黑龙江省普通高等学校2014计算机系列课程研讨班, 2014.4Genomix: Genome Assembly with Hyracks. UCI ISG Seminar, Irvine, 2013.2海量图数据子模式匹配技术初探, 第三届中国计算机学会优秀博士学位论文获奖者论坛, 大连,2012.7从量到质的数据管理——数据质量管理浅谈, 计算机研究进展报告-CCF优博论坛, 上海,2011.7


学术服务
中国计算机学会高级会员

ACM数据科学学科标准编写组成员

2014-2015 YOSCEF哈尔滨分论坛主席

2012-2013 YOSCEF哈尔滨分论坛副主席

2011-2015 YOSCEF哈尔滨分论坛学术委员(AC)

2017-现在 黑龙江省数据库专委会副主任委员

2017-现在 YOCSEF哈尔滨荣誉委员

2018-至今 CCF哈尔滨分部副主席

2016-2017 CCF哈尔滨分部秘书长

2016-现在 SIGMOD China Chapter秘书长

2018-现在 全国高校人工智能与大数据创新联盟专家委员会副主任

中国计算机学会数据库专委会常务委员

中国计算机学会计算机应用专委会委员

中国计算机学会大数据专家委员会委员

中国计算机学会哈尔滨工业大学学生分会指导委员会副主任

哈工大微软俱乐部指导教师

哈工大数据库俱乐部指导教师

哈工大数据智能俱乐部指导教师

哈工大EMC俱乐部指导教师

国际会议组织者

ADMA 2007 Registration Chair

WAIM 2012 Workshop Chair

WAIM 2014 Demo Chair

WASA 2014 Local Organization Chair

ICYCSEE 2015 General Chair

BDQM 2016 PC Chair

编辑(Editorial Board)

International Journal of Data Science

Symbiosis Center for Information Technology Journal

International Journal of Informatics Researches

Journal of Computer Sciences

程序委员会委员 (PC Member)

ACM International Conference on Information and Knowledge Management (CIKM), 2011,2017-2018

International Conference on Conceptual Modeling (ER), 2018,2019
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2016, 2018,2019
International Conference on Database Systems for Advanced Applications(DASFAA) 2015-2016,2019
全国数据库会议 (NDBC), 2009-2017

中国计算机大会 (CNCC) 2014

全国大数据会议 (CCF BigData), 2015-2017

全国计算机应用会议, 2015-2017

International Conference on Web-Age Information Management (WAIM), 2011-2017

Asia-Pacific Web Conference (APWeb), 2011-2016

Advanced Data Mining and Applications (ADMA) 2011-2014, 2016
International Conference on Service Science (ICSS) 2015

Asia-Pacific Services Computing Conference(APSCC) 2014-2016
the 10th IEEE International Conference on Big Data Science and Engineering (IEEE BigDataSE-16)

期刊审稿人

The VLDB Journal

IEEE Transaction of Knowledge and Data Engineering

Data and Knowledge Engineering

Knowledge and Information System

Information Sciences

International Journal of Computer Systems Science and Engineering

World Wide Web Journal

Knowledge-based Systems

Journal of Computer Science and Technology

The Computer Journal

Journal of Software and Systems

Journal of Parallel and Distributed Computing

SCIENCE CHINA Information Sciences

Computers and Electrical Engineering

China Communications

Parallel and Distributed Databases

Expert Systems With Applications

Concurrency and Computation: Practice and Experience

Frontiers of Computer Science

TELKOMNIKA Indonesian Journal of Electrical Engineering

Mathematical Problems in Engineering

Physical Sciences Research International

Journal of Computing and Information Technology

Information Fusion
浙江大学学报

清华大学学报

软件学报

计算机学报

自动化学报

电子科技大学学报

南京理工大学学报

东北大学学报

中文信息学报

西北工业大学学报

华中科技大学学报

第一届全国无线传感器网络会议(CWSN07)组织委员会委员

个人新闻

新闻标题当选CCF YOCSEF哈尔滨荣誉委员

发表时间2017-01-09

在2017年1月8日的AC会议上当选CCF YOCSEF哈尔滨荣誉委员。


新闻标题当选黑龙江省数据库专委会副主任委员

发表时间2017-03-20

在2017年3月19日的黑龙江专委会上当选黑龙江省数据库专委会副主任委员。


新闻标题指导的三支队伍收到“大学生创新创业训练计划”项目支持

发表时间2017-5-23

“大规模知识图谱的扩展研究”和“基于MapReduce查询任务的参数自适应调优”收到国家级资助;“基于MOOC大数据的个人行为分析”受到省级资助。祝贺!


新闻标题祝贺王春楠同学

发表时间2017-7-1

1.毕业论文获得百优本科生毕业论文

2.获得哈工大2017年大学生创新创业训练计划优秀项目一等奖

热烈祝贺!


新闻标题祝贺齐志鑫、孙铭、魏延杰同学获得国家奖学金

发表时间2017-11-17

祝贺!


Hongzhi Wang


Hongzhi Wang

Professor of Department of Computer Science and Technology, Harbin Institute of Technology.

My interested rearch area includes big data management, data quality, graph data management, web data management. I have published more than 100 papers. I have also been PI of more than 10 projects including three NSFC projects and co-PI of 973, 863 and NSFC key projects. I was awarded microsoft fellowship, Chinese Execellent database engineer and IBM PHD fellowship. My PHD thesis was selected as CCF outstanding PHD thesis.

Contact
Hongzhi Wang

Phone:86-**-810
FAX:86-**
E-mail:wangzh [AT] hit.edu.cn
Postal Code:150001
Address:P.O.Box 750, Harbin Institute of Technology


Working Experiences
TimePosition
2015-nowHarbin Institute of Technology, Professor
2013-nowHarbin Institute of Technology, PHD supervisor
2010-2015Harbin Institute of Technology, Associate Professor
2012.9-2013.9UC Irvine Visiting Scholar
2011.4-2011.10MSRA "Starring Track" Visiting Professor
2008-2010Harbin Institute of Technology, Asistent Professor
2006.10-2007.4National University of Singapore, Intern
2004.5-2005.4University of New South Walses, Visiting Reserach Associate


Education Background
2003 – 2008, Ph.D. Computer Science, Harbin Institute of Technology, Advisor: Professor Jianzhong Li2001 – 2003, Ph.D. Computer Science, Harbin Institute of Technology, Advisor: Professor Jianzhong Li1997 - 2001, B.S., Computer Science, Harbin Institute of Technology


Awards
2009, CCF Outstanding PHD thesis

2009, Outstanding PHD thesis of Harbin Institute of Technology

2007, IBM PHD Followship

2006, The best paper of WISE

2006, Chinese Excellent Database Engineer

2005, Microsoft Fellowship

Professional Services
Senior Member, CCF2014-2015, Chair, CCF YOSCEF Harbin2012-2013, Vice Chair, CCF YOSCEF HarbinSupervisor, HIT-MSClubADMA 2007 Registration ChairWAIM 2012 Workshop ChairWAIM 2014 Demo ChairWASA 2014 Local Organization ChairICYCSEE 2015 General ChairEditorial BoardInternational Journal of Data ScienceSymbiosis Center for Information Technology JournalInternational Journal of Informatics ResearchesJournal of Computer SciencesPC MemberACM International Conference on Information and Knowledge Management (CIKM), 2011Natinoal Database Conference (NDBC), 2009-2014International Conference on Frontier of Computer Science and Technology (FCST), 2010-2012International Conference on Web-Age Information Management (WAIM), 2011-2014Asia-Pacific Web Conference (APWeb), 2011-2014Advanced Data Mining and Applications (ADMA) 2011-2014International Conference on Information, Process, and Knowledge Management (eKNOW) 2014International Conference on Information and Knowledge Management (ICIKM), 2013International Workshop on Data Management for Emerging Network Infrastructures (Damen), 2011International Workshop on XML Data Management (XML-DM), 2010-2012International Workshop on Location-Based Social Networks (LBSN), 2013International Workshop on Management of Spatial Temporal Data (MSTD), 2013International Conference on Computer Science and Mechanical Engineering (CSME) 2013RefereeIEEE Transaction of Knowledge and Data EngineeringData and Knowledge EngineeringKnowledge and Information SystemInformation SciencesInternational Journal of Computer Systems Science and EngineeringWorld Wide Web JournalKnowledge-based SystemsJournal of Computer Science and TechnologyThe Computer JournalJournal of Software and SystemsComputers and Electrical EngineeringChina CommunicationsParallel and Distributed DatabasesExpert Systems With ApplicationsConcurrency and Computation: Practice and ExperienceFrontiers of Computer ScienceTELKOMNIKA Indonesian Journal of Electrical EngineeringMathematical Problems in EngineeringPhysical Sciences Research International


Research Interests
Big Data (大数据)Big Data Quality (大数据质量)Big Data Management (大数据管理)Big Data Learning (大数据学习)Big Data Analysis and Mining (大数据分析与挖掘)


招生信息
博士招生:
招收大数据、数据管理等方向博士研究生,期待有热情并有学术兴趣的同学报考。



硕士招生:
招收大数据、数据管理等方向硕士研究生,期待有热情的同学报考。



同时欢迎对大数据、数据管理以及相关领域研究与开发感兴趣的本科生同学加盟!

感兴趣的同学请email联系wangzh@hit.edu.cn

毕业生
博士

2018届:Amina Belhassena (Algeria development satellite)

硕士

2014届:冯华宾(百度)

2015届:叶晨(本中心读博,金牌毕业论文)

2016届:李佳宁( 昆仑智汇科技),孔欣欣(平安证券)

2017届:王鹤澎(百度),韩姗珊(招商银行),林一鸣(UCI)

2018届:齐志鑫,孙铭,宋扬,魏延杰,尹薇

工程硕士

2012届:丛大勇,王谦益

2013届:李永金,张宏哲,刘学良,郭畅宇

2015届:王文娟 (龙江银行总行), 刘峥宇(中兴)

2016届:孙方媛 (百度,优秀毕业论文), 马妍娇(京东),王思澄(百度),赵冀磊,曹贞兴(中国人民解放军某部)

2017届:苏钰(华三),甘小楚,徐琳

2018届:刘哲敏,李飞,黄炜,顾威,王星,王宁,孙宇,李夏南

本科生

2009届:李默涵 (本中心读研), 姜国华 (本中心读研,现百度工作), 张航 (国防科技大学),李璐(新加坡国立大学(NUS))

2010届:汪清 (本中心读研,优秀毕业论文),边旭 (本中心读研),王昉达(新加坡国立大学(NUS)), 纪鹏 (国防生),陈冬梁(人人), 朴东升

2011届:李亚坤 (本中心读研),刘永楠(本中心读研), 李飞(密歇根大学,优秀毕业论文), 姚立(软件学院读研),叶秀慧 (都灵理工大学),刘超亚 (方正)

2012届:张晓东(本中心读研),刘倩(北卡莱罗纳大学),王玥(宾夕法尼亚州立大学(PSU)),沈文博 (海康威视),张均健 (颗豆互动),徐妍妍(协助指导,香港中文大学)

2013届:叶晨(本中心读研),朱乾坤(香港中文大学),胡越(卡内基梅隆大学(CMU)),张佳程(清华大学读研),夏坤贤

2014届:张美范(本中心读研),张丹(UMASS),郑凯平(新加坡国立大学(NUS)),过云燕(本中心读研),曹贞兴(留校工作),陈宇晰(加州大学洛杉矶分校(UCLA)),郭锐(阿普杜勒国王大学),施奇奇(卡内基梅陇大学(CMU))

2015届:丁小欧(本中心读研,优秀毕业论文),熊风(本中心读研),林一鸣(本中心读研),李明达(加州大学洛杉矶分校(UCLA)),柴成亮(清华大学读研),唐家声(阿里研究院,优秀毕业论文),毛运东(搜狐),王欣宇(百度),姜川, 张笑影(爱数软件有限公司)

2016届:宋扬(本中心读研),魏延杰(本中心读研),李泽宇(加州大学洛杉矶分校(UCLA),优秀毕业论文),王雅萱(RICE,优秀毕业论文),袁炜东(CMU),刘泽明,张名驰(哈尔滨联通软件研究院)

2017届:王春楠(本中心读研,优秀毕业生),苏学斌,李天宝(多伦多大学),王潇雨,袁芳怡(本中心读研),汪歆城,苏佳轩(本中心读研),唐怡雯 (USC),石若曦(阿尔伯塔大学),李斯泽(本中心读研)

2018届:万晓珑(本中心读研),耿飞(本中心读研),李子珏(本中心读研),魏龑(本中心读研),邹开发(本中心读研)

讲授课程
大数据算法,2014夏,2015夏

算法设计与分析(本科),2007春,2008春,2009春,2010春,2010秋,2011春,2011秋,2012春, 2013秋, 2014春, 2014秋,2015春,2015秋,2016春,2016秋,2017春
大数据管理与分析(研究生),2015-2019春

算法设计与分析(研究生),2010春,2010秋,2012春

高级数据库系统,2014春

编译原理,2006秋,2008秋,2009秋,2010秋,2011秋

软件设计与实践I,2012春,2014春,2015春,2016春

Web Services,2006秋,2008秋,

主流数据库,2005秋

数学建模,2005秋,2006春

Web 技术,2003秋, 2013秋

分布式系统,2003秋

指导学生科技创新项目
项目名称年份参与同学资助类别
面向海量异构数据的机器学习理论与关键技术研究2015张翔熙,李沅泽,唐梦研,高琦琦2015年腾讯大学生创新实践项目
面向医疗健康领域的知识库的建立与应用2015杨志飞,白广通,苏佳轩,王重然,龚恒2015年腾讯大学生创新实践项目
基于新浪微博的数据分析与挖掘2013魏延杰,李言路,朱光亚,冀文欣,赵怀鹏校级
基于众包的数据清洗系统2013李可利,陈潜,袁建华,宋江夺,袁炜东2014年腾讯大学生创新实践项目
高效集成的海量数据并行化清洗系统2013李明达,张红阳,李昱晰,杨丽霞国家级
基于质量评估的数据市场系统2013丁小鸥,张丹,成烈南杰,肖蕾校级
基于实体的社交网络数据管理系统2012过云燕, 张玮奇、王佩琪、徐竟祎国家级
基于海量图数据管理的社交网络查询与挖掘技术的研究2012李开宇, 孙长滨、刘燊、胡冲国家级
互联网商品信息提取,分析与检索系统2011周小田,郭翔宇,胡筱,董志鑫校级
针对中文数据集的基于图的实体识别2010丁宇校级
海量电子商务数据面向复杂类目分类技术的探究2010刘倩,张晓东,付建宇校级
高可用性电子集市2009刘倩,张晓东,付建宇校级


指导科技创新获奖

奖项名称互联网商品信息提取,分析与检索系统

获奖时间2012

完成人周小田,郭翔宇,胡筱,董志鑫

所获奖项大学生创新性实验计划优秀项目一等奖



奖项名称基于Windows集群的电子商务信息检索与集成系统(基于实体的商品检索数据的并行化研究)

获奖时间2012

完成人张晓东,陈敏,陈懿诚

所获奖项微软HPC校园编程大赛,第二名



奖项名称支持电子商务的实体识别技术及其应用

获奖时间2011

完成人刘倩,张晓东,李飞,王玥

所获奖项校大学生科技创新优秀成果奖



奖项名称海量电子商务数据面向复杂类目分类技术的探究

获奖时间2011

完成人刘倩,张晓东,付建宇

所获奖项大学生创新性实验计划优秀项目一等奖



奖项名称基于Windows集群的海量数据实体识别与检索算法研究

获奖时间2011

完成人李亚坤,刘永楠,刘超亚

所获奖项微软HPC校园编程大赛,优胜奖



奖项名称高可用性电子集市

获奖时间2010

完成人刘倩,张晓东,付建宇

所获奖项大学生创新性实验计划优秀项目三等奖



奖项名称基于pureXML的文献检索系统

获奖时间2009

完成人黎玲利,孟啸,高静

所获奖项寻找PureXML应用竞赛第三名



奖项名称高效集成的海量数据并行化清洗系统

获奖时间2014

完成人李明达,张红阳,杨丽霞,李昱昕

所获奖项哈工大2014年大学生创新创业训练计划优秀项目一等奖



奖项名称基于实体的社交网络数据管理系统

获奖时间2014

完成人过云燕,张玮奇,王佩琪,徐竟祎

所获奖项哈工大2014年大学生创新创业训练计划优秀项目二等奖



奖项名称基于质量评估的数据市场系统

获奖时间2015

完成人丁小欧,成烈南杰,张丹,肖蕾

所获奖项哈尔滨工业大学大学生创新创业训练计划优秀项目一等奖



奖项名称海量数据计算的理论和技术

获奖时间2011

完成人李建中,樊文飞,高宏,王宏志

所获奖项黑龙江省自然科学一等奖



奖项名称XML数据查询处理技术研究

获奖时间2009

完成人王宏志

所获奖项中国计算机学会优秀博士论文



奖项名称XML数据查询处理技术研究

获奖时间2009

完成人王宏志

所获奖项哈尔滨工业大学优秀博士论文



指导大一Yearly Project
项目名称年份参与同学成绩
智能手机远程控制电脑2013吴菲豪、安然、张智、陈潜一等奖
android游戏2013王浩宇、王一舒、李昊洋、曲行、余卓勋二等奖
淘宝比价小助手2013孙冠雄、魏鸿焱、曾晓文通过
基于C语言错题集开发2013陶天一、燕珂、张晔、金亮、刘晓光通过



Deadline of Related Conferences&Special Issues
ConferenceDeadline
ICDM 2019Full paper submissions: June 5, 2019

Demo and tutorial proposals: July 15, 2019
ICDE 2019
Abstract submission due: June 8, 2019

Submission due: June 15, 2019

SIGMOD 2019July 9, 2019 : Abstract submission

July 16, 2019: Paper submission



Books & Chapters
王宏志 编著. 大数据分析原理与实践. 机械工业出版社, 2017.4. (In Chinese)王宏志 林可编著. 零基础学习大数据算法. 电子工业出版社, 2016.7. (In Chinese)王宏志,第二章 数据采集与治理. 大数据导论,机械工业出版社,2018.王宏志,李春静. Hadoop集群程序设计与开发,人民邮电出版社, 2018高辉,李东升,王宏志(译).高性能分布式计算系统开发与实现:基于Hadoop、Scalding和Spark.机械工业出版社,2018. (In Chinese)黎玲利,尹丹,李默涵,王宏志(译).推荐系统:原理与实践.机械工业出版社,2018. (In Chinese)王宏志,黎玲利(译).算法基础.机械工业出版社,2017. (In Chinese)王宏志 (译). 算法基础:打开算法之门.机械工业出版社, 2016.1. (In Chinese)王宏志 编著. 大数据算法. 机械工业出版社, 2015.7. (In Chinese)Hongzhi Wang. Innovative Techniques and Applications of Entity Resolution. IGI Global, 2014.Hongzhi Wang, Jianzhong Li, Jinbao Wang, Hong Gao: Dirty Data Management in Cloud Database. Grid and Cloud Database Management 2011: 133-150Hongzhi Wang, Jianzhong Li, Hong Gao. Labelling-Scheme-Based Subgraph Query Processing on Graph Data. Graph Data Management: Techniques and Applications 2011: 142-174.Hongzhi Wang, Jianzhong Li, Fei Li. Efficient Identification of Similar XML Fragments Based on Tree Edit Distance. XML Data Mining: Models, Methods, and Applications, 2011,78-97.王宏志 著. XML数据查询处理技术. 天津教育出版社,2010. (In Chinese)殷建平, 徐云, 王刚, 刘晓光, 苏明, 邹恒明, 王宏志 (译). 算法导论. 机械工业出版社, 2012. 12. (In Chinese)Zhipeng Cai, Chaokun Wang, Siyao Cheng, Hongzhi Wang, Hong Gao (Eds.): Wireless Algorithms, Systems, and Applications - 9th International Conference, WASA 2014, Harbin, China, June 23-25, 2014. Proceedings. Lecture Notes in Computer Science 8491, Springer 2014, ISBN 978-3-319-07781-9Hongzhi Wang, Haoliang Qi, Wanxiang Che, Zhaowen Qiu, Leilei Kong, Zhongyuan Han, Junyu Lin, Zeguang Lu (Eds.). Intelligent Computation in Big Data Era - International Conference of Young Computer Scientists, Engineers and Educators, ICYCSEE 2015, Harbin, China, January 10-12, 2015. Proceedings, Communications in Computer and Information Science, Volume 503 2015, ISBN: 978-3-662-46247-8 (Print) 978-3-662-46248-5 (Online).Wanxiang Che, Qilong Han, Hongzhi Wang, Weipeng Jing, Shaoliang Peng, Junyu Lin, Guanglu Sun, Xianhua Song, Hongtao Song, Zeguang Lu. Social Computing-Second International Conference of Young Computer Scientists, Engineers and Educators, ICYCSEE 2016, Harbin, China, August 20-22, 2016, Proceedings, Part I, Part II.Beiji Zou, Min Li, Hongzhi Wang, Xianhua Song, Wei Xie, Zeguang Lu: Data Science - Third International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2017, Changsha, China, September 22-24, 2017, Proceedings, Part I. Communications in Computer and Information Science 727, Springer 2017, ISBN 978-981-10-6384-8


International Journals
Hongzhi Wang, Xiaoou Ding, Jianzhong Li, Hong Gao:Rule-Based Entity Resolution on Database with Hidden Temporal Information. IEEE Trans. Knowl. Data Eng. 30(11): 2199-2212 (2018)

Zeyu Li, Hongzhi Wang, Wei Shao, Jianzhong Li, Hong Gao. Repairing Data through Regular Expressions. PVLDB 9(5),2015

Hongzhi Wang, Jianzhong Li, Jizhou Luo, Hong Gao. Hash-base subgraph query processing method for graph-structured XML documents. PVLDB 1(1): 478-489 (2008).

Zhao Sun, Hongzhi Wang, Haixun Wang, Bin Shao, Jianzhong Li. Efficient Subgraph Matching on Billion Node Graphs. PVLDB 5(5), 2012

Wenfei Fan, Jianzhong Li, Shuai Ma, Hongzhi Wang, Yinghui Wu. Graph Homomorphism Revisited for Matching Web Sites. PVLDB 3, 2010

Yiming Lin, Hongzhi Wang, Jianzhong Li, Hong Gao: Data source selection for information integration in big data era. Inf. Sci. 479: 197-213 (2019) Yan Zhang, Hongzhi Wang, Long Yang, Jianzhong Li: Efficient histogram-based range query estimation for dirty data. Frontiers Comput. Sci. 12(5): 984-999 (2018)

Hiba Abu Ahmad, Hongzhi Wang: An effective weighted rule-based method for entity resolution. Distributed and Parallel Databases 36(3): 593-612 (2018)

Hongzhi Wang, Feng Xiong, Jianing Li, Shengfei Shi, Jianzhong Li, Hong Gao:Data management on new processors: A survey. Parallel Computing 72: 1-13 (2018)

Junxiong Wang, Hongzhi Wang, Chenxu Zhao, Jianzhong Li, Hong Gao:Iteration acceleration for distributed learning systems. Parallel Computing 72: 29-41 (2018)

Hongzhi Wang, Ning Li, Jianzhong Li, Hong Gao:Parallel algorithms for flexible pattern matching on big graphs. Inf. Sci. 436-437: 418-440 (2018)

Zhixin Qi, Hongzhi Wang, Jianzhong Li, Hong Gao: FROG: Inference from knowledge base for missing value imputation. Knowl.-Based Syst. 145: 77-90 (2018)

Zemin Chao, Shengfei Shi, Hong Gao, Jizhou Luo, Hongzhi Wang: A gray-box performance model for Apache Spark. Future Generation Comp. Syst. 89: 58-67 (2018)

Jizhou Luo, Shengfei Shi, Guang Yang, Hong-Zhi Wang, Jian-Zhong Li: O2iJoin: An Efficient Index-Based Algorithm for Overlap Interval Join. J. Comput. Sci. Technol. 33(5): 1023-1038 (2018)

Hongzhi Wang, Shengjun Yin, Ming Sun, Y. E. Wang, Hepeng Wang, Jianzhong Li, Hong Gao: Efficient Computation of Skyline Queries on Incomplete Dynamic Data. IEEE Access 6: 52741-52753 (2018).

Jizhou Luo, Shengfei Shi, Hongzhi Wang, Jianzhong Li:FrepJoin: an efficient partition-based algorithm for edit similarity join. Frontiers of IT & EE 18(10): 1499-1510 (2017)

Hongzhi Wang, Zhixin Qi, Ruoxi Shi, Jian-Zhong Li, Hong Gao: COSSET+: Crowdsourced Missing Value Imputation Optimized by Knowledge Base. J. Comput. Sci. Technol. 32(5): 845-857 (2017)

Xue-Li Liu, Hong-Zhi Wang, Jian-Zhong Li, Hong Gao: EntityManager: Managing Dirty Data Based on Entity Resolution. J. Comput. Sci. Technol. 32(3): 644-662 (2017)

Boya Ren, Hongzhi Wang, Jianzhong Li, Hong Gao: Life-long learning based on dynamic combination model. Appl. Soft Comput. 56: 398-404 (2017)

Xiaoou Ding, Hongzhi Wang, Yitong Gao, Jianzhong Li, Hong Gao. Efficient Currency Determination Algorithms for Dynamic Data. TSINGHUA SCIENCE AND TECHNOLOGY, 22(3), 227-242, 2017.

Fei Li, Hongzhi Wang, Guowen Zhou, Daren Yu, Jiangzhong Li, Hong Gao. Anomaly Detection in Gas Turbine Fuel Systems Using a Sequential Symbolic Method. Energies 2017, 10, 724; doi:10.3390/en**.

Hongzhi Wang, Amina Belhassena: Parallel trajectory search based on distributed index. Inf. Sci. 388: 62-83 (2017)

Kaiping Zheng, Hongzhi Wang, Zhixin Qi, Jianzhong Li, Hong Gao. A survey of query result diversification.Knowl Inf Syst51(1): 1-36 (2017)

Yiming Lin, Hongzhi Wang, Shuo Zhang, Jianzhong Li, Hong Gao. Efficient quality-driven source selection from massive data sources. Journal of Systems and Software. Volume 118, August 2016, Pages 221–233.

Hongzhi Wang, Mingda Li, Yingyi Bu, Jianzhong Li, Hong Gao, Jiacheng Zhang. Cleanix: a Parallel Big Data Cleaning System. Sigmod Record 44(4), 2015.

Jianing Li, Hongzhi Wang, Jianzhong Li, Hong Gao. Skyline for Geo-Textual Data. Geoinformatica. 20(3): 453-469 (2016).

Rui Guo, Hongzhi Wang, Mengwen Chen, Jianzhong Li, Hong Gao:Parallelizing the extraction of fresh information from online social networks. Future Generation Comp. Syst. 59: 33-46 (2016)

Huan Hu, Hongzhi Wang, Jianzhong Li, Hong Gao. An efficient pruning strategy for approximate string matching over suffix tree. Knowledge and Information System. 49(1): 121-141 (2016)

Yue Wang, Hongzhi Wang, Jianzhong Li, Hong Gao:Efficient graph similarity join for information integration on graphs. Frontiers of Computer Science 10(2): 317-329 (2016)

Hongzhi Wang, Jianzhong Li, Hong Gao. Efficient Entity Resolution based on Subgraph Cohesion. Knowledge and Information Systems.Volume 46, Issue 2, pp 285-314

Yue Wang, Hongzhi Wang, Liyan Zhang, Yang Wang, Jianzhong Li, Hong Gao. Extend Tree Edit Distance for Effective Object Identification. Knowledge and Information Systems. 46(3): 629-656 (2016).

Yan Zhang, Hongzhi Wang, Hong Gao, Jianzhong Li. Efficient accuracy evaluation for multi-modal sensed data. Journal of Combinatorial Optimization. 32(4): 1068-1088 (2016).

Yan Zhang, Hongzhi Wang, Zhongsheng Yang, Jianzhong Li. Relative Accuracy Evaluation. PLoS ONE 9(8): e103853. doi:10.1371/journal.pone.**.

Xintong Guo, Hongzhi Wang, Yangqiu Song, Gao Hong: Brief survey of crowdsourcing for data mining. Expert Syst. Appl. 41(17): 7987-7994 (2014)

Fei Li, Hongzhi Wang, Jianzhong Li, Hong Gao. Approximate Joins for XML at Label Level. Information Sciences 282 (2014) 237–249

Fangda Wang, Hongzhi Wang, Jianzhong Li, Hong Gao. Graph-based Reference Table Construction to Facilitate Entity Matching. Journal of Systems and Software,Volume 86, Issue 6, 2013, 1679–1688.

Fei Li, Hongzhi Wang, Jianzhong Li, Hong Gao. A Survey on Tree Edit Distance Lower Bound Estimation Techniques for Similarity Join on XML Data. Sigmod Record 42(4), 2013

Yakun Li,Hongzhi Wang,Hong Gao,Jianzhong Li. An Efficient Entity Resolution Method for Large Relations. International Journal of Cooperative Information Systems, Vol. 22, No. 1 (2013), 1–17.

Yakun Li, Hongzhi Wang, Jianzhong Li, Hong Gao. Efficient Community Detection with Additive Constrains on Large Networks. Knowledge-Based Systems, 52 (2013) 268-278.

Yue Wang, Hongzhi Wang, Jianzhong Li, Hong Gao. Efficient Subgraph Join based on Connectivity Similarity. World Wide Web Journal. World Wide Web 18(4): 871-887 (2015).

Hongzhi Wang, Jianzhong Li, Wei Wang, Xuemin Lin. Coding-based Join Algorithms for Structural Queries on Graph-structured XML Document. World Wide Web Journal 11(4): 153-168 (2008).

Hongqiang Wang, Jianzhong Li, Hongzhi Wang. Clustered Chain Path Index for XML Document: Efficiently Processing Branch Queries. World Wide Web 11(1): 153-168 (2008).

Hongzhi Wang, Jianzhong Li, Jizhou Luo. Data Sources Selection for XML Data Sources. International Journal of Intelligent Information and Database Systems Vol. 2 422-445 (2008).

Jianzhong Qi,Rui Zhang,Kotagiri Ramamohanarao,Hongzhi Wang, Zeyi Wen,Dan Wu. Indexable online time series segmentation with error bound guarantee. World Wide Web 18(2): 359-401 (2015).

Hongzhi Wang, Jianzhong Li. GXQuery: Extending XQuery for Querying Graph-structured XML Data. Journal of Computing and Information Technology 19(2): 83-91 (2011).

Hongzhi Wang, Jianzhong Li, Shuguang Xiong. Efficient join algorithms for distributed information integration based on XML. Int. J. Business Process Integration and Management, Vol. 3, No. 4, pp.271–281.

WANG Hong-zhi, LIU Yong-Qiang, LIU Zhan-yi, SHANG Shou-ting. Mathematical model for detection ground water pollutant. Journal of Harbin Institute of Technology 7(4),2000.

International Conferences
Jinglin Peng, Hongzhi Wang, Jianzhong Li, Hong Gao. Set-based Similarity Search for Time Series. SIGMOD 2016

Wei Wang, Hongzhi Wang, Hongjun Lu, Haifeng Jiang, Xuemin Lin, Jianzhong Li. Efficient Processing of XML Path Queries Using the Disk-based F&B Index. VLDB 2005: 145-156.

Hongzhi Wang, Xiaodong Zhang, Jianzhong Li, Hong Gao: ProductSeeker: entity-based product retrieval for e-commerce. SIGIR 2013: 1085-1086

Hongzhi Wang, Xiaoou Ding, Xiangying Chen, Jianzhong Li, Hong Gao:CleanCloud: Cleaning Big Data on Cloud. CIKM 2017: 2543-2546

Hongzhi Wang, Mingda Li, Yingyi Bu, Jianzhong Li, Hong Gao, Jiacheng Zhang: Cleanix: A Big Data Cleaning Parfait. CIKM 2014: 2024-2026

Lingli Li, Jianzhong Li, Hongzhi Wang, Hong Gao: Context-based entity description rule for entity resolution. CIKM 2011: 1725-1730

Ruoxi Shi, Hongzhi Wang, Tao Wang, Yutai Hou, Yiwen Tang, Jianzhong Li, Hong Gao:Similarity Search Combining Query Relaxation and Diversification. DASFAA (2) 2017: 65-84.

Meifan Zhang, Hongzhi Wang, Jianzhong Li, Hong Gao. One-pass Inconsistency Detection Algorithms for Big Data. DASFAA 2016.

Chen Ye, Hongzhi Wang, Jianzhong Li, Hong Gao, Siyao Cheng. Crowdsourcing-enhanced Missing Values Imputation based on Bayesian Network. DASFAA 2016.

Yaxuan Wang, Hongzhi Wang, Jianzhong Li, Hong Gao. Efficient Influence Maximization in Weighted Independent Cascade Model. DASFAA 2016.

Hongzhi Wang, Jianzhong Li, Xianmin Liu, Jizhou Luo. Query Optimization for Complex Path Queries on XML Data. DASFAA 2009: 389-404.

Lingli Li, Hongzhi Wang, Jianzhong Li, Hong Gao. Efficient Algorithms for Skyline Top-K Keyword Queries on XML Streams. DASFAA 2009: 283-287.

Rui Guo, Hongzhi Wang, Lucheng Zhong, Jianzhong Li, Hong Gao: Harbinger: An Analyzing and Predicting System for Online Social Network Users's Behavior. DASFAA (2) 2014: 531-534

Zhiyu Liang, Hongzhi Wang, Jianzhong Li, Hong Gao: IMOptimizer: An Online Interactive Parameter Optimization System Based on Big Data. DASFAA 2019 (demo): 581-584Hongzhi Wang, Jianzhong Li, Ran Huo, Li Jia, Lian Jin, Xueying Men, Hui Xie. HITCleaner: A Light-weight Online Data Cleaning System. Proceedings of DASFAA 2013, 481-484. Demo.

Hongzhi Wang, Xueli Liu, Jianzhong Li, Xing Tong, Long Yang, Yakun Li. EntityManager: An Entity-based Dirty Data Management System. Proceedings of DASFAA 2013. 468-471. Demo.

Hongqiang Wang, Jianzhong Li, Hongzhi Wang. Clustered Chain Path Index for XML Document: Efficiently Processing Branch Queries. WISE 2006: 474-486. (Best Paper)

Xiaoou Ding, Hongzhi Wang, Jiaxuan Su, Aoran Xie, Jianzhong Li, Hong Gao: MobiDis: Relationship Discovery of Mobile Users from Spatial-Temporal Trajectories. ER Workshops 2018: 12-16

Dongsheng Li, Shengfei Shi, Yan Zhang, Hongzhi Wang, Jizhou Luo: An Anomaly Detection Method Based on Learning of "Scores Sequence". ICPCSEE (2) 2018: 296-311

Fanshan Meng, Tianbai Yue, Hongzhi Wang, Hong Gao, Yaping Li: SFSC: Segment Feature Sampling Classifier for Time Series Classification. ICPCSEE (1) 2018: 332-346

Haoran Zhang, Jianzhong Li, Hongzhi Wang: Statistical Learning-Based Prediction of Execution Time of Data-Intensive Program Under Hadoop2.0. ICPCSEE (1) 2018: 403-414

Tianyu Li, Shengfei Shi, Jizhou Luo, Hongzhi Wang: A Method to Identify Spark Important Parameters Based on Machine Learning. ICPCSEE (1) 2018: 525-538

Wei Yin, Tianbai Yue, Hongzhi Wang, Yanhao Huang, Yaping Li: Time Series Cleaning Under Variance Constraints. DASFAA Workshops 2018: 108-113

Fei Li, Guowen Zhou, Xingshuo Li, Linhai Zhu, Hongzhi Wang. A symbolic reasoning based anomaly detection for gas turbine subsystems. 2017 Prognostics and System Health Management Conference (PHM-Harbin)

Yanjie Wei, Hongzhi Wang, Shengfei Shi, Hong Gao, Jianzhong Li: Any-Time Methods for Time-Series Prediction with Missing Observations. BigData Congress 2017: 427-430

Hongzhi Wang, Hong Gao, Shenjun Yin, Jie Zhu:The design of course architecture for big data. ACM TUR-C 2017: 13:1-13:6

Xiaoou Ding, Hongzhi Wang, Yitong Gao, Jianzhong Li, Hong Gao:Determining the currency of dynamic data. ACM TUR-C 2017: 17:1-17:6

Amina Belhassena, Hongzhi Wang: Distributed skyline trajectory query processing. ACM TUR-C 2017: 19:1-19:7

Zhixin Qi, Hongzhi Wang, Fanshan Meng, Jianzhong Li, Hong Gao:Capture Missing Values with Inference on Knowledge Base. DASFAA Workshops 2017: 185-194

Yiwen Tang, Hongzhi Wang, Shiwei Zhang, Huijun Zhang, Ruoxi Shi:Efficient Web-Based Data Imputation with Graph Model. DASFAA Workshops 2017: 213-226

Ming Sun, Hongzhi Wang, Fanshan Meng, Jianzhong Li, Hong Gao:Incomplete Data Classification Based on Multiple Views. APWeb (2) 2016: 239-250

Shanshan Han, Hongzhi Wang, Hong Gao, Jianzhong Li, Shenbin Huang:Fuzzy Keywords Query. APWeb (2) 2016: 251-262

Ming Sun, Hongzhi Wang, Jianzhong Li, Hong Gao, Shenbin Huang:A Chronic Disease Analysis System Based on Dirty Data Mining. APWeb (2) 2016: 552-555

Jiahong Li, Hongzhi Wang, Shengqiang Zhang, Xiangyu Gao, Ziqi Qu, Shenbin Huang:An Alarming and Prediction System for Infections Disease Based on Combined Models. APWeb (2) 2016: 583-587

Xintong Guo, Hong Gao, Hongzhi Wang. Image Clustering Based on the Human Intelligence. 2015 International Conference on Intelligent Systems and Knowledge Engineering.

Kaiqi Zhang, Donghua Yang, Hong Gao, Jianzhong Li,Hongzhi Wang, Zhipeng Cai: VMPSP: Efficient Skyline Computation Using VMP-Based Space Partitioning. DASFAA Workshops 2016: 179-193

Yitong Gao, Yan Zhang, Hongzhi Wang, Jianzhong Li, Hong Gao: A Distributed Load Balance Algorithm of MapReduce for Data Quality Detection. DASFAA Workshops 2016: 294-306

Kaiqi Zhang, Hong Gao, Hongzhi Wang, Jianzhong Li: ISSA: Efficient Skyline Computation for Incomplete Data. DASFAA Workshops 2016: 321-328

MingLiang Yue, Hong Gao, Shengfei Shi, Hongzhi Wang: Join Query Processing in Data Quality Management. DASFAA Workshops 2016: 329-342

Yanzheng Wang, Hong Gao, Shengfei Shi, Hongzhi Wang: Similarity Search on Massive Data Based on FPGA. DASFAA Workshops 2016: 343-352

Chang Lu, Hongzhi Wang, Yan Zhang, Hong Gao. Euclidean-based Entity Resolution for Evolving Data. 2015 Fifth International Conference on Instrumentation and Measurement, Computer, Communication and Control.

Ke Lin, Siyao Cheng, Yingshu Li, Jianzhong Li, Hong Gao, Hongzhi Wang: SHMDRS: A Smartphone-Based Human Motion Detection and Response System. WASA 2016: 174-185

Yang Song, Hongzhi Wang, Jianzhong Li, Hong Gao. MapReduce for Big Data Analysis: Benefits, Limitations and Extensions. ICYCSEE 2016, Part I, CCIS 623, pp. 453–457, 2016.

Qian Liu, Hongzhi Wang, and Shaoying Song. HierarSearch: Enhancing Performance of Search Engines by Mining Semantic Relationships Among Results. ICYCSEE 2016, Part II, CCIS 624, pp. 201–205, 2016.

Wei Qu, Siyao Cheng, and Hongzhi Wang. Efficient File Accessing Techniques on Hadoop Distributed File Systems. ICYCSEE 2016, Part I, CCIS 623, pp. 350–361, 2016.

Jinglun Li, Shengfei Shi, and Hongzhi Wang. Optimization Analysis of Hadoop. ICYCSEE 2016, Part I, CCIS 623, pp. 520–532, 2016.

Ming Yan, Yan Zhang, Hongzhi Wang: Tree-Based Metric Learning for Distance Computation in Data Mining. APWeb 2015: 377-388

Xiaoou Ding, Hongzhi Wang, Dan Zhang, Jianzhong Li, Hong Gao: A Fair Data Market System with Data Quality Evaluation and Repairing Recommendation. APWeb 2015: 855-858

Yue Wang, Hongzhi Wang, Chen Ye, Hong Gao. Graph Similarity Join with K-Hop Tree Indexing. ICYCSEE 2015, Communications in Computer and Information Science Volume 503, 2015, pp 38-47.

Jie Pan, Hongzhi Wang, Hong Gao, Wenxuan Zhao, ongxing Huo,Huirong Dong. Paradise Pointer : A Sightseeing Scenes Images Search Engine Based on Big Data Processing. ICYCSEE 2015, Communications in Computer and Information Science Volume 503, 2015, pp 448-452

Qiqi Shi, Hongzhi Wang, Dong Li, Xinfei Shi, Chen Ye, Hong Gao. Maximal Influence Spread for Social Network Based on MapReduce. ICYCSEE 2015, Communications in Computer and Information Science Volume 503, 2015, pp 128-136

Chen Ye, Hongzhi Wang, Keli Li, Qian Chen, Jianhua Chen, Jiangduo Song, Weidong Yuan: CrowdCleaner: A Data Cleaning System Based on Crowdsourcing. APWeb 2014: 657-661

Guangze Liu, Hongzhi Wang, ChengHui Chen, Hong Gao: TruthOrRumor: Truth Judgment from Web. APWeb 2014: 674-678

Hang Zhang, Hongzhi Wang, Jianzhong Li, Hong Gao. Neighbor-base Similarity Matching for Graphs, CloudDB 2014, 191-198.

Ye Chen, Hongzhi Wang. Capture Missing Values based on Crowdsourcing. WASA 2014 Workshop.

Yan Zhang, Hongzhi Wang. Accuracy Evaluation for Sensed Data. WASA 2014

Mingda Li, Hongzhi Wang and Ye Li. Sectional and Conditional Functional Dependencies. WASA 2014 Workshop.

Chen Ye, Hongzhi Wang. Truth discovery based on Crowdsourcing. WAIM 2014.

Xiaojie Lin, Rui Zhang, Zeyi Wen, Hongzhi Wang, Jianzhong Qi: Efficient Subgraph Matching Using GPUs. ADC 2014: 74-85

Li Jia, Hongzhi Wang, Jianzhong Li, Hong Gao: Incremental Truth Discovery for Information from Multiple Data Sources. WAIM Workshops 2013: 56-66

Huabin Feng, Hongzhi Wang, Jianzhong Li, Hong Gao: Entity Resolution on Uncertain Relations. WAIM 2013: 77-86

Lian Jin, Hongzhi Wang, Hong Gao: Imputation for Categorical Attributes with Probabilistic Reasoning. WAIM 2013: 87-98

Rui Guo, Hongzhi Wang, Kaiyu Li, Jianzhong Li, Hong Gao: CUVIM: Extracting Fresh Information from Social Network. WAIM 2013: 351-362

Hui Xie, Hongzhi Wang, Jianzhong Li, Hong Gao: A Data Cleaning Framework Based on User Feedback. WAIM 2013: 514-520

Yan Zhang, Long Yang, Hongzhi Wang: Range Query Estimation for Dirty Data Management System. WAIM 2012: 152-164

Xing Tong, Hongzhi Wang: Fgram-Tree: An Index Structure Based on Feature Grams for String Approximate Search. WAIM 2012: 241-253

Xueli Liu, Hongzhi Wang: Dynamic Graph Shortest Path Algorithm. WAIM 2012: 296-307

Cong Wang, Hongzhi Wang: Graph-Structured Data Compression Based on Frequent Subgraph Contraction. GDMM 2012: 11-18.

Bingyi Qian, Hongzhi Wang, Jianzhong Li, Hong Gao: Path-Based XML Stream Compression with XPath Query Support. XML-DM 2012: 329-339.

Xu Bian, Hongzhi Wang, Hong Gao: Schema Mapping with Quality Assurance for Data Integration. XML-DM 2011: 472-483.

Liu Yongnan,Wang Hongzhi,Gao Hong. A Fast Entity Resolution Method based on Wave of Records. CECNet2011.

Qian Liu, Hongzhi Wang, Hong Gao, Qi Lv, Jianyu Fu. A Recommendation Method in E-commerce based on product taxonomy graph. 13th IEEE Joint International Computer Science and Information Technology Conference(JICSIT 2011).

Qing Wang, Shouxu Jiang, Hongzhi Wang, Hong Gao. AIIS: An Efficient String Index on Inconsistent Data. 2011 international conference on Computer Science and Information Engineering (CSIE2011).

Yakun Li, Hongzhi Wang, Hong Gao. Efficient Entity Resolution based on Sequence Rules. 2011 international conference on Computer Science and Information Engineering (CSIE2011).

Yue Wang, Hongzhi Wang, Yang Wang, Hong Gao. Similarity Join on XML Based on k-Generation Set Distance. XML-DM 2011: 124-135.

Fei Li, Hongzhi Wang, Liang Hao, Jianzhong Li, Hong Gao: pq-Hash: An Efficient Method for Approximate XML Joins. XML-DM 2010: 125-134

Qing Wang, Hongzhi Wang, Hong Gao, Jianzhong Li: Compression Algorithms for Structural Query Results on XML Data. XML-DM 2010: 141-145

Fei Li, Hongzhi Wang, Cheng Zhang, Liang Hao, Jianzhong Li, Hong Gao: Approximate Joins for XML Using g-String. XSym 2010: 3-17

Hongzhi Wang, Wei Wang, Jianzhong Li, Xuemin Lin, Reymond Wong: Practical Indexing XML Document for Twig Query. ASIAN 2005: 208-222.

Hongzhi Wang, Jianzhong Li, Shuguang Xiong. Efficient Join Algorithms for Integrating XML Data in Grid Environment. GCC 2005: 547-553.

Hongzhi Wang, Wei Wang, Xuemin Lin, Jianzhong Li. Subgraph Join: Efficient Processing Subgraph Queries on Graph-Structured XML Document. WAIM 2005: 68-80.

Hongzhi Wang, Jianzhong Li, Zhenying He, Hong Gao. OLAP for XML Data. CIT 2005: 233-237.

Hongzhi Wang, Wei Wang, Xuemin Lin, Jianzhong Li. Labeling Scheme and Structural Joins for Graph-Structured XML Data. APWeb 2005: 277-289.

Hongzhi Wang, Jianzhong Li, Zhenying He. Optimized Query Translation Strategy for XML Stored in Relational Database. WAIM 2004: 378-388.

Hongzhi Wang, Jianzhong Li, Jizhou Luo, Zhenying He. XCpaqs: Compression of XML Document with XPath Query Support. ITCC (1) 2004: 354-357.

Hongzhi Wang, Jianzhong Li, Zhenying He, Hong Gao. Xaggregation: Flexible Aggregation of XML Data. WAIM 2003: 104-115.

Hongzhi Wang, Jianzhong Li, Zhenying He, Jizhou Luo. Web Information Integration Based on Compressed XML. DNIS 2003: 122-137.

Hongzhi Wang,Jianzhong Li, Zhenying He. INEXP: Information Exchange Protocol for Interoperability. INEXP: Information Exchange Protocol for Interoperability. ICADL 2002.

Hongzhi Wang, Jianzhong Li, Zhenying He. An Effective Wrapper Architecture to Heterogeneous Data Source. AINA 2003: 565-569.

Guohua Jiang, Hongzhi Wang, Shouxu Jiang, Jianzhong Li, Hong Gao: DCUBE: CUBE on Dirty Databases. WAIM 2010: 507-512.

Mohan Li, Hongzhi Wang, Jianzhong Li, Hong Gao: Efficient Duplicate Record Detection Based on Similarity Estimation. WAIM 2010: 595-607.

Lingli Li, Hongzhi Wang, Hong Gao, Jianzhong Li: EIF: A Framework of Effective Entity Identification. WAIM 2010: 717-728.

Lingli Li, Hongzhi Wang, Jianzhong Li, Jizhou Luo. Efficient Top-k Keyword Search on XML Streams. ICYCS 2008: 1041-1046.

Xianmin Liu, Jianzhong Li, Hongzhi Wang. SAM: An Efficient Algorithm for F&B-Index Construction. APWeb/WAIM 2007: 697-708.

Hongqiang Wang, Jianzhong Li, Hongzhi Wang. Clustered Absolute Path Index for XML Document: On Efficient Processing of Twig Queries. APWeb Workshops 2006: 1-10.

Xin Zhan, Jianzhong Li, Hongzhi Wang, Zhenying He. Caching Frequent XML Query Patterns. APWeb Workshops 2006: 68-75.

Jizhou Luo, Jianzhong Li, Hongzhi Wang, Yanqiu Zhang, Kai Zhao. The Compression of Massive Offline Relations. WAIM 2004: 634-639.

Jinghua Zhu, Jianzhong Li, Jizhou Luo, Wei Zhang, Hongzhi Wang. C-kNN Query Processing in Object Tracking Sensor Networks. WASA 2008: 432-443.

中文期刊 (Chinese Journals)
王宏志,骆吉洲,李建中. 图结构XML数据子图查询的高效处理算法. 软件学报. 2009,20(9): 2436-2449

王宏志,李建中,骆吉洲. XML数据流上的高效聚集算法. 软件学报. 2008,19(8): 2032-2042.

王宏志, 李建中, 高宏. 一种非清洁数据库的数据模型. 2012,软件学报, 23(3):539-549.

王鹤澎,王宏志,李建中,高宏.不一致数据上精确决策树生成算法.软件学报,2017,28(11):2814?2824.

丁小欧,王宏志,张笑影,李建中,高宏.数据质量多种性质的关联关系研究.软件学报,2016,27(7).

王鹤澎,王宏志,李佳宁,孔欣欣,李建中,高宏. 面向新型处理器的数据密集型计算. 软件学报doi: 10.13328/j.cnki.jos.005060

李建中, 王宏志. 大数据可用性的研究进展. 软件学报,2016,27(7).

黎玲利,王宏志,高宏,李建中. XML数据流上Top-K关键字查询处理. 软件学报,2012,23(6):1561-1577.

李亚坤, 王宏志, 高宏, 李建中. 基于实体描述属性技术的XML重复对象检测方法. 计算机学报, 2011, 34(11), 2131-2141.

刘雪莉, 王宏志,李建中,高宏.基于实体的相似性连接算法. 软件学报,2015,26:(6):1421-1437.

王宏志,樊文飞. 复杂数据上的实体识别技术研究.计算机学报. 2011, 34(10), 1843—1852.

王宏志,熊风, 邹开发, 刘哲敏. 教育大数据分析:方法与探索.《中国大学教学》, 2017 (5)

王宏志. 互联网金融:是机遇还是泡沫. 中国计算机学会通讯,第10卷, 第8期,50-53.

王宏志. 大数据质量管理:问题与研究进展. 科技导报2014,32(34): 78-84

陈志注,王宏志,熊风,张义策,高宏,李建中.大数据拍卖的定价策略与方法.中国科学技术大学学报,2018, 18(6): 486-494王宏志, 梁志宇, 李建中, 高宏. 工业大数据分析综述:模型与算法. 大数据, 62-79, 2018. 梁志宇,王宏志,李建中,高宏. 制造业中的大数据分析技术应用研究综述. 机械, 45(6), 1-13, 2018.

孔欣欣,苏本昌,王宏志*,高宏,李建中,基于标签权重评分的推荐模型及算法研究,计算机学报,2015,Vol.38:在线出版号No.23

杨东华, 李宁宁, 王宏志*, 李建中, 高宏. 基于任务合并的并行大数据清洗过程优化. 计算机学报,2015,Vol.38:在线出版号No.37

张岩,唐兴,王宏志*. 劣质数据库上查询优化策略. 小型微型计算机系统, Vol.35 No. 11, 2410-2415,2014.

张岩,杨龙, 王宏志*.劣质数据库上阈值相似连接结果大小估计.计算机学报,35(10),2012.

骆吉洲,李建中,王宏志.压缩数据库中一种自适应直方图的构建.软件学报,2009,20(7):1785-1799.

王洪强,李建中,王宏志.基于F&B索引的XML查询处理算法.计算机研究与发展47(5): 866-877, 2010.

张硕, 李建中, 王宏志, 何震瀛 . 基于扩展编码的在线XML文档加载机制. 计算机研究与发展, 2004, 41 (10): 1829-1835.

霍然,王宏志,朱鎔,李建中,高宏. 基于Map-Reduce的大数据实体识别算法. 第一届全国大数据会议,计算机研究与发展(增刊)

朱乾坤,王宏志, 高宏.在线RFID多复杂事件查询处理技术.计算机科学与探索,2011, 5(09): 845-856.( NDBC2011萨师煊优秀论文奖)

缪丰羽,王宏志.图结构模糊XML文档上的模式匹配算法[J].计算机科学,2016,43(11):284-290

刘雪莉,王宏志,李建中,高宏.实体数据库中多相似连接顺序选择策略.计算机科学与探索, 6(10),2012.

张岩,杨忠胜,王宏志,高宏,李建中.基于压缩直方图的劣质数据库上相似连接结果大小估计. 小型微型计算机系统,2012(10), 2012.

李明达,王宏志,张佳程,李建中,高宏. PEIF: 基于并行机群的大数据实体识别算法. 第30届中国数据库学术会议.

丁小欧,王宏志,朱鎔,李建中,高宏. 基于数据质量评估的数据交易系统. 全国数据库会议 2014 系统演示.

魏延杰,王宏志,李言路,李建中,高宏. 微博小医生:基于新浪微博的医疗建议系统. 全国数据库会议 2014 系统演示.

李可利,王宏志,叶晨,郭欣彤,李建中,高宏. 关于数据密集型的众包清洗平台. 全国数据库会议 2014 系统演示.

于文涛,王宏志. 不完整数据上高效Skyline查询处理算法. 全国数据库会议 2014.

李佳宁,王宏志,李建中,高宏. 地理数据文本库上Top-k模糊查询技术研究. 全国数据库会议 2014.

金连,王宏志,高宏. 基于Map-Reduce的大数据缺失值填充算法. 第30届中国数据库学术会议.

叶晨,王宏志,李建中,高宏. 基于众包的电子商务数据实体分类系统. 第30届中国数据库学术会议.

张晓东,王宏志,高宏,李建中.一个针对电子商务数据的在线实体分类系统. 计算机研究与发展 增刊,2012.

佟星,王宏志,李建中,高宏.基于树结构索引的带权值字符串的 Top-k 查询算法.计算机研究与发展 增刊,2012.

周小田,王宏志,郭翔宇,胡筱,董志鑫,李建中,高宏. 基于 probase 的互联网商品信息分类与推荐系统. 2012年全国数据库会议(NDBC 2012), 系统演示.

姜国华,姜守旭,王宏志,李建中,高 宏.标签劣质的XML数据上的查询处理. 计算机科学与探索,2011,5(8): 673-685.

刘永楠,王宏志,高宏.MapReduce框架下基于字符串波形的实体识别方法. 计算机科学与探索,2011,5(8): 730-739

孟啸, 王宏志, 高宏, 李建中. bibEOS:一个高质量的社会化文献检索与管理系统. 第二十六届中国数据库学术会议, 2010,计算机科学与探索, 4(1): 54-63.

何震瀛, 李建中, 商超, 王宏志. 功能完全的XML数据查询语言X-SQL. 哈尔滨工业大学学报. 38(5): 678-681, 2006.

张航, 王宏志, 李建中, 高宏. 基于2-hop优化的子图模式匹配算法. 黑龙江大学自然科学学报, 2010, (01) :78-82. (黑龙江省计算机学会年会议优秀论文)

张春鹤, 李建中, 王宏志. 动态图结构XML数据上的查询处理算法. 第二十四届中国数据库学术会议, 2007,计算机研究与发展, 44(增刊) (10): 374-378. (第二十四届中国数据库会议优秀论文)

王宏志,李建中,何振瀛,石胜飞,李金宝. 基于XML的传感器网络数据处理. 计算机研究与发展 第40卷(增刊). 188-191, 2003.

王宏志, 李建中, 骆吉洲, 张艳秋. 海量关系数据库的压缩存储与查询策略. 计算机研究与发展增刊第40卷(增刊),337-341, 2003.

Publications by Subjects
Data QualityHongzhi Wang, Jianzhong Li, Hong Gao. Efficient Entity Resolution based on Subgraph Cohesion. Knowledge and Information Systems. online published, DOI: 10.1007/s10115-015-0818-7Yue Wang, Hongzhi Wang, Liyan Zhang, Yang Wang, Jianzhong Li, Hong Gao. Extend Tree Edit Distance for Effective Object Identification. Knowledge and Information Systems. online published, DOI: 10.1007/s10115-014-0816-1Yan Zhang, Hongzhi Wang, Hong Gao, Jianzhong Li. Efficient accuracy evaluation for multi-modal sensed data. Journal of Combinatorial Optimization. online published, DOI:10.1007/s10878-015-9920-8Yan Zhang, Hongzhi Wang, Zhongsheng Yang, Jianzhong Li. Relative Accuracy Evaluation. PLoS ONE 9(8): e103853. doi:10.1371/journal.pone.**.Fangda Wang, Hongzhi Wang, Jianzhong Li, Hong Gao. Graph-based Reference Table Construction to Facilitate Entity Matching. Journal of Systems and Software,Volume 86, Issue 6, 2013, 1679–1688.Yakun Li,Hongzhi Wang,Hong Gao,Jianzhong Li. An Efficient Entity Resolution Method for Large Relations. International Journal of Cooperative Information Systems, Vol. 22, No. 1 (2013), 1–17.Hongzhi Wang, Xiaodong Zhang, Jianzhong Li, Hong Gao: ProductSeeker: entity-based product retrieval for e-commerce. SIGIR 2013: 1085-1086Hongzhi Wang, Mingda Li, Yingyi Bu, Jianzhong Li, Hong Gao, Jiacheng Zhang: Cleanix: A Big Data Cleaning Parfait. CIKM 2014: 2024-2026Lingli Li, Jianzhong Li, Hongzhi Wang, Hong Gao: Context-based entity description rule for entity resolution. CIKM 2011: 1725-1730Hongzhi Wang, Jianzhong Li, Ran Huo, Li Jia, Lian Jin, Xueying Men, Hui Xie. HITCleaner: A Light-weight Online Data Cleaning System. Proceedings of DASFAA 2013, 481-484. Demo.Hongzhi Wang, Xueli Liu, Jianzhong Li, Xing Tong, Long Yang, Yakun Li. EntityManager: An Entity-based Dirty Data Management System. Proceedings of DASFAA 2013. 468-471. Demo.Ming Yan, Yan Zhang, Hongzhi Wang: Tree-Based Metric Learning for Distance Computation in Data Mining. APWeb 2015: 377- 388Xiaoou Ding, Hongzhi Wang, Dan Zhang, Jianzhong Li, Hong Gao: A Fair Data Market System with Data Quality Evaluation and Repairing Recommendation. APWeb 2015: 855-858Chen Ye, Hongzhi Wang, Keli Li, Qian Chen, Jianhua Chen, Jiangduo Song, Weidong Yuan: CrowdCleaner: A Data Cleaning System Based on Crowdsourcing. APWeb 2014: 657-661Guangze Liu, Hongzhi Wang, ChengHui Chen, Hong Gao: TruthOrRumor: Truth Judgment from Web. APWeb 2014: 674-678Hang Zhang, Hongzhi Wang, Jianzhong Li, Hong Gao. Neighbor-base Similarity Matching for Graphs, CloudDB 2014, 191-198.Ye Chen, Hongzhi Wang. Capture Missing Values based on Crowdsourcing. WASA 2014 Workshop.Yan Zhang, Hongzhi Wang. Accuracy Evaluation for Sensed Data. WASA 2014Mingda Li, Hongzhi Wang and Ye Li. Sectional and Conditional Functional Dependencies. WASA 2014 Workshop.Chen Ye, Hongzhi Wang. Truth discovery based on Crowdsourcing. WAIM 2014.Li Jia, Hongzhi Wang, Jianzhong Li, Hong Gao: Incremental Truth Discovery for Information from Multiple Data Sources. WAIM Workshops 2013: 56-66Huabin Feng, Hongzhi Wang, Jianzhong Li, Hong Gao: Entity Resolution on Uncertain Relations. WAIM 2013: 77-86Lian Jin, Hongzhi Wang, Hong Gao: Imputation for Categorical Attributes with Probabilistic Reasoning. WAIM 2013: 87-98Rui Guo, Hongzhi Wang, Kaiyu Li, Jianzhong Li, Hong Gao: CUVIM: Extracting Fresh Information from Social Network. WAIM 2013: 351-362Hui Xie, Hongzhi Wang, Jianzhong Li, Hong Gao: A Data Cleaning Framework Based on User Feedback. WAIM 2013: 514-520Yan Zhang, Long Yang, Hongzhi Wang: Range Query Estimation for Dirty Data Management System. WAIM 2012: 152- 164Xing Tong, Hongzhi Wang: Fgram-Tree: An Index Structure Based on Feature Grams for String Approximate Search. WAIM 2012: 241-253Liu Yongnan,Wang Hongzhi,Gao Hong. A Fast Entity Resolution Method based on Wave of Records. CECNet2011.
Qian Liu, Hongzhi Wang, Hong Gao, Qi Lv, Jianyu Fu. A Recommendation Method in E-commerce based on product taxonomy graph. 13th IEEE Joint International Computer Science and Information Technology Conference(JICSIT 2011).
Qing Wang, Shouxu Jiang, Hongzhi Wang, Hong Gao. AIIS: An Efficient String Index on Inconsistent Data. 2011 international conference on Computer Science and Information Engineering (CSIE2011).
Yakun Li, Hongzhi Wang, Hong Gao. Efficient Entity Resolution based on Sequence Rules. 2011 international conference on Computer Science and Information Engineering (CSIE2011).
Fei Li, Hongzhi Wang, Liang Hao, Jianzhong Li, Hong Gao: pq-Hash: An Efficient Method for Approximate XML Joins. XML-DM 2010: 125-134 Fei Li, Hongzhi Wang, Cheng Zhang, Liang Hao, Jianzhong Li, Hong Gao: Approximate Joins for XML Using g-String. XSym 2010: 3-17Guohua Jiang, Hongzhi Wang, Shouxu Jiang, Jianzhong Li, Hong Gao: DCUBE: CUBE on Dirty Databases. WAIM 2010: 507-512.Mohan Li, Hongzhi Wang, Jianzhong Li, Hong Gao: Efficient Duplicate Record Detection Based on Similarity Estimation. WAIM 2010: 595-607.Lingli Li, Hongzhi Wang, Hong Gao, Jianzhong Li: EIF: A Framework of Effective Entity Identification. WAIM 2010: 717- 728.王宏志, 李建中, 高宏. 一种非清洁数据库的数据模型. 2012,软件学报, 23(3):539-549.李亚坤, 王宏志, 高宏, 李建中. 基于实体描述属性技术的XML重复对象检测方法. 计算机学报, 2011, 34(11), 2131-2141.刘雪莉, 王宏志,李建中,高宏.基于实体的相似性连接算法. 软件学报,2015,26:(6):1421-1437.王宏志,樊文飞. 复杂数据上的实体识别技术研究.计算机学报. 2011, 34(10), 1843—1852.王宏志. 大数据质量管理:问题与研究进展. 科技导报2014,32(34): 78-84杨东华, 李宁宁, 王宏志*, 李建中, 高宏. 基于任务合并的并行大数据清洗过程优化. 计算机学报,2015,Vol.38:在线出版号No.37张岩,唐兴,王宏志*. 劣质数据库上查询优化策略. 小型微型计算机系统, Vol.35 No. 11, 2410-2415,2014.张岩,杨龙, 王宏志*.劣质数据库上阈值相似连接结果大小估计.计算机学报,35(10),2012.霍然,王宏志,朱鎔,李建中,高宏. 基于Map-Reduce的大数据实体识别算法. 第一届全国大数据会议,计算机研究与发展(增刊)刘雪莉,王宏志,李建中,高宏.实体数据库中多相似连接顺序选择策略.计算机科学与探索, 6(10),2012.张岩,杨忠胜,王宏志,高宏,李建中.基于压缩直方图的劣质数据库上相似连接结果大小估计. 小型微型计算机系统,2012(10), 2012.李明达,王宏志,张佳程,李建中,高宏. PEIF: 基于并行机群的大数据实体识别算法. 第30届中国数据库学术会议.丁小欧,王宏志,朱鎔,李建中,高宏. 基于数据质量评估的数据交易系统. 全国数据库会议 2014 系统演示.李可利,王宏志,叶晨,郭欣彤,李建中,高宏. 关于数据密集型的众包清洗平台. 全国数据库会议 2014 系统演示.于文涛,王宏志. 不完整数据上高效Skyline查询处理算法. 全国数据库会议 2014.李佳宁,王宏志,李建中,高宏. 地理数据文本库上Top-k模糊查询技术研究. 全国数据库会议 2014.金连,王宏志,高宏. 基于Map-Reduce的大数据缺失值填充算法. 第30届中国数据库学术会议.
叶晨,王宏志,李建中,高宏. 基于众包的电子商务数据实体分类系统. 第30届中国数据库学术会议.
张晓东,王宏志,高宏,李建中.一个针对电子商务数据的在线实体分类系统. 计算机研究与发展 增刊,2012.
佟星,王宏志,李建中,高宏.基于树结构索引的带权值字符串的 Top-k 查询算法.计算机研究与发展 增刊,2012.
周小田,王宏志,郭翔宇,胡筱,董志鑫,李建中,高宏. 基于 probase 的互联网商品信息分类与推荐系统. 2012年全国数据库会议(NDBC 2012), 系统演示.
姜国华,姜守旭,王宏志,李建中,高 宏.标签劣质的XML数据上的查询处理. 计算机科学与探索,2011,5(8): 673-685.
刘永楠,王宏志,高宏.MapReduce框架下基于字符串波形的实体识别方法. 计算机科学与探索,2011,5(8): 730-739
Graph Management and MiningHongzhi Wang, Jianzhong Li, Jizhou Luo, Hong Gao. Hash-base subgraph query processing method for graph-structured XML documents. PVLDB 1(1): 478-489 (2008). Zhao Sun, Hongzhi Wang, Haixun Wang, Bin Shao, Jianzhong Li. Efficient Subgraph Matching on Billion Node Graphs. PVLDB 5(5), 2012Wenfei Fan, Jianzhong Li, Shuai Ma, Hongzhi Wang, Yinghui Wu. Graph Homomorphism Revisited for Matching Web Sites. PVLDB 3, 2010Yue WANG, Hongzhi WANG, Jianzhong LI, Hong GAO. Efficient Graph Similarity Join for Information Integration on Graphs. Front. Comput. Sci. DOI: 10.1007/s11704-015-4505-3Yakun Li, Hongzhi Wang, Jianzhong Li, Hong Gao. Efficient Community Detection with Additive Constrains on Large Networks. Knowledge-Based Systems, 52 (2013) 268-278.Yue Wang, Hongzhi Wang, Jianzhong Li, Hong Gao. Efficient Subgraph Join based on Connectivity Similarity. World Wide Web Journal. World Wide Web 18(4): 871-887 (2015).Hongzhi Wang, Jianzhong Li, Wei Wang, Xuemin Lin. Coding-based Join Algorithms for Structural Queries on Graph-structured XML Document. World Wide Web Journal 11(4): 153-168 (2008).Hongzhi Wang, Jianzhong Li. GXQuery: Extending XQuery for Querying Graph-structured XML Data. Journal of Computing and Information Technology 19(2): 83-91 (2011).Rui Guo, Hongzhi Wang, Lucheng Zhong, Jianzhong Li, Hong Gao: Harbinger: An Analyzing and Predicting System for Online Social Network Users' Behavior. DASFAA (2) 2014: 531-534Yue Wang, Hongzhi Wang, Chen Ye, Hong Gao. Graph Similarity Join with K-Hop Tree Indexing. ICYCSEE 2015, Communications in Computer and Information Science Volume 503, 2015, pp 38-47.Qiqi Shi, Hongzhi Wang, Dong Li, Xinfei Shi, Chen Ye, Hong Gao. Maximal Influence Spread for Social Network Based on MapReduce. ICYCSEE 2015, Communications in Computer and Information Science Volume 503, 2015, pp 128-136Xiaojie Lin, Rui Zhang, Zeyi Wen, Hongzhi Wang, Jianzhong Qi: Efficient Subgraph Matching Using GPUs. ADC 2014: 74-85Xueli Liu, Hongzhi Wang: Dynamic Graph Shortest Path Algorithm. WAIM 2012: 296-307
Cong Wang, Hongzhi Wang: Graph-Structured Data Compression Based on Frequent Subgraph Contraction. GDMM 2012: 11-18.
Hongzhi Wang, Jianzhong Li, Shuguang Xiong. Efficient Join Algorithms for Integrating XML Data in Grid Environment. GCC 2005: 547-553. Hongzhi Wang, Wei Wang, Xuemin Lin, Jianzhong Li. Subgraph Join: Efficient Processing Subgraph Queries on Graph-Structured XML Document. WAIM 2005: 68-80. Hongzhi Wang, Wei Wang, Xuemin Lin, Jianzhong Li. Labeling Scheme and Structural Joins for Graph-Structured XML Data. APWeb 2005: 277-289. 王宏志,骆吉洲,李建中. 图结构XML数据子图查询的高效处理算法. 软件学报. 2009,20(9): 2436-2449张航, 王宏志, 李建中, 高宏. 基于2-hop优化的子图模式匹配算法. 黑龙江大学自然科学学报, 2010, (01) :78-82. (黑龙江省计算机学会年会议优秀论文)
张春鹤, 李建中, 王宏志. 动态图结构XML数据上的查询处理算法. 第二十四届中国数据库学术会议, 2007,计算机研究与发展, 44(增刊) (10): 374-378. (第二十四届中国数据库会议优秀论文)
Data-driven eHealthJianzhong Qi,Rui Zhang,Kotagiri Ramamohanarao,Hongzhi Wang, Zeyi Wen,Dan Wu. Indexable online time series segmentation with error bound guarantee. World Wide Web 18(2): 359-401 (2015).朱乾坤,王宏志, 高宏.在线RFID多复杂事件查询处理技术.计算机科学与探索,2011, 5(09): 845-856.( NDBC2011萨师煊优秀论文奖)魏延杰,王宏志,李言路,李建中,高宏. 微博小医生:基于新浪微博的医疗建议系统. 全国数据库会议 2014 系统演示.Information IntegrationHongzhi Wang, Jianzhong Li, Jizhou Luo. Data Sources Selection for XML Data Sources. International Journal of Intelligent Information and Database Systems Vol. 2 422-445 (2008).Hongzhi Wang, Jianzhong Li, Shuguang Xiong. Efficient join algorithms for distributed information integration based on XML. Int. J. Business Process Integration and Management, Vol. 3, No. 4, pp.271–281.Xu Bian, Hongzhi Wang, Hong Gao: Schema Mapping with Quality Assurance for Data Integration. XML-DM 2011: 472- 483.
Hongzhi Wang,Jianzhong Li, Zhenying He. INEXP: Information Exchange Protocol for Interoperability. INEXP: Information Exchange Protocol for Interoperability. ICADL 2002.Hongzhi Wang, Jianzhong Li, Zhenying He. An Effective Wrapper Architecture to Heterogeneous Data Source. AINA 2003: 565- 569. XML Data ManagementFei Li, Hongzhi Wang, Jianzhong Li, Hong Gao. Approximate Joins for XML at Label Level. Information Sciences 282 (2014) 237–249Fei Li, Hongzhi Wang, Jianzhong Li, Hong Gao. A Survey on Tree Edit Distance Lower Bound Estimation Techniques for Similarity Join on XML Data. Sigmod Record 42(4), 2013Hongqiang Wang, Jianzhong Li, Hongzhi Wang. Clustered Chain Path Index for XML Document: Efficiently Processing Branch Queries. World Wide Web 11(1): 153-168 (2008).Wei Wang, Hongzhi Wang, Hongjun Lu, Haifeng Jiang, Xuemin Lin, Jianzhong Li. Efficient Processing of XML Path Queries Using the Disk-based F&B Index. VLDB 2005: 145-156.Hongzhi Wang, Jianzhong Li, Xianmin Liu, Jizhou Luo. Query Optimization for Complex Path Queries on XML Data. DASFAA 2009: 389 -404.Lingli Li, Hongzhi Wang, Jianzhong Li, Hong Gao. Efficient Algorithms for Skyline Top-K Keyword Queries on XML Streams. DASFAA 2009: 283-287.Hongqiang Wang, Jianzhong Li, Hongzhi Wang. Clustered Chain Path Index for XML Document: Efficiently Processing Branch Queries. WISE 2006: 474-486. (Best Paper)Bingyi Qian, Hongzhi Wang, Jianzhong Li, Hong Gao: Path-Based XML Stream Compression with XPath Query Support. XML- DM 2012: 329-339.
Yue Wang, Hongzhi Wang, Yang Wang, Hong Gao. Similarity Join on XML Based on k-Generation Set Distance. XML-DM 2011: 124-135.
Qing Wang, Hongzhi Wang, Hong Gao, Jianzhong Li: Compression Algorithms for Structural Query Results on XML Data. XML-DM 2010: 141-145 Lingli Li, Hongzhi Wang, Jianzhong Li, Jizhou Luo. Efficient Top-k Keyword Search on XML Streams. ICYCS 2008: 1041- 1046. Xianmin Liu, Jianzhong Li, Hongzhi Wang. SAM: An Efficient Algorithm for F&B-Index Construction. APWeb/WAIM 2007: 697- 708. Hongqiang Wang, Jianzhong Li, Hongzhi Wang. Clustered Absolute Path Index for XML Document: On Efficient Processing of Twig Queries. APWeb Workshops 2006: 1-10. Xin Zhan, Jianzhong Li, Hongzhi Wang, Zhenying He. Caching Frequent XML Query Patterns. APWeb Workshops 2006: 68- 75. Hongzhi Wang, Wei Wang, Jianzhong Li, Xuemin Lin, Reymond Wong: Practical Indexing XML Document for Twig Query. ASIAN 2005: 208-222. Hongzhi Wang, Jianzhong Li, Zhenying He. Optimized Query Translation Strategy for XML Stored in Relational Database. WAIM 2004: 378-388. Hongzhi Wang, Jianzhong Li, Jizhou Luo, Zhenying He. XCpaqs: Compression of XML Document with XPath Query Support. ITCC (1) 2004: 354-357. Hongzhi Wang, Jianzhong Li, Zhenying He, Hong Gao. Xaggregation: Flexible Aggregation of XML Data. WAIM 2003: 104- 115. Hongzhi Wang, Jianzhong Li, Zhenying He, Jizhou Luo. Web Information Integration Based on Compressed XML. DNIS 2003: 122- 137. Hongzhi Wang, Jianzhong Li, Zhenying He, Hong Gao. OLAP for XML Data. CIT 2005: 233-237. 王宏志,李建中,骆吉洲. XML数据流上的高效聚集算法. 软件学报. 2008,19(8): 2032-2042. 黎玲利,王宏志,高宏,李建中. XML数据流上Top-K关键字查询处理. 软件学报,2012,23(6):1561-1577.王洪强,李建中,王宏志.基于F&B索引的XML查询处理算法.计算机研究与发展47(5): 866-877, 2010.张硕, 李建中, 王宏志, 何震瀛 . 基于扩展编码的在线XML文档加载机制. 计算机研究与发展, 2004, 41 (10): 1829-1835.何震瀛, 李建中, 商超, 王宏志. 功能完全的XML数据查询语言X-SQL. 哈尔滨工业大学学报. 38(5): 678-681, 2006.
王宏志,李建中,何振瀛,石胜飞,李金宝. 基于XML的传感器网络数据处理. 计算机研究与发展 第40卷(增刊). 188-191, 2003.
MiscellaneousXintong Guo, Hongzhi Wang, Yangqiu Song, Gao Hong: Brief survey of crowdsourcing for data mining. Expert Syst. Appl. 41(17): 7987-7994 (2014)WANG Hong-zhi, LIU Yong-Qiang, LIU Zhan-yi, SHANG Shou-ting.;Mathematical model for detection ground water pollutant. Journal of Harbin Institute of Technology 7(4),2000.Jie Pan, Hongzhi Wang, Hong Gao, Wenxuan Zhao, ongxing Huo,Huirong Dong. Paradise Pointer : A Sightseeing Scenes Images Search Engine Based on Big Data Processing. ICYCSEE 2015, Communications in Computer and Information Science Volume 503, 2015, pp 448-452Jizhou Luo, Jianzhong Li, Hongzhi Wang, Yanqiu Zhang, Kai Zhao. The Compression of Massive Offline Relations. WAIM 2004: 634-639. Jinghua Zhu, Jianzhong Li, Jizhou Luo, Wei Zhang, Hongzhi Wang. C-kNN Query Processing in Object Tracking Sensor Networks. WASA 2008: 432-443.王宏志. 互联网金融:是机遇还是泡沫. 中国计算机学会通讯,第10卷, 第8期,50-53.孔欣欣,苏本昌,王宏志*,高宏,李建中,基于标签权重评分的推荐模型及算法研究,计算机学报,2015,Vol.38:在线出版号No.23骆吉洲,李建中,王宏志.压缩数据库中一种自适应直方图的构建.软件学报,2009,20(7):1785-1799.孟啸, 王宏志, 高宏, 李建中. bibEOS:一个高质量的社会化文献检索与管理系统. 第二十六届中国数据库学术会议, 2010,计算机科学与探索, 4(1): 54-63.
王宏志, 李建中, 骆吉洲, 张艳秋. 海量关系数据库的压缩存储与查询策略. 计算机研究与发展增刊第40卷(增刊),337-341, 2003.

科研项目/Fundings

项目名称数据质量管理中实体识别关键技术的研究

项目来源国家自然科学基金青年基金

开始时间2011-01-01

结束时间2013-12-01

担任角色负责

项目类别纵向项目

项目状态完成



项目名称海量数据质量管理关键技术的研究

项目来源中国博士后基金特别资助

开始时间2010-09-01

结束时间2013-05-01

担任角色负责

项目类别横向项目

项目状态完成



项目名称海量图数据量质融合管理关键技术的研究

项目来源哈尔滨工业大学科研创新基金

开始时间2012-06-01

结束时间2014-05-01

担任角色负责

项目类别横向项目

项目状态完成



项目名称Uncertain Data Management Over Internet of Things

项目来源IBM中国研究院联合研究项目

开始时间2012-01-01

结束时间2012-12-01

担任角色负责

项目类别纵向项目

项目状态进行中



项目名称电子商务中基于实体的查询处理关键技术的研究

项目来源阿里巴巴青年学者支持计划

开始时间2010-04-01

结束时间2011-03-01

担任角色负责

项目类别纵向项目

项目状态进行中



项目名称海量信息可用性基础理论与关键技术研究

项目来源国家重点基础研究发展计划(973计划)项目

开始时间2012-01-01

结束时间2016-12-01

担任角色参与

项目类别横向项目

项目状态完成



项目名称开放环境下海量Web数据提取、集成、分析和管理系统平台与应用

项目来源863计划信息技术领域主题项目

开始时间2012-01-01

结束时间2014-01-01

担任角色参与

项目类别横向项目

项目状态完成



项目名称不确定数据管理的理论与关键技术

项目来源国家自然科学基金重点项目

开始时间2010-01-01

结束时间2013-12-01

担任角色参与

项目类别纵向项目

项目状态完成



项目名称非确定传感网络数据整合

项目来源国家自然科学基金NSF-RGC项目

开始时间2009-01-01

结束时间2011-12-01

担任角色参与

项目类别纵向项目

项目状态完成



项目名称纯XML关系数据库系统PXRDB研制与应用

项目来源国家863计划目标导向项目

开始时间2009-01-01

结束时间2010-12-01

担任角色参与

项目类别纵向项目

项目状态完成



项目名称压缩XML数据库关键技术的研究

项目来源国家自然科学基金面上项目

开始时间2008-01-01

结束时间2008-12-01

担任角色参与

项目类别纵向项目

项目状态完成



项目名称图数据库系统查询处理关键技术的研究

项目来源国家自然科学基金面上项目

开始时间2008-01-01

结束时间2010-12-01

担任角色参与

项目类别纵向项目

项目状态完成



项目名称传感器网络系统基础软件及数据管理技术的研究

项目来源国家自然科学基金重点项目

开始时间2006-01-01

结束时间2009-12-01

担任角色参与

项目类别纵向项目

项目状态完成



项目名称压缩数据库系统关键技术的研究

项目来源国家自然科学基金面上项目

开始时间2003-01-01

结束时间2005-12-01

担任角色参与

项目类别纵向项目

项目状态完成



奖项成果/Awards

奖项名称互联网商品信息提取,分析与检索系统

获奖时间2012

完成人周小田,郭翔宇,胡筱,董志鑫

所获奖项大学生创新性实验计划优秀项目一等奖



奖项名称基于Windows集群的电子商务信息检索与集成系统(基于实体的商品检索数据的并行化研究)

获奖时间2012

完成人张晓东,陈敏,陈懿诚

所获奖项微软HPC校园编程大赛,第二名



奖项名称支持电子商务的实体识别技术及其应用

获奖时间2011

完成人刘倩,张晓东,李飞,王玥

所获奖项校大学生科技创新优秀成果奖



奖项名称海量电子商务数据面向复杂类目分类技术的探究

获奖时间2011

完成人刘倩,张晓东,付建宇

所获奖项大学生创新性实验计划优秀项目一等奖



奖项名称基于Windows集群的海量数据实体识别与检索算法研究

获奖时间2011

完成人李亚坤,刘永楠,刘超亚

所获奖项微软HPC校园编程大赛,优胜奖



奖项名称高可用性电子集市

获奖时间2010

完成人刘倩,张晓东,付建宇

所获奖项大学生创新性实验计划优秀项目三等奖



奖项名称基于pureXML的文献检索系统

获奖时间2009

完成人黎玲利,孟啸,高静

所获奖项寻找PureXML应用竞赛第三名



奖项名称高效集成的海量数据并行化清洗系统

获奖时间2014

完成人李明达,张红阳,杨丽霞,李昱昕

所获奖项哈工大2014年大学生创新创业训练计划优秀项目一等奖



奖项名称基于实体的社交网络数据管理系统

获奖时间2014

完成人过云燕,张玮奇,王佩琪,徐竟祎

所获奖项哈工大2014年大学生创新创业训练计划优秀项目二等奖



奖项名称基于质量评估的数据市场系统

获奖时间2015

完成人丁小欧,成烈南杰,张丹,肖蕾

所获奖项哈尔滨工业大学大学生创新创业训练计划优秀项目一等奖



奖项名称海量数据计算的理论和技术

获奖时间2011

完成人李建中,樊文飞,高宏,王宏志

所获奖项黑龙江省自然科学一等奖



奖项名称XML数据查询处理技术研究

获奖时间2009

完成人王宏志

所获奖项中国计算机学会优秀博士论文



奖项名称XML数据查询处理技术研究

获奖时间2009

完成人王宏志

所获奖项哈尔滨工业大学优秀博士论文



Mahout-R
A hadoop-based big data analysis system. We implment R languaue based on Hadoop. Since Mahout is a hadoop-based big data analysis system, we first wrap-up Mahout to support R. Then for the functions in R that are not supported by Mahout, we attempt to design new MapReduce algorithms and implment them on Hadoop.

the Bayes network function for R is coming soon

2015.4.2 update

the first edition Mahout-R

details see

https://github.com/novas-meng/mahoutr#mahoutr


相关话题/数据 数据库 优秀 计算机 系统