Shanjiang Tang
Assistant Professor
School of Computer Science& Technology
Tianjin University
Contact55-B517, CS, Beiyang Campus, JinNan District, Tianjin, China, 300350 [map]
Tel: +**936
tashj [at] tju [dot] edu [dot] cn
http://cs.tju.edu.cn/faculty/tangshanjiang
Bio | Publications |Projects |Code |CV |Other Stuff
About MeI am an Assistant Professor in the School of Computer Science& Technology,Tianjin University. I received my Ph.D degree from Nanyang Technological University in 2015. I received the B.Eng.and M.Sc. degrees from School ofSoftware Engineering andSchool of Computer Science& Technology at TianjinUniversity in 2008 and 2011, respectively.
My general research interests primarily focus on large-scale computing systems, big data, and cloud computing, withspecial emphasis on the resource management and job scheduling for Hadoop/YARN system.Specifically, I am interested in designing new scheduling and resource allocation algorithms,analyzing their performance, and implementing them in large-scale computing systems. I am also interested in problemsat the intersection of computing systems and economics.
News04/2015, our paper, entitled “Dynamic Job Ordering and Slot Configurations for MapReduce Workloads ” has been accepted to IEEE Transactions on Services Computing.
06/2014, our paper, entitled “DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters ” has been accepted to IEEE Transactions on Cloud Computing.
03/2014, our paper, entitled “Long-Term Resource Fairness: Towards Economic Fairness on Pay-as-you-use Computing Systems ” has been accepted to ICS 2014.
03/2014, new website goes online.
PublicationsJournal ArticlesShanjiang Tang, Bu-Sung Lee, and Bingsheng He, “Dynamic Job Ordering and Slot Configurations for MapReduce Workloads,” IEEE Transactions on Services Computing, 2015.
Shanjiang Tang, Bu-Sung Lee, and Bingsheng He, “DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters,” IEEE Transactions on Cloud Computing, 2014. [Supplementary Document][slides]
Shanjiang Tang, Ce Yu, Jizhou Sun, Bu-Sung Lee, Tao Zhang, Zheng Xu, and Huabei Wu, “EasyPDP: An Efficient Parallel Dynamic Programming Runtime System for Computational Biology,” IEEE Transactions on Parallel and Distributed System, 2012. [Supplementary Document][slides]
Conference and Workshop ProceedingsShanjiang Tang, Bu-Sung Lee, and Bingsheng He, “Towards Economic Fairness for Big Data Processing in Pay-as-you-go Cloud Computing,” in CloudCom 2014 (Ph.D. Consortium), Singapore, Dec 2014. [slides]
Shanjiang Tang, Bu-Sung Lee, Bingsheng He and Haikun Liu, ‘‘Long-Term Resource Fairness: Towards Economic Fairness on Pay-as-you-use Computing Systems,’’ to be presented in the 28th International Conference on Supercomputing (ICS), Munich, Germany, June 2014. [slides]
Shanjiang Tang, Bu-Sung Lee, and Bingsheng He, ‘‘Dynamic slot allocation technique for MapReduce clusters,’’ In IEEE Cluster 2013, Indiana , USA, Sept 2013. [slides]
Shanjiang Tang, Bu-Sung Lee, and Bingsheng He, ‘‘MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads, ’’In Euro-Par 2013, Aachen, Germany, Aug 2013. [slides]
Jun Du, Ce Yu, Jizhou Sun, Chao Sun, Shanjiang Tang, and Yanlong Yin, ‘‘EasyHPS: A Multilevel Hybrid Parallel System for Dynamic Programming. ’’ in IPDPS Workshops 2013, Boston, USA, May 2013.
Shanjiang Tang, Bu-Sung Lee, and Bingsheng He, ‘‘ Speedup for Multi-Level Parallel Computing,’’ in IPDPS Workshops 2012, Shanghai, China, May 2013. [slides]
Shanjiang Tang, Ce Yu, Bu-Sung Lee, Chao Sun, and Jizhou Sun, ‘‘ Adaptive Data Refinement for Parallel Dynamic Programming Applications,’’ in IPDPS Workshops 2012, Shanghai, China, May 2012. [slides]
Technical ReportsShanjiang Tang, Bu-Sung Lee, and Bingsheng He, “Economic Fairness for Resource Sharing in Pay-as-you-use Cloud Computing.” [Technical Report-09-2014]
Shanjiang Tang, Zhaojie, Niu, Bu-Sung Lee, and Bingsheng He, “Multi-Resource Fair Allocation in Pay-as-you-go Cloud Computing.” [Technical Report-07-2014]
PatentsChao Sun, Jizhou Sun, Shanjiang Tang, et al., Multi-level parallel programming method, CN**0, Nov 2010.
Ce Yu, Shanjiang Tang, Jizhou Sun, et al.,Parallel programming model system of DAG oriented data driving type application and realization method, CN**9, May 2010.
Chao Sun, Jizhou Sun, Shanjiang Tang, et al., Visual modeling and code skeleton generating method for supporting design of multinuclear parallel program, CN**1, Nov 2010.
Ce Yu, Jizhou Sun, Zhen Xu, Huabei Wu, Shizhong Liao, Xiaojing Meng, Shanjiang Tang, et al.,MPI parallel programming system based on visual modeling and automatic skeleton code generation method, CN**5, June 2009.
ProjectsI focus on systems and algorithms for large-scale data-intensive computing. My projects include:
MRYARN: Pay-as-you-go is a popular billing model based on users' resource usage in the cloud. A user's demand is often changing over time, indicating that it is difficult to keep the high resource utilization all the time for cost efficiency. Resource sharing is an effective approach for high resource utilization. In view of the heterogeneous resource demands of workloads in the cloud, multi-resource allocation fairness is a must for resource sharing in cloud computing. MRYARN is proposed for multi-resource fair allocation on the cloud. It ensures that each user in cloud computing can at least get the amount of total resources as that under the exclusively non-sharing environment in the long term. Moreover, MRYARN can guarantee that no users can get more amount of total allocated resources over time by lying their demands. Finally, MRYARN has a mechanism to discourage users to submit cost-inefficient workloads, especially when there are some idle resources they truly do not need.(homepage)(Technical Report-07-2014)
LTYARN: Life is not fair, but with a little help, existing large-scale data processing systems (e.g., YARN, Spark, Dryad) can be, ensuring resource sharing between users. However, past work on fair sharing considered memoryless fairness, an instantaneous fair share without historical information considered. When it comes to cloud computing (i.e., pay-as-you-use computing), it fails to satisfy the service-as-you-pay fairness (i.e., the total service that each user enjoys should be proportional to her payment) from a long-term view. Long-Term Resource Fairness (LTRF) generalizes max-min fairness for this case. LTYARN implements LTRF for YARN in cloud computing. (homepage) (demo) (ICS'14) (CloudCom'14)(Technical Report-09-2014)
DynamicMR: Hadoop MRv1 uses the slot-based resource model with the static configuration of map/reduce slots. Due to the pre-configuration of distinct map slots and reduce slots which are not fungible on slave nodes, slots can be severely under-utilized, which significantly degrades the performance. Although YARN was proposed to address this problem by giving a new resource model of 'container' that either map and reduce tasks can run on, we keep the slot-based model by proposing an alternative technique called Dynamic Hadoop Slot Allocation (DHSA). It relaxes the slot allocation constraint to allow slots to be reallocated to either map or reduce tasks depending on their needs. Our experiments show that it consistently outperforms YARN by about 2% ~ 9% for multiple jobs due to the ratio control mechanism of running map/reduce tasks. Second, the speculative execution can tackle the straggler problem, which has shown to improve the performance for a single job but at the expense of the cluster efficiency. In view of this, we propose Speculative Execution Performance Balancing (SEPB) to balance the performance tradeoff between a single job and a batch of jobs. Third, delay scheduling has shown to improve the data locality but at the cost of fairness. Alternatively, we propose a technique called Slot PreScheduling that can improve the data locality but with no impact on fairness. Finally, by combining these techniques together, we form a step-by-step slot allocation system called DynamicMR that can improve the performance of Hadoop MRv1 significantly while maintaining the fairness. (homepage) (TCC'14) (Cluster'13)
MROrder: In Hadoop MRv1, different job submission orders will bring significantly varied performance results. MROrder is an automated MapReduce job ordering optimizaton prototype system. It targets at the online MapReduce workloads where MapReduce jobs arrives over time for various perfomane metrics, such as makespan, total completion time. Users just need to input some simple arguments. For example, users need to designate the job ordering performance metric( e.g., makespan, total completion time). The MROrder then starts to perform job ordering optimization automatically for online MapReduce jobs, based on user's configuration. (homepage) (Euro-Par'13) (TSC'15)
EasyPDP: To tackle the growing volume of genomic data, an efficient and easily-programmed high performance computing system is needed. EasyPDP is a parallel dynamic programming runtime system and an abstract programming model for computational biology and scientific computing applications. As a runtime system, it handles low-level thread creating, mapping, resource management, and fault tolerance issues automatically regardless of the system characteristics or scale. As an programming model, it allows users to describe applications and specify concurrency from the high level, without concerns about the complex details of parallel programming. (homepage) (demo) (TPDS'12) (IPDPS Workshop'13) (IPDPS Workshop'12)
Open SourceAlmost all of my work is open source:
The MRYARN (Multi-Resource YARN for cloud computing) fair scheduler is now available at http://sourceforge.net/projects/mryarn/.
The LTYARN fair scheduler is now available at http://sourceforge.net/projects/ltyarn/.
The DynamicMR framework can be downloaded at http://sourceforge.net/projects/dynamicmr/, and its component Dynamic Hadoop Fair Scheduler (DHFS) can be found at sourceforge.
The MROrder prototype is open at http://sourceforge.net/projects/mrorder/.
The EasyPDP framework is available at hpclab.
OthersPersonal ActivitiesI had experiences on a number of programming contests since I was a undergraduate student. I participated in the ACM International Collegiate Programming Contest in 2005 and 2006, and won 'Golden Prize' in the 31th ACM/ICPC Asia Programming Contest VS Google Cup on Programming Invitational Tournament of National College Students.I've now stopped doing contests, but I still love algorithmic and mathematical problems.
During my master study in 2009, I had internship at IBM CRL over four months during the summer holiday, where it germinates my strong interest on research.
I enjoy reading, swimming, fitness and playing badminton.
Professional ActivitiesReviewer for
TPDS: 2013, 2014
TCC: 2014, 2015
TSC: 2014, 2015
VLDB: 2014, 2015
HPDC: 2014
PPOPP: 2013
SOCC: 2013
Interesting LinksHow to Have a Bad Career in Research/Academia, by Prof. David A. Patterson
The Most Common Mistakes, by Ben Yagoda
How to Read a Paper, by Prof. S. Keshav
Write a Rapid Prototype First, by Prof. Terence Tao
Last updated on April-13, 2015.
删除或更新信息,请邮件至freekaoyan#163.com(#换成@)
天津大学计算机科学与技术学院研究生导师简介-汤善江
天津大学 免费考研网/2016-02-03
相关话题/计算机
天津大学计算机科学与技术学院研究生导师简介-张高燕
Peng ZhangHomeAboutProjects PublicationsJournal ArticlesEdited ProceedingSelected Conference PapersTeachingWelcome to Peng Zhang's HomePagePeng Zhang ...天津大学师资导师 天津大学 免费考研网 2016-02-03天津大学计算机科学与技术学院研究生导师简介-于永新
Index of /faculty/yyxNameLast modifiedSizeDescriptionParent Directory - 数字电路/30-Aug-2012 05:02- 数据结构/30-Aug-2012 05:02- 计算机接口/ 30-Aug-2012 05:01- Apa ...天津大学师资导师 天津大学 免费考研网 2016-02-03天津大学计算机科学与技术学院研究生导师简介-宋雪雁
欢迎您访问宋雪雁的个人主页学历:博士职称:副教授,硕导单位:计算机科学与技术学院EMAIL: songxy@tju.edu.cn联系电话:**办公室:第25教学楼 B-1209室研究兴趣优化及搜索策略(Optimisation & Search Methodologies); 决策支持(Decisi ...天津大学师资导师 天津大学 免费考研网 2016-02-03天津大学计算机科学与技术学院研究生导师简介-王博
Bo Wang 王 博 AssociateProfessor of Computer ScienceDepartmentof Computer Science and TechnologyTianjinUniversity, Tianjin, ChinaEmail: bo.wang. ...天津大学师资导师 天津大学 免费考研网 2016-02-03天津大学计算机科学与技术学院研究生导师简介-冯伟
Wei FENG 个人主页发表文章 科研成果 教授课程个人简介 简历 天津大学计算机科学与技术学院副教授,教育部新世纪优秀人才,天大计算机学院图形图像与媒体计算实验室主任。研究领域 媒体计算、计算机视觉、模式识别,及其产业化应用。欢迎有志从事该领域研究及开发工作的同学加入我们实验室。欢迎有志从事该领 ...天津大学师资导师 天津大学 免费考研网 2016-02-03天津大学计算机科学与技术学院研究生导师简介-吴斌
个人简介: 从事通信领域研究工作近20年,主持和参与相关科研项目10余项。在IEEE Communication Surveys and Tutorials、IEEE Transactions on Networking、IEEE/OSA Journal of Light ...天津大学师资导师 天津大学 免费考研网 2016-02-03天津大学计算机科学与技术学院研究生导师简介-唐继军
简介唐继军博士,天津大学计算机学院特聘教授,博士生导师,天津市“千人计划”入选者。1994年获得天津大学海船系船舶工程学士,1996年获得天津大学海洋工程硕士,2004年获得美国新墨西哥大学计算机科学博士学位。从2004年起任职美国南卡罗莱纳大学工程和计算学院计算机系至终身教授。主要研究领域为生物信 ...天津大学师资导师 天津大学 免费考研网 2016-02-03天津大学计算机科学与技术学院研究生导师简介-胡清华
Qinghua HuPh.D. ProfessorwithSchool of Computer Science and Technology,Tianjin UniversityWeijin Road No. 92, Nankai District, Tianjin, P. R. ChinaEm ...天津大学师资导师 天津大学 免费考研网 2016-02-03天津大学计算机科学与技术学院介绍
发展历史 2006年1月天津大学根据学校事业发展和学科建设的需要,在原天津大学电子信息工程学院计算机科学与技术系、天津大学计算机基础教学部、天津大学计算机基础教学实验中心、天津大学- ...天津大学通知公告 天津大学 考研网 2016-02-02