CS 553 - IIT-Computer Scienceiraicu/teaching/CS553-S13/CS553-flier.pdf · CS 553 Spring 2013...

1
Cloud Compung hp://www.cs.iit.edu/~iraicu/teaching/CS553-S13/ CS 553 Spring 2013 Monday/Wednesday, 11:25AM - 12:40PM, Life Sciences 121 Dr. Ioan Raicu Dr. Ioan Raicu is an assistant professor in the Department of Computer Science (CS) at Illinois Instute of Technology (IIT), as well as a guest research faculty in the Math and Computer Science Division (MCS) at Argonne Naonal Laboratory (ANL). He is also the founder (2011) and director of the Data -Intensive Distributed Systems Laboratory (DataSys) at IIT. He has received the presgious NSF CAREER award (2011 - 2015) for his innovave work on distributed file systems for exascale compung. He was a NSF/CRA Computaon Innovaon Fellow at Northwestern University in 2009 - 2010, and obtained his Ph.D. in Computer Science from University of Chicago under the guidance of Dr. Ian Foster in March 2009. He is a 3-year award winner of the GSRP Fellowship from NASA Ames Research Center. His research work and interests are in the general area of distributed systems. His work focuses on a relavely new paradigm of Many-Task Compung (MTC), which aims to bridge the gap between two predominant paradigms from distributed systems, High-Throughput Compung (HTC) and High-Performance Compung (HPC). His work has focused on defining and exploring both the theory and praccal aspects of realizing MTC across a wide range of large-scale distributed systems. He is parcularly interested in resource management in large scale distributed systems with a focus on many- task compung, data intensive compung, cloud compung, grid compung, and many-core compung. Over the past decade, he has co-authored over 50 peer reviewed arcles, book chapters, books, theses, and dissertaons, which received over 2100 citaons. His H-index is 19 , G-Index is 45, and E-Index is 37. His work has been funded by the NASA Ames Research Center, DOE Office of Advanced Scienfic Compung Research, the NSF/CRA CIFellows program, and the NSF CAREER program. He has also founded and chaired several workshops, such as ACM Workshop on Many-Task Compung on Grids and Supercomputers (MTAGS), the IEEE Int. Workshop on Data-Intensive Compung in the Clouds (DataCloud/ DataCloud-SC), and the ACM Workshop on Scienfic Cloud Compung (ScienceCloud). He is on the editorial board of the Springer Journal of Cloud Compung Advances, Systems and Applicaons (JoCCASA), as well as a guest editor for the IEEE Transacons on Parallel and Distributed Systems (TPDS), the Scienfic Programming Journal (SPJ), and the Journal of Grid Compung (JoGC). He has been leadership roles in several high profile conferences, such as HPDC, CCGrid, Grid, eScience, and ICAC. He is a member of the IEEE and ACM. More informaon can be found at hp://www.cs.iit.edu/~iraicu/, hp://datasys.cs.iit.edu/, or at hp://www.linkedin.com/in/ioanraicu. Overview Cloud Compung is “A large-scale distributed compung paradigm that is driven by economies of scale, in which a pool of abstracted, virtualized, dynamically-scalable, managed compung power, storage, plaorms, and services are delivered on demand to external customers over the Internet.” It has become a driving force for informaon technology over the past several years, and it is hinng at a future in which we won’t compute on local computers, but on centralized facilies operated by third-party compute and storage ulies. Governments, research instutes, and industry leaders are rushing to adopt Cloud Compung to solve their ever-increasing compung and storage problems arising in the Internet Age. There are three main factors contribung to the surge and interests in Cloud Compung: 1) rapid decrease in hardware cost and increase in compung power and storage capacity, and the advent of mul-core architecture and modern supercomputers consisng of hundreds of thousands of cores; 2) the exponenally growing data size in scienfic instrumentaon/simulaon and Internet publishing and archiving; and 3) the wide-spread adopon of Services Compung and Web 2.0 applicaons. This course is a tour through various topics and technologies related to Cloud Compung. We will explore soluons and learn design principles for building large network-based systems to support both compute and data intensive compung across geographically distributed infrastructure. Topics include resource management, programming models, applicaon models, system characterizaons, and implementaons. Our discussions will oſten be grounded in the context of deployed Cloud Compung systems, such as Amazon EC2 and S3, Microsoſt Azure, Google AppEngine, Eucalyptus, Nimbus, OpenStack, Google's MapReduce, Yahoo’s Hadoop, Microsoſt’s Dryad, Sphere/Sector, and many other systems. The course involves lectures, outside invited speakers, discussions of research papers, wrien assignments, and programming assignments. We will be using the textbook Distributed and Cloud Compung: Clusters, Grids, Clouds, and the Future Internet by Kai Hwang, Jack Dongarra & Geoffrey C. Fox. DETAILED COURSE TOPICS: Distributed System Models Virtualizaon Cloud Plaorm Architectures Amazon AWS Microsoſt Azure Google App Engine Google MapReduce / Yahoo Hadoop Eucalyptus, Nimbus, OpenStack Cloud Programming Grid Compung Peer-to-Peer Compung High-Performance Compung

Transcript of CS 553 - IIT-Computer Scienceiraicu/teaching/CS553-S13/CS553-flier.pdf · CS 553 Spring 2013...

Page 1: CS 553 - IIT-Computer Scienceiraicu/teaching/CS553-S13/CS553-flier.pdf · CS 553 Spring 2013 Monday/Wednesday, 11:25AM -12:40PM, Life Sciences 121 Dr. Ioan Raicu Dr. Ioan Raicu is

Cloud Computing http://www.cs.iit.edu/~iraicu/teaching/CS553-S13/

CS 553

Spring 2013 Monday/Wednesday, 11:25AM - 12:40PM, Life Sciences 121

Dr. Ioan Raicu

Dr. Ioan Raicu is an assistant professor in the Department of Computer Science (CS) at Illinois Institute of Technology (IIT), as well as a guest research faculty in the Math and Computer Science Division (MCS) at Argonne National Laboratory (ANL). He is also the founder (2011) and director of the Data-Intensive Distributed Systems Laboratory (DataSys) at IIT. He has received the prestigious NSF CAREER award (2011 - 2015) for his innovative work on distributed file systems for exascale computing. He was a NSF/CRA Computation Innovation Fellow at Northwestern University in 2009 - 2010, and obtained his Ph.D. in Computer Science from University of Chicago under the guidance of Dr. Ian Foster in March 2009. He is a 3-year award winner of the GSRP

Fellowship from NASA Ames Research Center. His research work and interests are in the general area of distributed systems. His work focuses on a relatively new paradigm of Many-Task Computing (MTC), which aims to bridge the gap between two predominant paradigms from distributed systems, High-Throughput Computing (HTC) and High-Performance Computing (HPC). His work has focused on defining and exploring both the theory and practical aspects of realizing MTC across a wide range of large-scale distributed systems. He is particularly interested in resource management in large scale distributed systems with a focus on many-task computing, data intensive computing, cloud computing, grid computing, and many-core computing. Over the past decade, he has co-authored over 50 peer reviewed articles, book chapters, books, theses, and dissertations, which received over 2100 citations. His H-index is 19 , G-Index is 45, and E-Index is 37. His work has been funded by the NASA Ames Research Center, DOE Office of Advanced Scientific Computing Research, the NSF/CRA CIFellows program, and the NSF CAREER program. He has also founded and chaired several workshops, such as ACM Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS), the IEEE Int. Workshop on Data-Intensive Computing in the Clouds (DataCloud/DataCloud-SC), and the ACM Workshop on Scientific Cloud Computing (ScienceCloud). He is on the editorial board of the Springer Journal of Cloud Computing Advances, Systems and Applications (JoCCASA), as well as a guest editor for the IEEE Transactions on Parallel and Distributed Systems (TPDS), the Scientific Programming Journal (SPJ), and the Journal of Grid Computing (JoGC). He has been leadership roles in several high profile conferences, such as HPDC, CCGrid, Grid, eScience, and ICAC. He is a member of the IEEE and ACM. More information can be found at http://www.cs.iit.edu/~iraicu/, http://datasys.cs.iit.edu/, or at http://www.linkedin.com/in/ioanraicu.

Overview Cloud Computing is “A large-scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstracted, virtualized, dynamically-scalable, managed computing power, storage, platforms, and services are delivered on demand to external customers over the Internet.” It has become a driving force for information technology over the past several years, and it is hinting at a future in which we won’t compute on local computers, but on centralized facilities operated by third-party compute and storage utilities. Governments, research institutes, and industry leaders are rushing to adopt Cloud Computing to solve their ever-increasing computing and storage problems arising in the Internet Age. There are three main factors contributing to the surge and interests in Cloud Computing: 1) rapid decrease in hardware cost and increase in computing power and storage capacity, and the advent of multi-core architecture and modern supercomputers consisting of hundreds of thousands of cores; 2) the exponentially growing data size in scientific instrumentation/simulation and Internet publishing and archiving; and 3) the wide-spread adoption of Services Computing and Web 2.0 applications. This course is a tour through various topics and technologies related to Cloud Computing. We will explore solutions and learn design principles for building large network-based systems to support both compute and data intensive computing across geographically distributed infrastructure. Topics include resource management, programming models, application models, system characterizations, and implementations. Our discussions will often be grounded in the context of deployed Cloud Computing systems, such as Amazon EC2 and S3, Microsoft Azure, Google AppEngine, Eucalyptus, Nimbus, OpenStack, Google's MapReduce, Yahoo’s Hadoop, Microsoft’s Dryad, Sphere/Sector, and many other systems. The course involves lectures, outside invited speakers, discussions of research papers, written assignments, and programming assignments. We will be using the textbook Distributed and Cloud Computing: Clusters, Grids, Clouds, and the Future Internet by Kai Hwang, Jack Dongarra & Geoffrey C. Fox.

DETAILED COURSE TOPICS: Distributed System Models Virtualization Cloud Platform Architectures

Amazon AWS Microsoft Azure Google App Engine Google MapReduce / Yahoo Hadoop Eucalyptus, Nimbus, OpenStack

Cloud Programming Grid Computing Peer-to-Peer Computing High-Performance Computing