Cluster computing

32
POOJA KHATANA BTBTI11354 Exam R.No:11394 B.Tech(IT)5 th semester Presented By: Divya Agarwal BTBTI11349 Exam R.No:11364 B.Tech(IT)5 th semester CLUSTER COMPUTING DATE: 28 November 2013

description

A short and complete overview of cluster computing.

Transcript of Cluster computing

Page 1: Cluster computing

POOJA KHATANA

BTBTI11354

Exam R.No:11394

B.Tech(IT)5th semester

Presented By:Divya AgarwalBTBTI11349Exam R.No:11364B.Tech(IT)5th semester

CLUSTER COMPUTING

DATE: 28 November 2013

Page 2: Cluster computing

ABSTRACTVery often applications need more computing power than a sequential computer can provide. One way of overcoming this limitation is to improve the operating speed of processors and other components so that they can offer the power required by computationally intensive applications. Even though this is currently possible to certain extent, future improvements are constrained by the speed of light, thermodynamic laws, and the high financial costs for processor fabrication. A viable and cost-effective alternative solution is to connect multiple processors together and coordinate their computational efforts. The resulting systems are popularly known as parallel computers, and they allow the sharing of a computational task among multiple processors.

Page 3: Cluster computing

CLUSTERS HAS ARRIVED

Page 4: Cluster computing

INTRODUCTION A computer cluster is a group of tightly coupled

computers that work together closely so that it can be viewed as a single

computer.

Clusters are commonly connected through fast local area networks.

Clusters have evolved to support applications ranging from

ecommerce, to high performance database applications.

Clusters are usually deployed to improve speed and/or reliability over that provided by a single computer, while typically being much more cost effective than single computer the of comparable speed or reliability.

Page 5: Cluster computing

CONTINUED

In cluster computing each node within a cluster is an independent system, with its own operating system, private memory, and, in some cases, its own file system. Because the processors on one node cannot directly access the memory on the other nodes, programs or software run on clusters usually employ a procedure called "message passing" to get data and execution code from one node to another.

Cluster computing can also be used as a relatively low-cost form of parallel processing for scientific and other applications that lend themselves to parallel operations.

Page 6: Cluster computing

HISTORY Customers invented clusters, as soon as they could

not fit all their work on one computer, or needed a backup.

The first commodity clustering product was ARCnet, developed by Datapoint in 1977.

The next product was VAXcluster, released by DEC in 1980’s.

Microsoft, Sun Microsystems, and other leading hardware and

software companies offer clustering packages.

Page 7: Cluster computing

ARCHITECTURE OF CLUSTER

A cluster is a type of parallel /distributed processing system ,which consists of a collection of interconnected stand-alone computers cooperatively working together a single , integrated computing resource.

A node:a single or multiprocessor system with memory, I/O facilities, &OSgenerally two or more computers (nodes) connected togetherin a single cabinet, or physically separated & connected via a LAN appear as a single system to users and applicationsprovide a cost-effective way to gain features and benefits

Page 8: Cluster computing

ARCHITECTURE OF CLUSTER

Page 9: Cluster computing

TYPE OF CLUSTERS

High Performance (HP) Clusters

Load Balancing Cluster

High Availability(HA) Clusters

Page 10: Cluster computing

HIGH PERFORMANCE CLUSTER

Start from 1994. Donald Becker of NASA assembled

this cluster. Also called Beowulf cluster. Applications like data mining,

simulations, parallel processing, weather modeling, etc.

Page 11: Cluster computing

HIGH PERFOMANCE CLUSTER

Page 12: Cluster computing

LOAD BALANCING CLUSTER PC cluster deliver load balancing

performance.

Commonly used with busy ftp(file transfer protocol) and web servers with large client base.

Large number of nodes to share load.

Page 13: Cluster computing

LOAD BALANCING CLUSTER

Page 14: Cluster computing

HIGH AVAILABILITY CLUSTER

Avoid single point of failure. This requires atleast two nodes - a

primary and a backup. Always with redundancy. Almost all load balancing cluster

are with HA capability.

Page 15: Cluster computing

HIGH AVAILABILITY CLUSTER

Page 16: Cluster computing

COMMODITY COMPONENTS OF CLUSTERThe components critical to the development of low cost clusters are:

1. Processors

2. Memory

3. Networking Components

4. Motherboards, busses, and other sub-systems

Page 17: Cluster computing

BENEFITS

Computer clusters offer a number of benefits over mainframe computers, including: 

1.Reduced Cost

2.Processing Power

3.Improved Network Technology

4.Scalability

5.Availability

Page 18: Cluster computing

BENEFITS OF COMPUTER CLUSTER

1. Reduced Cost:  The price of off-the-shelf consumer desktops has plummeted in recent years, and this drop in price has corresponded with a vast increase in their processing power and performance. The average desktop PC today is many times more powerful than the first mainframe computers. 

2. Processing Power :  The parallel processing power of a high-

performance cluster can, in many cases, prove more cost effective than a mainframe with similar power. This reduced price per unit of power enables enterprises to get a greater ROI from their IT budget.

Page 19: Cluster computing

CONTINUED

3. Improved Network Technology: Driving the development of computer clusters has been a vast improvement in the technology related to networking, along with a reduction in the price of such technology.  Computer clusters are typically connected via a single virtual local area network (VLAN), and the network treats each computer as a separate node. Information can be passed throughout these networks with very little lag, ensuring that data doesn’t bottleneck between nodes.

Page 20: Cluster computing

4. Scalability: Perhaps the greatest advantage of

computer clusters is

scalability they offer. While mainframe computers

have a fixed

processing capacity, computer clusters can be easily

expanded as

requirements change by adding additional nodes to

the network. 

5. Availability: When a mainframe computer fails, the

entire system fails.

However, if a node in a computer cluster fails, its

operations can be

simply transferred to another node within the cluster,

ensuring that

there is no interruption in service.

Page 21: Cluster computing

CHALLENGES

The cluster computing concept also poses three pressing research challenges:

A cluster should be a single computing resource and provide a single system image. This is in contrast to a distributed system where the nodes serve only as individual resources.

The supporting operating system and communication Mechanism must be efficient enough to remove the performance Bottlenecks.

Page 22: Cluster computing

CONTINUED

It must provide scalability by letting the system scale up or down. The scaled-up system should provide more functionality or better performance. The system’s total computing power should increase proportionally to the increase in resources. The main motivation for a scalable system is to provide a flexible, cost effective Information-processing tool.

Page 23: Cluster computing

ISSUES TO BE CONISDERED

Cluster NetworkingCluster SoftwareProgrammingTimingNetwork SelectionSpeed Selection

Page 24: Cluster computing

CONTINUED

Cluster networkingIf you are mixing hardware that has different networking technologies, there will be large differences in the speed with which data will be accessed and how individual nodes can communicate. If it is in your budget make sure that all of the machines you want to include in your cluster have similar networking capabilities, and if at all possible, have network adapters from the same manufacturer.

Cluster SoftwareYou will have to build versions of clustering software for each kind of system you include in your cluster.

Page 25: Cluster computing

CONTINUED

TimingThis is the most problematic aspect of cluster. Since these machines have different performance profile our code will execute at different rates on the different kinds of nodes. This can cause serious bottlenecks if a process on one node is waiting for results of a calculation on a slower node.

ProgrammingOur code will have to be written to support the lowest common denominator for data types supported by the least powerful node in our cluster. With mixed machines, the more powerful machines will have attributes that cannot be attained in the powerful machine.

Page 26: Cluster computing

CONTINUEDNetwork SelectionThere are a number of different kinds of network topologies, including buses, cubes of various degrees, and grids/meshes. These network topologies will be implemented by use of one or more network interface cards, or NICs, installed into the head-node and compute nodes of our cluster.

Speed SelectionNo matter what topology you choose for your cluster, you will want to get fastest network that your budget allows. Fortunately, the availability of high speed computers has also forced the development of high speed networking systems. Examples are : 10Mbit Ethernet, 100Mbit Ethernet, gigabit networking, channel bonding etc.

Page 27: Cluster computing

APPLICATIONS

Internet Applications Web serving / searching (Google Search

Engine) ASPs (application service providers) eMail, eChat, ePhone, eBook, eCommerce

,eBank

Business Applications E-commerce Applications (Amazon.com,

eBay.com) Database Applications (Oracle on cluster)

Page 28: Cluster computing

FUTURE OF CLUSTER

Cluster computing is no longer something which people see in science fiction movies. This system of networking between computers is not only necessary to continue to help business become more efficient, but it also helps programs run more quickly while also allowing people across the network cluster to access the same programs, files, and information. It makes sense to do this, doesn’t it? And since people like to communicate in business, it makes sense that you cluster computing is only going to become more popular as the economy rebounds and businesses grow even more.

Page 29: Cluster computing

CONCLUSION

Clusters are promisingSolve parallel processing paradox.New trends in hardware and software technologies are likely to make clusters. Clusters based supercomputers (Linux based clusters) can been everywhere !! Clusters offer incremental growth and match with funding pattern.

Page 30: Cluster computing

REFERENCES

http://academic.csuohio.edu/yuc/hpc00/lect/chapter-B1.pdfhttp://www.buyya.com/cluster/http://www.gridrepublic.org/?gclid=CLPnt4-ThbsCFWIC4godynsArghttp://www.springer.com/computer/communication+networks/journal/10586http://www.clustercomp.org/http://compnetworking.about.com/od/networkdesign/l/aa041600a.htmhttp://www.cloudbus.org/papers/http://research.amnh.org/scicomp/amnh-computer-clusteric_cluster.pdfhttp://www.buyya.com/csc433/ClusterApps.pdfhttp://www.davidchappell.com/writing/white_papers/Cluster_Computing_Today,_v1.1--Chappell.pdfhttp://www.codeproject.com/Articles/11709/Cluster-Computing

Page 31: Cluster computing
Page 32: Cluster computing

Any Questions??