Cluster computing

Post on 21-Dec-2014

632 views 3 download

Tags:

description

A short and complete overview of cluster computing.

Transcript of Cluster computing

POOJA KHATANA

BTBTI11354

Exam R.No:11394

B.Tech(IT)5th semester

Presented By:Divya AgarwalBTBTI11349Exam R.No:11364B.Tech(IT)5th semester

CLUSTER COMPUTING

DATE: 28 November 2013

ABSTRACTVery often applications need more computing power than a sequential computer can provide. One way of overcoming this limitation is to improve the operating speed of processors and other components so that they can offer the power required by computationally intensive applications. Even though this is currently possible to certain extent, future improvements are constrained by the speed of light, thermodynamic laws, and the high financial costs for processor fabrication. A viable and cost-effective alternative solution is to connect multiple processors together and coordinate their computational efforts. The resulting systems are popularly known as parallel computers, and they allow the sharing of a computational task among multiple processors.

CLUSTERS HAS ARRIVED

INTRODUCTION A computer cluster is a group of tightly coupled

computers that work together closely so that it can be viewed as a single

computer.

Clusters are commonly connected through fast local area networks.

Clusters have evolved to support applications ranging from

ecommerce, to high performance database applications.

Clusters are usually deployed to improve speed and/or reliability over that provided by a single computer, while typically being much more cost effective than single computer the of comparable speed or reliability.

CONTINUED

In cluster computing each node within a cluster is an independent system, with its own operating system, private memory, and, in some cases, its own file system. Because the processors on one node cannot directly access the memory on the other nodes, programs or software run on clusters usually employ a procedure called "message passing" to get data and execution code from one node to another.

Cluster computing can also be used as a relatively low-cost form of parallel processing for scientific and other applications that lend themselves to parallel operations.

HISTORY Customers invented clusters, as soon as they could

not fit all their work on one computer, or needed a backup.

The first commodity clustering product was ARCnet, developed by Datapoint in 1977.

The next product was VAXcluster, released by DEC in 1980’s.

Microsoft, Sun Microsystems, and other leading hardware and

software companies offer clustering packages.

ARCHITECTURE OF CLUSTER

A cluster is a type of parallel /distributed processing system ,which consists of a collection of interconnected stand-alone computers cooperatively working together a single , integrated computing resource.

A node:a single or multiprocessor system with memory, I/O facilities, &OSgenerally two or more computers (nodes) connected togetherin a single cabinet, or physically separated & connected via a LAN appear as a single system to users and applicationsprovide a cost-effective way to gain features and benefits

ARCHITECTURE OF CLUSTER

TYPE OF CLUSTERS

High Performance (HP) Clusters

Load Balancing Cluster

High Availability(HA) Clusters

HIGH PERFORMANCE CLUSTER

Start from 1994. Donald Becker of NASA assembled

this cluster. Also called Beowulf cluster. Applications like data mining,

simulations, parallel processing, weather modeling, etc.

HIGH PERFOMANCE CLUSTER

LOAD BALANCING CLUSTER PC cluster deliver load balancing

performance.

Commonly used with busy ftp(file transfer protocol) and web servers with large client base.

Large number of nodes to share load.

LOAD BALANCING CLUSTER

HIGH AVAILABILITY CLUSTER

Avoid single point of failure. This requires atleast two nodes - a

primary and a backup. Always with redundancy. Almost all load balancing cluster

are with HA capability.

HIGH AVAILABILITY CLUSTER

COMMODITY COMPONENTS OF CLUSTERThe components critical to the development of low cost clusters are:

1. Processors

2. Memory

3. Networking Components

4. Motherboards, busses, and other sub-systems

BENEFITS

Computer clusters offer a number of benefits over mainframe computers, including: 

1.Reduced Cost

2.Processing Power

3.Improved Network Technology

4.Scalability

5.Availability

BENEFITS OF COMPUTER CLUSTER

1. Reduced Cost:  The price of off-the-shelf consumer desktops has plummeted in recent years, and this drop in price has corresponded with a vast increase in their processing power and performance. The average desktop PC today is many times more powerful than the first mainframe computers. 

2. Processing Power :  The parallel processing power of a high-

performance cluster can, in many cases, prove more cost effective than a mainframe with similar power. This reduced price per unit of power enables enterprises to get a greater ROI from their IT budget.

CONTINUED

3. Improved Network Technology: Driving the development of computer clusters has been a vast improvement in the technology related to networking, along with a reduction in the price of such technology.  Computer clusters are typically connected via a single virtual local area network (VLAN), and the network treats each computer as a separate node. Information can be passed throughout these networks with very little lag, ensuring that data doesn’t bottleneck between nodes.

4. Scalability: Perhaps the greatest advantage of

computer clusters is

scalability they offer. While mainframe computers

have a fixed

processing capacity, computer clusters can be easily

expanded as

requirements change by adding additional nodes to

the network. 

5. Availability: When a mainframe computer fails, the

entire system fails.

However, if a node in a computer cluster fails, its

operations can be

simply transferred to another node within the cluster,

ensuring that

there is no interruption in service.

CHALLENGES

The cluster computing concept also poses three pressing research challenges:

A cluster should be a single computing resource and provide a single system image. This is in contrast to a distributed system where the nodes serve only as individual resources.

The supporting operating system and communication Mechanism must be efficient enough to remove the performance Bottlenecks.

CONTINUED

It must provide scalability by letting the system scale up or down. The scaled-up system should provide more functionality or better performance. The system’s total computing power should increase proportionally to the increase in resources. The main motivation for a scalable system is to provide a flexible, cost effective Information-processing tool.

ISSUES TO BE CONISDERED

Cluster NetworkingCluster SoftwareProgrammingTimingNetwork SelectionSpeed Selection

CONTINUED

Cluster networkingIf you are mixing hardware that has different networking technologies, there will be large differences in the speed with which data will be accessed and how individual nodes can communicate. If it is in your budget make sure that all of the machines you want to include in your cluster have similar networking capabilities, and if at all possible, have network adapters from the same manufacturer.

Cluster SoftwareYou will have to build versions of clustering software for each kind of system you include in your cluster.

CONTINUED

TimingThis is the most problematic aspect of cluster. Since these machines have different performance profile our code will execute at different rates on the different kinds of nodes. This can cause serious bottlenecks if a process on one node is waiting for results of a calculation on a slower node.

ProgrammingOur code will have to be written to support the lowest common denominator for data types supported by the least powerful node in our cluster. With mixed machines, the more powerful machines will have attributes that cannot be attained in the powerful machine.

CONTINUEDNetwork SelectionThere are a number of different kinds of network topologies, including buses, cubes of various degrees, and grids/meshes. These network topologies will be implemented by use of one or more network interface cards, or NICs, installed into the head-node and compute nodes of our cluster.

Speed SelectionNo matter what topology you choose for your cluster, you will want to get fastest network that your budget allows. Fortunately, the availability of high speed computers has also forced the development of high speed networking systems. Examples are : 10Mbit Ethernet, 100Mbit Ethernet, gigabit networking, channel bonding etc.

APPLICATIONS

Internet Applications Web serving / searching (Google Search

Engine) ASPs (application service providers) eMail, eChat, ePhone, eBook, eCommerce

,eBank

Business Applications E-commerce Applications (Amazon.com,

eBay.com) Database Applications (Oracle on cluster)

FUTURE OF CLUSTER

Cluster computing is no longer something which people see in science fiction movies. This system of networking between computers is not only necessary to continue to help business become more efficient, but it also helps programs run more quickly while also allowing people across the network cluster to access the same programs, files, and information. It makes sense to do this, doesn’t it? And since people like to communicate in business, it makes sense that you cluster computing is only going to become more popular as the economy rebounds and businesses grow even more.

CONCLUSION

Clusters are promisingSolve parallel processing paradox.New trends in hardware and software technologies are likely to make clusters. Clusters based supercomputers (Linux based clusters) can been everywhere !! Clusters offer incremental growth and match with funding pattern.

REFERENCES

http://academic.csuohio.edu/yuc/hpc00/lect/chapter-B1.pdfhttp://www.buyya.com/cluster/http://www.gridrepublic.org/?gclid=CLPnt4-ThbsCFWIC4godynsArghttp://www.springer.com/computer/communication+networks/journal/10586http://www.clustercomp.org/http://compnetworking.about.com/od/networkdesign/l/aa041600a.htmhttp://www.cloudbus.org/papers/http://research.amnh.org/scicomp/amnh-computer-clusteric_cluster.pdfhttp://www.buyya.com/csc433/ClusterApps.pdfhttp://www.davidchappell.com/writing/white_papers/Cluster_Computing_Today,_v1.1--Chappell.pdfhttp://www.codeproject.com/Articles/11709/Cluster-Computing

Any Questions??