Its potential for Grid and Cloud Computing Tan Tin Wee Dept of Biochemistry Yong Loo Lin School of...

Post on 17-Dec-2015

215 views 0 download

Tags:

Transcript of Its potential for Grid and Cloud Computing Tan Tin Wee Dept of Biochemistry Yong Loo Lin School of...

Desktop as a Service using Moka5+VMWare+BioSlax its potential for Grid and Cloud Computing

Tan Tin WeeDept of Biochemistry

Yong Loo Lin School of MedicineNational University of Singapore

Merging two trajectoriesLiveCD/DVD/USB/VM eScience and Grid

Computing APBioGrid : Globus Grid 1.1.6/7

– Ong Guan Sin (2002) APBioBox 1 and 2 with Sun’s

BioClusterGrid/BioBox and KOOPLite workflow integration

APBioKnoppix 1 and 2 LiveCD- Derek Kiong, Justin Choo (2003-2006)

BioSlax LiveCD- Mark de Silva and Lim Kuan Siong (2005-2009)

BioSlax LiveDVD, LiveUSB, VMplayer (2007-2009)

BioSlax Server (2008-2009)

BioSlax7.5-VMplayer-MokaFive (2009)

UDBlast-BioSlax-VMplayer-MokaFive (2009)

SINGAREN (1998) Telemanufacturing, eScience, remote

control of Synchrotron beam line (1999-2001)

NUS BioGrid and APBioGrid (2001-2002)

National Grid and LSVGC database (2004-2005)

BioManufacturing Grid (SERC Proof of Concept for Integrated Manufacturing and Services Systems (IMSS) grant) (2004-2006)- KOOPrime’s KOOPlatform- SIMTech and NTU- APSTC Sun Microsystems

TeraCampus Grid TCG@NUS (2005-2009)

1st World Wide Workflow Grid- Chris Baker (2008)

NUS BioGrid and APBioGrid 2001-2002

http://www.bic.nus.edu.sg/biogrid/http://www.startap.net/starlight/igrid2002/apbiogrid02.html

APBioBox 2002-2004

http://compaq.apbionet.org/grid/apbiobox/http://compaq.apbionet.org/grid/apbiobox/lsvgcApr04/index.shtml

National Grid Office and LS VGC

http://lsdb.bii.a-star.edu.sg/

APBioKnoppix (2004-2006)Bioinformatics in a LiveCD

http://compaq.apbionet.org/grid/apbioknoppix/

Used by 500 NUS students a year in Life Science Module LSM2104 from 2004 to 2007

Integrated Biolaboratory Manufacturing and Services System 2004-2005

A TSRP pilot project Group 9 Project 042-112-0020Integrated Manufacturing and Services Systems (IMSS)

For acceleration of specific areas of manufacturing, in particularbiomanufacturing.

www2.jp.apan.net/meetings/bangkok2005/presentation/tantinwee-APAN19-eScienceBKKJan05.ppt

1st WWWFG 2007

http://www.apbionet.org/wwwfg/programme.shtml

TeraCampusGrid TCG@NUS 2005-2009

Grid Apps on TCG@NUS Grid

http://www.nus.edu.sg/comcen/svu/services/gridapps.htm

UDBlast at TCG@NUS

http://srs1.bic.nus.edu.sg/udblast.htmlhttp://srs1.bic.nus.edu.sg/udblast_checkjob.htmlhttp://srs1.bic.nus.edu.sg/udblast_retrieve.html

http://conference.nie.edu.sg/2007/

Merged two trajectoriesLiveCD/DVD/USB/VM eScience and Grid

Computing APBioGrid : Globus Grid 1.1.6/7

– Ong Guan Sin (2002) APBioBox 1 and 2 with Sun’s

BioClusterGrid/BioBox and KOOPLite workflow integration

APBioKnoppix 1 and 2 LiveCD- Derek Kiong, Justin Choo (2003-2006)

BioSlax LiveCD- Mark de Silva and Lim Kuan Siong (2005-2009)

BioSlax LiveDVD, LiveUSB, VMplayer (2007-2009)

BioSlax Server (2008-2009)

BioSlax7.5-VMplayer-MokaFive (2009) UDBlast-BioSlax-VMplayer-MokaFive (2009)

SINGAREN (1998) Telemanufacturing, eScience, remote control of

Synchrotron beam line (1999-2001)

NUS BioGrid and APBioGrid (2001-2002) National Grid and LSVGC database (2004-2005)

BioManufacturing Grid (SERC Proof of Concept for Integrated Manufacturing and Services Systems (IMSS) grant) (2004-2006)- KOOPrime’s KOOPlatform- SIMTech and NTU- APSTC Sun Microsystems

TeraCampus Grid TCG@NUS (2005-2009) 1st World Wide Workflow Grid

- Chris Baker (2008)

Moka5+VMWare+BioSlaxOn TCG@NUS

www.BIOSLAX.com (2006-2009)

BioSlax with VMplayerBioinformatics software in a virtual machine

PackagesApplicationsProgramming languages – C, perl, python etcGUI interfacesServers – apache, mysql

No need to reboot, just install VMplayer, run BioSlax vmx file and create a new instance of a BioSlax virtual machine

Rapid and Easy Customisable Interface

Addable ModulesModules e.g.

newmodule.lzm Stored in

/mnt/sda1/modulesActivated during boot up

Compile –> dir2lzm -> copy into virtual HDD /mnt/sda1/module

Persistent Changes

changes=/mnt/sda1/changes

To change theboot up sequencefor starting any program

Add/etc/rc.d/rc.newprog

BioSlax 7.5Bootable with changesLiveCD, LiveDVD, LiveUSBBioSlax-VMplayer versionMokaFive version portable on Intel Machines

What has it/can it be used for?Desktop-as-a-ServiceUDGrid mpagent

ftp://sf01.bic.nus.edu.sg/incoming/bioslax/fixes/zz01_grid_mpagent.lzm

BioSlax with VMplayer+MokaFiveDownloadable from MokaFive serverMultiple instantiations ShoppableCacheable LivePCNo need for installationPortable on a USBRapid deployment – user initiated

BioSlax+VM+MokaFive: Desktop-as-a-Service

Desktop-as-a-Service

Grid-enabling BioSlax:UD MP agent in BioSlax

History – TCG@NUSUDBlast and UDHMMerRapid recruitment of machinesNo need for user installationPush or pull instancesEnduser joining of Grid

Joining a virtual machine to TCG@NUS

http://srs1.bic.nus.edu.sg/uddevlist.html

UD-BioSlax-VMware-Moka5How Desktops-as-a-Service can be used for

harnessing free cpu cycles quicklyEnd-users and members of the public can contribute“Contribute-to-Benefit” model – enduser can log in

to monitor their “credits” depending on how much CPU time they have contributed and how much they pay.

Creation of public participation grid computing and public recruitment of grid-enabled processors without interfering with Enduser’s machine.

Inter-Grid functionality – deploy instances of UD-BioSlax on multiple clouds

1. UD Mpagent updates require reinstallation.

2. Variation of the scheduler requires reinstallation

3. Endusers have to install the mpagent.

4. MPagent not portable

1. Moka5 can do push the updateAlternatively, BioSlax can do PXE boot

2. Endusers can run the agent whenever they want to

3. MPagent wrapped in BioSlax, played in Vmplayer, pushed by Moka5 is portable

Future WorkBioSlax LivePC pushed to cloudsEnd users can have BioSlax as a development

platform that will work exactly as the BioSlax pushed to the cloud.

Nutch-Hadoop configured on BioSlax and pushed onto Amazon EC2, for example

Bioinformatics datasets will be permanently pushed to EC2 as a common accessible database by data providers for end-users to compute