Post on 05-Jan-2016
description
Scientific Collaboration Cyberspace
TieJian Luo Ph.D.
tjluo@gucas.ac.cn
IWCG2006
Agenda
Motivation Modeling Collaboration The Challenges for e-scientists Five Pilot Projects Lessons learned
New Science Paradigms Thousand years ago:
Experimental Science - description of natural phenomena
Last few hundred years: Theoretical Science - Newton’s Laws, Maxwell’s Equations …
Last few decades: Computational Science - simulation of complex phenomena
Today: e-Science or Data-centric Science - unify theory, experiment, and simulation - using data exploration and data mining Data captured by instruments Data generated by simulations Processed by software Scientist analyzes databases/files(With thanks to Jim Gray)
2
22.
3
4
a
cG
a
a
2
22.
3
4
a
cG
a
a
Scientific Computing Research Paradigm
extract information
data
simulation
Data mining
deduce
model
idea
Scientific computing
result
Verify model( 1 ) problem model
( 2 ) experiment, data collectio
n
( 3 ) get computing resource
( 4 ) computing, verify model
Deduce natural law( 1 ) identify problem domain( 2 ) experiment, data collection ( 3 ) get computing resource( 4 ) analyze data , deduce law
The Problem for the e-Scientist
Data ingest Managing a petabyte Common schema How to organize it? How to reorganize it? How to coexist & cooperate with
others?
Data Query and Visualization tools Support/training Performance
Execute queries in a minute Batch (big) query scheduling
Experiments &Instruments
Simulationsfacts
facts
answers
questions
?Literature
Other Archives facts
facts
IWCG2006
Scientific Data Life Cycle
Data Acquisition Data Ingest Metadata Annotation Provenance
Data Storage Data Cleansing Data Mining Curation Preservation
IWCG2006
What is a Scientific Collaboration ?
Definition: two or more people work together to create or achieve the same thing. Inter-discipline guys Teamwork Agreement Common interests Divide task into several works Discuss problems, use instruments and share information Goal: create new knowledge
The Problem is how to make this things happen? Solution: Human Cooperation + Resource Share
IWCG2006
Basic Collaboration Model---G.Olson
People to People Communication Groupware Service
Access to Facilities Interaction with the Physical
World Access to Instrument online
Access to Information Digital Libraries, E-Pub Search Service
The concept back from 1989
National Collaboratories ---Applying IT for Scientific Research , NAP, 1993
IWCG2006
Current collaboration technology Electronic communication tools send messages, files, data, or documents between people and hence facil
itate the sharing of information. e-mail faxing voice mail Web publishing
Electronic conferencing tools also facilitate the sharing of information, but in a more interactive way. data conferencing — networked PCs share a common "whiteboard" that each user can modify voice conferencing — telephones allow users to interact video conferencing (and audio conferencing) — networked PCs share video or audio signals Internet forums (also known as message boards or discussion boards) — a virtual discussion platform to facilitate an
d manage online text messages chat rooms — a virtual discussion platform to facilitate and manage real-time text messages electronic meeting systems (EMS) — a conferencing system built into a room. The special purpose room will usually c
ontain a large screen projector interlinked with numerous PCs. Collaborative management tools facilitate and manage group activities.
electronic calendars (also called time management software) — schedule events and automatically notify and remind group members
project management systems — schedule, track, and chart the steps in a project as it is being completed workflow systems — collaborative management of tasks and documents within a knowledge-based business process knowledge management systems — collect, organize, manage, and share various forms of information extranet systems (sometimes also known as 'project extranets') — collect, organize, manage and share information a
ssociated with the delivery of a project (eg: the construction of a building) social software systems — organize social relations of groups
The Services to Enable Scientific Collaboration
3 Cyberspaces 10 Service functions
Virtual Team Team management
Task coordination (workflow)
Project management
Trust Metrics for collaboration
Resource and Interaction Online communication
Access to instrument
Semantic to scientific data
Information and Knowledge Publication and Search
Knowledge space
Experts community
Case Study : China State Key Labs
Area of Study Labs
Chemistry 22
Math and Physics 15
Geognosy 18
Biology 38
Information 26
Material 18
Engineering 25
Users more than 10k Large instruments more than 6k
Physical Resource and Virtual Team
people
deviceKnow how
data
SKL A
Info
people
deviceKnow how
data
SKL B
info
people
deviceKnow how
data
SKL D
Info
people
deviceKnow how
data
SKL C
Info
Virtual team C
Virtual team A
Virtual team B
Scie
ntifi
c Colla
bora
tion
Cybersp
ace
Che
Network Infrastructure
LDAPACLAuth SSL
Task
Know How Expert
Network com
Team
Instrument Acc
Pub/Search
Project Metrics
Semantic Data
Phy Math
……
InstrumentsInstruments Data ResourceData ResourceComputingComputing Instrumen
tInstrument
Archie Archie DataData Multimedia Multimedia Raw data Raw data Physical
Network
Services
Collaboration ‘game rules’
SitesResources JointManagement
Mat EngBio Geo
……Services
Domain
IWCG2006
Modeling Elements
Participant
Management
Institute
Affiliate org
E-community
{ Action Des }
{ name }
Instrument
Data Storage
Behavior
According
constraint
Info flow
Capital flow
Data Flow
Communicate
Management
Collaboration cyberspace
ObjectRelationshipEntity
Behavior
Container
Knowledge create and distribute Model
Collaboration Cyberspace
Labs
Outside labs
Buy ruleBuy rule
Agreement (1,n)(1,m)
3
Extra members
1.contribution 2.get Knowledge 3.oversee
21
1 2
(1,p)(1,q)
Members
22
ParticipantManagement Agency
Affiliate org
E-community
Instrument
Data storage
Platform
Behavior
ConstraintsInfo flowCapital flow
Data Flow
Communicate
Management
{活动描述}
{规则名称}
LegendEnsure a fair game !
SCC Web Architecture
Benefits Adaptable
More than 10 different science subjects templates
Scalability Dynamic growing VO does not affect platform performance
ExtensibilityEasy plug in the a new service to the platform
SCC web site http://co-lab.chinalab.gov.cn/
SCS
user
Org.A
Org.B Org.C Org.D Org.E
Org.F
Portal model for accessing autonomic resource Benefits:
Trust access path Single sign-on Delegate permits and Proxy Interoperate
IWCG2006
Video and audio interactive component
Benefits1. Multi node access2. Plug in SCC3. Security and efficiency
Benefits1. Multi node access2. Plug in SCC3. Security and efficiency
IWCG2006
Data Structure
Attribute DB
SpaceDB
Soil Property DB Soil Carbon DBSoil Environment
DBSoil Type Graph
Soil Carbon Type Distribute
Graph
Terrain 、 Vegetation 、 Precipita
tion
Country Graph
Province Graph
County Graph
Typical Area
Graph
1:400 1:100 1:50 1:10 1:5 1:1
Soil Carbon Recycle Data Schema
Project1 : Soil Carbon recycle mechanism database
A lot of UnitsDistribute data across mainland of ChinaMore than 1K scientists
IWCG2006
Soil Carbon Recycle collaboration cyberspace
IWCG2006
Project 2 : FACE ( Free Air CO2 Enrichment )
Find out the mechanism for the rice growing when the CO2 climate change
15 Collaboration org. , 9 domestic orgs( 3SKL ), 7 oversea country, 100 research staff.
Invest 100m RMB, only one in China 30 science topics
IWCG2006
FACE scientists Federal Agricultural Research Centre (Germany) National Institute of Agro-Environmental Sciences (Japan) North Carolin
a State University Tohoku National Agricultural Experiment Station (Japan) U.S. Water Conservation Laboratory University of Oklahoma
大气边界层物理与大气化学国家重点实验室 ( 大气物理所 ) 作物遗传与种质创新国家重点实验室 ( 吉林农业大学南京农业大学 ) 土壤与农业可持续发展国家重点实验室 ( 南京土壤研究所 ) 上海植物生理生态研究所 沈阳农业大学 沈阳应用生态研究所 扬州大学 北京教育出版社
IWCG2006
Contribution to the FACE community
• Monitor the farm site by video
• Automatic upload the raw data
• Real time display control pane by Browser
IWCG2006
FACE Project deployment
IWCG2006
FACE website
New 10 service for collaboration 2 years runtime
Old Only one function Info pub
Neutron emission facility
Remote monitor in Browser
1.Protect the staff from radiation2.Monitor the experiment process3.Inter-discipline scientists
Project3 : Neutron diffraction online experiment and data sharing
·Å ÖÃʵÑéÑùÆ·Set s am p le
ºË ·´ Ó¦ ¶Ñn u clear react o r
ÑùÆ·Ðýת ÒÇs am p le circu m gy rat e d ev ice
ÖÐ×ÓÊøB eam o f n eu t ro n
Æ×ÒÇp ed igree d ev ice
Êý ¾Ý²É ¼¯
Êý ¾Ý²É ¼¯
Êý ¾Ý²É ¼¯
Êý ¾Ý²É ¼¯
ʵÑéÔËÐмà¿Ø»ú co m p u t er fo r m o n it o r exp erim en t s t at u s
ʵ Ñé ÊÒ Lab
Ô¶³Ì ¼à¿Ø·þ Îñ Æ÷M o n it o r Serv er
Éã Ïñ Í·cam era
ʵÑé ¹ý ³Ì ¼à¿Ø»ú co m p u t er fo r m o n it o r exp erim en t p ro ces s
ʵÑé¼à¿Ø´ú Àí »úm o n it o r p ro xy
In t ern et
.......
¼à¿ØʵÑéÔËÐÐm o n it o r exp erim en t s t at u s
¼à¿ØʵÑé¹ý ³Ìm o n it o r exp erim en t p ro ces s
¼à¿ØʵÑéÔËÐÐm o n it o r exp erim en t s t at u s
Benefit 1 staff are isolated from experiment site 2 remote real time monitor the process 3 scientists online discuss
Neutron reactor remote access deployment
Data analysis software
Neutron diffraction remote control interface in Browser
IWCG2006
Project 4: Collaboration for BSL3 Labs (http://cl
b.gucas.ac.cn)
IWCG2006
IWCG2006
BSL3 setting
办公区
生物安全柜
倒置显微镜
监控视频服务器
麦克风
控制输入
摄像机带云台的摄像头
实验室 控制及演播室
交换机
C CC
操控服务器
视频服务器
2M带宽到互联网
防火墙
针眼摄像头
IWCG2006
Real time monitor BSL3 Labs of FUDAN University
IWCG2006
Project 5: Collaboration for Experiments Centers (http://cec.gucas.ac.cn)
IWCG2006
800MHz 核磁共振谱仪[experiment statues
The PHI 700 Field Emission Scanning Auger NanoprobeGet data
PHI700 对应工作站
Trio MRI 磁共振脑成像系统[ 北京磁共振脑成像中心 ]
Trio MRI 配套工作站Transfer data
IWCG2006
Implementation
IWCG2006
Large scientific instruments Centers
Instruments running statistics
IWCG2006
Lessons from SCC development
How to address the user’s application simpler? System development is less and less about coding
than about using things and gluing them together. Create a flexible enough architecture to allow for
changes. Customers will not be able to elucidate at the start
what they want; only by using the system will they be able to tell you what you should have done.
SSC 1.0 rely on MS SharePoint ; SSC 2.0 open source Future concerns should be e-community trust metrics.
IWCG2006
Acknowledge
Dec.2005--Dec.2007, China Bio-Safe Level 3 Labs Collaborative Cyberspace, Grant from MOST of China
Dec.2005--Dec.2007, China Large Scientific Instruments Collaborative Cyberspace, Grant from MOST of China
Jul.2005--Jul.2008, EU-Asia Link Programme HPC-Grid Computing Course Model, Grant from EU
IWCG2006
Thanks