2002 BIOi Final Project 1 A Distributed DNA Search Database System.
-
Upload
michael-ryan -
Category
Documents
-
view
216 -
download
0
Transcript of 2002 BIOi Final Project 1 A Distributed DNA Search Database System.
![Page 1: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/1.jpg)
1
2002
BIOi Final Project
A Distributed DNA Search Database System
![Page 2: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/2.jpg)
2
Motivation
• Bioinformatics is an important issue in the next century.
• DNA sequence queries and comparisons are more and more common.
• Due to the long sequence of a DNA , each DNA query takes too much time.
![Page 3: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/3.jpg)
3
Goal
• To accelerate the searching time , we create a stable and scalable platform for distributed computing.
• provide an easy and simple graphical user interface for DNA search user.
• dynamic joining.
![Page 4: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/4.jpg)
4
Architecture
End user
Coordinator
Host
Host
HostCoordinator
Host
Host
Coordinator
![Page 5: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/5.jpg)
5
System Roles
• End user
• web portal
• Coordinator
• Host
![Page 6: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/6.jpg)
6
End user’s view
• Submit a legal-formatted DNA query.
• Wait for a period of time.
• Receive result pages.
![Page 7: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/7.jpg)
7
Coordinator’s view
CoordinatorCoordinator
Coordinator
HostHost
![Page 8: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/8.jpg)
8
Coordinator’s View
• Accept an end-user’s request query to search in a range of database.
• Allocate a sub-range of database to its group hosts.
• Communicate with other groups’ coordinators.
• Return its group result to the requesting end-user.
![Page 9: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/9.jpg)
9
Host’s View
Coordinator
Host
Host Host
![Page 10: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/10.jpg)
10
Host’s View
• Each host belongs to a group and maintain a database partition.
• In each query , a host accepts a group coordinator’s allocation to search in a range of DNA database.
![Page 11: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/11.jpg)
11
Single Host’s Job
1. Search in one part of Database.
2. Share the Loading of alignment.
![Page 12: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/12.jpg)
12
Communication Protocol
1. Coordinating Module
2. Job Communicating Module
3. Query Processing Module
4. Database Module
5. Status Communicating Module
![Page 13: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/13.jpg)
13
Communication Protocol
![Page 14: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/14.jpg)
14
Robustness
• Fault Tolerance– Fail-safe mechanism for handling failures of both
coordinators and hosts.– Allow a fixed number of failed hosts.
• Recovery– Recover the previous state of crashed coordinators
and hosts.– Job of failed hosts can be taken over by other hosts.
• Scalability– New hosts can join this system to enhance the
computing power.– For local area networks only.
![Page 15: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/15.jpg)
15
Implementation
• Pure JAVA runtime environment
• Web site: Apache server + JSP
• JAVA RMI
• MySQL™
![Page 16: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/16.jpg)
16
Potential Application
• Mathematics.
• Message Encryption & Decryption.
• Distributed Datamining
![Page 17: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/17.jpg)
17
Implementation Phase
Database Module MySQL setup
DNA sequence data collection
Job Module Job assignment
Query Module DNA comparision implementation
Status Module Handle host failure exception
Handle coordinator failure exception
Coordinator Module Get global and group view
WEB Application module
Design Web interface
GUI Application interface
Realtime response insterface
![Page 18: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/18.jpg)
18
Project participants
• 蔡景祥 Chin Hseung Tsai– B88502119 資管四
• 徐蔚倫 Wei Lun Hsu– B87801009 公衛四
• 張均合 Chun Ho Chang– R 91725026 資管所研一
• 羅文興 Wen Hsin Lo– R 91725034 資管所研一
• 劉智雄 Chi Hsiung Liu– R 91725040 資管所研一
![Page 19: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/19.jpg)
19
Question?
![Page 20: 2002 BIOi Final Project 1 A Distributed DNA Search Database System.](https://reader030.fdocuments.us/reader030/viewer/2022032606/56649eab5503460f94bb116c/html5/thumbnails/20.jpg)
20
• ThanQ