2013 11-13-SoftwareSustainabilityInstitute@Manchester
-
Upload
yannick-wurm -
Category
Education
-
view
102 -
download
0
description
Transcript of 2013 11-13-SoftwareSustainabilityInstitute@Manchester
Ant genomics & Bioinformatics for emerging
model organisms
@yannick__ http://yannick.poulet.org
© Alex Wild & others
Animal biomass (Brazilian rainforest)
from Fittkau & Klinge 1973
Other insects AmphibiansReptiles
Birds
Mammals
Earthworms
Spiders
Soil fauna excluding earthworms,
ants & termites
Ants & termites
454IlluminaSolid...
454IlluminaSolid...
This changes everything.454
IlluminaSolid...
This changes everything.454
IlluminaSolid...
Any lab can sequence anything!
Major questions
Major questions
Which genes are involved in social behaviour?
Major questions
Which genes are involved in social behaviour?
Which genes make it possible for queens to live up to 30 years?
We need great tools.
Tools for genomics work on emerging model organisms
Tools for genomics work on emerging model organisms
Antgenomes.org
Tools for genomics work on emerging model organisms
Antgenomes.org
SequenceServer.com BLAST made easy
CTTTTTtGGTCGAGGTAACCCACACGTCGGGGGgTACGGTTGTATACATTTATGGGAGTAGCACGCTGACCCATAAACATTACTTAGTAGCCGTGCGACATAAAAATTGCCACTCGCGTATCTCAGGTGGTAGTAGTACTCTTAAAAAGTGTCAGCGTGAAAGTAGGCCAAATTACTGCATTTCCGTGTACACGTATCGTAACGCCTTCTCGCGATAAAAGAAGAATAAAATTTTCTCCTAAATATATATGATACCGCAAATGGATTTTGAGTTAGAGAGTAAGATGCAACGAATTGAAATTTATATAAATAATATATGATATTTTCTTATAAATCTTACTAAAtATTTTTTTTGTCAATTCTATCAGAATATTTCGAACGCACGTTTATCTGAACTGAGACGGCGGCATCAATTAGAAGTATTGTGAACGCAGTTGAAATGTATATCCATGAGAAACTCATACTAACGCAGAGCAGTGAAAACGAACACTTGACTCAGGTTTAACCGCGACGAAACGGTATCGGATAATGGACGGGACGAAAAGCGAATTCGACGGCTGGTATCGAGTTACGTGAAGATGTACCTCCGTCTTCGTCGTCATTGAAATTGACCGCGATACGCGAGGTTCACGGAGAGAGACAGAGATCGTAtCTTCCATCTTCTAAATTTCCTTGATTTCTCGTTTATACTCGCGGCAGCAGCCATTTACTTGTTCCACTTGGCGTTCTCAATTTTGCGCTTAGGGAAACAATTAAACCCAAGGATCTCGCATTACATTTTTCCTCCCGCATCTCTACATGTAGCGTATTAAAACCCCGTAGATGCCCGTTCATGCAGAACAAACAACACGGAGCAACATTCGTGCTCTATTTATTTAAGACTCTTATCCAAGGGGGATGAGTTGCAGCGCAGCGAAACTGGTCTGGTATTTATAGTATTTGGATTATGGAATAATCCCTTGCATAGCGACGGCGAAAAGGGAAGGAAAGCCAGGAATGCACGGTGTTTACCGATATCCTTCACCTCTTTTACCTCTTCGTATTCCTCTCTCTCGCGTTTGCCAtCTTTTTTTTtCTCTAAGCGATCTCATTGGTACCGCACGAGGAGCTTCAATCTTTTCTCTTATCTGGATTACTCGTACACAGAGAGTTACAATCCGCTATAAATTCCATCTTTTAAACGCGAAATTCATTCAATTTATTTGCGACCTCAATCATTTTCCCGTGAGATCGACCGAAAAATTCTTAATAAGGATTTATTCGTTTAATCGCGAGCTCTGATAATTTCATTACTCATTGTAACATAAGCGCATTGATATGAATGTACGTATACTGCAAAGTATAGAAAAaTAAATCCGCGTTAAAGTTCTAATTTTTGAAACGTTTATTATTGCTCGTAGAAAAGAAATATTTGTATCTGTGAATACATGTGTCGTAATATTTTAAAAATTAATTCTTTACCTTTATCACGAAAGCTAATTATTTACATATTTACAAATAAACAAAAATATTAGCGTACAAAAATAAGTAAGAGAAAAGAAATTAATTTTGTCTTGGACAAAACGTTTCATTAATGTCGTATTATGTAAAAAAAAATTTAATAAAATGTCAAGATTTTATTTTTGCGAAAGAATAAAAACAAGAGCTTAAGTTCATTCTTTTTGATAGATAATACAAGGTTCTATTTTTGATAAAGACTCATTTTTAGAGGGTTTTCTTTTCACATTTTCCGCGGTACGACTACAATGCATCGTCGAAATACAAAACAAAAAGAAAAGAAAAAGAAATCACAGCGCATCTCGTGCAAAAGCTGCGAACAGTCTACCGTCTACAAACATCGTCAGGAAGATTCTTTCATCTGGTCACGGGGGAGACATCGTTCGCGCAGGTATCTCTCACCGTTGCTGTATATTCATACGTATCTTCGACCTAGTCAAACAACGATTCGACCCAGCACAAAGCAGAGCGGGCGTCAGCGTAGCGTGAAAATTTAACCCGCGACTTGGCGGAGATTTGATTTTTTGGCGTCGGTCAAAAGGCAGACGTGCCCTGATATATCCGCGACGTTATTTGATGCACGGCTACCGGTACAATTGGACTGAGGTAAATAAGAAAGCAGCAACGACAAAAGACTTCCTTCTGTGTGGATGTTAGTCGTACCCTTGGGCGTTGTCTGCCGCCGTATCTCCTAAGTTGGTCGGGGAACGCGGAGTAAGGTTGGACGAATCCTAGTAGATAGCACAGCTGTATTAAATAAATAGAGAATCGCCGAGGCTTTCGACGATGTAGGAGGAAAGAACCGTACTCCTTGAAAGTGGAAAAGGTTGAAGTCAAGCTCCCCCCTCCCTCCCCTCTTCTCCCCGAGAGAACGTCGAGAGGAAGCGTCGTGGATGGAGAACAAAAGTCTGCGCGAAGAAATATCAGAAAAGTCGAACTAATATTTGAGATTACCCATCAATGCTTTTATCCGTCGTGTATCTCCGAAGAAAAAAAGATGGGTATAATTTTAAAAATACATTTGTCAGTGTCTGATATGCGCCGATCTTCTTTCCTCCATGCATTAACGCTTCGTGCGGATACGACCGGGCGTAATCTTAAAAAAAaTGATCGCCCATATACTAAACCACTGACAGCACGTACAGTTCGCCGAGAAGTTTAAGACCACTTTCAGCGCGTTAAACTTTTACACTCTAATTAGAAATTAATAAAGACTAATAGCTTTATCGTCGGTCTATTAAATCCCGTGGAGGAGGTGCTGGAAGTTAGGCAGTCGCGAAGGTGGTCGGTCGGTACACTCGGCAATAAAAGTTTTTTtCTTCGTGGGATAACGGGGACTTTAACTGCAAGAAGAAAGAAAAAAGaGGAAAGAGAGAAAATTCGTTGAAAGCGAAAGTCGAACTACGTTATTTATCTGATGGCCGACACTTCGAAGATTGTTGCGCGGTGTGTCGACGCAGCGCGCAGACGCTCACCAGACGTTTTCTTTTCATTTCTTATTTCTCCCGTCGAAAAAGGaGAGAAAAGGAGAGGATCGGGACGAAGTGAGAAAGAGTAGCGATAGAACGTCGGAAAGGGGAGAAAAAAaGAAATCAATGAGCGGAAACGTGATACGATGGGCGAAGGGCCAGGATGGGGGACAGGGTCGGCCGTGCTGTAGTTTCTCTATTTCCTTCATTAACTCTGATTCAACAGCAGCGTAACTTTTAATTGGAAGTGGTGGGGGgTCGAGTCGAGGCTCATTTCCCGGCGGACTTCCCCATACTGGAATTCAAGCCAATCCCCAAGCGTCTTGCGAGAGGTCACTTCATTCGTTTCGATATTCTCAGATTTTACTTTCATAGCCGTGCGATATAGCATTTGAATTGCTTCGAATTAGAAACGGTAGCGCCTATCtGTTTTTTTTTtAGAGATTAAAGAGATTAAAATCTATAGAAACGTAGTTTTTCGACAGATAAATGCTTCTTTACTGATGTCACATATGATACGCAAAGGGAAATAGATTATAATTAATGATTGTACAATATTTAGTGATATCCCTAACAATAATCATCACGAATGAATAAATATGAATGCATAAAGCGATGTACAATGGCACATTAATTTCCTATCCATTATAAATATAATAAATGTATAAATTGTATTAATGACGAGAACGGGAGATGCGCTGATTTAATTTAAATAAAATCACCAGAGCGTCATCTTCCAAAACGGAAAATAACGGAAGCGACAGATGCTTTCGAAAGCGTCCGTGCCTCCGTCCTTTACTCATATCCAGATAAAGGCACACGTGACGCTCGTAAGATACTGCCGGAAATAAGAGCGATAAAAAGCAAGGCGAATAGTTACACGCCGGGCGACGAGGtCCTTCGAAACGCGAAAATGACGAGCCTGCGATAAAGGACCGCCATCGCGAACGGTAACGGGAGTAACCCCGTGGAATTTAGTATGATCCCCCGCGCACGGCACATCgCGATACTATCGGTTTCTCGTTACCCAGATCGGTATATTGTTATTTCAGCAGTTCTCGGACAAGGTTCACGGACATTCGATAGCGCGAAGGGGACCGGAGCTCCTGAAGAATTCAACGTAACTGGCCGATAATCGGCCTCGTAACGGCTACCCATTTTCCGCGGCACGTCTCGGGGCACCATCATCGAAACGACCTTATTTCCATGCATCCACGTTGCACCTGCTATTGCACCGTAGCCGTCACCGAACTATATCGTCGTCGAACTTATTCGATGTACGATAATCCTTACGGAAAGCTTTAAACAGCTACCTGCATGCATACTAAAGATACGTCCTTcGCTTCTGGGATCCACCTTGTATTCGCGAGACATTGCATAATTTGTATTCGGTGTAAATATGCTATCCGCGTGGCGATCGACCACCGCGAACCGCGCGGAAACTCCACAGGCGGTGCTTGaTAACGTTCCGCGAACGCAGATAAGCAAATGCCAACTTATGGCAGCAGACACTcTACGCGACTTAACAAAATCTCGTATAAACCCGAGATTTTACATGGCAGTTTAACCAAAATAAATTAAGAGAGAAAGACGGAAGAAGAGAGACAGACGACGTGCCTTGCGTCGTCGTAAGAATACGTAAAACCGTGAAATGCTCGAAATGAAGCCGTCCGtATTTTTTGCATACGCTTATATCCGAAGGGCCGCTCTCCCCTCGTAATTGCGTGTCCTCGACAAAAATCTCCTTTCCGATTTGCCTCTTAGTGCTCGCGCATTCCGTCCGCGTACGACGCTTGCAAATCGGCTGCATCGCCGGCGTCACCGGCAGAAAAACGGAGAACTTTCACGGCGAGTCTCGCGGAGCCTCGTACGGCAAAACACCGCGACGACGCCTGCCATTTTGGGCTGGCACGGGAACATAGCAGGGGCCAGGATGACGGAGGGGTGGTGATAGGAAACCCCGGGCAAAAAACaGCGCCTGCAGAGATCCGCGACGCGGCTGTTTAATTCGGGACTTCGCCCGGGGGAGGAAAAAAAaGGGCCGAGGCCGCCGGCTCCTCCTCGCGCCGCTCATGCATTATGAATAATGGAATTTAAATATACGCGAAACAGATCCCCGCTGGGCGAATTCGGCTTAGAGAGGTGGAGTGCAGCCTGGTGCCACGGTGTCGTGTTCGCATTGTTAATTGGCCGGCCGTTtCATTTtCCGTTGCGGCGGCGTAACGTGAGAGACAGCGTAACGGGAAGCGAAACGACACGGCGCGTCGAGCGTGCACAGAGAAGTTGAATTATCGAGGGaGAAAAAGGGCGCGAGGGAGGGGCAGGACGGTCAAACACTGAGAAAGCAGGCAACATCCGCGAACCCGTAAATTAATTAGGGCCAACATAATAGCTGGGAATGGCTGGTAGTGGTGGTTCGTCTTCGTGGTCCAGCTTGCGGTTCCCGAGTGCCCTCGCAGTTCGCGCGGTCATGCCTCGGAAACTACACGATACACCTTGCGTCATCCTTACACGAGGATACCTGTTTGCGATCCCCTGAGGTATTAACGAGCGTATAAGCAGACTCAAAGTACGAACACACTGCTAGTTTCGCGTGCATGTTGCTCGTCGCCGTGCAGACGAGTTAGTACTGTCGACAAAGTAGTCGTAAAGTACGTAAGAGTCTTCCTCTAGTAGCGAGGACTAATCTCCCGGTACATTAATCTATGTATATTTTATGATATACACACCTGTATTTATCGAAAGTTTATGTTTATCTCGAAAATTAACGTTAATTTTCGAGGACGCAAAAGCTGTCGCAAGTTTaTTTAATTTTGTTTGtATTTTTTTTTCTTGCTCATTTTTATTTCGAAGTATGCAAATTGAATCTCTTGGTGGCATAACGAGAATTTTCGAAGCTTAAAGGGATCTGCGTTGGCGCGAAGAGAATGGGACTTGTTTTAAACTTTTCTTTCCAACGAACACCGTTTCCTTCATTGTAACGTAAAAATGGTGAGCTTCTGCGGCGCGATTGGTCTCATCTTTCCTCTCGTATCGTTCGTCATTTTGTCCGAGAGCGTGTAGATATTGTACCCGATAGAACGGAGACGGAATCACGCTATACGGTTCCCTCCCAAATGTTCGTTCCTCACGGGCAAAAGTACAAGTAAAAGTAGGAGGGTCCGACTTTATGACCCTACGAGGCACAATAAAGGTTGATGACTACTAAACGTAGAAGAACACGTATACAGTCCGACGTAAAGTTATTTTTCAATTCCGCGGTCCTCCGCCGCAGTCGTTTTGCCTTGACGTGGAAAAGGAAATTTCCCGGGTTCTAGTCGTCCGGTCTTCTTTtCGTTCTACTGACAATACCATAtTTTCGACATAGATCGATCTCCTTTtCTCTTtCTCTCTCTCTCTCTCTCTCTCTCTTGAAAATGAAAGCGCGGAAGAGCCGGATCGTGCAACGCACAATGCGAGGCCGCTTTGTACGAGGTACGAGGAGTCCGTCATGCGCCGTAAAACGCCGGAATATGCAATACTACTTTCGCAGCGACGGGGCTTGCACGAAGTAATATCGTGAAACGTAGAACCGTTCTTTTCATACGGATATGCGGGAGAAGTTGCTCGTCCGCCCTCCGGCGCATACACGTCGCGCGAGAGTATCGTGCATCCGAAACTCTGAGATGAAACTCTTCTGTAGTCGATATTCGTCGCGATAAATAGATAAGTCTCGGATAAGGTAGAGACATCGATAATTCCGTTACGAGAATACTCGAGAATAAGATCCAAGTGAAGTGATCACGCTCCATATGGTTTCAGATTAATCTCCAATCGGCTGACGAAGGAGGATCATCCTTTTACCGGTGGAAAATAGGGTGGATTGGCGGAGCAAGAAGGCTAAACAGAGAATAAACAGGAGCTATAGCCGAACGAGGGAGAAGGTAAGTAGATGCCTCGGAGAACGTGAGCGAAAAAAGGAAACGGGTACGAGAGAAAAAGAAACGGACGGAATAGCGGCTCGGGATTCGCATCCCAAGGAAAACCAAGGCTATACCGGGGTCTTGGATTATTCGAGGCACGACGACGATCTTCGAGTCAGCCGACTGTCTGTACCGTGAAAGTGGCACGTATCGATCGCACGGCTGGATTATCTTCCACTTCGATCTACGACGATTACTTCCGCCATCGTATATCCGGGCTTTGCACTAGCGAGGCTATTTAAAAATCTGCGCTCAGTAACTACTTATGATTTTTCCATCAGAAACGATTGTGGAGAGAAAAGAGGGAAAAAAAGAGATAGACAGCCTCTGGCTCGAAATGCTAATTTCGCAATCGAGAATTAAAATGCACTTCTTTGTATCTAAATTTTCGTAGAATTAAAATAAAATTGAATAAAGCAATGAATAAATTGAATAAACTAAAATATAGCTAAATATTTTTTCCTCTATACAAGGTGAATATAATTATCAAATATTTAAGTATGTAGATTGAATTTAAACAGCCTCGGAAGAGAAAAGAATCGGATAAACGAAATGCTTTTGCCTCTATTTTCAAGCACGTGACGAATAAAATCTAGCAAAGCTTTTCGACACAATATGTCGACGCAAATGTGGTCTATTTTGGCTAATGATTATTACCGGGAGTCCCGGGCACGGTGTGTCCGCGGCGCGATAAATTAGAGCGCGAATCGACTTCCACGGCCGTTGTAGAAGGTACTTTGGCAAACGTTATTTCTTCCTGTCTCGAAGGAAGCGCCACTCGAAAACTTGGAAAAGTTCGGCCAGCTGCACGCACCGCGATCTCGGGTCCGTTCTGGCTCGGTGGCGTGCGCAGGACGGTTGTGAGACGAGAGAAAGAGAGAAAGGAGGATAGAGCGCGAGAGGGAAAGAAGAGGGAACATCCGCGTGCGTGGTTGTATATGGCGTTAATGGCGGCGAGCATAAAGCATCCCCGcGCCTGCACGCTCGACCACGGTCGTTTACAGTGCCACTATTTATGTTGATAACTTCGGAATGGAGTCAGATACACGGTACCGAGTGCCGGCGGTTCGGCGGTGGTCCGGCAGCGGTCTGGTAGCACTTGCAAAGTATGGGAGAAGAAGGGGGATGGTGGATTCGCGGATCTTTTTGGTCGTGCAAGGAAGGCGGGTCGGTTAGGGTAGGTAGAGAGGAGACGAGCCGAGTCGAGAACAAGATTCAAGCGGAAAGTTTTGCGAGTTAATGGTGCGGAGGTAATGCCGCGGCCGCAACAGCAACAGCCATCCCCCTGCTTGTGTGTTCGCTCGCTCGCTCACTtCTTCCGTTCTCTCCTCTTCGCGCTAGTTCTCTCTCTCTCTCTCTCTCTCTCTCTGtCTTTCTTGGAATAGCCGTAGAAAAAaGGAAAAGAAAGAGAGAGAAAGAACGAGACTCCTTTCTCCCCACGAATTCTCTCCTCTCTTTAAGCACACTCTCTCTTCCTGCaccCCCCCCCcTCcTCTCTCTCTCTTTCTCTCTCTATTTTCTTGTCTTCCTCACTTGCATCATCCCTCGTCCATTTCCcTCTGGAGAACGTGCCACGTTCTCACTTCCTCGTTCGTCGCTACTTTTTTTCTTCTTTAGTTGTCTCCTCTCACACCCTCGAGACGGCCCGATCTTTTCCTCGTACAGCTCCTATCACGAACAGTCGCTAACAAGTGCATCGAATGCAAGTTGCCGACAAACTTCTTCCACCGATTTGTGCTTGTTCTCTGTGCATGCGCGGGCATGTATATCTCTATTAGGCGACATCTGCTCTCAGCTTTTTCATAACGAGGTAGCGCGGATTCGCGCTCCGAGAGACTTCACGAAGCACTTCCGATCTCGCTACAGTAGAATGCGTATTGTATTTTCTTGTCTCTCTCATTTACTCTTTTCTATCTTTCGTATCTAGCGTGAATACTCCCATGAGGAATGTAGAAACCAGATTTTGAAACGGCTTCTTCGTATCTAAATTTTCTTAACTATTTGTTCCTCAAAGTCCATTTTGACGTTATAAATTTTTATTTTTTAATGAGAATGTTTTTATGTGGAGAAGAAAAGTACAACTTTTTTCAAAATGCAATTAAATTTATACAAGACTACTAAGATAAAAAAAGATGCAAATAATAATGTTCAACTTACTAACTGCTATATTATTAAGGCCAAGTTAAATTACAGATTTATGATGTTGCAGAATATAATAGAAAAGTTTCAAGAAAAAAaCATTTTAAAGCTTAAAGAGTTTGTTTTCAAACCCATCAAATTTTTTtCTCTGGCAGTGCTGCTGCTCTGCGACGTCATTTTCAAGTCGATCAATGCAAAGTTAACAAAAGTATTTCAACTTTAAAATCATGCAGAAAGTTGGGAAAAATTTGTTTTCAATTTTACTTAAATTTTAGTTACATTTTTTCTAAAAGCTTAAAATCTCCTCTTTAAAATATGTCTTTAAAAATTGAGCTATGATTTTTCTTCACCGAGTTATCGTAATTTGAAATGGCCAATCGACGTTTTCTTTTCATCACTAGGAATAATGCCGGCGGATTTAAGCTTTCAGAAGATTCCAAGCAAAGTTGAAAACAATTTGTTTGAATTCCGCATGATCTCAAAGTTGAAGCTAAAAAAATTTCACAAGACTTTAAAAGACAAAAAGTCAATGTTGCACAATCGACTTGGAAATAACGCCACTTGGAGCAGAGAAGTATTGCCGAGAAAAAGCTTCGACCGGGTTTGAAAGCAGGATCTATAGGCTTCCAAACTTTTTTTTtGAAAAATGGAGCTTTAAGGTTACTTTTCACGAAATTGTATAAATTGTGTTTTGTTGtTTTTATCATCGATTCGAAATGATCCTTTCATTTTGTTCAAGAATGTATCATTAACTTAATGCAATGATATCTTTTAATAATTCAATACTTCTTACTAATTAGATTTAGAAAAGTTTCATGAACAATTTTAGAACGGTTATTGCTAATTATTCGCCAAAAACAGACGCCTTTCTTCGAGGAGTAACAACTCAAGCTGCATTCGCGGTTGACGGCTTTCCGCGGCGCGTCGCGGTAATATGGCTCCTATTTAATTACCCGGTCCTTTGGGAGCTTAAACCAGCCAGAGCTGGAACGGCTGCGCCGTTATTCTATCTAGACTCCTTTGCTTTCTCAAGCAGGCGCGCGATCAAACCTTCTCGCATAAAAGACAATCGCAGCTGGCAGTCGACGACGCGcGGGACAGTCGAATCATGACCCGCTCCTCTCTAGTTACCGCCGTCAGTCTGCTCTACTTCCAGACGCCGCGCGTAATCTATTCGACATTAGTTGCTTGATTGCACCGTAAAATGCGACGGCGACGTGAACGAGAACGACGACGACGACGACGACGACGACAACGACGACGACAATGACAACGACGACGACGACAAGAGTGGGTTCGTCGGTGCTCGATGGCGCCTCCGATTTCAACCGCAGCACGATGCAAGCCCATTACTATCGCCCGGAGCTAAACGGCACCCGGAGCTCGTGCCATTAAGGGAATCTAGGGTCCGATCCACCTCATTGAATTCCGTTcAATTGCGATTATGATAATGCGTGAACGATCGCCGTGGACGTGAGCTACGGaCAACgAGGGTGTTCGTTCTCGGCTCAGAGAAACGCAGCGATAAAATTATCAGTGACAGCTTCATTTTTGTTACATTTGACGTTGAAAAATTTGCGAAAGAAATGTGCATTATTCAACTACATTGACAATTGTAAATCTTACATGACCTTTTATTAATATACATATATGTAAGATATATGTTCCATCTCTAGTCTCGTTCGATAATGAAGAAATTATAGCTGCAACATAGACGCCGGATTATTCGCGGGGGAAACCGTTGATTATTTGGACTCCGTGGGCTCGGGGTCTGGTTTACTTCTTCCCTTATGTCCGGAGATAATGGACACTATTACCTTAACGAGGCCCCAGTCTCTTAGCCGGTAATTGCTTCGAGATATCCGAGAGAGCTCCGGCGTACGTTGCCGCCTGGTGTTGCAGGCAGAGAACCCaGACGGTATTATTGCCGCCGAGGCTACTCGCCCGTTTCATGGTGGTTATTGTTATGGGCCTGCGGCATTAAGATGTACACCGCACTCTCATACGGGACCCACCACCCCATATACGAGCTGCATATATACTTATGCGGGGAGGATTTCATTACGCCGCTTCATAACTGGCGGTCTTCAAAATCGCGATAAATCGGGATTGTCTTCCTTGTTAAATCAGCTCCCGTTCCCCTTCCTCATATCGCTGACGAAGCCAACGAGACGGATTAATCTGCGCGATTGAATGGCCTCTATGACGTAAGCGCACCGTTTACTGGCACGGCTCCTCGTGTTCACGTGAAACGATTTGCGTCGTAAATATTTTATTTTAACATCGCAACATCAAGACAGACGAGGATCGGCTATTGCCTCGTGATCCTAAAAGGGAGATTCTCAAGGCGGAATCGGGGTAACGCGTTCGTTGATCTCGCCAAACTACGGCATCTTGAGGACTAGTCTTAGaGGAAAAAAAGACGACGAGGAGACACGGTGAGCATTAGATGAGAAAGAGACGGCGCGGCGCGGTGCGGCGGAGCGAGACGGAAAGAGATCAAATCTGGATATCAGGATTAGGGTGGGTACGTAATCCGCAGGACGACGGGTGGTAGGAACGGTGGATCCGTCGGCAGATCTCATCGCGCGGAGGAACTCTGCGGGTACTTCGCCCGGCCAAATACGACAAGAGCAGCAGCCTAACTCGAGTAGAGCCGCCGAGATGGTTTACGGCTCGTCGCAAGTTGGTGAAATTTAAGAGCGCATTTAAAACTGGTTTGGCGCGATGCCCTGCGCCTCCCTGCAGCAGGTACTGCCGGCGAGAGATACCGCTGGGCTCCGACAGGAGTTTATCGCCCTAACATTCTTGGAAAGCTTCGGATGGATTTCCCTCCTCAACCCATTTCTTCCGGCAGCACCTCAAATCGTCGCTTCTTTCGAGCTCCCTGTGTCGTCCGTTCTTTCCCCTTGCCACGAAAATTCTTCCAACAGACTCGACACCAAATCGTCACGACGATCATCGGTCAAGATACCCGGAGAAACGTCGCGGACGATGTTACCGTCAGATCGGACGCTTTTAGCTCCGCATTGGAGCTTTTTCCTCGACCGTCTCGGAGGAGATTTAGCGGACCGACAACAAATGAAGGTCATCTTTCGCGCGAGAAAGCGTCTCACCTTGATTTCCGTCTCGTTTCTCCGGAAATTGGATATCAACCGGACGAACTGTAATGTGTACGTTAAATTGCCAATTATATATATAATGTAACAGCTGATTATCTCAAGTGTCCTAAAAACCATTATACTCTTAATTTCTGTGAAAAATGGCGAAAATAAAAAAGAAACCGATCTTAATAAAGATATTCTTCCTGATAGATGCCATGACCCACGTGGAAAACTTTTTAGTTTTGTACAGTGGTATTATACGTTATCTTCCGCTGAACGTAAGACGTGCCTATCGCGCAATTTCATCGCGACGTCGTCGTATAGCGATTATGGCTACTCCATTAAAAATGAATTTTATAAAGGCAATCTTTCCAAGCGATCGTTGTAGGAGAAAAAGGCGAAAGCCGGAGCCAAAGGGGATGAGGCCACTACCTTTGGCTGATCCACTTCGAATGATAATCACCTCTAGGAGACTCAATTTCGCCCTGCTCCGCGTCCTTACCCGTTCCTATCTTCGGAAGGTTCAACGCCGCAGCGGACTGCATCTTTCACTCCCTTCGTCACCACCGCCCTATTCCTATCGCCCTCCGCGCGCCTACCGCCCCTATATCCTTCCCTTCCTTCACtCCTAGACTATTCTGAACGACCTCTTCCCCCATTCGCCAACGCTCACTCCTAACTGATTGGAGTACCAATCAATGCGGCATTCAGGCGGCCGTGCTGAAaCTTTAGGAAATTAACTATTCACTCTCTGGAAATGGTTATTTGGAAGGCCGGAAAGGCAGTCGGGACTACGTTACGGAGAATATGAGCGAAAGGATATGGGCAGACGATGACGATAGATAAGACAGTTCAATTTTCAATTAGGCGTGGACGTAGCGCGCGTAGCTCGGCTCCTACCACGATGCTTACCACGCGTCCCATACGTTAGGAGGAATCCGAGTTATCGATTGCACCTACCTCTTCTCGCGGATCTTACCGCTGGGAAATTTCGTTCGCTTAATGAGATTATTGTACTCGCCGGAAAATCCAGAtAAATTATTCTGATAACTGCGCAACGTCGCCTTCTGGTGTTTCGCGAGCCGATAACGAGCATAGTTAACCGCAGAGAAATTATACGGTCCACATGAGAGAATATATCACTTGGCgCGGGGACGGCGGGGGGGGGGgaGGGgAGACGTACGCGCCTCCAGTCGCGCaCAAAGCCACGCGCGGAGAGAAGTATTAtCCCGGTACTCTCTTTCGTTAATGGCATCGTGGTAATCAACAAAGCCGTACGAGGTAATACGAGGTATTTACAGTTACTCCGGCGCAGCAGAGGGCACCGAGCGTCGAGCGGTGGTAGGCGCATGGGCGGATTTATGCGATGTTTACACTCGCGCAAGGCTGCTAATACGGCTTTCCGACAACAACCCCCACGCCACATCGCGACTGCTCGACTGTGCCCCGCTGTTTGAAAGTGAAACTTCTGTTTGTACGCTAGTTCCAGCTAATCCCGAATATTACGCCGGCGAATCCACCGCCGCGCGTCGGGAGAACGGAAGGCGGCAGCCCAACGCTCAGAACTCTCCGCAAGGCAGAACTGGcGCCTTGAGCGAGGCGGAGAAGTTTTCACAAAGGAGAAGCGCTTCGCCAACTGCCGTGCCGAGCAAAATACATGCGCCCTCTACGAACGTTTGTATTAGACAAGAGCCAATCTTAGCTCCAGCTGGTCCATCTTACGAGAAATCTCGTCGGTTTTGATTCGCACGTCTCGAATCGTATAATAAAAACCGTTGGCCCCAACGACAGTTGTTCGCGTTTACGTTGAGCAACGTTCGACGTGTCTAGAAAACTACAATAAACGGGAGTCGAGAAACGAGATCGTTGAGAAAGGTCTGAAAGATTTCAGCCCTACTACGCCGTGCTGGTACTCCAGTGGATATGTATATACATATACGTATATGTATATGCTCACCGAGAATCCGATCTGCAGTCTCAAACAGCTATTTACCGTAACGTCGGTGGATCGTCGGTACTTTTGTAGCCAGAAGGGAAAATGGATCGGCGGAATCGCCGAGAATGCTACTGCAGTATACAAGTCCGCCGCTGTAACTCATTCACCGATAAATTTCGGCGGGCGAAGCATCGATCACAATGTTCTATTTTGCTCATTGGTGTTTTTGCCGGCGGCATTCGATTCAATTCGAGGATGCGGTGAGTCGATATATCTCTCGAGGTGTCATGAGCCGCCCGTAATGGATGGCGGCTGTTCCCTTACCCTTACTCCGAAAGGAGAGGCCGTGCCAAGAACCTACCCGGTACCCGCAACCCTCTGCTTCTGATCCTGATTTTTGTCTGCTGTCTTCTTCTTATTCCACGCATTATTCTCGCTCTTTCCCTCGCACGGCATTCGCAAAGTGGCCAACAAAGTAGTCTACGTATCTCTCCACGCGATATTTCGTAGCACATTTTCACATTTAATCGCGAGACAATCGGAATCCAAAATATTATCCGTGATATTTAATTTACAAACGTCGTTTGCGTTACCAAAAACAAAATATTTTTTGATTAATCAAATAATATCATAAAAAATTCTTTCAATGCATTTTACCTTTGATGACAGAAAAATCGCTGCCACTCGGGGAACGATGGGATGGATTTCGCGGTAGAACAAGCAAACGCGAAAGCACGCGAAAGGAAATGCACGCGCGACTGGTCCGACAAGGATATTCATAACGTGTGAACGATCACGAGCATTCAAGCTTCGGCGAATGCTAAGACGTGCGAAAATGAGGGTTAAGAAAATCTCGAATCGGGTAGATAATCCTATAGCAGTAGTGAAAATACAATAGGCTTTAATTGCCGGACTTAATTCGTAGAAATACGATTTCGATTAAATGTAACGACTCTCGTGTTTCTCAAATAAAAATGGCACTGAATATAATTGATCACTTATTGTCACTTATCAGTTATATATTAATAAAACAAGACGTTAAAAATTTAATAAGAAAAGATGAAAAAAACAcGATGTTAAAAATTtAACCCTAAAATTAAAATTTTTGTTTATATTAATTATTTTTtATTTATTTTATATTTTTGTCACTAAATTTTGCTTGTATTAAATAAAAATATAAATTTTAATAAGGAAATACGATTTAGGCAGTACTTTCCCCTATTAAGAGGAAGTACATATAGTTTCTAAGAATATTTAGAAATATTTATGCTTTGCATTTAAATAAAATAAAAATGGCGAATTTGTTGCGCTAAATTATTAGTTCTTCAAATATTATTGCAAGAATTATAATCAGTATTGTAAAGTAAATTTTACGAGTATTTTAAGAACTGTTTATGATCTAAAAGGTTATAATTTTGTAAAACGGAGGAATAATCGTCCATCTGTAATGATGCATTTACGTCACAAGAGTGACAcTTCCCTTGAAGCTCGACAGTGCTAATACGAGTTCAAGCCTCTACCATTCTCCGAAGTGAGGATCGATAGTCTAGCATCATTGGGCTCCAACGGCTGATGCGGAGGAGTCGAAGATAGAAGGTACAGAAGGGAGAGGATAGGTGGATGGTGGACGGGGACTTTCGCATGCGGAGTACTGTCGGCTTCGGTTCGCCGAAGGATTAGCATGAATGGCCATCGTCTGCGCCAGTCGAGATAGCAACACAAGCTCTCGCCTATTATTGCTCGGAGAAATTAATCGCATCCATCGCGAATTACTCCGCGATTGCTTTTCCGAGTACAACACGTCTCGCGGTCGATGGGGCAGAGACAGCGGGGAAAGTTCGTGTGATAGGCGTCGTAGTAATTATCAAACGGAACAAAAGAGCGGGCTACCTAATTACTACTTCAATTCCCATATTTATTAATTAGCAATTTATATTATAAAAATTAAAATACTATAAATTGTATATCTTTTAATTATTAACTAATGGGAGACGTTAATAAGTTTAAAACGATGACGACATTAGGAAAAATGTTGCTTTTGTAACGTTTCTAATATTCGTCATTTGAGGAACAGTCGTTACCATTCAGTTCGTATCGAGCTGGATTTGATATAATTATGAAATAGCACACAGCACCCGCTTCCCATCAACATCCCTCCTTTTCCCGAGTCCCTCGGTCTCGGAGAAACAGAAGCGTAATTCCACTCCGAATGCAGTCATTATGGAGATTTTATCAAGGCCTGTTGCCAGCTAGCTAGGCTTCAACTTAACTCGCCCATTACGGCGTTTCCTTTGTCTGCGACTTACACATCGTCATGTCCCGCAGTTCCATTCCGAGCCGAGGGGAACACTGATGAAAGAGATCCGACGATTCGCATACGACGGTTCGATTCACAATGCCGCGCATTCACACACGCGAAAGAGCTCGCGAAGGATTCTGGGATGCATCCGAAAAATAATCACGTGGGACACGAGCATCGACAGCTTGCGCTAGCACGCACGATGGAAACCGCCTGCACAGTTGGACGGATCAGCGAGAGGCGTCGTTTTACGAGGCCGATACTTGAAACGTCGTGCTGAAGTATAAAATTCTACGGGTCATGCAACTGCCAGAGTACCGATCACGCGCATACGTGTACGTAATGCGACTTTAATGCAGCATGTATTATTCAACGATACCACCCGCCGTGCGTGTCCGTCCGACAGCTCGTATACGCGTAATTACGAGCATAGTTGTTTATTGCGAATCCGGCAGTCCGCGCAACGCCTTATCCGGCGGTTATAGACGCGCGTCTCTCAACCCTCCTCGCCAAACGATCCCGATACCTATCCGCCAACCCTCTCGTCCCTCATCTCTCGTCTCTCGTCTCTCGCCATCTCGCCTCCTCGCCTCCTCGCCTCCtCGCctCcTCGCCCCGCCTCCTCTCGCGTCGACAACCCGTGAAGAATGATCGTGGAGTTTCCCTGGTTGGACAGTGACAAATAGAGTATATTCTTGCCAGCTCGGAGAAGCGTGTTTTACGCTTTGTTTACCGTTTGTCATGTTATGCAAAGGAAAGGAATTTTTGCCGGTTGCGTATTCGGCGAATCGACCCTTATCCTACGCGCATGATATGCCTGCGGTAGTTTTACATTATCGCTCGGCTGCGGCGCGGCGCGGCGCTACACAACCATTACCACTAACGATTCGGAGCGACTCCAGGAGTAAAGAGAGAAAGGCCGTAGTCGCATTTGCGGTAAATTGCGCCCATTTACTTCGATTCGCAAACGTAATTTCTGAGATtACCTCACGCTAGTGTCCAGCGATGGTAAAAGTAAGCTCTCTGACCTTCCACGTACGATCCAATATACGATAGAACAATGCATTATTCGACGGCGCGAAATTATTCATAAGCGTCACGAATTGTCCGCAGCGAACGTGGAAACGTTAAACGTTCGGATCtAGTTTGACGAATCTAGAATCGAGACAGTCGGAACCGGACCCTCTCGACTCTCGATCCGGGTGTTTCGCGCGAGGCACGTCGCAATGTTCGGAAAATTGAGGCGAGCCGCATAATACGCATATAGTCGCGTAATAATATGCATACAGGTGCATGCGTGACGACGTGACGCAACGTGACGCCCGAGCGCGGGTCGTTAGCCGTACGGtCGTTTCGCTGGCAAGTCACGCCTAATCACCGATGAATACAACAATCGCAGCATGATGGACGATCGGCAAGCAAGCCGTCCTTTATTAGCGTCGGGTACCCGGGGGCGAGGGGCGGGTGGGAGAGCGGGTGATCATATAATTAGATTAGCGCGCGCCCATCGCGCGTCGTCATCATTTTCCGCGGTGATTTTGGACCGCGCGGGAGATCGGTAACCGCCCTGCGCCGCGCCGCGTCGCGTCGCGTTGCGGCACGGCGGTTACtGTTCTCTCGatACtcGTTGGGATTGTGCTGC
CCTTTGATGTCGCAACGCAGGACGGCACAACGTTCtCTTCCGgCgTaCGAAATCATTTtGGgACACcGTCGCGcGCGGcaCGAGaTTtAATTTgCGCCGGCAACGATCgCGC
Sequencing a genome is now cheap & easy.
CTTTTTtGGTCGAGGTAACCCACACGTCGGGGGgTACGGTTGTATACATTTATGGGAGTAGCACGCTGACCCATAAACATTACTTAGTAGCCGTGCGACATAAAAATTGCCACTCGCGTATCTCAGGTGGTAGTAGTACTCTTAAAAAGTGTCAGCGTGAAAGTAGGCCAAATTACTGCATTTCCGTGTACACGTATCGTAACGCCTTCTCGCGATAAAAGAAGAATAAAATTTTCTCCTAAATATATATGATACCGCAAATGGATTTTGAGTTAGAGAGTAAGATGCAACGAATTGAAATTTATATAAATAATATATGATATTTTCTTATAAATCTTACTAAAtATTTTTTTTGTCAATTCTATCAGAATATTTCGAACGCACGTTTATCTGAACTGAGACGGCGGCATCAATTAGAAGTATTGTGAACGCAGTTGAAATGTATATCCATGAGAAACTCATACTAACGCAGAGCAGTGAAAACGAACACTTGACTCAGGTTTAACCGCGACGAAACGGTATCGGATAATGGACGGGACGAAAAGCGAATTCGACGGCTGGTATCGAGTTACGTGAAGATGTACCTCCGTCTTCGTCGTCATTGAAATTGACCGCGATACGCGAGGTTCACGGAGAGAGACAGAGATCGTAtCTTCCATCTTCTAAATTTCCTTGATTTCTCGTTTATACTCGCGGCAGCAGCCATTTACTTGTTCCACTTGGCGTTCTCAATTTTGCGCTTAGGGAAACAATTAAACCCAAGGATCTCGCATTACATTTTTCCTCCCGCATCTCTACATGTAGCGTATTAAAACCCCGTAGATGCCCGTTCATGCAGAACAAACAACACGGAGCAACATTCGTGCTCTATTTATTTAAGACTCTTATCCAAGGGGGATGAGTTGCAGCGCAGCGAAACTGGTCTGGTATTTATAGTATTTGGATTATGGAATAATCCCTTGCATAGCGACGGCGAAAAGGGAAGGAAAGCCAGGAATGCACGGTGTTTACCGATATCCTTCACCTCTTTTACCTCTTCGTATTCCTCTCTCTCGCGTTTGCCAtCTTTTTTTTtCTCTAAGCGATCTCATTGGTACCGCACGAGGAGCTTCAATCTTTTCTCTTATCTGGATTACTCGTACACAGAGAGTTACAATCCGCTATAAATTCCATCTTTTAAACGCGAAATTCATTCAATTTATTTGCGACCTCAATCATTTTCCCGTGAGATCGACCGAAAAATTCTTAATAAGGATTTATTCGTTTAATCGCGAGCTCTGATAATTTCATTACTCATTGTAACATAAGCGCATTGATATGAATGTACGTATACTGCAAAGTATAGAAAAaTAAATCCGCGTTAAAGTTCTAATTTTTGAAACGTTTATTATTGCTCGTAGAAAAGAAATATTTGTATCTGTGAATACATGTGTCGTAATATTTTAAAAATTAATTCTTTACCTTTATCACGAAAGCTAATTATTTACATATTTACAAATAAACAAAAATATTAGCGTACAAAAATAAGTAAGAGAAAAGAAATTAATTTTGTCTTGGACAAAACGTTTCATTAATGTCGTATTATGTAAAAAAAAATTTAATAAAATGTCAAGATTTTATTTTTGCGAAAGAATAAAAACAAGAGCTTAAGTTCATTCTTTTTGATAGATAATACAAGGTTCTATTTTTGATAAAGACTCATTTTTAGAGGGTTTTCTTTTCACATTTTCCGCGGTACGACTACAATGCATCGTCGAAATACAAAACAAAAAGAAAAGAAAAAGAAATCACAGCGCATCTCGTGCAAAAGCTGCGAACAGTCTACCGTCTACAAACATCGTCAGGAAGATTCTTTCATCTGGTCACGGGGGAGACATCGTTCGCGCAGGTATCTCTCACCGTTGCTGTATATTCATACGTATCTTCGACCTAGTCAAACAACGATTCGACCCAGCACAAAGCAGAGCGGGCGTCAGCGTAGCGTGAAAATTTAACCCGCGACTTGGCGGAGATTTGATTTTTTGGCGTCGGTCAAAAGGCAGACGTGCCCTGATATATCCGCGACGTTATTTGATGCACGGCTACCGGTACAATTGGACTGAGGTAAATAAGAAAGCAGCAACGACAAAAGACTTCCTTCTGTGTGGATGTTAGTCGTACCCTTGGGCGTTGTCTGCCGCCGTATCTCCTAAGTTGGTCGGGGAACGCGGAGTAAGGTTGGACGAATCCTAGTAGATAGCACAGCTGTATTAAATAAATAGAGAATCGCCGAGGCTTTCGACGATGTAGGAGGAAAGAACCGTACTCCTTGAAAGTGGAAAAGGTTGAAGTCAAGCTCCCCCCTCCCTCCCCTCTTCTCCCCGAGAGAACGTCGAGAGGAAGCGTCGTGGATGGAGAACAAAAGTCTGCGCGAAGAAATATCAGAAAAGTCGAACTAATATTTGAGATTACCCATCAATGCTTTTATCCGTCGTGTATCTCCGAAGAAAAAAAGATGGGTATAATTTTAAAAATACATTTGTCAGTGTCTGATATGCGCCGATCTTCTTTCCTCCATGCATTAACGCTTCGTGCGGATACGACCGGGCGTAATCTTAAAAAAAaTGATCGCCCATATACTAAACCACTGACAGCACGTACAGTTCGCCGAGAAGTTTAAGACCACTTTCAGCGCGTTAAACTTTTACACTCTAATTAGAAATTAATAAAGACTAATAGCTTTATCGTCGGTCTATTAAATCCCGTGGAGGAGGTGCTGGAAGTTAGGCAGTCGCGAAGGTGGTCGGTCGGTACACTCGGCAATAAAAGTTTTTTtCTTCGTGGGATAACGGGGACTTTAACTGCAAGAAGAAAGAAAAAAGaGGAAAGAGAGAAAATTCGTTGAAAGCGAAAGTCGAACTACGTTATTTATCTGATGGCCGACACTTCGAAGATTGTTGCGCGGTGTGTCGACGCAGCGCGCAGACGCTCACCAGACGTTTTCTTTTCATTTCTTATTTCTCCCGTCGAAAAAGGaGAGAAAAGGAGAGGATCGGGACGAAGTGAGAAAGAGTAGCGATAGAACGTCGGAAAGGGGAGAAAAAAaGAAATCAATGAGCGGAAACGTGATACGATGGGCGAAGGGCCAGGATGGGGGACAGGGTCGGCCGTGCTGTAGTTTCTCTATTTCCTTCATTAACTCTGATTCAACAGCAGCGTAACTTTTAATTGGAAGTGGTGGGGGgTCGAGTCGAGGCTCATTTCCCGGCGGACTTCCCCATACTGGAATTCAAGCCAATCCCCAAGCGTCTTGCGAGAGGTCACTTCATTCGTTTCGATATTCTCAGATTTTACTTTCATAGCCGTGCGATATAGCATTTGAATTGCTTCGAATTAGAAACGGTAGCGCCTATCtGTTTTTTTTTtAGAGATTAAAGAGATTAAAATCTATAGAAACGTAGTTTTTCGACAGATAAATGCTTCTTTACTGATGTCACATATGATACGCAAAGGGAAATAGATTATAATTAATGATTGTACAATATTTAGTGATATCCCTAACAATAATCATCACGAATGAATAAATATGAATGCATAAAGCGATGTACAATGGCACATTAATTTCCTATCCATTATAAATATAATAAATGTATAAATTGTATTAATGACGAGAACGGGAGATGCGCTGATTTAATTTAAATAAAATCACCAGAGCGTCATCTTCCAAAACGGAAAATAACGGAAGCGACAGATGCTTTCGAAAGCGTCCGTGCCTCCGTCCTTTACTCATATCCAGATAAAGGCACACGTGACGCTCGTAAGATACTGCCGGAAATAAGAGCGATAAAAAGCAAGGCGAATAGTTACACGCCGGGCGACGAGGtCCTTCGAAACGCGAAAATGACGAGCCTGCGATAAAGGACCGCCATCGCGAACGGTAACGGGAGTAACCCCGTGGAATTTAGTATGATCCCCCGCGCACGGCACATCgCGATACTATCGGTTTCTCGTTACCCAGATCGGTATATTGTTATTTCAGCAGTTCTCGGACAAGGTTCACGGACATTCGATAGCGCGAAGGGGACCGGAGCTCCTGAAGAATTCAACGTAACTGGCCGATAATCGGCCTCGTAACGGCTACCCATTTTCCGCGGCACGTCTCGGGGCACCATCATCGAAACGACCTTATTTCCATGCATCCACGTTGCACCTGCTATTGCACCGTAGCCGTCACCGAACTATATCGTCGTCGAACTTATTCGATGTACGATAATCCTTACGGAAAGCTTTAAACAGCTACCTGCATGCATACTAAAGATACGTCCTTcGCTTCTGGGATCCACCTTGTATTCGCGAGACATTGCATAATTTGTATTCGGTGTAAATATGCTATCCGCGTGGCGATCGACCACCGCGAACCGCGCGGAAACTCCACAGGCGGTGCTTGaTAACGTTCCGCGAACGCAGATAAGCAAATGCCAACTTATGGCAGCAGACACTcTACGCGACTTAACAAAATCTCGTATAAACCCGAGATTTTACATGGCAGTTTAACCAAAATAAATTAAGAGAGAAAGACGGAAGAAGAGAGACAGACGACGTGCCTTGCGTCGTCGTAAGAATACGTAAAACCGTGAAATGCTCGAAATGAAGCCGTCCGtATTTTTTGCATACGCTTATATCCGAAGGGCCGCTCTCCCCTCGTAATTGCGTGTCCTCGACAAAAATCTCCTTTCCGATTTGCCTCTTAGTGCTCGCGCATTCCGTCCGCGTACGACGCTTGCAAATCGGCTGCATCGCCGGCGTCACCGGCAGAAAAACGGAGAACTTTCACGGCGAGTCTCGCGGAGCCTCGTACGGCAAAACACCGCGACGACGCCTGCCATTTTGGGCTGGCACGGGAACATAGCAGGGGCCAGGATGACGGAGGGGTGGTGATAGGAAACCCCGGGCAAAAAACaGCGCCTGCAGAGATCCGCGACGCGGCTGTTTAATTCGGGACTTCGCCCGGGGGAGGAAAAAAAaGGGCCGAGGCCGCCGGCTCCTCCTCGCGCCGCTCATGCATTATGAATAATGGAATTTAAATATACGCGAAACAGATCCCCGCTGGGCGAATTCGGCTTAGAGAGGTGGAGTGCAGCCTGGTGCCACGGTGTCGTGTTCGCATTGTTAATTGGCCGGCCGTTtCATTTtCCGTTGCGGCGGCGTAACGTGAGAGACAGCGTAACGGGAAGCGAAACGACACGGCGCGTCGAGCGTGCACAGAGAAGTTGAATTATCGAGGGaGAAAAAGGGCGCGAGGGAGGGGCAGGACGGTCAAACACTGAGAAAGCAGGCAACATCCGCGAACCCGTAAATTAATTAGGGCCAACATAATAGCTGGGAATGGCTGGTAGTGGTGGTTCGTCTTCGTGGTCCAGCTTGCGGTTCCCGAGTGCCCTCGCAGTTCGCGCGGTCATGCCTCGGAAACTACACGATACACCTTGCGTCATCCTTACACGAGGATACCTGTTTGCGATCCCCTGAGGTATTAACGAGCGTATAAGCAGACTCAAAGTACGAACACACTGCTAGTTTCGCGTGCATGTTGCTCGTCGCCGTGCAGACGAGTTAGTACTGTCGACAAAGTAGTCGTAAAGTACGTAAGAGTCTTCCTCTAGTAGCGAGGACTAATCTCCCGGTACATTAATCTATGTATATTTTATGATATACACACCTGTATTTATCGAAAGTTTATGTTTATCTCGAAAATTAACGTTAATTTTCGAGGACGCAAAAGCTGTCGCAAGTTTaTTTAATTTTGTTTGtATTTTTTTTTCTTGCTCATTTTTATTTCGAAGTATGCAAATTGAATCTCTTGGTGGCATAACGAGAATTTTCGAAGCTTAAAGGGATCTGCGTTGGCGCGAAGAGAATGGGACTTGTTTTAAACTTTTCTTTCCAACGAACACCGTTTCCTTCATTGTAACGTAAAAATGGTGAGCTTCTGCGGCGCGATTGGTCTCATCTTTCCTCTCGTATCGTTCGTCATTTTGTCCGAGAGCGTGTAGATATTGTACCCGATAGAACGGAGACGGAATCACGCTATACGGTTCCCTCCCAAATGTTCGTTCCTCACGGGCAAAAGTACAAGTAAAAGTAGGAGGGTCCGACTTTATGACCCTACGAGGCACAATAAAGGTTGATGACTACTAAACGTAGAAGAACACGTATACAGTCCGACGTAAAGTTATTTTTCAATTCCGCGGTCCTCCGCCGCAGTCGTTTTGCCTTGACGTGGAAAAGGAAATTTCCCGGGTTCTAGTCGTCCGGTCTTCTTTtCGTTCTACTGACAATACCATAtTTTCGACATAGATCGATCTCCTTTtCTCTTtCTCTCTCTCTCTCTCTCTCTCTCTTGAAAATGAAAGCGCGGAAGAGCCGGATCGTGCAACGCACAATGCGAGGCCGCTTTGTACGAGGTACGAGGAGTCCGTCATGCGCCGTAAAACGCCGGAATATGCAATACTACTTTCGCAGCGACGGGGCTTGCACGAAGTAATATCGTGAAACGTAGAACCGTTCTTTTCATACGGATATGCGGGAGAAGTTGCTCGTCCGCCCTCCGGCGCATACACGTCGCGCGAGAGTATCGTGCATCCGAAACTCTGAGATGAAACTCTTCTGTAGTCGATATTCGTCGCGATAAATAGATAAGTCTCGGATAAGGTAGAGACATCGATAATTCCGTTACGAGAATACTCGAGAATAAGATCCAAGTGAAGTGATCACGCTCCATATGGTTTCAGATTAATCTCCAATCGGCTGACGAAGGAGGATCATCCTTTTACCGGTGGAAAATAGGGTGGATTGGCGGAGCAAGAAGGCTAAACAGAGAATAAACAGGAGCTATAGCCGAACGAGGGAGAAGGTAAGTAGATGCCTCGGAGAACGTGAGCGAAAAAAGGAAACGGGTACGAGAGAAAAAGAAACGGACGGAATAGCGGCTCGGGATTCGCATCCCAAGGAAAACCAAGGCTATACCGGGGTCTTGGATTATTCGAGGCACGACGACGATCTTCGAGTCAGCCGACTGTCTGTACCGTGAAAGTGGCACGTATCGATCGCACGGCTGGATTATCTTCCACTTCGATCTACGACGATTACTTCCGCCATCGTATATCCGGGCTTTGCACTAGCGAGGCTATTTAAAAATCTGCGCTCAGTAACTACTTATGATTTTTCCATCAGAAACGATTGTGGAGAGAAAAGAGGGAAAAAAAGAGATAGACAGCCTCTGGCTCGAAATGCTAATTTCGCAATCGAGAATTAAAATGCACTTCTTTGTATCTAAATTTTCGTAGAATTAAAATAAAATTGAATAAAGCAATGAATAAATTGAATAAACTAAAATATAGCTAAATATTTTTTCCTCTATACAAGGTGAATATAATTATCAAATATTTAAGTATGTAGATTGAATTTAAACAGCCTCGGAAGAGAAAAGAATCGGATAAACGAAATGCTTTTGCCTCTATTTTCAAGCACGTGACGAATAAAATCTAGCAAAGCTTTTCGACACAATATGTCGACGCAAATGTGGTCTATTTTGGCTAATGATTATTACCGGGAGTCCCGGGCACGGTGTGTCCGCGGCGCGATAAATTAGAGCGCGAATCGACTTCCACGGCCGTTGTAGAAGGTACTTTGGCAAACGTTATTTCTTCCTGTCTCGAAGGAAGCGCCACTCGAAAACTTGGAAAAGTTCGGCCAGCTGCACGCACCGCGATCTCGGGTCCGTTCTGGCTCGGTGGCGTGCGCAGGACGGTTGTGAGACGAGAGAAAGAGAGAAAGGAGGATAGAGCGCGAGAGGGAAAGAAGAGGGAACATCCGCGTGCGTGGTTGTATATGGCGTTAATGGCGGCGAGCATAAAGCATCCCCGcGCCTGCACGCTCGACCACGGTCGTTTACAGTGCCACTATTTATGTTGATAACTTCGGAATGGAGTCAGATACACGGTACCGAGTGCCGGCGGTTCGGCGGTGGTCCGGCAGCGGTCTGGTAGCACTTGCAAAGTATGGGAGAAGAAGGGGGATGGTGGATTCGCGGATCTTTTTGGTCGTGCAAGGAAGGCGGGTCGGTTAGGGTAGGTAGAGAGGAGACGAGCCGAGTCGAGAACAAGATTCAAGCGGAAAGTTTTGCGAGTTAATGGTGCGGAGGTAATGCCGCGGCCGCAACAGCAACAGCCATCCCCCTGCTTGTGTGTTCGCTCGCTCGCTCACTtCTTCCGTTCTCTCCTCTTCGCGCTAGTTCTCTCTCTCTCTCTCTCTCTCTCTCTGtCTTTCTTGGAATAGCCGTAGAAAAAaGGAAAAGAAAGAGAGAGAAAGAACGAGACTCCTTTCTCCCCACGAATTCTCTCCTCTCTTTAAGCACACTCTCTCTTCCTGCaccCCCCCCCcTCcTCTCTCTCTCTTTCTCTCTCTATTTTCTTGTCTTCCTCACTTGCATCATCCCTCGTCCATTTCCcTCTGGAGAACGTGCCACGTTCTCACTTCCTCGTTCGTCGCTACTTTTTTTCTTCTTTAGTTGTCTCCTCTCACACCCTCGAGACGGCCCGATCTTTTCCTCGTACAGCTCCTATCACGAACAGTCGCTAACAAGTGCATCGAATGCAAGTTGCCGACAAACTTCTTCCACCGATTTGTGCTTGTTCTCTGTGCATGCGCGGGCATGTATATCTCTATTAGGCGACATCTGCTCTCAGCTTTTTCATAACGAGGTAGCGCGGATTCGCGCTCCGAGAGACTTCACGAAGCACTTCCGATCTCGCTACAGTAGAATGCGTATTGTATTTTCTTGTCTCTCTCATTTACTCTTTTCTATCTTTCGTATCTAGCGTGAATACTCCCATGAGGAATGTAGAAACCAGATTTTGAAACGGCTTCTTCGTATCTAAATTTTCTTAACTATTTGTTCCTCAAAGTCCATTTTGACGTTATAAATTTTTATTTTTTAATGAGAATGTTTTTATGTGGAGAAGAAAAGTACAACTTTTTTCAAAATGCAATTAAATTTATACAAGACTACTAAGATAAAAAAAGATGCAAATAATAATGTTCAACTTACTAACTGCTATATTATTAAGGCCAAGTTAAATTACAGATTTATGATGTTGCAGAATATAATAGAAAAGTTTCAAGAAAAAAaCATTTTAAAGCTTAAAGAGTTTGTTTTCAAACCCATCAAATTTTTTtCTCTGGCAGTGCTGCTGCTCTGCGACGTCATTTTCAAGTCGATCAATGCAAAGTTAACAAAAGTATTTCAACTTTAAAATCATGCAGAAAGTTGGGAAAAATTTGTTTTCAATTTTACTTAAATTTTAGTTACATTTTTTCTAAAAGCTTAAAATCTCCTCTTTAAAATATGTCTTTAAAAATTGAGCTATGATTTTTCTTCACCGAGTTATCGTAATTTGAAATGGCCAATCGACGTTTTCTTTTCATCACTAGGAATAATGCCGGCGGATTTAAGCTTTCAGAAGATTCCAAGCAAAGTTGAAAACAATTTGTTTGAATTCCGCATGATCTCAAAGTTGAAGCTAAAAAAATTTCACAAGACTTTAAAAGACAAAAAGTCAATGTTGCACAATCGACTTGGAAATAACGCCACTTGGAGCAGAGAAGTATTGCCGAGAAAAAGCTTCGACCGGGTTTGAAAGCAGGATCTATAGGCTTCCAAACTTTTTTTTtGAAAAATGGAGCTTTAAGGTTACTTTTCACGAAATTGTATAAATTGTGTTTTGTTGtTTTTATCATCGATTCGAAATGATCCTTTCATTTTGTTCAAGAATGTATCATTAACTTAATGCAATGATATCTTTTAATAATTCAATACTTCTTACTAATTAGATTTAGAAAAGTTTCATGAACAATTTTAGAACGGTTATTGCTAATTATTCGCCAAAAACAGACGCCTTTCTTCGAGGAGTAACAACTCAAGCTGCATTCGCGGTTGACGGCTTTCCGCGGCGCGTCGCGGTAATATGGCTCCTATTTAATTACCCGGTCCTTTGGGAGCTTAAACCAGCCAGAGCTGGAACGGCTGCGCCGTTATTCTATCTAGACTCCTTTGCTTTCTCAAGCAGGCGCGCGATCAAACCTTCTCGCATAAAAGACAATCGCAGCTGGCAGTCGACGACGCGcGGGACAGTCGAATCATGACCCGCTCCTCTCTAGTTACCGCCGTCAGTCTGCTCTACTTCCAGACGCCGCGCGTAATCTATTCGACATTAGTTGCTTGATTGCACCGTAAAATGCGACGGCGACGTGAACGAGAACGACGACGACGACGACGACGACGACAACGACGACGACAATGACAACGACGACGACGACAAGAGTGGGTTCGTCGGTGCTCGATGGCGCCTCCGATTTCAACCGCAGCACGATGCAAGCCCATTACTATCGCCCGGAGCTAAACGGCACCCGGAGCTCGTGCCATTAAGGGAATCTAGGGTCCGATCCACCTCATTGAATTCCGTTcAATTGCGATTATGATAATGCGTGAACGATCGCCGTGGACGTGAGCTACGGaCAACgAGGGTGTTCGTTCTCGGCTCAGAGAAACGCAGCGATAAAATTATCAGTGACAGCTTCATTTTTGTTACATTTGACGTTGAAAAATTTGCGAAAGAAATGTGCATTATTCAACTACATTGACAATTGTAAATCTTACATGACCTTTTATTAATATACATATATGTAAGATATATGTTCCATCTCTAGTCTCGTTCGATAATGAAGAAATTATAGCTGCAACATAGACGCCGGATTATTCGCGGGGGAAACCGTTGATTATTTGGACTCCGTGGGCTCGGGGTCTGGTTTACTTCTTCCCTTATGTCCGGAGATAATGGACACTATTACCTTAACGAGGCCCCAGTCTCTTAGCCGGTAATTGCTTCGAGATATCCGAGAGAGCTCCGGCGTACGTTGCCGCCTGGTGTTGCAGGCAGAGAACCCaGACGGTATTATTGCCGCCGAGGCTACTCGCCCGTTTCATGGTGGTTATTGTTATGGGCCTGCGGCATTAAGATGTACACCGCACTCTCATACGGGACCCACCACCCCATATACGAGCTGCATATATACTTATGCGGGGAGGATTTCATTACGCCGCTTCATAACTGGCGGTCTTCAAAATCGCGATAAATCGGGATTGTCTTCCTTGTTAAATCAGCTCCCGTTCCCCTTCCTCATATCGCTGACGAAGCCAACGAGACGGATTAATCTGCGCGATTGAATGGCCTCTATGACGTAAGCGCACCGTTTACTGGCACGGCTCCTCGTGTTCACGTGAAACGATTTGCGTCGTAAATATTTTATTTTAACATCGCAACATCAAGACAGACGAGGATCGGCTATTGCCTCGTGATCCTAAAAGGGAGATTCTCAAGGCGGAATCGGGGTAACGCGTTCGTTGATCTCGCCAAACTACGGCATCTTGAGGACTAGTCTTAGaGGAAAAAAAGACGACGAGGAGACACGGTGAGCATTAGATGAGAAAGAGACGGCGCGGCGCGGTGCGGCGGAGCGAGACGGAAAGAGATCAAATCTGGATATCAGGATTAGGGTGGGTACGTAATCCGCAGGACGACGGGTGGTAGGAACGGTGGATCCGTCGGCAGATCTCATCGCGCGGAGGAACTCTGCGGGTACTTCGCCCGGCCAAATACGACAAGAGCAGCAGCCTAACTCGAGTAGAGCCGCCGAGATGGTTTACGGCTCGTCGCAAGTTGGTGAAATTTAAGAGCGCATTTAAAACTGGTTTGGCGCGATGCCCTGCGCCTCCCTGCAGCAGGTACTGCCGGCGAGAGATACCGCTGGGCTCCGACAGGAGTTTATCGCCCTAACATTCTTGGAAAGCTTCGGATGGATTTCCCTCCTCAACCCATTTCTTCCGGCAGCACCTCAAATCGTCGCTTCTTTCGAGCTCCCTGTGTCGTCCGTTCTTTCCCCTTGCCACGAAAATTCTTCCAACAGACTCGACACCAAATCGTCACGACGATCATCGGTCAAGATACCCGGAGAAACGTCGCGGACGATGTTACCGTCAGATCGGACGCTTTTAGCTCCGCATTGGAGCTTTTTCCTCGACCGTCTCGGAGGAGATTTAGCGGACCGACAACAAATGAAGGTCATCTTTCGCGCGAGAAAGCGTCTCACCTTGATTTCCGTCTCGTTTCTCCGGAAATTGGATATCAACCGGACGAACTGTAATGTGTACGTTAAATTGCCAATTATATATATAATGTAACAGCTGATTATCTCAAGTGTCCTAAAAACCATTATACTCTTAATTTCTGTGAAAAATGGCGAAAATAAAAAAGAAACCGATCTTAATAAAGATATTCTTCCTGATAGATGCCATGACCCACGTGGAAAACTTTTTAGTTTTGTACAGTGGTATTATACGTTATCTTCCGCTGAACGTAAGACGTGCCTATCGCGCAATTTCATCGCGACGTCGTCGTATAGCGATTATGGCTACTCCATTAAAAATGAATTTTATAAAGGCAATCTTTCCAAGCGATCGTTGTAGGAGAAAAAGGCGAAAGCCGGAGCCAAAGGGGATGAGGCCACTACCTTTGGCTGATCCACTTCGAATGATAATCACCTCTAGGAGACTCAATTTCGCCCTGCTCCGCGTCCTTACCCGTTCCTATCTTCGGAAGGTTCAACGCCGCAGCGGACTGCATCTTTCACTCCCTTCGTCACCACCGCCCTATTCCTATCGCCCTCCGCGCGCCTACCGCCCCTATATCCTTCCCTTCCTTCACtCCTAGACTATTCTGAACGACCTCTTCCCCCATTCGCCAACGCTCACTCCTAACTGATTGGAGTACCAATCAATGCGGCATTCAGGCGGCCGTGCTGAAaCTTTAGGAAATTAACTATTCACTCTCTGGAAATGGTTATTTGGAAGGCCGGAAAGGCAGTCGGGACTACGTTACGGAGAATATGAGCGAAAGGATATGGGCAGACGATGACGATAGATAAGACAGTTCAATTTTCAATTAGGCGTGGACGTAGCGCGCGTAGCTCGGCTCCTACCACGATGCTTACCACGCGTCCCATACGTTAGGAGGAATCCGAGTTATCGATTGCACCTACCTCTTCTCGCGGATCTTACCGCTGGGAAATTTCGTTCGCTTAATGAGATTATTGTACTCGCCGGAAAATCCAGAtAAATTATTCTGATAACTGCGCAACGTCGCCTTCTGGTGTTTCGCGAGCCGATAACGAGCATAGTTAACCGCAGAGAAATTATACGGTCCACATGAGAGAATATATCACTTGGCgCGGGGACGGCGGGGGGGGGGgaGGGgAGACGTACGCGCCTCCAGTCGCGCaCAAAGCCACGCGCGGAGAGAAGTATTAtCCCGGTACTCTCTTTCGTTAATGGCATCGTGGTAATCAACAAAGCCGTACGAGGTAATACGAGGTATTTACAGTTACTCCGGCGCAGCAGAGGGCACCGAGCGTCGAGCGGTGGTAGGCGCATGGGCGGATTTATGCGATGTTTACACTCGCGCAAGGCTGCTAATACGGCTTTCCGACAACAACCCCCACGCCACATCGCGACTGCTCGACTGTGCCCCGCTGTTTGAAAGTGAAACTTCTGTTTGTACGCTAGTTCCAGCTAATCCCGAATATTACGCCGGCGAATCCACCGCCGCGCGTCGGGAGAACGGAAGGCGGCAGCCCAACGCTCAGAACTCTCCGCAAGGCAGAACTGGcGCCTTGAGCGAGGCGGAGAAGTTTTCACAAAGGAGAAGCGCTTCGCCAACTGCCGTGCCGAGCAAAATACATGCGCCCTCTACGAACGTTTGTATTAGACAAGAGCCAATCTTAGCTCCAGCTGGTCCATCTTACGAGAAATCTCGTCGGTTTTGATTCGCACGTCTCGAATCGTATAATAAAAACCGTTGGCCCCAACGACAGTTGTTCGCGTTTACGTTGAGCAACGTTCGACGTGTCTAGAAAACTACAATAAACGGGAGTCGAGAAACGAGATCGTTGAGAAAGGTCTGAAAGATTTCAGCCCTACTACGCCGTGCTGGTACTCCAGTGGATATGTATATACATATACGTATATGTATATGCTCACCGAGAATCCGATCTGCAGTCTCAAACAGCTATTTACCGTAACGTCGGTGGATCGTCGGTACTTTTGTAGCCAGAAGGGAAAATGGATCGGCGGAATCGCCGAGAATGCTACTGCAGTATACAAGTCCGCCGCTGTAACTCATTCACCGATAAATTTCGGCGGGCGAAGCATCGATCACAATGTTCTATTTTGCTCATTGGTGTTTTTGCCGGCGGCATTCGATTCAATTCGAGGATGCGGTGAGTCGATATATCTCTCGAGGTGTCATGAGCCGCCCGTAATGGATGGCGGCTGTTCCCTTACCCTTACTCCGAAAGGAGAGGCCGTGCCAAGAACCTACCCGGTACCCGCAACCCTCTGCTTCTGATCCTGATTTTTGTCTGCTGTCTTCTTCTTATTCCACGCATTATTCTCGCTCTTTCCCTCGCACGGCATTCGCAAAGTGGCCAACAAAGTAGTCTACGTATCTCTCCACGCGATATTTCGTAGCACATTTTCACATTTAATCGCGAGACAATCGGAATCCAAAATATTATCCGTGATATTTAATTTACAAACGTCGTTTGCGTTACCAAAAACAAAATATTTTTTGATTAATCAAATAATATCATAAAAAATTCTTTCAATGCATTTTACCTTTGATGACAGAAAAATCGCTGCCACTCGGGGAACGATGGGATGGATTTCGCGGTAGAACAAGCAAACGCGAAAGCACGCGAAAGGAAATGCACGCGCGACTGGTCCGACAAGGATATTCATAACGTGTGAACGATCACGAGCATTCAAGCTTCGGCGAATGCTAAGACGTGCGAAAATGAGGGTTAAGAAAATCTCGAATCGGGTAGATAATCCTATAGCAGTAGTGAAAATACAATAGGCTTTAATTGCCGGACTTAATTCGTAGAAATACGATTTCGATTAAATGTAACGACTCTCGTGTTTCTCAAATAAAAATGGCACTGAATATAATTGATCACTTATTGTCACTTATCAGTTATATATTAATAAAACAAGACGTTAAAAATTTAATAAGAAAAGATGAAAAAAACAcGATGTTAAAAATTtAACCCTAAAATTAAAATTTTTGTTTATATTAATTATTTTTtATTTATTTTATATTTTTGTCACTAAATTTTGCTTGTATTAAATAAAAATATAAATTTTAATAAGGAAATACGATTTAGGCAGTACTTTCCCCTATTAAGAGGAAGTACATATAGTTTCTAAGAATATTTAGAAATATTTATGCTTTGCATTTAAATAAAATAAAAATGGCGAATTTGTTGCGCTAAATTATTAGTTCTTCAAATATTATTGCAAGAATTATAATCAGTATTGTAAAGTAAATTTTACGAGTATTTTAAGAACTGTTTATGATCTAAAAGGTTATAATTTTGTAAAACGGAGGAATAATCGTCCATCTGTAATGATGCATTTACGTCACAAGAGTGACAcTTCCCTTGAAGCTCGACAGTGCTAATACGAGTTCAAGCCTCTACCATTCTCCGAAGTGAGGATCGATAGTCTAGCATCATTGGGCTCCAACGGCTGATGCGGAGGAGTCGAAGATAGAAGGTACAGAAGGGAGAGGATAGGTGGATGGTGGACGGGGACTTTCGCATGCGGAGTACTGTCGGCTTCGGTTCGCCGAAGGATTAGCATGAATGGCCATCGTCTGCGCCAGTCGAGATAGCAACACAAGCTCTCGCCTATTATTGCTCGGAGAAATTAATCGCATCCATCGCGAATTACTCCGCGATTGCTTTTCCGAGTACAACACGTCTCGCGGTCGATGGGGCAGAGACAGCGGGGAAAGTTCGTGTGATAGGCGTCGTAGTAATTATCAAACGGAACAAAAGAGCGGGCTACCTAATTACTACTTCAATTCCCATATTTATTAATTAGCAATTTATATTATAAAAATTAAAATACTATAAATTGTATATCTTTTAATTATTAACTAATGGGAGACGTTAATAAGTTTAAAACGATGACGACATTAGGAAAAATGTTGCTTTTGTAACGTTTCTAATATTCGTCATTTGAGGAACAGTCGTTACCATTCAGTTCGTATCGAGCTGGATTTGATATAATTATGAAATAGCACACAGCACCCGCTTCCCATCAACATCCCTCCTTTTCCCGAGTCCCTCGGTCTCGGAGAAACAGAAGCGTAATTCCACTCCGAATGCAGTCATTATGGAGATTTTATCAAGGCCTGTTGCCAGCTAGCTAGGCTTCAACTTAACTCGCCCATTACGGCGTTTCCTTTGTCTGCGACTTACACATCGTCATGTCCCGCAGTTCCATTCCGAGCCGAGGGGAACACTGATGAAAGAGATCCGACGATTCGCATACGACGGTTCGATTCACAATGCCGCGCATTCACACACGCGAAAGAGCTCGCGAAGGATTCTGGGATGCATCCGAAAAATAATCACGTGGGACACGAGCATCGACAGCTTGCGCTAGCACGCACGATGGAAACCGCCTGCACAGTTGGACGGATCAGCGAGAGGCGTCGTTTTACGAGGCCGATACTTGAAACGTCGTGCTGAAGTATAAAATTCTACGGGTCATGCAACTGCCAGAGTACCGATCACGCGCATACGTGTACGTAATGCGACTTTAATGCAGCATGTATTATTCAACGATACCACCCGCCGTGCGTGTCCGTCCGACAGCTCGTATACGCGTAATTACGAGCATAGTTGTTTATTGCGAATCCGGCAGTCCGCGCAACGCCTTATCCGGCGGTTATAGACGCGCGTCTCTCAACCCTCCTCGCCAAACGATCCCGATACCTATCCGCCAACCCTCTCGTCCCTCATCTCTCGTCTCTCGTCTCTCGCCATCTCGCCTCCTCGCCTCCTCGCCTCCtCGCctCcTCGCCCCGCCTCCTCTCGCGTCGACAACCCGTGAAGAATGATCGTGGAGTTTCCCTGGTTGGACAGTGACAAATAGAGTATATTCTTGCCAGCTCGGAGAAGCGTGTTTTACGCTTTGTTTACCGTTTGTCATGTTATGCAAAGGAAAGGAATTTTTGCCGGTTGCGTATTCGGCGAATCGACCCTTATCCTACGCGCATGATATGCCTGCGGTAGTTTTACATTATCGCTCGGCTGCGGCGCGGCGCGGCGCTACACAACCATTACCACTAACGATTCGGAGCGACTCCAGGAGTAAAGAGAGAAAGGCCGTAGTCGCATTTGCGGTAAATTGCGCCCATTTACTTCGATTCGCAAACGTAATTTCTGAGATtACCTCACGCTAGTGTCCAGCGATGGTAAAAGTAAGCTCTCTGACCTTCCACGTACGATCCAATATACGATAGAACAATGCATTATTCGACGGCGCGAAATTATTCATAAGCGTCACGAATTGTCCGCAGCGAACGTGGAAACGTTAAACGTTCGGATCtAGTTTGACGAATCTAGAATCGAGACAGTCGGAACCGGACCCTCTCGACTCTCGATCCGGGTGTTTCGCGCGAGGCACGTCGCAATGTTCGGAAAATTGAGGCGAGCCGCATAATACGCATATAGTCGCGTAATAATATGCATACAGGTGCATGCGTGACGACGTGACGCAACGTGACGCCCGAGCGCGGGTCGTTAGCCGTACGGtCGTTTCGCTGGCAAGTCACGCCTAATCACCGATGAATACAACAATCGCAGCATGATGGACGATCGGCAAGCAAGCCGTCCTTTATTAGCGTCGGGTACCCGGGGGCGAGGGGCGGGTGGGAGAGCGGGTGATCATATAATTAGATTAGCGCGCGCCCATCGCGCGTCGTCATCATTTTCCGCGGTGATTTTGGACCGCGCGGGAGATCGGTAACCGCCCTGCGCCGCGCCGCGTCGCGTCGCGTTGCGGCACGGCGGTTACtGTTCTCTCGatACtcGTTGGGATTGTGCTGC
CCTTTGATGTCGCAACGCAGGACGGCACAACGTTCtCTTCCGgCgTaCGAAATCATTTtGGgACACcGTCGCGcGCGGcaCGAGaTTtAATTTgCGCCGGCAACGATCgCGC
But making ituseable is hard.
Sequencing a genome is now cheap & easy.
Gene predictionDozens of software algorithms: dozens of predictions
Yand
ell &
Enc
e 20
13 N
RG
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Gene predictionDozens of software algorithms: dozens of predictions
Yand
ell &
Enc
e 20
13 N
RG
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Gene predictionDozens of software algorithms: dozens of predictions
Yand
ell &
Enc
e 20
13 N
RG
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
Gene predictionDozens of software algorithms: dozens of predictions
20% failure rate:
Yand
ell &
Enc
e 20
13 N
RG
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
Gene predictionDozens of software algorithms: dozens of predictions
20% failure rate: •missing pieces
Yand
ell &
Enc
e 20
13 N
RG
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
Gene predictionDozens of software algorithms: dozens of predictions
20% failure rate: •missing pieces•extra pieces
Yand
ell &
Enc
e 20
13 N
RG
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
Gene predictionDozens of software algorithms: dozens of predictions
20% failure rate: •missing pieces•extra pieces•incorrect merging
Yand
ell &
Enc
e 20
13 N
RG
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
Gene predictionDozens of software algorithms: dozens of predictions
20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting Ya
ndel
l & E
nce
2013
NR
G
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
Gene predictionDozens of software algorithms: dozens of predictions
20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting
Visual inspection... and manual fixing required.
Yand
ell &
Enc
e 20
13 N
RG
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
Gene predictionDozens of software algorithms: dozens of predictions
20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting
Visual inspection... and manual fixing required.
1 gene = 20 minutes to 3 days
Yand
ell &
Enc
e 20
13 N
RG
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
Gene predictionDozens of software algorithms: dozens of predictions
20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting
Visual inspection... and manual fixing required.
1 gene = 20 minutes to 3 days
15,000 genes * 20 species = impossible. Ya
ndel
l & E
nce
2013
NR
G
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
Monica Dragan https://github.com/monicadragan/GeneValidator
Monica Dragan https://github.com/monicadragan/GeneValidator
Monica Dragan https://github.com/monicadragan/GeneValidator
Gene predictionDozens of software algorithms: dozens of predictions
20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting
Visual inspection... and manual fixing required.
1 gene = 20 minutes to 3 days
15,000 genes * 20 species = impossible. Ya
ndel
l & E
nce
2013
NR
G
GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA
Evidence
Evidence
Consensus:
http://afra.sbcs.qmul.ac.ukAnurag Priyam
Crowd-sourcing the visual inspection + correction.
Begin
EĞĞĚƐ�ĐƵƌĂƟŽŶ
�ƌĞĂƚĞ�ŝŶŝƟĂů�ƚĂƐŬƐ
Being curated
Curate
Being curated
Curate
Being curated
Curate
Submit Submit Submit
�ƵƚŽͲĐŚĞĐŬ
�ŽŶĞ
/ŶĐŽŶƐŝƐƚ
ĞŶƚ͗�ĐƌĞĂ
ƚĞ�
“ƌĞǀŝĞǁ͟
�ƚĂƐŬ�
�ŽŶƐŝƐƚĞŶƚ͗�create nexƚ�ƌĞƋƵŝƌĞĚ�ƚĂƐŬ
http://afra.sbcs.qmul.ac.ukAnurag Priyam
Crowd-sourcing the visual inspection + correction.
Begin
EĞĞĚƐ�ĐƵƌĂƟŽŶ
�ƌĞĂƚĞ�ŝŶŝƟĂů�ƚĂƐŬƐ
Being curated
Curate
Being curated
Curate
Being curated
Curate
Submit Submit Submit
�ƵƚŽͲĐŚĞĐŬ
�ŽŶĞ
/ŶĐŽŶƐŝƐƚ
ĞŶƚ͗�ĐƌĞĂ
ƚĞ�
“ƌĞǀŝĞǁ͟
�ƚĂƐŬ�
�ŽŶƐŝƐƚĞŶƚ͗�create nexƚ�ƌĞƋƵŝƌĞĚ�ƚĂƐŬ
http://afra.sbcs.qmul.ac.ukAnurag Priyam
Crowd-sourcing the visual inspection + correction.
Scientific software + Facebook + Points + Badges + Redundancy
My aim is to make bioinformatics software:
My aim is to make bioinformatics software:
Shared
My aim is to make bioinformatics software:
Shared
Robust
My aim is to make bioinformatics software:
Shared
Robust
User-friendly
With the fellowship I will:
With the fellowship I will:
• Organize a local Software-Carpentry-type event.
With the fellowship I will:
• Organize a local Software-Carpentry-type event.
• Include software & reproducibility best practices on two new MSc & existing BSc.
With the fellowship I will:
• Organize a local Software-Carpentry-type event.
• Include software & reproducibility best practices on two new MSc & existing BSc.
• Promote/lobby in line with SSI’s mission (internally/talks/conf//networking/publications/web).
With the fellowship I will:
• Organize a local Software-Carpentry-type event.
• Include software & reproducibility best practices on two new MSc & existing BSc.
• Promote/lobby in line with SSI’s mission (internally/talks/conf//networking/publications/web).
• Seek funds to create great sustainable bioinformatics software.