2013 11-13-SoftwareSustainabilityInstitute@Manchester

45
Ant genomics & Bioinformatics for emerging model organisms @yannick__ http://yannick.poulet.org

description

 

Transcript of 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Page 1: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Ant genomics & Bioinformatics for emerging

model organisms

@yannick__ http://yannick.poulet.org

Page 2: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

© Alex Wild & others

Page 3: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Animal biomass (Brazilian rainforest)

from Fittkau & Klinge 1973

Other insects AmphibiansReptiles

Birds

Mammals

Earthworms

Spiders

Soil fauna excluding earthworms,

ants & termites

Ants & termites

Page 4: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

454IlluminaSolid...

Page 5: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

454IlluminaSolid...

Page 6: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

This changes everything.454

IlluminaSolid...

Page 7: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

This changes everything.454

IlluminaSolid...

Any lab can sequence anything!

Page 8: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Major questions

Page 9: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Major questions

Which genes are involved in social behaviour?

Page 10: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Major questions

Which genes are involved in social behaviour?

Which genes make it possible for queens to live up to 30 years?

Page 11: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

We need great tools.

Page 12: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Tools for genomics work on emerging model organisms

Page 13: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Tools for genomics work on emerging model organisms

Antgenomes.org

Page 14: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Tools for genomics work on emerging model organisms

Antgenomes.org

SequenceServer.com BLAST made easy

Page 15: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

CTTTTTtGGTCGAGGTAACCCACACGTCGGGGGgTACGGTTGTATACATTTATGGGAGTAGCACGCTGACCCATAAACATTACTTAGTAGCCGTGCGACATAAAAATTGCCACTCGCGTATCTCAGGTGGTAGTAGTACTCTTAAAAAGTGTCAGCGTGAAAGTAGGCCAAATTACTGCATTTCCGTGTACACGTATCGTAACGCCTTCTCGCGATAAAAGAAGAATAAAATTTTCTCCTAAATATATATGATACCGCAAATGGATTTTGAGTTAGAGAGTAAGATGCAACGAATTGAAATTTATATAAATAATATATGATATTTTCTTATAAATCTTACTAAAtATTTTTTTTGTCAATTCTATCAGAATATTTCGAACGCACGTTTATCTGAACTGAGACGGCGGCATCAATTAGAAGTATTGTGAACGCAGTTGAAATGTATATCCATGAGAAACTCATACTAACGCAGAGCAGTGAAAACGAACACTTGACTCAGGTTTAACCGCGACGAAACGGTATCGGATAATGGACGGGACGAAAAGCGAATTCGACGGCTGGTATCGAGTTACGTGAAGATGTACCTCCGTCTTCGTCGTCATTGAAATTGACCGCGATACGCGAGGTTCACGGAGAGAGACAGAGATCGTAtCTTCCATCTTCTAAATTTCCTTGATTTCTCGTTTATACTCGCGGCAGCAGCCATTTACTTGTTCCACTTGGCGTTCTCAATTTTGCGCTTAGGGAAACAATTAAACCCAAGGATCTCGCATTACATTTTTCCTCCCGCATCTCTACATGTAGCGTATTAAAACCCCGTAGATGCCCGTTCATGCAGAACAAACAACACGGAGCAACATTCGTGCTCTATTTATTTAAGACTCTTATCCAAGGGGGATGAGTTGCAGCGCAGCGAAACTGGTCTGGTATTTATAGTATTTGGATTATGGAATAATCCCTTGCATAGCGACGGCGAAAAGGGAAGGAAAGCCAGGAATGCACGGTGTTTACCGATATCCTTCACCTCTTTTACCTCTTCGTATTCCTCTCTCTCGCGTTTGCCAtCTTTTTTTTtCTCTAAGCGATCTCATTGGTACCGCACGAGGAGCTTCAATCTTTTCTCTTATCTGGATTACTCGTACACAGAGAGTTACAATCCGCTATAAATTCCATCTTTTAAACGCGAAATTCATTCAATTTATTTGCGACCTCAATCATTTTCCCGTGAGATCGACCGAAAAATTCTTAATAAGGATTTATTCGTTTAATCGCGAGCTCTGATAATTTCATTACTCATTGTAACATAAGCGCATTGATATGAATGTACGTATACTGCAAAGTATAGAAAAaTAAATCCGCGTTAAAGTTCTAATTTTTGAAACGTTTATTATTGCTCGTAGAAAAGAAATATTTGTATCTGTGAATACATGTGTCGTAATATTTTAAAAATTAATTCTTTACCTTTATCACGAAAGCTAATTATTTACATATTTACAAATAAACAAAAATATTAGCGTACAAAAATAAGTAAGAGAAAAGAAATTAATTTTGTCTTGGACAAAACGTTTCATTAATGTCGTATTATGTAAAAAAAAATTTAATAAAATGTCAAGATTTTATTTTTGCGAAAGAATAAAAACAAGAGCTTAAGTTCATTCTTTTTGATAGATAATACAAGGTTCTATTTTTGATAAAGACTCATTTTTAGAGGGTTTTCTTTTCACATTTTCCGCGGTACGACTACAATGCATCGTCGAAATACAAAACAAAAAGAAAAGAAAAAGAAATCACAGCGCATCTCGTGCAAAAGCTGCGAACAGTCTACCGTCTACAAACATCGTCAGGAAGATTCTTTCATCTGGTCACGGGGGAGACATCGTTCGCGCAGGTATCTCTCACCGTTGCTGTATATTCATACGTATCTTCGACCTAGTCAAACAACGATTCGACCCAGCACAAAGCAGAGCGGGCGTCAGCGTAGCGTGAAAATTTAACCCGCGACTTGGCGGAGATTTGATTTTTTGGCGTCGGTCAAAAGGCAGACGTGCCCTGATATATCCGCGACGTTATTTGATGCACGGCTACCGGTACAATTGGACTGAGGTAAATAAGAAAGCAGCAACGACAAAAGACTTCCTTCTGTGTGGATGTTAGTCGTACCCTTGGGCGTTGTCTGCCGCCGTATCTCCTAAGTTGGTCGGGGAACGCGGAGTAAGGTTGGACGAATCCTAGTAGATAGCACAGCTGTATTAAATAAATAGAGAATCGCCGAGGCTTTCGACGATGTAGGAGGAAAGAACCGTACTCCTTGAAAGTGGAAAAGGTTGAAGTCAAGCTCCCCCCTCCCTCCCCTCTTCTCCCCGAGAGAACGTCGAGAGGAAGCGTCGTGGATGGAGAACAAAAGTCTGCGCGAAGAAATATCAGAAAAGTCGAACTAATATTTGAGATTACCCATCAATGCTTTTATCCGTCGTGTATCTCCGAAGAAAAAAAGATGGGTATAATTTTAAAAATACATTTGTCAGTGTCTGATATGCGCCGATCTTCTTTCCTCCATGCATTAACGCTTCGTGCGGATACGACCGGGCGTAATCTTAAAAAAAaTGATCGCCCATATACTAAACCACTGACAGCACGTACAGTTCGCCGAGAAGTTTAAGACCACTTTCAGCGCGTTAAACTTTTACACTCTAATTAGAAATTAATAAAGACTAATAGCTTTATCGTCGGTCTATTAAATCCCGTGGAGGAGGTGCTGGAAGTTAGGCAGTCGCGAAGGTGGTCGGTCGGTACACTCGGCAATAAAAGTTTTTTtCTTCGTGGGATAACGGGGACTTTAACTGCAAGAAGAAAGAAAAAAGaGGAAAGAGAGAAAATTCGTTGAAAGCGAAAGTCGAACTACGTTATTTATCTGATGGCCGACACTTCGAAGATTGTTGCGCGGTGTGTCGACGCAGCGCGCAGACGCTCACCAGACGTTTTCTTTTCATTTCTTATTTCTCCCGTCGAAAAAGGaGAGAAAAGGAGAGGATCGGGACGAAGTGAGAAAGAGTAGCGATAGAACGTCGGAAAGGGGAGAAAAAAaGAAATCAATGAGCGGAAACGTGATACGATGGGCGAAGGGCCAGGATGGGGGACAGGGTCGGCCGTGCTGTAGTTTCTCTATTTCCTTCATTAACTCTGATTCAACAGCAGCGTAACTTTTAATTGGAAGTGGTGGGGGgTCGAGTCGAGGCTCATTTCCCGGCGGACTTCCCCATACTGGAATTCAAGCCAATCCCCAAGCGTCTTGCGAGAGGTCACTTCATTCGTTTCGATATTCTCAGATTTTACTTTCATAGCCGTGCGATATAGCATTTGAATTGCTTCGAATTAGAAACGGTAGCGCCTATCtGTTTTTTTTTtAGAGATTAAAGAGATTAAAATCTATAGAAACGTAGTTTTTCGACAGATAAATGCTTCTTTACTGATGTCACATATGATACGCAAAGGGAAATAGATTATAATTAATGATTGTACAATATTTAGTGATATCCCTAACAATAATCATCACGAATGAATAAATATGAATGCATAAAGCGATGTACAATGGCACATTAATTTCCTATCCATTATAAATATAATAAATGTATAAATTGTATTAATGACGAGAACGGGAGATGCGCTGATTTAATTTAAATAAAATCACCAGAGCGTCATCTTCCAAAACGGAAAATAACGGAAGCGACAGATGCTTTCGAAAGCGTCCGTGCCTCCGTCCTTTACTCATATCCAGATAAAGGCACACGTGACGCTCGTAAGATACTGCCGGAAATAAGAGCGATAAAAAGCAAGGCGAATAGTTACACGCCGGGCGACGAGGtCCTTCGAAACGCGAAAATGACGAGCCTGCGATAAAGGACCGCCATCGCGAACGGTAACGGGAGTAACCCCGTGGAATTTAGTATGATCCCCCGCGCACGGCACATCgCGATACTATCGGTTTCTCGTTACCCAGATCGGTATATTGTTATTTCAGCAGTTCTCGGACAAGGTTCACGGACATTCGATAGCGCGAAGGGGACCGGAGCTCCTGAAGAATTCAACGTAACTGGCCGATAATCGGCCTCGTAACGGCTACCCATTTTCCGCGGCACGTCTCGGGGCACCATCATCGAAACGACCTTATTTCCATGCATCCACGTTGCACCTGCTATTGCACCGTAGCCGTCACCGAACTATATCGTCGTCGAACTTATTCGATGTACGATAATCCTTACGGAAAGCTTTAAACAGCTACCTGCATGCATACTAAAGATACGTCCTTcGCTTCTGGGATCCACCTTGTATTCGCGAGACATTGCATAATTTGTATTCGGTGTAAATATGCTATCCGCGTGGCGATCGACCACCGCGAACCGCGCGGAAACTCCACAGGCGGTGCTTGaTAACGTTCCGCGAACGCAGATAAGCAAATGCCAACTTATGGCAGCAGACACTcTACGCGACTTAACAAAATCTCGTATAAACCCGAGATTTTACATGGCAGTTTAACCAAAATAAATTAAGAGAGAAAGACGGAAGAAGAGAGACAGACGACGTGCCTTGCGTCGTCGTAAGAATACGTAAAACCGTGAAATGCTCGAAATGAAGCCGTCCGtATTTTTTGCATACGCTTATATCCGAAGGGCCGCTCTCCCCTCGTAATTGCGTGTCCTCGACAAAAATCTCCTTTCCGATTTGCCTCTTAGTGCTCGCGCATTCCGTCCGCGTACGACGCTTGCAAATCGGCTGCATCGCCGGCGTCACCGGCAGAAAAACGGAGAACTTTCACGGCGAGTCTCGCGGAGCCTCGTACGGCAAAACACCGCGACGACGCCTGCCATTTTGGGCTGGCACGGGAACATAGCAGGGGCCAGGATGACGGAGGGGTGGTGATAGGAAACCCCGGGCAAAAAACaGCGCCTGCAGAGATCCGCGACGCGGCTGTTTAATTCGGGACTTCGCCCGGGGGAGGAAAAAAAaGGGCCGAGGCCGCCGGCTCCTCCTCGCGCCGCTCATGCATTATGAATAATGGAATTTAAATATACGCGAAACAGATCCCCGCTGGGCGAATTCGGCTTAGAGAGGTGGAGTGCAGCCTGGTGCCACGGTGTCGTGTTCGCATTGTTAATTGGCCGGCCGTTtCATTTtCCGTTGCGGCGGCGTAACGTGAGAGACAGCGTAACGGGAAGCGAAACGACACGGCGCGTCGAGCGTGCACAGAGAAGTTGAATTATCGAGGGaGAAAAAGGGCGCGAGGGAGGGGCAGGACGGTCAAACACTGAGAAAGCAGGCAACATCCGCGAACCCGTAAATTAATTAGGGCCAACATAATAGCTGGGAATGGCTGGTAGTGGTGGTTCGTCTTCGTGGTCCAGCTTGCGGTTCCCGAGTGCCCTCGCAGTTCGCGCGGTCATGCCTCGGAAACTACACGATACACCTTGCGTCATCCTTACACGAGGATACCTGTTTGCGATCCCCTGAGGTATTAACGAGCGTATAAGCAGACTCAAAGTACGAACACACTGCTAGTTTCGCGTGCATGTTGCTCGTCGCCGTGCAGACGAGTTAGTACTGTCGACAAAGTAGTCGTAAAGTACGTAAGAGTCTTCCTCTAGTAGCGAGGACTAATCTCCCGGTACATTAATCTATGTATATTTTATGATATACACACCTGTATTTATCGAAAGTTTATGTTTATCTCGAAAATTAACGTTAATTTTCGAGGACGCAAAAGCTGTCGCAAGTTTaTTTAATTTTGTTTGtATTTTTTTTTCTTGCTCATTTTTATTTCGAAGTATGCAAATTGAATCTCTTGGTGGCATAACGAGAATTTTCGAAGCTTAAAGGGATCTGCGTTGGCGCGAAGAGAATGGGACTTGTTTTAAACTTTTCTTTCCAACGAACACCGTTTCCTTCATTGTAACGTAAAAATGGTGAGCTTCTGCGGCGCGATTGGTCTCATCTTTCCTCTCGTATCGTTCGTCATTTTGTCCGAGAGCGTGTAGATATTGTACCCGATAGAACGGAGACGGAATCACGCTATACGGTTCCCTCCCAAATGTTCGTTCCTCACGGGCAAAAGTACAAGTAAAAGTAGGAGGGTCCGACTTTATGACCCTACGAGGCACAATAAAGGTTGATGACTACTAAACGTAGAAGAACACGTATACAGTCCGACGTAAAGTTATTTTTCAATTCCGCGGTCCTCCGCCGCAGTCGTTTTGCCTTGACGTGGAAAAGGAAATTTCCCGGGTTCTAGTCGTCCGGTCTTCTTTtCGTTCTACTGACAATACCATAtTTTCGACATAGATCGATCTCCTTTtCTCTTtCTCTCTCTCTCTCTCTCTCTCTCTTGAAAATGAAAGCGCGGAAGAGCCGGATCGTGCAACGCACAATGCGAGGCCGCTTTGTACGAGGTACGAGGAGTCCGTCATGCGCCGTAAAACGCCGGAATATGCAATACTACTTTCGCAGCGACGGGGCTTGCACGAAGTAATATCGTGAAACGTAGAACCGTTCTTTTCATACGGATATGCGGGAGAAGTTGCTCGTCCGCCCTCCGGCGCATACACGTCGCGCGAGAGTATCGTGCATCCGAAACTCTGAGATGAAACTCTTCTGTAGTCGATATTCGTCGCGATAAATAGATAAGTCTCGGATAAGGTAGAGACATCGATAATTCCGTTACGAGAATACTCGAGAATAAGATCCAAGTGAAGTGATCACGCTCCATATGGTTTCAGATTAATCTCCAATCGGCTGACGAAGGAGGATCATCCTTTTACCGGTGGAAAATAGGGTGGATTGGCGGAGCAAGAAGGCTAAACAGAGAATAAACAGGAGCTATAGCCGAACGAGGGAGAAGGTAAGTAGATGCCTCGGAGAACGTGAGCGAAAAAAGGAAACGGGTACGAGAGAAAAAGAAACGGACGGAATAGCGGCTCGGGATTCGCATCCCAAGGAAAACCAAGGCTATACCGGGGTCTTGGATTATTCGAGGCACGACGACGATCTTCGAGTCAGCCGACTGTCTGTACCGTGAAAGTGGCACGTATCGATCGCACGGCTGGATTATCTTCCACTTCGATCTACGACGATTACTTCCGCCATCGTATATCCGGGCTTTGCACTAGCGAGGCTATTTAAAAATCTGCGCTCAGTAACTACTTATGATTTTTCCATCAGAAACGATTGTGGAGAGAAAAGAGGGAAAAAAAGAGATAGACAGCCTCTGGCTCGAAATGCTAATTTCGCAATCGAGAATTAAAATGCACTTCTTTGTATCTAAATTTTCGTAGAATTAAAATAAAATTGAATAAAGCAATGAATAAATTGAATAAACTAAAATATAGCTAAATATTTTTTCCTCTATACAAGGTGAATATAATTATCAAATATTTAAGTATGTAGATTGAATTTAAACAGCCTCGGAAGAGAAAAGAATCGGATAAACGAAATGCTTTTGCCTCTATTTTCAAGCACGTGACGAATAAAATCTAGCAAAGCTTTTCGACACAATATGTCGACGCAAATGTGGTCTATTTTGGCTAATGATTATTACCGGGAGTCCCGGGCACGGTGTGTCCGCGGCGCGATAAATTAGAGCGCGAATCGACTTCCACGGCCGTTGTAGAAGGTACTTTGGCAAACGTTATTTCTTCCTGTCTCGAAGGAAGCGCCACTCGAAAACTTGGAAAAGTTCGGCCAGCTGCACGCACCGCGATCTCGGGTCCGTTCTGGCTCGGTGGCGTGCGCAGGACGGTTGTGAGACGAGAGAAAGAGAGAAAGGAGGATAGAGCGCGAGAGGGAAAGAAGAGGGAACATCCGCGTGCGTGGTTGTATATGGCGTTAATGGCGGCGAGCATAAAGCATCCCCGcGCCTGCACGCTCGACCACGGTCGTTTACAGTGCCACTATTTATGTTGATAACTTCGGAATGGAGTCAGATACACGGTACCGAGTGCCGGCGGTTCGGCGGTGGTCCGGCAGCGGTCTGGTAGCACTTGCAAAGTATGGGAGAAGAAGGGGGATGGTGGATTCGCGGATCTTTTTGGTCGTGCAAGGAAGGCGGGTCGGTTAGGGTAGGTAGAGAGGAGACGAGCCGAGTCGAGAACAAGATTCAAGCGGAAAGTTTTGCGAGTTAATGGTGCGGAGGTAATGCCGCGGCCGCAACAGCAACAGCCATCCCCCTGCTTGTGTGTTCGCTCGCTCGCTCACTtCTTCCGTTCTCTCCTCTTCGCGCTAGTTCTCTCTCTCTCTCTCTCTCTCTCTCTGtCTTTCTTGGAATAGCCGTAGAAAAAaGGAAAAGAAAGAGAGAGAAAGAACGAGACTCCTTTCTCCCCACGAATTCTCTCCTCTCTTTAAGCACACTCTCTCTTCCTGCaccCCCCCCCcTCcTCTCTCTCTCTTTCTCTCTCTATTTTCTTGTCTTCCTCACTTGCATCATCCCTCGTCCATTTCCcTCTGGAGAACGTGCCACGTTCTCACTTCCTCGTTCGTCGCTACTTTTTTTCTTCTTTAGTTGTCTCCTCTCACACCCTCGAGACGGCCCGATCTTTTCCTCGTACAGCTCCTATCACGAACAGTCGCTAACAAGTGCATCGAATGCAAGTTGCCGACAAACTTCTTCCACCGATTTGTGCTTGTTCTCTGTGCATGCGCGGGCATGTATATCTCTATTAGGCGACATCTGCTCTCAGCTTTTTCATAACGAGGTAGCGCGGATTCGCGCTCCGAGAGACTTCACGAAGCACTTCCGATCTCGCTACAGTAGAATGCGTATTGTATTTTCTTGTCTCTCTCATTTACTCTTTTCTATCTTTCGTATCTAGCGTGAATACTCCCATGAGGAATGTAGAAACCAGATTTTGAAACGGCTTCTTCGTATCTAAATTTTCTTAACTATTTGTTCCTCAAAGTCCATTTTGACGTTATAAATTTTTATTTTTTAATGAGAATGTTTTTATGTGGAGAAGAAAAGTACAACTTTTTTCAAAATGCAATTAAATTTATACAAGACTACTAAGATAAAAAAAGATGCAAATAATAATGTTCAACTTACTAACTGCTATATTATTAAGGCCAAGTTAAATTACAGATTTATGATGTTGCAGAATATAATAGAAAAGTTTCAAGAAAAAAaCATTTTAAAGCTTAAAGAGTTTGTTTTCAAACCCATCAAATTTTTTtCTCTGGCAGTGCTGCTGCTCTGCGACGTCATTTTCAAGTCGATCAATGCAAAGTTAACAAAAGTATTTCAACTTTAAAATCATGCAGAAAGTTGGGAAAAATTTGTTTTCAATTTTACTTAAATTTTAGTTACATTTTTTCTAAAAGCTTAAAATCTCCTCTTTAAAATATGTCTTTAAAAATTGAGCTATGATTTTTCTTCACCGAGTTATCGTAATTTGAAATGGCCAATCGACGTTTTCTTTTCATCACTAGGAATAATGCCGGCGGATTTAAGCTTTCAGAAGATTCCAAGCAAAGTTGAAAACAATTTGTTTGAATTCCGCATGATCTCAAAGTTGAAGCTAAAAAAATTTCACAAGACTTTAAAAGACAAAAAGTCAATGTTGCACAATCGACTTGGAAATAACGCCACTTGGAGCAGAGAAGTATTGCCGAGAAAAAGCTTCGACCGGGTTTGAAAGCAGGATCTATAGGCTTCCAAACTTTTTTTTtGAAAAATGGAGCTTTAAGGTTACTTTTCACGAAATTGTATAAATTGTGTTTTGTTGtTTTTATCATCGATTCGAAATGATCCTTTCATTTTGTTCAAGAATGTATCATTAACTTAATGCAATGATATCTTTTAATAATTCAATACTTCTTACTAATTAGATTTAGAAAAGTTTCATGAACAATTTTAGAACGGTTATTGCTAATTATTCGCCAAAAACAGACGCCTTTCTTCGAGGAGTAACAACTCAAGCTGCATTCGCGGTTGACGGCTTTCCGCGGCGCGTCGCGGTAATATGGCTCCTATTTAATTACCCGGTCCTTTGGGAGCTTAAACCAGCCAGAGCTGGAACGGCTGCGCCGTTATTCTATCTAGACTCCTTTGCTTTCTCAAGCAGGCGCGCGATCAAACCTTCTCGCATAAAAGACAATCGCAGCTGGCAGTCGACGACGCGcGGGACAGTCGAATCATGACCCGCTCCTCTCTAGTTACCGCCGTCAGTCTGCTCTACTTCCAGACGCCGCGCGTAATCTATTCGACATTAGTTGCTTGATTGCACCGTAAAATGCGACGGCGACGTGAACGAGAACGACGACGACGACGACGACGACGACAACGACGACGACAATGACAACGACGACGACGACAAGAGTGGGTTCGTCGGTGCTCGATGGCGCCTCCGATTTCAACCGCAGCACGATGCAAGCCCATTACTATCGCCCGGAGCTAAACGGCACCCGGAGCTCGTGCCATTAAGGGAATCTAGGGTCCGATCCACCTCATTGAATTCCGTTcAATTGCGATTATGATAATGCGTGAACGATCGCCGTGGACGTGAGCTACGGaCAACgAGGGTGTTCGTTCTCGGCTCAGAGAAACGCAGCGATAAAATTATCAGTGACAGCTTCATTTTTGTTACATTTGACGTTGAAAAATTTGCGAAAGAAATGTGCATTATTCAACTACATTGACAATTGTAAATCTTACATGACCTTTTATTAATATACATATATGTAAGATATATGTTCCATCTCTAGTCTCGTTCGATAATGAAGAAATTATAGCTGCAACATAGACGCCGGATTATTCGCGGGGGAAACCGTTGATTATTTGGACTCCGTGGGCTCGGGGTCTGGTTTACTTCTTCCCTTATGTCCGGAGATAATGGACACTATTACCTTAACGAGGCCCCAGTCTCTTAGCCGGTAATTGCTTCGAGATATCCGAGAGAGCTCCGGCGTACGTTGCCGCCTGGTGTTGCAGGCAGAGAACCCaGACGGTATTATTGCCGCCGAGGCTACTCGCCCGTTTCATGGTGGTTATTGTTATGGGCCTGCGGCATTAAGATGTACACCGCACTCTCATACGGGACCCACCACCCCATATACGAGCTGCATATATACTTATGCGGGGAGGATTTCATTACGCCGCTTCATAACTGGCGGTCTTCAAAATCGCGATAAATCGGGATTGTCTTCCTTGTTAAATCAGCTCCCGTTCCCCTTCCTCATATCGCTGACGAAGCCAACGAGACGGATTAATCTGCGCGATTGAATGGCCTCTATGACGTAAGCGCACCGTTTACTGGCACGGCTCCTCGTGTTCACGTGAAACGATTTGCGTCGTAAATATTTTATTTTAACATCGCAACATCAAGACAGACGAGGATCGGCTATTGCCTCGTGATCCTAAAAGGGAGATTCTCAAGGCGGAATCGGGGTAACGCGTTCGTTGATCTCGCCAAACTACGGCATCTTGAGGACTAGTCTTAGaGGAAAAAAAGACGACGAGGAGACACGGTGAGCATTAGATGAGAAAGAGACGGCGCGGCGCGGTGCGGCGGAGCGAGACGGAAAGAGATCAAATCTGGATATCAGGATTAGGGTGGGTACGTAATCCGCAGGACGACGGGTGGTAGGAACGGTGGATCCGTCGGCAGATCTCATCGCGCGGAGGAACTCTGCGGGTACTTCGCCCGGCCAAATACGACAAGAGCAGCAGCCTAACTCGAGTAGAGCCGCCGAGATGGTTTACGGCTCGTCGCAAGTTGGTGAAATTTAAGAGCGCATTTAAAACTGGTTTGGCGCGATGCCCTGCGCCTCCCTGCAGCAGGTACTGCCGGCGAGAGATACCGCTGGGCTCCGACAGGAGTTTATCGCCCTAACATTCTTGGAAAGCTTCGGATGGATTTCCCTCCTCAACCCATTTCTTCCGGCAGCACCTCAAATCGTCGCTTCTTTCGAGCTCCCTGTGTCGTCCGTTCTTTCCCCTTGCCACGAAAATTCTTCCAACAGACTCGACACCAAATCGTCACGACGATCATCGGTCAAGATACCCGGAGAAACGTCGCGGACGATGTTACCGTCAGATCGGACGCTTTTAGCTCCGCATTGGAGCTTTTTCCTCGACCGTCTCGGAGGAGATTTAGCGGACCGACAACAAATGAAGGTCATCTTTCGCGCGAGAAAGCGTCTCACCTTGATTTCCGTCTCGTTTCTCCGGAAATTGGATATCAACCGGACGAACTGTAATGTGTACGTTAAATTGCCAATTATATATATAATGTAACAGCTGATTATCTCAAGTGTCCTAAAAACCATTATACTCTTAATTTCTGTGAAAAATGGCGAAAATAAAAAAGAAACCGATCTTAATAAAGATATTCTTCCTGATAGATGCCATGACCCACGTGGAAAACTTTTTAGTTTTGTACAGTGGTATTATACGTTATCTTCCGCTGAACGTAAGACGTGCCTATCGCGCAATTTCATCGCGACGTCGTCGTATAGCGATTATGGCTACTCCATTAAAAATGAATTTTATAAAGGCAATCTTTCCAAGCGATCGTTGTAGGAGAAAAAGGCGAAAGCCGGAGCCAAAGGGGATGAGGCCACTACCTTTGGCTGATCCACTTCGAATGATAATCACCTCTAGGAGACTCAATTTCGCCCTGCTCCGCGTCCTTACCCGTTCCTATCTTCGGAAGGTTCAACGCCGCAGCGGACTGCATCTTTCACTCCCTTCGTCACCACCGCCCTATTCCTATCGCCCTCCGCGCGCCTACCGCCCCTATATCCTTCCCTTCCTTCACtCCTAGACTATTCTGAACGACCTCTTCCCCCATTCGCCAACGCTCACTCCTAACTGATTGGAGTACCAATCAATGCGGCATTCAGGCGGCCGTGCTGAAaCTTTAGGAAATTAACTATTCACTCTCTGGAAATGGTTATTTGGAAGGCCGGAAAGGCAGTCGGGACTACGTTACGGAGAATATGAGCGAAAGGATATGGGCAGACGATGACGATAGATAAGACAGTTCAATTTTCAATTAGGCGTGGACGTAGCGCGCGTAGCTCGGCTCCTACCACGATGCTTACCACGCGTCCCATACGTTAGGAGGAATCCGAGTTATCGATTGCACCTACCTCTTCTCGCGGATCTTACCGCTGGGAAATTTCGTTCGCTTAATGAGATTATTGTACTCGCCGGAAAATCCAGAtAAATTATTCTGATAACTGCGCAACGTCGCCTTCTGGTGTTTCGCGAGCCGATAACGAGCATAGTTAACCGCAGAGAAATTATACGGTCCACATGAGAGAATATATCACTTGGCgCGGGGACGGCGGGGGGGGGGgaGGGgAGACGTACGCGCCTCCAGTCGCGCaCAAAGCCACGCGCGGAGAGAAGTATTAtCCCGGTACTCTCTTTCGTTAATGGCATCGTGGTAATCAACAAAGCCGTACGAGGTAATACGAGGTATTTACAGTTACTCCGGCGCAGCAGAGGGCACCGAGCGTCGAGCGGTGGTAGGCGCATGGGCGGATTTATGCGATGTTTACACTCGCGCAAGGCTGCTAATACGGCTTTCCGACAACAACCCCCACGCCACATCGCGACTGCTCGACTGTGCCCCGCTGTTTGAAAGTGAAACTTCTGTTTGTACGCTAGTTCCAGCTAATCCCGAATATTACGCCGGCGAATCCACCGCCGCGCGTCGGGAGAACGGAAGGCGGCAGCCCAACGCTCAGAACTCTCCGCAAGGCAGAACTGGcGCCTTGAGCGAGGCGGAGAAGTTTTCACAAAGGAGAAGCGCTTCGCCAACTGCCGTGCCGAGCAAAATACATGCGCCCTCTACGAACGTTTGTATTAGACAAGAGCCAATCTTAGCTCCAGCTGGTCCATCTTACGAGAAATCTCGTCGGTTTTGATTCGCACGTCTCGAATCGTATAATAAAAACCGTTGGCCCCAACGACAGTTGTTCGCGTTTACGTTGAGCAACGTTCGACGTGTCTAGAAAACTACAATAAACGGGAGTCGAGAAACGAGATCGTTGAGAAAGGTCTGAAAGATTTCAGCCCTACTACGCCGTGCTGGTACTCCAGTGGATATGTATATACATATACGTATATGTATATGCTCACCGAGAATCCGATCTGCAGTCTCAAACAGCTATTTACCGTAACGTCGGTGGATCGTCGGTACTTTTGTAGCCAGAAGGGAAAATGGATCGGCGGAATCGCCGAGAATGCTACTGCAGTATACAAGTCCGCCGCTGTAACTCATTCACCGATAAATTTCGGCGGGCGAAGCATCGATCACAATGTTCTATTTTGCTCATTGGTGTTTTTGCCGGCGGCATTCGATTCAATTCGAGGATGCGGTGAGTCGATATATCTCTCGAGGTGTCATGAGCCGCCCGTAATGGATGGCGGCTGTTCCCTTACCCTTACTCCGAAAGGAGAGGCCGTGCCAAGAACCTACCCGGTACCCGCAACCCTCTGCTTCTGATCCTGATTTTTGTCTGCTGTCTTCTTCTTATTCCACGCATTATTCTCGCTCTTTCCCTCGCACGGCATTCGCAAAGTGGCCAACAAAGTAGTCTACGTATCTCTCCACGCGATATTTCGTAGCACATTTTCACATTTAATCGCGAGACAATCGGAATCCAAAATATTATCCGTGATATTTAATTTACAAACGTCGTTTGCGTTACCAAAAACAAAATATTTTTTGATTAATCAAATAATATCATAAAAAATTCTTTCAATGCATTTTACCTTTGATGACAGAAAAATCGCTGCCACTCGGGGAACGATGGGATGGATTTCGCGGTAGAACAAGCAAACGCGAAAGCACGCGAAAGGAAATGCACGCGCGACTGGTCCGACAAGGATATTCATAACGTGTGAACGATCACGAGCATTCAAGCTTCGGCGAATGCTAAGACGTGCGAAAATGAGGGTTAAGAAAATCTCGAATCGGGTAGATAATCCTATAGCAGTAGTGAAAATACAATAGGCTTTAATTGCCGGACTTAATTCGTAGAAATACGATTTCGATTAAATGTAACGACTCTCGTGTTTCTCAAATAAAAATGGCACTGAATATAATTGATCACTTATTGTCACTTATCAGTTATATATTAATAAAACAAGACGTTAAAAATTTAATAAGAAAAGATGAAAAAAACAcGATGTTAAAAATTtAACCCTAAAATTAAAATTTTTGTTTATATTAATTATTTTTtATTTATTTTATATTTTTGTCACTAAATTTTGCTTGTATTAAATAAAAATATAAATTTTAATAAGGAAATACGATTTAGGCAGTACTTTCCCCTATTAAGAGGAAGTACATATAGTTTCTAAGAATATTTAGAAATATTTATGCTTTGCATTTAAATAAAATAAAAATGGCGAATTTGTTGCGCTAAATTATTAGTTCTTCAAATATTATTGCAAGAATTATAATCAGTATTGTAAAGTAAATTTTACGAGTATTTTAAGAACTGTTTATGATCTAAAAGGTTATAATTTTGTAAAACGGAGGAATAATCGTCCATCTGTAATGATGCATTTACGTCACAAGAGTGACAcTTCCCTTGAAGCTCGACAGTGCTAATACGAGTTCAAGCCTCTACCATTCTCCGAAGTGAGGATCGATAGTCTAGCATCATTGGGCTCCAACGGCTGATGCGGAGGAGTCGAAGATAGAAGGTACAGAAGGGAGAGGATAGGTGGATGGTGGACGGGGACTTTCGCATGCGGAGTACTGTCGGCTTCGGTTCGCCGAAGGATTAGCATGAATGGCCATCGTCTGCGCCAGTCGAGATAGCAACACAAGCTCTCGCCTATTATTGCTCGGAGAAATTAATCGCATCCATCGCGAATTACTCCGCGATTGCTTTTCCGAGTACAACACGTCTCGCGGTCGATGGGGCAGAGACAGCGGGGAAAGTTCGTGTGATAGGCGTCGTAGTAATTATCAAACGGAACAAAAGAGCGGGCTACCTAATTACTACTTCAATTCCCATATTTATTAATTAGCAATTTATATTATAAAAATTAAAATACTATAAATTGTATATCTTTTAATTATTAACTAATGGGAGACGTTAATAAGTTTAAAACGATGACGACATTAGGAAAAATGTTGCTTTTGTAACGTTTCTAATATTCGTCATTTGAGGAACAGTCGTTACCATTCAGTTCGTATCGAGCTGGATTTGATATAATTATGAAATAGCACACAGCACCCGCTTCCCATCAACATCCCTCCTTTTCCCGAGTCCCTCGGTCTCGGAGAAACAGAAGCGTAATTCCACTCCGAATGCAGTCATTATGGAGATTTTATCAAGGCCTGTTGCCAGCTAGCTAGGCTTCAACTTAACTCGCCCATTACGGCGTTTCCTTTGTCTGCGACTTACACATCGTCATGTCCCGCAGTTCCATTCCGAGCCGAGGGGAACACTGATGAAAGAGATCCGACGATTCGCATACGACGGTTCGATTCACAATGCCGCGCATTCACACACGCGAAAGAGCTCGCGAAGGATTCTGGGATGCATCCGAAAAATAATCACGTGGGACACGAGCATCGACAGCTTGCGCTAGCACGCACGATGGAAACCGCCTGCACAGTTGGACGGATCAGCGAGAGGCGTCGTTTTACGAGGCCGATACTTGAAACGTCGTGCTGAAGTATAAAATTCTACGGGTCATGCAACTGCCAGAGTACCGATCACGCGCATACGTGTACGTAATGCGACTTTAATGCAGCATGTATTATTCAACGATACCACCCGCCGTGCGTGTCCGTCCGACAGCTCGTATACGCGTAATTACGAGCATAGTTGTTTATTGCGAATCCGGCAGTCCGCGCAACGCCTTATCCGGCGGTTATAGACGCGCGTCTCTCAACCCTCCTCGCCAAACGATCCCGATACCTATCCGCCAACCCTCTCGTCCCTCATCTCTCGTCTCTCGTCTCTCGCCATCTCGCCTCCTCGCCTCCTCGCCTCCtCGCctCcTCGCCCCGCCTCCTCTCGCGTCGACAACCCGTGAAGAATGATCGTGGAGTTTCCCTGGTTGGACAGTGACAAATAGAGTATATTCTTGCCAGCTCGGAGAAGCGTGTTTTACGCTTTGTTTACCGTTTGTCATGTTATGCAAAGGAAAGGAATTTTTGCCGGTTGCGTATTCGGCGAATCGACCCTTATCCTACGCGCATGATATGCCTGCGGTAGTTTTACATTATCGCTCGGCTGCGGCGCGGCGCGGCGCTACACAACCATTACCACTAACGATTCGGAGCGACTCCAGGAGTAAAGAGAGAAAGGCCGTAGTCGCATTTGCGGTAAATTGCGCCCATTTACTTCGATTCGCAAACGTAATTTCTGAGATtACCTCACGCTAGTGTCCAGCGATGGTAAAAGTAAGCTCTCTGACCTTCCACGTACGATCCAATATACGATAGAACAATGCATTATTCGACGGCGCGAAATTATTCATAAGCGTCACGAATTGTCCGCAGCGAACGTGGAAACGTTAAACGTTCGGATCtAGTTTGACGAATCTAGAATCGAGACAGTCGGAACCGGACCCTCTCGACTCTCGATCCGGGTGTTTCGCGCGAGGCACGTCGCAATGTTCGGAAAATTGAGGCGAGCCGCATAATACGCATATAGTCGCGTAATAATATGCATACAGGTGCATGCGTGACGACGTGACGCAACGTGACGCCCGAGCGCGGGTCGTTAGCCGTACGGtCGTTTCGCTGGCAAGTCACGCCTAATCACCGATGAATACAACAATCGCAGCATGATGGACGATCGGCAAGCAAGCCGTCCTTTATTAGCGTCGGGTACCCGGGGGCGAGGGGCGGGTGGGAGAGCGGGTGATCATATAATTAGATTAGCGCGCGCCCATCGCGCGTCGTCATCATTTTCCGCGGTGATTTTGGACCGCGCGGGAGATCGGTAACCGCCCTGCGCCGCGCCGCGTCGCGTCGCGTTGCGGCACGGCGGTTACtGTTCTCTCGatACtcGTTGGGATTGTGCTGC

CCTTTGATGTCGCAACGCAGGACGGCACAACGTTCtCTTCCGgCgTaCGAAATCATTTtGGgACACcGTCGCGcGCGGcaCGAGaTTtAATTTgCGCCGGCAACGATCgCGC

Sequencing a genome is now cheap & easy.

Page 16: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

CTTTTTtGGTCGAGGTAACCCACACGTCGGGGGgTACGGTTGTATACATTTATGGGAGTAGCACGCTGACCCATAAACATTACTTAGTAGCCGTGCGACATAAAAATTGCCACTCGCGTATCTCAGGTGGTAGTAGTACTCTTAAAAAGTGTCAGCGTGAAAGTAGGCCAAATTACTGCATTTCCGTGTACACGTATCGTAACGCCTTCTCGCGATAAAAGAAGAATAAAATTTTCTCCTAAATATATATGATACCGCAAATGGATTTTGAGTTAGAGAGTAAGATGCAACGAATTGAAATTTATATAAATAATATATGATATTTTCTTATAAATCTTACTAAAtATTTTTTTTGTCAATTCTATCAGAATATTTCGAACGCACGTTTATCTGAACTGAGACGGCGGCATCAATTAGAAGTATTGTGAACGCAGTTGAAATGTATATCCATGAGAAACTCATACTAACGCAGAGCAGTGAAAACGAACACTTGACTCAGGTTTAACCGCGACGAAACGGTATCGGATAATGGACGGGACGAAAAGCGAATTCGACGGCTGGTATCGAGTTACGTGAAGATGTACCTCCGTCTTCGTCGTCATTGAAATTGACCGCGATACGCGAGGTTCACGGAGAGAGACAGAGATCGTAtCTTCCATCTTCTAAATTTCCTTGATTTCTCGTTTATACTCGCGGCAGCAGCCATTTACTTGTTCCACTTGGCGTTCTCAATTTTGCGCTTAGGGAAACAATTAAACCCAAGGATCTCGCATTACATTTTTCCTCCCGCATCTCTACATGTAGCGTATTAAAACCCCGTAGATGCCCGTTCATGCAGAACAAACAACACGGAGCAACATTCGTGCTCTATTTATTTAAGACTCTTATCCAAGGGGGATGAGTTGCAGCGCAGCGAAACTGGTCTGGTATTTATAGTATTTGGATTATGGAATAATCCCTTGCATAGCGACGGCGAAAAGGGAAGGAAAGCCAGGAATGCACGGTGTTTACCGATATCCTTCACCTCTTTTACCTCTTCGTATTCCTCTCTCTCGCGTTTGCCAtCTTTTTTTTtCTCTAAGCGATCTCATTGGTACCGCACGAGGAGCTTCAATCTTTTCTCTTATCTGGATTACTCGTACACAGAGAGTTACAATCCGCTATAAATTCCATCTTTTAAACGCGAAATTCATTCAATTTATTTGCGACCTCAATCATTTTCCCGTGAGATCGACCGAAAAATTCTTAATAAGGATTTATTCGTTTAATCGCGAGCTCTGATAATTTCATTACTCATTGTAACATAAGCGCATTGATATGAATGTACGTATACTGCAAAGTATAGAAAAaTAAATCCGCGTTAAAGTTCTAATTTTTGAAACGTTTATTATTGCTCGTAGAAAAGAAATATTTGTATCTGTGAATACATGTGTCGTAATATTTTAAAAATTAATTCTTTACCTTTATCACGAAAGCTAATTATTTACATATTTACAAATAAACAAAAATATTAGCGTACAAAAATAAGTAAGAGAAAAGAAATTAATTTTGTCTTGGACAAAACGTTTCATTAATGTCGTATTATGTAAAAAAAAATTTAATAAAATGTCAAGATTTTATTTTTGCGAAAGAATAAAAACAAGAGCTTAAGTTCATTCTTTTTGATAGATAATACAAGGTTCTATTTTTGATAAAGACTCATTTTTAGAGGGTTTTCTTTTCACATTTTCCGCGGTACGACTACAATGCATCGTCGAAATACAAAACAAAAAGAAAAGAAAAAGAAATCACAGCGCATCTCGTGCAAAAGCTGCGAACAGTCTACCGTCTACAAACATCGTCAGGAAGATTCTTTCATCTGGTCACGGGGGAGACATCGTTCGCGCAGGTATCTCTCACCGTTGCTGTATATTCATACGTATCTTCGACCTAGTCAAACAACGATTCGACCCAGCACAAAGCAGAGCGGGCGTCAGCGTAGCGTGAAAATTTAACCCGCGACTTGGCGGAGATTTGATTTTTTGGCGTCGGTCAAAAGGCAGACGTGCCCTGATATATCCGCGACGTTATTTGATGCACGGCTACCGGTACAATTGGACTGAGGTAAATAAGAAAGCAGCAACGACAAAAGACTTCCTTCTGTGTGGATGTTAGTCGTACCCTTGGGCGTTGTCTGCCGCCGTATCTCCTAAGTTGGTCGGGGAACGCGGAGTAAGGTTGGACGAATCCTAGTAGATAGCACAGCTGTATTAAATAAATAGAGAATCGCCGAGGCTTTCGACGATGTAGGAGGAAAGAACCGTACTCCTTGAAAGTGGAAAAGGTTGAAGTCAAGCTCCCCCCTCCCTCCCCTCTTCTCCCCGAGAGAACGTCGAGAGGAAGCGTCGTGGATGGAGAACAAAAGTCTGCGCGAAGAAATATCAGAAAAGTCGAACTAATATTTGAGATTACCCATCAATGCTTTTATCCGTCGTGTATCTCCGAAGAAAAAAAGATGGGTATAATTTTAAAAATACATTTGTCAGTGTCTGATATGCGCCGATCTTCTTTCCTCCATGCATTAACGCTTCGTGCGGATACGACCGGGCGTAATCTTAAAAAAAaTGATCGCCCATATACTAAACCACTGACAGCACGTACAGTTCGCCGAGAAGTTTAAGACCACTTTCAGCGCGTTAAACTTTTACACTCTAATTAGAAATTAATAAAGACTAATAGCTTTATCGTCGGTCTATTAAATCCCGTGGAGGAGGTGCTGGAAGTTAGGCAGTCGCGAAGGTGGTCGGTCGGTACACTCGGCAATAAAAGTTTTTTtCTTCGTGGGATAACGGGGACTTTAACTGCAAGAAGAAAGAAAAAAGaGGAAAGAGAGAAAATTCGTTGAAAGCGAAAGTCGAACTACGTTATTTATCTGATGGCCGACACTTCGAAGATTGTTGCGCGGTGTGTCGACGCAGCGCGCAGACGCTCACCAGACGTTTTCTTTTCATTTCTTATTTCTCCCGTCGAAAAAGGaGAGAAAAGGAGAGGATCGGGACGAAGTGAGAAAGAGTAGCGATAGAACGTCGGAAAGGGGAGAAAAAAaGAAATCAATGAGCGGAAACGTGATACGATGGGCGAAGGGCCAGGATGGGGGACAGGGTCGGCCGTGCTGTAGTTTCTCTATTTCCTTCATTAACTCTGATTCAACAGCAGCGTAACTTTTAATTGGAAGTGGTGGGGGgTCGAGTCGAGGCTCATTTCCCGGCGGACTTCCCCATACTGGAATTCAAGCCAATCCCCAAGCGTCTTGCGAGAGGTCACTTCATTCGTTTCGATATTCTCAGATTTTACTTTCATAGCCGTGCGATATAGCATTTGAATTGCTTCGAATTAGAAACGGTAGCGCCTATCtGTTTTTTTTTtAGAGATTAAAGAGATTAAAATCTATAGAAACGTAGTTTTTCGACAGATAAATGCTTCTTTACTGATGTCACATATGATACGCAAAGGGAAATAGATTATAATTAATGATTGTACAATATTTAGTGATATCCCTAACAATAATCATCACGAATGAATAAATATGAATGCATAAAGCGATGTACAATGGCACATTAATTTCCTATCCATTATAAATATAATAAATGTATAAATTGTATTAATGACGAGAACGGGAGATGCGCTGATTTAATTTAAATAAAATCACCAGAGCGTCATCTTCCAAAACGGAAAATAACGGAAGCGACAGATGCTTTCGAAAGCGTCCGTGCCTCCGTCCTTTACTCATATCCAGATAAAGGCACACGTGACGCTCGTAAGATACTGCCGGAAATAAGAGCGATAAAAAGCAAGGCGAATAGTTACACGCCGGGCGACGAGGtCCTTCGAAACGCGAAAATGACGAGCCTGCGATAAAGGACCGCCATCGCGAACGGTAACGGGAGTAACCCCGTGGAATTTAGTATGATCCCCCGCGCACGGCACATCgCGATACTATCGGTTTCTCGTTACCCAGATCGGTATATTGTTATTTCAGCAGTTCTCGGACAAGGTTCACGGACATTCGATAGCGCGAAGGGGACCGGAGCTCCTGAAGAATTCAACGTAACTGGCCGATAATCGGCCTCGTAACGGCTACCCATTTTCCGCGGCACGTCTCGGGGCACCATCATCGAAACGACCTTATTTCCATGCATCCACGTTGCACCTGCTATTGCACCGTAGCCGTCACCGAACTATATCGTCGTCGAACTTATTCGATGTACGATAATCCTTACGGAAAGCTTTAAACAGCTACCTGCATGCATACTAAAGATACGTCCTTcGCTTCTGGGATCCACCTTGTATTCGCGAGACATTGCATAATTTGTATTCGGTGTAAATATGCTATCCGCGTGGCGATCGACCACCGCGAACCGCGCGGAAACTCCACAGGCGGTGCTTGaTAACGTTCCGCGAACGCAGATAAGCAAATGCCAACTTATGGCAGCAGACACTcTACGCGACTTAACAAAATCTCGTATAAACCCGAGATTTTACATGGCAGTTTAACCAAAATAAATTAAGAGAGAAAGACGGAAGAAGAGAGACAGACGACGTGCCTTGCGTCGTCGTAAGAATACGTAAAACCGTGAAATGCTCGAAATGAAGCCGTCCGtATTTTTTGCATACGCTTATATCCGAAGGGCCGCTCTCCCCTCGTAATTGCGTGTCCTCGACAAAAATCTCCTTTCCGATTTGCCTCTTAGTGCTCGCGCATTCCGTCCGCGTACGACGCTTGCAAATCGGCTGCATCGCCGGCGTCACCGGCAGAAAAACGGAGAACTTTCACGGCGAGTCTCGCGGAGCCTCGTACGGCAAAACACCGCGACGACGCCTGCCATTTTGGGCTGGCACGGGAACATAGCAGGGGCCAGGATGACGGAGGGGTGGTGATAGGAAACCCCGGGCAAAAAACaGCGCCTGCAGAGATCCGCGACGCGGCTGTTTAATTCGGGACTTCGCCCGGGGGAGGAAAAAAAaGGGCCGAGGCCGCCGGCTCCTCCTCGCGCCGCTCATGCATTATGAATAATGGAATTTAAATATACGCGAAACAGATCCCCGCTGGGCGAATTCGGCTTAGAGAGGTGGAGTGCAGCCTGGTGCCACGGTGTCGTGTTCGCATTGTTAATTGGCCGGCCGTTtCATTTtCCGTTGCGGCGGCGTAACGTGAGAGACAGCGTAACGGGAAGCGAAACGACACGGCGCGTCGAGCGTGCACAGAGAAGTTGAATTATCGAGGGaGAAAAAGGGCGCGAGGGAGGGGCAGGACGGTCAAACACTGAGAAAGCAGGCAACATCCGCGAACCCGTAAATTAATTAGGGCCAACATAATAGCTGGGAATGGCTGGTAGTGGTGGTTCGTCTTCGTGGTCCAGCTTGCGGTTCCCGAGTGCCCTCGCAGTTCGCGCGGTCATGCCTCGGAAACTACACGATACACCTTGCGTCATCCTTACACGAGGATACCTGTTTGCGATCCCCTGAGGTATTAACGAGCGTATAAGCAGACTCAAAGTACGAACACACTGCTAGTTTCGCGTGCATGTTGCTCGTCGCCGTGCAGACGAGTTAGTACTGTCGACAAAGTAGTCGTAAAGTACGTAAGAGTCTTCCTCTAGTAGCGAGGACTAATCTCCCGGTACATTAATCTATGTATATTTTATGATATACACACCTGTATTTATCGAAAGTTTATGTTTATCTCGAAAATTAACGTTAATTTTCGAGGACGCAAAAGCTGTCGCAAGTTTaTTTAATTTTGTTTGtATTTTTTTTTCTTGCTCATTTTTATTTCGAAGTATGCAAATTGAATCTCTTGGTGGCATAACGAGAATTTTCGAAGCTTAAAGGGATCTGCGTTGGCGCGAAGAGAATGGGACTTGTTTTAAACTTTTCTTTCCAACGAACACCGTTTCCTTCATTGTAACGTAAAAATGGTGAGCTTCTGCGGCGCGATTGGTCTCATCTTTCCTCTCGTATCGTTCGTCATTTTGTCCGAGAGCGTGTAGATATTGTACCCGATAGAACGGAGACGGAATCACGCTATACGGTTCCCTCCCAAATGTTCGTTCCTCACGGGCAAAAGTACAAGTAAAAGTAGGAGGGTCCGACTTTATGACCCTACGAGGCACAATAAAGGTTGATGACTACTAAACGTAGAAGAACACGTATACAGTCCGACGTAAAGTTATTTTTCAATTCCGCGGTCCTCCGCCGCAGTCGTTTTGCCTTGACGTGGAAAAGGAAATTTCCCGGGTTCTAGTCGTCCGGTCTTCTTTtCGTTCTACTGACAATACCATAtTTTCGACATAGATCGATCTCCTTTtCTCTTtCTCTCTCTCTCTCTCTCTCTCTCTTGAAAATGAAAGCGCGGAAGAGCCGGATCGTGCAACGCACAATGCGAGGCCGCTTTGTACGAGGTACGAGGAGTCCGTCATGCGCCGTAAAACGCCGGAATATGCAATACTACTTTCGCAGCGACGGGGCTTGCACGAAGTAATATCGTGAAACGTAGAACCGTTCTTTTCATACGGATATGCGGGAGAAGTTGCTCGTCCGCCCTCCGGCGCATACACGTCGCGCGAGAGTATCGTGCATCCGAAACTCTGAGATGAAACTCTTCTGTAGTCGATATTCGTCGCGATAAATAGATAAGTCTCGGATAAGGTAGAGACATCGATAATTCCGTTACGAGAATACTCGAGAATAAGATCCAAGTGAAGTGATCACGCTCCATATGGTTTCAGATTAATCTCCAATCGGCTGACGAAGGAGGATCATCCTTTTACCGGTGGAAAATAGGGTGGATTGGCGGAGCAAGAAGGCTAAACAGAGAATAAACAGGAGCTATAGCCGAACGAGGGAGAAGGTAAGTAGATGCCTCGGAGAACGTGAGCGAAAAAAGGAAACGGGTACGAGAGAAAAAGAAACGGACGGAATAGCGGCTCGGGATTCGCATCCCAAGGAAAACCAAGGCTATACCGGGGTCTTGGATTATTCGAGGCACGACGACGATCTTCGAGTCAGCCGACTGTCTGTACCGTGAAAGTGGCACGTATCGATCGCACGGCTGGATTATCTTCCACTTCGATCTACGACGATTACTTCCGCCATCGTATATCCGGGCTTTGCACTAGCGAGGCTATTTAAAAATCTGCGCTCAGTAACTACTTATGATTTTTCCATCAGAAACGATTGTGGAGAGAAAAGAGGGAAAAAAAGAGATAGACAGCCTCTGGCTCGAAATGCTAATTTCGCAATCGAGAATTAAAATGCACTTCTTTGTATCTAAATTTTCGTAGAATTAAAATAAAATTGAATAAAGCAATGAATAAATTGAATAAACTAAAATATAGCTAAATATTTTTTCCTCTATACAAGGTGAATATAATTATCAAATATTTAAGTATGTAGATTGAATTTAAACAGCCTCGGAAGAGAAAAGAATCGGATAAACGAAATGCTTTTGCCTCTATTTTCAAGCACGTGACGAATAAAATCTAGCAAAGCTTTTCGACACAATATGTCGACGCAAATGTGGTCTATTTTGGCTAATGATTATTACCGGGAGTCCCGGGCACGGTGTGTCCGCGGCGCGATAAATTAGAGCGCGAATCGACTTCCACGGCCGTTGTAGAAGGTACTTTGGCAAACGTTATTTCTTCCTGTCTCGAAGGAAGCGCCACTCGAAAACTTGGAAAAGTTCGGCCAGCTGCACGCACCGCGATCTCGGGTCCGTTCTGGCTCGGTGGCGTGCGCAGGACGGTTGTGAGACGAGAGAAAGAGAGAAAGGAGGATAGAGCGCGAGAGGGAAAGAAGAGGGAACATCCGCGTGCGTGGTTGTATATGGCGTTAATGGCGGCGAGCATAAAGCATCCCCGcGCCTGCACGCTCGACCACGGTCGTTTACAGTGCCACTATTTATGTTGATAACTTCGGAATGGAGTCAGATACACGGTACCGAGTGCCGGCGGTTCGGCGGTGGTCCGGCAGCGGTCTGGTAGCACTTGCAAAGTATGGGAGAAGAAGGGGGATGGTGGATTCGCGGATCTTTTTGGTCGTGCAAGGAAGGCGGGTCGGTTAGGGTAGGTAGAGAGGAGACGAGCCGAGTCGAGAACAAGATTCAAGCGGAAAGTTTTGCGAGTTAATGGTGCGGAGGTAATGCCGCGGCCGCAACAGCAACAGCCATCCCCCTGCTTGTGTGTTCGCTCGCTCGCTCACTtCTTCCGTTCTCTCCTCTTCGCGCTAGTTCTCTCTCTCTCTCTCTCTCTCTCTCTGtCTTTCTTGGAATAGCCGTAGAAAAAaGGAAAAGAAAGAGAGAGAAAGAACGAGACTCCTTTCTCCCCACGAATTCTCTCCTCTCTTTAAGCACACTCTCTCTTCCTGCaccCCCCCCCcTCcTCTCTCTCTCTTTCTCTCTCTATTTTCTTGTCTTCCTCACTTGCATCATCCCTCGTCCATTTCCcTCTGGAGAACGTGCCACGTTCTCACTTCCTCGTTCGTCGCTACTTTTTTTCTTCTTTAGTTGTCTCCTCTCACACCCTCGAGACGGCCCGATCTTTTCCTCGTACAGCTCCTATCACGAACAGTCGCTAACAAGTGCATCGAATGCAAGTTGCCGACAAACTTCTTCCACCGATTTGTGCTTGTTCTCTGTGCATGCGCGGGCATGTATATCTCTATTAGGCGACATCTGCTCTCAGCTTTTTCATAACGAGGTAGCGCGGATTCGCGCTCCGAGAGACTTCACGAAGCACTTCCGATCTCGCTACAGTAGAATGCGTATTGTATTTTCTTGTCTCTCTCATTTACTCTTTTCTATCTTTCGTATCTAGCGTGAATACTCCCATGAGGAATGTAGAAACCAGATTTTGAAACGGCTTCTTCGTATCTAAATTTTCTTAACTATTTGTTCCTCAAAGTCCATTTTGACGTTATAAATTTTTATTTTTTAATGAGAATGTTTTTATGTGGAGAAGAAAAGTACAACTTTTTTCAAAATGCAATTAAATTTATACAAGACTACTAAGATAAAAAAAGATGCAAATAATAATGTTCAACTTACTAACTGCTATATTATTAAGGCCAAGTTAAATTACAGATTTATGATGTTGCAGAATATAATAGAAAAGTTTCAAGAAAAAAaCATTTTAAAGCTTAAAGAGTTTGTTTTCAAACCCATCAAATTTTTTtCTCTGGCAGTGCTGCTGCTCTGCGACGTCATTTTCAAGTCGATCAATGCAAAGTTAACAAAAGTATTTCAACTTTAAAATCATGCAGAAAGTTGGGAAAAATTTGTTTTCAATTTTACTTAAATTTTAGTTACATTTTTTCTAAAAGCTTAAAATCTCCTCTTTAAAATATGTCTTTAAAAATTGAGCTATGATTTTTCTTCACCGAGTTATCGTAATTTGAAATGGCCAATCGACGTTTTCTTTTCATCACTAGGAATAATGCCGGCGGATTTAAGCTTTCAGAAGATTCCAAGCAAAGTTGAAAACAATTTGTTTGAATTCCGCATGATCTCAAAGTTGAAGCTAAAAAAATTTCACAAGACTTTAAAAGACAAAAAGTCAATGTTGCACAATCGACTTGGAAATAACGCCACTTGGAGCAGAGAAGTATTGCCGAGAAAAAGCTTCGACCGGGTTTGAAAGCAGGATCTATAGGCTTCCAAACTTTTTTTTtGAAAAATGGAGCTTTAAGGTTACTTTTCACGAAATTGTATAAATTGTGTTTTGTTGtTTTTATCATCGATTCGAAATGATCCTTTCATTTTGTTCAAGAATGTATCATTAACTTAATGCAATGATATCTTTTAATAATTCAATACTTCTTACTAATTAGATTTAGAAAAGTTTCATGAACAATTTTAGAACGGTTATTGCTAATTATTCGCCAAAAACAGACGCCTTTCTTCGAGGAGTAACAACTCAAGCTGCATTCGCGGTTGACGGCTTTCCGCGGCGCGTCGCGGTAATATGGCTCCTATTTAATTACCCGGTCCTTTGGGAGCTTAAACCAGCCAGAGCTGGAACGGCTGCGCCGTTATTCTATCTAGACTCCTTTGCTTTCTCAAGCAGGCGCGCGATCAAACCTTCTCGCATAAAAGACAATCGCAGCTGGCAGTCGACGACGCGcGGGACAGTCGAATCATGACCCGCTCCTCTCTAGTTACCGCCGTCAGTCTGCTCTACTTCCAGACGCCGCGCGTAATCTATTCGACATTAGTTGCTTGATTGCACCGTAAAATGCGACGGCGACGTGAACGAGAACGACGACGACGACGACGACGACGACAACGACGACGACAATGACAACGACGACGACGACAAGAGTGGGTTCGTCGGTGCTCGATGGCGCCTCCGATTTCAACCGCAGCACGATGCAAGCCCATTACTATCGCCCGGAGCTAAACGGCACCCGGAGCTCGTGCCATTAAGGGAATCTAGGGTCCGATCCACCTCATTGAATTCCGTTcAATTGCGATTATGATAATGCGTGAACGATCGCCGTGGACGTGAGCTACGGaCAACgAGGGTGTTCGTTCTCGGCTCAGAGAAACGCAGCGATAAAATTATCAGTGACAGCTTCATTTTTGTTACATTTGACGTTGAAAAATTTGCGAAAGAAATGTGCATTATTCAACTACATTGACAATTGTAAATCTTACATGACCTTTTATTAATATACATATATGTAAGATATATGTTCCATCTCTAGTCTCGTTCGATAATGAAGAAATTATAGCTGCAACATAGACGCCGGATTATTCGCGGGGGAAACCGTTGATTATTTGGACTCCGTGGGCTCGGGGTCTGGTTTACTTCTTCCCTTATGTCCGGAGATAATGGACACTATTACCTTAACGAGGCCCCAGTCTCTTAGCCGGTAATTGCTTCGAGATATCCGAGAGAGCTCCGGCGTACGTTGCCGCCTGGTGTTGCAGGCAGAGAACCCaGACGGTATTATTGCCGCCGAGGCTACTCGCCCGTTTCATGGTGGTTATTGTTATGGGCCTGCGGCATTAAGATGTACACCGCACTCTCATACGGGACCCACCACCCCATATACGAGCTGCATATATACTTATGCGGGGAGGATTTCATTACGCCGCTTCATAACTGGCGGTCTTCAAAATCGCGATAAATCGGGATTGTCTTCCTTGTTAAATCAGCTCCCGTTCCCCTTCCTCATATCGCTGACGAAGCCAACGAGACGGATTAATCTGCGCGATTGAATGGCCTCTATGACGTAAGCGCACCGTTTACTGGCACGGCTCCTCGTGTTCACGTGAAACGATTTGCGTCGTAAATATTTTATTTTAACATCGCAACATCAAGACAGACGAGGATCGGCTATTGCCTCGTGATCCTAAAAGGGAGATTCTCAAGGCGGAATCGGGGTAACGCGTTCGTTGATCTCGCCAAACTACGGCATCTTGAGGACTAGTCTTAGaGGAAAAAAAGACGACGAGGAGACACGGTGAGCATTAGATGAGAAAGAGACGGCGCGGCGCGGTGCGGCGGAGCGAGACGGAAAGAGATCAAATCTGGATATCAGGATTAGGGTGGGTACGTAATCCGCAGGACGACGGGTGGTAGGAACGGTGGATCCGTCGGCAGATCTCATCGCGCGGAGGAACTCTGCGGGTACTTCGCCCGGCCAAATACGACAAGAGCAGCAGCCTAACTCGAGTAGAGCCGCCGAGATGGTTTACGGCTCGTCGCAAGTTGGTGAAATTTAAGAGCGCATTTAAAACTGGTTTGGCGCGATGCCCTGCGCCTCCCTGCAGCAGGTACTGCCGGCGAGAGATACCGCTGGGCTCCGACAGGAGTTTATCGCCCTAACATTCTTGGAAAGCTTCGGATGGATTTCCCTCCTCAACCCATTTCTTCCGGCAGCACCTCAAATCGTCGCTTCTTTCGAGCTCCCTGTGTCGTCCGTTCTTTCCCCTTGCCACGAAAATTCTTCCAACAGACTCGACACCAAATCGTCACGACGATCATCGGTCAAGATACCCGGAGAAACGTCGCGGACGATGTTACCGTCAGATCGGACGCTTTTAGCTCCGCATTGGAGCTTTTTCCTCGACCGTCTCGGAGGAGATTTAGCGGACCGACAACAAATGAAGGTCATCTTTCGCGCGAGAAAGCGTCTCACCTTGATTTCCGTCTCGTTTCTCCGGAAATTGGATATCAACCGGACGAACTGTAATGTGTACGTTAAATTGCCAATTATATATATAATGTAACAGCTGATTATCTCAAGTGTCCTAAAAACCATTATACTCTTAATTTCTGTGAAAAATGGCGAAAATAAAAAAGAAACCGATCTTAATAAAGATATTCTTCCTGATAGATGCCATGACCCACGTGGAAAACTTTTTAGTTTTGTACAGTGGTATTATACGTTATCTTCCGCTGAACGTAAGACGTGCCTATCGCGCAATTTCATCGCGACGTCGTCGTATAGCGATTATGGCTACTCCATTAAAAATGAATTTTATAAAGGCAATCTTTCCAAGCGATCGTTGTAGGAGAAAAAGGCGAAAGCCGGAGCCAAAGGGGATGAGGCCACTACCTTTGGCTGATCCACTTCGAATGATAATCACCTCTAGGAGACTCAATTTCGCCCTGCTCCGCGTCCTTACCCGTTCCTATCTTCGGAAGGTTCAACGCCGCAGCGGACTGCATCTTTCACTCCCTTCGTCACCACCGCCCTATTCCTATCGCCCTCCGCGCGCCTACCGCCCCTATATCCTTCCCTTCCTTCACtCCTAGACTATTCTGAACGACCTCTTCCCCCATTCGCCAACGCTCACTCCTAACTGATTGGAGTACCAATCAATGCGGCATTCAGGCGGCCGTGCTGAAaCTTTAGGAAATTAACTATTCACTCTCTGGAAATGGTTATTTGGAAGGCCGGAAAGGCAGTCGGGACTACGTTACGGAGAATATGAGCGAAAGGATATGGGCAGACGATGACGATAGATAAGACAGTTCAATTTTCAATTAGGCGTGGACGTAGCGCGCGTAGCTCGGCTCCTACCACGATGCTTACCACGCGTCCCATACGTTAGGAGGAATCCGAGTTATCGATTGCACCTACCTCTTCTCGCGGATCTTACCGCTGGGAAATTTCGTTCGCTTAATGAGATTATTGTACTCGCCGGAAAATCCAGAtAAATTATTCTGATAACTGCGCAACGTCGCCTTCTGGTGTTTCGCGAGCCGATAACGAGCATAGTTAACCGCAGAGAAATTATACGGTCCACATGAGAGAATATATCACTTGGCgCGGGGACGGCGGGGGGGGGGgaGGGgAGACGTACGCGCCTCCAGTCGCGCaCAAAGCCACGCGCGGAGAGAAGTATTAtCCCGGTACTCTCTTTCGTTAATGGCATCGTGGTAATCAACAAAGCCGTACGAGGTAATACGAGGTATTTACAGTTACTCCGGCGCAGCAGAGGGCACCGAGCGTCGAGCGGTGGTAGGCGCATGGGCGGATTTATGCGATGTTTACACTCGCGCAAGGCTGCTAATACGGCTTTCCGACAACAACCCCCACGCCACATCGCGACTGCTCGACTGTGCCCCGCTGTTTGAAAGTGAAACTTCTGTTTGTACGCTAGTTCCAGCTAATCCCGAATATTACGCCGGCGAATCCACCGCCGCGCGTCGGGAGAACGGAAGGCGGCAGCCCAACGCTCAGAACTCTCCGCAAGGCAGAACTGGcGCCTTGAGCGAGGCGGAGAAGTTTTCACAAAGGAGAAGCGCTTCGCCAACTGCCGTGCCGAGCAAAATACATGCGCCCTCTACGAACGTTTGTATTAGACAAGAGCCAATCTTAGCTCCAGCTGGTCCATCTTACGAGAAATCTCGTCGGTTTTGATTCGCACGTCTCGAATCGTATAATAAAAACCGTTGGCCCCAACGACAGTTGTTCGCGTTTACGTTGAGCAACGTTCGACGTGTCTAGAAAACTACAATAAACGGGAGTCGAGAAACGAGATCGTTGAGAAAGGTCTGAAAGATTTCAGCCCTACTACGCCGTGCTGGTACTCCAGTGGATATGTATATACATATACGTATATGTATATGCTCACCGAGAATCCGATCTGCAGTCTCAAACAGCTATTTACCGTAACGTCGGTGGATCGTCGGTACTTTTGTAGCCAGAAGGGAAAATGGATCGGCGGAATCGCCGAGAATGCTACTGCAGTATACAAGTCCGCCGCTGTAACTCATTCACCGATAAATTTCGGCGGGCGAAGCATCGATCACAATGTTCTATTTTGCTCATTGGTGTTTTTGCCGGCGGCATTCGATTCAATTCGAGGATGCGGTGAGTCGATATATCTCTCGAGGTGTCATGAGCCGCCCGTAATGGATGGCGGCTGTTCCCTTACCCTTACTCCGAAAGGAGAGGCCGTGCCAAGAACCTACCCGGTACCCGCAACCCTCTGCTTCTGATCCTGATTTTTGTCTGCTGTCTTCTTCTTATTCCACGCATTATTCTCGCTCTTTCCCTCGCACGGCATTCGCAAAGTGGCCAACAAAGTAGTCTACGTATCTCTCCACGCGATATTTCGTAGCACATTTTCACATTTAATCGCGAGACAATCGGAATCCAAAATATTATCCGTGATATTTAATTTACAAACGTCGTTTGCGTTACCAAAAACAAAATATTTTTTGATTAATCAAATAATATCATAAAAAATTCTTTCAATGCATTTTACCTTTGATGACAGAAAAATCGCTGCCACTCGGGGAACGATGGGATGGATTTCGCGGTAGAACAAGCAAACGCGAAAGCACGCGAAAGGAAATGCACGCGCGACTGGTCCGACAAGGATATTCATAACGTGTGAACGATCACGAGCATTCAAGCTTCGGCGAATGCTAAGACGTGCGAAAATGAGGGTTAAGAAAATCTCGAATCGGGTAGATAATCCTATAGCAGTAGTGAAAATACAATAGGCTTTAATTGCCGGACTTAATTCGTAGAAATACGATTTCGATTAAATGTAACGACTCTCGTGTTTCTCAAATAAAAATGGCACTGAATATAATTGATCACTTATTGTCACTTATCAGTTATATATTAATAAAACAAGACGTTAAAAATTTAATAAGAAAAGATGAAAAAAACAcGATGTTAAAAATTtAACCCTAAAATTAAAATTTTTGTTTATATTAATTATTTTTtATTTATTTTATATTTTTGTCACTAAATTTTGCTTGTATTAAATAAAAATATAAATTTTAATAAGGAAATACGATTTAGGCAGTACTTTCCCCTATTAAGAGGAAGTACATATAGTTTCTAAGAATATTTAGAAATATTTATGCTTTGCATTTAAATAAAATAAAAATGGCGAATTTGTTGCGCTAAATTATTAGTTCTTCAAATATTATTGCAAGAATTATAATCAGTATTGTAAAGTAAATTTTACGAGTATTTTAAGAACTGTTTATGATCTAAAAGGTTATAATTTTGTAAAACGGAGGAATAATCGTCCATCTGTAATGATGCATTTACGTCACAAGAGTGACAcTTCCCTTGAAGCTCGACAGTGCTAATACGAGTTCAAGCCTCTACCATTCTCCGAAGTGAGGATCGATAGTCTAGCATCATTGGGCTCCAACGGCTGATGCGGAGGAGTCGAAGATAGAAGGTACAGAAGGGAGAGGATAGGTGGATGGTGGACGGGGACTTTCGCATGCGGAGTACTGTCGGCTTCGGTTCGCCGAAGGATTAGCATGAATGGCCATCGTCTGCGCCAGTCGAGATAGCAACACAAGCTCTCGCCTATTATTGCTCGGAGAAATTAATCGCATCCATCGCGAATTACTCCGCGATTGCTTTTCCGAGTACAACACGTCTCGCGGTCGATGGGGCAGAGACAGCGGGGAAAGTTCGTGTGATAGGCGTCGTAGTAATTATCAAACGGAACAAAAGAGCGGGCTACCTAATTACTACTTCAATTCCCATATTTATTAATTAGCAATTTATATTATAAAAATTAAAATACTATAAATTGTATATCTTTTAATTATTAACTAATGGGAGACGTTAATAAGTTTAAAACGATGACGACATTAGGAAAAATGTTGCTTTTGTAACGTTTCTAATATTCGTCATTTGAGGAACAGTCGTTACCATTCAGTTCGTATCGAGCTGGATTTGATATAATTATGAAATAGCACACAGCACCCGCTTCCCATCAACATCCCTCCTTTTCCCGAGTCCCTCGGTCTCGGAGAAACAGAAGCGTAATTCCACTCCGAATGCAGTCATTATGGAGATTTTATCAAGGCCTGTTGCCAGCTAGCTAGGCTTCAACTTAACTCGCCCATTACGGCGTTTCCTTTGTCTGCGACTTACACATCGTCATGTCCCGCAGTTCCATTCCGAGCCGAGGGGAACACTGATGAAAGAGATCCGACGATTCGCATACGACGGTTCGATTCACAATGCCGCGCATTCACACACGCGAAAGAGCTCGCGAAGGATTCTGGGATGCATCCGAAAAATAATCACGTGGGACACGAGCATCGACAGCTTGCGCTAGCACGCACGATGGAAACCGCCTGCACAGTTGGACGGATCAGCGAGAGGCGTCGTTTTACGAGGCCGATACTTGAAACGTCGTGCTGAAGTATAAAATTCTACGGGTCATGCAACTGCCAGAGTACCGATCACGCGCATACGTGTACGTAATGCGACTTTAATGCAGCATGTATTATTCAACGATACCACCCGCCGTGCGTGTCCGTCCGACAGCTCGTATACGCGTAATTACGAGCATAGTTGTTTATTGCGAATCCGGCAGTCCGCGCAACGCCTTATCCGGCGGTTATAGACGCGCGTCTCTCAACCCTCCTCGCCAAACGATCCCGATACCTATCCGCCAACCCTCTCGTCCCTCATCTCTCGTCTCTCGTCTCTCGCCATCTCGCCTCCTCGCCTCCTCGCCTCCtCGCctCcTCGCCCCGCCTCCTCTCGCGTCGACAACCCGTGAAGAATGATCGTGGAGTTTCCCTGGTTGGACAGTGACAAATAGAGTATATTCTTGCCAGCTCGGAGAAGCGTGTTTTACGCTTTGTTTACCGTTTGTCATGTTATGCAAAGGAAAGGAATTTTTGCCGGTTGCGTATTCGGCGAATCGACCCTTATCCTACGCGCATGATATGCCTGCGGTAGTTTTACATTATCGCTCGGCTGCGGCGCGGCGCGGCGCTACACAACCATTACCACTAACGATTCGGAGCGACTCCAGGAGTAAAGAGAGAAAGGCCGTAGTCGCATTTGCGGTAAATTGCGCCCATTTACTTCGATTCGCAAACGTAATTTCTGAGATtACCTCACGCTAGTGTCCAGCGATGGTAAAAGTAAGCTCTCTGACCTTCCACGTACGATCCAATATACGATAGAACAATGCATTATTCGACGGCGCGAAATTATTCATAAGCGTCACGAATTGTCCGCAGCGAACGTGGAAACGTTAAACGTTCGGATCtAGTTTGACGAATCTAGAATCGAGACAGTCGGAACCGGACCCTCTCGACTCTCGATCCGGGTGTTTCGCGCGAGGCACGTCGCAATGTTCGGAAAATTGAGGCGAGCCGCATAATACGCATATAGTCGCGTAATAATATGCATACAGGTGCATGCGTGACGACGTGACGCAACGTGACGCCCGAGCGCGGGTCGTTAGCCGTACGGtCGTTTCGCTGGCAAGTCACGCCTAATCACCGATGAATACAACAATCGCAGCATGATGGACGATCGGCAAGCAAGCCGTCCTTTATTAGCGTCGGGTACCCGGGGGCGAGGGGCGGGTGGGAGAGCGGGTGATCATATAATTAGATTAGCGCGCGCCCATCGCGCGTCGTCATCATTTTCCGCGGTGATTTTGGACCGCGCGGGAGATCGGTAACCGCCCTGCGCCGCGCCGCGTCGCGTCGCGTTGCGGCACGGCGGTTACtGTTCTCTCGatACtcGTTGGGATTGTGCTGC

CCTTTGATGTCGCAACGCAGGACGGCACAACGTTCtCTTCCGgCgTaCGAAATCATTTtGGgACACcGTCGCGcGCGGcaCGAGaTTtAATTTgCGCCGGCAACGATCgCGC

But making ituseable is hard.

Sequencing a genome is now cheap & easy.

Page 17: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

Yand

ell &

Enc

e 20

13 N

RG

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Page 18: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

Yand

ell &

Enc

e 20

13 N

RG

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Page 19: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

Yand

ell &

Enc

e 20

13 N

RG

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 20: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

20% failure rate:

Yand

ell &

Enc

e 20

13 N

RG

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 21: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

20% failure rate: •missing pieces

Yand

ell &

Enc

e 20

13 N

RG

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 22: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

20% failure rate: •missing pieces•extra pieces

Yand

ell &

Enc

e 20

13 N

RG

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 23: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

20% failure rate: •missing pieces•extra pieces•incorrect merging

Yand

ell &

Enc

e 20

13 N

RG

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 24: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting Ya

ndel

l & E

nce

2013

NR

G

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 25: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting

Visual inspection... and manual fixing required.

Yand

ell &

Enc

e 20

13 N

RG

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 26: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting

Visual inspection... and manual fixing required.

1 gene = 20 minutes to 3 days

Yand

ell &

Enc

e 20

13 N

RG

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 27: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting

Visual inspection... and manual fixing required.

1 gene = 20 minutes to 3 days

15,000 genes * 20 species = impossible. Ya

ndel

l & E

nce

2013

NR

G

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 28: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Monica Dragan https://github.com/monicadragan/GeneValidator

Page 29: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Monica Dragan https://github.com/monicadragan/GeneValidator

Page 30: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Monica Dragan https://github.com/monicadragan/GeneValidator

Page 31: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Gene predictionDozens of software algorithms: dozens of predictions

20% failure rate: •missing pieces•extra pieces•incorrect merging•incorrect splitting

Visual inspection... and manual fixing required.

1 gene = 20 minutes to 3 days

15,000 genes * 20 species = impossible. Ya

ndel

l & E

nce

2013

NR

G

GTCTACAATGCGATTGTAAAATAGCACGAgAGGTGCATATGATGAACGACTATGTTCCACAACCACAGCTCATATATAACATGATTTtGTTTGCCGAATTCATACACGCATTACAACACACATTGAATTCAATAATAATATCAAATTCACATTCAAAGCTTTCAAGTTAGACAAAAGTTTTAATGCCGTTTTtACCTGTTTTtGAAAAGGTAATTTTCTTTAGATATATACAGTTTGTAATaTTAGGTATTTTATAAACAGTGTGTATATTTCTTACAATATAAAAGACACAATTGCAAACTAGCATGATTGTAAACAATTGCTAAACGGATCAATATAAATTAAAATTGTAATATTAAGTATCAAACCGATAATTTTTATTTATTGTTCATTGTTTGTTCTTTATTTTGTTATTTGTAAATAATGAAA

Evidence

Evidence

Consensus:

Page 32: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

http://afra.sbcs.qmul.ac.ukAnurag Priyam

Crowd-sourcing the visual inspection + correction.

Page 33: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Begin

EĞĞĚƐ�ĐƵƌĂƟŽŶ

�ƌĞĂƚĞ�ŝŶŝƟĂů�ƚĂƐŬƐ

Being curated

Curate

Being curated

Curate

Being curated

Curate

Submit Submit Submit

�ƵƚŽͲĐŚĞĐŬ

�ŽŶĞ

/ŶĐŽŶƐŝƐƚ

ĞŶƚ͗�ĐƌĞĂ

ƚĞ�

“ƌĞǀŝĞǁ͟

�ƚĂƐŬ�

�ŽŶƐŝƐƚĞŶƚ͗�create nexƚ�ƌĞƋƵŝƌĞĚ�ƚĂƐŬ

http://afra.sbcs.qmul.ac.ukAnurag Priyam

Crowd-sourcing the visual inspection + correction.

Page 34: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

Begin

EĞĞĚƐ�ĐƵƌĂƟŽŶ

�ƌĞĂƚĞ�ŝŶŝƟĂů�ƚĂƐŬƐ

Being curated

Curate

Being curated

Curate

Being curated

Curate

Submit Submit Submit

�ƵƚŽͲĐŚĞĐŬ

�ŽŶĞ

/ŶĐŽŶƐŝƐƚ

ĞŶƚ͗�ĐƌĞĂ

ƚĞ�

“ƌĞǀŝĞǁ͟

�ƚĂƐŬ�

�ŽŶƐŝƐƚĞŶƚ͗�create nexƚ�ƌĞƋƵŝƌĞĚ�ƚĂƐŬ

http://afra.sbcs.qmul.ac.ukAnurag Priyam

Crowd-sourcing the visual inspection + correction.

Scientific software + Facebook + Points + Badges + Redundancy

Page 35: 2013 11-13-SoftwareSustainabilityInstitute@Manchester
Page 36: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

My aim is to make bioinformatics software:

Page 37: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

My aim is to make bioinformatics software:

Shared

Page 38: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

My aim is to make bioinformatics software:

Shared

Robust

Page 39: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

My aim is to make bioinformatics software:

Shared

Robust

User-friendly

Page 40: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

With the fellowship I will:

Page 41: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

With the fellowship I will:

• Organize a local Software-Carpentry-type event.

Page 42: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

With the fellowship I will:

• Organize a local Software-Carpentry-type event.

• Include software & reproducibility best practices on two new MSc & existing BSc.

Page 43: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

With the fellowship I will:

• Organize a local Software-Carpentry-type event.

• Include software & reproducibility best practices on two new MSc & existing BSc.

• Promote/lobby in line with SSI’s mission (internally/talks/conf//networking/publications/web).

Page 44: 2013 11-13-SoftwareSustainabilityInstitute@Manchester

With the fellowship I will:

• Organize a local Software-Carpentry-type event.

• Include software & reproducibility best practices on two new MSc & existing BSc.

• Promote/lobby in line with SSI’s mission (internally/talks/conf//networking/publications/web).

• Seek funds to create great sustainable bioinformatics software.