Omics era

82
OMICS ERA Dr. Hetalkumar Panchal Associate Professor Gujarat Agricultural Biotechnology Institute (GABI) Navsari Agriculture University, Athwa Farm, Surat – 395007 [email protected] 29 th Refresher Course :Bio-Sciences and Bio-Enginering (ID) (02/06/2014 to 2/06/2014) UGC-Academic Staff College, Sardar Patel University, Vallbh Vidyangar -38120, Dist. Anand, (Gujarat)

description

This Presentation gives brief idea of different omics technology resources.

Transcript of Omics era

Page 1: Omics era

OMICS ERA

Dr. Hetalkumar PanchalAssociate Professor

Gujarat Agricultural Biotechnology Institute (GABI)Navsari Agriculture University,Athwa Farm, Surat – 395007

[email protected]

29th Refresher Course :Bio-Sciences and Bio-Enginering (ID) (02/06/2014 to 2/06/2014)

UGC-Academic Staff College, Sardar Patel University,

Vallbh Vidyangar -38120, Dist. Anand, (Gujarat)

Page 2: Omics era

• OMICS– The term ‘‘omic’’ is derived from the Latin suffix

‘‘ome’’ meaning mass or many. Thus, OMICS involve a mass (large number) of measurements per endpoint. (Jackson et al., 2006)

• Integration of OMICS data– Efficient integration of data from different

OMICS can greatly facilitate the discovery of true causes and states of disease, mostly done by softwares (Andrew et al., 2006).

What is ‘omics’?

Page 3: Omics era

What is ‘omics’?• In biological context , suffix –omics is used to

refer to the study of large sets of biological molecules (Smith et al., 2005)

• The realization that DNA is not alone regulate complex biological processes (as a result of HGP, 2001), triggered the rapid development of several fields in molecular biology that together are described with the term OMICS.

• The OMICS field ranges from – Genomics (focused on the genome) – Proteomics (focused on large sets of proteins, the

proteome) – Metabolomics (focused on large sets of small

molecules, the metabolome).

Page 4: Omics era

4

TYPES OF OMICS

Genomics Computational genomics Epigenomics Functional genomics Immunomics Metagenomics Pathogenomics Regenomics Personal genomics Proteomics Psychogenomics

Page 5: Omics era

GENOMICS

• The field of genomics has been divided into 3 major categories.– Genotyping (focused on the genome sequence),

• The physiological function of genes and the elucidation of the role of specific genes in disease susceptibility (Syvanen, 2001)

– Transcriptomics (focused on genomic expression)• The abundance of specific mRNA transcripts in a biological

sample is a reflection of the expression levels of the corresponding genes (Manning et al., 2007)

– Epigenomics (focused on epigenetic regulation of genome expression)

• Study of epigenetic processes (expression activities not involving DNA) on a large (ultimately genome-wide) scale (Feinberg, 2007)

Page 6: Omics era

GENOTYPING

• Goal– Identification of the physiological function of genes– Role of specific genes in disease susceptibility (syvanen et al., 2001)

• Common Parameter used– Among different variations (insertions, deletions, SNPs, etc.), single

nucleotide polymorphisms (SNPs) are the most commonly investigated (Sachidanandam et al., 2001) and can be used as markers for diseases.

– Tag SNPs (informative subset of SNPs) and fine mapping are further used to identify true cause of phenotype (patil et al., 2001).

• Application– Identification of genes associated with disease

• Recent improvement in genotyping– Array-based genotyping techniques, allowing the simultaneous

assessment (up to 1 million SNPs) per assay, leads to the genotyping of entire genome known as genome-wide association studies (GWAS) Jelly et al., 2010)

Page 7: Omics era

TRANSCRIPTOMICS

• Gene expression profiling – The identification and characterization of the mixture of mRNA that is

present in a specific sample. • Principle

– The abundance of specific mRNA transcripts in a biological sample is a reflection of the expression levels of the corresponding genes

(Manning et al., 2007).• Application

– To associate differences in mRNA mixtures originating from different groups of individuals to phenotypic differences between the groups

(Nachtomy et al., 2007). • Challenge

– The transcriptome in contrast to the genome is highly variable over time, between cell types and environmental changes (Celis et al., 2000).

Page 8: Omics era

EPIGENOMICS

• Epigenetic processes – Mechanisms other than changes in DNA sequence that cause

effect in gene transcription and gene silencing30-32. – Number of mechanisms of epigenomics but is mainly based on two

mechanisms, DNA methylation and histone modification28 33-39. – Recently RNAi has acquired considerable attention31 40 41.

• Goal– The focus of epigenomics is to study epigenetic processes on a

large (ultimately genome-wide) scale to assess the effect on disease28 29.

• Association with disease– Hypermethylation of CpG islands located in promoter regions of

genes is related to gene silencing. 28 36. Altered gene silencing plays a causal role in human disease31 34 37 38 42.

– Histone proteins are involved in the structural packaging of DNA in the chromatin complex. Post translational histone modifications such as acetylation and methylation are believed to regulate chromatin structure and therefore gene expression34 37

Page 9: Omics era

PROTEOMICS• Proteomics provides insights into the role proteins in biological systems.

The proteome consists of all proteins present in specific cell types or tissue and highly variable over time, between cell types and will change in response to changes in its environment, a major challenge (Fliser et al., 2007).

• The overall function of cells can be described by the proteins (intra- and inter-cellular )and the abundance of these proteins (Sellers et al., 2003)

• Although all proteins are directly correlated to mRNA (transcriptome) , post translational modifications (PTM) and environmental interactions impede to predict from gene expression analysis alone (Hanash et al., 2008)

• Tools for proteomics– Mainly two different approaches that are based on detection by

• mass spectrometry (MS) and • protein microarrays using capturing agents such as antibodies.

• Major focuses– the identification of proteins and proteins interacting in protein-complexes– Then the quantification of the protein abundance. The abundance of a specific

protein is related to its role in cell function (Fliser et al., 2007)

Page 10: Omics era

METABOLOMICS

• The metabolome consists of small molecules (e.g. lipids or vitamins) that are also known as metabolites (Claudino et al., 2007).

• Metabolites are involved in the energy transmission in cells (metabolism) by interacting with other biological molecules following metabolic pathways.

• Metabolic phenotypes are the by-products of interactions between genetic, environmental, lifestyle and other factors (Holmes et al., 2008).

• The metabolome is highly variable and time dependent, and it consists of a wide range of chemical structures.

• An important challenge of metabolomics is to acquire qualitative and quantitative information with preturbance of environment (Jelly et al., 2010)

Page 11: Omics era

11

METABOLITES, METABOLOME & METABONOMICS

Metabolites are the intermediates and products of metabolism. Within the context of metabolomics, a metabolite is usually defined as any molecule less than 1 kDa in size.

Metabolome refers to the complete set of small-molecule metabolites (such as metabolic intermediates, hormones and other signaling molecules, and secondary metabolites) to be found within a biological sample.

The word was coined in analogy with transcriptomics and proteomics; like the transcriptome and the proteome, the metabolome is dynamic, i.e. changing from second to second.

Metabonomics is defined as "the quantitative measurement of the dynamic multiparametric metabolic response of living systems to pathophysiological stimuli or genetic modification".

Page 12: Omics era

12

METAGENOMICS,

Metagenomics is the study of metagenomes, genetic material recovered directly from environmental samples. The broad field may also be referred to as environmental genomics, ecogenomics or community genomics.

Page 13: Omics era

13

COMPUTATIONAL GENOMICS

Computational genomics (often referred to as Computational Genetics) refers to the use of computational and statistical analysis to decipher biology from genome sequences and related data,[1] including both DNA and RNA sequence as well as other "post-genomic" data (i.e. experimental data obtained with technologies that require the genome sequence, such as genomic DNA microarrays). These, in combination with computational and statistical approaches to understanding the function of the genes and statistical association analysis, this field is also often referred to as Computational and Statistical Genetics/genomics.

Page 14: Omics era

14

EPIGENETICS

Genomic modifications that alter gene expression that cannot be attributed to modification of the primary DNA sequence and that are heritable mitotically and meiotically are classified as epigenetic modifications. DNA methylation and histone modification are among the best characterized epigenetic processes

Page 15: Omics era

15

FUNCTIONAL GENOMICS

Functional genomics is a field of molecular biology that attempts to make use of the vast wealth of data produced by genomic projects (such as genome sequencing projects) to describe gene (and protein) functions and interactions. Unlike genomics, functional genomics focuses on the dynamic aspects such as gene transcription, translation, and protein–protein interactions, as opposed to the static aspects of the genomic information such as DNA sequence or structures. Functional genomics attempts to answer questions about the function of DNA at the levels of genes, RNA transcripts, and protein products. A key characteristic of functional genomics studies is their genome-wide approach to these questions, generally involving high-throughput methods rather than a more traditional “gene-by-gene” approach.

Page 16: Omics era

16

IMMUNOMICS

Immunomics is the study of immune system regulation and response to pathogens using genome-wide approaches. With the rise of genomic and proteomic technologies, scientists have been able to visualize biological networks and infer interrelationships between genes and/or proteins; recently, these technologies have been used to help better understand how the immune system functions and how it is regulated.

Page 17: Omics era

17

PATHOGENOMICS

Pathogen infections are among the leading causes of infirmity and mortality among humans and other animals in the world.[1] Until recently, it has been difficult to compile information to understand the generation of pathogen virulence factors as well as pathogen behaviour in a host environment. The study of Pathogenomics attempts to utilize genomic and metagenomics data gathered from high through-put technologies (e.g. sequencing or DNA microarrays), to understand microbe diversity and interaction as well as host-microbe interactions involved in disease states. The bulk of pathogenomics research concerns itself with pathogens that affect human health; however, studies also exist for plant and animal infecting microbes.

Page 18: Omics era

18

REGENOMICS

Regenomics represents the merger of two fields of scientific endeavor: Regenerative medicine[1] and genomics.[2][3][4] New technologies to reprogram aged somatic cells back to pluripotency and to restore telomere length are currently used in research in regenerative medicine,[5] though FDA-approved cellular therapies using reprogrammed cells are currently not available in the United States.[6] The culture and banking of somatic cells also allows the parallel sequencing of their nuclear DNA to provide individuals with potentially valuable information for guiding them in lifestyle choices, but also one day, potentially in preventative strategies where cell types are made in advance for high risk categories of disease, i.e. preparing cardiac progenitor cells for individuals at high risk for heart disease.

Page 19: Omics era

19

PERSONAL GENOMICS

Personal genomics is the branch of genomics concerned with the sequencing and analysis of the genome of an individual. The genotyping stage employs different techniques, including single-nucleotide polymorphism (SNP) analysis chips (typically 0.02% of the genome), or partial or full genome sequencing. Once the genotypes are known, the individual's genotype can be compared with the published literature to determine likelihood of trait expression and disease risk.

Use of personal genomics in predictive and precision medicine[edit]

Predictive medicine is the use of the information produced by personal genomics techniques when deciding what medical treatments are appropriate for a particular individual. Precision medicine is focused on "a new taxonomy of human disease based on molecular biology“.

Page 20: Omics era

APPLICATION OF DIFFERENT OMICS

Page 21: Omics era
Page 22: Omics era

Stress-responsive transcription Factors DataBase (STIFDB)

Database for  Annotation, Visualization and Integrated Discovery (DAVID ) 

Page 23: Omics era

http://caps.ncbs.res.in/stifdb2/

Page 24: Omics era

http://david.abcc.ncifcrf.gov/

Page 25: Omics era
Page 26: Omics era

'OMICS' DATA REPOSITORIES

Page 27: Omics era

27

I. SEQUENCE SIMILARITY SEARCH Find a protein sequence: text search Based on Pair-Wise Comparisons

BLOSUM scoring matrix PAM scoring matrix

Dynamic Programming Algorithms Global Similarity: Needleman-Wunsch

(GAP/BestFit) Local Similarity: Smith-Waterman (SSEARCH)

Heuristic Algorithms (Sequence Database Searching) FASTA: Based on K-Tuples (2-Amino Acid) BLAST: Triples of Conserved Amino Acids Gapped-BLAST: Allow Gaps in Segment Pairs

(NREF) PHI-BLAST: Pattern-Hit Initiated Search (NCBI) PSI-BLAST: Iterative Search (NCBI)

Page 28: Omics era

28

SEQUENCE SEARCH BY TEXT OR UNIQUE IDEntrez (http://www.ncbi.nlm.nih.gov/Entrez/)

(http://pir.georgetown.edu/pirwww/search/textsearch.html)

Page 29: Omics era

29

PAIR-WISE COMPARISONS

Scoring matrix Global and local

Similarity: Dynamic Programming(Needleman-Wunsch,Smith-Waterman)

(http://www.ebi.ac.uk/emboss/align/)

Page 30: Omics era

30

FASTA SEARCH

(http://www.ebi.ac.uk/fasta33/)

(http://pir.georgetown.edu/pirwww/search/fasta.html)

Page 31: Omics era

31

GAPPED-BLAST SEARCH(http://pir.georgetown.edu/pirwww/search/pirnref.shtml)

(http://www.ncbi.nlm.nih.gov/BLAST/)

Page 32: Omics era

A BLAST Result

Page 33: Omics era

THE DIFFERENT VERSIONS OF BLAST

Page 34: Omics era

34

PSI-BLAST ITERATIVE SEARCH

(http://www.ncbi.nlm.nih.gov/BLAST/)

Page 35: Omics era

35

PSI-BLAST

Page 36: Omics era

36

II. FAMILY CLASSIFICATION METHODS

Multiple Sequence Alignment and Phylogenetic Analysis ClustalW Multiple Sequence Alignment Alignment Editor & Phylogenetic Trees

Searches Based on Family Information PROSITE Pattern Search Motif and Profile Search Hidden Markov Model (HMMs)

Page 37: Omics era

37

MULTIPLE SEQUENCE ALIGNMENT ClustalW (http://pir.georgetown.edu/pirwww/search/multaln.html)

Page 38: Omics era

38

ALIGNMENT EDITOR (JALVIEW)(http://www.ebi.ac.uk/clustalw/)

Page 39: Omics era

39

ALIGNMENT EDITOR (GENEDOC)(http://www.psc.edu/biomed/genedoc/)

Page 40: Omics era

40

PHYLOGENETIC ANALYSIS

Tree Programs: (http://evolution. genetics.washington.edu/phylip.html) Tree Searches: (

http://pauling. mbu.iisc.ernet.in/~pali/index.html)

Page 41: Omics era

41

PHYLOGENETIC TREES (IGFBP SUPERFAMILY)

(Radial Tree)

(Phylogram)

Page 42: Omics era

42

PROSITE PATTERN SEARCH(http://pir.georgetown.edu/pirwww/search/patmatch.html)

Page 43: Omics era

43

PROFILE SEARCH(http://bmerc-www.bu.edu/bioinformatics/profile_request.html)

Page 44: Omics era

44

HIDDEN MARKOV MODEL SEARCH (http://www.sanger.ac.uk/Software/Pfam/search.shtml)

(http://smart.embl-heidelberg.de)

Page 45: Omics era

45

III. STRUCTURAL PREDICTION METHODS

Signal Peptide: SIGFIND, SignalP Transmembrane Helix: TMHMM, TMAP 2D Prediction (a-helix, b-sheet, Coiled-coils): PHD,

JPred 3D Modeling: Homology Modeling (Modeller,

SWISS-MODEL), Threading, Ab-initio Prediction

Page 47: Omics era

47

PROTEIN PREDICTION SERVER

(http://www.cbs.dtu.dk/services/)

Page 48: Omics era

48

SIGNAL PEPTIDE PREDICTION(http://www.stepc.gr/~synaptic/sigfind.html)

(http://www.cbs.dtu.dk/services/SignalP-2.0)

Page 49: Omics era

49

TRANSMEMBRANE HELIX

(http://www.cbs.dtu.dk/services/TMHMM/)

Page 50: Omics era

50

PROTEIN STRUCTURE PREDICTION(http://cmgm.stanford.edu/WWW/www_predict.html)

(http://restools.sdsc.edu/biotools/biotools9.html)

Page 52: Omics era

52

3D-MODELLING(http://www.salilab.org/modeller/modeller.html)

(http://www.expasy.ch/swissmod/SWISS-MODEL.html)

Page 53: Omics era

53

IV. PROTEIN FAMILY DATABASES Whole Proteins

PIR: Superfamilies and Families COG (Clusters of Orthologous Groups) of Complete

Genomes ProtoNet: Automated Hierarchical Classification of Proteins

Protein Domains

Pfam: Alignments and HMM Models of Protein Domains SMART: Protein Domain Families

Protein Motifs

PROSITE: Protein Patterns and Profiles BLOCKS: Protein Sequence Motifs and Alignments PRINTS: Protein Sequence Motifs and Signatures

Integrated Family Databases

iProClass: Superfamilies/Families, Domains, Motifs, Rich Links

InterPro: Integrate Pfam, PRINTS, PROSITES, ProDom, SMART

Page 54: Omics era

54

PROTEIN CLUSTERING(http://www.ncbi.nlm.nih.gov/COG/)

Page 55: Omics era

55

PROTEIN DOMAINSPfam (http://www.sanger.ac.uk/Software/Pfam/)

SMART (http:// smart.embl-heid elberg.de/smart/ show_motifs.pl)

Page 56: Omics era

56

PROTEIN MOTIFS PROSITE is a database of protein families and domains. It consists of biologically significant sites, patterns and profiles. (http://www.expasy.ch/prosite/)

Page 57: Omics era

57

INTEGRATED FAMILY CLASSIFICATION InterPro: An integrated resource unifying PROSITE, PRINTS,

ProDom, Pfam, SMART, and TIGRFAMs, PIRSF. (http://www.ebi.ac.uk/interpro/search.html)

Page 58: Omics era

58

V. DATABASES OF PROTEIN FUNCTIONS

Metabolic Pathways, Enzymes, and Compounds Enzyme Classification: Classification and Nomenclature of Enzyme-Catalysed Reactions

(EC-IUBMB) KEGG (Kyoto Encyclopedia of Genes and Genomes): Metabolic Pathways LIGAND (at KEGG): Chemical Compounds, Reactions and Enzymes EcoCyc: Encyclopedia of E. coli Genes and Metabolism MetaCyc: Metabolic Encyclopedia (Metabolic Pathways) WIT: Functional Curation and Metabolic Models BRENDA: Enzyme Database UM-BBD: Microbial Biocatalytic Reactions and Biodegradation Pathways Klotho: Collection and Categorization of Biological Compounds

Cellular Regulation and Gene Networks EpoDB: Genes Expressed during Human Erythropoiesis BIND: Descriptions of interactions, molecular complexes and

pathways DIP: Catalogs experimentally determined interactions

between proteins RegulonDB: Escherichia coli Pathways and Regulation

Page 59: Omics era

59

KEGG METABOLIC & REGULATORY PATHWAYS

(http://www.genome.ad.jp/dbget-bin/show_pathway?hsa00590+874)

KEGG is a suite of databases and associated software, integrating our current knowledge

on molecular interaction networks, the information of genes and proteins, and of chemical compounds and reactions. (http://www.genome.ad.jp/kegg/kegg2.html)

Page 60: Omics era

60

BIOCYC (ECOCYC/METACYC METABOLIC PATHWAYS) The BioCyc Knowledge Library is a collection of Pathway/Genome

Databases (http://biocyc.org/)

Page 61: Omics era

61

PROTEIN-PROTEIN INTERACTIONS: DIP(http://dip.doe-mbi.ucla.edu/)

Page 62: Omics era

62

PROTEIN-PROTEIN INTERACTION: BIND(http://www.bind.ca/)

Page 63: Omics era

63

BIOCARTA CELLULAR PATHWAYS(http://www.biocarta.com/index.asp)

Page 64: Omics era

64

VI. DATABASES OF PROTEIN STRUCTURES

Protein Structure and Classification PDB: Structure Determined by X-ray Crystallography and NMR CATH: Hierarchical Classification of Protein Domain Structures SCOP: Familial and Structural Protein Relationships FSSP: Protein Fold Family Database

Protein Sequence-Structure Relationship PIR-NRL3D: Protein Sequence-Structure Database PIR-RESID: Protein Structure/Post-Translational Modifications HSSP: Families and Alignments of Structurally-Conserved

Regions

Page 65: Omics era

65

PDB STRUCTURE DATA(http://www.rcsb.org/pdb/)

Page 66: Omics era

66

PDBSUM:

Summary and Analysis (http://www.biochem.ucl.ac.uk/bsm/pdbsum)

Page 67: Omics era

67

PDB: EXPERIMENTAL 3D STRUCTURE REPOSITORY

(http://www.rcsb.org/pdb/)

Rat gamma-crystallin, chain A, B.

Can you do a text search at PIR to find this?

Page 68: Omics era

68

PDBSUM:Summary and Analysis (http://www.ebi.ac.uk/thornton-srv/databases/pdbsum/)

Search 3-D structure summary

2-D structure

Page 69: Omics era

69

PROTEIN STRUCTURAL CLASSIFICATION

CATH: Hierarchical domain classification of protein structures (http://www.biochem. ucl.ac.uk/bsm/cath_new/)

Page 71: Omics era

71

PROTEIN STRUCTURAL CLASSIFICATION

(http://scop.mrc-lmb. cam.ac.uk/scop/)

The SCOP database aims to provide a detailed and comprehensive description of the structural and evolutionary relationships between all proteins whose structure is known, including all entries in the PDB.

Page 72: Omics era

72

VII. PROTEOMIC RESOURCES

GELBANK (http://gelbank.anl.gov): 2D-gel patterns from completed genomes; SWISS-2DPAGE (http://www.expasy.org/ch2d/)PEP: Predictions for Entire Proteomes: (http://cubic.bioc.columbia.edu/ pep/): Summarized analyses of protein sequences Proteome BioKnowledge Library: (http://www.proteome.com): Detailed information on human, mouse and rat proteomesProteome Analysis Database (http://www.ebi.ac.uk/proteome/): Online application of InterPro and CluSTr for the functional classification of proteins in whole genomesExpression Profiling databases: GNF (http://expression.gnf.org/cgi-bin/index.cgi, human and mouse transcriptome), SMD (http://genome-www5.stanford.edu/MicroArray/SMD/, Stanford microarray data analysis), EBI Microarray Informatics (http://www.ebi.ac.uk/microarray/ index.html , managing, storing and analyzing microarray data)

Page 73: Omics era

73

2D-GEL IMAGE DATABASES (2)(http://us.expasy.org/ch2d/2d-index.html)

(http://us.expasy.org/cgi-bin/nice2dpage.pl?P06493)

Page 74: Omics era

74

VIII. PROTEOME ANALYSIS(http://www.ebi.ac.uk/proteome)

Page 75: Omics era

75

EXPRESSION PROFILING Human and Mouse Transcriptome

(http://expression.gnf.org/cgi-bin/index.cgi)

(http://genome-www. stanford.edu/serum/)

Page 76: Omics era

76

OMICS TOOLS(http://omictools.com/)

Page 77: Omics era

77

OMICS TOOLS METAGENOMICS ANALYSIS

Page 78: Omics era

78

OMICS TOOLS MASS SPECTROMETRY ANALYSIS

Page 79: Omics era

79

OMICS TOOLS MASS SPECTROMETRY ANALYSIS

Page 80: Omics era

80

OMICS TOOLS COMMON TOOLS

Page 81: Omics era

Q & AQ U E S T I O N S

&A N S W E R S

Page 82: Omics era