NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

Post on 12-Jan-2016

228 views 1 download

Transcript of NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

NCBI Molecular Biology Resources

January 2008

Using Entrez

NC

BI

Fie

ldG

uid

eWWWAccess

Entrez&BLAST

NC

BI

Fie

ldG

uid

e

Gene

Homologene

Entrez: Database Integration

PubMed abstracts

Nucleotide sequences

Protein sequences

3-D Structure

3 -D Structure

Word weight

VAST

BLASTBLAST

Hard LinkNeighborsRelated Sequences

NeighborsRelated SequencesBLinkDomains

NeighborsRelated Structures

NC

BI

Fie

ldG

uid

eThe Links Menu: Access to Neighbors and

LinksSNPSNP

GEOGEO

GeneGene

PubMedPubMed

ProteinProtein

NC

BI

Fie

ldG

uid

eThe Links Menu: Access to Neighbors and

LinksNeighbors: BLAST Linkpre-computed BLASTNeighbors: BLAST Linkpre-computed BLAST

Neighbors:pre-computed CDD searchNeighbors:pre-computed CDD search

NC

BI

Fie

ldG

uid

eThe Links Menu: Access to Neighbors and

Links

NeighborsNeighbors

Hard LinksHard Links

NC

BI

Fie

ldG

uid

e

Database Searching with Entrez

Using limits and field restriction to find human MutL homologLinking and neighboring with MutLMapping SNPs onto structure

NC

BI

Fie

ldG

uid

e

Global NCBI (Entrez) Search

colon cancercolon cancer

NC

BI

Fie

ldG

uid

e

Global Entrez Search Results

NC

BI

Fie

ldG

uid

e

Nucleotide Sequences

Nucleotide database now three parts

•EST expressed sequence tags•GSS genome survey sequences•CoreNucleotide everything else

NC

BI

Fie

ldG

uid

e

Core Nucleotide Results with Gene Preview

New! Gene Previewmore relevant resultsNew! Gene Previewmore relevant results

New! Taxonomy FiltersNew! Taxonomy Filters

NC

BI

Fie

ldG

uid

e

Advanced Search OptionsTabsTabs

Taxonomy filterTaxonomy filter

NC

BI

Fie

ldG

uid

eMore Precise Nucleotides

Search

colon cancer[Title] AND nonpolyposis[Title] AND human[Organism] AND biomol_mrna[Properties] AND srcdb_refseq[Properties]colon cancer[Title] AND nonpolyposis[Title] AND human[Organism] AND biomol_mrna[Properties] AND srcdb_refseq[Properties]

NC

BI

Fie

ldG

uid

e

Useful Field Restrictions[Title]: Definition line in GenBank / GenPept format shown in Summary format

glyceraldehyde 3 phosphate dehydrogenase[Title]

[Organism]: NCBI’s taxonomy. Organizing system for molecular databases

mouse[organism]; green plants[organism]; Streptomyces coelicolor[organism]

[Properties]: molecule type, location, database source

biomol_mrna[properties]; biomol_genomic[properties]; gene_in_mitochondrion[properties]; srcdb pdb[properties]

[Filter]: subsets of data, Entrez links

all[filter]; nucleotide mapview[filter]; nucleotide omim[filter]

NC

BI

Fie

ldG

uid

eEntrez Tip: Start Searches in

Gene

HomoloGene

Entrez Protein

Gene

UniGene

Other Entrez DBs

BLink

Homologene:Gene Neighbors

NC

BI

Fie

ldG

uid

e

Gene Results nonpolyposis colon cancer AND human[Organism]nonpolyposis colon cancer AND human[Organism]

NC

BI

Fie

ldG

uid

e

Precise Results

MLH1[Gene Name] AND Human[Organism]MLH1[Gene Name] AND Human[Organism]

NCBI TaxonomyNCBI Taxonomy

NC

BI

Fie

ldG

uid

e

Organism Field: NCBI’s Taxonomy

All molecular databasesAll molecular databases

NC

BI

Fie

ldG

uid

e

MLH1 Gene Record

NC

BI

Fie

ldG

uid

eMLH1 Gene Record: Interactions and

GO

NC

BI

Fie

ldG

uid

e

MLH1 Sequences

NC

BI

Fie

ldG

uid

e

MLH1 Gene Record: Sequences

NC

BI

Fie

ldG

uid

e

MLH1: Sequence Links

NC

BI

Fie

ldG

uid

e

Gene Table: Genomic Sequences

NC

BI

Fie

ldG

uid

e

Map Viewer: All SequencesCustomizableCustomizable

NCBI Assembly

EST Hits

Gene Annotations

Models

Transcripts

Download data and sequencesDownload data and sequences

NC

BI

Fie

ldG

uid

e

MLH1 Homologs

NC

BI

Fie

ldG

uid

eSynteny: Mammalian Genomes

Albumin Gene FamilyAlbumin Gene Family

NC

BI

Fie

ldG

uid

e

Finding Homologs: HomoloGene

GeneGene

HomoloGene Cluster

Gene LinksGene LinksProtein LinksProtein Links

HomoloGene Downloader

ProtienmRNA

Genomic

ProtienmRNA

Genomic

NC

BI

Fie

ldG

uid

e

Finding Homologs 2: BLink

GeneGene

ProteinProtein

NC

BI

Fie

ldG

uid

e

BLink: BLAST Link (Best Hits)

Redundant ProteinsRedundant Proteins

First 200 onlyFirst 200 only

BLASTBLAST

Tomato homologTomato homolog

NC

BI

Fie

ldG

uid

e

Finding Polymorphisms

GeneGene

NC

BI

Fie

ldG

uid

eGeneView: Variations Human

MLH1

ATPase domain

NC

BI

Fie

ldG

uid

e

MLH1 Structure Model and Mapping Polymorphisms

NC

BI

Fie

ldG

uid

e

Related Structures: structure model

NC

BI

Fie

ldG

uid

e

Sequence Similar Structures

ConservedDomain

ConservedDomain

Link to StructureLink to StructureLink to AlignmentLink to Alignment

NC

BI

Fie

ldG

uid

e

E. coli MutL Structure

Cn3D viewerCn3D viewer

Conserved DomainsConserved Domains

NC

BI

Fie

ldG

uid

e

Alignment Based Model: Mapping Polymorphisms

Mg2+ binding siteMg2+ binding site

Ile - ValIle - Val

NC

BI

Fie

ldG

uid

e

Better Model: Conserved Domain

GeneGene

ProteinProtein

Related StructuresRelated Structures

NC

BI

Fie

ldG

uid

e

Better Model: Conserved Domain

Mg2+ binding siteMg2+ binding site

Ile – ValPosition 32Ile – ValPosition 32