NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

41
NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez

Transcript of NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

Page 1: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

NCBI Molecular Biology Resources

January 2008

Using Entrez

Page 2: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

eWWWAccess

Entrez&BLAST

Page 3: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Gene

Homologene

Entrez: Database Integration

PubMed abstracts

Nucleotide sequences

Protein sequences

3-D Structure

3 -D Structure

Word weight

VAST

BLASTBLAST

Hard LinkNeighborsRelated Sequences

NeighborsRelated SequencesBLinkDomains

NeighborsRelated Structures

Page 4: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

eThe Links Menu: Access to Neighbors and

LinksSNPSNP

GEOGEO

GeneGene

PubMedPubMed

ProteinProtein

Page 5: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

eThe Links Menu: Access to Neighbors and

LinksNeighbors: BLAST Linkpre-computed BLASTNeighbors: BLAST Linkpre-computed BLAST

Neighbors:pre-computed CDD searchNeighbors:pre-computed CDD search

Page 6: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

eThe Links Menu: Access to Neighbors and

Links

NeighborsNeighbors

Hard LinksHard Links

Page 7: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Database Searching with Entrez

Using limits and field restriction to find human MutL homologLinking and neighboring with MutLMapping SNPs onto structure

Page 8: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Global NCBI (Entrez) Search

colon cancercolon cancer

Page 9: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Global Entrez Search Results

Page 10: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Nucleotide Sequences

Nucleotide database now three parts

•EST expressed sequence tags•GSS genome survey sequences•CoreNucleotide everything else

Page 11: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Core Nucleotide Results with Gene Preview

New! Gene Previewmore relevant resultsNew! Gene Previewmore relevant results

New! Taxonomy FiltersNew! Taxonomy Filters

Page 12: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Advanced Search OptionsTabsTabs

Taxonomy filterTaxonomy filter

Page 13: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

eMore Precise Nucleotides

Search

colon cancer[Title] AND nonpolyposis[Title] AND human[Organism] AND biomol_mrna[Properties] AND srcdb_refseq[Properties]colon cancer[Title] AND nonpolyposis[Title] AND human[Organism] AND biomol_mrna[Properties] AND srcdb_refseq[Properties]

Page 14: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Useful Field Restrictions[Title]: Definition line in GenBank / GenPept format shown in Summary format

glyceraldehyde 3 phosphate dehydrogenase[Title]

[Organism]: NCBI’s taxonomy. Organizing system for molecular databases

mouse[organism]; green plants[organism]; Streptomyces coelicolor[organism]

[Properties]: molecule type, location, database source

biomol_mrna[properties]; biomol_genomic[properties]; gene_in_mitochondrion[properties]; srcdb pdb[properties]

[Filter]: subsets of data, Entrez links

all[filter]; nucleotide mapview[filter]; nucleotide omim[filter]

Page 15: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

eEntrez Tip: Start Searches in

Gene

HomoloGene

Entrez Protein

Gene

UniGene

Other Entrez DBs

BLink

Homologene:Gene Neighbors

Page 16: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Gene Results nonpolyposis colon cancer AND human[Organism]nonpolyposis colon cancer AND human[Organism]

Page 17: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Precise Results

MLH1[Gene Name] AND Human[Organism]MLH1[Gene Name] AND Human[Organism]

NCBI TaxonomyNCBI Taxonomy

Page 18: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Organism Field: NCBI’s Taxonomy

All molecular databasesAll molecular databases

Page 19: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

MLH1 Gene Record

Page 20: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

eMLH1 Gene Record: Interactions and

GO

Page 21: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

MLH1 Sequences

Page 22: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

MLH1 Gene Record: Sequences

Page 23: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

MLH1: Sequence Links

Page 24: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Gene Table: Genomic Sequences

Page 25: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Map Viewer: All SequencesCustomizableCustomizable

NCBI Assembly

EST Hits

Gene Annotations

Models

Transcripts

Download data and sequencesDownload data and sequences

Page 26: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

MLH1 Homologs

Page 27: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

eSynteny: Mammalian Genomes

Albumin Gene FamilyAlbumin Gene Family

Page 28: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Finding Homologs: HomoloGene

GeneGene

Page 29: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

HomoloGene Cluster

Gene LinksGene LinksProtein LinksProtein Links

Page 30: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

HomoloGene Downloader

ProtienmRNA

Genomic

ProtienmRNA

Genomic

Page 31: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Finding Homologs 2: BLink

GeneGene

ProteinProtein

Page 32: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

BLink: BLAST Link (Best Hits)

Redundant ProteinsRedundant Proteins

First 200 onlyFirst 200 only

BLASTBLAST

Tomato homologTomato homolog

Page 33: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Finding Polymorphisms

GeneGene

Page 34: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

eGeneView: Variations Human

MLH1

ATPase domain

Page 35: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

MLH1 Structure Model and Mapping Polymorphisms

Page 36: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Related Structures: structure model

Page 37: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Sequence Similar Structures

ConservedDomain

ConservedDomain

Link to StructureLink to StructureLink to AlignmentLink to Alignment

Page 38: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

E. coli MutL Structure

Cn3D viewerCn3D viewer

Conserved DomainsConserved Domains

Page 39: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Alignment Based Model: Mapping Polymorphisms

Mg2+ binding siteMg2+ binding site

Ile - ValIle - Val

Page 40: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Better Model: Conserved Domain

GeneGene

ProteinProtein

Related StructuresRelated Structures

Page 41: NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.

NC

BI

Fie

ldG

uid

e

Better Model: Conserved Domain

Mg2+ binding siteMg2+ binding site

Ile – ValPosition 32Ile – ValPosition 32