An Introduction to Bioinformatics
description
Transcript of An Introduction to Bioinformatics
![Page 1: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/1.jpg)
An Introduction to Bioinformatics
Molecular Biology Databases
![Page 2: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/2.jpg)
AIMS
OBJECTIVES
To introduce the major databases- nucleotide- protein
To explain how to search the appropriate databases
To explain how to retrieve information from databases
Choose appropriate databases for information retrieval
Use of Boolean operators to search databases
Retrieve nucleotide and protein sequence files
![Page 3: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/3.jpg)
Introduction
• Hundreds!
• Databases of databases!
• Acronym rich!
• Subcomponents• organisms• structure• metabolism…….
• Searched• text, sequences
![Page 4: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/4.jpg)
Historically
• 1960s •Mary Dayhoff - Protein Sequences
(Eck, R. V., and M. O. Dayhoff. 1966. Atlas of Protein Sequence and Structure 1966.
National Biomedical Research Foundation, Silver Spring, Maryland.)
• 1980s - explosion in DNA sequences• EMBL (European Molecular Biology Laboratory)• NIH (National Institute of Health) Genbank• DDBJ (DNA database of Japan)
• 1988• agreed on international collaboration
![Page 5: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/5.jpg)
![Page 6: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/6.jpg)
• Experimentally determined nucleotide sequence,• Inferred protein sequence
– EMBL, GenBank, DDBJ nucleotides– GenPept– PIR Protein Identification Resource proteins– SWISS-PROT
• Which to choose?
Primary Databases
}
![Page 7: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/7.jpg)
Composite Databases
SWISS-PROT + PIR+ GenPept +
SWISS-PROT, Swissnew, Trembl, Tremblnew, Genbank, PIR, Wormpep and PDB
![Page 8: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/8.jpg)
Secondary Databases
• Analytical results of primary databases
• Searching for related patterns
– Prosite– Pfam More on these later
![Page 9: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/9.jpg)
Sub-Databases
• EST - Expressed Sequence Tags
• STS - Sequence Tagged Sites
• SNP - Single Nucleotide Polymorphisms
• OMIM - Online Medelian Inheritance in Man
![Page 10: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/10.jpg)
Searching and Retrieval
• Entrez - National Center for Biotechnology Information
• SRS - European Bioinformatics Institute
• DBGET - Japan’s GenomeNet.
Capable of retrieving specific nucleotide or protein sequence.Provide links to additional related information.
![Page 11: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/11.jpg)
Entrez
![Page 12: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/12.jpg)
Entrez Tutorial
• Search for penicillin-binding genes• Search for Mycobacterium tuberculosis• Combine the searches• Scan the output
Q/ Are there any genes that code for penicillin binding in the Mycobacterium genome?
Example of a text based search to identify genes that have already been annotated.
![Page 13: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/13.jpg)
![Page 14: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/14.jpg)
![Page 15: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/15.jpg)
![Page 16: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/16.jpg)
![Page 17: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/17.jpg)
#1 AND #2
![Page 18: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/18.jpg)
![Page 19: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/19.jpg)
![Page 20: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/20.jpg)
![Page 21: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/21.jpg)
SRS guide
![Page 22: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/22.jpg)
Searching the Databases
• Subject
• Accession Numbers
• Author
e.g. AF208262
![Page 23: An Introduction to Bioinformatics](https://reader035.fdocuments.in/reader035/viewer/2022062314/56814413550346895db0b296/html5/thumbnails/23.jpg)
Boolean Operators
AND will locate all records containing both the words e.g. human AND protease
OR will locate all records containing either word not necessarily both e.g. human OR protease)
NOT will locate records containing one word, but NOT the other word e.g. human NOT protease