Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not...
-
Upload
diana-clarke -
Category
Documents
-
view
215 -
download
0
Transcript of Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not...
![Page 1: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/1.jpg)
Doug RaifordLesson 3
![Page 2: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/2.jpg)
More and more sequence data is being generated every day
Useless if not made available to other researchers
![Page 3: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/3.jpg)
Not just sequence dataMany other biological
experiments Expression NMR Mass Spec Protein X-ray crystallography
![Page 4: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/4.jpg)
With the data comes scientific journal articles
![Page 5: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/5.jpg)
Search tools Find similar genes in other
organism Find articles Find
Implemented algorithms Alignment Sequence assembly Protein structure prediction
![Page 6: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/6.jpg)
National Center for Biotechnology Information (NCBI) GenBank
(accessed through NCBI)▪ Sponsored by
National Institute of Health (NIH)
RefSeq▪ Derived from
GenBank, curated, non-redundant
![Page 7: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/7.jpg)
European Molecular Biology Laboratory (EMBL)
DNA Data Bank of Japan (DDBJ)
![Page 8: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/8.jpg)
Protein Data Bank (PDB) PDB files: standardized format for
viewersProtein Information Resource (PIR)
![Page 9: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/9.jpg)
Will revisit laterCan actually perform scientific
analysis Color by charge Hydrophobicity Render surface
![Page 10: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/10.jpg)
Entrez Global Query Cross-Database Search System Single source for searching publications,
sequences, proteins,diseases, etc.
Whole Genome DB Genomic
Expression Omnibux (GEO)
Online Mendelian Inheritance in Man(OMIM)
PubMed Map of site
![Page 11: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/11.jpg)
Practical Extraction and Report Language Expansion came later
Really good at string manipulation DNA and proteins
represented as strings Scripting language Almost all Unix and Linux
systems come with it installed
Free download and install for windows
![Page 12: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/12.jpg)
Make a computer do what we want it to do
Program in a language Machine language▪ Low level—1’s and 0’s
High level programming language▪ C/C++▪ Java▪ Compiled into machine
language Very high level
languages▪ Scripting▪ Interpreted
Perl lives herePerl lives here
![Page 13: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/13.jpg)
Display something to the screenSyntax and punctuationStore something in a variableCommenting the codeSome easy string manipulation
print “Hello World\n”;
![Page 14: Doug Raiford Lesson 3. More and more sequence data is being generated every day Useless if not made available to other researchers.](https://reader035.fdocuments.in/reader035/viewer/2022062518/56649ea35503460f94ba80a3/html5/thumbnails/14.jpg)