Algorithms for Data Science

Anil Maheshwari

anil@scs.carleton.caSchool of Computer ScienceCarleton University

Outline

Topics

Required Background

Blended Teaching

Course Evaluation

Help Me

Topics

Course Title

Algorithms for Data Science

Input Algorithm Output01000011101000010111001001101010 ⇒ ?01100101011101000110111101101110

Topics

1. Probability Basics: Linearity of Expectation, Concentration Bounds, Ballsand Bins, Hashing.

2. Data Streaming - CMS, Estimating Frequency Moments, What isTrending?

3. Locality-Sensitive Hashing

4. Geometric Approximation (Nearest Neighbors, Core Sets)

5. Dimensionality Reduction

6. Online Algorithms: Bipartite Matching, Adwords

7. Regret Minimization (Multiplicative Weight Algorithm)

8. Graph Partitioning

9. Linear Algebra: Eigenvalues + SVD

10. Recommendation Systems

11. Randomized Linear Algebra

12. . . .

Topics

12. . . .

Topics

12. . . .

Topics

12. . . .

Topics

12. . . .

Topics

12. . . .

Topics

12. . . .

Topics

12. . . .

Topics

12. . . .

Topics

12. . . .

Topics

12. . . .

Topics

12. . . .

Our Focus

Warnings: First Offering - Online - Plans May Derail!Office may become inaccessible

We will try to understand

• Algorithmic Techniques

• Key Ideas

• Learn bit of Math

• High-level details

Required Background

Background

• Basic Data StructuresLists, Stacks, BST, Hashing, Searching and Sorting

See Pat Morin’s https://opendatastructures.org/

• Basic Probability TheoryExpectation, Variance, Concentration Bounds

Michiel Smid’s COMP 2804 Notes http://cglab.ca/~michiel/DiscreteStructures/

Joe Blitzstein https://projects.iq.harvard.edu/stat110/youtube

• Linear AlgebraEigenvalues and Eigenvectors, Null Space, Positive Definitive Matrices, QΛQT , SVD

Gilbert Strang

https://ocw.mit.edu/courses/mathematics/18-06-linear-algebra-spring-2010/video-lectures/

• Algorithmic TechniquesPseudocode, Correctness, Analysis (Recurrences + O,Ω-notation), Analysis of (randomized) Quicksort. Algorithms for

Connectivity, Strong Connectivity, MST, SSSP, Graph Traversal. Techniques: D& C, Greedy, Dynamic Programming.

Blended Teaching

Online Course

• Lectures (with recordings) on Tuesdays & Fridays from 06:30 AM EST.

• Timing may change when Day Light Saving Hours apply in March.

• E-mail: anil@scs.carleton.ca

• Post queries during the lectures using the zoom chat.

References

Useful References

1. Foundations of Data Science by Blum, Hopcroft and Kannan.https://www.cs.cornell.edu/jeh/book.pdf

2. Mining of Massive Data Sets http://www.mmds.org/

3. Notes on Algorithm Design on my web-pagehttp://people.scs.carleton.ca/~maheshwa/

4. Numerous research articles.

5. Pointers to useful Videos.

Course Evaluation

Evaluation Parameters

Principles

• Open-ended course by design

• Contents evolve

• Enjoyable learning experience

• Use zoom chat effectively

Evaluation Components

• Assignments

• Programming Assignments

• Tests

• On the spot quizzes!

Help Me

PLEASE, PLEASE, PLEASE, PLEASE

1. Post your questions on Zoom

2. Post useful links/references/ideas

3. Take initiative to learn

4. E-mail now if you want to learn some specific algorithmic technique(s)

5. Seek Help, Have Fun & Smile

Season - SUMMER

Season - FALL

Season - WINTER

Algorithms for Data Science - people.scs.carleton.ca

Transcript of Algorithms for Data Science - people.scs.carleton.ca

Algorithms for Data Science - people.scs.carleton.ca

Documents

Transcript of Algorithms for Data Science - people.scs.carleton.ca

Analysis of Algorithms - Undergraduate Courses | Computer Science

GPM Science Status: Algorithms and Partnerships · GPM Science Status: Algorithms and Partnerships ... • Characterization of uncertainties in hydrologic models ... •Heavily instrument

Computer Science, Algorithms, Abstractions, & Information CSC 2001.

Introducing Computer Science & Algorithms. Goals By the end of this lecture you should … … understand Computer Science as the study of algorithms … understand.

AERMOD Deposition Algorithms – Science Document (Revised ... · 1 AERMOD – 03/19/2004 AERMOD Deposition Algorithms – Science Document (Revised Draft) This Science Document describes

Algorithms and Modern Computer Science

Greedy Algorithms - Faculty of Computer Science

MACHINE LEARNING: The Art and Science of Algorithms that ...dsd.future-lab.cn/members/2015nlp/Peter_Flach_Machine_Learning... · MACHINE LEARNING The Art and Science of Algorithms

Algorithms and Data Structures - ETH - Computer Science

Top 5 algorithms used in Data Science

Genetic algorithms in computational materials science and ...€¦ · Genetic algorithms in computational materials science and engineering: ... The two genetic algorithms promise

On Scheduling Algorithms for MapReduce Jobs in ...people.scs.carleton.ca/~wei_shi/cloud_opodis_v3.pdf · the limitations of the cloud infrastructure to build-up theframework [4],

Holographic algorithms: From art to science

Ant Colony Optimization Algorithms with Local Search …people.scs.carleton.ca/~arunka/research/papers/ACO-DVRP.pdf · Brock University Department of Computer Science Ant Colony Optimization

Approximation Algorithms - Computer Science at Leicester

Introduction to Computer Science Lecture 5: Algorithms

Advanced Data Structures and Algorithms · Advanced Data Structures and Algorithms Department of Computer Science _ UHD 1 Computer Science Department College of Science and Technology

Advanced Methods in Network Science: Community Detection Algorithms

Copyright ©2004 Department of Computer & Information Science Introducing Computer Science & Algorithms.

Lecture #12 Distributed Algorithms (I) CS492 Special Topics in Computer Science: Distributed Algorithms and Systems.