Numenta ACM Data Min - PowerPoint Presentation
Transcript of Numenta ACM Data Min - PowerPoint Presentation
![Page 1: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/1.jpg)
Copyright © 2009 Numenta
Hierarchical Temporal Memory
Subutai [email protected]
Vice President, EngineeringNumenta
![Page 2: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/2.jpg)
Introduction to Numenta
What can we learn from Neuroscience?
How can we incorporate these ideas into
Algorithms?
How can we incorporate these ideas into
Applications?
Agenda
![Page 3: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/3.jpg)
Numenta Snapshot
•Creating a new computing technology, Hierarchical Temporal Memory, based on the structure and function of the neocortex
•16 employees
—Founded in 2005 by Jeff Hawkins, Donna Dubinsky and Dileep George
•For-profit company with very long term roadmap and “patient capital”
—Focus on core technology
—Currently developing our third generation of algorithms
—Very selective corporate partnerships and application development
![Page 4: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/4.jpg)
Numenta Timeline
2002 Redwood Neuroscience Institute, Jeff Hawkins
2004 On Intelligence, Hawkins and Blakeslee
Described theory of Hierarchical Temporal Memory (HTM)
2005 Mathematical formalism (Dileep George)
2005 Numenta founded to build new computing
platform based on HTM
2007 Released NuPIC software platform
2008 First HTM Workshop (>200 attendees)
2009 Vision toolkit Beta release
2010 Prediction toolkit release
![Page 5: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/5.jpg)
Demo: An Easy Visual Task
Goal: output the name of the object in the image
cell phone
cow
rubber duck
sailboat
![Page 6: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/6.jpg)
Why Isn’t This Easy For Computers?
Huge variations in images, even within a single category
It is impossible to write down a set of rules or transformations that cover all possibilities
![Page 7: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/7.jpg)
Vision4 - Four Category Object Recognition Demo
![Page 8: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/8.jpg)
Introduction to Numenta
What can we learn from Neuroscience?
How can we incorporate these ideas into
Algorithms?
How can we incorporate these ideas into
Applications?
Agenda
![Page 9: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/9.jpg)
No Universal Learning Machine
No Free Lunch Theorem“no learning algorithm has an inherent superiority over other learning algorithms for all problems.”
(Wolpert, 1995)
Universal Learning Machine Specific Learning Machine
Machine with assumptions that match the structure of the world
x
![Page 10: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/10.jpg)
• Many different regions performing specialized functions
• Local structure is similar across regions
The Neocortex
![Page 11: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/11.jpg)
Common Cortical Algorithm
![Page 12: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/12.jpg)
Cortical Hierarchy
Sensory data(retina)
Sensory data(skin)
• Representations are distributed hierarchically
• Connections are bidirectional – significant feedback projections
• Each region exposed to constantly changing sensory patterns and is constantly predicting future patterns
From: Felleman and Van Essen
![Page 13: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/13.jpg)
Introduction to Numenta
What can we learn from Neuroscience?
How can we incorporate these ideas into
Algorithms?
How can we incorporate these ideas into
Applications?
Agenda
![Page 14: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/14.jpg)
Hierarchical Temporal Memory (HTM)
•Network of learning nodes
•All nodes do same thing
— Learns common spatial patterns
— Learns common sequences (groups patterns with common cause)
•Create a hierarchical, spatio-temporal model of data
—Probability of sequences passed up
—Predicted spatial patterns passed down
•Bayesian methods resolve ambiguity
Common spatial patterns
Common sequences
High level causes
Low level causes
![Page 15: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/15.jpg)
First Order Markov GraphHTM Nodes Learn Static Patterns
Memorizes static patterns, “coincidences”
HTM Node
Stable, sparse vectors
[Input vector]
![Page 16: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/16.jpg)
First Order Markov GraphHTM Nodes Learn Temporal Sequences
Memorizes static patterns, “coincidences”
HTM Node
Models frequency of transitions between patterns
Variable order Markov Chains, “groups”
[Input vectors]
![Page 17: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/17.jpg)
First Order Markov GraphHTM Nodes Output Probability Over Sequences
HTM Node [P(g1), P(g2), … ]
[…], […], […], …
![Page 18: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/18.jpg)
HTM Nodes Are Connected In Hierarchies
![Page 19: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/19.jpg)
Hierarchies Allow Contextual Prediction
![Page 20: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/20.jpg)
Summary: Hierarchical Temporal Memory
•Network of learning nodes
•All nodes do same thing
— Learns common spatial patterns
— Learns common sequences (groups patterns with common cause)
•Creates hierarchical model of data
—Sequence names passed up
—Predicted spatial patterns passed down
•Bayesian methods resolve ambiguity
Common spatial patterns
Common sequences
High level causes
Low level causes
![Page 21: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/21.jpg)
Introduction to Numenta
What can we learn from Neuroscience?
How can we incorporate these ideas into
Algorithms?
How can we incorporate these ideas into
Applications?
Agenda
![Page 22: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/22.jpg)
Web Analytics
•Analyze temporal patterns in a very high traffic news website (Forbes.com)
•Question: Can HTM’s model temporal statistics and predict topics and pages of interest to users?
![Page 23: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/23.jpg)
Which Topic Is The User Interested In Next?
•177 total topics
•Random prediction gives 0.56% accuracy
?
?
Time
![Page 24: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/24.jpg)
Training Paradigm
HTM trained using 100,000 user sequences
Temporal pooler builds up a variable order sequence model
![Page 25: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/25.jpg)
Prediction Based On Page View Statistics
•Could predict using no temporal context, based just on popularity of different topics (“0’th order” prediction)
•This is what most sites do today
•Leads to 23% accuracy
?
?
Time
???
![Page 26: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/26.jpg)
First Order Prediction
•Can do better if we use transition probabilities from each page
•Improves accuracy from 23% to 28%
?
?
Time
??
![Page 27: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/27.jpg)
Variable Order Prediction
•“Variable order prediction” – how much temporal context you need is determined based on individual sequences
•Accuracy jumps to 45%
?
?
Time
![Page 28: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/28.jpg)
Summary: Predicting News Topics
Prediction
Accuracy
Random chance 0.56 %
Page views prediction 23 %
1st order prediction 28 %
Variable order prediction
45 %
![Page 29: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/29.jpg)
Summary: Predicting News Topics
Prediction
Accuracy
Accuracy Predicting Top-5
Pages
Random chance 0.56 % 3.16 %
Page views prediction 23 % 46 %
1st order prediction 28 % 58 %
Variable order prediction
45 % 69 %
HTMs potentially represent a powerful mechanism for predicting and analyzing web traffic patterns
![Page 30: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/30.jpg)
Potential Applications In Web Analytics
• Increase length of site visits
—Predict pages that are directly relevant to each user
• Increase revenue
—Predict ad-clicks based on current user’s immediate history
•Display interesting traffic patterns through a website
—What are most common sequences?
•Display changes in traffic patterns
—How are sequence models changing from day to day?
![Page 31: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/31.jpg)
Video Analysis: People Tracking
Person
![Page 32: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/32.jpg)
Example Videos – Persons
Occlusions Non-ideal lighting
Groups/overlapping peopleSmall, non-upright
![Page 33: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/33.jpg)
Non-Persons – Potential False Positives
Cars/Vehicles Balloons
Animals Trees/foliage/pool sweeper
![Page 34: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/34.jpg)
People Tracking Demo
![Page 35: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/35.jpg)
Applications In Biomedical Imaging
•Numerous pattern recognition tasks in biomedical imaging
![Page 36: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/36.jpg)
Pattern Detection In Digital Pathology
Glands Not glands
Task: detect patterns in biopsy slides indicative of cancer
Malformed glands -> could be prostate cancer
![Page 37: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/37.jpg)
Early Results Were Promising
Glands Not glands
•We trained a network to discriminate glands from other structures
•Test set accuracy was around 95%
![Page 38: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/38.jpg)
HTM For Biomedical Imaging
•HTM performing quite well in gland detection as well as some other tasks
•There could be applications in other areas of Biomedical Imaging
—Radiology
—Electron microscopy
—….
•Key differentiator:
—General purpose pattern recognition algorithm
—Most existing work involves coding very specific algorithms to specific patterns
![Page 39: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/39.jpg)
Applications Areas
•Web analytics
•Biomedical Imaging
•Video Analysis
•Credit card fraud
•Automotive
•Gaming
•Drug discovery
•Business modeling
•Healthcare
![Page 40: Numenta ACM Data Min - PowerPoint Presentation](https://reader035.fdocuments.in/reader035/viewer/2022062703/554e86e5b4c90573338b4796/html5/thumbnails/40.jpg)
Working With Numenta On HTMs
• NuPIC, Numenta Platform For Intelligent Computing, available free for research on numenta.com
• Support through an active forum
• Contains implementation of our second generation of algorithms
• Vision Toolkit Beta, free for research
• Easy to use GUI for creating vision applications
• Includes hosted inference and a web services API
• Internships available for students!
• Send email to [email protected]