Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.
-
Upload
ambrose-dawson -
Category
Documents
-
view
226 -
download
0
description
Transcript of Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.
![Page 1: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/1.jpg)
Progress Report - Year 2
Extensions of the PhD Symposium Presentation
Daniel McEnnis
![Page 2: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/2.jpg)
Overview
Accomplishments Data set acquisition and cleaning Theoretical achievements Graph-RAT improvements
![Page 3: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/3.jpg)
Current Data
40’s Jazz Recordings 2000 annotated recordings from 80 CDs Covers nearly all 40’s popular music
LastFM by Song Retrieves tag and user info by song Data cleaning on user playcounts needed
![Page 4: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/4.jpg)
Planned Data Set Acquisition
Explored DBTunes XML version of myspace.
Linking with LastFM data designed but not yet written.
Provides per-artist audio data for all recent artists.
![Page 5: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/5.jpg)
Theoretical Achievements
Algorithm Literature ReviewTheortical Computer Science journal
submissionNZCSRSC conference submissionRecommendation Tasks and Evaluation
Metrics
![Page 6: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/6.jpg)
Algorithm Literature
Systematic exploration of theoretical computer science and discrete mathematics.
Discovered 1973 SIAM paper for maximal clique algorithm.
Maximal clique algorithm is most efficient discovered
![Page 7: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/7.jpg)
Journal Submission
Submitted Graph Triples Census algorithm. Proof of correctness Proof of Time complexity Proof of Space Complexity
Rediscovery of 2001 algorithm in Social Networks
Most efficient implementation known
![Page 8: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/8.jpg)
NZCSRSC
Poster at the conferenceWritten as a short users guide
![Page 9: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/9.jpg)
Evaluation Exploration
Incorporating cross-validation into relational data.
9 types of music recommendation Personalized versus generic Open query versus targeted query Dynamic versus static data New music versus all music
![Page 10: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/10.jpg)
Personalized Radio
Open query with personalized presentation
Static data vs dynamic dataNew items prediction vs predict
anything
![Page 11: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/11.jpg)
Targeted Search
Not personalizedSimilarity queriesAutomatically generating targeted lists
for a browsing hierarchyNew music vs all musicStatic vs dynamic data
![Page 12: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/12.jpg)
Personalized Tag Radio
Create a personalized play list matching a given query
New music vs all musicStatic vs dynamic data
![Page 13: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/13.jpg)
Excluded Types
‘Top 40’ predictionRendered obsolete by other types
![Page 14: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/14.jpg)
Cross-Validation in Graphs
Actor removal Only form currently used All links to a particular actor are removed
Link removal Selected links from ground truth are
removed Algorithm evaluated on reproducing
missing links
![Page 15: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/15.jpg)
Graph-RAT Improvements
Release of 0.4.4 Finalized Graph-RAT as a relational
programming language Added propositional algorithms
Release of 0.5.0 New Query Subsystem Usability enhancements Space complexity improvements
![Page 16: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/16.jpg)
Aggregators
8 algorithms with 9 helper functionsCover each form of propositionalizationCover mappings between links and
propertiesCore primitives for Graph-RAT as a
programming language.
![Page 17: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/17.jpg)
Similarity
2 new similarity algorithms1 new distance metric
![Page 18: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/18.jpg)
Query Subsystem
28 primitives for searching in a graph 10 graph primitives 7 actor primitives 7 link primitives 4 property primitives
Functional - composition to build queries
![Page 19: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/19.jpg)
Performance Specs
Queries can return collections or iterators.
Collections Implemented as references into graphs Linear in number of references
Iterators Ordered sequences of objects Constant in space complexity (excluding
Graph ID and AllGraphs)
![Page 20: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/20.jpg)
Usability Enhancements
Properties and MetadataInterface enhancementsDynamic Loading of ClassesXML scripting support
![Page 21: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/21.jpg)
Properties and Metadata
Properties description Encapsulates all parameter code Utilizes Graph-RAT Property objects Comparison to JavaBeans
New Metadata Model Parameter model update Input/Output descriptors update
![Page 22: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/22.jpg)
Interface Updates
Arrays->Lists graph, link, actor, and property objects
Iterators All graph operations support iterators
![Page 23: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/23.jpg)
Dynamic Loading
Classes loaded from file at runtime.Loading controlled by call to loader
objectAutomatic registering with relevant
factoriesAll factories updated to support dynamic
loading Extend Abstract Factory
![Page 24: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/24.jpg)
XML Scripting support
SAX parser support for all components excepting crawling and parsing
Implemented using the Builder pattern
![Page 25: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/25.jpg)
Core Improvements
2 cross-validation algorithms~20 algorithm with space complexity
improvementsIterators for all graph primitivesMacros for separation of graph data by
cross-validation property.
![Page 26: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/26.jpg)
Additional algorithms
2 new similarity algorithms
1 new distance metric added
Obsolete algorithms removed
![Page 27: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/27.jpg)
LastFM crawler updates
LastFM upgraded its web-services, removing the old version
New version will link to the semantic web
~20 parsers completedStill under construction
![Page 28: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/28.jpg)
Planned Future Work
Contingent on arrival of computerTesting of existing codeCross-Validation SchedulerCompletion of LastFM ParserDBTunes (from semantic web) parserExperiments!Write Thesis!
![Page 29: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/29.jpg)
Unplanned Future Work
Full semantic web crawlerIncorporating GData protocolsDatabase backendColt-Matrix-Over-Graph adapterDatabase-backed Weka instance
![Page 30: Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.](https://reader033.fdocuments.in/reader033/viewer/2022051301/5a4d1aed7f8b9ab05997be67/html5/thumbnails/30.jpg)
Beyond the Horizon
Support for Prolog primitivesMulti-database graph supportSemantic Web graph utilizing the proxy
patternSupport for dynamic updates and
dynamic data