Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative,...
-
Upload
linette-morrison -
Category
Documents
-
view
218 -
download
0
Transcript of Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative,...
Markup and Validation Agents in Vijjana – A Pragmatic model for Self-Organizing, Collaborative, Domain-Centric Knowledge Networks
S. Devalapalli, R. Reddy, L. Wang, S. ReddySIPLab, Department of Computer Science & Electrical EngineeringWest Virginia University,
Morgantown, WV 26506, USA
Presentation Outline
Motivation Vijjana Architecture Keyword Extraction Vijjana Browser Extensions Markup and Validation Agents Conclusion and Future Work
Motivation-Knowledge acquisition Process People gain knowledge
through thousands of ways
People accumulate knowledge in a systematic way
People form their own knowledge network
A large knowledge network are formed by collaboration
Workstations
Agent
Context AwareWorld Aware
Selft Aware
Motivation-Methodology in Knowledge acquisition Management Science on
Knowledge organization Machine facilitates people
to gather knowledge Collaborative channel is
needed in communication Knowledge network
publication
Vijjana
Defined as a Pragmatic model for Collaborative, Self-Organizing, Domain-Centric Knowledge Networks
A Semantic web A portal for collaboration A discussion forum And much more!
The Vijjana Model
Vijjana-X = {J, T, R| dA, oA, cA, vA, sA, rA}; where X = the domain name, J= the collection of JAN’s in the Vijjana-X, T = the Taxonomy OR pattern set used for classification of
JAN’s, R= the domain specific relations; dA = the discovery agent which finds relevant JAN’s, oA = the organizing agent which interlinks the JAN’s based
on R, cA = the consistency/completeness agent, vA = the visualization agent, sA = the search agent, rA = the rating agent.
Vijjana Architecture- A standard way to exploit the knowledge Find Organize Update Maintain Consistency and Completeness Distill Tools for Visualization Present on demand contextually relevant
knowledge!
Vijjana Client Interface Architecture
Vijjana Architecture-Knowledge Representation Semantic Networks Logic Frames
graph views of the Vijjana-Computer Science network
How useful is Vijjana
No unproductive browsing sessions anymore Search by concept, not by keywords Semantic visualization Social networking Receive alerts on topics of interest Combine resources on the Web and a user’s local
machine to form a “User JAN Space” Integrate and share “JAN Spaces” among users
Vijjana Network
Vijjana Markup Agents--Web Interface Prototype Browse JAN’s with in the user interest. Comment on the JAN, for discussion . Rate the JAN, to get best & useful content. Visualization of Taxonomy, for addition of
JAN’s manually. Visualization of Knowledge Domains for easy
navigation and User friendly search.
Keyword Extraction
Effectively summarize long documents Provide a context to the document Very valuable in web advertising Vijjana tags JANs with keywords describing them Examples of keyword assignment and usage include
youtube, Gmail, search engines etc
Keyword extraction in Vijjana
KEA algorithm used Simple and effective algorithm based on the
Bayesian model Domain specific keyword extraction Less overhead in training needed Available at http://www.nzdl.org/kea
Vijjana Browser Extensions
Firefox browser required Extensions are provided as toolbar buttons and
menus Extensions must be downloaded and installed on the
user’s browser Current extensions provide navigation to Vijjana
homepage, the Markup feature and Validation of JANs in the database
Vijjana Extensions in Firefox
Markup Process
Part of building up the database Similar to, but more involved than bookmarking Process of adding meta-data to a JAN Pages added to the database simply by clicking the
“Markup” button in the browser extension Invokes the Organizing Agent which adds a JAN to
the database
Markup Example
Markup Example (contd.)
Markup Process
Validation Agent
There may be hundreds or thousands of JANs in a user’s space
JANs are usually URLs or documents that might relocate or cease to exist
JANs must be validated (manually or automatically) The visualization must reflect most recent state of the
JAN
Validation Agent
Validation can be time and memory intensive task Time taken to validate is proportional to the number
of JANs in a user’s space Best carried out as an overnight operation
Validation Process Flow
Validation Confirmation
Database view before Validate process
Database view after Validation
Conclusion
A Firefox browser extension with options to navigate to Vijjana Homepage, Markup the current page and validate the user’s JANs has been developed
The KEA model was trained using a set of 24 documents pertaining to various technical domains and the results have been good
Future Work
A heuristic based key extraction algorithm called VKE is under evaluating
Automated periodic JAN “Validator” which runs as clients work instead of server work to manage load balance.
A series security mechanism should be applied for protecting privacy issue.
References
1. Vijjana – A Pragmatic model for Collaborative, Self-Organizing, Domain-Centric Knowledge Networks - Reddy, Dr. Ramana. Morgantown : IKE08, 2008.
2. KEA: Practical Automatic Key phrase extraction - Witten I.H., Paynter G.W., Frank
3. The KEA project - http://www.nzdl.org/Kea/
4. Mozilla Developer Center – Building an extension, http://developer.mozilla.org/en/docs/Building_an_Extension
5. Twine – Radar Networks Inc., http://www.twine.com/
6. Del.icio.us social bookmarking - http://del.icio.us/
Questions?