Digital Library Service Integration Senior Projects
description
Transcript of Digital Library Service Integration Senior Projects
Digital Library Service IntegrationSenior Projects
Professors Bieber, Im and Wu
Information Systems Department
College of Computing Sciences
New Jersey Institute of Technology
http://is.njit.edu/dlsi
For more senior project information:
http://is.njit.edu/dlsi/dlsi-sr-projects-s2003.doc
DL
= Distance Learning
= Digital Library
DLSI: DL Service Integration
Why Participate in DLSI?
• Real-world project– very helpful for students and teachers worldwide
• High visibility for you and NJIT
• Gain research experience and work with research teams
• Learning XML, XLS, and other skills
• Support: DLSI project leader and bi-weekly DLSI project meetings
Outline• Motivation
– supporting learning communities
• DLSI Architecture
• Senior Projects
Motivation:Community Knowledge Resides in...• documents
(published papers, reports, photos, videos, lesson plans, syllabi, etc.)
• discussions• decisions• conceptual models
• formal educational modules
• workflows/processes• people’s expertise
• links/relationships among all these
• Digital Library: Multimedia Document Services
Service Modules
Repositories
Doc
Doc
Integrationlinking related
documents
DLSI Architecture
for manipulating and maintaining data
for storing data
Service Modules
Repositories
Doc
Doc
Disc
Disc
• Digital Library: Multimedia Document Services
• Asynchronous Discussion Tools(Groupware)
IntegrationDiscussinga document
DLSI Architecture
for manipulating and maintaining data
for storing data
Service Modules
Repositories
Doc
Doc
Disc
Disc
H/ M
H/M
• Digital Library: Multimedia Document Services • Asynchronous Discussion Tools/Groupware
• Hypermedia Services(tours, annotation, linking)
IntegrationAnnotating a
discussion
Tours of documents and
discussion comments
DLSI Architecture
Service Modules
Repositories
Doc
Doc
Disc
Disc
H/ M
H/M
Proc
Proc
• Digital Library: Multimedia Document Services • Asynchronous Discussion Tools
• Hypermedia Services (tours, annotations, links)
• Processes/Workflows
IntegrationAnnotating and
discussing a community
process
DLSI Architecture
Service Modules
Repositories
Doc
Doc
Disc
Disc
H/ M
H/M
Proc
Proc
Dec'n
Dec'n
CKS
CKS
...
...
• Digital Library: Multimedia Document Services • Asynchronous Discussion Tools • Hypermedia Services
• Processes/Workflows • Decision Analysis Support
• Conceptual Knowledge Structures
• Others...
DLSI Architecture
Here are some examples. The agricultural system is verycomplex. It consists of farmers in interaction with the envi-ronment (weather, soil, pests), the economy and society.One problem currently receiving a lot of attention isIntegrated Pest Management.
The problem unfortunately is being addressed separatelyby experts in Plant Pathology, Entomology, Agronomy, Bot-any, and Soil Science, with no real common ground (and lit-tle common understanding) to base our decisions on.Meetings and workshops tasked with integration have takenplace, but these various subdisciplines could not seem toform an integrated viewpoint. No real methodology existsto discuss and analyze the systems, which eachsubdiscipline has developed. In the end, all IPM decisionsend up unintegrated and therefore only partially effective. Asystematic approach that would enable the different partici-pants to discuss, and determine all the interrelationships,which would help researchers link their practices and derivetheir effects on each others area and the environment,would greatly help the IPM research and lead to better deci-sion for the environment and society as a whole.
Thus, developing a systems methodology for understandingthe relationships, cause and effects, and repercussions inthe agricultural field would be very helpful. The research inthis incubation proposal would make an important contribu-tion towards this goal.
i
Deeply Understanding Complexity{document}
View Peer Review Comments{JESSE Peer Review service}
Enter your own Peer Review Comment{JESSE Peer Review service}
Search for similar/related documents{Core Search service}
Other collections with this document{DLSI Collection Registry}
Create a new comment on document{Core Annotation service}
Add document to current Guided Tour{DLSI Guided Tour service}
Start your own link from this document{DLSI Link service}
Plant Pathology {concept}
Ask an expert about this concept{in the Virtual Reference Desk}
Relevant NASA Experiments in Space{ National Space Science Data Center}
Search for this concept{Core Search service}
View Comments on this concept{Core Annotation service}
Create a new comment on this concept{Core Annotation service}
Guided Tours concerning this concept{DLSI Guided Tour service}
Start your own link from this concept{DLSI Link service}
Start a discussion on this concept{DLSI discussion environment}
Related journal articles{in JESSE}
DLSI:Integration through
Linking
DLSI is Based on theDynamic Hypermedia Engine
• Automatically adds link anchors, links and other “hypermedia” services to applications:
• comments
• guided tours
• structural search (based on links and relationships instead of keywords)
• others...
• See separate presentation
Shipping I nformation
http : / /www.nj i t .edu/purchasing /shipping-info rmat ion.htm
251 Requisition Header - Shipping and Text STRATEGIC SUPPLIES INTERN'L 71 UNION AVE Screen: Vend: V0000304390 PR: R010294 Inv: Line:
Deliver-to Address FOB: Name: MICHAEL BIEBER Rte: Org: NJIT, CIS DEPARTMENT Delivery Service: UPS Addr: 323 ML KING BLVD Delivery Date: 03-12-2000
City: NEWARK St: NJ Zip: 07102 Country: USA Phone: 973 596 2681 Ext: Requisition Codes: 58 128 PUX ZY2 Requisition Text:
Document Notes: NSample Screen from Purchasing System:all text with no links...
Shipping I nformation
http : / /www.nj i t .edu/purchasing /shipping-info rmat ion.htm
251 Requisition Header - Shipping and Text STRATEGIC SUPPLIES INTERN'L 71 UNION AVE Screen: Vend: V0000304390 PR: R010294 Inv: Line:
Deliver-to Address FOB: Name: MICHAEL BIEBER Rte: Org: NJIT, CIS DEPARTMENT Delivery Service: UPS Addr: 323 ML KING BLVD Delivery Date: 03-12-2000
City: NEWARK St: NJ Zip: 07102 Country: USA Phone: 973 596 2681 Ext: Requisition Codes: 58 128 PUX ZY2 Requisition Text:
Document Notes: NDHE automatically generates anchorsas the screen is created
Shipping I nformation
http : / /www.nj i t .edu/purchasing /shipping-info rmat ion.htm
251 Requisition Header - Shipping and Text STRATEGIC SUPPLIES INTERN'L 71 UNION AVE Screen: Vend: V0000304390 PR: R010294 Inv: Line:
Deliver-to Address FOB: Name: MICHAEL BIEBER Rte: Org: NJIT, CIS DEPARTMENT Delivery Service: UPS Addr: 323 ML KING BLVD Delivery Date: 03-12-2000
City: NEWARK St: NJ Zip: 07102 Country: USA Phone: 973 596 2681 Ext: Requisition Codes: 58 128 PUX ZY2 Requisition Text:
Document Notes: NDHE produces a list of possible linkswhen the user chooses an anchor
V0000304390
Vendor Details
Vendor Reliability
Vendor Agreements
Other Possible Vendors
Your Purchasing History
All Screens with this Vendor
Dynamic Hypermedia Engine
• Links generated based on application structure, not search or lexical analysis
– You cannot do a search on the display text “$127,322.12” to find related information…
– But you can find relationships for the element Sales[1997]
$85,101.99$127,322.12
1997 Expenses1997 Sales
Link Mapping Rules
...
Relationship Manager Rule Base…Vendor - Vendor IS - Vendor Details - {commands} Vendor - Vendor IS - Vendor Reliability - {commands}Vendor - Vendor IS - Vendor Agreements - {commands}Vendor - Purchasing Data Warehouse
- Who else uses vendor - {commands}Vendor - Purchasing IS
- Your Purchasing History - {commands}Vendor - CASE Workbench
- All screens with this vendor - {commands}…
Shipping I nformation
http : / /www.nj i t .edu/purchasing /shipping-info rmat ion.htm
251 Requisition Header - Shipping and Text STRATEGIC SUPPLIES INTERN'L 71 UNION AVE Screen: Vend: V0000304390 PR: R010294 Inv: Line:
Deliver-to Address FOB: Name: MICHAEL BIEBER Rte: Org: NJIT, CIS DEPARTMENT Delivery Service: UPS Addr: 323 ML KING BLVD Delivery Date: 03-12-2000
City: NEWARK St: NJ Zip: 07102 Country: USA Phone: 973 596 2681 Ext: Requisition Codes: 58 128 PUX ZY2 Requisition Text:
Document Notes: N
V0000304390
Vendor Details
Vendor Reliability
Vendor Agreements
Who Else Uses Vendor
Your Purchasing History
All Screens with this Vendor
DHE generates anchors and links from theRelationship Management Rule Base
Relationship Manager Rule Base...Vendor - Vendor IS - Vendor Details - {commands}Vendor - Vendor IS - Vendor Reliability - {commands}Vendor - Vendor IS - Vendor Agreements - {commands}Vendor - Purchasing Data Warehouse - Who Else Uses Vendor - {commands}Vendor - Purchasing IS - Your Purchasing History - {commands}Vendor - CASE Workbench - All Screens with this Vendor - {commands}...
Here are some examples. The agricultural system is verycomplex. It consists of farmers in interaction with the envi-ronment (weather, soil, pests), the economy and society.One problem currently receiving a lot of attention isIntegrated Pest Management.
The problem unfortunately is being addressed separatelyby experts in Plant Pathology, Entomology, Agronomy, Bot-any, and Soil Science, with no real common ground (and lit-tle common understanding) to base our decisions on.Meetings and workshops tasked with integration have takenplace, but these various subdisciplines could not seem toform an integrated viewpoint. No real methodology existsto discuss and analyze the systems, which eachsubdiscipline has developed. In the end, all IPM decisionsend up unintegrated and therefore only partially effective. Asystematic approach that would enable the different partici-pants to discuss, and determine all the interrelationships,which would help researchers link their practices and derivetheir effects on each others area and the environment,would greatly help the IPM research and lead to better deci-sion for the environment and society as a whole.
Thus, developing a systems methodology for understandingthe relationships, cause and effects, and repercussions inthe agricultural field would be very helpful. The research inthis incubation proposal would make an important contribu-tion towards this goal.
i
Deeply Understanding Complexity{document}
View Peer Review Comments{JESSE Peer Review service}
Enter your own Peer Review Comment{JESSE Peer Review service}
Search for similar/related documents{Core Search service}
Other collections with this document{DLSI Collection Registry}
Create a new comment on document{Core Annotation service}
Add document to current Guided Tour{DLSI Guided Tour service}
Start your own link from this document{DLSI Link service}
Plant Pathology {concept}
Ask an expert about this concept{in the Virtual Reference Desk}
Relevant NASA Experiments in Space{ National Space Science Data Center}
Search for this concept{Core Search service}
View Comments on this concept{Core Annotation service}
Create a new comment on this concept{Core Annotation service}
Guided Tours concerning this concept{DLSI Guided Tour service}
Start your own link from this concept{DLSI Link service}
Start a discussion on this concept{DLSI discussion environment}
Related journal articles{in JESSE}
DLSI:Integration through
Linking
Prototype
Prototype
Benefits of Integrationfor a system (collection/service)
• Users: direct access to related systems– enlarges a system’s feature set
• DLSI leads users to a system– systems gain wider use
• Users become aware of other systems– systems gain wider awareness
• Direct access to a system’s features– streamlined access (bypassing menus)
Finding Links
• Structural links (as with DHE)– when we know the object type
• Lexical analysis (Professor Wu)
– NJIT Noun Phrase Extractor– NJIT Ontology Developer
Filtering & Rank Ordering Links
• Collaborative Filtering (Professor Im)
– customize the link set for each user
• based on:– user-direct evaluations (ratings)– indirect evaluation (clickstream data)
• using a Collaborative Filtering Engine
AVC Collection WIKI ServiceCore Search &
Discovery Service
User`s WebBrowser
Core SearchWrapper
Collection (i) Service (j)
Digital Library Service Integration Manager
AVC CollectionWrapper
CollectionWrapper (i)
WIKI ServiceWrapper
ServiceWrapper (j)
How to Integrate
(1) Develop a Wrapper – Parse all display screens to identify the
“elements of interest” that DLSI will make into link anchors.
• Parse each kind of display screen
• Parse based on the standard template/layout or metadata provided
– Also call the lexical analysis routines to identify key phrases for you
How to Integrate, cont.
(2) Develop Linking Rules
– specify the “structural relationships” for recognized object types within the system being integrated.
• e.g., author, address, concept, spacecraft, measurement
– one rule per object type (class) per link
– all linking rules are merged by DLSI, so rules for other systems apply automatically to your system
• (e.g., annotations, discussions, related documents)
How to Integrate, cont.
(3) Initiate Communications:
– Several possible ways, depending on the application.
Outline• Motivation
– supporting learning communities
• DLSI Architecture
• Senior Projects– Project 1: AVC and AskNSDL/VRD– Project 2: Metis Workflow and JESSE– Project 3: NASA’s NSSDC– Project 4: Lexical Analysis and User Preferences– Project 5: myKnowledge
NSDL(National Science Digital Library)• Sponsored by the National Science
Foundation (NSF)
• Purpose: to provide educational resources in an integrated environment to students and teachers (kindergarten-graduate school)
• URL: http://www.nsdl.org/
• Flash Presentation: http://about.nsdl.org/flash
DLSI & NSDL
• DLSI is providing the integration for all of the NSDL system!
• Senior Projects will– be the first integration prototypes– provide necessary internal features
Project 1a Atmospheric Visualization Collection
• Provides visualization tools and images of weather data from Atmospheric Radiation Measurement (ARM) program
• ARM: the largest global change research program supported by the U.S. Dept. of Energy
• based at the Argonne National Laboratory• URL: http://www.nsdl.arm.gov/visualization.shtml
Automated Links (AVC)
• From concepts found in the glossary and from instruments:– link to the glossary definition– link to lesson plans containing it– link to an instrument's page– link to ARM publications containing the
keyword/instrument
Automated Links, cont. (AVC)
• From any relevant object to the appropriate data display page
• From any relevant object to the appropriate internal data file (for AVC internal developers)
Automated Links, cont. (AVC)
and, of course:
• Links to related objects, teaching notes and documents in other systems
• Links for additional services such as discussion, comments, guided tours, etc.
• Based at the University of Syracuse
• See separate presentation
Project 1b Ask-NSDL & Virtual Reference Desk
Automated Links(Ask-NSDL & VRD)
• Lexical analysis to find key phrases recognized in glossaries
• Links to other questions/answers for a key phrase
• Direct links to all relevant functions for experts, administrators (e.g., show all answers this expert made)
Automated Links, cont.
and, of course:
• Links to related objects, teaching notes and documents in other systems
• Links for additional services such as discussion, comments, guided tours, etc.
• Same kind of links for the Virtual Reference Desk! (http://www.vrd.org/)
Project 1b Ask-NSDL & Virtual Reference Desk
Project 2aMetis Workflow Engine
• based at the University of Colorado at Boulder
• Workflow: the process to get something done – involves triggering events from/in different systems
– some steps are automated (e.g., send email), others require people to do something
• See separate presentation
Automated Linking (Metis)
• Linking workflow definition tools to internal Metis documentation
• Linking workflow specifications and Metis displays with the systems involved in the workflow
Automated Links, cont. (Metis)
and, of course:
• Links to related objects, teaching notes and documents in other systems
• Links for additional services such as discussion, comments, guided tours, etc.
Project 2bJESSE/Picture of the Day
• Journal of Earth System Science Education
• Based at the Universities Space Research Association
• See external presentation
• URL: http://jesse.usra.edu/testing/
• Also, Earth Science Picture of the Day
• URL: http://epod.usra.edu
Automated Links (JESSE/POTD)
• Links among related pictures and articles
and, of course:
• Links to related objects, teaching notes and documents in other systems
• Links for additional services such as discussion, comments, guided tours, etc.
Project 3: NASA’sNational Space Science Data Center• based at the Goddard Space Flight Center
• URL: http://nssdc.gsfc.nasa.gov/
• preliminary starting demo connecting NSSDC with the University of Arizona Document Summarizer (see next page)
Prototype
Automated Linking (NSSDC)
• Links among related space missions, experiments, astronauts & scientists, and definitions/explanations of key phrase/concepts
• Links utilizing other NASA systems
Automated Links, cont. (NSSDC)
and, of course:
• Links to related objects, teaching notes and documents in other systems
• Links for additional services such as discussion, comments, guided tours, etc.
Project 4a (internal): Linking through Lexical Analysis
Finding Links
¥ Structural links (as with DHE)Ğwhen we know the object type
¥ Lexical analysis (Professor Wu)
ĞNJIT Noun Phrase Extractor
ĞNJIT Ontology Developer
• See external presentation
Project 4b (internal): User Preference Module
• UPM maintains database of user preferences for any module integrating with DLSI.
– communicates with other modules through DLSI’s existing message passing protocol
• UPM will communicate with users
– to gather preferences
– to get information from the user about his or her current task, so we can customize the links generated task and preferences
• We have a fairly complete set of requirements already prepared.
Project 5: myKnowledge
• Independent application; integrated with DLSI for all NSDL users
• Users maintain a knowledge base of ideas and references
• It has several fields where users can make notes, record references and mark characteristics (metadata) about a concept or DL “resource” (document, article or Web page)
myKnowledge
• myKnowledge information will be stored within a MySQL database.
• We have a fairly complete set of requirements already prepared.
• We need help determining the best way of presenting the information.
Automated Links (myKnowledge)
• Links to the actual resource on the Web
and, of course:
• Links to related objects, teaching notes and documents in other systems
• Links for additional services such as discussion, comments, guided tours, etc.
Automated Links, cont. (myKnowledge)
and, of course:
• Links to related objects, teaching notes and documents in other systems
• Links for additional services such as discussion, comments, guided tours, etc.
Why Participate in DLSI?
• Real-world project– very helpful for students and teachers worldwide
• High visibility for you and NJIT
• Gain research experience and work with research teams
• Learning XML, XLS, and other skills
• Support: DLSI project leader and bi-weekly DLSI project meetings
Digital Library Service IntegrationSenior Projects
Professors Bieber, Im and Wu
Information Systems Department
College of Computing Sciences
New Jersey Institute of Technology
http://is.njit.edu/dlsi
For more senior project information:
http://is.njit.edu/dlsi/dlsi-sr-projects-s2003.doc