A Quick Overview of the SEAD DataNet · PDF fileA Quick Overview of the SEAD DataNet project...
Transcript of A Quick Overview of the SEAD DataNet · PDF fileA Quick Overview of the SEAD DataNet project...
AQuickOverviewoftheSEADDataNetproject
JimMyers([email protected]),5thNaDonalDataServiceConsorDumWorkshop
Cooperative agreement #OCI0940824
SEAD: Sustainable Environment -Actionable Data
• AnNSFDataNetprojectstartedinOctober,2011
• AninternaDonalresourceforsustainabilityscience
• Aprovideroflight-weightDataServicesbasedonnoveltechnicalandbusinessapproaches:– SupporDngthelong-tailofresearch– EnablingacDveandsocialcuraDon– Providingintegratedlifecyclesupportfor
data
h"p://sead-data.net/
MargaretHedstrom,PIPraveenKumar,co-PIJimMyers,co-PIBethPlale,co-PI
Why do we need “Data Patriotism”?
Thefutureneedsyourdata!
“Andso,myfellowAmericans,asknotwhatyourdatacandoforyou,askwhatyoucandoforyourdata”
Onlyyoucanpreventdataloss!
Imagine Having to Ask: “Why Acquire Data, Analyze It, and Draw Conclusions?”
• Fundingagenciesurgingresearcherstoacquiredata!?!
• PublishersmandaDngthatauthorsanalyzetheirdata?!!
• AnoutcryforscienDststoputmoreeffortintodrawconclusionstohelpdriveprogressandeconomicgrowth!?!
?
SEAD Overarching Concepts
• Self-service• Simplebasicdefaults,richcustomizaDon,branding,andautomaDonopDons
• Leverageincremental,informalacDveusetocapturedataandmetadatafromfirstsources
• Providedata-related(metadata-driven)servicestoacDveproducers,curators,andusersofdata
• SimplifyandautomatecuraDonandpreservaDonprocessesusingcapturedinformaDonandcontext
• LeverageexisDnginsDtuDonalrepositorytechnologiesandorganizaDonstoprovidelong-termstorage
IncreaseValue,LowerCosts,IncreaseImmediacy
Consider SEAD: An NSF DataNet Partner
Whereyourprojectcanrequestasecure,brandedProjectSpaceinthecloud…
Consider SEAD: An NSF DataNet Partner
Drag-and-dropyourdata,orimportfromotherwebsites
Consider SEAD: An NSF DataNet Partner
Usethemtosearch,coordinatewithcolleagues
AddTags,FormalTerms(configurable),comments…
Consider SEAD: An NSF DataNet Partner
• CreaterelaDonships
Citedin
HasCorrecDonUsesCalibraDon
GeneratedUsing UsesProcedure
hcp://myproj.org/wiki/p7
Orletyoursogwarewritethem!-ResiulAPI-Library
Consider SEAD: An NSF DataNet Partner
FindaRepositoryAndclicktopublish!– Cloud/FilestorageforlargecollecDons(#filesortotalsize),
– InsDtuDonalRepositoriesformoderatecollecDons
IUSDA
Consider SEAD: An NSF DataNet Partner
• PublicaDonwithSEADmeans:– PersistentDataID(i.e.DOI)– MulDpleRepositoryopDons,includinglightweight,standards-basedpackageforlong-termstorage(BagIT,OAI-ORE,JSON-LD)
– RegistraDonwithDataOneCatalog
– Branded“PublishedData”pagetolinkinyourwebsite
hcp://dx.doi.org/10.5072/FK2FF3PK7W
SEAD Interacts with:
• Projects&theirwebsites• AuthenDcaDonservices(Google,ORCID,local,…)• ResearcherProfileServices(ORCID,(VIVO),…)• DataSources(TerraPop,NEON,‘any’,…)• DataProcessors(BrownDog,Geoserver,image/video
players,…)• Repositories(Dspace,Fedora,Cloud,openICPSR,…)• DiscoveryServices(DataOne,DataCite,…)• ApplicaDons/Services(R,ECubeGeosemanDcs,VIC/DFC,…)
--withoutdeepagreementonarchitectural/modeldetails--withmechanismstohelpinteroperability/synthesis
SEAD as NDS Infrastructure:
SEADProjectSpaces
Sharing,CuraDon,PublicaDon,Reuse
WebApp/RESTAPIData/MD
BrownDogSvcs(RabbitMQ)EarthCube
GeosemanDcsServices
SEADPublishingMatchmaking,Publishing,CI
EcosystemIntegraDon(Profiles,papers,catalogs,events,provenance,…)
RESTAPI
ComputaDonalEnvironments
Tools
Apps
Browser
Long-termRepositories
NDS/ReferencePublisherDOILandingPages
Webapp
PublishingAgentApp
FileSystem/CloudStore
DomainCyberinfrastructures
Profiles/Pubs
ProfileSources,SSO
Catalogs(DataOne,…)
Third-PartyRepositories
SEAD as a Data Source:
• IniDalCommuniDes:– SustainabilityResearch(Ecological,Social)– Centerstogradstudentsandcountyparkmanagers
– 2-3Mfiles,2TB+,20+groups• LongerTerm:
– Open–longtailofresearchprojectsneeding• hosteddataservices• coreshare/curate/publishcapabiliDesforcustomCI
– ObservaDonal,Experimental,Modeling,AnalyDcsDataacrossdisciplines
– RichpublicaDons(e.g.forcollaboraDon,integraDon,reproducibility)
SEAD as an NDS Consortium Member
• Interestedin– CloudDeployment(Labs?)– InteroperableDataPublicaDons(ORE,BagIT,+)(Labs?)
– ArchitectureandBusinessModelIssues(NDS?Ecube,RDA?)
– SupporDngNaDonalDataPublicaDonNeeds(NDS,Share?)
– IntegraDngtoimplementtheNDSvisionacrosssystems(Labs,Share,NDS,RDA,?)