{"id":857,"date":"2011-06-08T13:17:39","date_gmt":"2011-06-08T13:17:39","guid":{"rendered":"http:\/\/notebooks.dataone.org\/lod4dataone\/?page_id=12"},"modified":"2013-05-15T15:37:09","modified_gmt":"2013-05-15T15:37:09","slug":"mentorplan","status":"publish","type":"post","link":"https:\/\/notebooks.dataone.org\/linked-data\/mentorplan\/","title":{"rendered":"Mentor Plan"},"content":{"rendered":"
Intern:<\/strong> A\u00edda G\u00e1ndara: a doctoral student from The Department of Computer Science<\/a> at The University of Texas at El Paso and a research student at Cyber-ShARE<\/a>.<\/p>\n Primary Mentor:<\/strong> Hilmar Lapp: from The National Evolutionary Synthesis Center (NESCent)<\/a>.<\/p>\n The Linked Open Data DataONE Summer Internship Mentor Plan below was conceived as a tentative plan for the duration of the internship.\u00a0 We expect this plan to become increasingly different as the prototype changes and community ideas are incorporated into the progress. This plan will be updated weekly. An original version of this plan can be found on Google Docs<\/a>.<\/p>\n Preliminary research will establish infrastructure for the automation process including programming platform, initial browsing technology, demo-site, development libraries and tools that will be used to implement the prototype. Overall, this plan focuses on \u00a0:<\/p>\n Preliminary Research Update<\/strong> Project Activities:<\/strong> Focus on selecting data and RDF structure for DataONE dataset repositories (KNB, Dryad, ORNL-DAAC)<\/p>\n Development opportunities for the intern:<\/strong> Identifying data knowledge contacts and learning how to apply RDF to scientific data in the DataONE repositories Project Activities:<\/strong> Focus on extracting data and generating RDF<\/p>\n Development opportunities for the intern:<\/strong> Understand how to\u00a0extract from different scientific source repositories as well as the challenges in making scientific data browsable. Project Activities:<\/strong>Evaluate research effort and data with DataONE community<\/p>\n Development opportunities for the intern:<\/strong> Understand the RDF that is and should be available for scientific data. Project Activities:<\/strong> Focus on reconciliation of data with authoritative sources.<\/p>\n Development opportunities for the intern:<\/strong> Learning how to integrate RDF datasets to common authoritative sources. Project Activities: Integrating DataONE data with browser knowledge<\/strong><\/p>\n Development opportunities for the intern:<\/strong> Learning how to integrate RDF data with other\/source data and how to identify and choose valid\/useful data sources and vocabularies. Project Activities:<\/strong> Focus on Use Case 2 – How to search for DataONE data from other data.<\/p>\n Development opportunities for the intern:<\/strong> Learning how to integrate RDF data with other\/source data and how to identify and choose valid\/useful data sources. Project Activities:<\/strong> Focus on the bigger cloud.<\/p>\n Development opportunities for the intern:<\/strong>Learning how to integrate RDF datasets on a bigger cloud and identifying useful cloud features for integrating RDF data. Project Activities:<\/strong> Focus on the bigger cloud.<\/p>\n Development opportunities for the intern:<\/strong> building a sparql query interface and queries into DataONE. Project Activities:<\/strong> Access DataONE data from an RDF Mashup w\/DBpedia or Data.gov data<\/p>\n Development opportunities for the intern:<\/strong> Alignment of research with DataONE and LOD community. Project Activities:<\/strong> Documentation and project completion.<\/p>\n LOD4DataONE Summer Intern Mentor Plan Intern: A\u00edda G\u00e1ndara: a doctoral student from The Department of Computer Science at The University of Texas at El Paso and a research student at Cyber-ShARE. Primary Mentor: Hilmar Lapp: from The National Evolutionary Synthesis Center (NESCent). The Linked Open Data DataONE Summer Internship Mentor Continue reading Mentor Plan<\/span>Preliminary Research<\/strong><\/h2>\n
\n
\nJava seems to be the most appropriate language to write the prototype in. URIBurner seems to be the best candidate for loading the RDF and using it as a browser of the data. I will use my research website to house the browser, mainly because I have the permissions I need to set this up for demoing the results. A link will be placed on the lod4dataone DataONE notebook to make it easy to access the prototype. In addition to Java, Jena<\/a> will be used to build the RDF and the needed libraries to access the three repositories will be used. For now, this only seems to be the Metacat libraries used to access data on the KNB repository. For Dryad I will use the OAH-PMH Web services and the METS page to obtain data. For ORNL-DAAC it looks like I will have to browse pages from their ftp data repository. The LOD4DataONE<\/a> GitHub repository will be used to store all software created for this project.<\/p>\nWeek 1 (Jun 6th – Jun 10th )<\/strong><\/h2>\n
\n
\nExpected Outcomes:<\/strong> List of initial set of datasets (at least 3 from each), vocabularies, and use-cases that will be included in the prototype.
\nCompleted?:<\/strong> Yes<\/p>\nWeek 2 (Jun 13th -Jun 17th)<\/strong><\/h2>\n
\n
\nExpected Outcomes: <\/strong>Extracted data accessible in RDF and browsable via the web. => Update: extracted but not browsable. Browsers not as straightforward in RDF world.
\nComplete?:<\/strong> Partially ==> Update: All data can be browsed using default Openlink Data Explorer Add-on but not using the RDF I created. RDF data I created only for Dryad and ODE does not load it correctly.<\/p>\nWeek 3 (Jun 20th – Jun 24th)<\/strong><\/h2>\n
\n
\nExpected Outcomes:<\/strong> Identify next implementation steps for data reconciliation.
\nComplete?:<\/strong> Mostly ==> Update: feedback not as interactive as expected. Still many questions to answer, e.g., how to link data, what RDF vocabularies to use, how to leverage RDF browsers<\/p>\nWeek 4 (Jun 27th – Jul 1st)<\/strong><\/h2>\n
\n
\nExpected Outcomes:<\/strong> Data linked with outside sources ==> Update: able to link but data not always visible in useful views.
\nComplete?:<\/strong> Partially. ==> Extracting data, linking to internal and external sources, e.g., FOAF or hasPart type records but viewers not showing it. Can run RDF queries but not enough, need to understand these RDF browsers that show maps, calendar and timelines to make the point of usefulness of the structured data.<\/p>\nWeek 5 (Jul 4th – Jul 8th)<\/strong><\/h2>\n
\n
\nExpected Outcomes:<\/strong> Data linked to other sources of data.
\nComplete?:<\/strong> Partially ==> still need to complete demo.<\/p>\nWeek 6 (Jul 11th – Jul 15th) \u2013 Midterm evaluations.<\/strong><\/h2>\n
\n
\nDevelopment opportunities for the intern:<\/strong> Learning how to integrate RDF datasets on a bigger cloud.
\nExpected Outcomes:<\/strong> Identify next implementation steps for cloud integration.
\nComplete?:<\/strong> in progress<\/p>\nWeek 7 (Jul 25th – Jul 29th)<\/strong><\/h2>\n
\n
\nExpected Outcomes:<\/strong> DataONE data accessible from a bigger cloud, e.g., Linked Open Data Cloud.
\nComplete?:<\/strong> in progress ==> will focus on query for information scientists and queries across DataONE datasets. ORNL DAAC input did not show much promise for additional data. Will try ideas sent but the data I am grabbing seems good for demonstration and search purposes.<\/p>\nWeek 8 (Aug 1st – Aug 5th)<\/strong><\/h2>\n
\n
\nExpected Outcomes:<\/strong> retrieve RDF about DataONE data
\nComplete?:<\/strong> Almost ==> cleaning up some RDF issues for all the data.<\/p>\nWeek 9 (Aug 8th – Aug 12th)<\/strong><\/h2>\n
\n
\nExpected Outcomes:<\/strong> Steps to close research.
\nComplete?:<\/strong> Mostly. Hilmar and I will be discussing the close of research which will include the final steps.\u00a0 I will be finishing up the webpage with the queries.<\/p>\nWeek 10 (Aug 15th – Aug 19th)<\/strong><\/h2>\n
\nExpected Outcomes:<\/strong> Final: Demo, Documentation & Code
\nComplete?:<\/strong>Yes. Will collect lessons learned and present at final meeting and possible publication<\/li>\n","protected":false},"excerpt":{"rendered":"