Prospective and Retrospective Provenance Queries: Week 4 – YesWorkflow Data Model

Hi everyone, I’m Linh Hoang, from project 3. This week, my co-intern and I focused on creating a YesWorkflow Data Model with our own vocabulary. Previously, we used ProvONE Data Model (which is an extension of the standard Provenance Data Model recommended by W3C) to represent items of YesWorkflow. However, Continue reading Prospective and Retrospective Provenance Queries: Week 4 – YesWorkflow Data Model

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 3

My goals for week 3 were to collect download logs from a SOLR index, parse those logs into tokens, populate a database with the log information, and relate the download events to the search events by connecting them in time and by remote host address.  I was able to accomplish Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 3

Markdown-based Semantic Annotation of Workflow Scripts: Week 1

My name is Xiaoliang Jiang, a second-year graduate student in Library and Information Science from University of Illinois at Urbana Champaign. I am the DataONE summer intern for the project 1: Markdown-based Semantic Annotation of Workflow Scripts, which focuses on an extension to the RStudio environment enabling data analysts to Continue reading Markdown-based Semantic Annotation of Workflow Scripts: Week 1

What is DataONE and how do we attract our audience? Week 1

Hello Folks, Hope you are all enjoying the start of our internships.  My name is Megan Mach and I’m the intern working on “DataONE Messaging: Creating Marketing for DataONE Stakeholder Communities.” I have never done semi-live journaling, but I’m going to give it a stab in this weekly notebook. This Continue reading What is DataONE and how do we attract our audience? Week 1

Prospective and Retrospective Provenance Queries: Week 3 – SPARQL Recursive Queries

Hi everyone, It’s Linh Hoang from Project 3. This week, I spent most of my time to read papers and also ran some experiments to explore SPARQL recursive queries capability. The objective is to be able to run standard recursive queries in SPARQL. We found “property path” is a way Continue reading Prospective and Retrospective Provenance Queries: Week 3 – SPARQL Recursive Queries

Week-3-Update

This is the third week of my internship. I felt much comfortable to work with the professors and my co-partner. We basically made progress in schedule. For this week, given the elementary model structure in RDF/Turtle file for simulate_data_collection example of YesWorkflow, I further improved the RDF/Turtle data model by Continue reading Week-3-Update

Improving Search Capabilities Through Controlled Vocabularies: Week 1

My name is Elizabeth Olson, and I am the DataONE Summer Intern on the project, “Improving DataONE’s Search Capabilities Through Controlled Vocabularies.” This project focuses on improving researchers search capabilities within the DataONE data repository. In order to improve the recall and search precision of the data objects available through Continue reading Improving Search Capabilities Through Controlled Vocabularies: Week 1

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 2

For the second week of my project, my original goals were to collect download logs, parse the log events into tokens, and populate a database with the download information.  After our weekly internship call, my mentors and I decided to change things up a little bit. The purpose of building a Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 2

Prospective and Retrospective Provenance Queries: Week 2 – SPARQL Provenance Query

Hi all, It’s Linh Hoang from Project 3. This week, I am working on exploring SPARQL querying capabilities in YesWorkflow. The objective is to testing how well SPARQL can query YW outputs, which are represented in RDF format. First, I completed installing Virtuoso on my machine and wrote a manual of Continue reading Prospective and Retrospective Provenance Queries: Week 2 – SPARQL Provenance Query