Week 4: ECSO knowledge representation and carbon cycling incorporation

This week we had two goals 1) to further define our knowledge representation structure by going through the ECSO ontology and determining which custom annotations can be systematically replaced with standard SKOS elements and 2) to improve the thematic contents of interest within the ontology related to carbon cycling. For Continue reading Week 4: ECSO knowledge representation and carbon cycling incorporation

Prospective and Retrospective Provenance Queries: Week 5 – YW Data Model & SPARQL queries (cont.)

Hello everyone, It’s Linh Hoang, the intern from project 3. This week, my co-intern and I continue to focus on revising our YesWorkflow Data Model with our own vocabulary. Besides that, we also created an UML diagram to represent components of the model and how they connects to each other. The diagram is Continue reading Prospective and Retrospective Provenance Queries: Week 5 – YW Data Model & SPARQL queries (cont.)

Week-5-Update

During the middle week of my intern, I continued working on improving the YesWorkflow Conceptual Model vocabulary and structure. As suggested by my primary mentor, I utilized Markdown format combined with HTML tags to create the model vocabulary documentation table to make it machine-readable. I also created another table for Continue reading Week-5-Update

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 4

My goal for week four was to do some exploratory data analysis (EDA), now that the data are all transformed into a system that makes them easy to query. I produced some preliminary results and figures describing the search and download events captured by the logs. I’ll go through a Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 4

Meeting with Stakeholders – DataONE Messaging Week 2

Hello Folks, This week in the Messaging Internship I spent my time working on a one-liner (i.e., broad audience statement) for describing DataONE, clarifying internal jargon, and interviews with some of the DataONE stakeholders. The challenge in developing a one-liner to describe a company is clearly going to be in Continue reading Meeting with Stakeholders – DataONE Messaging Week 2

Week-4-Update

During this week, I further improved the RDF model for YesWorkflow model facts. Especially, I introduced three self-defined associations (yw:isAssociatedWith, yw:filePath, yw:portType) and one class (yw:Data). These namespaces can help me to distinguish input ports, output ports and parameters, and to connect data with URI templates to corresponding ports. To Continue reading Week-4-Update

Prospective and Retrospective Provenance Queries: Week 4 – YesWorkflow Data Model

Hi everyone, I’m Linh Hoang, from project 3. This week, my co-intern and I focused on creating a YesWorkflow Data Model with our own vocabulary. Previously, we used ProvONE Data Model (which is an extension of the standard Provenance Data Model recommended by W3C) to represent items of YesWorkflow. However, Continue reading Prospective and Retrospective Provenance Queries: Week 4 – YesWorkflow Data Model

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 3

My goals for week 3 were to collect download logs from a SOLR index, parse those logs into tokens, populate a database with the log information, and relate the download events to the search events by connecting them in time and by remote host address.  I was able to accomplish Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 3