Exploration of Search Logs, Metadata Quality and Data Discovery: Week 9

The last goal of my DataONE summer internship was to try to determine whether or not metadata quality is related to data downloads. In other words, does higher quality metadata increase the likelihood that a dataset will be downloaded? The basic approach to answering the question is to gather two Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 9

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 8

One of the goals of this internship is to use the Metadata Quality API reporting feature to score metadata records within the DataONE system and then attempt to determine whether that metadata score has any relationship with whether or not (or how many times) a dataset described by that metdata Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 8

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 7

For the seventh week of my internship, I took up a few spatial questions that I discussed with my mentor group, as well as looking into the temporal component of DataONE search. Last week, I looked at searches in DataONE that are spatially explicit: searches that specify a collection of Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 7

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 6

For week six of my internship, I’ve been diving into the spatial search history of DataONE. The DataONE search interface includes a map that allows users to restrict search results to a spatial area on the map. We were curious about what areas of the Earth are the most common Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 6

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 5

For week five of my internship, my goals were to continue with exploratory data analysis to develop some additional figures and refine some of the exiting ones. Also, we had originally conceived of producing ‘session graphs’ that could illustrate the events of a session in a graphical way, but given what Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 5

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 4

My goal for week four was to do some exploratory data analysis (EDA), now that the data are all transformed into a system that makes them easy to query. I produced some preliminary results and figures describing the search and download events captured by the logs. I’ll go through a Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 4

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 3

My goals for week 3 were to collect download logs from a SOLR index, parse those logs into tokens, populate a database with the log information, and relate the download events to the search events by connecting them in time and by remote host address.  I was able to accomplish Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 3

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 2

For the second week of my project, my original goals were to collect download logs, parse the log events into tokens, and populate a database with the download information.  After our weekly internship call, my mentors and I decided to change things up a little bit. The purpose of building a Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 2

Exploration of Search Logs, Metadata Quality and Data Discovery: Week 1

My name is Ed Flathers, and I’m the DataONE Summer Intern on the project, “Exploration of Search Logs, Metadata Quality and Data Discovery.”  This project is largely focused on data mining and analysis of the DataONE search logs, download logs, and quality reports; many of my products will be program Continue reading Exploration of Search Logs, Metadata Quality and Data Discovery: Week 1

Welcome to the 2017 Summer Internship Open Notebooks

We are excited to begin work with our 2017 cohort of summer interns across a range of projects.  Information about the projects can be found on our internship description page. Our interns will start recording their activities, experiences and results in this space starting May 2017.