{"id":2363,"date":"2014-07-14T05:36:19","date_gmt":"2014-07-14T05:36:19","guid":{"rendered":"https:\/\/notebooks.dataone.org\/?p=2363"},"modified":"2014-07-14T05:36:19","modified_gmt":"2014-07-14T05:36:19","slug":"documenting-data-quality-mechanisms-from-citizen-science-project-websites-2-week-7","status":"publish","type":"post","link":"https:\/\/notebooks.dataone.org\/citsci-data\/documenting-data-quality-mechanisms-from-citizen-science-project-websites-2-week-7\/","title":{"rendered":"Documenting data quality mechanisms from citizen science project websites (2) \u2013 Week 7"},"content":{"rendered":"
This week I continue\u00a0documenting data quality mechanisms by visiting and trying the first randomly chosen 100 citizen science projects\u2018 websites in the citizen science project list I compiled earlier. The initial findings include first, in terms of project discipline, it is similar to what we found among the papers searched from\u00a0Scopus (see my post for week 3). More than half of the projects (N = 52) are related to biology and biodiversity conservancy, ten projects focus on water quality monitoring, and the rest of projects are about astronomy,\u00a0chemistry, language, archaeology, or are platforms for helping building citizen science projects.\u00a0I did not find a project focusing on the domain\u00a0of geography. However, many projects related to biodiversity includes geographic information as one important part of their data.<\/span><\/span><\/p>\n Second, in terms of data quality mechanism, it seems that there are domain specific preferences of data quality mechanism for water quality monitoring projects.\u00a0All ten projects about water quality adopt the mechanism of participant training, and the mechanism of quality assurance plan is found only among the water quality monitoring project. This phenomenon partially\u00a0answers\u00a0my question, or proves my guesses\u00a0mentioned in the end of my week 4 blog post. However, for projects in other disciplines, I did not observe any obvious data quality mechanism preferences so far.<\/p>\n Third, when trying those project websites by myself, I observed\u00a0that the level of technologies adopted by different projects varies a lot. For example, for data submission, it varies from purely offline submission, to email submission, to simply online submission form, to advanced online submission with smart filter. We can see that there is undoubtedly a development trend that computer technologies will be more and more adopted to assure and control data quality.<\/p>\n Fourth, as there is water quality project specifically\u00a0emphasizing the difference between data collection (i.e., collect water sample in offline environment) and data entry (i.e., digitize water quality sample and input the data into database), and pointing out the importance of maintaining quality of the database by paying special attention on limiting data entry to “a very few volunteers” (http:\/\/www.brodheadwatershed.org\/SWVolunteerpage.html<\/a>), I was inspired to rethink the relationship between “collect” and “assure” in data life cycle, under the circumstance that human as sensor and data collector. Should data entry belong to “collect” or “assure”?<\/p>\n This week, we also start to think about one\/multiple potential paper(s), based on the very first draft written by our mentor, we will mainly work on this draft next week, while I will continue documenting data quality mechanisms from\u00a0citizen science project websites.<\/p>\n","protected":false},"excerpt":{"rendered":" This week I continue\u00a0documenting data quality mechanisms by visiting and trying the first randomly chosen 100 citizen science projects\u2018 websites in the citizen science project list I compiled earlier. The initial findings include first, in terms of project discipline, it is similar to what we found among the papers searched Continue reading Documenting data quality mechanisms from citizen science project websites (2) \u2013 Week 7<\/span>