Text Processing Methods for Data Extraction (PDF to HTML conversion)

Tried for mac: VeryPDF “PDF to Any Converter” Did not like it.  PDF to HTML was not good.  PDF to Excel was ok, but one complaint is the documents are placed into a new folder Might be useful: http://sourceforge.net/projects/pdftohtml/. From the first freeware/trial ware software I tried, I’m definitely dog-earing Continue reading Text Processing Methods for Data Extraction (PDF to HTML conversion)

DataONE Community Engagement via Twitter

DataONE has been around since 2010. It’s an NSF project so it’s continually evaluated for performance. One metric could be reach and engagement on social media, as a measure of awareness about DataONE. Since I’ve looked at open science sentiment analysis  before, I volunteered to poke around a bit on Continue reading DataONE Community Engagement via Twitter

Designing a Meta-Analysis for Data Sharing via Open Science Networks

I’m behind in consulting the literature for references to figshare. From participating in the 2013 Walter E. Dean Environmental Information Management Institute, I am aware of a trend in research for using scholarly databases to conduct a “meta analysis.” The book about meta analysis we referenced for the course in Continue reading Designing a Meta-Analysis for Data Sharing via Open Science Networks

Survey of Data Management Early Adopters

In support of the sociocultural working group, I am now assisting with an inquiry into the phenomenon of sharing data online via publishing and archival services.  In particular, I’m looking at the user community surrounding FigShare. Over the summer I took a course in Scholarly Publishing.  The class examined a Continue reading Survey of Data Management Early Adopters