Open source python for bibtex transformations

This project has used custom python code at various steps to manipulate data formats, do some automated annotation parsing, and perform random subsampling. The python source is all available as open source gists on github: wos_txt_to_bibtex.py For converting ISI Web of Science tab-delimited exports into bibtex format annotate_bibtex.py For annotating Continue reading Open source python for bibtex transformations

June 30, 2011 – Initial Table and Graphs

The initial table and graphs were created using the results from the Google Scholar search and the subset of 150 articles analyzed from the WoS citations.  As the results are a combination of a full search (GS) and a partial search (WoS) without extrapolation they should not be taken as conclusive but Continue reading June 30, 2011 – Initial Table and Graphs

June 30, 2011 – ASIS&T poster proposal to-do list

Today was spent in a mad dash putting together the poster proposal for the ASIS&T conference which is due at midnight tomorrow. Here is our to-do list as copied from the Google Chat with Heather and myself: Here’s what it needs: – an abstract – a shorter intro. Total length Continue reading June 30, 2011 – ASIS&T poster proposal to-do list

June 23, 2011 – Pangaea Data Collection

Today I was able to start the data collection/tracking for the Word of Science citations of the Data Collection Article.  For a reminder, we previously collected all of the citations for articles that have cited the data collection article for each of the datasets in all repositories.  Heather completed a Continue reading June 23, 2011 – Pangaea Data Collection

June 21, 2011 – Tracking Protein Data Bank

Today was spent tracking the Protein Data Bank (PDB) datasets.  As Heather had predicted, there were quite a few more hits than the previous data repositories (670 citations).  The search terms used and search results can be seen in this Google Spreadsheet.  As Heather predicted such a large amount of Continue reading June 21, 2011 – Tracking Protein Data Bank