This project has used custom python code at various steps to manipulate data formats, do some automated annotation parsing, and perform random subsampling.
The python source is all available as open source gists on github:
- wos_txt_to_bibtex.py For converting ISI Web of Science tab-delimited exports into bibtex format
- annotate_bibtex.py For annotating bibtex files using article info pulled from exports of our spreadsheets on Google Scholar. Also used to export information about number of citations and citation groups.
- sample_bibtex.py For stratified sampling of bibtex records.
- bibtex_to_csv.py For exporting bibtex info into csv for running stats