{"id":1953,"date":"2014-01-31T17:43:18","date_gmt":"2014-01-31T17:43:18","guid":{"rendered":"https:\/\/notebooks.dataone.org\/?p=1953"},"modified":"2014-01-31T17:43:18","modified_gmt":"2014-01-31T17:43:18","slug":"dataone-community-engagement-via-twitter","status":"publish","type":"post","link":"https:\/\/notebooks.dataone.org\/data-science\/dataone-community-engagement-via-twitter\/","title":{"rendered":"DataONE Community Engagement via Twitter"},"content":{"rendered":"
DataONE has been around since 2010.<\/p>\n
It’s an NSF project so it’s continually evaluated for performance.<\/p>\n
One metric could be reach and engagement on social media, as a measure of awareness about DataONE.<\/p>\n
Since I’ve looked at open science sentiment analysis<\/a> \u00a0before, I volunteered to poke around a bit on topsy (www.topsy.com<\/a>) and see what I can find.<\/p>\n First, a search of tweets for DataONE, by month.<\/p>\n There have been 31 tweets with the phrase “DataONE” in the past 30 days.<\/p>\n http:\/\/topsy.com\/s?q=DataONE&window=m&sort=date<\/p>\n However, this includes various products named “DataONE” which includes\u00a0a software vendor based out of southeast Asia called DataONE and a broadband package.<\/p>\n Therfore, let’s take a look at tweets that mention DataONE’s Twitter handle:<\/p>\n @DataONEorg<\/a><\/p>\n Interestingly, for the time period December 31st to January 30, there have actually been 81 tweets mentioning @DatONEorg.<\/p>\n http:\/\/topsy.com\/analytics?q1=%40DataONEorg&via=Topsy<\/a><\/p>\n Repeating this search for “All-time” yields the following URL:<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date<\/a><\/p>\n I assume there is a limit for the free version.<\/p>\n I navigated to the last page:<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=90<\/a><\/p>\n The results are sorted by newest. \u00a0The oldest tweet available is dated two months ago.<\/p>\n I changed the “offset” key to 200.<\/p>\n The oldest tweet available is dated three months ago.<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=200<\/a><\/p>\n I changed the “offset” key to 900. The oldest tweet available is a year ago.<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=900<\/a><\/p>\n I changed the “offset” key to 999. \u00a0No data loaded.<\/p>\n I change the “offset” key to 1,000. No data was retrieved.<\/p>\n I re-loaded the\u00a0http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=900<\/a><\/p>\n The oldest tweet was from Carly Strasser<\/a>:<\/p>\n https:\/\/twitter.com\/carlystrasser\/status\/245276554333151232<\/a><\/p>\n It was dated September 10, 2012.<\/p>\n There are 10 tweets per page. \u00a0The pages are accessible in multiples of ten.<\/p>\n That is, working backwards from 900, previous would be 890.<\/p>\n \u00a0Prev<\/a><\/p>\n This explains why 999 did not work.<\/p>\n Let’s try again with 990.<\/p>\n Success!<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=990<\/a><\/p>\n The oldest tweet is from 2 years ago but\u00a0Geoff Barker @geoffmuse<\/a><\/p>\n It is dated July 29, 2012.<\/p>\n Using 1000 as a key failed. What happens if we use 1010?<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=1010<\/a><\/p>\n No tweets found.<\/p>\n Therefore it appears the farthest back in time we can go with Topsy’s (free) analytics is July 29, 2012.<\/p>\n We know that DataONE is older than 2012.<\/p>\n How long has the @DataONEorg twitter account been available?<\/p>\n Looking at the analytics service from “twtrland” we can see metrics.<\/p>\n http:\/\/twtrland.com\/profile\/DataONEorg<\/a><\/p>\n @DataONEorg has been on Twitter since Thursday, November 18, 2010.<\/p>\n For data available 3 months ago:<\/p>\n There are 202 followers<\/p>\n 52% are female<\/p>\n 73% are from the United States.<\/p>\n It’s possible to sort by number of followers.<\/p>\n http:\/\/twtrland.com\/profile\/DataONEorg\/followers<\/a><\/p>\n “We analyze all the content people share and how their audience reacts to it, to find their influential skills”<\/p>\n The skills reported are as follows:<\/p>\n Science, Research, Universities, Scientists, Community, Biologists, Biology, Management, Genomics, Librarians, Library, Publishing, Technology, Bioinformatics, Digital, Education, Ecology, Professors, Writers, Open Access.<\/p>\n There are 84 replies per 100 t<\/p>\n There are 28 RTs per 100t<\/p>\n 37% of tweets are links<\/p>\n There are .01 tweets per day<\/p>\n The demographics information is interesting:<\/p>\n Some of the advanced analytics are available only by using the “pro version” – there is a free trial option, however this is something Community Engagement & Outreach coordinator Amber Budden would only have access to. \u00a0I’ll let her know in case she wants to try it out.<\/p>\n At any rate the point of looking at Twitterland was to pinpoint when DataONEorg went live.<\/p>\n I now have a date of<\/p>\n Thursday, November 18, 2010.<\/p>\n I can go back to Topsy and try a range of dates to overcome the problem I encountered with being unable to seek tweets past 990, the tweet at \u00a0July 29, 2012.<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&offset=10&mintime=1288605638&maxtime=1341136815<\/a><\/p>\n November 1, 2010 to July 1, 2012<\/p>\n Now I want to try November 1, 2010 to November 1, 2011<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&mintime=1288612824&maxtime=1320148851<\/a><\/p>\n I wish I could make sense of the numbers here (1288612824) and (1320148851) but for now I just need to accept on faith that they correspond to November 1, 2010 and November 1, 2011.<\/p>\n The “most recent” tweet appears to be:<\/p>\n October 31, 2011.<\/p>\n https:\/\/twitter.com\/mcdonald\/status\/131111854327070720<\/a><\/p>\n The oldest Tweet appears to be:<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&offset=150&mintime=1288612824&maxtime=1320148851<\/a><\/p>\n Beyond 150.<\/p>\n Let’s try an offset key of 300<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&offset=300&mintime=1288612824&maxtime=1320148851<\/a><\/p>\n No tweets found.<\/p>\n Let’s try an offset key of 220.<\/p>\n No tweets found.<\/p>\n Let’s try and offset key of 200.<\/p>\n No tweets found.<\/p>\n 180.<\/p>\n None<\/p>\n 170.<\/p>\n Three tweets found.<\/p>\n http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&offset=170&mintime=1288612824&maxtime=1320148851<\/a><\/p>\n The very first tweet (that was not spam) in Topsy mentioning @DataONEorg was from Heather Piowar.<\/p>\n https:\/\/twitter.com\/DataONEorg\/status\/47691665087016960<\/a><\/p>\n So, working backwards from 170 pages of mentions at 10 tweets per page for the 365 day period between November 1, 2010 and November 1, 2011, there were at least 1,700.<\/p>\n I say “at least” because in the case of the “first tweet,” 4 other users “re-tweeted” the same tweet, and that was not reflected in the numbers from Topsy.<\/p>\n So, this appears to be a method for poring over the history of tweets.<\/p>\n How to systematically extract that data is another topic, which I will address in a separate post.<\/p>\n <\/p>\n <\/p>\n <\/p>\n <\/p>\n <\/p>\n <\/p>\n <\/p>\n <\/p>\n <\/p>\n <\/p>\n <\/p>\n","protected":false},"excerpt":{"rendered":" DataONE has been around since 2010. It’s an NSF project so it’s continually evaluated for performance. One metric could be reach and engagement on social media, as a measure of awareness about DataONE. Since I’ve looked at open science sentiment analysis \u00a0before, I volunteered to poke around a bit on Continue reading DataONE Community Engagement via Twitter<\/span>
\n<\/a><\/div>\n
\n<\/a><\/div>\n
\n<\/a><\/div>\n