{"id":1953,"date":"2014-01-31T17:43:18","date_gmt":"2014-01-31T17:43:18","guid":{"rendered":"https:\/\/notebooks.dataone.org\/?p=1953"},"modified":"2014-01-31T17:43:18","modified_gmt":"2014-01-31T17:43:18","slug":"dataone-community-engagement-via-twitter","status":"publish","type":"post","link":"https:\/\/notebooks.dataone.org\/data-science\/dataone-community-engagement-via-twitter\/","title":{"rendered":"DataONE Community Engagement via Twitter"},"content":{"rendered":"

DataONE has been around since 2010.<\/p>\n

It’s an NSF project so it’s continually evaluated for performance.<\/p>\n

One metric could be reach and engagement on social media, as a measure of awareness about DataONE.<\/p>\n

Since I’ve looked at open science sentiment analysis<\/a> \u00a0before, I volunteered to poke around a bit on topsy (www.topsy.com<\/a>) and see what I can find.<\/p>\n

First, a search of tweets for DataONE, by month.<\/p>\n

There have been 31 tweets with the phrase “DataONE” in the past 30 days.<\/p>\n

http:\/\/topsy.com\/s?q=DataONE&window=m&sort=date<\/p>\n

However, this includes various products named “DataONE” which includes\u00a0a software vendor based out of southeast Asia called DataONE and a broadband package.<\/p>\n

Therfore, let’s take a look at tweets that mention DataONE’s Twitter handle:<\/p>\n

@DataONEorg<\/a><\/p>\n

Interestingly, for the time period December 31st to January 30, there have actually been 81 tweets mentioning @DatONEorg.<\/p>\n

http:\/\/topsy.com\/analytics?q1=%40DataONEorg&via=Topsy<\/a><\/p>\n

Repeating this search for “All-time” yields the following URL:<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date<\/a><\/p>\n

I assume there is a limit for the free version.<\/p>\n

I navigated to the last page:<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=90<\/a><\/p>\n

The results are sorted by newest. \u00a0The oldest tweet available is dated two months ago.<\/p>\n

I changed the “offset” key to 200.<\/p>\n

The oldest tweet available is dated three months ago.<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=200<\/a><\/p>\n

I changed the “offset” key to 900. The oldest tweet available is a year ago.<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=900<\/a><\/p>\n

I changed the “offset” key to 999. \u00a0No data loaded.<\/p>\n

I change the “offset” key to 1,000. No data was retrieved.<\/p>\n

I re-loaded the\u00a0http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=900<\/a><\/p>\n

The oldest tweet was from Carly Strasser<\/a>:<\/p>\n

https:\/\/twitter.com\/carlystrasser\/status\/245276554333151232<\/a><\/p>\n

It was dated September 10, 2012.<\/p>\n

There are 10 tweets per page. \u00a0The pages are accessible in multiples of ten.<\/p>\n

That is, working backwards from 900, previous would be 890.<\/p>\n

\u00a0Prev<\/a><\/p>\n

This explains why 999 did not work.<\/p>\n

Let’s try again with 990.<\/p>\n

Success!<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=990<\/a><\/p>\n

The oldest tweet is from 2 years ago but\u00a0Geoff Barker @geoffmuse<\/a><\/p>\n

It is dated July 29, 2012.<\/p>\n

Using 1000 as a key failed. What happens if we use 1010?<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&window=a&type=tweet&sort=date&offset=1010<\/a><\/p>\n

No tweets found.<\/p>\n

Therefore it appears the farthest back in time we can go with Topsy’s (free) analytics is July 29, 2012.<\/p>\n

We know that DataONE is older than 2012.<\/p>\n

How long has the @DataONEorg twitter account been available?<\/p>\n

Looking at the analytics service from “twtrland” we can see metrics.<\/p>\n

http:\/\/twtrland.com\/profile\/DataONEorg<\/a><\/p>\n

@DataONEorg has been on Twitter since Thursday, November 18, 2010.<\/p>\n

For data available 3 months ago:<\/p>\n

There are 202 followers<\/p>\n

52% are female<\/p>\n

73% are from the United States.<\/p>\n

It’s possible to sort by number of followers.<\/p>\n

http:\/\/twtrland.com\/profile\/DataONEorg\/followers<\/a><\/p>\n

“We analyze all the content people share and how their audience reacts to it, to find their influential skills”<\/p>\n

The skills reported are as follows:<\/p>\n

Science, Research, Universities, Scientists, Community, Biologists, Biology, Management, Genomics, Librarians, Library, Publishing, Technology, Bioinformatics, Digital, Education, Ecology, Professors, Writers, Open Access.<\/p>\n

There are 84 replies per 100 t<\/p>\n

There are 28 RTs per 100t<\/p>\n

37% of tweets are links<\/p>\n

There are .01 tweets per day<\/p>\n

The demographics information is interesting:<\/p>\n

Followers Demographics<\/div>\n
<\/div>\n
\n
16 Countries
\n<\/a><\/div>\n
United States 72.4%<\/div>\n
United Kingdom 7.6%<\/div>\n
Canada 3.8%<\/div>\n<\/div>\n
\n
<\/div>\n
69 Cities
\n<\/a><\/div>\n
Washington 4.2%<\/div>\n
Manchester 3.1%<\/div>\n
San Francisco 3.1%<\/div>\n<\/div>\n
\n
<\/div>\n
Followers\u00a0Skills
\n<\/a><\/div>\n
Research 10.2%<\/div>\n
Science 10.1%<\/div>\n
Community 7.3%<\/div>\n<\/div>\n

Some of the advanced analytics are available only by using the “pro version” – there is a free trial option, however this is something Community Engagement & Outreach coordinator Amber Budden would only have access to. \u00a0I’ll let her know in case she wants to try it out.<\/p>\n

At any rate the point of looking at Twitterland was to pinpoint when DataONEorg went live.<\/p>\n

I now have a date of<\/p>\n

Thursday, November 18, 2010.<\/p>\n

I can go back to Topsy and try a range of dates to overcome the problem I encountered with being unable to seek tweets past 990, the tweet at \u00a0July 29, 2012.<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&offset=10&mintime=1288605638&maxtime=1341136815<\/a><\/p>\n

November 1, 2010 to July 1, 2012<\/p>\n

Now I want to try November 1, 2010 to November 1, 2011<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&mintime=1288612824&maxtime=1320148851<\/a><\/p>\n

I wish I could make sense of the numbers here (1288612824) and (1320148851) but for now I just need to accept on faith that they correspond to November 1, 2010 and November 1, 2011.<\/p>\n

The “most recent” tweet appears to be:<\/p>\n

October 31, 2011.<\/p>\n

https:\/\/twitter.com\/mcdonald\/status\/131111854327070720<\/a><\/p>\n

The oldest Tweet appears to be:<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&offset=150&mintime=1288612824&maxtime=1320148851<\/a><\/p>\n

Beyond 150.<\/p>\n

Let’s try an offset key of 300<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&offset=300&mintime=1288612824&maxtime=1320148851<\/a><\/p>\n

No tweets found.<\/p>\n

Let’s try an offset key of 220.<\/p>\n

No tweets found.<\/p>\n

Let’s try and offset key of 200.<\/p>\n

No tweets found.<\/p>\n

180.<\/p>\n

None<\/p>\n

170.<\/p>\n

Three tweets found.<\/p>\n

http:\/\/topsy.com\/s?q=%40DataONEorg&type=tweet&sort=date&offset=170&mintime=1288612824&maxtime=1320148851<\/a><\/p>\n

The very first tweet (that was not spam) in Topsy mentioning @DataONEorg was from Heather Piowar.<\/p>\n

https:\/\/twitter.com\/DataONEorg\/status\/47691665087016960<\/a><\/p>\n

So, working backwards from 170 pages of mentions at 10 tweets per page for the 365 day period between November 1, 2010 and November 1, 2011, there were at least 1,700.<\/p>\n

I say “at least” because in the case of the “first tweet,” 4 other users “re-tweeted” the same tweet, and that was not reflected in the numbers from Topsy.<\/p>\n

So, this appears to be a method for poring over the history of tweets.<\/p>\n

How to systematically extract that data is another topic, which I will address in a separate post.<\/p>\n

 <\/p>\n

 <\/p>\n

 <\/p>\n

 <\/p>\n

 <\/p>\n

 <\/p>\n

 <\/p>\n

 <\/p>\n

 <\/p>\n

 <\/p>\n

 <\/p>\n","protected":false},"excerpt":{"rendered":"

DataONE has been around since 2010. It’s an NSF project so it’s continually evaluated for performance. One metric could be reach and engagement on social media, as a measure of awareness about DataONE. Since I’ve looked at open science sentiment analysis \u00a0before, I volunteered to poke around a bit on Continue reading DataONE Community Engagement via Twitter<\/span>→<\/span><\/a><\/p>\n","protected":false},"author":35,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[12],"tags":[23,204,56,227,192],"_links":{"self":[{"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/posts\/1953"}],"collection":[{"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/users\/35"}],"replies":[{"embeddable":true,"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/comments?post=1953"}],"version-history":[{"count":3,"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/posts\/1953\/revisions"}],"predecessor-version":[{"id":1956,"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/posts\/1953\/revisions\/1956"}],"wp:attachment":[{"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/media?parent=1953"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/categories?post=1953"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/notebooks.dataone.org\/wp-json\/wp\/v2\/tags?post=1953"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}