We're going to analyze hashtag behavior on Twitter.
Our analysis will focus on the #blogchat community, a Sunday evening social media discussion hosted by Mack Collier on Twitter. Every Sunday night, this community discusses various topics of interest to their community. Folks communicate via the #blogchat hashtag, so that everybody can follow what folks are saying about each other.
I collected data from about eighteen weeks of #blogchat events. After analyzing the behavior of the community, I reduced the dataset to five weeks of behavior. I used four weeks as the segmentation period, with one week as a prediction period. Later in the analysis period, we'll review eight weeks of data, using four weeks to predict the next four weeks.
Allow me to explain the variables I am tracking in my dataset.
The first variable is called "statement". Here's what a statement looks like:
- MineThatData: I am really looking forward to #blogchat tonight!
The second variable is called "re-tweet". This is a huge form of social currency. The person issuing the re-tweet is giving another individual credit for saying something clever, or is trying to gain attention in some way.
- MineThatData: RT @mackcollier Bloggers really do a nice job of sharing interesting topics #blogchat.
- MineThatData: And they have unique opinions. RT @mackcollier Bloggers really do a nice job of sharing interesting topics #blogchat.
- MineThatData: @mackcollier Don't you think that bloggers could do a better job of being objective? #blogchat
- MineThatData: We covered this topic on my blog last month: http://bit.ly/dk928d #blogchat
The first set of variables describe the actions a user might partake in.
The next two variables are very important. These two variables account for feedback from other users in the community.
The seventh variable is called "RT". Each time a user is "re-tweeted", I tally one for the user in the "RT" column. In my earlier example, @minethatdata gets a value of "1" in the "re-tweet" variable. @mackcollier gets a value of "1" in the "RT" variable, because his statement is re-tweeted.
- MineThatData: RT @mackcollier Bloggers really do a nice job of sharing interesting topics #blogchat.
- MineThatData: @mackcollier Don't you think that bloggers could do a better job of being objective? #blogchat
Let's assume that a user, called @user, issued the following tweets.
- @user: @person But don't you think that brands should be "all-in" in Social Media? #blogchat
- @user: If big brands don't join the conversation, they're finished. #blogchat
- @user: What do you think are the three most important things a large brand should do first? #blogchat.
- @user: @person This link is very helpful. http://bit.ly/dkeo229kf #blogchat.
- Statement = 2.
- Re-Tweet = 0.
- Amplify = 0.
- Converse = 2.
- Link = 2.
- Tweets = 4.
- RT = 0.
- ANSW = 0.
And when you have data formatted in this manner, you can make magic happen! Tomorrow, we begin to explore the magic behind the #blogchat community.
Hi Kevin,
ReplyDeleteAre you assigning your variables in an automated way or through some manual graft?
Keen to read more, looking forward to part 2
Old school, by hand, with good 'ole fashioned programming code.
ReplyDelete