Past two years, at the beginning of the year, I did a post where I have tried to sum up year results and find some interesting facts/stats from tweets:
This year was really awesome, a lot of things happened:
- Real cooperation between Visual F# Team and F# open source community
- Legal entity for F# Software Foundation
- New F# logo and branded new site.
- Awesome F# Advent Calendar at the end of the year😉
- and many more…
There are so many things occurred, so I need your help to find everything that happened or changed this year.
Especially for this, I prepared a data set with tweets starting from Jan 1, 2013 that is available here, and ask you to help me analyze it.
How to start:
- Download the data set & unzip an archive.
- Download latest version of Fsharp.Data.
- Copy-paste following code snippet:
#r @"..\packages\FSharp.Data.2.1.1\lib\net40\FSharp.Data.dll" open FSharp.Data type Tweets = CsvProvider<"fsharp_2013-2014.csv"> let tweets = Tweets.GetSample() //TODO: Your awesome analytics here
My sample analysis:
As an example of what you can do with data, I prepared a calculation of people activity that shows who had more tweets this year.
tweets.Rows |> Seq.filter (fun x -> x.CreatedDate.Year = 2014) |> Seq.groupBy (fun x -> x.FromUserScreenName) |> Seq.map (fun (group, items) -> (group, Seq.length items)) |> Seq.sortBy (fun (_, cnt) -> -cnt) |> Seq.take 100 |> Seq.iter (fun (group, cnt) -> printfn "%s:%d" group cnt)
When you execute these rows you will get a statistical output in FSI with user names and number of tweets. You can copy this output to wordle.net and play with settings to visualize it in the nice way:
Please help me to observe data set and share your results with me on Twitter (@sergey-tihon). I will include your plots/charts in the end of this post. Thank you!