Site icon Accidentally in Code

Pycon AU: Exploring Science on Twitter with IPython Notebook and Python Pandas

Brenda gave a great talk at Pycon-AU about using IPython and Pandas for her research. Slightly rough notes below.

She has a dataset of 12 million tweets containing the word “science” – about a years worth of data, after filtering fout non-English tweets and spam.

Using UTC for fewer timezone problems. Although still some – mostly things expecting the month first cause date-related problems.

Found more tweets about science mid-week than at weekends – this matches wider patterns of Twitter use in other research.

IPython features:

IP[y] : Notebook

Pandas

Overall

I’m pretty fascinated with the results of this research, which we didn’t see much of  as the talk was about the technical setup. I feel like this would have been incredibly handy doing my own research though, and it was good to chat to Brenda at our women’s breakfast and compare notes on other tools like processing, prefuse etc.

Exit mobile version