Here’s a post which combines my favourite bits of writing a blog – fairly mathematical, not too simple or difficult to implement, mostly based around pictures, not covered in my undergraduate education, and pretty damn useful in my job. Excited? You should be.
The last time I looked at house prices it went pretty well, and I ended up winning a data science competition. There I was only dealing with a million or so records, and a relatively small 120 MB dataset. Then I found out it was possible to download 3.7GB of property sale records for all of England and Wales since 1995, so let’s have another go. Continue reading
I’ve been a little busy recently for blog posts, what with a pesky thesis to finish and all, but such an occasion as this deserves the highest honour possible – making lots of plots. That’s right, the hallowed HTML files under this domain name have been yanked from a poor WordPress server somewhere over 1 million times now, almost coinciding with the first anniversary of the blog. I think it’s a useful event to prompt some details for those of you interested in the business of blogging – who knows, perhaps you’ll start one of your own?
Over the last few years I’ve been working towards a PhD in the realm of experimental physics. This, naturally, requires experiments to occur. As good scientists we (ideally) record all the details of the experiments, which makes for a tasty dataset after a few years. Let’s have a look at some details.
What does this title mean? What’s with the recent bus obsession? Is this post even about buses, or are they just being carelessly shoehorned into every post title from here on out? Excellent questions, thanks for asking. Allow me to explain.
As the old saying goes, you wait ages for a bus and then two come along at once (or more!). Is this true though? My own anecdotal evidence would suggest yes, every single bloody time. However, we love data and maths in this blog almost as much as we hate waiting for the bus, so let’s have a more thorough look at the issue.
A while back I heard about the Summer Data Challenge, hosted by my university. A number of datasets are available to download and analyse, and I chose to look at data on the selling price of properties in London over the past 5 years. I’ve assembled a number of plots here in order to examine this slightly cumbersome dataset more easily.