Batch Geocoding with R and Google maps
I’ve recently wanted to geocode a large number of addresses (think circa 60k) in Ireland as part of a visualisation of the Irish property market. Geocoding can be simply achieved in R using the...
View ArticleData Science Videos from Dublin WebSummit 2013
The Web Summit, Europes largest technology-industry conference was held in Dublin this week. An annual event since 2010, the Web Summit attracted over ten thousand visitors from over 90 countries. The...
View ArticleCSV Data Extraction Tool for ROS bag files for Python, R, Matlab
So you’ve been using ROS to record data from a robot that you use? And you have the data in a rosbag file? And you’ve spent a while googling to find out how to extract images, data, imu readings, gps...
View ArticleOnline Learning Curriculum for Data Scientists
“Is there any online reading or courses I can do to get into data analysis?” At my workplace, I get asked the question above. The question is usually posed by people typically with a finance...
View ArticleSelf-Organising Maps for Customer Segmentation using R
Self-Organising Maps (SOMs) are an unsupervised data visualisation technique that can be used to visualise high-dimensional data sets in lower (typically 2) dimensional representations. In this post,...
View ArticleScraping Dublin City Bikes Data Using Python
FAST TRACK: There is some python code that allows you to scrape bike availability from bike schemes at the bottom of this post… SLOW TRACK: As a recent aside, I was interested in collecting Dublin...
View ArticleAsynchronous updates to a webpage with Flask and Socket.io
This post is about creating Python Flask web pages that can be asynchronously updated by your Python Flask application at any point without any user interaction. We’ll be using Python Flask, and the...
View ArticleUsing Python Threading and Returning Multiple Results (Tutorial)
I recently had an issue with a long running web process that I needed to substantially speed up due to timeouts. The delay arose because the system needed to fetch data from a number of URLs. The total...
View ArticleSummarising, Aggregating, and Grouping data in Python Pandas
I’ve recently started using Python’s excellent Pandas library as a data analysis tool, and, while finding the transition from R’s excellent data.table library frustrating at times, I’m finding my way...
View ArticleFixing Office 2016 installation for Mac – error code 0xD0000006
This is a very quick post to help some people out on installation problems with Office for Mac 2016. On excited installation of Excel 2016 on my Macbook, the following error threatened to ruin the day:...
View ArticleAmazon Elastic Beanstalk – Logging to Logentries from Python Application
[Short version] The S3 ingestion script for Amazon applications provided by Logentries will not work for the gzip compressed log files produced by the Elastic Beanstalk log rotation system. A slightly...
View ArticleAnalysis of Weather data using Pandas, Python, and Seaborn
The most recent post on this site was an analysis of how often people cycling to work actually get rained on in different cities around the world. You can check it out here. The analysis was completed...
View ArticleHow wet is a cycling commute in Ireland? Pretty wet… if you live in Galway.
How often do you get wet cycling to work? Cycling in Ireland is taking off. The DublinBikes scheme is a massive success with over 10 million journeys, there’s large increases in people cycling in Irish...
View ArticleThe ggthemr package – Theme and colour your ggplot figures
Theming ggplot figure output The default colour themes in ggplot2 are beautiful. Your figures look great, the colours match, and you have the characteristic “R” look and feel. The author of ggplot2,...
View ArticleFixing Office 2016 installation for Mac – error code 0xD0000006
This is a very quick post to help some people out on installation problems with Office for Mac 2016. On excited installation of Excel 2016 on my Macbook, the following error threatened to ruin the day:...
View ArticleAmazon Elastic Beanstalk – Logging to Logentries from Python Application
[Short version] The S3 ingestion script for Amazon applications provided by Logentries will not work for the gzip compressed log files produced by the Elastic Beanstalk log rotation system. A slightly...
View ArticleAnalysis of Weather data using Pandas, Python, and Seaborn
The most recent post on this site was an analysis of how often people cycling to work actually get rained on in different cities around the world. You can check it out here. The analysis was completed...
View ArticleHow wet is a cycling commute in Ireland? Pretty dry!… if you don’t live in...
How often do you get wet cycling to work? Cycling in Ireland is taking off. The DublinBikes scheme is a massive success with over 10 million journeys, there’s large increases in people cycling in Irish...
View ArticleThe ggthemr package – Theme and colour your ggplot figures
Theming ggplot figure output The default colour themes in ggplot2 are beautiful. Your figures look great, the colours match, and you have the characteristic “R” look and feel. The author of ggplot2,...
View ArticleSelecting DataFrame rows and columns using iloc, loc, and ix in Pandas
There are multiple ways to select and index rows and columns from Pandas DataFrames. I find tutorials online focusing on advanced selections of row and column choices a little complex for my...
View Article