This is a rather niche post, but the method can likely be adapted for other use cases. In the lab we have many different cell lines stored in liquid nitrogen. The arrangement is: Ideally, to retrieve the correct vial from the cell store requires a map. We have a FileMaker Pro database of all of […]
Tag: ggplot
Running Around: 2023 running dataviz in R
2023 has been a great year in running for me. Previous running round-ups are here (2022, 2021). My two main goals for 2023 were to run 3000 km and also to run 50 HM-or-more distance runs. I managed both with a couple of weeks left. I also bagged new PBs for 5K, 10K and half […]
Step By Step: recreating a volcano plot in R
We have an analysis routine for proteomics data written for IgorPro. One output is a volcano plot. These plots show the fold change in one sample compared to another and plot that against a p-value to estimate how reproducible any changes observed are. This post is not about that software, but on the topic of […]
Pledging My Time VI: scraping and analysis of race results in R
I’ve posted in the past about analysing race results in R (most recently here). I ran the 2023 MK Marathon and wanted to have a look at the finishing times. The days of race results being made available as a csv or xls for easy analysis seem to be behind us. Instead they tend to […]
Running Around: 2022 running dataviz in R
2022 was my best year for running to date. In 2021, my goal was to run 2021 km. For 2022, I wanted to see if I could run 2500 km and also to run 50 HM-or-more distance runs. I managed both and ended the year on a total of 2734 km. I also bagged two […]
Twitcher II: tweet frequency and top tweets
Another post looking at Twitter data in R. It follows this one and this one. I wanted to look again at my tweeting frequency over the 12 years on Twitter, but this time do it in a calendar view. Something like a GitHub commit calendar would be perfect. I have used a library for this […]
Any Time At All: tweet frequency around the clock
Please consider this a “supplementary analysis” to my previous post looking at the frequency of tweets from my personal account over the last 12 years. I was curious about what times I was active on Twitter (measured by when I tweeted). Others might be interested in a solution to look at this in R. The […]
Twitcher: tweet frequency over the years
At the time of writing, I have essentially left Twitter. It was a fun ride and without going into what’s happening there now, this is a good opportunity to look at my 12 years on the platform. Early in November, I downloaded my data and locked my Twitter account. This gave me all the data […]
Get Miles: using treemap to visualise running distances
By 30th September 2022, I had clocked up a total of over 2000 km of running in 2022. This milestone was a good opportunity to look at how I got to this point. The code is shown below. First, we can make a histogram to look at the distance of runs. From this type of […]
Pledging My Time V: analysing race results in R
It’s been a while since I posted a breakdown of half marathon times. The last time seems to have been 2018. I decided to give my old code a clean-up and quickly crunched the numbers from the 2022 Kenilworth Half Marathon. First, the results: Briefly, the code below reads in a csv file of race […]