The excitement of Formula 1 racing lies in the intense competition between drivers and teams as they navigate the twists and turns of each race to claim the coveted championship title. In recent years, data-driven predictions have become increasingly popular in sports, and F1 is no exception. Inspired by FiveThirtyEight’s comprehensive soccer predictions, we set […]
Kaggle Playground Series – Tidymodels
Hello readers, we are entering another Kaggle playground competition, so get your Yorkshire tea ready and enjoy the process of joining. This month the competition I entered is this one https://www.kaggle.com/competitions/playground-series-s3e7It’seiew It’s looks like looks are canncellations from hotels and spoiler alert – I had a lot of fun with this dataset. EDA First, I […]
Cricket Weighted Batting Average in R
Hello, I hope you have your Yorkshire tea ready as today I am going to be exploring weighted averages using R. I used the code above to generate the table of the top 15 players by batting average in the 2022 county championship. Now the whole point of this blog is to devise a weighted […]
Kaggle January Playground Series – Tidymodels
Hello, hope you have your Yorkshire tea ready this is going to be a new series on the blog in which each month I am going to be tackling Kaggles monthly playground series. Find the link to Januarys below feel free https://www.kaggle.com/competitions/playground-series-s3e1 So let’s get started EDA Above is the structure of the training dataset. […]
Predicting Twenty 20 Cricket Result with Tidy Models
Hello, hope you have your Yorkshire tea to hand and sitting comfortably ready to read today’s blog. In it I am going to be doing some machine learning with tidymodels to predict the outcome of some twenty20 cricket matches. I am using the data from cricsheet as used in this blog and using the win […]
F1 Drivers Rated – Version 2
Hello, so a year and a half a go I created a new metric for measuring F1 drivers performance based around there performance in the race and the expected the performance in the race see blog here F1 Drivers Rated Since then my laptop BSOD’ed and me being useless I never committed the code to […]
F1 2020 -Season So Far and Why Racing Point’s Method of Designing the Car is Controversial
Hello Readers, Today i’m going to do a little data explore of the data from the F1 2020 season so far. Exploring a number of questions about the season so far. First of all looking at qualifying and why a lot of teams are annoyed by (t)Racing Point and the strategy they have used to […]
Pole Position Prediction- A tidymodels Example
Hello readers, today’s blog I will be looking at predicting the formula 1 grid using the Tidymodels collection of R packages. The idea is to use data from the practice sessions on a Friday, to give an idea of what the grid is expected to be for the race on Sunday before qualifying on Saturday. […]
Twenty20 Win Probability Added
Hello readers, welcome to today’s blog. I am going to implement a win probability added model for twenty 20 cricket. Now this is nothing new a quick google and there are many sources for it. Cricviz is possibly the most famous version which you may have seen on the app. The idea is the model […]
Cricket Moneyball, pt 2
Hello readers, today we have part 2 of this cricket moneyball series. If you missed the first one its here: https://wordpress.com/view/theparttimeanalyst.com In it I looked at calculating the Pythagorean win percentage for each team in the IPL and then moving that forward to calculating the how many extra runs are needed to win one extra […]