Undersampling Will Change the Base Rates of Your Model's Predictions 2020-11-23 TLDR: In classification problems, under and over sampling1 techniques shift the distribution of predicted probabilities towards the minority class. If your problem requires accurate probabilities you … ggplot dplyr purrr tidyr knitr modelr yardstick
Feature Engineering with Sliding Windows and Lagged Inputs 2020-10-12 The new rsample::sliding_*() functions bring the windowing approaches used in slider to the sampling procedures used in the tidymodels framework1. These functions make evaluation of models with … httr jsonlite dplyr lubridate rsample slider devtools recipes parsnip workflows tune purrr tidyr forcats ggplot broom
Short Examples of Best Practices When Writing Functions That Call dplyr Verbs 2020-06-25 dplyr, the foundational tidyverse package, makes a trade-off between being easy to code in interactively at the expense of being more difficult to create functions with. The source of the trade-off is … dplyr
Use Flipbooks to Explain Your Code and Thought Process 2020-06-24 Using the pipe operator (%>%) is one of my favorite things about coding in R and the tidyverse. However when it was first shown to me, I couldn’t understand what the #rstats nut describing it was … dplyr tidyr purrr ggplot ggbeeswarm animatrixr emo rlang fs pagedown magick here pdftools officer flair flipbookr
Tidy Pairwise Operations 2020-06-03 In May of 2021 I co-wrote pwiser a package for doing pairwise operations in {dplyr} that provides a much smoother approach than the one I build-up to in this post. […] Say you want to map an … AmesHousing dplyr corrr tidyr stringr purrr forcats ggplot devtools weights
animatrixr & Visualizing Matrix Transformations pt. 2 2020-02-24 This post is a continuation on my post from last week on Visualizing Matrix Transformations with gganimate. Both posts are largely inspired by Grant Sanderson’s beautiful video series The Essence of … devtools dplyr animatrixr