Odds Are You're Using Probabilities to Describe Event Outcomes 2023-11-03 We grow up learning proportions, percentages, risks, probabilities. You encounter them when a teacher gives a grade on a test or a doctor describes the risk of an illness. On the other hand, we rarely … dplyr ggplot2 DiagrammeR
Converting Between Currencies Using priceR 2022-06-16 In this post I’ll walk through an example of how to convert between currencies. A challenge is that the conversion rate is constantly changing. If you have historical data you’ll want the conversion … priceR dplyr tidyr purrr lubridate devtools
Pulling Twitter Engagements Using the v2 API as Well as rtweet 2022-04-11 This is a follow-up to a short post I wrote on R Access to Twitter’s v2 API. In this post I’ll walk through a few more examples of pulling data from twitter using a mix of Twitter’s v2 API as well as … rjson httr jsonlite dplyr purrr lubridate rtweet tidyr glue rstudioapi fs readr tidyverse
Network Visualizations of Code Collections (funspotr part 3) 2022-03-17 In previous posts and threads I’ve alluded to the potential utility of visualizing the relationships between parsed functions/packages and files as a network plot. […] It can be helpful to … dplyr funspotr readr
Identifying R Functions & Packages in Github Gists (funspotr part 2) 2022-02-07 This post is part two in a series of posts introducing funspotr. See also: […] This post shows how funspotr can also be applied to parse gists: […] By functions or packages … dplyr purrr stringr funspotr readr DT fs rstudioapi
Identifying R Functions & Packages Used in GitHub Repos (funspotr part 1) 2022-01-18 TLDR: funspotr provides helpers for spotting the functions and packages in R and Rmarkdown files and associated github repositories. See Examples for catalogues of the functions/packages used in posts … dplyr funspotr yaml purrr fs readr here
Macros in the Shell: Integrating That Spreadsheet From Finance Into a Data Pipeline 2021-05-10 There is many a data science meme degrading excel: (Google Sheets seems to have escaped most of the memes here.) While I no longer use it regularly for the purposes of analysis, I will always have a … dplyr digest mvtnorm purrr readr glue readxl here
Quantile Regression Forests for Prediction Intervals 2021-04-21 In this post I will build prediction intervals using quantile regression, more specifically, quantile regression forests. This is my third post on prediction intervals. Prior posts: […] This … workflows ggplot yardstick gt forcats scales pander
Simulating Prediction Intervals 2021-04-05 Part 1 of my series of posts on building prediction intervals used data held-out from model training to evaluate the characteristics of prediction intervals. In this post I will use hold-out data to … workflows devtools gt ggplot forcats scales pander
Understanding Prediction Intervals 2021-03-18 Prediction intervals provide a measure of uncertainty for predictions on individual observations. This post… […] This is the first of three posts on prediction intervals (Part 2 employs … AmesHousing dplyr rsample recipes gt parsnip workflows ggplot yardstick stringr tidyr forcats scales pander
Weighting Confusion Matrices by Outcomes and Observations 2020-12-08 Weighting in predictive modeling may take multiple forms and occur at different steps in the model building process. […] The focus of this post is on the last stage1. I will describe two types … ggplot dplyr rsample parsnip probably yardstick devtools purrr knitr tidyr
Influencing Distributions with Tiered Incentives 2020-11-02 In this post I will use incentives for sales representatives in pricing to provide examples of factors to consider when attempting to influence an existing distribution. For instance, if you have a … ggplot dplyr purrr forcats
Feature Engineering with Sliding Windows and Lagged Inputs 2020-10-12 The new rsample::sliding_*() functions bring the windowing approaches used in slider to the sampling procedures used in the tidymodels framework1. These functions make evaluation of models with … httr jsonlite dplyr lubridate rsample slider devtools recipes parsnip workflows tune purrr tidyr forcats ggplot broom