Introduction Assuming you already have some background with the other more common types of joins, inner, left, right, and outer; adding semi and anti can prove incredibly useful saving you what could have alternatively taken multiple steps. In this post, I’ll be focusing on just semi-joins; with that said, there is a lot of overlap …
Tag Archives: Dplyr
Tired of Nested ifelse in Dplyr?
Using Mutate to Feature Engineer a New Categorical Among the most helpful functions from dplyr is mutate; it allows you to create new variables– typically by layering some logic on top of the other variables in your dataset. Quick Example Let’s say that you’re analyzing user data and you want to categorize users according to …