Wrangling Unruly Data: The Bane of Every Data Science Team

2020-05-05 Carl Howe
Thumbnail data-wrangling
There's an old saying (at least old in data scientist years) that goes, "90% of data science is data wrangling." This rings particularly true for data science leaders, who watch their data scientists spend days painstakingly picking apart ossified corporate datasets or arcane Excel spreadsheets. Does data science really have to be this hard? And why can't they just delegate the job to someone else? Data Is More Than Just Numbers The reason that data wrangling is so difficult is that data is more than text and numbers.

Building tidy tools workshop

2019-03-08 Roger Oberg
Join RStudio Chief Data Scientist Hadley Wickham for his popular "Building tidy tools" workshop in Sydney, Australia! If you'd missed the sold out course at rstudio::conf 2019 now is your chance. Register here: https://www.rstudio.com/workshops/building-tidy-tools/ You should take this class if you have some experience programming in R and you want to learn how to tackle larger-scale problems. You'll get the most if you're already familiar with the basics of functions (i.

Applied Machine Learning Workshop

2018-05-15 Roger Oberg
Join Max Kuhn of RStudio for his popular Applied Machine Learning Workshop in Washington D.C.! If you'd missed his sold out course at rstudio::conf 2018 now is your chance. Register here: https://www.rstudio.com/workshops/applied-machine-learning/ This two-day course will provide an overview of using R for supervised learning. The session will step through the process of building, visualizing, testing, and comparing models that are focused on prediction. The goal of the course is to provide a thorough workflow in R that can be used with many different regression or classification techniques.