Skip to content

Latest commit

 

History

History
34 lines (26 loc) · 1.43 KB

README.md

File metadata and controls

34 lines (26 loc) · 1.43 KB

MLDay18: Random Forests and Gradient Boosting Machines in R

Slides for Machine Learning Day ’18. This talk provides an overview of the following topics, as well as some of their implementations in the R programming language:

Launch slides

Abstract

Good modeling tools should be universally applicable in classification and regression, have state-of-the-art accuracy, scale well to large data sets, and handle missing values effectively. Additionally, it would be nice for these tools to be able to automatically discover which variables are important, how they interact, and whether there are any novel cases or outliers. In this presentation, we discuss two such modeling tools: random forests and gradient boosting machines. The talk will cover a brief background of both methodologies (including decision trees) as well as various implementations of each in the R software environment for statistical computing. The pros and cons of each implementation will also be covered.