Statistical Learning and Data Mining

Author

phonchi

Published

September 5, 2022

Preface

This is the companion book for the course Statistical Learning and Data Mining open in the Department of Applied Mathematics, National Sun Yat-sen University. Statistical learning refers to a set of tools for modeling and understanding complex datasets. It is a recently developed area in statistics and blends with parallel developments in computer science and, in particular, machine learning. We cover how to use machine learning techniques and statistics with the goal of statistical inference: drawing conclusions on the data at hand. The book encompasses many methods such as regression, classification, regression trees, boosting, support vector machines, clustering and dimension reduction.

The book is based on several well-known books and resources, including:

If you would like to review basics about Python, you may refer to