Review of scikit-learn cookbook

scikit-learn Cookbook” by Trent Hauck is a recent cookbook with 50 recipes about the popular Python machine learning package scikit-learn. The book has 5 chapters and 195 pages:

  1. Premodel Workflow – data acquisition, preprocessing and data cleaning.
  2. Working with Linear Models – linear regression, ridge regression and logistic regression.
  3. Building Models with Distance Metrics – Kmeans clustering and Gaussian mixture models.
  4. Classifying Data with scikit-learn – decision trees, SVM and more.
  5. Postmodel Workflow – cross validation, grid search and model evaluation.

Although written in a cookbook format, the book is ordered by workflow steps for easier lookup. “scikit-learn Cookbook” does a good job of describing the most popular algorithms found in scikit-learn. With a few exceptions the cookbook uses mostly generated data and datasets from scikit-learn. On the one hand this makes it easy to reproduce results and we get to practice generating data with scikit-learn. On the other hand it might have been more fun to use real data that is not part of the scikit-learn distribution. The author often uses the Python shell for demonstrations. In the ebook I read, it was a bit hard to distinguish between the code in the shell and the related output (due to identical formatting).

Disclaimer: Fahad S. from Packt Publishing sent me a review copy.

By the author of NumPy Beginner's Guide, NumPy Cookbook and Instant Pygame. If you enjoyed this post, please consider leaving a comment or subscribing to the RSS feed to have future articles delivered to your feed reader.
This entry was posted in Uncategorized. Bookmark the permalink.