This skill set is in high demand, as machine learning algorithms now power the majority of trading on Wall Street and the product recommendations at big companies like Amazon, Spotify, and Netflix.
This course will begin with linear and logistic regression—the most time-tested and reliable tools for approaching a machine learning problem. The course will then progress to algorithms with a very different theoretical basis, such as k-nearest neighbors, decision trees, and random forest. This will bring important statistical concepts to the forefront, such as bias, variance, and overfitting. You’ll also learn how to measure the accuracy of your models, as well as gain tips for choosing effective features and algorithms.
The course will focus on the practical skills needed to solve real-world problems with machine learning. The mathematical foundations for each machine learning algorithm will be explained visually, but there will not be a formal mathematics component. Entering students are expected to be comfortable writing Python programs, as well as using the NumPy and Pandas libraries.
1. Course Kick‑off & Python Refresher
2. Data Visualization & Simple Linear Regression
3. Binary Classification & Logistic Regression
4. k‑Nearest Neighbors & the Iris Dataset
5. Ensemble Methods & Neural Networks
This course does require students to be comfortable with Python and its data science libraries (NumPy and Pandas). If a student has not worked in Python before, we require them to enroll in our Python for Data Science Bootcamp before taking this course.