Welcome to this course, Machine Learning for Accounting with Python. I'm Linden Lu and I'm the instructor of this course. As indicated by the course title, this course is about machine learning. You will learn various machine learning algorithms and how to apply them to business-related dataset with Python. It is very important to always have the big picture in mind so that you're not overwhelmed by the algorithms or the Python coding details. So let's first look at how data analysis process. This is one of our most popular Data Analysis Framework, CRISP-DM or cross industry standard process for data mining. I'm not going to explain this framework in detail, since you should have already seen this in the course Accounting Data Analytics with Python, which is the prerequisite for this course. Accounting Data Analytics with Python covers the second and the third steps in the framework; data understanding and data preparation. This course covers the next two steps; modeling and a model evaluation. Module 1 of this course introduces machine learning in general. You will also learn data preprocessing, which is an important part of modeling. Module 2 and 3 introduce six machine learning algorithms. A machine learning algorithm is a sequence of rules and instructions to solve a class of problems. Once an algorithm is implemented with a programming language, which is Python in this course, it becomes a model. Module 2 and 3 explain the algorithms and also demonstrate how to construct and train the machine learning models with Python code. Module 4 introduces model evaluation metrics. The purpose of model evaluation is to improve the models. So following model evaluation, in module 5, we introduce model optimization, which includes feature selection and model selection. Feature selection helps you select most relative features from a dataset, and model selection determines the best hyperparameters for a machine learning model with the help of cross-validation. Module 6 introduces a special application of machine learning; text analysis, which extracts insights from text documents like SEC filings and customer reviews. All machine learning algorithms introduced in the first six modules are simple by their learning, where the models are trained with data who's true outcome or label is available. A trained model develops a map between the income and the outcome of the data, then predicts unseen data with this map. Module 7 introduces an unsupervised learning clustering, which is designed to extract structures from datasets without true outcome or label. One unsupervised learning example is customer segmentation, which divides customers into different groups based on third and characteristics like gender, age, and purchase pattern. We will introduce two clustering algorithms, K-means in DBSCAN. Module 8 introduces a special kind of data; time series data, which is a dataset indexed by time. Hourly weather data and daily stock quote data are two examples of time series data. When we're introduced basic date and time manipulation with Python, we will also briefly introduce some basic time series analysis techniques. This course runs on the same platform as the one used in a prerequisite. Both lesson materials and assignments are in Jupyter Notebooks. We provide you a Jupyter Notebook server on Coursera, but it's highly recommended that you install the server on your own machine. The easiest way to have your own notebook server, it's through Anaconda. Now, let's get back to the big picture. After you finish this course, you will be able to perform the entire data analysis process on business-related datasets with Python. In this process, I personally think that the most important steps are business understanding and data understanding, because they provide the foundation of a solid analysis. Data preparation is the most tedious and time-consuming step and most coding efforts are dedicated to this step. The two steps covered by this course; modeling and model evaluation, are the fun part. In these two steps, the Python code is relatively simple since it follows some standard steps, but you can extract exciting insights from the dataset. So I hope you'll enjoy this course.