Description
This course is an introduction to Machine Learning and Optimization with Python. The course starts defining mathematical modelling and how it can be used in real business cases. Examples of supervised and unsupervised learning are provided with a presentation of the main methodologies to train performant models. Then the course completes the picture with a deep dive into time series analysis and optimization problems. The course is designed to be interactive and practical. The student will learn by doing, by solving real business problems.
Topics include
- The present and future of Data Science
- Technologies and integration with Python
- Concepts of AI, machine learning, and statistical learning and the role of Python
- Supervised learning: Linear models, Bayesian classifiers, Random forests, Xgboost
- Unsupervised learning: K-means, hierarchical clustering, dimensionality reduction algorithms
- Introduction to Scikit-learn
- Model evaluation and improvement: cross-validation, k-folds, grid search, key metrics
- Chaining algorithms and creating pipelines
- Introduction to the Statsmodels library
- Integration of Python with enterprise frameworks and tools
- Time series analysis
- Time series modeling: ARMA, ARIMA, SARIMA
- Time series forecasting
- Time series classification
- Time series clustering
- Fourier Transform and signal analysis
- Illustrative examples of other approaches to using time series data
- Optimization algorithms and libraries
- Local Vs Global optimization
- Linear programming
- Quadratic programming
- Nonlinear optimization
What you will be able to do
- Understand the main concepts of Data Science
- Use the main libraries for Data Science in Python
- Understand the main algorithms for supervised and unsupervised learning
- Evaluate and improve models
- Create pipelines for chaining algorithms
- Use the Statsmodels library
- Integrate Python with enterprise frameworks and tools
- Analyze time series data
- Model time series data
- Solve optimization problems with or without constraints with the most efficient methods
Duration
3 days
Prerequisites
Ability to use Python for data analysis. A mathematical background is recommended.
Audience
This is an intermediate course for professionals and researchers who need to solve business problems with data.