Professional Project Management with Python

Professional Project Management
Professional Project Management with Python

Description

This course is designed to introduce you to the world of professional project management with Python. Real life projects are complex and require a structured approach to be successful. This course will teach you how to organize your project, develop it in a team, version code and data at the same time, and keep it tidy and reproducible to go in production smoothly and produce value for the business. The course is designed to oil and optimize your current Data Science practices and team.

Topics include

  1. Introduction to a professional project management
  2. Professional project organization for code structuring, commenting, and documentation.
  3. Git for version control, collaboration, and effective code management.
  4. Git-Flow methodology for a structured development process.
  5. Web API to retrieve data from the internet and to exploit public datasets.
  6. Scientific and reproducible Reports (Using LaTeX formulas)
  7. Interactive Reports with Jupyter notebook
  8. Scientific presentations with Jupyter notebook
  9. Use Docker to create applications and to deploy them
  10. Introduction to FastAPI for creating APIs
  11. Model Deployment
  12. Automate the build and deployment process with Gitlab CI/CD
  13. Git basics and Git flow
  14. issues & merge requests - Gitlab
  15. Package registry - Gitlab
  16. Container registry - Gitlab
  17. Sprint planning - Gitlab
  18. DVC: Data Version Control to track data changes and to reproduce experiments
  19. Reproducible model training
  20. Store big and common datasets on GDrive or S3 storages with DVC
  21. DVC pipelines, metrics, experiments to automate the data process
  22. Reproducible data process with DVC: version code and data at the same time (for data analysis, ML model training, …).
  23. Poetry scripts and new CLI commands
  24. studio.iterative.ai
  25. Confluence and Collaborative documentations
  26. Git/Jira Boards

What you will be able to do

  1. Organize your project in a professional way
  2. Cooperate with your team in an efficient way
  3. Version code and data at the same time
  4. Create the right presentation or report for every situation
  5. Use Gitlab for issues, merge requests, package registry, container registry, and sprint planning
  6. Use DVC to track data changes and to reproduce experiments
  7. Create and automate the build of reports
  8. Store big and common datasets on GDrive or S3 storages with DVC
  9. Automate the data process with DVC pipelines, metrics, and experiments

Duration

3 days

Prerequisites

Ability to use Python for data analysis.

Audience

This is an advanced course for professionals and team leaders who need to find the best way to manage data science projects. It is also suggested for those who has experience with IT and programming and want to improve their skills in data science.