Course Info
Python is a popular programming language that has gained attention for its ease-of-use and wide support for a variety of tasks. This advanced course is aimed at equipping data analysts to handle data on a larger scale and produce interactive dashboards to effectively communicate results to business stakeholders.
What Will I Learn From This Course?
Utilize NumPy and Dask to manipulate large data sets.
Generate visually compelling interactive dashboards using Plotly.
Pre-Requirements
Python for Data Science (Beginner)
Technical Requirements
The class will be conducted using Jupyter interactive notebooks. Participants are required to install Anaconda Python v3.8 prior to attending this course. We recommend a machine running MacOS/Unix/Windows with at least 4GB RAM.
Course Outline for This Programme
Numerical Python (NumPy)
– NumPy arrays
– Broadcasting rules
– Working with image data
We begin by taking a deeper look at Numerical Python (NumPy), a Python package which is frequently used in handling data for production systems and forms the basis for Pandas data frames. Participants will learn the basics of NumPy’s behavior including broadcasting rules and how to use it to their advantage and how we can improve the speed of calculations in specific cases by working with NumPy arrays vs regular Pandas data frames.
Working with large data sets
– In-memory vs on-disk
– Dask data frames
In this section we introduce Dask, a Python package for lazy evaluation to handle data frames that are too large to fit in memory. Dask provides a Pandas-friendly interface that allows for manipulation of large data sets without the need for distributed computing platforms such as Spark.
Dashboarding
– Plotly
In this section, participants learn to build beautiful interactive dashboards from their data with the aid of a Python packaged called Plotly.
Practice Project
– Live coding exercises on a large dataset
In the final section for this course, participants will get their hands dirty by working on a large dataset to simulate a real-life data workflow and put their Python knowledge to the test.
Practice Project
– Live coding exercises on a large dataset
In the final section for this course, participants will get their hands dirty by working on a large dataset to simulate a real-life data workflow and put their Python knowledge to the test.
Background – Microsoft .NET Framework
James has more than 10 years of software development experience working in a software house. He has developed multiple solutions on different platforms, including Web, Desktop, and Server. He has started developing solutions using the Microsoft .NET Framework technology since 2002. He has also developed and implemented Document Management Solutions with document routing, Construction Site Safety Processing Solutions for Government Agencies, Human Resource Management Assessment Solution on Balance Score Card and Recruitment Assessment.
Background – Microsoft SQL ServerJames has substantial experience using Microsoft SQL Server version 6.5 for RDBMS solution development and deployment. He has also developed multiple Business Intelligence solutions using Microsoft SQL Server BI components since 2002. Microsoft SQL Server is his first choice on any solution development, like Digital Bill Conversion for Telco Corporate Users Solution, Human Resource Management Assessment Solution on BI for Performance Analysis, etc.
Background – Microsoft AZUREJames has trained and consulted AZURE solution for 5 years. He has been following azure technology closely. Since 2019, he was also one of the trainers pointed to deliver the AZURE training content for the MICROSOFT AZURE Training project in APEC. He has helped to deliver the content to MICROSOFT clients like AXA (Hong Kong), P&G (Philippine), SAP SA (China) etc. He also provides numerous training to T System (Malaysia). He also trains and consults AZURE Solution Architect, AZURE Data Science Solution and AZURE Service Development.