Module 3: Transformation
Data Preparation and Transformation
Objective:
- Teach participants data cleaning and preprocessing techniques in Tableau
- Guide participants in joining and blending data from multiple sources
- Provide hands-on experience in working with calculated fields and parameters
- Help participants handle missing data and outliers effectively
3.1 Data Cleaning and Preprocessing
- Understanding the importance of data cleaning and data quality
- Identifying and handling missing values, duplicates, and outliers
- Techniques for data filtering, sorting, and aggregation
Exercise 3.1: Cleaning and Preprocessing Data
- Participants are given a dataset with various data quality issues
- Participants use Tableau to clean and preprocess the data by applying appropriate techniques.
3.2 Joining and Blending Data
- Different types of data joins: inner, left, right, and full outer joins
- Joining tables based on common keys or relationships
- Blending data from multiple sources using data blending techniques
Exercise 3.2: Joining and Blending Data
- Participants are provided with two or more datasets to join or blend.
- Participants use Tableau to perform the necessary joins or blending to combine the datasets.
3.3 Calculated Fields and Parameters
- Creating calculated fields using basic mathematical operations and functions
- Using logical expressions and conditional statements in calculated fields
- Introduction to Tableau parameters and their role in dynamic analysis
Exercise 3.3: Creating Calculated Fields and Parameters
- Participants are given specific analysis requirements.
- Participants use Tableau to create calculated fields and parameters to meet the analysis needs.
3.4 Handling Missing Data and Outliers
- Techniques for handling missing data: imputation, filtering, or excluding
- Identifying and addressing outliers in data visualizations
- Utilizing Tableau's features for missing data and outlier management
Exercise 3.4: Handling Missing Data and Outliers
- Participants work with a dataset that contains missing values and outliers.
- Participants use Tableau to implement appropriate strategies for handling missing data and outliers.
3.5 Data Aggregation and Transformation
- Aggregating data using dimensions and measures
- Transforming data using pivot, split, and custom transformations
- Grouping data and creating hierarchies for better analysis
Exercise 3.5: Data Aggregation and Transformation
- Participants manipulate the provided dataset using Tableau's data aggregation and transformation features.
- Participants aggregate data, perform transformations, and create hierarchies as instructed.