SlideShare una empresa de Scribd logo
1 de 31
Tableau Prep
Tableau prep
• Tableau Prep Builder is a tool in the Tableau product suite designed to
make preparing your data easy and intuitive. Use Tableau Prep Builder
to combine, shape, and clean your data for analysis in Tableau.
• Start by connecting to your data from a variety of files, servers, or
Tableau extracts. Connect to and combine data from multiple data
sources. Drag and drop or double-click to bring your tables into the
flow pane, and then add flow steps where you can then use familiar
operations such as filter, split, rename, pivot, join, union and more to
clean and shape your data.
• Each step in the process is represented visually in a flow chart that you
create and control. Tableau Prep tracks each operation so that you can
check your work and make changes at any point in the flow.
• When you are finished with your flow, run it to apply the operations to
the entire data set.
Using tableau prep
• Go to the > Connections + -> Text File (Any type of file).
• Choose the file Orders_South 2015 (for the given example) -> open
• It will be loaded into the tableau as shown in the next slide.
• The first option is connections that shows the file that we have
loaded into the tableau prep.
• Below that, there is a table option that shows the tables available in
the file that we have uploaded.
• Then in the top right (white area with a flow chart) is known as flow
area that shows how the data flow is going on.
• Below flow area different options are there such as Input, connection,
text options, field separator, etc.
• Table like structure in the bottom right is the information about the
dataset that we have loaded such as column names, datatype, sample
data, etc.
• Check boxes are given in that table where we can uncheck any box for
the column that we don’t want for our analysis.
• Type specifies the type of data stored in the table.
• Field name is the name of the column that we can change according
to our requirement. Eg: Sales to sale for our data
• Original Field name is the name of the column in the actual dataset
that will remain same.
• Changes represent if any changes made into the data.
• Preview is some sample data given.
• There is one another option i.e., Filter values with the help of which
we can apply filter to the data.
• When we will click on the filter values option, we will get one window
(shown in the next slide) where we need to mention the logic or
condition based on which we want to filter out our data.
• Eg:- We only want to display the data of First-Class Ship Mode
• Add a formula [Ship Mode]='first class’ into the window and click on
apply.
• We can also remove/edit the filter applied to our data by right clicking
• Under input, there are certain options like settings, multiple files,
data sample, changes.
• Multiple files is the option with the help of which we can join multiple
files.
• Data sample is the option that help us to choose how many rows we
want to use.
• Changes is the tab that shows the changes that we made to our data.
• We can even add new files to the existing area by dragging from the
folder to the flow area.
• In the orders_south table, there are the two columns order date and ship
date. But in orders_central there are 6 columns that are representing order
day, month, year and ship day, month and year. So, this is the problem.
• In the orders_south, region column is there. But in orders_central, region
column is not there. This is also the problem.
• In the orders_central table, order date, ship date and regions, all these
columns are available. But when scrolling down through the columns, there
are duplicate columns with a prefix Right. To remove these duplicate
columns, just check the check box in front of them. And the value given for
the state is AZ, not the full name just like in other tables.
• In the orders_East, there is a prefix (USD) with the values of Sales
column. But in other tables no such prefix is there. So, we need to
remove all these problems from our dataset as a part of cleaning.
Cleaning using
Tableau Prep
• Go to the orders_central
table in the flow area, click on
the + and choose clean step.
• We will get the output as
shown in the picture that is
showing the data
distribution. Eg: Most of the
orders are from standard
class and many more.
• In the orders_central table,
no region column is there. So,
this is the first step of our
cleaning process.
• There are 3 dots (…) available
after rename fields option.
Click on it to add a new field.
• Then give Region as a field
name and write “Central” in
the formula section and click
on Apply and Save.
• A new column has been
added to our data now with a
name Region and value
Central as shown in the
picture on the next slide.
Removal of 1st problem in Orders_Central
• Second issue with the orders_central is missing order date and ship
date column. For this again we will add a new calculated field.
• Enter the details as Order Date as a field name and formula as
MAKEDATE([Order Year],[Order Month],[Order Day]) and click on
apply and save.
• New field with a name Order Date has been added into the data as
shown in the picture on the next slide.
• Similarly add a new field ship date with a formula MAKEDATE([Ship
Year],[Ship Month],[Ship Day]).
Removal of 2nd problem in Orders_Central
• As we have now ship date
and order date, so we do not
need separate fields such as
order month, year etc. To
remove them, just go that
field and click on 3 dots and
then remove.
• Apply the same procedure to
remove ship date, month and
year from the dataset.
• All the changes that we are
making is not in the original
dataset.
• For ship date and order date,
we are getting summary
information not the detailed
information. For this click on
the 3 dots and choose detail.
• There is one another issue in the orders_central table that there is a
discount column which is of type abc and one value is None. But if no
discount is there it must be 0. So, to do this, double click on none and
type 0. And to change the datatype, click on abc and choose number.
• We can see all the changes that we have made on the next slide.
• If we want, we can also assign a new name to the 1st clean step by
just double clicking clean 1 and enter a new name as “Fixing date and
discount”.
Removal of 1st
Problem in
Orders_East
• The problem in this table is
there is USD written with the
sales value. So, for this go to
the sales field and click on 3
dots and then choose clean
and then remove letters. It
will automatically remove
USD from the sales.
• After that convert the
datatype to the decimal
number.
• Now orders_east is fine.
Removal of 1st
problem in
Orders_West
• Problem in this table is the
state name. So to do this, go
to state column -> click on 3
dots -> group values ->
manual selection.
• Then enter the state names
one by one and press enter.
• After changing all the state
names, click on done.
Union of two files
• To perform the union of west and east dataset, drag the cleaning box
of west on to the cleaning step of east. New step will be added that
represents the union.
• Similarly, drag cleaning step of central and drop it on the union made
by east and west.
• At last, drag south data and drop it on the union box created earlier.
• The structure created is represented on the next slide.
• As shown in the picture, in the left side there is mismatched fields
column that is representing all the mismatched columns from the
union of 4 tables.
• Click on the checkbox of show only mismatched fields and we will
only get those fields.
• Drag Discounts field on the Discount to merge these two and drag
Product to Product Name.
• There are no mismatched fields left as shown in the picture on the
next slide.
• Now add new file, go to + -> Excel file -> Return Reasons_New.

Más contenido relacionado

La actualidad más candente

Tableau desktop & server
Tableau desktop & serverTableau desktop & server
Tableau desktop & server
Chris Raby
 
Tableau presentation
Tableau presentationTableau presentation
Tableau presentation
kt166212
 

La actualidad más candente (20)

Tableau desktop & server
Tableau desktop & serverTableau desktop & server
Tableau desktop & server
 
Tableau
TableauTableau
Tableau
 
Tableau slideshare
Tableau slideshareTableau slideshare
Tableau slideshare
 
Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2
 
Tableau Software - Business Analytics and Data Visualization
Tableau Software - Business Analytics and Data VisualizationTableau Software - Business Analytics and Data Visualization
Tableau Software - Business Analytics and Data Visualization
 
How to Improve Data Analysis Through Visualization in Tableau
How to Improve Data Analysis Through Visualization in TableauHow to Improve Data Analysis Through Visualization in Tableau
How to Improve Data Analysis Through Visualization in Tableau
 
Data Visualization with Tableau - by Knowledgebee Trainings
Data Visualization with Tableau - by Knowledgebee TrainingsData Visualization with Tableau - by Knowledgebee Trainings
Data Visualization with Tableau - by Knowledgebee Trainings
 
My tableau
My tableauMy tableau
My tableau
 
Tableau Visual Guidebook
Tableau Visual GuidebookTableau Visual Guidebook
Tableau Visual Guidebook
 
Tableau online training
Tableau online trainingTableau online training
Tableau online training
 
Tableau interview questions www.bigclasses.com
Tableau interview questions www.bigclasses.comTableau interview questions www.bigclasses.com
Tableau interview questions www.bigclasses.com
 
Tableau Suite Analysis
Tableau Suite Analysis Tableau Suite Analysis
Tableau Suite Analysis
 
Data Visualization With Tableau | Edureka
Data Visualization With Tableau | EdurekaData Visualization With Tableau | Edureka
Data Visualization With Tableau | Edureka
 
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
 
Tableau
TableauTableau
Tableau
 
Tableau
TableauTableau
Tableau
 
Tableau presentation
Tableau presentationTableau presentation
Tableau presentation
 
Power BI for Developers
Power BI for DevelopersPower BI for Developers
Power BI for Developers
 
Power BI Overview
Power BI OverviewPower BI Overview
Power BI Overview
 
Visualization using Tableau
Visualization using TableauVisualization using Tableau
Visualization using Tableau
 

Similar a Tableau Prep.pptx

Basic Computer skill-P4 Excel.pptx
Basic Computer skill-P4 Excel.pptxBasic Computer skill-P4 Excel.pptx
Basic Computer skill-P4 Excel.pptx
Mastewal5
 
Introduction to Excel
Introduction to ExcelIntroduction to Excel
Introduction to Excel
Najma Alam
 
Excel creating pivot table
Excel creating pivot tableExcel creating pivot table
Excel creating pivot table
samikshaa sinha
 

Similar a Tableau Prep.pptx (20)

Pivots and Slicers_v5
Pivots and Slicers_v5Pivots and Slicers_v5
Pivots and Slicers_v5
 
Microsoft access exercises
Microsoft access exercisesMicrosoft access exercises
Microsoft access exercises
 
Chapter 7 -DescriptiveStatistics and Pivot Table
Chapter 7 -DescriptiveStatistics and Pivot TableChapter 7 -DescriptiveStatistics and Pivot Table
Chapter 7 -DescriptiveStatistics and Pivot Table
 
Lecture 4-Prepare data-Clean, transform, and load data in Power BI.pptx
Lecture 4-Prepare data-Clean, transform, and load data in Power BI.pptxLecture 4-Prepare data-Clean, transform, and load data in Power BI.pptx
Lecture 4-Prepare data-Clean, transform, and load data in Power BI.pptx
 
IS100 Week 8
IS100 Week 8IS100 Week 8
IS100 Week 8
 
Getting started with Tableau
Getting started with TableauGetting started with Tableau
Getting started with Tableau
 
Microsoft Excel Tutorial
Microsoft Excel TutorialMicrosoft Excel Tutorial
Microsoft Excel Tutorial
 
CREATING A DATASET FROM EXCEL IN POWER BI REPORT BUILDER
CREATING A DATASET FROM EXCEL IN POWER BI REPORT BUILDERCREATING A DATASET FROM EXCEL IN POWER BI REPORT BUILDER
CREATING A DATASET FROM EXCEL IN POWER BI REPORT BUILDER
 
Basic Computer skill-P4 Excel.pptx
Basic Computer skill-P4 Excel.pptxBasic Computer skill-P4 Excel.pptx
Basic Computer skill-P4 Excel.pptx
 
kiromax.ppt
kiromax.pptkiromax.ppt
kiromax.ppt
 
Advanced Filter Concepts in MS-Excel
Advanced Filter Concepts in MS-ExcelAdvanced Filter Concepts in MS-Excel
Advanced Filter Concepts in MS-Excel
 
Introduction to Excel
Introduction to ExcelIntroduction to Excel
Introduction to Excel
 
excell.pdf
excell.pdfexcell.pdf
excell.pdf
 
Working with Google Sheet - Portfolio.pptx
Working with Google Sheet - Portfolio.pptxWorking with Google Sheet - Portfolio.pptx
Working with Google Sheet - Portfolio.pptx
 
Libre Office Calc Lesson 2: Formatting and Charts
Libre Office Calc Lesson 2: Formatting and ChartsLibre Office Calc Lesson 2: Formatting and Charts
Libre Office Calc Lesson 2: Formatting and Charts
 
Excel 2007 for Retail
Excel 2007 for RetailExcel 2007 for Retail
Excel 2007 for Retail
 
Excel notes by satish kumar avunoori
Excel notes by satish kumar avunooriExcel notes by satish kumar avunoori
Excel notes by satish kumar avunoori
 
Office excel tips and tricks 201101
Office excel tips and tricks 201101Office excel tips and tricks 201101
Office excel tips and tricks 201101
 
Print18
Print18Print18
Print18
 
Excel creating pivot table
Excel creating pivot tableExcel creating pivot table
Excel creating pivot table
 

Más de Venneladonthireddy1 (10)

02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
 
CG Lecture 1.pptx GRAPHIS VENNELA DONTHIREDDY
CG Lecture 1.pptx GRAPHIS VENNELA DONTHIREDDYCG Lecture 1.pptx GRAPHIS VENNELA DONTHIREDDY
CG Lecture 1.pptx GRAPHIS VENNELA DONTHIREDDY
 
CG Lecture0.pptx
CG Lecture0.pptxCG Lecture0.pptx
CG Lecture0.pptx
 
Supervised Learning-classification Part-3.ppt
Supervised Learning-classification Part-3.pptSupervised Learning-classification Part-3.ppt
Supervised Learning-classification Part-3.ppt
 
unit3.ppt
unit3.pptunit3.ppt
unit3.ppt
 
EOD Continued.pdf
EOD Continued.pdfEOD Continued.pdf
EOD Continued.pdf
 
lecture 1234.ppt
lecture 1234.pptlecture 1234.ppt
lecture 1234.ppt
 
VLAN _SLAN and VSAN.pptx
VLAN _SLAN and VSAN.pptxVLAN _SLAN and VSAN.pptx
VLAN _SLAN and VSAN.pptx
 
fashion.pptx
fashion.pptxfashion.pptx
fashion.pptx
 
Lecture12_16717_Lecture1.ppt
Lecture12_16717_Lecture1.pptLecture12_16717_Lecture1.ppt
Lecture12_16717_Lecture1.ppt
 

Último

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
mphochane1998
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
AldoGarca30
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
HenryBriggs2
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
pritamlangde
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
Kamal Acharya
 

Último (20)

A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
Learn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic MarksLearn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic Marks
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 

Tableau Prep.pptx

  • 2. Tableau prep • Tableau Prep Builder is a tool in the Tableau product suite designed to make preparing your data easy and intuitive. Use Tableau Prep Builder to combine, shape, and clean your data for analysis in Tableau. • Start by connecting to your data from a variety of files, servers, or Tableau extracts. Connect to and combine data from multiple data sources. Drag and drop or double-click to bring your tables into the flow pane, and then add flow steps where you can then use familiar operations such as filter, split, rename, pivot, join, union and more to clean and shape your data.
  • 3. • Each step in the process is represented visually in a flow chart that you create and control. Tableau Prep tracks each operation so that you can check your work and make changes at any point in the flow. • When you are finished with your flow, run it to apply the operations to the entire data set.
  • 5. • Go to the > Connections + -> Text File (Any type of file). • Choose the file Orders_South 2015 (for the given example) -> open • It will be loaded into the tableau as shown in the next slide. • The first option is connections that shows the file that we have loaded into the tableau prep. • Below that, there is a table option that shows the tables available in the file that we have uploaded. • Then in the top right (white area with a flow chart) is known as flow area that shows how the data flow is going on.
  • 6.
  • 7. • Below flow area different options are there such as Input, connection, text options, field separator, etc. • Table like structure in the bottom right is the information about the dataset that we have loaded such as column names, datatype, sample data, etc. • Check boxes are given in that table where we can uncheck any box for the column that we don’t want for our analysis. • Type specifies the type of data stored in the table. • Field name is the name of the column that we can change according to our requirement. Eg: Sales to sale for our data
  • 8. • Original Field name is the name of the column in the actual dataset that will remain same. • Changes represent if any changes made into the data. • Preview is some sample data given. • There is one another option i.e., Filter values with the help of which we can apply filter to the data. • When we will click on the filter values option, we will get one window (shown in the next slide) where we need to mention the logic or condition based on which we want to filter out our data. • Eg:- We only want to display the data of First-Class Ship Mode
  • 9.
  • 10. • Add a formula [Ship Mode]='first class’ into the window and click on apply. • We can also remove/edit the filter applied to our data by right clicking
  • 11. • Under input, there are certain options like settings, multiple files, data sample, changes. • Multiple files is the option with the help of which we can join multiple files. • Data sample is the option that help us to choose how many rows we want to use. • Changes is the tab that shows the changes that we made to our data.
  • 12. • We can even add new files to the existing area by dragging from the folder to the flow area.
  • 13. • In the orders_south table, there are the two columns order date and ship date. But in orders_central there are 6 columns that are representing order day, month, year and ship day, month and year. So, this is the problem. • In the orders_south, region column is there. But in orders_central, region column is not there. This is also the problem. • In the orders_central table, order date, ship date and regions, all these columns are available. But when scrolling down through the columns, there are duplicate columns with a prefix Right. To remove these duplicate columns, just check the check box in front of them. And the value given for the state is AZ, not the full name just like in other tables.
  • 14. • In the orders_East, there is a prefix (USD) with the values of Sales column. But in other tables no such prefix is there. So, we need to remove all these problems from our dataset as a part of cleaning.
  • 15. Cleaning using Tableau Prep • Go to the orders_central table in the flow area, click on the + and choose clean step. • We will get the output as shown in the picture that is showing the data distribution. Eg: Most of the orders are from standard class and many more. • In the orders_central table, no region column is there. So, this is the first step of our cleaning process.
  • 16. • There are 3 dots (…) available after rename fields option. Click on it to add a new field. • Then give Region as a field name and write “Central” in the formula section and click on Apply and Save. • A new column has been added to our data now with a name Region and value Central as shown in the picture on the next slide.
  • 17. Removal of 1st problem in Orders_Central
  • 18. • Second issue with the orders_central is missing order date and ship date column. For this again we will add a new calculated field. • Enter the details as Order Date as a field name and formula as MAKEDATE([Order Year],[Order Month],[Order Day]) and click on apply and save. • New field with a name Order Date has been added into the data as shown in the picture on the next slide. • Similarly add a new field ship date with a formula MAKEDATE([Ship Year],[Ship Month],[Ship Day]).
  • 19. Removal of 2nd problem in Orders_Central
  • 20. • As we have now ship date and order date, so we do not need separate fields such as order month, year etc. To remove them, just go that field and click on 3 dots and then remove. • Apply the same procedure to remove ship date, month and year from the dataset. • All the changes that we are making is not in the original dataset.
  • 21. • For ship date and order date, we are getting summary information not the detailed information. For this click on the 3 dots and choose detail.
  • 22. • There is one another issue in the orders_central table that there is a discount column which is of type abc and one value is None. But if no discount is there it must be 0. So, to do this, double click on none and type 0. And to change the datatype, click on abc and choose number. • We can see all the changes that we have made on the next slide. • If we want, we can also assign a new name to the 1st clean step by just double clicking clean 1 and enter a new name as “Fixing date and discount”.
  • 23.
  • 24. Removal of 1st Problem in Orders_East • The problem in this table is there is USD written with the sales value. So, for this go to the sales field and click on 3 dots and then choose clean and then remove letters. It will automatically remove USD from the sales. • After that convert the datatype to the decimal number. • Now orders_east is fine.
  • 25. Removal of 1st problem in Orders_West • Problem in this table is the state name. So to do this, go to state column -> click on 3 dots -> group values -> manual selection. • Then enter the state names one by one and press enter. • After changing all the state names, click on done.
  • 26.
  • 27. Union of two files • To perform the union of west and east dataset, drag the cleaning box of west on to the cleaning step of east. New step will be added that represents the union. • Similarly, drag cleaning step of central and drop it on the union made by east and west. • At last, drag south data and drop it on the union box created earlier. • The structure created is represented on the next slide. • As shown in the picture, in the left side there is mismatched fields column that is representing all the mismatched columns from the union of 4 tables.
  • 28.
  • 29. • Click on the checkbox of show only mismatched fields and we will only get those fields. • Drag Discounts field on the Discount to merge these two and drag Product to Product Name. • There are no mismatched fields left as shown in the picture on the next slide.
  • 30.
  • 31. • Now add new file, go to + -> Excel file -> Return Reasons_New.