2. Dr.Girija Narasimhan 2
It is an interactive data visualization software.
This tool is used for effective data analysis and data visualization in the
industry.
It has a drag and drop interface and this feature helps it to perform tasks
easily and very fast.
Tableau
4. Tableau Desktop
Tableau Online
Tableau Server
Tableau Reader
Tableau Public Free
Dr.Girija Narasimhan 4
Tableau Version
Tableau Desktop is now free for all full-time students. ...
Through Tableau for Students, you can get a 1-year license
of Tableau Desktop for free! Others 14-day free trial.
Tableau Reader is a free tool which allows you to view
the workbooks and visualizations created using Tableau
Desktop or Tableau Public.
Tableau Online is your analytics platform fully hosted in the cloud. Publish
dashboards and share your discoveries with anyone. Invite colleagues or
customers to explore hidden opportunities with interactive visualizations
and accurate data. All easily accessible from a browser or on the go with
our mobile apps. 14-days free trial
14-day free trial.
5. Dr.Girija Narasimhan 5
Tableau Desktop Tableau Public
Pricing
Personal Edition - $35 per month Professional
Edition - $70 per month
Free and Open Source
Data Source Connection
Connect to any data source files including
databases, web applications and more
Connect to Excel and text files
Publish/Save
Can be saved as a local file and can be
published in Tableau Server and Tableau
Online
Can be published only in Tableau Public
profile
OS Windows and Mac Windows and Mac
Security Data and Visualizations are secured
Data and visualizations are not secured as it
is available in public
Usage Details Professional and Enterprise level Personal level
Data Extract
Data can be extracted from various data
sources and stored as Tableau extract file
Data cannot be obtained from different data
sources as it is limited to connect only excel
and text files.
Tableau Public (Free) vs. Tableau Desktop (Commercial)
8. Dr.Girija Narasimhan 8
•The architecture of Tableau Server is designed to connect different data sources securely.
•Data Server is the first layer in the architecture. It helps Tableau to connect data in various heterogeneous
environments.
•Data connector is the second layer. It helps to connect to various databases using its ODBC connector.
•Tableau can connect the real time live data by connecting the database directly. It can also extract a local copy
of data through its in-memory data store for faster processing.
•The components such as Application Server, VizQL Server, and data server act as the third layer.
•The application server is used for authentications and authorizations.
•VizQL is used to convert the SQL query into visualizations.
•Data Server is a centralized data management system used in the architecture.
•A gateway is used to distribute the processes into different components.
•The fourth layer of the architecture are the clients such as Tableau Desktop, web and Mobile.
Tableau Architecture & Server Components
9. Dr.Girija Narasimhan 9
Tableau Server is designed in a way to connect many data tiers. It can connect clients from desktop, mobile, and
web. Tableau Desktop is a robust data visualization tool. It is highly available and secure.
It can run on both virtual and physical machines. It is a multi-user, multi-process and multi-threaded system.
The various layers used in the Tableau server are given in the following architecture diagram
10. Dr.Girija Narasimhan 10
1.Data Server
The primary component of Tableau Architecture is the Data sources it can connect to it. Tableau can
connect to multiple data sources. These data sources can be on-premise or remotely located. It can
connect to a database, excel file, and a web application all at the same time. Tableau can connect data
from heterogeneous environments. It can blend the data from multiple data sources. It can also make the
relationship between various types of data sources.
11. Dr.Girija Narasimhan 11
2.Data Connectors
The Data Connectors provide an interface to connect external data sources to Tableau Data Server.
Tableau has in-built ODBC/SQL connector. This ODBC Connector can connect to any databases without using
their native connector. Tableau has an option to select both live and extract data. Based on the usage, one can
be easily switched between extracted and live data.
•Live Connection or Real time data: Tableau can connect to real time data by linking to the external database
directly. It uses the infrastructure of existing database system by sending dynamic MDX (Multidimensional
Expressions) and SQL statements. This feature can link to the live data with Tableau rather than importing the
data. It makes good the investment done by an organization on a fast and optimized database system. In many
enterprises, the size of the database is huge and is updated periodically. In those cases, Tableau works as a
front-end visualization tool by connecting to the live data.
•Extracted or In-memory data: Tableau has an option to extract the data from external data sources. We can
make a local copy in the form of tableau extract file. It can extract millions of records in Tableau data engine with
a single click. Tableau's data engine uses storage such as RAM, ROM and cache memory to store and process
data. Using filters, Tableau can extract few records from a huge dataset. This improves the performance,
especially while working on massive datasets. Extracted or in-memory data allows the users to visualize the
data offline, without connecting to the data source.
12. Dr.Girija Narasimhan 12
3.Components of Tableau Server
The different components present in a Tableau server are:
•Application Server
•VizQL Server
•Data Server
A) Application Server:
The application server is used to provide the authentications and authorizations. It handles the
administration and permission for web and mobile interfaces. It assures security by recording each session
id on Tableau Server. The administrator can configure the default timeout of the session in the server.
B) VizQL Server:
VizQL server is used to convert the queries from the data source into visualizations. Once the client
request is forwarded to VizQL process, it sends the query directly to data source and retrieves information
in the form of images. This image or visualization is presented to the user. Tableau server creates a cache
of visualization to reduce the load time. The cache can be shared across many users who have the
permission to view the visualization.
C) Data Server:
Data server is used to manage and store the data from external data sources. It is a central data
management system. It provides metadata management, data security, data storage, data connection and
driver requirements. It stores the relevant details of data set such as metadata, calculated fields, sets,
groups, and parameters. The data source could extract data as well make live connections to external data
sources.
13. Dr.Girija Narasimhan 13
4.Gateway
The gateway channelizes the requests from users to Tableau components.
When the client makes a request, it is forwarded to external load balancer for processing. The gateway works as
a distributor of processes to various components.
In case of absence of external load balancer, gateway also works as a load balancer. For single server
configuration, one primary server or gateway manages all the processes. For multiple server configurations, one
physical system works as primary server while others are used as worker servers. Only one machine can be
used as a primary server in Tableau Server environment.
14. Dr.Girija Narasimhan 14
Tableau Desktop Tableau Desktop is a business analytics tool. It
helps to create, view and publish dashboards in
Tableau Server. Users can access various data
sources and build visualizations in Tableau
Desktop.
Mobile The dashboards from the server can be
interactively visualized using mobile browsers and
applications. The browser and application can be
used to view and edit the contents in the
workbook.
Web Web browsers such as Google Chrome, Safari,
Firefox and internet explorer support the Tableau
server. The contents and visualizations in the
dashboard can be edited through these web
browsers.
Clients
The dashboards and visualizations in Tableau server can be viewed and edited using different clients. The
Clients are Tableau Desktop, web browser and mobile applications.
Clients Environment
15. Dimension vs. Measure
Dimension is a field that can be considered an
independent variable.
By default, Tableau treats any field containing
qualitative, categorical information as
a dimension. (Slice and dice)
The measure is the number-dependent
variable.
Measures contain quantitative values. By
default, Measures can be aggregated, drag
a measure into the view.
Rows or Columns shelves in the view
calculate the “Sum of Sales” for every “State in India”.
The values of Sales fields (Measures) are dependent on the State, So State is an independent field and Sales is
a dependent field. Dr.Girija Narasimhan 15
16. Dr.Girija Narasimhan 16
Find total sales in each category for each region.
South, west,
central, east
Furniture, office
Supplier, technology
17. Dr.Girija Narasimhan 17
Measure names and Measure values:
Measure names and Measure values are the two fields created in Tableau by default. These fields are
created when a data set is imported into Tableau. You can go to a Data pane of the worksheet and view the
field
Measure Names:
A measure name consists of all the names of the measure
present in a data set. It always presents at the end of
dimension list. It consists of discrete values of all measure
names.
Measure Values:
All the measure values present in a data set are kept
together in the field called measure values. It always
presents at the end of measures list. It consists of
continuous values of all measures.
18. Data fields are made from the columns in your data source.
Each field is automatically assigned a data type (such as
integer, string, date),
Role: Discrete Dimension or Continuous Measure (more
common), or Continuous Dimension or Discrete Measure (less
common).
Data field roles and types
Dr.Girija Narasimhan 18
19. ContinuousDiscrete
In the Data window, green icons indicate continuous fields
and add axes when dragged to the view
Discrete data such as hierarchies and
members that cannot be aggregated.
Discrete values are independent of
each other and they create
breakdowns.
In the Data window, blue icons indicate
discrete and add headers when dragged to the
view
Examples of dimensions include
customer names, Row Id, state and
customer segments.
These numbers can take on any value or they do
not fall into distinct categories. Continuous values
create a continuous range.
These are usually variables such as Unit Price, Profit, or Order
Quantity, Gross Profit, Shipping Cost or Inventory.
Roles
Dr.Girija Narasimhan 19
22. Quantity field has been set
to Discrete. It creates horizontal
headers instead of an axis
Quantity field is set to Continuous, it creates a horizontal axis
Dr.Girija Narasimhan 22
23. Product Category Bill amount
There is no limit to the bill amount a customer can purchase- continues
where no matter how many products are purchased, they will each fall into specific
categories. Each order in your transaction database could potentially have a different
quantity, but there will be a finite number of product categories- discrete.
Identify continuous and Discrete
Dr.Girija Narasimhan 23
26. There are three basic steps involved in creating any Tableau data analysis report.
These three steps are −
•Connect to a data source
•Choose dimensions and measures − This involves selecting the required columns from the
source data for analysis.
•Apply visualization technique − This involves applying required visualization methods, such
as a specific chart or graph type to the data being analyzed.
Tableau - Get Started
Dr.Girija Narasimhan 26
28. Connect to a data source − It involves locating the data and using an appropriate type
of connection to read the data.
Dr.Girija Narasimhan 28
29. Then navigate to the file “Sample – Superstore.xls” as mentioned
above. The excel file has three sheets named Orders, People and
Returns. Choose Orders.
Dr.Girija Narasimhan 29
37. Dr.Girija Narasimhan 37
• Data present in the data source can be sorted based on the user requirement.
• It can be sorted using data source order, ascending, descending, ascending per table and
descending per table.
• Once the data is connected to Tableau, data sorting can be done using the "Sort Fields" option.
• The option is present in the "Data Source" tab.
Data Sorting
48. Dr.Girija Narasimhan 48
Menu Bar:
It consists of menu options such as File, Data, Worksheet, Dashboard, Story, Analysis, Map, Format, Server, and
Windows. The options in the menu bar include features such as file saving, data source connection, file export,
table calculation options, and design features for creating a worksheet, dashboard, and storyboard.
Toolbar Icon:
Toolbar icon present below the menu bar can be used to edit the workbook using different features such as undo,
redo, save, new data source, slideshow and so on.
Dimension Shelf:
The dimensions present in the data source can be viewed in the dimension shelf.
Measure Shelf:
The measures present in the data source can be viewed on the measure shelf.
The user-defined sets and parameters can be viewed in the sets and parameter shelf. It can also be used to edit the existing
sets and parameters.
Page Shelf:
Page shelf can be used to view the visualization in video format by keeping the relevant filter on the page shelf.
Filter Shelf:
The filters that can control the visualization can be placed on the filter shelf, and the required dimensions or measures can be
filtered in.
Marks Card:
Marks card can be used to design the visualization. The data components of the visualization such as color, size, shape, path,
label, and tooltip used in the visualizations can be modified in the marks card.
Sets and Parameters Shelf:
49. Dr.Girija Narasimhan 49
Worksheet:
The worksheet is the place where the actual visualization can be viewed in the workbook. The design and functionalities of
the visual can be viewed in the worksheet.
Tableau Repository:
Tableau repository is used to store all the files related to tableau desktop. It includes various folders such as Bookmarks,,
Mapsources, web datasources, local, bin, patterns.
51. Dr.Girija Narasimhan 51
Data Source:
The addition of new data source of modification of existing
data source can be done using the 'Data Source' tab present
at the bottom of the Tableau Desktop Window.
Current Sheet:
Current Sheet can be viewed with the name of the sheet. All
the sheets, dashboards and story board present in the
workbook can be viewed here.
New Sheet:
The new sheet icon present in the tab can be used to create a
new worksheet in the Tableau Workbook.
New Dashboard:
The new dashboard icon present in the tab can be used to
create a new dashboard in the Tableau Workbook.
New Storyboard:
The new storyboard icon present in the tab can be used to
create new storyboard in the Tableau Workbook.
54. •Dimensions are the descriptive data while measures are numeric data
•Choose dimensions and measures − This involves selecting the required columns from the source data for
analysis.
Choose Category and Region as the dimensions and Sales as the measure.
Drag and drop them as shown in the following screenshot. The result shows the total sales in each category for
each region.
Click sheet1
Dr.Girija Narasimhan 54
57. By default – it will select Bar chart
Dr.Girija Narasimhan 57
58. Drag and drop add another
dimension segment and click color
Dr.Girija Narasimhan 58
Using marks card we can easily reduce a large volume of data to a simple,
comprehensible visualization.
Marks card provides control over how the data is displayed in the view.
if the inner fields are a measure and a dimension, the
default mark type is a bar.
Segment Values are homeoffice,
corporate, Consumer
65. Bar Chart with Color Range
The longer bars get darker shades and the smaller bars get the lighter shades
Dr.Girija Narasimhan 65
drag the profit field to the color
palette under the Marks Pane
Apply colors to the bars
based on their ranges
Based on Ranges colors
are from dark blue shade
to light shade
69. Use a separate bar for each segment
Dr.Girija Narasimhan 69
Drag
segment
from
Marks to
column
70. From the Measures pane, drag segment Values to Rows.
Dr.Girija Narasimhan 70
71. A waterfall chart is a form of data visualization that helps in understanding the cumulative effect of sequentially
introduced positive or negative values.
The waterfall chart is also known as a flying bricks chart or Mario chart
Type of column chart
It is used to show how an initial value is increased/decreased by a series of intermediate values, to a final value.
Find the variation of Sales for each Sub-Category of Products
Hint: cumulative of Sales -> sum(sales) in columns
Rows -> sub-category
Apply the chart with color
Dr.Girija Narasimhan 71
waterfall chart
72. Dr.Girija Narasimhan
Step 1 : Select Ascending or Descending
Step 3 : select Gantt Bar
Step 4 :drag sales in color
Step 2:Click this arrow
73. Histogram
Histograms are based on area, not height of bars
Histogram. It is similar to a Bar Chart .
Histogram from a continuous variable, first need to split the data into intervals, called bins. In the example
age has been split into bins, with each bin representing a 10-year period starting at 20 years.
Dr.Girija Narasimhan 73
75. Using the Sample-superstore, Find the quantities of sales for different regions.
Select the Histogram chart.
Column : Quantity (bin)
Rows: CNT (Quantity)
Marks Card : Region (color)
: ctrl & drag CNT(Quantity) in Label
Dr.Girija Narasimhan 75
76. Dr.Girija Narasimhan 76
Step 1: Drag Quantity to Columns.
Step 2:Click Show Me on
the toolbar, then select
the histogram chart type.
79. Filtering is the process of removing certain values or range of values from a result set.
Tableau - Basic Filters
There are three types of basic filters available in Tableau. They are as follows −
•Filter Dimensions are the filters applied on the dimension fields.
•Filter Measures are the filters applied on the measure fields.
•Filter Dates are the filters applied on the date fields.
Dr.Girija Narasimhan 79
80. Dr.Girija Narasimhan 80
Drag Region
dimension then
filter pop up will
appear.
Select only two
regions
Filter General
Column : sum(sales)
Row : sub-category
90. Dr.Girija Narasimhan 90
Select only sub-categories
whose sum of sales is > 50
Step 1 Step 2
step 3
If “Null” appears then selected incorrect
aggregated function, if it gives value means
it is perfect
91. Dr.Girija Narasimhan 91
Sum sales in columns
indicates the bar
Include sum
(quantity)
Tooltip you can check
the quantity
124. Dr.Girija Narasimhan 124
Only highlighted portion having the
label remaining portion no labels.
Select the highlighted using mouse.
Like hand will appear you can
select it
129. Dr.Girija Narasimhan 129
A relational Database/Excel file consists of multiple Tables/sheets.
These multiple tables/sheets can be connected to each other in Tableau.
This connection is established by 'Join' or 'Union' feature present in Tableau.
The relationship between data in two or more tables needs to be specified while joining tables.
Join
130. Dr.Girija Narasimhan 130
Inner Join Left Join Right Join Outer Join
It joins all the
common records
between two tables
or sheets. The
joining condition
can be given based
on the primary key.
One or more joining
conditions can be
specified to join the
tables. Many tables
can be joined
together in Tableau
for visualization.
A left join is used to
join all the records
from a left table and
common records
from the right table.
One or more join
conditions can be
specified to left join
two different tables.
A right join is used
to join all the
records from a right
table and common
records from left
table. Based on the
requirements, one
or more joining
condition can be
set.
An Outer join is
used to join all the
records from both
left and right table.
Joining conditions
can be set to join
common records.
Tableau can "join" tables. It can join up to 32 tables in a data source. While joining, the relationship between
two or more tables can be specified. The tables present in the data source can be related to each other using
the joins such as inner join, left join, right join and outer join. The functionalities of Join types are explained in
the table below,
Various Join
139. Dr.Girija Narasimhan 139
Each book has one author.
One book can have many ratings and many editions.
Reviews are given for the book, not the edition, so the same rating can be counted against multiple editions.
This means there is effectively a many-to-many relationship between ratings and editions