This module covers the Tableau interface, including how to install Tableau, navigate the interface, and understand key features. We will make some basic charts to get familiar with the software.
Outcomes:
Links:
Before class, install Tableau on your computer:
Be careful how you save your file! Normal Tableau files don’t include the data. You must save as a packaged workbook for any submitted work in this class.
We share our data with Tableau Public. This makes our information and visualizations public. Do not use Tableau Public for any sensitive or private data.
We start by connecting to a data source. Tableau can connect to many different data sources, including databases, spreadsheets, and flat files. For this class, we will primarily use Excel and CSV files. Tableau can also connect to databases, but that is outside the scope of this class.
Data sources:
Once you connect to a data source, Tableau will show you a preview of the data. You will generally need to do some basic data cleaning before you can start visualizing the data. Common tasks include:
Each field can hold different types of data.
Some common data types include:
Text are strings of characters (like names, categories)Integers are number without a decimal point valueFloats/doubles include information past the decimal pointMoney is a type in some databases. It is an integer that records the number of pennies, which solves some problems with doubles/floatsBoolean are True or False valuesDates are a special type that records dates. Tableau can parse many different date formats, but they must have a day, month, and year. You can also hold time.Geographical types include country, state, county, city, and zip code. Tableau can use these to create maps.One of the most critical things to understand in Tableau is the difference between dimensions and measures.
Dimensions are fields that are used to slice and dice data. They are typically discrete values, such as categories or names. Dimensions are used to group data and create headers in tables.
Discrete are blue pills are discrete categories (like id number, neighborhood)Continuous are green pills are continuous ranges (like room price)Measures are fields that are used to aggregate data. They are typically numeric values, such as sales or quantities. Measures are used to calculate totals, averages, and other statistics.
Mode is the most common itemMean is sum / countMedian is middle item when sorted large/smallCountsummaxminWe commonly use discrete measures and continuous dimensions. However, this is not always the case. You can change a dimension to a measure by right-clicking on the field and selecting “Convert to Measure”. Similarly, you can change a measure to a dimension by right-clicking on the field and selecting “Convert to Dimension”.
To create a table in Tableau, drag and drop fields from the Data pane to the Rows and Columns shelves. You can also use the Show Me panel to quickly create a table.
A table is not chart unless you also use pre-attentive attributes to map data to visual features. For example, you can use color to represent different categories or size to represent different values.
Source: https://www.reddit.com/r/dataisbeautiful/comments/1qaq7kz/a_quarter_century_of_television_oc/ by gammafission00