0️⃣Smart Plot

❓What It Can Do for You

Smart Plot is the key feature of HEARTCOUNT where typical code-heavy visualization tasks can be executed easily and come in handy. To visualize data in a way you want to examine it, all you have to do is to simply select which variable goes x-axis and which goes to y-axis and choose the type of visualizations you wish to use.

Smart Plot provides a variety of visualizations of your data, which include such below.

Data Type

Available Visualization

Between numeric and other numeric

Scatterplot

Trend Line(regression line)

Heat Scatter

Between categorical and numeric

Bar (average or sum)

Stacked Bar

Stacked Area

95% confidence interval

Boxplot

Between categorical and categorical

Ratio Bar Chart

Stacked Count Bar

Between time series and numeric

Time Series Line Chart

Stacked Area

Trend Line

Forecast

📃How to Use

- Basics

Smart Plot consists of four key sections. You can easily create a suitable visualization of your dataset using these sections without writing a single line of code.

Area

What's It For

1. Main Area

This is where a plot will be displayed. You will be able to interact with plot elements such as data points in a scatterplot to further investigate the dataset to find an answer to your analytic inquiries.

2. Side Menu

This is where you may configure Smart Plot's parameters, such as which variables to use to change the colors or sizes of data points, or to filter the data. Also, you could choose

3. Variable Selection

This is where you can choose which variable to create a plot suitable for your analytic purpose. As with creating a data visualization in a code-heavy setting(R/Python), you must choose which variables you would place in the x and y axes and which to use for subgrouping or faceting.

4. Visualization Type

This is where you can choose which type of visualization you would use to plot the data. Given the variables for the x and y axes, the Visualization Type tab will provide you several options you could choose from to correctly visualize your data.

- Types of Visualizations

This section will discuss the many sorts of visualizations possible in relation to the specified x and y axes variables.

When you put numeric variables on both axes, a simple scatterplot will be displayed on Smart Plot. Also, a Pearson correlation coefficient will be given as a basic information on these two variables.

See in Detail

There are two available types of additional visualizations for the scatterplot.
- You may choose to display a trend line , which is basically a regression line. It displays how the x and y variables are linearly correlated.
  - On the left of the trend line icon lies a number. It is a Pearson correlation coefficient that shows how much those variables are linearly correlated.
    Pearson correlation coefficients are on or between −1 and +1.
    If it equals to zero, it means they are not linearly correlated at all.
    If it equals to plus one, it means they are strongly positively correlated.
    If it equals to minus one, it means they are strongly negatively correlated.
  - If you drag to select some data points of interest, it will show you the linear relationship of only the selected data.
- The other available option is heatmap (often known as heatscatter).
  - This visualizes around which area data points cluster together the most.
  - The gradient color scale will change in accordance with the color settings on right top of HEARTCOUNT.

There are two major ways to visualize the data when you put a categorical variable on x-axis and a numeric variable on y-axis. One is to visualize individual data points and the other is to create subgroups based on the categorical variable on the x-axis.

Individual Data Point Visualization

Scatterplot : This is equivalent to a scatterplot for the scatterplot with two numeric variables, but data points are aligned in accordance with the given x-axis.

95% Confidence Interval : This shows 95% confidence interval for each group of a category variable on x-axis. A box placed upon the data points of a group represents the confidence interval of a sample mean for the group. When you click the box, you will be shown another box that is long and horizontal that you can use to compare with other groups' confidence interval boxes. You can imagine conducting an eye-level two-sample t-test.

Distribution : This is one of the two ways to visualize how data points are distributed across y-axis by each group of a categorical variable on x-axis. Length of a line represents the number of data points lying upon the line.

Heatmap : Heatmap, just as Distribution does, represents the distribution of data points but with gradient colors. Note that you can change the color scale with the color scale icon on top right of HEARTCOUNT.

Grouped Data Point Visualization

You may create subgroups based on the categorical variable on the x-axis by using subgroup tab. After so, Smart Plot will provide a set of available options for grouped data point visualization.

There are five metrics you may choose to put on a y-axis.
- Average of a numeric variable
- Sum of a numeric variable
- Record count: how many records are in each group
- Global ratio: how much proportion each group takes within the given dataset
- Local ratio: how much proportion each subgroup takes within the each group. This is useful when you divide each group into another subgroup.

Here are the list of available visualization options.
- Bar chart : This is a simple bar chart where heights are proportional to the given metric on a Y-axis.
  - You may use other variable to create subgroups within each group instead of grouping the entire group. Each bar on the x-axis will be divided into multiple subgroup bars.
- Stacked Bar Chart : Another way to use the subgroup feature is with a stacked bar chart. Each bar on the x-axis will be divided into multiple subgroup bars, and the subgroup bars will be stacked so that the original bar can be better understood.
- Stacked Area Chart : A stacked area chart is similar to the stacked bar chart, except that subgroup bars are connected together so that the levels/values of each subgroup vary over the given x-axis variable can be better understood. Below is displayed how easily local ratio of subgroups within each category can be calculated.
- Line Chart : This is a traditional line chart where seasonal patterns and trends can be explored if your dataset contains a temporal(time, date, etc) variable.

- Additional Features

Facetting

Smart Plot offers a facet feature. It allows you to divide a single plot into multiple charts based on a facet variable in order to better understand the relationship between the x and y variables. You can also use every other feature in Smart Plot within each facet plot.

Categorical variables with fewer than 11 groups can currently be used as a facet variable.

To get free access to HEARTCOUNT Public, Click Here

Last updated 2 years ago