Data Exploring

Data exploring focuses on understanding data quality, patterns, distributions, and relationships.

There are several data exploration techniques. The most common ones are:

  • Heading data (looking at the first row)
  • Creating histograms to visualize distributions
  • Producing scatter plots to asses the relationships between variables
  • Conduct dimensional reductions to simplify the dataset
  • Cluster the data to identify groups and patterns

This usually brings dashboards, graphs, and other visual representations of data.

The objective of this phase is not to understand every minor detail, but to get a good sense of the trends to make a proper model.

To visualize data, we use tools like Power BI, Tableau and Python libraries like matplotlib.

Exploratory Data Analysis

Exploratory Data Analysis (EDA) is an iterative process combining analysis and visualization to understand data.