Exploratory Data Analysis

The exploratory data analysis helps to understand all the variables, their formats and examples, and contextualize them within the business and through time. This step also allows the team to validate that the dataset selected for analysis meets the established objective, as well as other data quality requirements. The exploratory data analysis should contain univariate and multivariate analysis: 

  • Visualizations and summary statistics that describe the data and allow to find patterns and/or to detect outliers that may exist within it (e.g., Histograms, Box plots, bar and line charts, …) 
  • Correlation plots between variables and other analyses that allow for the identification of trends and the detection of anomalous behaviour (e.g., Heat maps, Scatter plots, Cross-tabulation, …)