Unlocking potential by illuminating the path to data literacy. 

  • Data7 Exploratory Data Analysis in R Workshop Series

    Data7 Exploratory Data Analysis in R Workshop Series

    Exploratory data analysis (EDA) is crucial towards determining the validity of data and how it is often performed too late or not at all. The R programming language, specifically through the RStudio IDE, is a widely used open source platform for data analysis and visualization due to the variety of packages available. The dlookr package is introduced as a tool for conducting preliminary EDA to diagnose issues with imported data sets, including data outliers, missing data, and summary statistical reports.

  • Data7 Exploratory Data Analysis in Python Materials

    Data7 Exploratory Data Analysis in Python Materials

    Exploratory data analysis (EDA) is important throughout the data pipeline. I utilize the Python programming language and the pandas and pandas-profiling libraries to conduct preliminary EDA to diagnose any major issues with an imported data set, such as data outliers and missing data.

All Articles