Unlocking potential by illuminating the path to data literacy.
Data7 Exploratory Data Analysis in R Workshop Series
Exploratory data analysis (EDA) is crucial towards determining the validity of data and how it is often performed too late or not at all. The R programming language, specifically through the RStudio IDE, is a widely used open source platform for data analysis and visualization due to the variety of packages available. The dlookr package is introduced as a tool for conducting preliminary EDA to diagnose issues with imported data sets, including data outliers, missing data, and summary statistical reports.
Data7 Exploratory Data Analysis in Python Materials
Exploratory data analysis (EDA) is important throughout the data pipeline. I utilize the Python programming language and the pandas and pandas-profiling libraries to conduct preliminary EDA to diagnose any major issues with an imported data set, such as data outliers and missing data.