Introduction to Exploratory Data Analysis

pedal sepal

Nowadays, the world is powered by data. Whether you are innovating for the future or working to improve the decision-making process, data lies at the heart of it all. Making sense of data, the job of Data Scientist, is “The top-demanded job of the 21st century”. One of the must-have skills for a data scientist is exploratory data analysis (EDA). It is the process of performing initial investigations on data with the help of summary statistics and graphical representations. The project is designed to help students develop basic EDA skills such that given new data, they can discover patterns, spot anomalies and test hypotheses. We will focus on developing practical skills. Starting from small, tailored, curated “Iris” data sets. Students will work as a group to apply the learning to perform EDA on a famous dataset -Titanic dataset. Finally, they will be given some datasets (e.g. some overlake community data set, sports) from different sources to tell their own stories. Learn More Download this PDF


Special Considerations

Participants must be willing to follow COVID guidelines.  Participants should only attend illness free. If a participant should test positive for COVID during the project, the student will be isolated, and the student’s family will be required to pick them up at their current location.  The family will bear all the costs associated with the student pick up. 

By selecting the above project, you and your family are accepting the financial responsibility for the cost of the projects.  This includes any unrecoverable expenses incurred if Overlake, or any federal, state or local government forces cancelation of the project, or if you or your family decide not to attend, or if your student becomes symptomatic and has to be isolated. 

Project Week (2022-23)

Mar 27, 2023 - Mar 31, 2023


Chrissy Wakeling
Assistant Director of Experiential Education