What is data science? How can my students and I engage in it? Why is it essential in scientific disciplines? How does it contribute to solving societal problems? How it can become an important part of the science or math curriculum that I teach?
In the summer of 2020, the Data Clubs project hosted a virtual workshop for high school teachers to help them answer these questions. The workshop introduced teachers, especially those whose students are doing research projects, to data science with a 15-hour workshop consisting of several modules that were implemented online.
Teachers worked with multiple data sets curated by the project on topics ranging from climate change to COVID-19. Teachers examined these datasets, asked and iterated their own questions of the data, and explored patterns—much in the same ways that their students will do.
The modules emphasized solving problems involving datasets and data tools and engaging with key data science concepts, including:
- the ubiquity of data and the potential for learning from data;
- the complexities of measurement and how to look at data through these complexities;
- the case/attribute structure of data;
- describing distributions and distributional shape;
- looking at relationships within a dataset by comparing distributions and exploring correlations between attributes; and
- determining which questions can and cannot be asked of a dataset.
Teachers used data as they examined rates, charted changes over time, made comparisons between groups, and learned how multiple attributes can contribute to a particular finding. They also learned to use CODAP, an online tool for working with critical data science ideas, and had the opportunity to import, clean, and curate their own data sets.
Finally, teachers learned how to assess their students’ competency with data and figure out what is essential for their students to understand.