In spring 2024, IRDL Online hosted a 3-day workshop by The Carpentries. Taught via Zoom, by instructors in Data Carpentry, the social sciences workshop covered data management and analysis for social science research including best practices for data organization in spreadsheets, reproducible data cleaning with OpenRefine, and data analysis and visualization in R. Learn more about the workshop by reading the summary or exploring the schedule below.
Day 1: February 7, 2024
9 a.m.–2 p.m. Pacific / 10 a.m.–3 p.m. Mountain / 11 a.m.–4 p.m. Central / 12–5 p.m. Eastern
Helpers: Jessica Serrao, Savannah Kelly, Sarah Christensen, Courtney Block, Simon Robins
Before starting | Pre-workshop survey |
---|---|
Session 1 | Data Organization in Spreadsheets (Marion Walton) |
00:00 - 00:15 | Introductions |
00:15 - 00:30 | Introduction & Formatting Data Tables in Spreadsheets |
00:30 - 00:50 | Formatting Problems |
00:50 - 01:00 | Short break |
01:00 - 01:20 | Dates as Data |
01:20 - 01:40 | Quality Assurance |
01:40 - 02:00 | Exporting Data |
02:00 - 03:00 | Long break |
Session 2 | Open Refine for Social Science Data I (Lyrric Jackson) |
03:00 - 03:15 | Introduction |
03:15 - 03:50 | Working with OpenRefine |
03:50 - 04:00 | Short break |
04:00 - 04:30 | Filtering and Sorting with OpenRefine |
04:30 - 05:00 | Examining Numbers in OpenRefine |
Day 2: February 8, 2024
9 a.m.–2 p.m. Pacific / 10 a.m.–3 p.m. Mountain / 11 a.m.–4 p.m. Central / 12–5 p.m. Eastern
Helpers: Jessica Serrao, Sarah Christensen, Courtney Block, Simon Robins
Session 1 | Open Refine for Social Science II (Lyrric Jackson) |
---|---|
00:00 - 00:45 | Using Scripts |
00:45 - 01:00 | Short break |
01:00 - 01:30 | Exporting and Saving Data from OpenRefine |
01:30 - 02:00 | Other Resources in OpenRefine |
02:00 - 03:00 | Long break |
Session 2 | R for Social Scientists I (Jia Qi Beh) |
03:00 - 03:45 | Before we start |
03:45 - 04:00 | Short break |
04:00 - 04:30 | Introduction to R |
04:30 - 05:00 | Starting with Data |
Day 3: February 9, 2024
9 a.m.–2 p.m. Pacific / 10 a.m.–3 p.m. Mountain / 11 a.m.–4 p.m. Central / 12–5 p.m. Eastern
Session 1 | R for Social Scientists II (Lyrric Jackson) |
---|---|
00:00 - 00:45 | Data Wrangling with dplyr |
00:45 - 01:00 | Short break |
01:00 - 02:00 | Data Wrangling with tidyr |
02:00 - 03:00 | Long break |
Session 2 | R for Social Scientists II (Jia Qi Beh) |
03:00 - 03:50 | Data Visualisation with ggplot2 |
03:50 - 04:00 | Short break |
04:00 - 04:20 | Getting started with R Markdown |
04:20 - 04:50 | Getting started with Processing JSON data |
04:50 - 05:00 | Final wrap up and Post survey |
This workshop was hosted by IRDL and our partner, SCELC.