-
Notifications
You must be signed in to change notification settings - Fork 12
/
Copy pathindex.qmd
152 lines (109 loc) · 9.9 KB
/
index.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
---
title: Designing the data science classroom
---
------------------------------------------------------------------------
🗓️ July 25 and 26, 2022
⏰ 09:00 - 17:00
🏨 Maryland 3
✍️ [Click here to register](https://www.rstudio.com/conference/2022/workshops/teach-ds/)
------------------------------------------------------------------------
# Overview
```{=html}
<script src="https://fast.wistia.com/embed/medias/o8orjrabxc.jsonp" async></script><script src="https://fast.wistia.com/assets/external/E-v1.js" async></script><div class="wistia_responsive_padding" style="padding:56.25% 0 0 0;position:relative;"><div class="wistia_responsive_wrapper" style="height:100%;left:0;position:absolute;top:0;width:100%;"><div class="wistia_embed wistia_async_o8orjrabxc videoFoam=true" style="height:100%;position:relative;width:100%"><div class="wistia_swatch" style="height:100%;left:0;opacity:0;overflow:hidden;position:absolute;top:0;transition:opacity 200ms;width:100%;"><img src="https://fast.wistia.com/embed/medias/o8orjrabxc/swatch" style="filter:blur(5px);height:100%;object-fit:contain;width:100%;" alt="" aria-hidden="true" onload="this.parentNode.style.opacity=1;" /></div></div></div></div>
```
<br>
There has been significant innovation in introductory statistics and data science courses to equip students with the statistical, computing, and communication skills needed for modern data analysis.
Innovating subsequent courses is also important, so students can continue developing these skills beyond the first course.
In this session, we'll present a modern approach to teaching undergraduate regression analysis, the second statistics course for many students.
We'll share strategies for using real-world data sets and examples, teaching modern computing skills, and incorporating non-technical skills such as writing and effective collaboration as part of the course.
We'll share example activities and assignments, along with a demo of the computing toolkit using the R tidymodels package, Quarto for reproducible reports, and Git and GitHub for version control and collaboration.
The activities and demo will be hands-on; attendees will also have the opportunity to exchange ideas and ask questions throughout the session.
Success in data science and statistics is dependent on the development of both analytical and computational skills, and the demand for educators who are proficient at teaching both these skills is growing.
The goal of this workshop is to equip educators with concrete information on content, workflows, and infrastructure for painlessly introducing modern computation with R and RStudio within a data science curriculum.
In addition to gaining technical knowledge, participants will engage in discussion around the decisions that go into developing a data science curriculum and choosing workflows and infrastructure that best support the curriculum and allow for scalability.
Workshop attendees will work through several exercises from existing courses and get first-hand experience with doing and teaching data science with the tidyverse and tidymodels packages.
Attendees will also get experience using relevant tool-chains and techniques, including running a course on RStudio Cloud, literate programming with Quarto, and workflows for collaboration, version control, and automated feedback with Git/GitHub.
We will also discuss best practices for configuring and deploying classroom infrastructures to support these tools.
This workshop is aimed primarily at participants teaching data science in an academic setting in semester-long courses, however much of the information and tooling we introduce is applicable for shorter teaching experiences like workshops and bootcamps as well.
Basic knowledge of R is assumed and familiarity with the tidyverse and Git is preferred.
## Learning objectives
Curriculum, workflow, and infrastructure design for teaching data science with R and RStudio.
## Is this course for me?
If your answer to any of the following questions is "yes", then this is the right workshop for you.
- Do you want to learn / discuss curriculum, pedagogy, and computing infrastructure design for teaching data science with R and RStudio?
- Are you interested in setting up your class in RStudio Cloud?
- Do you want to integrate version control with git into your teaching and learn about tools and best practices for running your course on GitHub?
## Prework
In this workshop you will wear two hats: the educator and the learner.
At times we will be demoing workflows for instructors (whom I assume are familiar with R, RStudio, and the tidyverse and have taught with or are interested in teaching with RStudio Cloud, Git, and GitHub) and at other times you will be working through the student view (on RStudio Cloud, assuming you're not using your local setup).
### Watch
For the latter, you can mostly assume that you're a student in an intro data science course and this is the first day of class (i.e. there's no expectation of prep).
However we'd also like to give you a taste of how the remainder of the semester goes, which often entails students watching background videos to prepare.
To prepare for the first module, please watch the following video.
::: {style="text-align: center;"}
```{=html}
<iframe width="560" height="315" align="middle" src="https://www.youtube.com/embed/fQ4t7p6ZXDg" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
```
::: {style="align: center;"}
🖥️ [Slides](https://bit.ly/let-eat-cake-cc)
:::
:::
### Set up
For the former, you'll might need to prepare a bit more.
If you already are a Git and GitHub user, have a personal token saved, and can push to and pull from GitHub without having to type your password in each time, you're good to go.
If not, please complete the following steps before the workshop[^1].
If you need help with any of the steps, please ask on [GitHub Discussions](https://github.com/rstudio-conf-2022/teach-ds/discussions).
[^1]: These steps will direct you to relevant chapters from [*Happy Git with R*](https://happygitwithr.com/) by Jenny Bryan et.
al.
It's such a great and comprehensive resource that I have not felt the need to replicate the information elsewhere.
- [Register a free GitHub account](https://happygitwithr.com/github-acct.html#github-acct)
- [Install or update R and RStudio](https://happygitwithr.com/install-r-rstudio.html#install-r-rstudio)
- [Install Git](https://happygitwithr.com/install-git.html#install-git)
- [Introduce yourself to Git](https://happygitwithr.com/hello-git.html#hello-git)
- [Personal access token for HTTPS](https://happygitwithr.com/https-pat.html)
- [Connect to GitHub](https://happygitwithr.com/push-pull-github.html)
- [Connect RStudio to Git and GitHub](https://happygitwithr.com/rstudio-git-github.html)
- [Detect Git from RStudio](https://happygitwithr.com/rstudio-see-git.html)
### Install
If you choose to use your local R and RStudio install for the workshop, you should also install the following packages:
- From CRAN:
```{r}
#| eval: false
install.packages("devtools")
install.packages("DT")
install.packages("fivethirtyeight")
install.packages("gapminder")
install.packages("ghclass")
install.packages("learnr")
install.packages("lubridate")
install.packages("nycflights13")
install.packages("openintro")
install.packages("rvest")
install.packages("shiny")
install.packages("sortable")
install.packages("tidyverse")
install.packages("tidymodels")
install.packages("unvotes")
```
- From GitHub:
```{r}
#| eval: false
devtools::install_github("rundel/learnrhash")
```
## RStudio Cloud
You can join the RStudio Cloud workspace for this workshop at [**rstd.io/teach-ds-conf22-cloud**](https://rstd.io/teach-ds-conf22-cloud).
# Instructors
{style="float:right;padding: 0 0 0 10px;" fig-alt="Headshot of Dr. Mine Çetinkaya-Rundel" width="200"}
[**Dr. Mine Çetinkaya-Rundel**](http://mine-cr.com/) (she/her) is Professor of the Practice at Duke University and Developer Educator at RStudio.
Mine's work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education as well as pedagogical approaches for enhancing retention of women and under-represented minorities in STEM.
Mine works on integrating computation into the undergraduate statistics curriculum, using reproducible research methodologies and analysis of real and complex datasets.
Mine works on the OpenIntro project, whose mission is to make educational products that are free, transparent, and lower barriers to education.
As part of this project she co-authored four open-source introductory statistics textbooks.
She is also the creator and maintainer of datasciencebox.org and she teaches the popular Statistics with R MOOC on Coursera.
Mine is a Fellow of the ASA and Elected Member of the ISI as well as the winner of the 2021 Robert V. Hogg Award for For Excellence in Teaching Introductory Statistics.
{style="float:right;padding: 0 0 0 10px;" fig-alt="Headshot of Dr. Maria Tackett" width="200"}
[**Dr. Maria Tackett**](https://mariatackett.net) is an Assistant Professor of the Practice in the Department of Statistical Science at Duke University.
Prior to joining the faculty at Duke, Maria earned a Ph.D. in Statistics from the University of Virginia and worked as a statistician at Capital One.
Her current work focuses on understanding how active learning strategies can be used to promote engagement and student motivation in undergraduate statistics courses.
She also studies how classroom practices in introductory math and statistics courses impact students' sense of community, self-efficacy, and learning outcomes.
Maria is an RStudio certified trainer and is actively involved in the R and statistics education communities.