Skip to content

Commit

Permalink
update fyi boxes
Browse files Browse the repository at this point in the history
  • Loading branch information
KatherineCox committed Sep 1, 2022
1 parent 3af062b commit bd68aca
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion 05-student-guide.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ This process continues, with each PC capturing a decreasing amount of variation,
The first PCs will capture most heterogeneity (and thus most of the signal) in our high-dimensional dataset by definition, while the variation in later PCs will mostly represent noise. Thus, we can use the first PCs to both examine structure in our dataset as well as for downstream analyses to reduce computational work and improve the fit of our statistical models.


::: {.fyi}
::: {.dictionary}
GOING DEEPER: The math behind PCA

PCA is a method of matrix factorization. Essentially, the PCA approach allows a researcher to break down (or decompose) a large data matrix into smaller constituent parts (similar to the way we can break the number 30 into the factors 5 and 6). Each smaller matrix can be summarized by a vector, called the eigenvector. This vector can be resized; how much it is resized can be summarized by a number called the eigenvalue. You can also think of the eigenvalue as the variance of the eigenvector.
Expand Down

0 comments on commit bd68aca

Please sign in to comment.