Appendix C β Technical resources for intermediate to advanced data scientists
C.1 R programming
C.1.1 Tidyverse
|  | tidyverse | Collection of R packages designed for data science | 
| Link | Recommendation | 
|---|---|
| Introduction to the Tidyverse (JHU Data Science Labβs lectures) | β β β β β | 
| Importing data in the Tidyverse (JHU Data Science Labβs lectures) | β β β β β | 
| Wranging data in the Tidyverse | |
| Visualizing data in the Tidyverse | |
| Modeling Data in the Tidyverse | 
| Link | |
|---|---|
| Advanced R Programming | |
| Building R packages | |
| Building Data Visualization Tools | |
| Advanced R and solutions | 
C.1.2 Spatial analyses
| Link | Recommendation | 
|---|---|
| afrimapr - mapping data in Africa | 
C.1.3 Literature reviews
| metaverse | Collection of R packages designed for data science | |
| litsearchr | Excellent R package for supporting evidence synthesis generation | 
There are very well documented) vignettes/tutorials
- https://luketudge.github.io/litsearchr-tutorial/litsearchr_tutorial.html
- https://elizagrames.github.io/litsearchr/litsearchr_vignette.html
- https://elizagrames.github.io/litsearchr/introduction_vignette_v010.html
C.1.4 Multilanguage programming
| Link | Language | Recommendation | 
|---|---|---|
| Harvard Data Science workshops materials | R, Python, Stata | 
C.1.5 Machine learning
| MOOC | Language | Recommendation | 
|---|---|---|
| Machine learning in Python with scikit-learn | Python | β β β β β | 
| Machine learning for healthcare | 
Supervised Machine Learning for Text Analysis in R
https://ocw.mit.edu/courses/6-0002-introduction-to-computational-thinking-and-data-science-fall-2016/
https://ocw.mit.edu/courses/6-438-algorithms-for-inference-fall-2014/
C.1.6 Comprehensive lectures
| Link | Tools | Recommendation | Pricing | 
|---|---|---|---|
| Principles, Statistical and Computational Tools for Reproducible Data Science | R and Rstudio Python, Git, and GitHub | 
 | |
| Collaborative Data Science for Healthcare | Experience with R, Python and/or SQL is required | 
 | |
| https://www.coursera.org/learn/data-public-health | |||
C.2 Data collection workflows
| 
 | ruODK | |
| sitrep | ||
| REDCap+R | ||
| REDCapTidieR | 
C.2.1 Python programming
C.3 Books and websites
C.3.1 Git and GitHub
| Link | Recommendation | 
|---|---|
| GitHub Guides | |
| Happy Git and GitHub for the useR | 
C.3.2 Reproducible research
- The Turing way (1) β β β β β
| Link | Language | Recommendation | 
|---|---|---|
| R for applied epidemiology and public health (2) | R | β β β β β | 
| R for Epidemiology | R | |
| R4epis | R | |
| R for Data Science | R | β β β β β | 
| Quarto for scientists | R | |
| Tidyverse Skills for Data Science (jhudatascience.org) | R | 
C.4 Communities
Conferences
https://www.r-project.org/conferences/ in general in June-July Fees for Tanzanian participants around 6 USD / person
C.5 Statistics
https://rmisstastic.netlify.app/
| grf | 
C.6 Quarto
Styling PDF documents with Quarto extensions - April 2024
Resource from Nicola Rennie
 
      