Computing for the Social Sciences
Computing for the Social Sciences
Home
FAQ
Schedule of Topics
Homework
Setup
Notes
Light
Dark
Automatic
Overview
This section contains lecture notes and exercises for the course.
Relational data: a quick review
Relational data is multiple tables of data that when combined together answer research questions. Relations define the important element, not just the individual tables. Relations are defined between a pair of tables, or potentially complex structures can be built up with more than 2 tables.
Last updated on Nov 15, 2021
datawrangle
Saving the source and blank slates
library(tidyverse) library(here) set.seed(1234) theme_set(theme_minimal()) Save source, not the workspace Your workspace in R is the current working environment. It includes any libraries you have loaded using library(), any user-defined objects (e.
Last updated on May 25, 2021
project-management
Scraping web pages
library(tidyverse) library(rvest) library(lubridate) theme_set(theme_minimal()) Run the code below in your console to download this exercise as a set of R scripts. usethis::use_course("uc-cfss/getting-data-from-the-web-scraping") What if data is present on a website, but isn’t provided in an API at all?
Last updated on Jul 20, 2021
webdata
Selecting optimal color palettes
library(tidyverse) library(sf) library(tidycensus) library(RColorBrewer) library(patchwork) # useful on MacOS to speed up rendering of geom_sf() objects if (!identical(getOption("bitmapType"), "cairo") && isTRUE(capabilities()[["cairo"]])) { options(bitmapType = "cairo") } options(digits = 3) set.seed(1234) theme_set(theme_minimal()) Selection of your color palette is perhaps the most important decision to make when drawing a choropleth.
Last updated on May 25, 2021
dataviz
,
geospatial
Simplifying lists
library(tidyverse) library(httr) library(repurrrsive) set.seed(123) theme_set(theme_minimal()) Run the code below in your console to download this exercise as a set of R scripts. usethis::use_course("uc-cfss/getting-data-from-the-web-api-access") Not all lists are easily coerced into data frames by simply calling content() %>% as_tibble().
Last updated on Nov 15, 2021
webdata
Supervised classification with text data
library(tidyverse) library(tidymodels) library(tidytext) set.seed(1234) theme_set(theme_minimal()) A common task in social science involves hand-labeling sets of documents for specific variables (e.g. manual coding). In previous years, this required hiring a set of research assistants and training them to read and evaluate text by hand.
Last updated on Jan 13, 2022
text
The basics of statistical learning
Statistical models attempt to summarize relationships between variables by reducing the dimensionality of the data. For example, here we have some simulated data on sales of Shamwow in 200 different markets.
Last updated on Jan 5, 2022
stat-learn
The grammar of graphics
This page is a summary of A Layered Grammar of Graphics by Hadley Wickham. I strongly encourage you to read the original article in conjunction with this summary. library(tidyverse) library(knitr) library(palmerpenguins) Google defines a grammar as “the whole system and structure of a language or of languages in general, usually taken as consisting of syntax and morphology (including inflections) and sometimes also phonology and semantics”.
Last updated on Jan 6, 2022
dataviz
Tidy data
library(tidyverse) Most data analysts and statisticians analyze data in a spreadsheet or tabular format. This is not the only way to store information,^[Computer scientists and web developers frequently make use of a range of other data types to store information.
Last updated on May 25, 2021
datawrangle
Topic modeling
library(tidyverse) library(tidymodels) library(tidytext) library(textrecipes) library(topicmodels) library(here) library(rjson) library(tm) library(tictoc) set.seed(1234) theme_set(theme_minimal()) Typically when we search for information online, there are two primary methods: Keywords - use a search engine and type in words that relate to whatever it is we want to find Links - use the networked structure of the web to travel from page to page.
Last updated on Sep 1, 2021
text
«
»
Cite
×