Skip to content

Latest commit

 

History

History
91 lines (65 loc) · 1.13 KB

analysis.md

File metadata and controls

91 lines (65 loc) · 1.13 KB

Proportion of Women on the Streets

Load library

library(dplyr)

Load data

setwd(githubdir)
setwd("women-count/")
mturk <- read.csv("data/batch_2808915_batch_results.csv")

Range of date and time

range(mturk$Input.exif_date)
## [1] "2016-11-12" "2017-01-11"
range(mturk$Input.exif_time)
## [1] "10:09:03" "19:00:04"

Total number of unique locations

length(unique(mturk$Input.id))
## [1] 196

Total number of photos

length(unique(mturk$Input.file_id))
## [1] 1958

Average Estimate (Averaging the three ratings) per photo:

avg_est <-
  mturk %>% 
  group_by(Input.file_id) %>%
  summarize(total_n = mean(as.numeric(Answer.total_people), na.rm = T), total_men = mean(as.numeric(Answer.total_men), na.rm = T))

Total men, and total people

totals <- colSums(avg_men[, 2:3])

Proportion of men

totals[2]/totals[1]
## total_men 
## 0.8146494

Average proportion across locations

avg_prop <- avg_men[, 3]/avg_men[, 2]

mean(avg_prop[avg_prop < 30], na.rm=T)
## [1] 0.8666389