Describe political science/panel-series data problems.
Introduce psData, our first attempt at a solution.
Thoughts for future framework.
Get ideas for going forward/avoiding effort duplication.
| Country | Year | Var1 | Var2 | Var3 | Var4 | Var5 | Var6 | Var7 | Var8 |
|---|---|---|---|---|---|---|---|---|---|
| Angola | 2000 | ||||||||
| Angola | 2001 | ||||||||
| Brazil | 2000 | ||||||||
| Brazil | 2001 | ||||||||
| Cambodia | 2000 | ||||||||
| Cambodia | 2001 |
Many political scientists are creating/making publicly available panel-series data sets!
Data tied to particular studies.
Data posted and maintained haphzardly.
Variety of panel/series identifiers, even for regularly used panels (especially countries).
Some variables are suggested by the literature (e.g. winset) are composed of other variables, but aren’t regularly updated.
R package psData
Master build on CRAN
DpiGetPolityGetRRCrisisGetWinsetCreatorCreator functions source getter functions for raw data.
# Load package
library(psData)
# Download/transform polity2 variable
PolityData <- PolityGet(vars = "polity2")
head(PolityData)
## iso2c country year polity2
## AF Afghanistan 1800 -6
## AF Afghanistan 1801 -6
## AF Afghanistan 1802 -6
## AF Afghanistan 1803 -6
## AF Afghanistan 1804 -6
## AF Afghanistan 1805 -6
# Download/transform Reinhart and Rogoff (2010)
RRData <- RRCrisisGet()
head(RRData)[1:5]
## iso2c country year RR_Independence RR_CurrencyCrisis
## AO Angola 1800 0 0
## AO Angola 1801 0 0
## AO Angola 1802 0 0
## AO Angola 1803 0 0
## AO Angola 1804 0 0
## AO Angola 1805 0 0
# Create winset and selector variables
WinsetData <- WinsetCreator()
head(WinsetData)
## iso2c country year W ModS
## AF Afghanistan 1975 0.25 0
## AF Afghanistan 1976 0.25 0
## AF Afghanistan 1977 0.25 0
## AF Afghanistan 1989 0.50 0
## AF Afghanistan 1990 0.50 0
## AF Afghanistan 1991 0.50 0
Development at rOpenGov.
Ideas for avoiding effort duplication with others building open data frameworks?
Twitter: @chrisgandrud
GitHub: rOpenGov/psData