Scrape ISU salary data by year from the web and turn it into a tidy dataframe

See CyChecks

sal_df(limit = 1000, offset = 0, fiscal_year = 2007, token = NULL)

Arguments

limit

The number of data entries (rows in the dataframe) you'd like to receive (default = 1000)

offset

Where you'd like the data entries to start pulling from (default = 0). If using the default of 0, then the dataset will start pulling from the beginning of alphabet. This argument is useful to change when using the function multiple times.

fiscal_year

The fiscal year the data are taken from. Limited to 2007-2018. Can only enter one fiscal year at a time.

token

An API token. Only necessary for large amounts of datascraping. Generated from this website

Value

A dataframe with salary information, position, and gender for Iowa State University employees in a given fiscal year.

Details

An API (or APP) token isn't necessary for scraping data, but it will help speed up the data grabbing process and will allow users to get nearly unlimited data.

Examples

sal_df()
#> # A tibble: 1,000 x 8 #> base_salary_date fiscal_year gender name place_of_reside~ position #> <dttm> <dbl> <chr> <chr> <chr> <chr> #> 1 2007-07-01 00:00:00 2007 M ABAS~ STORY POSTDOC~ #> 2 2007-07-01 00:00:00 2007 F ABAT~ STORY CASUAL ~ #> 3 2007-07-01 00:00:00 2007 M ABBA~ COOK SYS SUP~ #> 4 2007-07-01 00:00:00 2007 M ABBA~ OAKLAND CASUAL ~ #> 5 2007-07-01 00:00:00 2007 M ABBE~ STORY GRAD AS~ #> 6 2007-07-01 00:00:00 2007 M ABBE~ STORY CASUAL ~ #> 7 2007-07-01 00:00:00 2007 F ABBO~ STORY COMM SP~ #> 8 2007-07-01 00:00:00 2007 F ABBO~ BOONE ASST MG~ #> 9 2007-07-01 00:00:00 2007 M ABBO~ STORY PROF #> 10 2007-07-01 00:00:00 2007 F ABBO~ STORY CASUAL ~ #> # ... with 990 more rows, and 2 more variables: total_salary_paid <dbl>, #> # travel_subsistence <dbl>
x <- sal_df(limit = 10, fiscal_year = 2015)