Published June 27, 2023
| Version v1
Dataset
Open
Data from "Measuring data rot: an analysis of the continued availability of shared data from a single university"
Description
Data files from the article "Measuring Data Rot" by Kristin Briney.
This research looked at supplemental data links from publications in CaltechAUTHORS and tested them for their availability on the web using web scraping and hand testing in the Chrome browser.
Data in the tables:
- Table1_LinkType.csv
- Table2_URLwebsites.csv
- Table3_DOIwebsites.csv
- Table4_UnavailableByType.csv
- Table5_UnavailableURLs.csv
- Table6_UnavailableDOIs.csv
Data in the figures:
- Figure1_LinksByYear.csv
- Figure2_UnavailableByYear.csv
Data from the project:
- DataRot.csv
- Overall dataset supporting this research, with variables defined in the data dictionary. This data contains all of the links tested, listing results of the webscraping but not results of the hand testing.
- DataRot_dataDictionary.csv
- Data dictionary defining variable names and values for DataRot.csv
- DataRot_handTested.csv
- Subset of supplemental data links from DataRot.csv that were hand tested and the results of the hand testing ("browser_test = TRUE" means the data was available, "browser_test = FALSE" means the data was not available, and "browser_test = LOGIN" means the webpage asked for a login to see the data).
- DataRot_missingData.csv
- Subset of DataRot_handTested.csv with fewer variables. This dataset only includes supplemental data links for data that was not available.
Files
Figure1_LinksByYear.csv
Files
(624.5 kB)
| Name | Size | Actions |
|---|---|---|
|
md5:2ec65c86a2ff088a47c6092177309461
|
234 Bytes | Preview Download |
|
md5:d6babd9567e930b32fb701d241b418d5
|
525.5 kB | Preview Download |
|
md5:d7fe1d4f67cd49738a6c54fc62fb13e4
|
56.2 kB | Preview Download |
|
md5:6ce0255022f3490c7d0e4544b3c8afa2
|
1.6 kB | Preview Download |
|
md5:1c5349e25d7e6537e20e26def3293780
|
850 Bytes | Preview Download |
|
md5:cba7bd78ca6c056878b8f2b8687a5c23
|
272 Bytes | Preview Download |
|
md5:e2ffbbb80918f6232fcfd78aabcc9532
|
1.7 kB | Preview Download |
|
md5:a4d59507a62758a8406f0cc5821d603d
|
188 Bytes | Preview Download |
|
md5:642fead2e0740fa0b62d9ecc1395ed5a
|
2.2 kB | Preview Download |
|
md5:a636bc6ef81953a9cf4f226df03b378e
|
22.8 kB | Preview Download |
|
md5:33dffd1b06bf9a518c6e708cb12ddeaf
|
1.2 kB | Preview Download |
|
md5:323d6c95b1cd988c56a0537a3deb63cc
|
289 Bytes | Preview Download |
|
md5:b7d6554f49d94c02413423358bdd889a
|
11.5 kB | Preview Download |
Additional details
- Collected
-
2023-05-19Web scraping and hand testing done on May 18 and 19, 2023