Published September 2, 2020 | Version 1.2.0
Software Open

caltechlibrary/eprints2archives: Version 1.2.0 – Coverage improvements and use of HTTP/2

Michael Hucka1
  • 1. ROR icon California Institute of Technology

Description

Changes in this release: * In addition to the record pages, `eprints2archives` now also harvests general URLs from the server, including the top-level URL and `/view` and 2 levels of pages underneath it. However, if a subset of records is requested, only gets those particular `/view/X/N.html` pages rather than all pages under `/view/X/`. * Internal changes allow it to use protocol HTTP/2, which was necessary to communicate with Archive.Today (because it appears to have stopped accepting save requests unless HTTP2 is used). * Now tries to add `https://` or `http://` if the user forgets to provide it, and also removes `/eprint` and adds `/rest` if needed. This makes it possible for the user to just provide a host name and `eprints2archives` will figure out the rest. * Minor improvements to some of the run-time status messages. * More progress bars! * Improvements to debug logging. * Improvements to README.md. * Internal code refactoring.

Files

caltechlibrary_eprints2archives-1.2.0.zip
Files (195.3 kB)
Name Size
md5:4c03cd708f2ce83e0eb68364dfda9bd3
195.3 kB Preview Download

Other

Send records from an EPrints server to the Internet Archive and other web archives

Additional details

Created:
September 8, 2022
Modified:
November 28, 2022