Caltech Library logo

dataset is a command line tool demonstrating dataset package for managing JSON documents stored on disc or cloud storage. A dataset is organized around collections, collections contain JSON documents and and their attachments. In addition to the JSON documents dataset maintains metadata for management of the documents any defined data frames related to the collection.