Docs
Public data dump
We are committed to providing a data dump service for the community but, in order to ensure that DOAJ's data is used for the benefit of the entire community and under the terms of the licenses that accompany the data, access to the dumps is granted on a case-by-case basis. If you would like access, please email our Help Desk stating the following information: - your name - the group, organisation or company you represent or that wants to use the metadata - your group/organisation/company address, including country - whether you want access to the journal or article metadata, or both - what you want to use the metadata for. Include as much information as possible.
An exportable version of the journal metadata is also available (CSV format).
Structure
The data dumps are structured as follows:
- When you unzip/untar the file, you will have a single directory of the form
doaj_zx[type]_data_[date generated]
. - Inside that directory, you will find a list of files with names of the form
[type]_batch_[number].json
.- For example,
journal_batch_3.json
orarticle_batch_27.json
.
- For example,
- Each file contains up to 100,000 records and is UTF-8 encoded. All files should contain the same number of records, apart from the last one, which may have fewer.
- The structure of each file is as a JSON list:
[ { ... first record ... }, { ... second record ... }, { ... third record ...}, ... etc ... ]
- Records are not explicitly ordered and the order is not guaranteed to remain consistent across data dumps produced on different days.