Invenio RDM allows OAI-PMH metadata to be managed in “sets” for selective harvesting.
In order to view the OAI-PMH sets that currently exist within their repository, an Administrative user clicks “OAI-PMH” in the left-hand sidebar of the Administrative panel, under the “Site Management” header:
A new OAI-PMH set is automatically created for each collection as part of the collection creation process, which means each collection has its own, unique OAI-PMH endpoint that can be harvested by anyone interested only in that collection’s contents.
Each OAI-PMH set is defined by a search query.
Creating New OAI-PMH Sets
New OAI-PMH sets can be created by clicking the green “Create” button above and to the right of the list of all existing sets:
The Administrative user will be presented with a form requesting the following information:
Set name (a short human-readable string naming the set, required)
Set spec (An identifier for the set, which cannot be edited after the set is created, required)
The search query defining the scope of content to be harvested from the OAI-PMH set endpoint, required). Note: search queries can be complex text strings, including metadata that may be unfamiliar to most users. Your Repository Account Manager is happy to help you generate the necessary string to achieve your desired results.
Editing OAI-PMH Sets
System-generated sets cannot be edited. Usually, this is not an issue, but it can be a problem if you want one of the system-generated sets to output in a format other than OAI Dublin Core, in which case a new OAI-PMH set using the same search query but a different set spec, would be required.
Editing OAI-PMH Set name or Search Query
In order to edit a human-generated OAI-PMH set’s set name or search query (set specs cannot be edited under any circumstances), after navigating to the desired set in the list of all existing sets and clicking the gray “Edit” button on the far right side of the item’s row, an Administrative user will be shown the basic editor form for the OAI-PMH set:
After any desired changes are made, the blue Save button should be clicked.
OAI-PMH Set Output Formats
By default, OAI-PMH sets have their output format set to OAI Dublin Core, but the system also generates a feed for each set in numerous other formats, including:
marcxml
dcat
marc21
datacite
oai_datacite
datacite64
oai_datacite64
These separate outputs feeds may be consumed from specific URL endpoints.
To obtain the specific format-based URL endpoint for a OAI-PMH set created by an Administrative user, the user should locate the set in the complete list of sets and click the linked “set spec” from the set spec column. This will take the user to OAI-PMH Details screen.
On the the set’s details screen, the “Format” dropdown menu can be found to the far right of the “Links” header:
Each format-based URL endpoint for a OAI-PMH set contains the id of the format within it (after “metadataPrefix=”). The URLs follow this pattern: