Esploro XML

Importing Research Assets

Esploro repository managers can add research assets in bulk using a Research Assets import profile. You can manage Esploro import profiles on the Import Profiles page (Research > Import Assets > Manage Research Import Profiles). For more information on running an Import Profile see Adding Research Assets in Bulk.

This article refers only to loading research assets using the Esploro XML format. The profile field “Originating system” need to be set to “Esploro” and the profile field “Physical source format” to “XML”.

Input File

The input file containing the assets must be in XML format and must be compatible with the Esploro Records schema.

The schema contains various comments that describe the fields, their formats, and relevance.

Upload of files can be done either manually when running the import profile or from an FTP connection using the connection details defined in your system. For more information on managing FTP connection, see Configuring S/FTP Connections.

Asset Matching

The asset matching mechanism determines whether each asset record exists in the repository. Existing assets are disregarded, while non-existing assets are inserted as new records.

The match is done on a combination of fields:

  • DOI + Brief Title (first 25 characters of the title)
  • PMID + Brief Title
  • ISBN + Brief Title
  • Is Part of ISSN + Brief Title + Is Part of Volume + Is Part of Issue + Is Part of Start page
  • Is Part of ISBN + Brief Title + Is Part of Volume
  • Is Part of ISBN + Brief Title + Is Part of Volume
  • Is Part of Title + Brief Title + Is Part of Volume + Is Part of Issue + Is Part of Start page
  • Is Part of Title + Brief Title + Year + Is Part of Volume + Is Part of Issue
  • Is Part of Title + Brief Title + Is Part of Start page + Year

 

Asset Validations and Report

Various validations, according to the xsd schema, are executed on each record.

There are two validation types: Reject and Warning. If a record fails on a reject validation, the entire record is discarded. If a record fails on a warning validation, the record is inserted.

The following are the reject validations:

  1. Resource type is mandatory
  2. Title is mandatory
  3. Creator is mandatory for ETDs
  4. Degree grantor is mandatory for ETDs
  5. Degree name is mandatory for ETDs
  6. Degree affiliation is mandatory for ETDs
  7. Date is required for publications

 

A list of all validation failures of a given import is kept in the execution report of the import. For more information see Viewing the Research Import Profile Reports.

Importing the Research Asset Files

After executing the Research Assets Import Profile, the final step is to import the actual files for each asset based on the file URL loaded with the research asset previously. For more information see Importing Research Asset Files in Bulk.