2007/01/15

Automatically importing Pubmed records in DSpace

Two programs are already developped:

  1. An ingestion module which can import an XML file for which an XSLT crosswalk has been configured: this XML file can contain multiple bibliographic records which are then all loaded automatically.
  2. An XSLT crosswalk from Pubmed XML format to an DSpace "overloaded DC" format.

This XSLT uses a Java function for Medline date conversion (we did not succeed to use JavaScript up to now): the corresponding compiled java class must therefore be included in the CLASSPATH.

When the development will be stabilised and documented, we will be happy to contribute it to DSpace community. Meanwhile, you can follow the links for an "as is" copy.

NLM Entrez is offering a collection of tools to automatically retrieve PubMed information.

A sample file is provided by NLM for testing purposes. The import result can be viewed on line.

No comments: