How to convert XML data to a linked collection of PSML documents
|Skills required||XML, XSLT|
|Time required (minutes)||30|
PSML documents are often linked together using cross-references (XRefs). This tutorial will demonstrate how use XSLT to convert two XML files generated by Wikipedia to PSML documents linked together with XRefs. The PSML will then be uploaded to PageSeeder for checking the results.
To complete this tutorial requires:
- Software to read and write zip files (e.g. 7-Zip or on Windows right-click Extract all/Send to Compressed).
- Access to a PageSeeder server with at least a contributor role on the tutorial group.
All the necessary files for this tutorial are on Github .
Run the XSLT code
- The easiest way to do this is to follow the instructions in How to run XSLT from Eclipse but if the Eclipse IDE is not available follow How to run XSLT from the command line. When doing this use the following
xrefs- the name of the data folder.
wikipediafilms.xml– the input XML film data.
films-bios.xsl– the XSLT code.
wikipediabios.xml– the input XML bio data. This must be in the same folder as the XSLT code.
It should create files in the output folder according to the following naming pattern:
bios/bio-1.psml bios/bio-2.psml bios/bio-3.psml etc.... films/film-1.psml films/film-2.psml films/film-3.psml etc....
Before continuing, open some files in a text editor to check the content. Also to understand the XSLT conversion code or PSML markup, review
Adding images to the collection requires storing the image files in a paths relative to the text. For example, the following:
Copy the film images from Github to these folder.
Package and upload the PSML
The final step requires moving the data from a local file system into PageSeeder. Do this via the following steps:
- 'Zip' the
biosfolders into a single zip archive.
- Upload the archive to the PageSeeder group and select the unzip icon (see image below).
- After unzipping the archive, simply continue through the upload.