Skip to main content

 Glossary

A-Z glossary of PageSeeder concepts

Backup

Requires backing up both the file system and the database.

File system backup

The following is a summary of folders to back up:

Folder (Linux)ContentsBackup instruction
/opt/pageseeder/documents The source of all non-PSML documentsRequired simultaneously with database backup
/opt/pageseeder/WEB-INF/config A PageSeeder configuration including properties and templatesRequired
/opt/pageseeder/WEB-INF/state Indexes, logs, caches and temporary filesRecommended (publish can be excluded)

Documents

The documents folder holds the source files for most documents and must be regularly backed up. To ensure consistency, the backup must be performed simultaneously with the database backup, ensuring that database references to the files remain accurate.

As the documents folder can contain a large number of files and grows over time, it's important to factor this into your capacity planning and management strategy.

If the database and documents folder are not backed up simultaneously, some documents might become dereferenced or reported as missing by PageSeeder.

Config

The config folder must be backed up as it contains all files specific to your PageSeeder installation.

Although they do not change frequently, we recommend backing them up alongside other backups.

State

Files in the state folder are highly volatile and grow over time. All files in this folder are generated by PageSeeder and are not critical for disaster recovery.

While most files can be regenerated, the state folder contains important files that are essential for PageSeeder’s functionality and performance:

  • Indexes: These enable searching and related functions. If lost, PageSeeder administrators need to rebuild them using the administration interface. For installations with many documents or comments, the indexing process can take several hours, delaying full functionality.

  • Caches: These files are automatically regenerated as the system is used. However, their loss can impact performance and degrade the user experience. PageSeeder must be stopped to backup the ehcache files which include the PSML documents cache.

  • Logs: While not necessary for PageSeeder to function, logs might need to be retained for regulatory compliance. Therefore, they generally need be backed up as part of your retention strategy.

The publish folder contains only temporary files and can be safely excluded from backups.

Created on , last edited on