Size in bytes of PageSeeder documents.
PageSeeder does not index or record the file size of PSML documents.
However, when uploading documents, although there are no hard and fast rules for the ideal file size, following the recommendations below ensures that documents are smoothly processable in different contexts.
PSML documents are XML documents, so the general guidelines for XML apply. PageSeeder relies on various types of XML processors and the PSML might also be processed externally, so it is important to bear in mind the following:
- Content with no or few line breaks might be difficult to handle by processors that break down the code per line, this is mostly true for text editors.
- Indentation and comments increase the file size but have little real impact in terms of processing.
- Markup density has a greater impact on processing, especially when the content is processed as a tree structure, in other words, it is easier to process content that is mostly plain text.
PSML file size
As a rule of thumb, for typical PSML content with regular line breaks, minimal indentation and average markup density.
- Any PSML file that is larger than 1Mb is likely to cause problems at some point.
- Keeping a PSML document under 100K or even 10K is a good idea.
The size of individual fragments also matters, especially if editing or diffing is involved.
- The smallest fragment is a paragraph.
- Fragment content should fit within a full window height of a desktop.
Diffing won’t work on large fragments as PageSeeder imposes a hard limit on these.
Non-PSML documents are stored as binary documents and are only processed by the PageSeeder indexer.
Large Word or PDF documents (over 100Mb) might be slow to index.