Size in bytes of PageSeeder documents.
PageSeeder does not index or record the file size of PSML documents.
However, when uploading documents, although there are no hard and fast rules for the ideal file size, following the recommendations below will ensure that documents will be smoothly processable in different contexts.
PSML documents are XML documents, so the general guidelines for XML apply. PageSeeder relies on various types of XML processors and the PSML may also be processed externally so it is important to bear in mind the following:
- content with no or few line breaks may be difficult to handle by processors that break down the code per line, this is mostly true for text editors;
- indentation and comments increase the file size but have little real impact in terms of processing;
- markup density has a greater impact on processing, especially when the content is processed as a tree structure, in other words, it is easier to process content that is mostly plain text.
PSML file size
As a rule of thumb, for typical PSML content with regular line breaks, minimal indentation and average markup density.
- any PSML file that is larger than 1Mb is likely to cause problems at some point;
- keeping a PSML document under 100K or even 10K is a good idea.
The size of individual fragments also matters, especially if editing or diffing is involved.
- the smallest fragment is a paragraph;
- their content should fit within a full window height of a desktop.
Diffing will not work on large fragments, as PageSeeder imposes a hard limit on these.
Non-PSML documents are stored as binary documents and are only processed by the PageSeeder indexer.
Large Word or PDF documents (over 100Mb) may be slow to index.