File size

Size in bytes of PageSeeder documents.

PSML documents

PageSeeder does not index or record the file size of PSML documents.

However, when uploading documents, although there are no hard and fast rules for the ideal file size, following the recommendations below will ensure that your documents will be processable smoothly in different contexts.

XML processors

PSML documents are XML documents, so the general guidelines for XML apply. PageSeeder relies on various types of XML processors and your PSML may also be processed externally so it is important to bear in mind the following:

  • content with no or few line breaks may be difficult to handle by processors that break down the code per line, this mostly true for text editors;
  • indentation and comments increase the file size but have little real impact in terms of processing;
  • markup density has a greater impact on processing especially when the content is processed as a tree structure, in other words, it is easier to process mostly content that is mostly plain text.

PSML file size

As a rule of thumb, for typical PSML content with regular line breaks, minimal indentation and average markup density.

  • any PSML file that is larger than 1Mb is likely to cause problems at some point;
  • keeping PSML document under 100K or even 10K is a good idea.

The size of individual fragments also matters especially if editing or diffing is involved.

  • the smallest fragment is a paragraph;
  • their content should fit within a full window height of a desktop.

Diffing will not work on large fragments, as PageSeeder imposes a hard limit on these.

Other documents

Non-PSML documents are stored as binary documents and are only processed by the PageSeeder indexer.

Large Word or PDF documents (over 100Mb) may be slow to index.

