Skip to main content

 Index

Indexing reference

Index fields

A summary table of all the index fields in PageSeeder and items for which they are available.

A more detailed definition of each field can be found in Index field definitions.

For more explanations on the abbreviations used in this document see Legend & explanations.

Setting the group property indexXFields to true and then re-indexing the group adds all document metadata fields extracted by Tika to the index with the prefix x-(also known as X-Fields).

Core fields

Folder iconWord iconPDF iconHTML iconXML iconPSML iconImage iconComment iconTask iconURL iconFile icon
psidSI
pstitleST
pstitle-sortI
psdescriptionST
pstypeSI
pssubtypeSI
psmediatypeSI
pscreateddateSI?
psmodifieddateSI
pscontentZT
pswordcountSI
pslabelSI*
psgroupnameSI*
psgroupidSI*

Media specific

Folder iconWord iconPDF iconHTML iconXML iconPSML iconImage iconComment iconTask iconURL iconFile icon
psfilenameST
psfolderSI
psancestorI+
pssizeSIN?
psdocidSC?
psownedSI
psdocumenttypeSI
pseditableSI
psfragmenttypeSI*
psfragmentlabelSI*
psblocklabelnameSI*
psinlinelabelnameSI*
pspropertynameSI*
psmetadatanameSI*
psproperty-[name]SI*
psmetadata-[name]SI*
psinline-[name]SI*
pspara-[name]ZT*
psblock-[name]ZT*
pslastediteddateSI?
pslatestversionSI
pslatestversiondateSI?
psversionidSI?
psstatusSI
psstatuschangeddateSI?
psprioritySI
psassignedtoSI
psassignedtoidSIL?
psduedateSI?
pscomment-statusSI
pscomment-prioritySI
pscomment-assignedtoSI
pscomment-assignedtoidSIL?
pscomment-duedateSI?
psdiscussionidSIL
pstaskdefinitionidSIL
psauthorSI?
psauthoridSIL?
pscontextexternalSI
pscontextfragmentSI
pscontexturiidSIL
pswidthSIN?
psheightSIN?
pspixelcountIL?
psgraphicsrcSI*
psimagesrcSI*
pshostSI
psurlSI
pspagecountSN
psxrefcountSN
psreversexrefcountSN
psimagelabelSI*
pslinklabelSI*
psxreflabelSI*
psreversexreflabelSI*
psxreftypeSI*
psreversexreftypeSI*
psmedia-createddateSI?
psmedia-modifieddateSI?
pspublicationidSI*
pspublicationrootSI?
psrootpublicationidSI?
psprefixcontentZT*
psgroupcountSN
psmetadatacheckedSI?
pssourceSI
psurltypeSI
psurlmediatypeSI?

Legend

For brevity, the documents use icons to denote the different kind of indexable items.

Media types

Iconinput media typeDescription
Folder iconFolders
word_icon.svgapplication/wordWord document
PDF iconapplication/pdfPDF document
HTML icontext/htmlHTML document
XML iconapplication/xml
text/xml
*/*+xml
XML document
PSML iconapplication/vnd.pageseeder.psml+xmlPSML document
URL iconapplication/vnd.pageseeder.url+xmlURL (external URI)
Other file icon*/* (any other)Any other type of document
Image iconimage/*Image (gif, bmp, png or jpeg)
Task iconTask
Comment iconComment

The input media type is the MIME type received by the index engine. It is slightly different from the psmediatype index field.

Field types 

Lucene field types legend
SStoredThe field value is stored in the index in full and is available in search results
ZCompressedThe value is stored in the index in a compressed format
IIndexedThe value is indexed as it is and is case sensitive
CCase insensitive
The value is indexed as lower case and searches are case insensitive
TTokenizedThe value is split into tokens which is not case sensitive
NNumberThe value is stored as a integer value
LLongThe value is stored as a long value
?OptionalWhen applicable for the mediatype, the field might not always appear in the index
*Zero-or-moreWhen applicable for the mediatype, the field might appear in the index multiple times
+One-or-moreWhen applicable for the mediatype, the field appears in the index at least once

If a field is not tokenized, it must be searched using its exact value including correct case (for example, “smith” or “john smith” wont match “John Smith”).

Created on , last edited on