Manuals - eCATALOGsolutions

5.9.22.5. *.prj (Project files)

5.9.22.5.2. Full text search: Indexing of PDF and other documents

Documents available in catalogs can be indexed and included into the full-text index.

For this, columns containing the PDF and other documents have to be stated in the key VARSEARCHINDEXDOCUMENT (either in the dir.prj of catalog or in the respective prj files).

VARSEARCHINDEXDOCUMENTVARIABLES=<List of columns to index>

To index a document project, the VARSEARCHINDEXDOCUMENT key must be set to "YES".

VARSEARCHINDEXDOCUMENT=YES

In order for image content inside PDF documents to be read, the text recognition software "Tesseract" has to be installed and in the config file, the installation path has to be stated.

$CADENAS_SETUP/partsol.cfg

[INDEX:OCR]
TesseractPath=
TesseractDataPath=

Furthermore there are two optional settings:

DPI=600
ImageFormat=

Prev	Up	Next
5.9.22.5.1. Full text search: Add keywords to project file	Home	5.10. Adjust 2D derivations