Skip to main content

Pre-validators

ContentWhitelistPreValidator

Accept documents if either the filename or content contain ANY of the given patterns.

type (required)

Type: string

Must be set to ContentWhitelistPreValidator.

patterns (required)

Type: list[string]

Patterns to search for.

FilenameBanlistPreValidator

Ban documents based on their filename or extension. A document is excluded if it matches ANY of the patterns.

type (required)

Type: string

Must be set to FilenameBanlistPreValidator.

banlist_extensions

Type: list[string]

Patterns to match extensions to exclude. The extension does not include the starting '.'.

banlist_filenames

Type: list[string]

Patterns to match filenames to exclude.

include_default_banlist_extensions

Type: boolean

Include a predefined list of extensions to ban.