Add support for pattern replace filter in normalizers (#96588)

This change adds support for `pattern_replace` token filters use in custom normalizers. 

Closes #83005
This commit is contained in:
Marantidis Kiriakos 2023-06-10 01:32:39 +03:00 committed by GitHub
parent 4df6911ec7
commit a8cf4d6006
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23
5 changed files with 90 additions and 2 deletions

View file

@ -9,7 +9,7 @@ allowed, but not a stemming filter, which needs to look at the keyword as a
whole. The current list of filters that can be used in a normalizer is
following: `arabic_normalization`, `asciifolding`, `bengali_normalization`,
`cjk_width`, `decimal_digit`, `elision`, `german_normalization`,
`hindi_normalization`, `indic_normalization`, `lowercase`,
`hindi_normalization`, `indic_normalization`, `lowercase`, `pattern_replace`,
`persian_normalization`, `scandinavian_folding`, `serbian_normalization`,
`sorani_normalization`, `uppercase`.