Commit graph

350 commits

Author SHA1 Message Date
Nicole Albee
2e97844a79
Clarify filters can be used while creating a normalizer. (#103826) (#106234) 2024-03-12 09:43:02 -04:00
Nik Everett
17f5cc87bd
Backport doc fixes to 7.17 (#84722)
* Update painless-reindex-context.asciidoc (#84444) (#84712)

ctx['op'] should be set to 'noop', not 'none' when specifying no
operation.

Elasticsearch error when using 'none':

```json
{
  "error" : {
    "root_cause" : [
      {
        "type" : "illegal_argument_exception",
        "reason" : "Operation type [none] not allowed, only [noop, index, delete] are allowed"
      }
    ],
    "type" : "illegal_argument_exception",
    "reason" : "Operation type [none] not allowed, only [noop, index, delete] are allowed"
  },
  "status" : 400
}
```

Co-authored-by: jalvar08 <jeovanny.alvarez@gmail.com>

* [DOCS] Update install instructions for Debian/Ubuntu (#84645) (#84714)

The use of `apt-key` is deprecated and will no longer be available after
Debian 11 and Ubuntu 22.04. This updates the installation instructions
for Debian-based distributions.

Closes #84644

Co-authored-by: er0k <er0k@users.noreply.github.com>

* Fix some typos in plugins & reference docs (#84667) (#84717)

This pull request removes a few instances of duplicate words or
punctuation and erroneous spelling from the docs.

Co-authored-by: Abele Mălan <6689720+AbeleMM@users.noreply.github.com>

Co-authored-by: jalvar08 <jeovanny.alvarez@gmail.com>
Co-authored-by: er0k <er0k@users.noreply.github.com>
Co-authored-by: Abele Mălan <6689720+AbeleMM@users.noreply.github.com>
2022-03-07 14:04:12 -05:00
James Rodewig
53ed187d63
[DOCS] Fix typos (#83895) (#83975)
Co-authored-by: Tobias Stadler <ts.stadler@gmx.de>
2022-02-15 13:05:01 -05:00
James Rodewig
0813bda90e
[DOCS] Remove Hunspell dictionaries location config (#82704) (#82956)
User can no longer set location for Hunspell dictionaries. `<config-dir>/hunspell` directory is silently used everytime no matter what configuration is used.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
(cherry picked from commit 1a4fd34129)

Co-authored-by: Jan Jíša <jenda.jisa@gmail.com>
2022-01-24 11:17:34 -05:00
nexusalf
e58a8ec9f7 [Docs] Update edgengram-tokenizer.asciidoc (#79577)
The original example of "snapped" does not apply to this section since it is talking about edge ngrams.
The change replaces the term with "approximate" as a valid example.
2021-10-26 13:04:41 +02:00
Adam Locke
2b52aa558e
change a typo in first letter of a user query (#76394) (#76451)
Co-authored-by: Arseni Prokharchyk <2657789+arsen91@users.noreply.github.com>
2021-08-12 14:29:03 -04:00
James Rodewig
e0ee21070d [DOCS] Fix formatting 2021-05-04 12:29:36 -04:00
James Rodewig
c757f9e4e7
[DOCS] Fix double spaces (#71082) (#71120) 2021-03-31 11:43:34 -04:00
James Rodewig
e97377ca52
[DOCS] Note you can omit type for custom analyzers (#70754) (#70760)
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2021-03-23 13:43:40 -04:00
Adam Locke
ab6eba4662
[DOCS] [7.x] Focus scripting docs on Painless (#69748) (#70588)
* Initial changes for scripting.

* Shorten script examples.

* Expanding types docs.

* Updating types.

* Fixing broken cross-link.

* Fixing map error.

* Incorporating review feedback.

* Fixing broken table.

* Adding more info about reference types.

* Fixing broken path.

* Adding more info an examples for def type.

* Adding more info on operators.

* Incorporating review feedback.

* Adding notconsole for example.

* Removing comments in example.

* More review feedback.

* Editorial changes.

* Incorporating more reviewer feedback.

* Rewrites based on review feedback.

* Adding new sections for storing scripts and shortening scripts.

* Adding redirect for stored scripts.

* Adding DELETE for stored script plus link.

* Adding section for updating docs with scripts.

* Incorporating final feedback from reviews.

* Tightening up a few areas.

* Minor change around other languages.

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2021-03-18 17:33:44 -04:00
James Rodewig
3162990374
[DOCS] Fix nori tokenizer link (#70564) (#70569) 2021-03-18 11:25:40 -04:00
James Rodewig
f1d7e54705
[DOCS] Fix Lucene's stop words links (#70405) (#70475)
Co-authored-by: Fabien Caylus <fabien.caylus@gmail.com>
2021-03-16 17:44:42 -04:00
James Rodewig
8e92000b97
[DOCS] Fix case sensitivity for elision token filter (#69873) (#69878) 2021-03-03 09:27:02 -05:00
James Rodewig
b55249507e
[DOCS] Fix typos for duplicate words (#69125) (#69132) 2021-02-17 11:16:58 -05:00
James Rodewig
dfe113b68d
[DOCS] Expand simple query string query's multi-position token section (#68753) (#68784) 2021-02-09 17:12:39 -05:00
James Rodewig
c7f7bdde89 [DOCS] Fix typo 2021-02-03 10:46:13 -05:00
James Rodewig
3872f08b33
[DOCS] Clean up index template xrefs (#67264) (#67272) 2021-01-11 12:58:00 -05:00
James Rodewig
c4bfa1afad
[DOCS] Fix typo (#65912) (#65913)
Co-authored-by: Toast <mrtoastcheng@gmail.com>
2020-12-05 10:20:56 -05:00
James Rodewig
f1ede024c2
[DOCS] Fix whitespace in pattern replace token filter docs (#64345) (#64349) 2020-10-29 10:41:44 -04:00
James Rodewig
aea83909d9
[DOCS] Fix case for 'Boolean' (#64299) (#64341) 2020-10-29 10:04:20 -04:00
James Rodewig
36213acd80
[DOCS] Fix pattern replace token filter intro (#64189) (#64217)
Removes an incorrect statement about anchoring regex patterns on tokens.

Co-authored-by: Elasticsearch addict <amitmbm87@gmail.com>
2020-10-27 10:35:23 -04:00
James Rodewig
26a157da7b
[DOCS] Update snowball links (#63351) (#63355) 2020-10-06 16:21:37 -04:00
James Rodewig
3e548592b6
[DOCS] Update link to Snowball documentation (#63305) (#63348)
The current link points to an obsolete site, which is no longer maintained.

Co-authored-by: Stefan Walter <67258699+rd-stefan-walter@users.noreply.github.com>
2020-10-06 13:41:06 -04:00
James Rodewig
e179b89085
[DOCS] Clarify that v2.0+ hyphenation files aren't supported (#60579) (#63073)
Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>

Co-authored-by: jgkirschbaum <juergen.kirschbaum@gmail.com>
2020-09-30 09:28:44 -04:00
James Rodewig
60876a0e32
[DOCS] Replace Wikipedia links with attribute (#61171) (#61209) 2020-08-17 11:27:04 -04:00
James Rodewig
d0810cca19
[DOCS] Add xref to multiplexer token filter docs (#60431) (#61168)
Co-authored-by: paiboon auengkongkatong <paiboon15721@gmail.com>
2020-08-14 15:05:07 -04:00
James Rodewig
5885f6ae66
[DOCS] Add missing lang values to snowball token filter (#60489) (#60692) 2020-08-04 17:46:03 -04:00
James Rodewig
5a2c6f0d4f
[DOCS] http -> https, remove outdated plugin docs (#60380) (#60545)
Plugin discovery documentation contained information about installing
Elasticsearch 2.0 and installing an oracle JDK, both of which is no
longer valid.

While noticing that the instructions used cleartext HTTP to install
packages, this commit replaces HTTPs links instead of HTTP where possible.

In addition a few community links have been removed, as they do not seem
to exist anymore.

Co-authored-by: Alexander Reelsen <alexander@reelsen.net>
2020-07-31 16:16:31 -04:00
James Rodewig
aba785cb6e
[DOCS] Update my-index examples (#60132) (#60248)
Changes the following example index names to `my-index-000001` for consistency:

* `my-index`
* `my_index`
* `myindex`
2020-07-27 15:58:26 -04:00
James Rodewig
988e8c8fc6
[DOCS] Swap [float] for [discrete] (#60134)
Changes instances of `[float]` in our docs for `[discrete]`.

Asciidoctor prefers the `[discrete]` tag for floating headings:
https://asciidoctor.org/docs/asciidoc-asciidoctor-diffs/#blocks
2020-07-23 12:42:33 -04:00
James Rodewig
b302b09b85
[DOCS] Reformat snippets to use two-space indents (#59973) (#59994) 2020-07-21 15:49:58 -04:00
malpani
0555fef799 Support ignore_keywords flag for word delimiter graph token filter (#59563)
This commit allows customizing the word delimiter token filters to skip processing 
tokens tagged as keyword through the `ignore_keywords` flag Lucene's 
WordDelimiterGraphFilter already exposes.

Fix for #59491
2020-07-21 16:11:55 +01:00
James Rodewig
82a8d9aa0c
[DOCS] Fix keyword marker docs (#59834) (#59863)
Co-authored-by: Rui Almeida <ruial@outlook.com>
2020-07-20 09:27:42 -04:00
James Rodewig
da85a40e7e
[DOCS] Reformat predicate_token_filter tokenfilter (#57705) (#59714) 2020-07-16 13:35:09 -04:00
James Rodewig
6ed356ffc3
[DOCS] Replace datatype with data type (#58972) (#59184) 2020-07-07 14:59:35 -04:00
James Rodewig
6436792aac
[DOCS] Fix headings for simple analyzer docs (#58910) (#58918) 2020-07-02 09:52:05 -04:00
James Rodewig
28717d1e02
[DOCS] Fix analyzer page titles (#58362) (#58603)
Changes the titles for analyzer pages to sentence case.

Also changes the 'Pattern character filter' page title to sentence case.
2020-06-26 10:17:01 -04:00
James Rodewig
ab29162ab3
[DOCS] Fix tokenizer page titles (#58361) (#58598)
Changes the titles for tokenizer pages to sentence case.

Also moves the 'Path hierarchy tokenizer examples' page within the
'Path hierarchy tokenizer' page and adds a related redirect.
2020-06-26 09:24:41 -04:00
James Rodewig
c36df27730
[DOCS] Reformat pattern_replace token filter (#57699) (#57995)
Changes:

* Rewrites description and adds Lucene link
* Adds analyze example
* Adds parameter definitions
* Adds custom analyzer example
2020-06-11 12:19:38 -04:00
James Rodewig
24a50eb3af
[DOCS] Reformat mapping charfilter (#57818) (#57885)
Changes:

* Adds title abbreviation
* Adds Lucene link to description
* Adds standard headings
* Simplifies analyze example
* Simplifies analyzer example and adds contextual text
2020-06-09 12:43:01 -04:00
James Rodewig
63e962c99a [DOCS] Fix typo in html_strip char filter docs 2020-06-08 10:37:35 -04:00
James Rodewig
2121eb528c
[DOCS] Reformat html_strip charfilter (#57764) (#57810)
Changes:

* Converts title to sentence case
* Adds a title abbreviation
* Adds Lucene link to description
* Reformat sections
2020-06-08 08:44:43 -04:00
Tomasz Elendt
a7c36c8af5 Support multiple tokens on LHS in stemmer_override rules (#56113) (#56484)
This commit adds support for rules with multiple tokens on LHS, also
known as "contraction rules", into stemmer override token
filter. Contraction rules are handy into translating multiple
inflected words into the same root form. One side effect of this change is
that it brings stemmer override rules format closer to synonym rules
format so that it makes it easier to translate one into another.

This change also makes stemmer override rules parser more strict so
that it should catch more errors which were previously accepted.

Closes #56113
2020-05-29 22:34:31 +02:00
James Rodewig
a0ca0325fe
[DOCS] Reformat min_hash token filter docs (#57181) (#57246)
Changes:

* Rewrites description and adds a Lucene link
* Reformats the configurable parameters as a definition list
* Changes the `Theory` heading to `Using the min_hash token filter for
  similarity search`
* Adds some additional detail to the analyzer example
2020-05-27 15:08:23 -04:00
James Rodewig
a2de43d468
[DOCS] Reformat shingle token filter (#57040)
Changes:

* Rewrites description and adds Lucene link
* Adds analyze example
* Rewrites parameter documentation
* Updates custom analyzer and filter examples
* Adds anchor to `index.max_shingle_diff` index-level setting
2020-05-21 13:56:54 -04:00
James Rodewig
5cb34d9a6e
[DOCS] Reformat hunspell token filter (#56955)
Changes:

* Rewrites description and adds Lucene link
* Adds analyze example
* Rewrites parameter documentation
* Updates custom analyzer example
* Rewrites related setting documentation
2020-05-20 14:47:53 -04:00
Andrei Balici
19a336e8d3 Add max_token_length setting to the CharGroupTokenizer (#56860)
Adds `max_token_length` option to the CharGroupTokenizer.
Updates documentation as well to reflect the changes.

Closes #56676
2020-05-20 14:28:40 +02:00
James Rodewig
342e713e2a
[DOCS] Fix fingerprint token filter's analyzer example (#56811) (#56943)
Co-authored-by: Abhilash Bolla <2282894+ivssh@users.noreply.github.com>
2020-05-19 09:30:00 -04:00
James Rodewig
4faf5a7916
[DOCS] Reformat porter_stem token filter (#56053)
Makes the following changes to the `porter_stem` token filter docs:

* Rewrites description and adds a Lucene link
* Adds detailed analyze example
* Adds an analyzer example
2020-05-04 10:39:17 -04:00
markharwood
e197b6c45b
Analysis enhancement - add preserve_original setting in ngram-token-filter (#55432) (#56100)
Authored-by: Amit Khandelwal <amitmbm87@gmail.com>
2020-05-04 11:31:28 +01:00