mirror of
https://github.com/elastic/elasticsearch.git
synced 2025-04-25 15:47:23 -04:00
* [DOCS] Documentation for the stable plugin API * Removed references to rivers * Add link to Cloud docs for managing plugins * Add caveat about needing to update plugins * Remove reference to site plugins * Wording and clarifications * Fix test * Add link to text analysis docs * Text analysis API dependencies * Remove reference to REST endpoints and fix list * Move plugin descriptor file to its own page * Typos * Review feedback * Delete unused properties file * Changed into * Changed 'elasticsearchVersion' into 'pluginApiVersion' * Swap 'The analysis plugin API' and 'Plugin file structure' sections * Update docs/plugins/authors.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/creating-non-text-analysis-plugins.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/creating-non-text-analysis-plugins.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/creating-text-analysis-plugins.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/creating-text-analysis-plugins.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/creating-non-text-analysis-plugins.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/creating-text-analysis-plugins.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/creating-text-analysis-plugins.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/example-text-analysis-plugin.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/plugin-descriptor-file.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/plugin-script.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/creating-non-text-analysis-plugins.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Update docs/plugins/development/creating-non-text-analysis-plugins.asciidoc Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> * Rewording * Add modulename and extended.plugins descriptions for descriptor file * Add link to existing plugins in Github * Review feedback * Use 'stable' and 'classic' plugin naming * Fix capitalization * Review feedback --------- Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com> Co-authored-by: William Brafford <william.brafford@elastic.co>
69 lines
2 KiB
Text
69 lines
2 KiB
Text
[[analysis]]
|
|
== Analysis plugins
|
|
|
|
Analysis plugins extend Elasticsearch by adding new analyzers, tokenizers,
|
|
token filters, or character filters to Elasticsearch.
|
|
|
|
[discrete]
|
|
==== Core analysis plugins
|
|
|
|
The core analysis plugins are:
|
|
|
|
<<analysis-icu,ICU>>::
|
|
|
|
Adds extended Unicode support using the http://site.icu-project.org/[ICU]
|
|
libraries, including better analysis of Asian languages, Unicode
|
|
normalization, Unicode-aware case folding, collation support, and
|
|
transliteration.
|
|
|
|
<<analysis-kuromoji,Kuromoji>>::
|
|
|
|
Advanced analysis of Japanese using the https://www.atilika.org/[Kuromoji analyzer].
|
|
|
|
<<analysis-nori,Nori>>::
|
|
|
|
Morphological analysis of Korean using the Lucene Nori analyzer.
|
|
|
|
<<analysis-phonetic,Phonetic>>::
|
|
|
|
Analyzes tokens into their phonetic equivalent using Soundex, Metaphone,
|
|
Caverphone, and other codecs.
|
|
|
|
<<analysis-smartcn,SmartCN>>::
|
|
|
|
An analyzer for Chinese or mixed Chinese-English text. This analyzer uses
|
|
probabilistic knowledge to find the optimal word segmentation for Simplified
|
|
Chinese text. The text is first broken into sentences, then each sentence is
|
|
segmented into words.
|
|
|
|
<<analysis-stempel,Stempel>>::
|
|
|
|
Provides high quality stemming for Polish.
|
|
|
|
<<analysis-ukrainian,Ukrainian>>::
|
|
|
|
Provides stemming for Ukrainian.
|
|
|
|
[discrete]
|
|
==== Community contributed analysis plugins
|
|
|
|
A number of analysis plugins have been contributed by our community:
|
|
|
|
* https://github.com/medcl/elasticsearch-analysis-ik[IK Analysis Plugin] (by Medcl)
|
|
* https://github.com/medcl/elasticsearch-analysis-pinyin[Pinyin Analysis Plugin] (by Medcl)
|
|
* https://github.com/duydo/elasticsearch-analysis-vietnamese[Vietnamese Analysis Plugin] (by Duy Do)
|
|
* https://github.com/medcl/elasticsearch-analysis-stconvert[STConvert Analysis Plugin] (by Medcl)
|
|
|
|
include::analysis-icu.asciidoc[]
|
|
|
|
include::analysis-kuromoji.asciidoc[]
|
|
|
|
include::analysis-nori.asciidoc[]
|
|
|
|
include::analysis-phonetic.asciidoc[]
|
|
|
|
include::analysis-smartcn.asciidoc[]
|
|
|
|
include::analysis-stempel.asciidoc[]
|
|
|
|
include::analysis-ukrainian.asciidoc[]
|