--- navigation_title: "Thai" mapped_pages: - https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-thai-tokenizer.html --- # Thai tokenizer [analysis-thai-tokenizer] The `thai` tokenizer segments Thai text into words, using the Thai segmentation algorithm included with Java. Text in other languages in general will be treated the same as the [`standard` tokenizer](/reference/text-analysis/analysis-standard-tokenizer.md). ::::{warning} This tokenizer may not be supported by all JREs. It is known to work with Sun/Oracle and OpenJDK. If your application needs to be fully portable, consider using the [ICU Tokenizer](/reference/elasticsearch-plugins/analysis-icu-tokenizer.md) instead. :::: ## Example output [_example_output_17] ```console POST _analyze { "tokenizer": "thai", "text": "การที่ได้ต้องแสดงว่างานดี" } ``` The above sentence would produce the following terms: ```text [ การ, ที่, ได้, ต้อง, แสดง, ว่า, งาน, ดี ] ``` ## Configuration [_configuration_20] The `thai` tokenizer is not configurable.