Analysis updated 2026-06-24
Add Chinese search support to an existing Elasticsearch deployment
Build a Chinese-language product search backed by OpenSearch
Extend the tokenizer with a domain-specific vocabulary file
Hot-reload an evolving keyword list without restarting the cluster
| infinilabs/analysis-ik | justauth/justauth | openzipkin/zipkin | |
|---|---|---|---|
| Stars | 17,448 | 17,444 | 17,431 |
| Language | Java | Java | Java |
| Setup difficulty | moderate | easy | easy |
| Complexity | 3/5 | 2/5 | 3/5 |
| Audience | developer | developer | ops devops |
Figures from each repo's GitHub metadata at analysis time.
Plugin version must match the exact Elasticsearch or OpenSearch version you are running.
analysis-ik is a plugin that adds Chinese-language text analysis to Elasticsearch and OpenSearch, two popular search engines used to build fast, scalable search features in applications. The core problem it solves is that Chinese text has no spaces between words, making it hard for search engines to know where one word ends and another begins. This plugin integrates the IK analyzer (a Chinese text tokenizer, meaning a tool that splits text into meaningful units called tokens) so that search queries and indexed content are broken down correctly. The plugin provides two tokenizer modes. "ik_max_word" performs the finest-grained split, generating all possible word combinations from a phrase, useful when you want to match any possible way a query might overlap with the content. "ik_smart" performs a coarser, more minimal split, useful for phrase-level queries. You configure which mode to use per field in your Elasticsearch index mapping. Custom dictionaries are supported: you can add your own vocabulary files (lists of words, one per line, in UTF-8 encoding) to extend the default dictionary. The plugin also supports hot-reloading dictionaries from a remote URL, meaning you can update the word list without restarting the search engine, as long as the server serving the file returns standard HTTP caching headers. Installation is done via the Elasticsearch or OpenSearch plugin CLI with a single command. The plugin is written in Java, licensed under Apache 2.0, and maintained by INFINI Labs.
Elasticsearch and OpenSearch plugin that adds Chinese-language tokenization via the IK analyzer, with custom and hot-reloadable dictionaries.
Mainly Java. The stack also includes Java, Elasticsearch, OpenSearch.
Apache 2.0 lets you use, modify, and redistribute the code commercially, as long as you keep the license notice and state your changes.
Setup difficulty is rated moderate, with roughly 30min to a first successful run.
Mainly developer.
This repo across BitVibe Labs
Verify against the repo before relying on details.