ElasticSearch IK Tokenizer Quick Start

ElasticSearch IK Tokenizer Quick Start

1. Install IK word segmenter

1. Allocate a pseudo terminal

My ElasticSearch is installed using Docker, so first assign a pseudo terminal to the container. Then you can directly operate the contents in Docker just like logging into the server.
docker exec -it container ID /bin/bash

2. Install the plugin using elasticsearch-plugin

cd plugins Enter the plugins folder and execute the following command. 7.12.0 needs to be changed to your es version number

../bin/elasticsearch-plugin install https://github.com/medcl/elasticsearch-analysis-ik/releases/download/v7.12.0/elasticsearch-analysis-ik-7.12.0.zip

3. Restart the elasticsearch container

docker restart container id

4. Frequently Asked Questions

(1)java.lang.IllegalStateException

When executing elasticsearch-plugin install, Failed installing and java.lang.IllegalStateException appeared

I checked and found that there is already a folder named ik in my plugins/ directory.
With a try-it attitude, I deleted the folder named ik and re-executed

../bin/elasticsearch-plugin install https://github.com/medcl/elasticsearch-analysis-ik/releases/download/v7.12.0/elasticsearch-analysis-ik-7.12.0.zip

Installation Successful

2. ik word segmentation principle

(Placeholder, I will add more when I have time to read the source code)

3. ik_smart and ik_max_word

GET _analyze?pretty{ "analyzer": "ik_smart", "text": "Unlike me, I only feel sorry for pigeons"}

GET _analyze?pretty{ "analyzer": "ik_max_word", "text": "Unlike me, I only feel sorry for pigeons"}

in conclusion

_analyzer builds the index when the word segmentation, when the index uses max_word
search_analyzer uses word segmentation when searching and smart when querying

<<:  Development is accelerating again, and 5G plans and goals for major provinces and cities in my country are released!

>>:  Using Jenkins to create continuous integration for microservice applications

Recommend

The network protocols behind server push, online gaming, and email

We have talked a lot about network protocols befo...

5G may be just around the corner as a universal core for fiber

Convergence between wired and wireless networks i...

Design tiny url

Design tiny url For example, Maimai will not allo...

How does TCP ensure reliable transmission?

There are many factors in the network that may ca...

A thread pool that novices can understand at a glance

I believe everyone can feel that using multithrea...

"Interview Eight-part Essay" Network Volume 19

[[422375]] 1.How many layers does the TCP/IP netw...

Have you ever thought about why TCP needs to handshake before sending data?

When I look at computer networks, there is always...

How does TCP perform flow control?

We all know that TCP is a reliable, connection-or...

Why Microsoft's acquisition of Nuance is a big deal

[[394293]] Microsoft recently announced that it w...

IP address conversion: conversion between numbers and strings

There are generally two formats for storing IP ad...

Let’s talk about how to implement RPC remote service calls?

Overview In the previous article, I introduced ho...

How to configure OVN router?

Overview Based on the experimental environment in...