Tag: Elasticsearch

22 articles

Big Data 191 - Elasticsearch Cluster Planning & Tuning: Node Roles, Shards, Replicas, Write and Search Checklist

Master / Data / Coordinating node responsibilities and production role isolation strategies, capacity planning calculations.

Big Data 189 - Nginx JSON Logs to ELK: ZK + Kafka + Elasticsearch 7.3.0 + Kibana 7.3.0

Configure Nginx logformat json to output structured accesslog (containing @timestamp, requesttime, status, requesturi, ua and other fields).

Filebeat → Kafka → Logstash → Elasticsearch Practice

Filebeat collects Nginx access.log to Kafka, and Logstash consumes, parses embedded JSON by field conditions, enriches metadata, and writes structured logs to Elasticsear...

Big Data 188 - Logstash Output Plugin Practice

Output is the final stage of Logstash pipeline, responsible for outputting processed data to target system.

Big Data 183 - Elasticsearch Concurrency Conflicts & Optimistic Lock

Elasticsearch concurrency conflicts (inventory deduction read-modify-write) breakdown write overwrite cause, and gives engineering solution using ES optimistic.

Big Data 184 - Elasticsearch Doc Values Mechanism Detailed

Disk columnar data structure generated at indexing time, optimized for sorting, aggregation and script values

Big Data 181 - Elasticsearch Segment Merge & Disk Directory Breakdown

Explains why refresh causes small segment increase, how segment merge merges small segments into large ones in background and cleans deleted documents.

Big Data 182 - Elasticsearch Inverted Index Underlying Breakdown

Article details core data structure of Elasticsearch inverted index: Terms Dictionary, Posting List, FST (Finite State Transducer) and SkipList how accelerate.

Big Data 179 - Elasticsearch Inverted Index and Read/Write Process

This article deeply analyzes Elasticsearch's inverted index principle based on Lucene, and document read/write flow.

Big Data 180 - Elasticsearch Near Real-Time Search: Segment, Refresh and Flush

Article details core mechanism of Elasticsearch near real-time search, including Lucene Segment, Memory Buffer, File System Cache, Refresh, Flush and Translog.

Elasticsearch Aggregation Practice: Metrics Aggregations & Bucket Aggregations

Covers complete practice of Metrics Aggregations and Bucket Aggregations, applicable to common Elasticsearch 7.x / 8.x versions in 2025.

Big Data 178 - Elasticsearch 7.3 Java Practice: Index and Document CRUD

This article details the complete flow for index and document CRUD operations using Elasticsearch 7.3.0 and RestHighLevelClient.

Big Data 175 - Elasticsearch Term Queries and Bool Combination Practice

This article demonstrates Elasticsearch term-level queries including term, terms, range, exists, prefix, regexp, fuzzy, ids queries, and bool compound queries.

Big Data 176 - Elasticsearch Filter DSL Practice: Filter Queries, Pagination and Highlighting

This article details practical usage of Elasticsearch Filter DSL, covering filter query, sort pagination, highlight display and batch operations.

Big Data 173 - Elasticsearch Mapping and Document CRUD Practice

After creating an index, need to set field constraints, called field mapping (mapping).

Elasticsearch Query DSL Practice: match/match_phrase/query_string/multi_match

In-depth explanation of core Query DSL usage in Elasticsearch 7.3, focusing on differences and pitfalls of match, matchphrase, querystring.

Big Data 171 - Elasticsearch-Head and Kibana 7.3.0 Practice

Introduction to Elasticsearch-Head plugin and Kibana 7.3.0 installation and connectivity points, covering Chrome extension quick access.

Elasticsearch Index Operations & IK Analyzer Practice: 7.3/8.x

This article explains Elasticsearch index CRUD operations and IK analyzer config, covering versions 7.3.0 and 8.15.0.

Big Data 169 - Elasticsearch Getting Started: Index/Document CRUD & Minimum Search Examples

Elasticsearch (ES 7.x/8.x) minimum examples for index creation, document CRUD, query by ID, and _search, with response samples and screenshots to quickly run through the...

Big Data 170 - Elasticsearch 7.3.0 Three-Node Cluster Practice

Elasticsearch 7.3.0 three-node cluster deployment practice tutorial, covering directory creation and permission settings.

Big Data 167 - ELK Elastic Stack Practice: Architecture, Indexing and Troubleshooting

Article introduces core capabilities and common practices of Elasticsearch 8.x, Logstash 8.x, Kibana 8.

Elasticsearch Single Machine Cloud Server Deployment & Operations

Elasticsearch is a distributed full-text search engine, supports single-node mode and cluster mode deployment. Generally, small companies can use Single-Node Mode for the...