Elasticsearch

Internal Structure Of Elasticsearch Java High-Level REST Client

This article explores the implementation of Elasticsearch Java High-Level REST Client (HLRC) by analyzing the structure of the client, the dependencies, the error handling mechanism, serialization, its observability, and more.

23 Jan 2022

The Decision System For Shard Allocation in Elasticsearch

Curious about how does a decision system work? This article explains the deciders for shard allocation in Elasticsearch by going through their responsibility, structure, decision making, lifecycle, testing, and more.

27 Dec 2021

Slow Query Logs In Elasticsearch

Better understand the slow queries in Elasticsearch by using the slow logs. In this article, we discuss the log structure, the thresholds, and some potential root causes.

27 Nov 2021

Elasticsearch Snapshot Plugins

Elasticsearch snapshot repository plugins for AWS, GCP, and Azure

01 Nov 2021

Elasticsearch Snapshot APIs

This article summarizes the list of APIs for "Snapshot and Restore" in Elasticsearch, which allows you to perform operations easily and navigate to official documentation if you need more detail.

25 Oct 2021

Internal Structure Of Snapshot Repository

This article takes you to the Elasticsearch snapshot repository to explore its internal structure and understand the contents and uses of different files.

04 Sep 2021

Elasticsearch Settings

This article describes the different types of settings of Elasticsearch: cluster settings, index settings, node settings, the difference between static and dynamic settings, and some commonly used curl commands.

21 Aug 2021

How to prevent data loss in Elasticsearch?

Six practical solutions to improve data reliability: improve number of replicas, snapshot and restore, RAID, MQ, etc.

31 Jul 2021

Fix Corrupted Index in Elasticsearch

How to fix corrupted index in Elasticsearch cluster without snapshot?

14 Jul 2021

DVF: Real Estate Analysis For Île-de-France in 2020

This article studies the real estate market of Île-de-France in 2020 by exploring and visualizing the dataset DVF using Kibana. We will discuss the global landscape, the impact of COVID-19, the situation in different departments, and more.

16 Apr 2021

DVF: Aggregations

How to write and execute metric and bucket aggregations in Elasticsearch for dataset: Demandes de valeurs foncières (DVF) for data analytics. Also, how to execute aggregations that contain sub-aggregations.

12 Apr 2021

Elasticsearch: Generate Configuration With Python Jinja 2

This article explains how to generate the configuration for Elasticsearch using Python templating engine Jinja 2 by going through a basic use-case. It also explains features about Jinja2, testing, and more.

11 Apr 2021

Disk Watermarks In Elasticsearch

Understanding different disk watermarks: low, high, and flood-stage in Elasticsearch. The symptom when the cluster reaches these values and how to mitigate or avoid the issues. Also, how to better operate your clusters using these watermarks.

10 Apr 2021

DVF: Snapshot And Restore

Part 4: How to create a snapshot for index "transactions" of DVF and restore it to an Elasticsearch cluster.

10 Jan 2021

DVF: Storage Optimization

Part 3: How to optmize storage of a given index by 40% using force-merge.

25 Dec 2020

DVF: Indexing Optimization

Part 2: Optimize the indexing process using bulk index requests and multi-threading.

17 Dec 2020

DVF: Indexing New Documents

Part 1: Indexing new documents into Elasticsearch using French government's open data "Demande de valeurs foncières (DVF)".

16 Dec 2020

18 Allocation Deciders in Elasticsearch

This article explains the 18 allocation deciders in Elasticsearch: when they decide to allow, deny, or throttle the shard allocation under different circumstances. Also, a complete list of messages for unassigned shards.

27 Sep 2020

Elasticsearch: Common Index Exceptions

Explain the common Elasticsearch exceptions occurred when indexing new documents, with sample messages, analysis, suggestions and external resources for further investigation.

13 Sep 2020

GC in Elasticsearch

Basic information about garbage collection (GC) in Elasticsearch, including the default garbage collector used, JVM options, GC logging, and more.

30 Aug 2020

Discovery in Elasticsearch

How does discovery work in Elasticsearch? This article explains different mechanisms of discovery, the key settings, fault detection, related logs, and more.

22 Aug 2020

Wrap Elasticsearch Response Into CompletableFuture

Wrap Elasticsearch client response into CompletableFuture in Java for Elasticsearch transport client or Java high level REST client.

26 Jul 2020

Testing Elasticsearch With Docker And Java High Level REST Client

Testing Elasticsearch with docker and Java High Level REST Client

05 Apr 2020

Elasticsearch: cat nodes API

Use Elasticsearch cat nodes API to understand the state of your Elasticsearch: id, name, IP address, CPU, load, memory, and much more.

07 Mar 2020

Elasticsearch: Scroll API in Java

Elasticsearch Scroll API sample written in Java, useful for retrieving large numbers of results (or even all results) from a single search request.

19 Jan 2020

Indexing New Data in Elasticsearch

Initializing Elasticsearch client, sending a single index request or a bulk index request, index response, different content types, refresh policy etc.

02 Dec 2019

Testing Elasticsearch with ESSingleNodeTestCase

Writing unit tests for Elasticsearch using Elasticsearch Single Node Test Case (ESSingleNodeTestCase), a derived class of ESTestCase which simplifies the testing set up for you.

24 Nov 2019