Tuples in ClickHouse

Tuples are collections of items with heterogeneous data types. ClickHouse tuples are used with IN operators and in Lambda functions. The tuples can not be empty and need at least one element in them. Tuples […]

Ingesting Data from a Kafka topic

Image Source – Pexels   Apache Kafka is a distributed event streaming platform developed by Apache Software Foundation. Visit the official page before proceeding for a detailed introduction to the basics of Kafka. The installation instructions are […]

Arrays in ClickHouse

Arrays are collections of items comprising similar data types. ClickHouse supports arrays as its column data type. The maximum allowed size of a ClickHouse array is 1 million. An array can hold elements entirely of […]

OPTIMIZE statement

Image Courtesy – Pexels – Vlad Chetan  The data is stored in multiple parts in MergeTree family of engines and data parts are merged asynchronously in the background. Refer to our article on Mergetree storage and […]

ReplacingMergetree Engine in ClickHouse

Image Source – Pexels Multiple heavy-weight table engines and functionalities of ClickHouse are built on top of the MergeTree engine. The MergeTree engine supports PRIMARY KEY expression, but it is not the same as the primary […]

ClickHouse July 2022 Release – Version 22.7

ClickHouse version 22.7 (July 2022) was unveiled on 21st July 2022. This release has around 25 new features, 19 performance improvement changes,  40+ other improvements and 50+ bug fixes. Here is the official list of […]

Running ClickHouse using Docker – Part 1

ClickHouse has a Client-Server model and running the ClickHouse server using docker should be fairly straightforward. Official docker images from the ClickHouse team are available in the docker hub. This article is intended to be […]

1 2 3