ChistaDATA Inc.

Enterprise-class 24*7 ClickHouse Consultative Support and Managed Services

  • ChistaDATA
    • Columnar Stores vs. ROW-Based Databases
    • Vectorized Query
    • High Performance Analytics
    • Digital Transformation
    • Data Warehousing
  • ChistaDATA Server
    • Real-Time Analytics
      • Hadoop to ClickHouse
      • Amazon RedShift to ClickHouse
    • Data Archiving
    • ClickHouse Unveiled
    • ClickHouse Consulting
      • ClickHouse Performance Audit
        • Pre- Engagement Questionnaire
    • Online Ticketing System
  • Support
    • Data Analytics
    • Data Warehousing
    • ChistaDATA Analytics Support
    • Gen AI
    • Online Ticketing System
  • Managed Services
    • Data Strategy
    • Why engage ChistaDATA?
    • ClickHouse Managed Services
    • ClickHouse Performance Tuning
    • DBaaS Optimization
    • Data SRE
    • Online Ticketing System
  • Data Science
  • ChistaDATA Fabric
    • Data Archiving
    • ChistaDATA ColumnStore
  • Blog
    • Shiv Iyer Talks
    • ChistaDATA Blog
  • Careers
  • Contact
  • Twitter
  • Facebook
  • LinkedIn
    • Shiv Iyer
  • GitHub
    • @ShivIyer
  • Medium
HomeAuthorsShiv Iyer

Articles by Shiv Iyer

About Shiv Iyer
Open Source Database Systems Engineer with a deep understanding of Optimizer Internals, Performance Engineering, Scalability and Data SRE. Shiv currently is the Founder, Investor, Board Member and CEO of multiple Database Systems Infrastructure Operations companies in the Transaction Processing Computing and ColumnStores ecosystem. He is also a frequent speaker in open source software conferences globally.
Website Facebook Twitter LinkedIn
Application Domain Index for Nested Data Structures
ChistaDATA

Optimizing Revenue for FinTech Platforms Using K-Means Algorithms in Google BigQuery

Shiv Iyer

Harnessing the sophisticated K-Means algorithm within Google BigQuery’s comprehensive analytics ecosystem presents a transformative opportunity to substantially enhance revenue optimization strategies for FinTech platforms. This advanced machine learning technique demonstrates remarkable proficiency in categorizing diverse […]

Machine Learning in ClickHouse
ClickHouse

Understanding ClickHouse MergeTree: Data Organization, Merging, Replication, and Mutations Explained

Shiv Iyer

Understanding ClickHouse MergeTree: Data Organization, Merging, Replication, and Mutations Explained ClickHouse is renowned for its high-performance analytics and its ability to efficiently handle massive amounts of data. At the core of ClickHouse’s data storage and […]

Tuning Linux for ClickHouse Performance
ClickHouse Performance

Why Delta Updates Are Not Recommended in OLAP Databases: A Performance and Efficiency Perspective

Shiv Iyer

Why Delta Updates Are Not Recommended in OLAP Databases: A Performance and Efficiency Perspective Delta Updates are not recommended in OLAP (Online Analytical Processing) databases due to the fundamental design and architecture of these systems, […]

ClickHouse Search: Manticore Full Text Search with Plain Index
ClickHouse Security

Mastering User Management in ClickHouse: A Complete Guide to Authentication, Authorization, and Future Security Enhancements

Shiv Iyer

User Management in ClickHouse: A Comprehensive Guide Introduction User management is a critical aspect of any analytical application, as it ensures secure access to data while maintaining flexibility for various users. In ClickHouse, user management […]

ClickHouse Performance

Integrating Parquet File Ingestion into ClickHouse Using Kafka: A Step-by-Step Guide

Shiv Iyer

Unlock the Power of Data: Seamlessly Integrate Parquet File Ingestion into ClickHouse with Kafka – Your Ultimate Step-by-Step Guide to Optimized Performance! To ingest Parquet files into ClickHouse using Kafka, you can follow a structured […]

ClickHouse Data Compression Techniques for Time-series Datasets
ClickHouse

Optimizing Non-SARGable Predicates in ClickHouse for Improved Query Performance

Shiv Iyer

Non-SARGable (Search ARGument ABLE) predicates are conditions in SQL queries that prevent the database engine from using indexes efficiently, leading to full table scans and degraded query performance. Implementing and handling Non-SARGable predicates in ClickHouse […]

Tuning ClickHouse for High-Velocity Data Ingestion in Distributed Tables
ClickHouse Performance

Implementing Tiered Storage in ClickHouse: Leveraging S3 for Efficient Data Archival and Compliance

Shiv Iyer

Using tiered storage like S3 for archiving data in ClickHouse is a common strategy for handling large volumes of data efficiently, particularly for compliance purposes where data must be retained but is queried infrequently. Here […]

When to Avoid Indexing in ClickHouse for Optimal Performance
ClickHouse Security

Implementing Custom Access Policies in ClickHouse: A Comprehensive Guide

Shiv Iyer

Implementing access policies in ClickHouse similar to SQL Server’s Purview Access Policies requires combining ClickHouse’s built-in access control mechanisms with additional scripting and possibly external tools. ClickHouse does not have a direct equivalent to SQL […]

ClickHouse Backup & DR

Implementing an Oracle RMAN-Like Backup and Recovery Toolkit for ClickHouse

Shiv Iyer

Implementing a comprehensive backup and recovery toolkit for ClickHouse is essential to ensure data integrity, consistency, and reliability, forming the core of ClickHouse Data Reliability Engineering. While ClickHouse lacks a built-in tool as comprehensive as […]

ClickHouse Redo Operations for Data Reliability
ClickHouse Performance

Efficient Strategies for Purging Data in ClickHouse: Real-Life Use Cases and Detailed Implementation

Shiv Iyer

Efficiently purging data from ClickHouse is crucial for maintaining performance and managing storage costs, especially when dealing with large, real-life datasets. Here are some detailed strategies, complete with real-life data sets and use cases: 1. […]

Posts pagination

« 1 … 3 4 5 … 27 »

ChistaDATA is committed to open source software and building high performance ColumnStores

In the spirit of freedom, independence and innovation. ChistaDATA Corporation is not affiliated with ClickHouse Corporation 

Tell us how we can help!

Loading

Search ChistaDATA Website

★READ THIS WARNING★

* Everything changes over time – Our blogs/posts and comments changes over time, That’s how it should be! Whatever we comment from ChistaDATA Inc. Teams (including Shiv Iyer) and other stakeholders or guest bloggers posted here are never permanent, These things worked for us. But, there is no guarantee they will work for you too, When using the recommendations from ChistaDATA or MinervaDB or MinervaSQL or any other online resources / Google,  You must test the advice before applying them to your production systems, and always invest for a robust Database DR solution, Thank you for understanding. 

Recent Posts from ChistaDATA

  • Avoiding ClickHouse Fan Traps : A Technical Guide for High-Performance Analytics
  • Open Source Data Warehousing and Analytics
  • Implementing Data Level Security on ClickHouse: Complete Technical Guide
  • ClickHouse ReplacingMergeTree Explained
  • Building Fast Data Loops in ClickHouse®

☎ TOLL FREE PHONE (24*7)

(844)395-5717

🚩 ChistaDATA Inc. FAX

+1 (209) 314-2364

CORPORATE ADDRESS: HOUSTON

ChistaDATA Inc.,
1321 Upland Dr. PMB 19322, Houston,
TX, 77043, US

CORPORATE ADDRESS: CALIFORNIA

ChistaDATA Inc.
440 N BARRANCA AVE #9718 COVINA,
CA 91723

CORPORATE ADDRESS: NEW CASTLE, DELAWARE

ChistaDATA Inc.,
256 Chapman Road STE 105-4,
Newark, New Castle 19702,
Delaware

CORPORATE ADDRESS: DELAWARE

ChistaDATA Inc.,
PO Box 2093 PHILADELPHIA PIKE #3339
CLAYMONT, DE 19703

HOW CAN WE HELP?

We are committed to building Optimal, Scalable, Highly Available, Reliable, Fault-Tolerant and Secured Database Infrastructure Operations for WebScale to our customers globally

PostgreSQL is a registered trademark of the PostgreSQL Community Association. ClickHouse is a registered trademark of ClickHouse, Inc. MongoDB is a registered trademark of MongoDB, Inc. Couchbase is a registered trademark of Couchbase, Inc. Redis is a registered trademark of Redis Ltd. Apache Cassandra is a registered trademark of the Apache Software Foundation. Milvus is a registered trademark of Zilliz. MinIO is a registered trademark of MinIO, Inc. Amazon Redshift and Amazon Aurora are registered trademarks of Amazon.com, Inc. Google Cloud is a registered trademark of Google LLC. Snowflake is a registered trademark of Snowflake Inc. Databricks is a registered trademark of Databricks, Inc. MySQL and InnoDB are registered trademarks of Oracle Corporation. Oracle is a registered trademark of Oracle Corporation. MariaDB is a trademark of MariaDB Corporation Ab. All other trademarks are property of their respective owners. Other product or company names mentioned may be trademarks or trade names of their respective owner. Copyrights © 2010-2025. All Rights Reserved by ChistaDATA®.