Readwrite think insert

Hmm, I think this is best to be controlled via CH Proxy and extra infrastrucure. I have raise a discussion in 2021 roadmap discussion. Following is chart of architecture (use temporary Clickhouse cluster in k8s).

For example, QQ Music use case is a proven use case (unlucky, the article is wrote in Chinese). If users send a large amount of records(which belongs to many shards) to single write node, the write node need to reduce the number of small files and improve the merge speed.ĭo you agree with my proposal or do you have any suggestions?Īs far as I know, some Clickhouse users leverage merge tree engine's library or temporary Clickhouse cluster(in k8s) to generate merge-tree files, and move them directly to all local table path corresponding to the specific distributed table.

For the scenario of continuous/real-time insertion, there should be some parameters (such as max_wait_count, max_wait_druation) that can control the frequency of synchronization to the read node of course, this will also cause the data delay to increase.

Only merge tree table engines will be supported.

Here is some detail/concerns of this design(some point are complex so I didn't explain them fully at the moment): After all the records are merged into merged-tree files, merged-tree files will be distributed to the read nodes (so compact/merge in read node side will be reduced). Users can insert records into any write nodes, the write nodes are responsible for dividing the records by shard and inserting it into the local directory temporarily. Each clickhouse node can be designated as read or write node when creating a r/w splitting table. My idea is to introducing r/w splitting distributed table engine and new role to clickhouse cluster : read node and write node. Doing so may cause waste of resources, in the real scenario, maybe only two machines is enough to accept all write operation (write node is not responsible for answering query).

For example, if I create a clickhouse cluster with 10 shards and 2 replica for each shard (in 20 physical nodes), then I will have 10 clickhouse nodes for write operation. So my question is, do community have a plan to support the built-in read-write separation function ?įrom my understanding, using dedicated replica to implement the separation/splitting of read and write do works, but this approach is not flexible and may lead to waste of resources. If clickhouse can support the built-in read-write separation function, all insertion operations will be carried out on the write node, and the write node will compact/merge input records, and then the merged files will be distributed to the corresponding read nodes according to shard, so that the impact of insert operation on query node can be reduced, thus ensuring the stability of query performance. However, this way depends on extra development to reinvent the wheel (also need good understanding to the internal of Clickhouse) and it is not user friendly. As a DBA, I want to ensure that clickhouse's performance is good and stable.Īs far as I know, some clickhouse users/developers leverage clickhouse mergetree engine's library and external compute engine(such as spark) to generate mergetree files, and move them directly to all local table path corresponding to the specific distributed table. At present, when users insert a large amount of records into the mergetree distributed table, the insertion task will take up a large amount of computing resources, which leads to the performance decline or even failure of the query.