Large number of tables

vincepell · May 9, 2023, 6:28pm

I have a use case where the customer requires data to be stored in separate tables. Each table is dynamically created and eventually deleted after a week or so. As a result, there are typically ~20000 tables in the system at any given time. I know that this is not good database design, but, for various reasons, it cannot be changed. My question is: If CrateDB is clustered and the data is not sharded or partitioned, will it spread the tables tables across the cluster rather than putting all the tables on the a single node. Each table is not that big ~100000 rows

hernanc · May 10, 2023, 7:16am

Hi,
Different tables will be on different shards, so CrateDB will spread the tables across the cluster,
but please review Sharding and partitioning guide for time-series data - Tutorials - CrateDB Community
In particular we recommend a maximum of 1000 shards per node so for 20 thousand tables you would be looking at 20 nodes minimum.
2 other approaches you may want to consider are:

if the tables can be grouped in some way you may want to have multiple CrateDB clusters instead of a single one with so many tables
perhaps you could combine data in less tables but present it as if it were on separate “tables” using views?

Topic		Replies	Views
Index size limit CrateDB	1	1033	October 24, 2019
Why cratedb creating more than 1000 shards on partitioned table? CrateDB	1	679	September 8, 2020
How flexible is CrateDB when it comes to vertical scaling? CrateDB	1	608	September 13, 2021
Table creation strategy for thousands of IoT devices CrateDB	8	140	January 22, 2024
Partition requires significantly a lot more space than the others CrateDB	10	930	October 26, 2021

Large number of tables

Related Topics