Andy Smith
02/14/2024, 2:59 PMScout
02/14/2024, 2:59 PMStaticPartitionsDefinition
, you would simply list the fixed number of partition keys you want to have. However, if you are using a partitioning scheme that could potentially generate a large number of partitions, such as a DailyPartitionsDefinition
over a long time range, you would need to implement logic to restrict the range of dates being considered. This could be done by specifying a start and end date that bounds the partitions to the desired number. The documentation provides guidance on best practices for partitioning, including avoiding over-partitioning, which can negatively impact performance. It suggests that a good rule of thumb is to limit the number of partitions to 10,000 or less for query performance and manageability. If you are looking to programmatically limit the number of partitions during a backfill operation, you could use the BackfillPolicy
to control how many partitions are backfilled per run. For instance, using BackfillPolicy.multi_run(max_partitions_per_run=1)
would execute the backfill in multiple runs, each run backfilling a single partition. Remember that the specific implementation details will depend on your use case and the partitioning scheme you are using. If you need further assistance with a particular partitioning scheme or backfill operation, please provide more context so that we can offer more targeted advice.