Write partitioning properties#
use-preferred-write-partitioning#
Type:
booleanDefault value:
true
Enable preferred write partitioning. When set to true and more than the
minimum number of partitions, set in preferred-write-partitioning-min-number-of-partitions,
are written, each partition is written by a separate writer. As a result, for some connectors such as the
Hive connector, only a single new file is written per partition, instead of
multiple files. Partition writer assignments are distributed across worker
nodes for parallel processing.
preferred-write-partitioning-min-number-of-partitions#
Type:
integerDefault value:
50
The minimum number of written partitions that is required to use connector
preferred write partitioning. If the number of partitions cannot be
estimated from the statistics, then preferred write partitioning is not used.
If the threshold value is 1 then preferred write partitioning is always
used.