23.74. Release 0.139
Dynamic Split Concurrency
The number of running leaf splits per query is now dynamically adjusted to improve
overall cluster throughput. task.initial-splits-per-node
can be used to set
the initial number of splits, and task.split-concurrency-adjustment-interval
can be used to change how frequently adjustments happen. The session properties
initial_splits_per_node
and split_concurrency_adjustment_interval
can
also be used.
General Changes
- Fix planning bug that causes some joins to not be redistributed when
distributed-joins-enabled
is true. - Fix rare leak of stage objects and tasks for queries using
LIMIT
. - Add experimental
task.join-concurrency
config which can be used to increase concurrency for the probe side of joins.
Hive Changes
- Remove cursor-based readers for ORC and DWRF file formats, as they have been replaced by page-based readers.
- Fix creating tables on S3 with CREATE TABLE AS.