Release 0.183#
General#
Fix planning failure for queries that use
GROUPINGand contain aggregation expressions that require implicit coercions.Fix planning failure for queries that contains a non-equi left join that is semantically equivalent to an inner join.
Fix issue where a query may have a reported memory that is higher than actual usage when an aggregation is followed by other non-trivial work in the same stage. This can lead to failures due to query memory limit, or lower cluster throughput due to perceived insufficient memory.
Fix query failure for
CHARfunctionstrim(),rtrim(), andsubstr()when the return value would have trailing spaces underVARCHARsemantics.Fix formatting in
EXPLAIN ANALYZEoutput.Improve error message when a query contains an unsupported form of correlated subquery.
Improve performance of
CAST(json_parse(...) AS ...).Add
map_from_entries()andmap_entries()functions.Change spilling for aggregations to only occur when the cluster runs out of memory.
Remove the
experimental.operator-memory-limit-before-spillconfig property and theoperator_memory_limit_before_spillsession property.Allow configuring the amount of memory that can be used for merging spilled aggregation data from disk using the
experimental.aggregation-operator-unspill-memory-limitconfig property or theaggregation_operator_unspill_memory_limitsession property.
Web UI#
Add output rows, output size, written rows and written size to query detail page.
Hive#
Work around ORC-222 which results in invalid summary statistics in ORC or DWRF files when the input data contains invalid string data. Previously, this would usually cause the query to fail, but in rare cases it could cause wrong results by incorrectly skipping data based on the invalid statistics.
Fix issue where reported memory is lower than actual usage for table columns containing string values read from ORC or DWRF files. This can lead to high GC overhead or out-of-memory crash.
Improve error message for small ORC files that are completely corrupt or not actually ORC.
Add predicate pushdown for the hidden column
"$path".
TPCH#
Add column statistics for schemas
tinyandsf1.
TPCDS#
Add column statistics for schemas
tinyandsf1.
SPI#
Map columns or values represented with
ArrayBlockandInterleavedBlockare no longer supported. They must be represented asMapBlockorSingleMapBlock.Extend column statistics with minimal and maximal value.
Replace
nullsCountwithnullsFractionin column statistics.