Release 0.207#
General#
Fix a planning issue for queries where correlated references were used in
VALUES.Remove support for legacy
JOIN ... USINGbehavior.Change behavior for unnesting an array of
rowtype to produce multiple columns.Deprecate the
reorder_joinssession property and thereorder-joinsconfiguration property. They are replaced by thejoin_reordering_strategysession property and theoptimizer.join-reordering-strategyconfiguration property.NONEmaintains the order of the joins as written and is equivalent toreorder_joins=false.ELIMINATE_CROSS_JOINSwill eliminate any unnecessary cross joins from the plan and is equivalent toreorder_joins=true.AUTOMATICwill use the new cost-based optimizer to select the best join order. To simplify migration, setting thereorder_joinssession property overrides the new session and configuration properties.Deprecate the
distributed_joinssession property and thedistributed-joins-enabledconfiguration property. They are replaced by thejoin_distribution_typesession property and thejoin-distribution-typeconfiguration property.PARTITIONEDturns on hash partitioned joins and is equivalent todistributed_joins-enabled=true.BROADCASTchanges the join strategy to broadcast and is equivalent todistributed_joins-enabled=false.AUTOMATICwill use the new cost-based optimizer to select the best join strategy. If no statistics are available,AUTOMATICis the same asREPARTITIONED. To simplify migration, setting thedistributed_joinssession property overrides the new session and configuration properties.Add support for column properties.
Add
optimizer.max-reordered-joinsconfiguration property to set the maximum number of joins that can be reordered at once using cost-based join reordering.Add support for
chartype toapprox_distinct().
Security#
Fail on startup when configuration for file based system access control is invalid.
Add support for securing communication between cluster nodes with Kerberos authentication.
Web UI#
Add peak total (user + system) memory to query details UI.
Hive connector#
Fix handling of
VARCHAR(length)type in the optimized Parquet reader. Previously, predicate pushdown failed withMismatched Domain types: varchar(length) vs varchar.Fail on startup when configuration for file based access control is invalid.
Add support for HDFS wire encryption.
Allow ORC files to have struct columns with missing fields. This allows the table schema to be changed without rewriting the ORC files.
Change collector for columns statistics to only consider a sample of partitions. The sample size can be changed by setting the
hive.partition-statistics-sample-sizeproperty.
Memory connector#
Add support for dropping schemas.
SPI#
Remove deprecated table/view-level access control methods.
Change predicate in constraint for accessing table layout to be optional.
Change schema name in
ConnectorMetadatato be optional rather than nullable.