Apache DataFusion Ballista 0.6.0 Changelog#
Breaking changes:
Register tables in BallistaContext using TableProviders instead of Dataframe #1028 (rdettai)
Make TableProvider.scan() and PhysicalPlanner::create_physical_plan() async #1013 (rdettai)
FilePartition and PartitionedFile for scanning flexibility #932 [sql] (yjshen)
Implemented enhancements:
Update datafusion-cli to support Ballista, or implement new ballista-cli #886
Prepare Ballista crates for publishing #509
Add support for
create table asvia MemTable #1243 [sql] (Dandandan)Multiple files per partitions for CSV Avro Json #1138 (rdettai)
Implement INTERSECT & INTERSECT DISTINCT #1135 [sql] (xudong963)
add digest(utf8, method) function and refactor all current hash digest functions #1090 (Jimexist)
[crypto] add
blake3algorithm todigestfunction #1086 (Jimexist)remove hard coded partition count in ballista logicalplan deserialization #1044 (xudong963)
Implement Display for Expr, improve operator display #971 [sql] (matthewmturner)
ObjectStore API to read from remote storage systems #950 (yjshen)
fixes #933 replace placeholder fmt_as fr ExecutionPlan impls #939 (tiphaineruy)
Add BaselineMetrics, Timestamp metrics, add for
CoalescePartitionsExec, rename output_time -> elapsed_compute #909 (alamb)[Ballista] Add executor last seen info to the ui #895 (msathis)
Fixed bugs:
Test execution_plans::shuffle_writer::tests::test Fail #1040
Integration test fails to build docker images #918
Ballista: Remove hard-coded concurrency from logical plan serde code #708
How can I make ballista distributed compute work? #327
Fix compilation for ballista in stand-alone mode #1008 (Igosuki)
Documentation updates:
implement
approx_distinctfunction using HyperLogLog #1087 (Jimexist)[DataFusion] - Add show and show_limit function for DataFrame #923 (francis-du)
update docs related to protoc and optional syntax #902 (Jimexist)
Performance improvements:
Closed issues:
InList expr with NULL literals do not work #1190
update the homepage README to include values,
approx_distinct, etc. #1171[Python]: Inconsistencies with Python package name #1011
Wanting to contribute to project where to start? #983
delete redundant code #973
How to build DataFusion python wheel #853
Produce a design for a metrics framework #21
Merged pull requests:
For older versions, see apache/arrow/CHANGELOG.md