Apache DataFusion Ballista 0.10.0 Changelog#

Full Changelog

Implemented enhancements:

  • Add user guide section on prometheus metrics #507

  • Don’t throw error when job path not exist in remove_job_data #502

  • Fix clippy warning #494

  • Use job_data_clean_up_interval_seconds == 0 to indicate executor_cleanup_enable #488

  • Add a config for tracing log rolling policy for both scheduler and executor #486

  • Set up repo where we can push benchmark results #473

  • Make the delayed time interval for cleanup job data in both scheduler and executor configurable #469

  • Add some validation for the remove_job_data grpc service #467

  • Add ability to build docker images using release-lto profile #463

  • Suggest users download (rather than build) the FlightSQL JDBC Driver #460

  • Clean up legacy job shuffle data #459

  • Add grpc service for the scheduler to make it able to be triggered by client explicitly #458

  • Replace Mutex<HashMap> by using DashMap #448

  • Refine log level #446

  • Upgrade to DataFusion 14.0.0 #445

  • Add a feature for hdfs3 #419

  • Add optional flag which advertises host for Arrow Flight SQL #418

  • Partitioning reasoning in DataFusion and Ballista #284

  • Stop wasting time in CI on MIRI runs #283

  • Publish Docker images as part of each release #236

  • Cleanup job/stage status from TaskManager and clean up shuffle data after a period after JobFinished #185

Fixed bugs:

  • build broken: configure_me_codegen retroactively reserved bind_host #519

  • Return empty results for SQLs with order by #451

  • ballista scheduler is not taken inline parameters into account #443

  • [FlightSQL] Cannot connect with Tableau Desktop #428

  • Benchmark q15 fails #372

  • Incorrect documentation for building Ballista on Linux when using docker-compose #362

  • Scheduler silently replaces ParquetExec with EmptyExec if data path is not correctly mounted in container #353

  • SQL with order by limit returns nothing #334

Documentation updates:

Merged pull requests: