Skip to main content
Ctrl+K
Apache DataFusion Ballista  documentation - Home Apache DataFusion Ballista  documentation - Home

User Guide

  • Introduction

Cluster Deployment

  • Deployment
    • Quick Start
    • Cargo Install
    • Docker
    • Docker Compose
    • Kubernetes
  • Scheduler

Clients

  • Python
    • Quick Start
    • Querying S3 Data
    • Jupyter Notebooks
  • Rust
  • SQL CLI

Reference

  • Configuration
  • Tuning Guide
  • Ballista Scheduler Metrics
  • Frequently Asked Questions
  • Extending Ballista Scheduler And Executors
  • Spark-Compatible Functions
  • Extensions Example

Contributors Guide

  • Ballista Architecture
  • Ballista Code Organization
  • Ballista Development
  • Source code

Community

  • Communication
  • Issue tracker
  • Code of conduct
  • Python Client

Python Client#

  • Quick Start
    • Connecting to a Cluster
    • Configuration
    • SQL
    • DataFrame API
    • User Defined Functions
  • Querying S3 Data
    • Prerequisites
    • Environment Variables
    • Registering an S3 Object Store
    • Creating External Tables
    • Running Queries
    • Complete Example
    • Configuring S3 via SQL
    • Kubernetes Deployment
    • Troubleshooting
  • Jupyter Notebooks
    • Basic Usage
    • Converting Results
    • Example Workflow
    • Running a Local Cluster in a Notebook

previous

Ballista Scheduler

next

Ballista Python Bindings

Edit on GitHub
Show Source

Apache DataFusion Ballista, Arrow Ballista, Apache, the Apache feather logo, and the Apache DataFusion Ballista project logo

are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.