Skip to main content
Ctrl+K
Apache DataFusion in Python Apache DataFusion in Python
  • User Guide
  • Contributor Guide
  • API Reference
  • Links
  • GitHub
  • Rust API docs (docs.rs)
  • User Guide
  • Contributor Guide
  • API Reference
  • Links
  • GitHub
  • Rust API docs (docs.rs)

Section Navigation

  • User Guide
    • Introduction
    • Concepts
    • Data Sources
    • DataFrames
      • DataFrame Rendering
      • Execution Metrics
    • Common Operations
      • Registering Views
      • Basic Operations
      • Column Selections
      • Expressions
      • Joins
      • Functions
      • Spark-Compatible Functions
      • Aggregation
      • Window Functions
      • User-Defined Functions
    • IO
      • Arrow
      • Avro
      • CSV
      • JSON
      • Parquet
      • Custom Table Provider
    • Configuration
    • Distributing work
    • SQL
    • Upgrade Guides
    • Using AI Coding Assistants
  • Contributor Guide
    • Introduction
    • Python Extensions
  • API Reference
    • datafusion
      • datafusion.catalog
      • datafusion.context
      • datafusion.dataframe
      • datafusion.dataframe_formatter
      • datafusion.expr
      • datafusion.functions
        • datafusion.functions.spark
      • datafusion.input
        • datafusion.input.base
        • datafusion.input.location
      • datafusion.io
      • datafusion.ipc
      • datafusion.object_store
      • datafusion.options
      • datafusion.plan
      • datafusion.record_batch
      • datafusion.substrait
      • datafusion.unparser
      • datafusion.user_defined
  • Links
    • GitHub and Issue Tracker
    • Rust API Docs
    • Code of Conduct
    • Examples
  • User Guide
  • Common Operations

Common Operations#

The contents of this section are designed to guide a new user through how to use DataFusion.

  • Registering Views
  • Basic Operations
  • Column Selections
  • Expressions
    • Column
    • Literal
    • Boolean
    • Arrays
    • Lambda functions
    • Testing membership in a list
    • Conditional expressions
    • Structs
    • Functions
  • Joins
    • Inner Join
    • Left Join
    • Full Join
    • Left Semi Join
    • Left Anti Join
    • Duplicate Keys
    • Disambiguating Columns with DataFrame.col()
  • Functions
    • Mathematical
    • Conditional
    • Temporal
    • String
    • Casting
    • Other
  • Handling Missing Values
    • fill_null
  • Spark-Compatible Functions
    • Why a Separate Namespace?
    • DataFrame API
    • SQL
    • Function Reference
  • Aggregation
    • Setting Parameters
    • Grouping Sets
    • Aggregate Functions
    • User-Defined Aggregate Functions
  • Window Functions
    • Setting Parameters
    • Aggregate Functions
    • Available Functions
    • User-Defined Window Functions
  • User-Defined Functions
    • Scalar Functions
    • Aggregate Functions
    • Window Functions
    • Table Functions

previous

Execution Metrics

next

Registering Views

Apache Arrow DataFusion, Arrow DataFusion, Apache, the Apache feather logo, and the Apache Arrow DataFusion project logo

are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.