PySpark Pipelines#

materialized_view([query_function, name, ...])

(Return a) decorator to define a materialized view in the pipeline and mark a function as the materialized view's query function.

table([query_function, name, comment, ...])

(Return a) decorator to define a table in the pipeline and mark a function as the table's query function.

temporary_view([query_function, name, ...])

(Return a) decorator to define a view in the pipeline and mark a function as the view's query function.

create_streaming_table(name, *[, comment, ...])

Creates a table that can be targeted by append flows.

append_flow(*, target[, name, spark_conf])

Return a decorator on a query function to define a flow in a pipeline.

create_sink(name, format[, options])

Creates a sink that can be targeted by streaming flows, providing a generic destination for flows to send data external to the pipeline.