Input/Output#
Data Generator#
|
Create a DataFrame with some range of numbers. |
Spark Metastore Table#
|
Read a Spark table and return a DataFrame. |
|
Write the DataFrame into a Spark table. |
Delta Lake#
|
Read a Delta Lake table on some file system and return a DataFrame. |
|
Write the DataFrame out as a Delta Lake table. |
Parquet#
|
Load a parquet object from the file path, returning a DataFrame. |
|
Write the DataFrame out as a Parquet file or directory. |
ORC#
|
Load an ORC object from the file path, returning a DataFrame. |
|
Write a DataFrame to the ORC format. |
Generic Spark I/O#
|
Load a DataFrame from a Spark data source. |
|
Write the DataFrame out to a Spark data source. |
Flat File / CSV#
|
Read CSV (comma-separated) file into DataFrame or Series. |
|
Write object to a comma-separated values (csv) file. |
Clipboard#
|
Read text from clipboard and pass to read_csv. |
|
Copy object to the system clipboard. |
Excel#
|
Read an Excel file into a pandas-on-Spark DataFrame or Series. |
|
Write object to an Excel sheet. |
JSON#
|
Normalize semi-structured JSON data into a flat table. |
|
Convert a JSON string to DataFrame. |
|
Convert the object to a JSON string. |
HTML#
|
Read HTML tables into a |
|
Render a DataFrame as an HTML table. |
SQL#
|
Read SQL database table into a DataFrame. |
|
Read SQL query into a DataFrame. |
|
Read SQL query or database table into a DataFrame. |