ABFS is an important data store for big data users. This doc discusses config details and use cases of Gluten with ABFS. To use an ABFS account as your data source, please ensure you use the listed ABFS config in your spark-defaults.conf. If you would like to authenticate with ABFS using additional auth mechanisms, please reach out using the ‘Issues’ tab.
Working with ABFS
Configuring ABFS Access Token
To configure access to your storage account, replace
spark.hadoop.fs.azure.account.key.<storage-account>.dfs.core.windows.net XXXXXXXXX
Other authentatication methods are not yet supported.
Local Caching support
Velox supports a local cache when reading data from ABFS. Please refer Velox Local Cache part for more detailed configurations.