Skip to main content

ConfiguredAssetDBFSDataConnector

class great_expectations.datasource.data_connector.ConfiguredAssetDBFSDataConnector(name: str, datasource_name: str, base_directory: str, assets: dict, execution_engine: great_expectations.execution_engine.execution_engine.ExecutionEngine, default_regex: Optional[dict] = None, glob_directive: str = '**/*', sorters: Optional[list] = None, batch_spec_passthrough: Optional[dict] = None, id: Optional[str] = None)#

Extension of ConfiguredAssetFilesystemDataConnector used to connect to the DataBricks File System (DBFS).

Being a Configured Asset Data Connector, it requires an explicit list of each Data Asset it can connect to. While this allows for fine-grained control over which Data Assets may be accessed, it requires more setup.

Parameters:
  • name (str) – required name for DataConnector

  • datasource_name (str) – required name for datasource

  • assets (dict) – dict of asset configuration (required for ConfiguredAssetDataConnector). These can each have their own regex and sorters

  • execution_engine (ExecutionEngine) – Reference to ExecutionEngine

  • default_regex (dict) – optional regex configuration for filtering data_references

  • glob_directive (str) – glob for selecting files in directory (defaults to *)

  • sorters (list) – optional list of sorters for sorting data_references

  • batch_spec_passthrough (dict) – dictionary with keys that will be added directly to batch_spec

get_available_data_asset_names() List[str]#

Return the list of asset names known by this DataConnector.

Returns:

A list of available names