Skip to main content

ConfiguredAssetS3DataConnector

class great_expectations.datasource.data_connector.ConfiguredAssetS3DataConnector(name: str, datasource_name: str, bucket: str, assets: dict, execution_engine: Optional[great_expectations.execution_engine.execution_engine.ExecutionEngine] = None, default_regex: Optional[dict] = None, sorters: Optional[list] = None, prefix: str = '', delimiter: str = '/', max_keys: int = 1000, boto3_options: Optional[dict] = None, batch_spec_passthrough: Optional[dict] = None, id: Optional[str] = None)#

Extension of ConfiguredAssetFilePathDataConnector used to connect to S3.

Being a Configured Asset Data Connector, it requires an explicit list of each Data Asset it can connect to. While this allows for fine-grained control over which Data Assets may be accessed, it requires more setup.

Parameters:
  • name (str) – required name for DataConnector

  • datasource_name (str) – required name for datasource

  • bucket (str) – bucket for S3

  • assets (dict) – dict of asset configuration (required for ConfiguredAssetDataConnector)

  • execution_engine (ExecutionEngine) – optional reference to ExecutionEngine

  • default_regex (dict) – optional regex configuration for filtering data_references

  • sorters (list) – optional list of sorters for sorting data_references

  • prefix (str) – S3 prefix

  • delimiter (str) – S3 delimiter

  • max_keys (int) – S3 max_keys (default is 1000)

  • boto3_options (dict) – optional boto3 options

  • batch_spec_passthrough (dict) – dictionary with keys that will be added directly to batch_spec

get_available_data_asset_names() List[str]#

Return the list of asset names known by this DataConnector.

Returns:

A list of available names