In VM-based environments, you must configure a parameter in Profiler
Scheduler in your instance to profile table data in non-default
buckets.
-
In Cloudera Data Catalog, make not of your environment
name in the Search menu.
-
Go
-
Search for your environment, then switch to the Data hubs
tab.
-
Open you Cloudera Data Hub by clicking its name.
-
Open the CM URL under Cloudera Manager Info.
-
In Cloudera Manager go to .
-
Search for the term Profiler Scheduler Spark conf.
The Profiler Scheduler Spark conf configuration
snippet appears.
-
Add
spark.yarn.access.hadoopFileSystems=s3a://default-bucket,s3a://bucket-1,s3a://bucket-2
to Profiler Scheduler Spark conf to enable profiling for
bucket-1 and bucket-2 non-default buckets.