Profiling table data in non-default buckets

In VM-based environments, you must configure a parameter in Profiler Scheduler in your instance to profile table data in non-default buckets.

  1. In Cloudera Data Catalog, make not of your environment name in the Search menu.
  2. Go Cloudera Management Console > Environments
  3. Search for your environment, then switch to the Data hubs tab.
  4. Open you Cloudera Data Hub by clicking its name.
  5. Open the CM URL under Cloudera Manager Info.
  6. In Cloudera Manager go to Configuration > Configuration Search.
  7. Search for the term Profiler Scheduler Spark conf.
    The Profiler Scheduler Spark conf configuration snippet appears.
  8. Add spark.yarn.access.hadoopFileSystems=s3a://default-bucket,s3a://bucket-1,s3a://bucket-2 to Profiler Scheduler Spark conf to enable profiling for bucket-1 and bucket-2 non-default buckets.