Install, configure, and maintain Cloudera Data Platform (CDP) or Cloudera Distribution of Hadoop (CDH) clusters in production and non-production environments
Monitor cluster health, performance, and resource utilization to ensure high availability and reliability
Manage and support Hadoop ecosystem components (HDFS, YARN, Hive, Impala, Spark, HBase, Kafka, Oozie, Zookeeper, Ranger, Atlas)
Perform cluster upgrades, patches, and migrations with minimal downtime
Implement and maintain security controls using Kerberos, Ranger, and LDAP/AD integration
Manage user access, roles, and privileges across the Cloudera ecosystem
Automate routine administrative tasks using Python, Shell scripting, or Ansible
Troubleshoot cluster-related issues, performance bottlenecks, and job failures
Support backup and disaster recovery planning for Hadoop clusters
Collaborate with Data Engineers, Developers, and Architects to optimize data pipelines and workflows
Generate reports on cluster usage, job performance, and capacity planning
Ensure adherence to data governance and compliance using tools like Atlas and Ranger
Proactively suggest improvements in architecture and operations and stay updated with Cloudera/CDP advancements
Requirements
Cloudera Certified Administrator for Apache Hadoop (CCAH) or equivalent certification
Expertise with Cloudera/HDP/CDP distributions and enterprise big data platforms