Job Description:Execute weekly server rebuilds (– nodes) with zero data loss and minimalperformance impact
- Perform Hadoop-level pre/post validations: cluster health, HDFS usage, replication, skew, and logs
- Coordinate with Data Center Ops and Unix Admins for hardware and OS-level tasks
- Reconfigure and reintegrate rebuilt nodes into the cluster
- Provide weekday and rotational weekend support across BDH and BDH clusters
Required Skills: Stronghands-on experience with Hadoop ecosystem (HDFS, YARN, MapReduce, HBase)
- Proficient in log analysis, volume/block checks, and skew troubleshooting
- Familiarity with open-source Hadoop distributions and production change controls
- Excellent communication and cross-team coordination skills
- Ability to work independently in a fast-paced, complex environment