OBS! Ansökningsperioden för denna annonsen har
passerat.
Arbetsbeskrivning
About the role:
• Responsible for developing extraction scripts in Hive, Spark
• Responsible for tuning for performance the PIG scripts on HDFS
• Identify and fix data quality issues and find functional defects in data lake implementation
Desired Competences:
• Telecom Domain Knowledge - 1+year
• Knowledge of SQL and query optimization
• Working experience on HDFS
• Analyzes data issue reported by various internal and external users/teams and proposes the best solution to rectify the same
• Develop and test scripts for sourcing data from one database to another
• Oozie, Zookeeper utilities knowledge is a must Must-Have
• Good in working in an onsite/offshore model
• Knowledge on any ETL Tool like Talend is desirable
• SQL tuning and query optimization is essential
• Identify Quality gaps and provide recommendations to internal teams to
• cover the gap Script the data standardization, cleansing programs and perform necessary
• unit test Strong Data Quality and Problem solving skills is essential
• Good communication and interpersonal skills
Must have skills:
• Exposure to DevOps/ CICD and Agile/Scrum is a big plus
• Strong experience in BigData with knowledge on Telecom Set domain.
• Strong DWH concepts, SQL tuning and exposure to Data Quality improvement techniques
• ETL Knowledge is essential; Talend is preferred
• Pig, Hive, Spark, Cloudera exposure is a MUST
• 3+ years experience form work in Hadoop
Merits:
• Exposure to DevOps/ CICD and Agile/Scrum is a big plus