3-5 years implementation experience using opensource software stacks such as Hive, Python, Spark, Scala
· 3-5 years’ experience in coding using Python under any noSQL DB and/or Hive
· Strong data analysis skills (2-3 years working with data quality, data consolidation and data wrangling projects)
· Strong working experience in developing data manipulation code including data extraction, data quality, data structure (data relationships) and loading them into a structured database
· End to end experience in data lineage and writing SQL, python, Scala code to source data from multiple systems, consolidating them in a single on-prem and cloud platform and presenting to a entity relationship model
· Hands on experience in data manipulation , statistical, perdition , data analysis libraries , packages, and toolkit in opensource technologies including Scala, spark, python or R
· Working experience in cloud based data lake/ analytics implementation using GCP and related cloud integration technologies
· Independent contributors and business knowledge in banking industry will be an added advantage
Bachelors
B.E
Apache spark,
IT-Software- Software services