Analytical Skills, Pyspark, Data Science, BAsh-Scripting, SQL, Python, Hadoop, HDFS ,Communication Skills, Agile Methododlogy,Java,NoSQL, Scala, DevOps, Big Data
Candidate Profile
Experience Range
5+ years of experience in data engineering, data scince or related fields, with demonstrated experience building and optimizing data pipelines, architectures and data set
Education Qualification
Bachelor’s or Master’s Degree in Computer Science, Computer Engineering or related discilines is required.
Essential Skills
5-7 yers of solid Python working experience
Experience with Big Data Tools : Hadoop, Spark,Kafka etc
Experience with relational SQl and NoSQL databases including Pstgres and Cassandra
Experience with Data Pieline and workflow management tools : Azkaban, Luigi, Airflow etc.
Experience with cloud services : EC2, EMR, RDS, Redshift, Azure, Google, etc.
Additional Skill Sets And Expectations
Create and maintain optimal data pipeline and architecture
Assemble large, complex data sets that meets functional/non-functonal business requirements.
Identify,design and implement internal process improvements : automating manual porcesses, optimizing data delivery,re-designing infrastructure for greater scalability, etc.