BE / B.Tech / M.Tech / MCA / MSc (IT) or equivalent
Job Description
Work with state-of-the-art data frameworks and technologies like Dataflow(Apache Beam), Dataproc(Apache Spark & Hadoop), Apache Kafka, Google PubSub, Apache Airflow, and others.
You will work cross-functionally with various teams, creating solutions that deal with large volumes of data.
You will work with the team to set and maintain standards and development practices.
You will be a keen advocate of quality and continuous improvement.
You will modernize the current data systems to develop Cloud-enabled Data and Analytics solutions
Drive the development of cloud-based data lake, hybrid data warehouses & business intelligence platforms
Improve upon the data ingestion models, ETL jobs, and alerts to maintain data integrity and data availability
Build Data Pipelines to ingest structured and Unstructured Data.
Gain hands-on experience with new data platforms and programming languages
Analyze and provide data-supported recommendations to improve product performance and customer acquisition
Design, Build and Support resilient production-grade applications and web services
Who you are:
4+ years of work experience in Software Engineering and development.
Very strong understanding of Python & pandas library.Good understanding of Scala, R, and other related languages
Experience with data transformation & data analytics in both batch & streaming mode using cloud-native technologies.
Strong experience with the big data technologies like Hadoop, Spark, BigQuery, DataProc, Dataflow
Strong analytical and communication skills.
Experience working with large, disconnected, and/or unstructured datasets.
Experience building and optimizing data pipelines, architectures, and data sets using cloud-native technologies.
Hands-on experience with any cloud tech like GCP/AWS is a plus.