About Rajesh Gupta
About me
(Registered since 06/07/2025)
Principal Data Engineer with 15+ years of experience building scalable, cloud-native data platforms and distributed data pipelines for real-time and batch processing. Hands-on expertise with Hadoop, Spark, Pyspark, Hive, Pig, Yarn, NoSQL (HBase/mongodb), Kafka, Airflow, AWS/GCP Docker, Kubernetes, Jenkins across AWS and GCP environments. Experienced in orchestrating complex data workflows, ensuring data reliability, and enabling downstream analytics and machine learning systems. Strong background in leading engineering teams and delivering production-grade data infrastructure.