About me (Registered since 24/12/2024)

Proficient in optimizing data storage and retrieval using file formats such as CSV
and Parquet.
Collaborated effectively with DevOps teams and provided production support to
resolve platform issues promptly.
Demonstrated ability in engaging with clients to understand requirements and
deliver solutions effectively.
Hands-on experience with AWS services including EMR, EC2, S3, and Athena.
Expertise in PySpark and Spark frameworks for large-scale data processing.
Proficient in MySQL database management and optimization.
Proficient in monitoring, upgrading, and installing clusters to ensure optimal
performance and reliability.

IT Skills

Education

  • 2018 - 2021
    Fergusson college,Pune

    MSc Tech (masters)

    India

    I have done masters in industrial mathematics with computer applications. This course involves a strong mathematics+statistics background with computer application subjects.

Key Skills and Competencies

Programming Languages: Python, PHP, MySQL, SQL, Bash Big Data Tools & Frameworks: Apache Spark, PySpark, Hadoop Cloud Technologies: AWS S3, AWS Glue, AWS EMR, AWS Athena, AWS RDS, AWS CloudFormation, AWS Redshift Database Management: MySQL, PostgreSQL, SQL Server Version Control: Git, GitLab Monitoring & Scheduling Tools: Apache Airflow, Jenkins, New Relic Other Tools: Postman, IntelliJ IDEA, phpStorm, DataGrip, JIRA

Work Experience

  • 2021 - Present
    Zimetrics Technologies Pvt.Ltd

    Data Engineer

    ROLES AND RESPONSIBILITIES
    Investigate assigned issues and create detailed plans for how to solve them. Discuss
    these plans with the team to ensure everyone is on the same page.
    Before handing over tasks for broader testing, perform thorough testing on your own to
    catch any issues early on.
    Always strive to provide efficient and accurate solutions to assigned tasks, aiming to
    prevent any problems in the development phase.
    Take responsibility for maintaining high-quality data from its raw form all the way through
    to the final reporting stage.
    Maintain strict adherence to data security measures to protect sensitive information.
    Use Jenkins pipeline to automate the deployment of code branches for testing purposes,
    ensuring smoother and more efficient testing processes.
    Regularly use Apache Airflow to schedule and manage the execution of data processing
    jobs.
    Work with raw data stored in AWS S3 buckets during development, applying necessary
    transformations and optimizations to improve efficiency.
    Organize data into structured formats within Athena tables to facilitate easier access and
    analysis.
    Manage optimized reporting data in MySQL databases, ensuring that it is readily
    available for use in various reporting interfaces.
    Keep thorough documentation of all engineering processes, workflows, and best
    practices related to data management, ensuring transparency and consistency across
    the team.
    Maintain open and effective communication with team members throughout all stages of
    projects to foster collaboration and ensure alignment on goals and tasks.

Languages

English
Intermediate

Certifications & Licences

  • 2022,2023

    Appreciation Award

    This award is given by my current organization for appreciation of my work. This award usually given to the employees after every 6 months.