You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Sumit KumarSK

Sumit Kumar

Data Engineer

$250/day
Dubai City, AE
8-15 years

Average response time: 1 hour

About Sumit

  • English

    Native or bilingual

  • Hindi

    Native or bilingual

Can work on-site
Dubai City (up to 50km)

Experience

  • Affine Analytics - Client(Cholamandalam MS General Insurance)
    Data Consultant
    October 2024 - March 2025 (5 months)
    • Developed Airflow DAGs to automate SQL DDL operations for multiple data load phases, improving efficiency by reducing manual updates and saving 4 developer hours per week.
    • Spearheaded the design of dimension tables, enabling real-time analytics and reporting on claims data, thereby reducing claims processing time by two days. Optimized redshift storage by column encoding.
    • Prepared a code for data ingestion from Flat Files, Oracle, and Redshift into the Bronze layer, followed by loading into the Silver layer for historical processing in the data warehouse using the medallion architecture.
    • Established linting tools across multiple python repo's, resulting in a 27% reduction in developer debugging time by enforcing consistent code structure and enabling automated pre-commit code correction.
  • Affine Analytics - Client(Expedia Group)
    Data Consultant
    January 2023 - October 2024 (1 year and 9 months)
    • Reengineered ETL data pipeline for Expedia recommendation email campaigns, improving data quality. Resolved diamond user tickets related to shopper campaigns data quality issues.
    • Slashed expenses by 47% ($51,300/Month) with data cleanup, lifecycle rules, job restructuring, trimming cluster sizes, dropping from $1,35,000/month to $83,700/month. Dropped the s3 storage to 2.3 PB from 3.3 PB.
    • Improved code, identified bottlenecks, and refined tasks. Achieved a 45% runtime reduction, bringing pipeline duration down from 11 hours to 6 hours, through code optimization, task parallelism in REST calls using multithreading.
    • Implemented the 62 source tables migration, shaping new columns for extensive data coverage, while ensuring integration, quality and availability for downstream teams and models by using spark framework.
  • Nokia(Payroll – Vertexplus pvt ltd)
    Data Engineer
    July 2020 - December 2022 (2 years and 5 months)
    • Led the design, implementation, and management of a high-performance Elasticsearch infrastructure (700M+ records) to serve data via REST APIs (Scalatra, Django) from Elasticsearch and S3 data lakes.
    • Collaborated with data scientists to integrate ML prediction models into the data engineering pipeline, eliminating the need for separate pipelines and streamlining end-to-end processing.
    • Improved data system performance and efficiency by optimizing PostgreSQL queries (reducing response times from minutes to seconds) and migrating Spark 2 to 3 on Kubernetes with Helm charts.

Recommendations

Be the first to recommend Sumit

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Lightbend Scala Language
    Lightbend Scala Language
  • Quiklabs GCP Learn to Earn Challenge – Essentials, Data, Security, Architecture
    Quiklabs GCP Learn to Earn Challenge – Essentials, Data, Security, Architecture

Categories