Available for new opportunities

Hi there, I'm

Isha Shaw

I'm a

Software Developer at Publicis Sapient, passionate about building scalable data platforms, distributed pipelines, and cloud-native data solutions. I specialize in transforming complex datasets into reliable, high-performance systems that power analytics and business decisions.

2+ Years Experience
5+ Projects Built
10+ Technologies
Scroll
Isha Shaw
🏒
Currently at
Publicis Sapient
⚑
Role
SDE

Crafting digital
experiences that matter

I'm Isha Shaw, a Software Developer working as an SDE at Publicis Sapient. I love building clean, performant, and user-friendly applications. With a background in Computer Science, I bring both technical depth and creative thinking to every project.

When I’m not building distributed systems or data pipelines, you’ll usually find me writing technical articles, exploring emerging technologies, contributing to open-source projects, or enjoying a good cup of chai β˜•

πŸ’Ό
Software Developer SDE @ Publicis Sapient
πŸŽ“
B.Tech CSE Indian Institute of Information Technology Kalyani
πŸ“
Location Gurgaon, India
πŸš€
Open To Software Developer, Data Engineer
Download Resume

Work Experience

My professional journey so far β€” building real-world solutions at scale.

Software Development Engineer
Publicis Sapient
2024 – Present
  • Engineered configuration-driven data pipelines using Apache Beam, Dataflow & BigQuery, processing 5L+ daily records across 8+ extract types with a metadata-driven schema validation framework β€” cutting manual validation effort by 60% and new extract onboarding time by 40%.
  • Orchestrated end-to-end workflows via Apache Airflow, dynamically validating 10+ daily CSV files from Cloud Storage and triggering parameterized Dataflow jobs through XCom-based task communication β€” achieving a 99% production job success rate with modular, fault-isolated pipelines.
  • Implemented environment-specific configuration management with Helm charts and automated CI/CD deployments via Jenkins & Harness, reducing manual deployment effort by 50% across 3 environments.
Python Google Dataflow Kubernetes Apache Airflow Docker Jenkins Harness Apache Beam GCP
Trainee Engineer
Publicis Sapient
2023 – 2024
  • Developed a credit card application platform using Java, REST APIs, Docker, and Kubernetes, contributing to backend service development and containerized deployments.
  • Worked with containerization and orchestration tools including Docker and Kubernetes to deploy and manage microservice-based applications in development environments.
JAVA Kubernetes React Docker Jenkins REST APIs Kafka GCP

Skills & Technologies

A curated toolkit of languages, frameworks, and tools I use to bring ideas to life.

πŸ’»
Languages
Python Java SQL C++
πŸ”„
Data Engineering
Apache Beam PySpark Apache Spark BigQuery Kafka ETL/ELT Distributed Systems
☁️
Cloud & DevOps
GCP Cloud Storage Cloud Run Compute Engine Kubernetes Helm Terraform
πŸš€
CI/CD & Version Control
Jenkins Harness Git GitHub
πŸ› οΈ
Tools & Technologies
Linux REST APIs JIRA Postman

Featured Projects

Distributed Spark Streaming Lakehouse Platform Add project screenshot Full Stack
Distributed Spark Streaming Lakehouse Platform
A complete local distributed data engineering platform built using: Apache Spark Cluster, Apache Kafka, Spark Structured Streaming Delta Lake, MinIO (S3-Compatible Object Storage), Jupyter Notebook and Docker Compose This project simulates a modern cloud-style streaming lakehouse architecture locally.
Python Spark Kafka Delta Lake Docker
RAKSHA AI / ML
RAKSHA
A virtual self-defense trainer using real-time human pose analysis from webcam feeds. Processed 50+ hours of video data continuously, achieving 50% accuracy in assessing user performance and providing live feedback.
Python Flask Deep Learning Pose Estimation
SANGEET Django
SANGEET
A Music Box application built with Python Django. Features a keyword-based search engine that improves song discoverability and enhances the overall user experience.
Python Django SQL

Technical Articles

Deep dives, practical guides, and real-world case studies on distributed systems, cloud-native engineering, and modern data platforms.

What Are Parquet Files? How They Work and Why They’re Faster Than CSV
A technical article on Parquet file format, its columnar storage design, and how it optimizes performance for big data processing compared to traditional CSV files.
LangChain vs LangGraph: A Beginner’s Guide with Examples
An introductory comparison of LangChain and LangGraph frameworks for building LLM applications, covering their core features, use cases, and practical examples to help beginners choose the right tool for their AI projects.
How an API Request Flows from Client to Server (And Back) β€” A Beginner-Friendly Guide
A beginner-friendly article explaining the end-to-end flow of an API request, covering key components like DNS resolution, TCP/IP communication, load balancers, web servers, application servers, databases, and how they work together to process and respond to client requests.

Get In Touch

Have a project in mind, an opportunity, or just want to say hey? My inbox is always open!

πŸ“§
πŸ“
Location
Gurgaon, India
πŸ’Ό
Currently
SDE @ Publicis Sapient

Find me on