Data Engineer · DataOps · Cloud

Sandesh V H

Building data pipelines and the cloud platforms they run on. Two-plus years of shipping production systems with cost-conscious design.

2+ years experience AWS Certified Data · Cloud · Platform
01 About

A software engineer at the intersection of data and infrastructure.

I work end-to-end across the data and platform stack — streaming and batch ingestion, transformation, dimensional modeling, and the IaC, CI/CD, and observability that keep everything running in production.

I lean toward data engineering, but I'm comfortable across DataOps, DevOps, cloud architecture, and platform work. Most of what I ship is on AWS, with a track record of cost-conscious design, junior mentorship, and translating fuzzy stakeholder needs into systems that hold up.

"Perfect and useful as a reference doc." — client, on a data model workbook I delivered.
02 What I Do

Capabilities across the data and cloud stack.

Data Engineering

  • Streaming pipelines: Kafka Connect → S3 → Aurora, with Lambda for near real-time loads
  • Batch ETL/ELT with AWS Glue, Apache Airflow, dbt, and Spark — into Redshift
  • Dimensional / EDW modeling — DIM/FACT design and analytical query remodeling
  • End-to-end data lineage and ODS-to-warehouse mapping

Cloud Architecture & DevOps

  • AWS architecture: VPC isolation, least-privilege IAM, cost-optimized service selection
  • Infrastructure as Code with Terraform, reproducible across environments
  • CI/CD with Bitbucket Pipelines, Azure DevOps, and GitHub Actions
  • Container release flow: Docker → ECR → ECS, automated end-to-end
  • Observability via CloudWatch + Slack/email alerting; security via WAF, KMS, Secrets Manager, SSM

Knowledge & Stakeholder Management

  • Delivered Cloud ETL training program for new hires while live on production
  • Ran SQL bootcamp for interns; led data-management training for the sales team
  • Hosting bi-weekly client calls and owning project administration end-to-end
  • Junior engineer mentorship and cross-team collaboration
03 Tech Stack

Tools I reach for.

AWS
S3RedshiftRDS AuroraGlueMWAAMSK ConnectEC2ECSLambdaVPCIAMKMSSecrets ManagerSSM Parameter StoreWAFCloudWatchEventBridgeSNS
Data Engineering
Apache AirflowKafka ConnectdbtSparkAWS GlueData ModelingML Studio
DevOps & IaC
TerraformAzure DevOpsBitbucket PipelinesGitHub ActionsDocker
Languages
PythonSQL
Databases
PostgreSQLSQL ServerAWS RedshiftRDS Aurora
Visualization
Tableau
Low-Code
Microsoft Power AutomatePentaho Data Integration
04 Impact & Highlights

Where my work has moved the needle.

$300 / month

Saved by implementing dbt Core inside MWAA — eliminated the need for a separate dbt Cloud subscription.

64% cost cut

Deployed Kafka Connect on EC2 Auto Scaling instead of the managed alternative; client called it out as a standout result.

$3,000 / year

Saved on connector subscription costs by adopting OneDrive thumbnail generation as a free image-resize path.

"Perfect"

Client's verdict on the EDW data model workbook — adopted as a reference document for the team.

3 training programs

Cloud ETL training for new engineers, SQL bootcamp for interns, and data-management training for the sales team.

PDF rendering fix

Resolved long-standing image and table page-break issues in a legacy client system, while keeping output file size minimal.

Client ownership

Hosting bi-weekly client calls and owning project administration — payments, scoping, and stakeholder alignment.

Mentorship

Mentored junior engineers and led cross-team collaboration between Data Engineering and DevOps to unblock delivery.

05 Certifications

Verified credentials.

AWS

AWS Certified Cloud Practitioner

Amazon Web Services · CLF-C02

AWS

AWS Generative AI Applications Professional Certificate

Amazon Web Services · Coursera

06 Publications & Writing

Things I've published.

07 Education

Academic foundation.

M.Tech

Environmental Engineering

National Institute of Technology Karnataka (NITK) Surathkal

B.Tech (Honours)

Civil Engineering

College of Engineering Trivandrum (CET)

08 Contact

Let's build something.

The best place to reach me is LinkedIn — I'm always up for a conversation about data, cloud platforms, or interesting infrastructure problems.

Connect on LinkedIn