Sarthak Zende - Data Engineer

Sarthak Zende

Data Engineer | Python, SQL, AWS | MS CS @ SUNY Binghamton

MS Computer Science graduate at SUNY Binghamton specializing in building scalable data pipelines and ETL/ELT solutions. Experienced in processing millions of records using AWS, Python, and modern data engineering tools. Architected serverless pipelines achieving 90% efficiency gains and 99.9% data accuracy. AWS Certified Cloud Practitioner actively seeking Data Engineering opportunities starting Aug 2025.

Tools & Technologies

Programming & Query Languages

PythonPython
SQLSQL
C++C++
CC

Cloud Platforms

AWSAWS
AzureAzure
AWS LambdaLambda
AWS S3S3
AWS GlueGlue
AWS AthenaAthena

Data Engineering & Analytics

Apache SparkApache Spark
SnowflakeSnowflake
Apache AirflowApache Airflow
dbtdbt
TableauTableau

Databases & Development Tools

PostgreSQLPostgreSQL
MongoDBMongoDB
DockerDocker
GitHubGitHub
LinuxLinux

Experience

Data Operations - Student Assistant

TAPS - Binghamton University

Vestal, NY

Sept 2023 - Present

  • Automated data collection pipeline using Python and Selenium to process parking occupancy datasets, reducing manual entry time from 30 minutes to 3 minutes per batch (90% efficiency gain) across 15+ daily cycles
  • Engineered Excel-to-Google Forms data integration system handling 500+ daily parking records for 40+ parking lots, eliminating repetitive manual processes and improving real-time parking app data accuracy by 85%
  • Managed data collection operations, implementing quality protocols that improved app accuracy for 17,000+ users

Software Engineer Intern

Microsoft - Future Ready Talent

Remote, Pune

Dec 2021 - June 2022

  • Architected information system using Python, SQL, Azure QnA Maker and Azure Bot Services, designing optimized data models and integration patterns for 40% faster query processing
  • Built scalable Azure Knowledge Base by designing and implementing data integration connectors to CDC database, ensuring continuous and reliable data synchronization
  • Developed automated data pipeline on Azure Cloud integrating 200+ verified sources, implementing robust data quality checks to maintain 99.9% information accuracy

Data Analytics Intern

KPMG - Forage Virtual Internship

Virtual/Remote

Nov 2021 - Dec 2021

  • Developed a robust data quality framework using Python and Pandas to audit 50,000+ records across accuracy, completeness, and consistency, improving data reliability by 15%
  • Engineered an automated data transformation pipeline using NumPy for preprocessing missing values, normalizing columns, and flagging anomalies via validation checks
  • Delivered actionable customer segmentation reports through Tableau dashboards, improving targeting strategies and stakeholder decision-making

Projects

Education

Master of Science in Computer Science

State University of New York at Binghamton

Binghamton, NY, USA

August 2023 - Dec 2025

Bachelor of Engineering in Computer Engineering

Savitribai Phule Pune University

Pune, IN

August 2019 - May 2023

Certifications

AWS Certified Cloud Practitioner

Verify Credential

Microsoft Certified: Azure Data Fundamentals

Verify Credential

Get In Touch

Feel free to reach out for collaborations or just a friendly hello!

Open to new opportunities • Available for relocation
szende@binghamton.edu