About

Motivated and detail-oriented Data Engineer with a passion for transforming data into actionable insights. Strong foundation in data engineering principles and eager to contribute to the development of scalable and efficient data solutions. Proficient in programming languages and familiar with various data technologies, I am committed to continuously learning and enhancing my skills to support data-driven decision-making.

  • Birthday: 27 April 1996
  • Phone: +1 619-955-9806
  • City: San Diego, CA
  • Email: kharva.hitensh11@gmail.com

Interests

Data Engineering

Data Warehousing

Data Modeling

Scripting/Automation

Software Engineering

Visualization

Algorithms

ETL

Education

MS in Computer Science

August 2021 - May 2023
Relevant Coursework
  • Big Data Tools & Methods
  • Algorithm Analysis and Design
  • Scientific Database Technique
  • Machine Learning

B.E. in Electronics and Telecommunications

August 2014 - May 2018
Relevant Coursework
  • Data Structures & Algorithms
  • Object Oriented Programming
  • Operating Systems
  • Mathematics

Online Certification

Algorithms Specialization

IBM Data Engineering

Experience

Wind River Systems

June 2023 - Present
Data Developer, San Diego, CA

Tech Stack: Python, JavaScript, SQL, Looker, Snowflake, dbt, Github, AWS, CI/CD, Fivetran

  • Seamlessly channeled SurveyMonkey & Hootsuite data via Fivetran connectors into Snowflake, and created staging, intermediate and marts dbt SQL models to culminate it in Looker, providing intuitive analysis to People’s Branding Team.
  • Pioneered the integration of pre-commit hooks from dbt-checkpoint within our CI/CD framework, driving a 25% reduction in data model review time and a 30% increase in documentation accuracy, while ensuring over 90% compliance with naming conventions and enabling automated testing of critical data transformations in SQL models.
  • Collaborated with the data team to integrate Autoscaling Spectacles within the dbt pipeline by setting up data testing scenarios and to establish automated data validation for LookML models.
  • Implemented a CI pipeline in Python that automates the identification of changed dbt files affecting Looker tiles, retrieves queries for each dashboard tile in specific folders, and runs relevant tiles based on downstream model changes resulting in increased accuracy and decreased the risk of errors

Wind River Systems

May 2022 - March 2023
Data Developer Intern, San Diego, CA

Tech Stack: Python , JavaScript, SQL, Looker, Snowflake, dbt, Github, AWS, CI/CD

  • Designed and built data processing python models that successfully scanned, tagged, and masked PII fields in Snowflake schema, improving data privacy and security for data governance.
  • ISpearheaded the implementation of an autoscaling, YAML linting in CI/CD ecosystem in dbt to optimize resource allocation and streamline development workflow, resulting in consistency and correctness in configurations for faster deployments and reducing development time by 40%.
  • Scheduled and automated cron processes leveraging python scripts to report consolidated Looker dashboards, providing stakeholders with real-time insights, and enabling data-driven decisions.
  • Created accurate and consistent models in dbt using SQL to build scalable and maintainable ELT pipelines.
  • Developed data compare tool using Python and SQL to automate data comparison between development and production environments, improving data quality and efficiency in data management.

Bloomstack Technology

November 2020 - May 2021
Backend Python Engineer, Mumbai, India

Tech Stack: Python, MySQL, Looker, PostgreSQL, RESTFul API

  • Developed a data pipeline using Python and SQL to process semi-structured data from external RESTFul API, processing over 200 data points per day, and resulting in improved data management and enabling more effective analysis and reporting.
  • Established meaningful metrics to qualify social media marketing value generated and created Dashboards that automatically refreshed these metrics daily, saving 2 hours per day of manual work.
  • Designed and implemented scalable and secure backend systems using Python and Django on ERPNext using the open-source app-development framework Frappe.

Larsen and Toubro Infotech (LTIMindtree)

July 2018 - October 2020
Software Engineer, Pune, India

Tech Stack: Python, JavaScript, MySQL, MongoDb, Django, RESTFul API

  • Successfully developed and delivered web and mobile banking products and services for 9 countries, meeting all project requirements through the Agile software development life cycle and exceeding client expectations.
  • Automated testing & reporting processes in Python to analyze 30+ functional areas, increasing team efficiency by 20%.
  • Performed thorough analysis, unit testing, and integration testing with other applications of database objects and SQL statements before deployment to production servers.
  • Developed and delivered complex software applications through the Agile software development life cycle, including analyzing client requirements, planning, coding, testing, and implementation

Projects

F1 Data Analysis

F1 Data Analysis

Reddit Cloud Batch ELT

Stack Overflow Analysis

MotoGP Race Analytics

E-Commerce Data

Skills

Languages and Databases

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone upload.wikimedia.org vectorlogo.zone vectorlogo.zone

Cloud Technologies

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Tools

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Contact

My Address

San Diego, CA

Social Profiles

Email

hkharva3283@gmail.com

kharva.hitensh11@gmail.com

Contact

+1 619-955-9806