I've worked at both ends of the data stack β building production-grade ELT pipelines in Snowflake for a real client project (Mercedes-Benz USA & Canada) at Infosys, and owning the full BI function at Troy Consultancy where I built dashboards used in daily decision-making.
My current focus is AI-assisted data development using Snowflake Cortex β Cortex Code for pipeline development and Cortex Analyst to enable natural language querying through YAML-based semantic models.
- π’ Previously at Infosys (Mercedes-Benz USA & Canada) and Troy Consultancy
- βοΈ Snowflake Certified β Data Engineering Professional (Feb 2026)
- π€ Hands-on with Snowflake Cortex Code & Cortex Analyst
- π Currently building AI-powered Snowflake data platforms
βοΈ Core β Snowflake & Data Engineering
βοΈ Cloud & Orchestration
ποΈ Databases & Querying
π BI & Programming
π§ Concepts & Methods
End-to-end Snowflake data platform integrating AI-assisted development and semantic analytics.
Implemented Medallion Architecture (Bronze β Silver β Gold) to process 13+ source files into an analytical Star Schema with SCD Type 2. Used Cortex Code for AI-assisted SQL generation and pipeline development, and built a Cortex Analyst semantic model enabling natural language querying on 86K+ sales records without writing SQL.
Enterprise-style data warehouse on SQL Server with full Bronze β Silver β Gold implementation.
CRM and ERP source integration, stored procedure-based ETL, star schema modeling, and a Sales Data Mart with dim_customers, dim_products, and fact_sales.
Production-style Snowflake pipeline modeled on a food delivery platform.
Covers initial and delta loads, CDC using Streams, SCD Type 2 dimensions, a star schema fact table at order-item granularity, data governance with Tags and Masking Policies, and full automation via Stored Procedures and Tasks.
Enterprise-scale retail analytics for a 5M+ customer ecommerce company across 15 countries.
Built on Snowflake with ADLS as external stage, ingesting CSV, JSON, and Parquet. Implements Bronze β Silver β Gold layers, CDC with Streams, data quality pipelines, and Gold layer views for sales performance, customer segmentation, and product analytics.
| Project | Focus |
|---|---|
| Snowflake Streams & CDC | INSERT / UPDATE / DELETE tracking using Streams with AWS S3 |
| Snowflake Snowpipe β Automated Ingestion | End-to-end Snowpipe setup, configuration, and event-based triggering |
| Snowflake Semi-Structured Data Handling | Querying nested JSON using VARIANT and FLATTEN |
| Project | Focus |
|---|---|
| SQL Data Cleaning | Nulls, duplicates, standardization, type corrections on real-world data |
| MLB Analysis | Window functions, aggregations, and performance insights on MLB data |
| Restaurant Order Analysis | Menu and order data analysis for pricing trends and spending patterns |
| Project | Focus |
|---|---|
| Airbnb Dataset Cleaning | Missing values, outliers, type conversions, column normalization |
| Amazon Dataset Cleaning | Product data preprocessing structured for analytics or ML |
| Project | Focus |
|---|---|
| HR Data Analytics Report | Headcount, attrition, departmental performance, workforce KPIs |
| Personality Survey Report | Trait distributions and behavioral patterns from survey data |