Available for US roles · Senior level · Remote / San Diego

Rodolfo Mendivil Data & AI - ML Engineer

8+ years building production data systems across AWS · Azure · GCP.
Expert in Databricks, Snowflake, real-time pipelines,
Healthcare AI and end-to-end MLOps. Worked with Aeromexico, Coca-Cola, Thomson Reuters.

⚡ View Live Demos → Get In Touch

Years Experience

15+

Enterprise Projects

98%

Uptime Delivered

pipeline_config.py — rodolfo@mendivil-data

# Senior Data & AI - ML Engineer

engineer = {

name: "Rodolfo Mendivil",

location: "San Diego, CA",

experience: 8, # years

platforms: [

"Databricks", "Snowflake",

"AWS", "Azure", "GCP"

certifications: 5,

open_to_work: True,

}

status = pipeline.run()

About Me

Building systems that scale

My name is Rodolfo Mendivil, and I bring over 8 years of experience in Data Analytics, Business Intelligence, Data Engineering, and Data Science. I specialize in building and optimizing production-grade data pipelines and architectures at scale.

I've delivered solutions for Fortune 500 companies including Aeromexico, Coca-Cola LATAM, and Tupperware, building real-time streaming pipelines, ML platforms, and cloud data warehouses that process terabytes daily.

My focus is on Databricks and Snowflake ecosystems — designing Delta Lake architectures, building MLOps pipelines on MLflow, and engineering cost-optimized Snowflake deployments that save companies thousands in compute credits.

Based in San Diego, CA. Open to senior Data Engineer and ML Engineer roles across the US (remote or hybrid).

Certifications

AWS Solutions Architect

Amazon Web Services

AWS

Azure Data Engineer

Microsoft

DP-203

Databricks Associate

Databricks

Spark

Snowflake SnowPro Core

Snowflake

Core

Google Professional Data Engineer

Google Cloud

GCP

Platform Expertise

Databricks & Snowflake

Databricks

Unified Analytics Platform · Delta Lake · MLflow

🧱

Unity Catalog — governance, data lineage, fine-grained access control
Delta Live Tables — declarative ETL with auto-scaling + SLA monitoring
MLflow + Feature Store — end-to-end model lifecycle, experiment tracking, serving
Databricks Workflows — multi-task job orchestration, CI/CD integration
Spark Structured Streaming — real-time pipelines, Kafka integration, watermarks
Photon Engine — vectorized query execution, 10x cost/performance gains

PySpark / Scala95%

Delta Lake Architecture92%

MLflow / MLOps88%

Databricks SQL90%

Delta Lake MLflow PySpark Unity Catalog DLT Photon Autoloader Workflows

Used In Production

Aeromexico revenue analytics pipeline — Delta Lake medallion architecture processing 500K+ flight records/day with real-time revenue aggregations.

Snowflake

Cloud Data Platform · Data Sharing · Snowpark

❄️

Multi-cluster Warehouses — auto-scaling, workload isolation, query acceleration
Snowpark Python/Java — ML pipelines and transformations without data movement
Data Sharing & Marketplace — live data exchange across orgs, zero-copy cloning
Dynamic Data Masking — column-level security, row access policies
Cost Governance — resource monitors, warehouse credit optimization, query profiling
Stream & Task — CDC pipelines, micro-batch processing, alerting

Snowflake SQL / Architecture93%

Snowpark (Python)85%

Cost Optimization91%

Data Governance88%

Snowpark SnowPro Core dbt + Snowflake Data Sharing Streams & Tasks Zero-Copy Clone Query Profiler

Published Guides

4 in-depth Snowflake articles on architecture, query optimization, resource monitors, and schema change management — available on dev.to.

Projects

Enterprise work

Aeromexico

BI Groups & Connections Revenue Analysis

Delta Lake medallion architecture processing 500K+ flight records/day. Databricks + Spark for real-time revenue aggregations. Impressive time-zone logic for worldwide flight arrivals. 40% faster reporting cycle.

Databricks Delta Lake Amazon S3 AWS Glue Redshift Power BI

Coca-Cola LATAM

DI BI Dashboards & Reports

End-to-end Azure data platform with ADF orchestration, Synapse Analytics for structured processing, and Power BI for executive dashboards. DAX calculations covering 12+ LATAM markets.

Azure Data Factory Synapse Analytics Azure Data Lake Power BI DAX Azure DevOps

Tupperware

Order Fulfillment ML Project

Azure Databricks ML pipeline for order fulfillment optimization. Trained and deployed models via Azure Machine Learning. Batch pipeline with ADF orchestration across multi-source file ingestion.

Azure Databricks Azure ML ADF Azure Data Lake Synapse Power BI

INTER Insurance

Oracle → Azure Data Migration

High-stakes Oracle-to-Azure migration for insurance company with sensitive data. Achieved 98% reduction in downtime. Azure DMS + ADF orchestration with automated validation and rollback.

Azure DMS ADF Oracle Data Pump Azure Blob Synapse PowerShell

✦ Personal Project 🩺 Healthcare AI 🤖 LLMOps + MLOps ⚡ Live Demo Available

05 · Personal · Healthcare AI

🩸 GlucoFlow — CGM Intelligence Platform

            Kafka → Databricks Feature Store → LSTM+LightGBM ensemble → MLflow → Snowflake · OhioT1DM public dataset
          

Real-time glucose forecasting platform that fuses three data streams: CGM sensor readings (every 5 minutes), exercise history with pharmacodynamic decay (aerobic vs anaerobic, intensity, recency), and medication pharmacokinetics (insulin on board curves, GLP-1, Metformin active states).

The system predicts glucose 30–120 minutes ahead using a personalized LSTM + LightGBM ensemble registered in Databricks Feature Store with 47 engineered features per patient. An LLM-powered recommendation engine generates natural-language guidance grounded in patient-specific patterns — always prompting doctor validation per FDA SaMD guidelines.

Built to target San Diego healthtech companies: Dexcom, Abbott, ResMed, Scripps Health — every architectural decision mirrors their production stack.

🎯

93% forecast confidence 30-min ahead glucose prediction · 5 clinical scenarios

⚡

142ms p95 pipeline latency Kafka ingest → feature compute → model inference

🧬

47 engineered features CGM velocity + exercise decay + medication IOB curves

🩺

FDA-aware architecture Rule engine + LLM guidance + Snowflake audit trail

📊

OhioT1DM public dataset CGM + insulin + meal + exercise · 12 T1D patients

Stack: Databricks MLflow Delta Lake Snowflake LightGBM LSTM SHAP Kafka Spark Streaming PySpark Feature Store FastAPI OhioT1DM

Services

What I build

🔧

Data Pipeline Development

ETL/ELT design, automated ingestion from any source, real-time Kafka streaming, data cleaning and transformation, workflow scheduling with Airflow or Databricks Workflows.

🏗️

Data Warehouse Architecture

Snowflake and Databricks lakehouse design, Delta Lake medallion architecture, schema optimization, cloud DW on Redshift, BigQuery, and Synapse Analytics.

🤖

MLOps & ML Engineering

End-to-end ML platforms on Databricks MLflow: feature engineering, model training, experiment tracking, deployment to REST APIs, monitoring and retraining pipelines.

☁️

Cloud Data Engineering

Multi-cloud architecture on AWS, Azure, and GCP. Cloud migrations, serverless data processing, container orchestration with Docker and Kubernetes, cost optimization.

🛡️

Data Quality & Governance

Data quality frameworks, lineage tracking, Unity Catalog governance, master data management, GDPR/CCPA compliance pipelines, automated anomaly detection.

📊

Analytics & BI Solutions

Power BI and Tableau dashboards, self-service analytics, predictive modeling, executive reporting automation, KPI frameworks and real-time operational metrics.