Available for US roles · Senior level · Remote / San Diego

Rodolfo Mendivil Data & ML Engineer

8+ years building production data systems across AWS · Azure · GCP.
Expert in Databricks, Snowflake, real-time pipelines,
Healthcare AI and end-to-end MLOps. Worked with Aeromexico, Coca-Cola, Thomson Reuters.

8+
Years Experience
15+
Enterprise Projects
98%
Uptime Delivered
pipeline_config.py — rodolfo@mendivil-data
# Senior Data & ML Engineer

engineer = {
  name: "Rodolfo Mendivil",
  location: "San Diego, CA",
  experience: 8, # years
  platforms: [
    "Databricks", "Snowflake",
    "AWS", "Azure", "GCP"
  ],
  certifications: 5,
  open_to_work: True,
}

status = pipeline.run()
Databricks
Snowflake
AWS
Azure
GCP
Apache Spark
Kafka
Airflow
MLflow
Delta Lake
dbt
GraphQL
Power BI
Tableau
Databricks
Snowflake
AWS
Azure
GCP
Apache Spark
Kafka
Airflow
MLflow
Delta Lake
dbt
GraphQL
Power BI
Tableau
Building systems that scale

My name is Rodolfo Mendivil, and I bring over 8 years of experience in Data Analytics, Business Intelligence, Data Engineering, and Data Science. I specialize in building and optimizing production-grade data pipelines and architectures at scale.

I've delivered solutions for Fortune 500 companies including Aeromexico, Coca-Cola LATAM, and Tupperware, building real-time streaming pipelines, ML platforms, and cloud data warehouses that process terabytes daily.

My focus is on Databricks and Snowflake ecosystems — designing Delta Lake architectures, building MLOps pipelines on MLflow, and engineering cost-optimized Snowflake deployments that save companies thousands in compute credits.

Based in San Diego, CA. Open to senior Data Engineer and ML Engineer roles across the US (remote or hybrid).

Certifications
AWS Solutions Architect
Amazon Web Services
AWS
Azure Data Engineer
Microsoft
DP-203
Databricks Associate
Databricks
Spark
Snowflake SnowPro Core
Snowflake
Core
Google Professional Data Engineer
Google Cloud
GCP
Databricks & Snowflake
Databricks
Unified Analytics Platform · Delta Lake · MLflow
🧱
  • Unity Catalog — governance, data lineage, fine-grained access control
  • Delta Live Tables — declarative ETL with auto-scaling + SLA monitoring
  • MLflow + Feature Store — end-to-end model lifecycle, experiment tracking, serving
  • Databricks Workflows — multi-task job orchestration, CI/CD integration
  • Spark Structured Streaming — real-time pipelines, Kafka integration, watermarks
  • Photon Engine — vectorized query execution, 10x cost/performance gains
PySpark / Scala95%
Delta Lake Architecture92%
MLflow / MLOps88%
Databricks SQL90%
Delta Lake MLflow PySpark Unity Catalog DLT Photon Autoloader Workflows
Used In Production
Aeromexico revenue analytics pipeline — Delta Lake medallion architecture processing 500K+ flight records/day with real-time revenue aggregations.
Snowflake
Cloud Data Platform · Data Sharing · Snowpark
❄️
  • Multi-cluster Warehouses — auto-scaling, workload isolation, query acceleration
  • Snowpark Python/Java — ML pipelines and transformations without data movement
  • Data Sharing & Marketplace — live data exchange across orgs, zero-copy cloning
  • Dynamic Data Masking — column-level security, row access policies
  • Cost Governance — resource monitors, warehouse credit optimization, query profiling
  • Stream & Task — CDC pipelines, micro-batch processing, alerting
Snowflake SQL / Architecture93%
Snowpark (Python)85%
Cost Optimization91%
Data Governance88%
Snowpark SnowPro Core dbt + Snowflake Data Sharing Streams & Tasks Zero-Copy Clone Query Profiler
Published Guides
4 in-depth Snowflake articles on architecture, query optimization, resource monitors, and schema change management — available on dev.to.
See the pipelines run
03
📡
Pipeline Monitor
Live DataOps dashboard showing Airflow/Databricks job statuses, SLA health, data freshness, row counts and anomaly alerts — refreshing every 30s.
Airflow DataOps Observability
04
Streaming ETL Visualizer
Animated Kafka → Spark Structured Streaming → Delta Lake pipeline. Watch records flow in real-time with lag metrics and schema evolution events.
Kafka Spark Delta Lake
05
🧬
Feature Store Explorer
Browse a mock Databricks Feature Store catalog. Filter by entity, inspect lineage, freshness and sample data. Generate training datasets with one click.
Databricks MLOps Feature Store
Enterprise work
01
Aeromexico
BI Groups & Connections Revenue Analysis
Delta Lake medallion architecture processing 500K+ flight records/day. Databricks + Spark for real-time revenue aggregations. Impressive time-zone logic for worldwide flight arrivals. 40% faster reporting cycle.
Databricks Delta Lake Amazon S3 AWS Glue Redshift Power BI
02
Coca-Cola LATAM
DI BI Dashboards & Reports
End-to-end Azure data platform with ADF orchestration, Synapse Analytics for structured processing, and Power BI for executive dashboards. DAX calculations covering 12+ LATAM markets.
Azure Data Factory Synapse Analytics Azure Data Lake Power BI DAX Azure DevOps
03
Tupperware
Order Fulfillment ML Project
Azure Databricks ML pipeline for order fulfillment optimization. Trained and deployed models via Azure Machine Learning. Batch pipeline with ADF orchestration across multi-source file ingestion.
Azure Databricks Azure ML ADF Azure Data Lake Synapse Power BI
04
INTER Insurance
Oracle → Azure Data Migration
High-stakes Oracle-to-Azure migration for insurance company with sensitive data. Achieved 98% reduction in downtime. Azure DMS + ADF orchestration with automated validation and rollback.
Azure DMS ADF Oracle Data Pump Azure Blob Synapse PowerShell
What I build
🔧
Data Pipeline Development
ETL/ELT design, automated ingestion from any source, real-time Kafka streaming, data cleaning and transformation, workflow scheduling with Airflow or Databricks Workflows.
🏗️
Data Warehouse Architecture
Snowflake and Databricks lakehouse design, Delta Lake medallion architecture, schema optimization, cloud DW on Redshift, BigQuery, and Synapse Analytics.
🤖
MLOps & ML Engineering
End-to-end ML platforms on Databricks MLflow: feature engineering, model training, experiment tracking, deployment to REST APIs, monitoring and retraining pipelines.
☁️
Cloud Data Engineering
Multi-cloud architecture on AWS, Azure, and GCP. Cloud migrations, serverless data processing, container orchestration with Docker and Kubernetes, cost optimization.
🛡️
Data Quality & Governance
Data quality frameworks, lineage tracking, Unity Catalog governance, master data management, GDPR/CCPA compliance pipelines, automated anomaly detection.
📊
Analytics & BI Solutions
Power BI and Tableau dashboards, self-service analytics, predictive modeling, executive reporting automation, KPI frameworks and real-time operational metrics.
What clients say
"
★★★★★

Rodolfo Mendívil is an outstanding Data Engineer who played a pivotal role in our data transformation projects at Thomson Reuters. His attention to detail, problem-solving skills, and ability to handle complex datasets were truly impressive. He consistently delivered high-quality work under tight deadlines and was a pleasure to collaborate with.

JG
José Alejandro Gutiérrez
Thomson Reuters · Senior Manager
Let's work together

Open to senior Data Engineer and ML Engineer roles across the US. Remote, hybrid, or San Diego area. Available for contract and full-time opportunities.

📱
US: +1 (619) 400-3376
📍
421 Broadway Suite 5015, San Diego CA 92101
🩸 GlucoFlow — CGM Intelligence Platform Kafka · Databricks · MLflow · Snowflake · LLM Recommendations
Clinical demo · all recommendations require doctor validation
🏙️ Real-Time Rent Predictor — Tijuana & San Diego Kafka · XGBoost · Databricks · MLflow · Delta Lake
48K+ listings · MXN & USD · live exchange rate
❄️ Snowflake Query Optimizer — AI-Powered SQL Rewriter Cluster Keys · Partition Pruning · CTE Refactor · Credit Cost Estimator
SnowPro Certified · 4 real SQL scenarios