Software/ML/Data Engineer
Generalist Software Engineer specialized in Machine Learning Products, Data Platforms, and Python development.
Passionate about Software Development, Distributed Computing, Big Data, and Serverless architectures.
SecDevOps philosophy advocate.
Happy to solve challenging problems with modern technologies.
Updated: Oct 4, 2024
👨💻Experience
Clarity AI
Clarity aims at contributing to a more socially efficient capital allocation by providing decision makers with the most reliable and comprehensive tools to understand and optimize social and environmental impact leveraging scientific research and the latest technologies.
Tech Stack
- Languages and Frameworks → Python, FastAPI, Langchain, Pandas, Dask, Spark, Yarn.
- Deployment and automation → Docker, Kubernetes, Airflow, AWS, EMR, Gitlab CI.
- Databases and Storage → MongoDB, PostgreSQL, S3, Redshift.
- Cloud Managed Services → AWS Event Bridge, SQS, QuickSight.
Lead Machine Learning Engineer - Data Science
January 2023 - Present
- Led technical teams of Data Scientists and Engineers (4-7 people) with 3 direct reports.
- Designed ML inference and training systems and data stores to manage over 15 ML estimation models, increasing delivery frequency from biannual to weekly and reducing operational costs by 75%.
- Launched new LLM-based products (assistants, chatbots, and agents) to engage web clients directly.
Senior Machine Learning Engineer - Data Science
June 2021 - January 2023
- Reduced ML development time from months to days with improved traceability and reproducibility through model tracking, versioning, monitoring, and custom tooling based on MLFLow.
- Designed, developed, and deployed large-scale ML inference pipelines incorporating human-in-the-loop for NLP using AWS EMR, SageMaker, and Airflow.
Engineer II - Data Engineer
January 2020 - June 2021
- Designed and implemented data collection and automatic data validation systems based on Event Sourcing Architecture, scaling the system from less than 10 users to a hundred and improving new data delivery speed from biannual to daily.
- Designed and built the company’s Data Lake, data versioning, and lineage systems.
- Developed data pipelines in Python using Pandas and Dask, deployed over Airflow and Kubernetes.
BBVA Next Technologies
BBVA Next Technologies is a company inside the BBVA Group that works to transform the world through technology.
I work with the Global Data Security Team. Firstly, developing and hardening different parts of the company big data platform. Later, I participate as a lead developer in the Security Insights platform.
Tech Stack
- Languages and Frameworks → Python, Go, Hadoop.
- Deployment and automation → Terraform, Ansible, CloudFormation, Jenkins, DC/OS, Nomad, Docker.
- Databases and Storage → MongoDB, PostgreSQL, S3, ElasticSearch
- Cloud Managed Services → AWS Kinesis, AWS Lambda, AWS Glue, AWS API Gateway.
Software Engineer - Security Insights
September 2019 - January 2020
- Developed a Hybrid Cloud solution to monitor security alarms across the BBVA cloud platform.
- Implemented data pipelines, ETL processes, and streaming services to ingest millions of daily events.
- Deployed services using cloud and serverless technologies (AWS Kinesis, Lambda, API Gateway, Glue, Aurora).
Software Engineer - Data Security
January 2018 - September 2019
- Managed security components of the BBVA Big Data platform based on Hadoop, Spark, DC/OS, and deployed over OpenStack (Hadoop Encryption, Hadoop KMS, Apache Ranger, OpenLDAP, Kerberos, Vault).
- Developed, deployed, and maintained core services to ensure data security requirements were met
BBVA Labs
Develop a service to automate the hyperparameter optimization for Machine Learning algorithms. Find the open source project here.
Software Engineer Intern - Innovation & Technology
November 2016 - June 2017
- Applied evolutionary algorithms to hyperparameter optimization.
- Designed REST APIs using Python, Flask, and MongoDB, deployed with Docker and Kubernetes (OpenShift).
📚Education
Georgia Institute of Technology
Master of Science - Computational Analytics
August 2019 - July 2021
GPA: 3.91/4.0
Course list
- ISYE 6501: Introduction to Analytics Modeling
- CSE 6040: Computing for Data Analytics
- MGT 8803: Business Fundamentals for Analytics
- CSE 6242: Data and Visual Analytics
- MGT 6203: Data Analytics in Business
- ISYE 6740: Computational Data Analysis
- CSE 6250: Big Data for Health Care
- ISYE 6414: Regression Analysis
- CS 7646: Machine Learning for Trading
- ISYE 6669: Deterministic Optimization
University Rankings
- #8 Best Graduate Engineering School
- #8 Best Computer Science Graduate Program
- #4 Most Innovative Schools
Universidad Politécnica de Madrid
Bachelor Degree in Software Engineering
September 2013 - July 2017
GPA: 8.53/10. Class Rank: 2
Unofficial Transcripts- Honor Distinction in Graduation Project
- Award to best academic record in freshman year
Certifications
TOEFL Score: 107
Languages
- 🇬🇧→ C1
- 🇪🇸→ Native
Independent Courses
- Machine Learning | Standford by Coursera
- Scalable Microservices with Kubernetes | Udacity
- Python: Basic and Advance Training | Talentum Startups
- Node.js + Express + MongoDB | KeepCoding
Volunteering & Other Experience
Sports Club Claudina Thévenet
Ski Camp Counselor ⛷
2013 - 2018
I collaborate with the "Sports Club Claudina Thévenet" as a ski counselor. I take care of the campers and lead them through the slopes. I participate in the organization of the activities and decided the routes in the slopes.
Colores Contra la Violencia
Event Coordinator 🏃♀️🏃♂️
2014 - 2017
I participated in the non-profit event “Colores Contra la Violencia” (Colors against Violence). I made all the preparations for the event, coordinated the volunteers and solved problems during the event.
Casa Hogar Maria Inmaculada
Volunteer School Assistant 🏫
2011 - 2013
Volunteer with school support for children at foster home "Casa Hogar Maria Inmaculada" in Madrid.