Summary
Overview
Work History
Education
Skills
Portfolio
Hobbies and Interests
Timeline
Generic

ALEXANDER FEBRES

Summary

Software Engineer specializing in backend systems, large-scale data pipelines, and applied machine learning. Experienced in building distributed web-scraping frameworks, ETL workflows, and event-driven architectures on AWS. Skilled in designing REST APIs, orchestrating data workflows with Airflow/Metaflow, and integrating ML features such as embeddings, NLP, and semantic search into production systems. Background includes developing migration pipelines, compliance engines, LLM-powered enrichments, and high-volume data services for enterprise products. Currently completing a Master’s in Artificial Intelligence and Machine Learning (UAX, Spain). Strong focus on reliability, system design, and delivering clean data, insights, and automation for real-world applications.

Overview

7
7
years of professional experience

Work History

Senior Software Engineer

Yogi
10.2024 - Current
  • Designed and built Yogi, a modular web scraping framework for extracting product reviews, PDP data, and SERP results from major retailers.
  • Implemented Playwright-based scrapers with proxy rotation, session management, fingerprinting, and stealth techniques to bypass advanced anti-bot systems.
  • Developed production-grade review scrapers integrating with Apify, enabling large-scale distributed crawling and automated retries, throttling, and dataset exports.
  • Created scrapers compatible with multiple retailers using standardized schemas for reviews, PDP metadata, and SERP product catalogs to support downstream ML pipelines.
  • Engineered flexible pipelines capable of handling GraphQL endpoints, REST APIs, hidden internal APIs, browser automation, and JSON/XHR reverse engineering.
  • Built an intelligent scraper agent that inspects network requests and autonomously identifies the best source of product and review data (PDP, SERP, reviews endpoints).
  • Implemented CI/CD, monitoring, and testing to ensure scraper reliability, uptime, and automatic recovery during layout or endpoint changes.
  • Designed and implemented NLP pipelines for processing large-scale product reviews, including sentiment analysis, keyword extraction, and aspect-level insights, enabling structured understanding of customer feedback across retailers.
  • Built ML-driven PDP insights workflows that analyze product descriptions, features, and review sentiment to generate actionable product intelligence and optimization recommendations.
  • Integrated LLM-based and classical ML models into data pipelines to enrich scraped PDP and review data with sentiment scores, semantic embeddings, and insight summaries for downstream analytics and decision-making.

Software Engineer

Full Stack Labs
06.2023 - 11.2025
  • Implemented Apache Airflow pipelines to support the merger of the Prolaera and LCV learning management systems, enabling reliable large-scale data migration.
  • Led key ETL workflows to extract, transform, and load learning data objects from Prolaera into LCV while ensuring schema consistency and data integrity.
  • Contributed to the design and development of a public-facing API used by enterprise customers to automate compliance workflows and retrieve jurisdiction-specific requirements.
  • Improved the compliance engine by refactoring legacy logic and introducing a strategy-pattern architecture to handle complex rules such as prorations, carry-over credits, and jurisdiction-specific reporting periods.
  • Developed and maintained LMS features across Django and React applications, including exports, reporting views, and administrative tools.
  • Collaborated with cross-functional teams (product, data, QA) to optimize migration processes and ensure a seamless integration of systems.
  • Utilized Django, React, TypeScript, and Airflow to deliver stable, production-ready workflows and backend services.

Software Engineer

GAP - Growth Acceleration Partners
05.2022 - 04.2024
  • Collaborated with the data science team to build Metaflow and Prefect workflows for modeling building interventions aimed at reducing carbon emissions.
  • Integrated Comstock energy models to predict emissions for commercial buildings.
  • Developed workflows leveraging CBECS datasets to generate energy-efficiency benchmarks and support sustainability analytics.
  • Designed and implemented a secure, customer-facing API for uploading and consuming electricity usage data.
  • Built and deployed multiple ETL integrations to ingest, validate, transform, and load data from heterogeneous sources into internal data lakes.
  • Implemented a robust REST API used as the primary interface for interacting with the data engineering team’s data lake services.
  • Established coding guidelines and best practices to maintain code quality and consistency across the team.
  • Supported developers in resolving technical issues and contributed to tooling improvements across the platform.

Back-end Developer

Smartory
Arequipa
01.2021 - 05.2022
  • Played a key role in the development of a management platform for 'Petroperu' Company.
  • Played a critical role in developing a Smart tracking system for 'LogistiChange', a leading logistics company.
  • Effectively managed individual priorities and deadlines.

Programming Teacher / Full-stack Developer

Silabuz
Lima
03.2019 - 01.2021
  • Conducted specialized training for the International Labour Organization (OIT) course, contributing to the development of technical skills among participants.
  • As a Full-Stack Developer and Instructor, played a pivotal role in training and mentoring aspiring developers at a bootcamp hosted by Cargamos Company(Mexico).
  • Produced effective materials for various courses.
  • Had the opportunity to teach various courses tailored to students with diverse levels of expertise.

Web Developer

Freelancer
Arequipa
01.2019 - 10.2019
  • Designed and developed stock management systems for a couple of local restaurants.

Education

Bachelor of Science - Software Engineering

Santiago Mariño
Venezuela
10.2018

Master of Science - AI&ML

Universidad Alfonso X el Sabio
Madrid

Skills

    Languages: Python, JavaScript/TypeScript, SQL

    Backend & APIs: Django, DRF, FastAPI, Flask, REST API development, Authentication & RBAC, Schema modeling

    Web Scraping: Playwright, Apify, Anti-bot evasion (proxies, fingerprints, stealth), Reverse engineering APIs (REST, GraphQL, XHR), Distributed scraping frameworks

    Data Engineering & ETL: Apache Airflow, Prefect, Metaflow, Event-driven pipelines (S3, SQS, EventBridge), Bronze–Silver–Gold modeling, Data normalization & ingestion

    Cloud and DevOps: AWS (ECS Fargate, Step Functions, Lambda, S3, RDS, VPC, IAM), Docker, Terraform, CI/CD with GitHub Actions

    Databases: PostgreSQL, Relational modeling, Data lakes & S3 storage

Portfolio

Aether Data Platform (4-Repo Architecture)
End-to-end data & ML platform for collecting, processing, enriching, and serving product and review data.
Built using modular services:

Hermes – Distributed scraping engine (Playwright, proxy rotation, anti-bot).
Athanor – ETL orchestration (Bronze → Silver pipelines, S3, SQS, Step Functions).
Intel – ML enrichment layer (embeddings, NLP, signals, pgvector).
Aurum – FastAPI insights service (REST endpoints, semantic search).

Repos: https://github.com/Aether-Data-Platform
Demo video available upon request.

Hobbies and Interests

  • Technology
  • Martial Arts
  • Music

Timeline

Senior Software Engineer

Yogi
10.2024 - Current

Software Engineer

Full Stack Labs
06.2023 - 11.2025

Software Engineer

GAP - Growth Acceleration Partners
05.2022 - 04.2024

Back-end Developer

Smartory
01.2021 - 05.2022

Programming Teacher / Full-stack Developer

Silabuz
03.2019 - 01.2021

Web Developer

Freelancer
01.2019 - 10.2019

Bachelor of Science - Software Engineering

Santiago Mariño

Master of Science - AI&ML

Universidad Alfonso X el Sabio
ALEXANDER FEBRES