..

about

24 year old data engineer based in Bradford, UK — open to relocation or remote work.


skills

languages: Python, SQL

cloud & infrastructure: AWS (S3, Lambda, CloudWatch, Secrets Manager, EventBridge, RDS), Terraform, Docker

databases: PostgreSQL

libraries & frameworks: pandas, boto3, pg8000, psycopg2, moto, dbt

testing & devops: pytest, GitHub Actions, Git, TDD, agile

monitoring: Grafana


projects

ToteSys ETL pipeline — aug 2024 – sept 2024

  • collaborated across a six-person agile team, contributing to sprint planning, daily stand-ups, code reviews, and final project presentation
  • built a fully automated ETL pipeline across three Lambda functions: incremental extraction from a live PostgreSQL database using timestamp-based change detection, transformation into a Parquet-based OLAP star schema (7 dimension tables and 3 fact tables) using pandas and pyarrow, and loading into an AWS RDS data warehouse via SQLAlchemy
  • provisioned all AWS infrastructure using Terraform with remote state in S3, including separate Lambda dependency layers, resource tagging, and IAM roles
  • implemented CI/CD with GitHub Actions for automated Terraform deployment on merge to main, backed by a 1,214-line pytest test suite using moto for AWS service mocking
  • stack: Python, SQL, AWS (S3, Lambda, CloudWatch, Secrets Manager, EventBridge, RDS), Terraform, PostgreSQL, pandas, pg8000, boto3, moto, GitHub Actions

GDPR obfuscator — jan 2025 – mar 2025

  • versioned Python library for GDPR-compliant PII detection and obfuscation in AWS S3 CSV files
  • exposed as both a reusable library and CLI tool with a 515-line pytest test suite with moto-based S3 mocking
  • includes Black formatting, Safety dependency scanning, and DeepSource for test coverage reporting

FuelNearMe — apr 2026 – present

  • CLI tool fetching data from the government’s Fuel Finder REST API to find the cheapest nearby fuel stations by address using haversine distance calculations and bounding box pre-filtering
  • planned: integrate AWS Lambda to cache price data; enable historical price tracking
  • stack: Python, pandas, numpy, geopy, requests, uv

this site — aug 2023 – present

  • portfolio site built with Hugo and deployed on Cloudflare Pages via automated CI/CD
  • stack: Hugo, GitHub Actions

experience

IT systems administrator — Mortimer Site Services Ltd (oct 2022 – present)

  • deploy and manage Microsoft 365 and Exchange Online infrastructure
  • orchestrated website modernisation and migration to a new VPS, resolving a long-standing SSL issue
  • migrated company data from on-premises file storage to Dropbox

operations associate — GoPuff, Leeds (jan 2026 – present)

  • maintained consistent employment and continued independent technical development alongside exposure to real-time inventory systems in a high-volume operational environment

freelance data engineer — self-employed via Tech Returners/Northcoders (jan 2025 – mar 2025)

  • delivered a GDPR-compliant PII obfuscation tool on commission against a defined client brief
  • packaged as a standalone Python library with a full pytest test suite, deployed against CSV data in AWS S3

education

Northcoders — bootcamp certificate in data engineering (june 2024 – sept 2024)

  • 13-week intensive programme covering Python, SQL, PostgreSQL, AWS, Terraform, and data pipeline architecture

the University of Manchester — DipHE in biochemistry (sept 2021 – apr 2024)

  • completed two years of a BSc, awarded the DipHE qualification on exit
  • applied Python for data sanitisation and transformation; conducted statistical analysis (t-tests, ANOVA, regression) on experimental datasets
© 2026 alex schofield - theme: nostyleplease