The Machinist

Applied Machine Learning Engineer • End-to-end ML pipelines • Predictive systems • Decision automation • Risk modeling

About

I build machine learning systems and agentic AI pipelines that turn raw data into decisions, tools, and automated workflows. My focus is end-to-end: data engineering, feature design, model development, and production automation, with a particular interest in predictive systems, risk modeling, and operational intelligence.

Right now I'm working through a structured 5-phase agentic AI curriculum. Phase 1 was raw tool-calling loops. Phase 2 was persistent knowledge retrieval with hybrid RAG. Phase 3 (in progress) is self-correcting execution agents with LangGraph. Phases 4 and 5 cover multi-agent orchestration and production deployment with FastAPI and LangFuse. Each phase ships a real working system.

I'm drawn to systems that catch failures before they happen, get smarter from context, and hold up in production — predictive maintenance, autonomous research pipelines, agents that can debug and recover themselves.

Skills

Machine Learning & AI

  • Predictive Modeling & Risk Systems
  • Time Series Analysis

Agentic AI Systems

  • LLM Tool Calling & Agent Loops
  • RAG — Hybrid Retrieval & Reranking
  • LangGraph State Machines
  • Claim Verification & Grounding

Data & Engineering

  • Python, SQL, PostgreSQL
  • ETL Pipeline Design
  • Feature Engineering
  • Statistical Analysis

Infrastructure & Delivery

  • GitHub Actions & CI Automation
  • Streamlit & FastAPI
  • Netlify Deployment
  • Structured Run Logging & Cost Tracking

Learning Log

Knowledge Agent with Hybrid RAG — Project Deep Dive

Phase 2 upgrade of the research agent into a persistent knowledge system with hybrid retrieval (dense + BM25), cross-encoder reranking, and claim verification. This architecture moves from one-shot web lookup to reusable memory, improving routing reliability and grounded answer quality.

CLI Research Agent — Project Deep Dive

A language model that drives a multi-step research workflow autonomously. No frameworks, just direct API calls, a messages array, and a loop. Built to understand how agents actually work at the mechanical level.

Supervised Learning Models — Concept Exploration

A breakdown of the model families you reach for most in production —> linear models, tree-based ensembles, and gradient boosting. Deep dives on each.

Data Preprocessing — Concept Exploration

A systematic series on preparing data for machine learning —> feature scaling, encoding, missing data, and outlier treatment.

California Housing Price Prediction — ML Deep Dive

A hands-on ML learning series covering data loading, EDA, visualization, feature engineering, and stratified sampling — concept by concept.

Spaceship Titanic SQL Case Study

SQL-powered analysis uncovering how CryoSleep dictated passenger outcomes, debunking "planet" and "deck" myths, and revealing one true spatial anomaly.

Contact

Open to full-time opportunities and collaborative projects.