Skip to main content

Recruit Reveal - Low-Latency Model Serving

Machine learning model deployment on Databricks with Python SQL UDFs for real-time NFL draft predictions.

Role: ML Engineer / Data Engineer2024
XGBoost Multi-class
Model Type
QB/RB/WR position prediction
Reproducible
Feature Engineering
Z-scores, OOF encoding, missing flags
MLflow
Model Tracking
@production alias deployment
SQL UDFs
Serving
JSON probability responses

Technology Stack

DatabricksPySparkSQL WarehouseUnity CatalogPythonXGBoostMLflowNode.jsExpressNext.js

Problem

NFL draft analysts need real-time player evaluation tools that can predict draft position and success probability across different positions. Traditional scouting relies on subjective evaluation, while data-driven approaches often lack the speed and accessibility needed for live draft analysis.

Architecture

Built an end-to-end ML pipeline on Databricks with real-time serving capabilities:

Raw Data → Feature Engineering → XGBoost Training → MLflow Registry → SQL UDFs → REST API → Next.js UI

Results & Impact

Model Performance

  • Cross-validation accuracy: 78% across all position predictions
  • Position-specific performance: QB (82%), RB (76%), WR (74%)
  • Sub-second latency: <200ms response times for prediction requests