System Design Problem

Design an ML Feature Store

Commonly Asked By:UberFeastTectonAirbnbGoogle

  • Register feature definitions (name, type, entity, description, owner, SLA)
  • Ingest features from batch (Spark/Hive) and streaming (Flink/Kafka) sources
  • Serve features at low latency for online inference (< 5ms p99)
  • Serve features in batch for model training (point-in-time correct joins)
  • Feature versioning: track schema changes, backward compatibility
  • Feature sharing: discover and reuse features across teams
  • Point-in-time correctness: no data leakage
  • Feature monitoring: drift detection, freshness alerts
  • Feature lineage: trace from raw data source → transformation → feature
Loading...