stack

https://www.reddit.com/r/mlops/comments/1ephb1p/whats_your_mlops_stack/

https://ml-ops.org/content/state-of-mlops

https://docs.databricks.com/en/machine-learning/mlops/mlops-stacks.html

https://github.com/kelvins/awesome-mlops

usingdbt +feast +teradata

This article explains how feast is used: simplifying and standardizing the retrieval of a training dataset for cross project management. https://developers.teradata.com/quickstarts/manage-data/getting-started-dbt-feast-teradata-pipeline/

interested compatible FOSS stack components

typetool
data sourcesparkkafkapulsarapiwarehousepostgresql
data source transform orchestrationdbtsqlmesh
data source/model catalogdatahubamundsen-ioOpenMetadata
feature store/share/documentationfeast
model build/trainsklearn,spark,xgboostray
model code devmlflow
model registrymlflow
model servekubeflow
model orchestrationairflow,argoflow
IaCTerraform
CICDgithubgitlab workflowjenkins
data reportingapache-supersetmetabaseredashplotly-dash
service monitorprometheusgrafana

lineage visibility:

datahub supports sourcing lineage from the following:

  • majority of data sources listed above
  • feature storefeast
  • model registrymlflow
  • model orchestrationairflow

integration with dominating stack