🥼Evaluation, explainability, experiment:​
-
Business knowledge induced feature selection​
-
Visualize training metrics to test overfit​
-
Metric/Model that balance precision recall​
-
Add model calibration to pipeline Sklearn Pipeline Notes​
-
Split data by time group instead​
-
More and Better plots​
🧹Code clean up + feature:​
-
Incorporate post grid search feature selection into the pipeline. Sklearn Pipeline Notes​
-
MLFlow​ auto experiment and output
-
feature source lineage​
-
Top “K” based on the data sample ratio instead.​
-
Unsupervised cluster labeling​
⚡Runtime Performance:​
-
Polars ​
-
multi metrics param search using Optuna ​
-
Combine near miss, random sampling,​
-
Auto EDA with Sweetviz to detect data drift​