clustering and outlier detection

  • Local outlier factor
  • Isolation Forest
    • argued to be truely non parametric (no manual definition of distance)
    • still need to specify split size (akin to cluster size)

dim reduction

  • umap
  • autoencoder
  • llm embedding