I've open-sourced the script I use to find related blog posts (and to describe
why they're similar).
Works on any set of markdown articles, so should fit into any SSG workflow.
Uses embeddings to calculate the similarities, and GPT4 to add descriptive text.
A quick search didn't find either of these; the closest was this paper which used json-ld to record a vector reduced to 2 dimensions using tSNE : https://hajirajabeen.github.io/publications/Metadata_for_Eme... Metadata standards for the FAIR sharing of vector embeddings in Biomedicine S¸ enay Kafkas et al.