this post was submitted on 17 Nov 2025
99 points (98.1% liked)
Data is Beautiful
2889 readers
93 users here now
Be respectful
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Probably the only interesting part of that study to me is how they are measuring "erratic" which is using a measure they've called "novelty". Its in appendix A1:
There is a decent primer on the transformer here:
https://medium.com/@rahultiwari065/unlocking-the-power-of-sentence-embeddings-with-all-minilm-l6-v2-7d6589a5f0aa
I'm not sure of a great primer on PCA, it kind of finds the dominant directions of a set of vectors.
With that novelty measurement the eracticness seems to be averaging over a window (seven day) and then measuring euclidean distance.
I did have a pint just before reading and writing this so there's probably some mistakes here