Bear in mind that "any two high-dimensional vectors are almost always orthogonal...

frizkie · on Feb 17, 2025

Is this better rephrased as “any two vectors in a high-dimensional space are almost always functionally orthogonal”?

I have mostly a laypersons understanding of this idea but I would assume that it would be false to say that they are typically _entirely_ orthogonal?

aithrowawaycomm · on Feb 17, 2025

Yes, one more precise way to phrase this is that the expected value of the dot product between two random vectors chosen from a vector space tends towards 0 as the dimension tends to infinity (I think the scaling is 1/sqrt(dimension)). But the probability of drawing two truly orthogonal vectors at random (over the reals) is zero - the dot product will be very small but nonzero.

That said, for sparse high dimensional datasets, which aren't proper vector spaces, the probability of being truly orthogonal can be quite high - e.g. if half your vectors have totally disjoint support from the other half then the probability is at least 50-50.

Note that ML/LLM practioners use "approximate orthogonality" anyway.

viraptor · on Feb 17, 2025

https://softwaredoug.com/blog/2022/12/26/surpries-at-hi-dime... it's both much more likely to be actually orthogonal and almost always very close to orthogonal.

GeneralMayhem · on Feb 18, 2025

That link doesn't contradict the person you're replying to. Actual orthogonality still has a probability of zero, just as the equator of a sphere has zero surface area, because it's a one-dimensional line (even if it is in some sense "bigger" than the Arctic circle).

If you're picking a random point on the (idealized) Earth, the probability of it being exactly on the equator is zero, unless you're willing to add some tolerance for "close enough" in order to give the line some width. Whether that tolerance is +/- one degree of arc, or one mile, or one inch, or one angstrom, you're technically including vectors that aren't perfectly orthogonal to the pole as "successes". That idea does generalize into higher dimensions; the only part that doesn't is the shape of the rest of the sphere (the spinning-top image is actually quite handy).

esafak · on Feb 17, 2025

The visualization is useless. IF the 2D embeddings were any good they might be useful to R1's developers but still not to end users. What am I supposed to with it?

higuidebot · on Feb 17, 2025

No need to do anything in particular! Perhaps interesting to observe

TaurenHunter · on Feb 17, 2025

So the trick is to pick the dimensions that are relevant and discard the rest when calculating the distance.

dehrmann · on Feb 17, 2025

Alternatively, in a high-dimension space, everyone sits in their own corner.