I'm looking for an answer that scales, but for my specific purpose, I have a 48th dimension vector. This could be represented as an array of 48 integers all between 0 and 255.
I have a large dictionary of these vectors, approximately 25 thousand of them.
I need to be able to take a vector that may or may not be in my database, and quickly find which vector from the database is closest. By closest, I mean in terms of traditional distance formula.
My code will end up in python but this is more a general question.
Brute Force is too slow. I need a near dictionary speed lookup. Anyone have an idea?