Vector similarity search using Redis Stack
Finding similar (or nearest) vectors is a popular task in ML workloads. One of the popular applications currently is querying LLMs enriched by a local knowledge base using the RAG approach. Redis is a popular and effective solution to work with different types of data, including vectors.
Installing Redis Stack
While standard Redis installation doesn’t support vectors and requires installing extensions, Redis Stack is an advanced pack that comes with vector support. To install Redis Stack visit installation instructions or in the case of Ubuntu run the following:
curl -fsSL https://packages.redis.io/gpg | sudo gpg --dearmor -o /usr/share/keyrings/redis-archive-keyring.gpg
sudo chmod 644 /usr/share/keyrings/redis-archive-keyring.gpg
echo "deb [signed-by=/usr/share/keyrings/redis-archive-keyring.gpg] https://packages.redis.io/deb $(lsb_release -cs) main" | sudo tee /etc/apt/sources.list.d/redis.list
sudo apt-get update
sudo apt-get install redis-stack-server
... (will install Redis Stack)
https://packages.redis.io/deb
— redis official debian/ubuntu sourceinstall redis-stack-server
— installs Redis Stack server
Storing vectors
We have multiple ways of storing vectors in Redis. First - is by using Redis hashes. In this case, we can pass vector as some specific key of hash together with other keys:
import numpy as np
from redis import Redis
= Redis(host="localhost", port=6379)
r 'doc_1', mapping={
r.hset('title': 'Some doc',
'vector': np.random.rand(8).astype(np.float32).tobytes()
})print(np.frombuffer(r.hget('doc_1', 'vector'), dtype=np.float32))
[0.4681091 0.5844017 0.21458998 0.15927918 0.29837957 0.13542081
0.9707261 0.6643426 ]
import numpy
— we’ll use Numpy to operate with vectorsr = Redis
— connect to Redis (Stack) serverr.hset
— set hash value (key/value pairs)mapping=
— allows setting multiple keys values at oncenp.random.rand(8)
— generate random vector of 8 dimensionstobytes()
— saving vectors in Redis requires converting vectors to bytes
Another way of storing vectors is to use JSON documents in a similar manner.
Creating vector index
Since vector similarity search requires a lot of computation work (calculating the distance between the query vector and all vectors in or db), Redis provides vector indexes to make things fast. To create a vector index, we should specify vector dimensions (since we can only search for similar vectors of the same dimension) and distance metric (L2 or Cosine based on your case):
from redis import Redis
from redis.commands.search.field import VectorField
from redis.commands.search.indexDefinition import IndexDefinition, IndexType
= Redis(host="localhost", port=6379)
r
= (
schema "vector",
VectorField("HNSW", {
"TYPE": "FLOAT32",
"DIM": 8,
"DISTANCE_METRIC": "L2",
}
),
)
'my_index').create_index(
r.ft(=schema,
fields=IndexDefinition(prefix=['obj:'], index_type=IndexType.HASH)
definition
)
print('index created')
Index created
VectorField("vector"
— should contain the name of our hash field with vector,HNSW
— the type of vector index, HNSW is one of the ways to optimize search across large amounts of vectors,FLOAT32
— the type of vector coordinates data,"DIM": 8
— number of dimensions of vectors,DISTANCE_METRIC
— we’ve chosenL2
(Euclidean distance),my_index
— the name of our index,obj:
— prefix of hash keys to index (basically, the first part of the hash key which is the same for all our objects with vectors).
We can make sure the index was created successfully with the following command:
redis-cli FT.INFO my_index
1) index_name
2) my_index
3) index_options
4) (empty array)
5) index_definition
...
Now we can populate and check our vectors search.
Searching for similar vectors
To search for the nearest (similar) vectors, we should build a search query with the KNN
expression:
from redis import Redis
from redis.commands.search.query import Query
= Redis(host="localhost", port=6379)
r
= (
query "*=>[KNN 5 @vector $vec as dist]")
Query("dist")
.sort_by("id", "dist")
.return_fields(2)
.dialect(
)
= { "vec": np.random.rand(8).astype(np.float32).tobytes() }
query_params = r.ft('my_index').search(query, query_params).docs
found print(found)
[Document {'id': 'obj:80079', 'payload': None, 'dist': '0.0622428953648'}, Document {'id': 'obj:29389', 'payload': None, 'dist': '0.0633160546422'}, Document {'id': 'obj:78626', 'payload': None, 'dist': '0.078750140965'}, Document {'id': 'obj:69396', 'payload': None, 'dist': '0.0949668586254'}, Document {'id': 'obj:37440', 'payload': None, 'dist': '0.1012461707'}]
KNN 5 @vector
— will search for 5 nearest neighbors to thevector
field of the hashes$vec
— name of the query vector parameter,sort_by("dist")
— sort results by distance,return_fields
— list of fields to return with each item in results,dialect(2)
— setting 2nd dialect is required to do vector search,np.random.rand(8)
— generate random 8-dimensional vector to find similar vectors to,my_index
— the name of the index to search in (created earlier).
Related reading
Published a year ago in #data about #vector search, #redis, #knn, #hnsw and #rag by Denys GolotiukEdit this article on Github