What happened?
Below is
schema = Schema()
schema.create_index(
config=SparseVectorIndexConfig(
key=K.DOCUMENT, # this line is wrong! the named param is `source_key`
embedding_function=ChromaBm25EmbeddingFunction(),
bm25=True
),
key=SPARSE_KEY
)
When providing values to SparseVectorIndexConfig, the index will silently fail to operate if the source_key value is not passed. In the case of the above, if the collection is created with the schema and documents are ingested, the Bm25 index will not be present on the documents and the user will receive no error that they provided an incorrect config.
Versions
chroma 1.2.1
Relevant log output