-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[BUG] Make memberlist use ips for routing (#3405)
## Description of changes *Summarize the changes made by this PR.* - Improvements & Bug fixes - The memberlist uses k8s dns based routing for the stateful set. However, this can result in delays when the routing updates. This propagates the ip in the memberlist which is much faster. - This change is backwards compatible - CRDs fields are optional by default, so not setting the ip is fine if CRD is updated - CRD fields can be set in the code without the CRD being updated - The python frontend will use the ip if present, otherwise it falls back to the id - The go code will read the ip to "" anywhere it expects it to be set if the old version of the CR is used - New functionality - ... ## Test plan *How are these changes tested?* Added a test which adds data to a collection, kills the query service pods and waits for them to be ready, simulating a roll out of sorts. Then it issues another query and make sure the query succeeds with updated routing. Before this change, this test failed. - [x] Tests pass locally with `pytest` for python, `yarn test` for js, `cargo test` for rust ## Documentation Changes None
- Loading branch information
Showing
12 changed files
with
167 additions
and
38 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,74 @@ | ||
from typing import Sequence | ||
from chromadb.test.conftest import ( | ||
reset, | ||
skip_if_not_cluster, | ||
) | ||
from chromadb.api import ClientAPI | ||
from kubernetes import client as k8s_client, config | ||
import time | ||
|
||
|
||
@skip_if_not_cluster() | ||
def test_reroute( | ||
client: ClientAPI, | ||
) -> None: | ||
reset(client) | ||
collection = client.create_collection( | ||
name="test", | ||
metadata={"hnsw:construction_ef": 128, "hnsw:search_ef": 128, "hnsw:M": 128}, | ||
) | ||
|
||
ids = [str(i) for i in range(10)] | ||
embeddings: list[Sequence[float]] = [ | ||
[float(i), float(i), float(i)] for i in range(10) | ||
] | ||
collection.add(ids=ids, embeddings=embeddings) | ||
collection.query(query_embeddings=[embeddings[0]]) | ||
|
||
# Restart the query service using k8s api, in order to trigger a reroute | ||
# of the query service | ||
config.load_kube_config() | ||
v1 = k8s_client.CoreV1Api() | ||
# Find all pods with the label "app=query" | ||
res = v1.list_namespaced_pod("chroma", label_selector="app=query-service") | ||
assert len(res.items) > 0 | ||
items = res.items | ||
seen_ids = set() | ||
|
||
# Restart all the pods by deleting them | ||
for item in items: | ||
seen_ids.add(item.metadata.uid) | ||
name = item.metadata.name | ||
namespace = item.metadata.namespace | ||
v1.delete_namespaced_pod(name, namespace) | ||
|
||
# Wait until we have len(seen_ids) pods running with new UIDs | ||
timeout_secs = 10 | ||
start_time = time.time() | ||
while True: | ||
res = v1.list_namespaced_pod("chroma", label_selector="app=query-service") | ||
items = res.items | ||
new_ids = set([item.metadata.uid for item in items]) | ||
if len(new_ids) == len(seen_ids) and len(new_ids.intersection(seen_ids)) == 0: | ||
break | ||
if time.time() - start_time > timeout_secs: | ||
assert False, "Timed out waiting for new pods to start" | ||
time.sleep(1) | ||
|
||
# Wait for the query service to be ready, or timeout | ||
while True: | ||
res = v1.list_namespaced_pod("chroma", label_selector="app=query-service") | ||
items = res.items | ||
ready = True | ||
for item in items: | ||
if item.status.phase != "Running": | ||
ready = False | ||
break | ||
if ready: | ||
break | ||
if time.time() - start_time > timeout_secs: | ||
assert False, "Timed out waiting for new pods to be ready" | ||
time.sleep(1) | ||
|
||
time.sleep(1) | ||
collection.query(query_embeddings=[embeddings[0]]) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.