-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
One node can see the other, but not reciprocal #603
Comments
How are they networked? Is this over WiFi? |
Yessir - wifi. |
I have the same issue as you, however I used thunderbolt5 to connect them. Here is how I addressed this issue:
|
Appreaciate the suggestion. Unfortunately, no change. The thunderbolt bridge is connected, and different IPS. Same behavior - the macbook can see the mini, but the mini, can't see the macbook. |
I have the exact same issue. The first run after a fresh install on both hosts appeared to work Okay, as both hosts downloaded the model, but then I had to leave and take one host with me, when I came back no number of restarting both node's exo will make it work-- one host sees both, the other only sees one. |
Can you try running on the latest main. I made some fixes to blocking code which would cause nodes to appear unhealthy. |
I was using the latest, as of yesterday. Today I just rebooted the host that only showed the other host and it all worked-- able to download and run a 70B model on 2 32GB macbooks without issue! |
A follow up here, as the connection issues persist. When doing a query on node1 I see the query activity on node2 as well, even though it's still only showing 1 node connected. I did a pull today on both nodes, so they are running latest code. I had also noticed that on node2 (the unhealthy node) it sometimes crashes with a GPU timeout error: Node2 is a M1 Pro, node1 is a M2 Pro. |
I have a Macbook Pro (M1 Max, 32GB) and a Mac Mini (M4, 16GB). Both running Exo with no errors in the console.
The Macbook sees the mini in the cluster (2 nodes, both listed) but the mini can't see the macbook (1 node, only itself listed).
To make things more intersting - the macbook tiny chat doesn't work, but the mini does, Macbook shows the start of the prompt response generation in the terminal, then hangs:
<lbegin......
Cutting Knowledge...
Today Date...
The text was updated successfully, but these errors were encountered: