-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Index Node CrashLoopBackOff #40241
Comments
Hey @roy-akash! 👋 I'm here to help you with any bugs, questions, or contributions you have in mind. Let's squash those bugs together! To fix the Index Node CrashLoopBackOff issue in Milvus version 2.5.3 caused by a panic due to an empty func (it *indexBuildTask) PreExecute(ctx context.Context) error {
if len(it.req.DataPaths) == 0 {
for _, id := range it.req.GetDataIds() {
path := metautil.BuildInsertLogPath(it.req.GetStorageConfig().RootPath, it.req.GetCollectionID(), it.req.GetPartitionID(), it.req.GetSegmentID(), it.req.GetField().GetFieldID(), id)
it.req.DataPaths = append(it.req.DataPaths, path)
}
}
// Additional logic...
} Ensure that this logic is correctly implemented to prevent the panic caused by accessing an empty To continue talking to Dosu, mention @dosu. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Bug Report | Other |
/assign @xiaocai2333 |
@xiaocai2333 please let me know if any info is required regarding this ? I want to recover our milvus instance. Should i go ahead and try to drop this collection as a mitigation ? |
If you have a way to delete a single segment, just delete segment 452973157558666152. You can refer to Birdwatcher for the method to delete an individual segment. |
[2025/02/27 10:41:01.652 +00:00] [INFO] [indexnode/indexnode_service.go:380] ["receive stats job"] [traceID=b52a0b2da86be6e002264e7fb531c53e] [clusterID=milvusdefaultv2] [TaskID=456147103870884550] [jobType=JobTypeStatsJob] [collectionID=452973157558657785] [partitionID=452973157558657794] [segmentID=452973157558666152] [targetSegmentID=456147103870884549] [subJobType=Sort] [startLogID=456301132344294350] [endLogID=456301132344299251] |
By the way, version 2.5.5 has been released and you can upgrade to it directly. |
@xiaocai2333 sure, we can try upgrading as a long term goal. when i run show segment for this segment id i get when i try to remove this i get |
I've gone ahead and dropped the collection, things seem to be improving now. However i am very much interested in how a segment with 0 rows can be in a flushed state. |
The segment with 0 rows will be directly marked as not dropped after v2.4.x. |
so there should be a case added here right for the backward compatibility ? |
Is there an existing issue for this?
Environment
Current Behavior
There is a specific task id "456147103870884550" related to stats_log for a given collection id : "452973157558657785", when this task is scheduled in the given index node it causes a panic and index node crashes
As per my debugging i can trace this panic to this part of the code, when it checks here the req.GetInsertLogs()[0] seems to be empty causing a Panic and index node crashes
Expected Behavior
No response
Steps To Reproduce
Milvus Log
milvus-log.tar.gz
Anything else?
How do i fix this ?
The text was updated successfully, but these errors were encountered: