-
Notifications
You must be signed in to change notification settings - Fork 316
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fail to fine-tune FlowNet2 #152
Comments
Hi, |
Thank you for your reply, normally there is no risk of this type. |
For two days, I tried a lot of things that we can read on forums: I recompiled all FlowNet2, modified the .bashrc, modified the makefile.config but without success. If somebody has an idea I'm more than interested |
Modify the gcc version is useless, since Cuda 8.0.61 gcc 5.4 is allowed. '''Thread 6 "caffe" received signal SIGSEGV, Segmentation fault. Thread 6 (Thread 0x7fffbef4f700 (LWP 18797)): Thread 5 (Thread 0x7fffc5003700 (LWP 18795)): Thread 4 (Thread 0x7fffc5804700 (LWP 18794)): Thread 3 (Thread 0x7fffc6005700 (LWP 18793)): Thread 2 (Thread 0x7fffc75f2700 (LWP 18791)): Thread 1 (Thread 0x7ffff7f6db00 (LWP 18787)): |
Your backtrace indicates that you are using CuDNN version 7. We've only ever used version 5. I know that it's relatively easy to make the code compatible with version 6, but I never tried 7. |
Thank you for your reply, I will try by downgrading my CuDNN version. It's very strange, I can run FlowNet2 and build without problem but I can't train or fine-tune. |
Hm, that's strange, but it really might be a problem with CuDNN. But it might be worth asking the people in #92 whether they actually used training, or just testing 😉 |
Thank you for all your help @nikolausmayer , My setup : Ubuntu 16.04, 980Ti, Cuda 8.0.61, CuDNN 5.1, gcc 5.4, python 3.5. '''Thread 6 "caffe" received signal SIGSEGV, Segmentation fault. Thread 6 (Thread 0x7fffc9f02700 (LWP 10153)): Thread 5 (Thread 0x7fffcbf47700 (LWP 10152)): Thread 4 (Thread 0x7fffcc748700 (LWP 10151)): Thread 3 (Thread 0x7fffccf49700 (LWP 10150)): Thread 2 (Thread 0x7fffce536700 (LWP 10148)): Thread 1 (Thread 0x7ffff7f6db00 (LWP 10144)): |
Ok I found why I had an error !!! Thank you @nikolausmayer for your help |
Nice job. I guess it would be good if the converters or data layers checked for this... 🙂 |
Hi,
I'm trying to fine-tune FlowNet2 with my own dataset. I formatted my database in lmdb and modified the FlowNet2_train.prototxt for fitting with my problematic.
Then when I started my training, I faced with a "Segmentation Fault" during the CustomDataLayerPrefetch and I don't know where the error comes from.
Any suggestion ?
The text was updated successfully, but these errors were encountered: