-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
decoder::stateless::h264::vaapi::tests::test_25fps_block
regression on Intel
#44
Comments
Hi! I decided to take a look at this one. First I followed the suggestion to diff the libva traces between 39e3d00 and 66bf1d2. I wasn't able to find any difference between regular traces except different So I modified the va_trace.c to dump buffer contents during This led me to believe that it is potentially an issue in buffer management. I decided to try to delete buffers more proactively, eg. after diff --git a/src/picture.rs b/src/picture.rs
index ea9a8d9..6e882e8 100644
--- a/src/picture.rs
+++ b/src/picture.rs
@@ -220,10 +220,15 @@ impl<T> Picture<PictureEnd, T> {
let res = self.surface().sync();
match res {
- Ok(()) => Ok(Picture {
- inner: self.inner,
- phantom: PhantomData,
- }),
+ Ok(()) => {
+ let mut inner = self.inner;
+ inner.buffers.clear();
+
+ Ok(Picture {
+ inner,
+ phantom: PhantomData,
+ })
+ }
Err(e) => Err((e, self)),
}
} |
Some time ago, @ndufresne found out that if slices aren't submitted as a slice array, only the last slice is registered. This is just how the Intel driver seems to work. Back then, the capability of submitting slices as an array was added to cros-libva and then to AV1 decoder in cros-codecs, but the other decoders were left untouched. |
Thanks for the tip @dwlsalmeida! In such a case I wonder why it would work before 39e3d00. I hope it's not that Intel driver would work correctly if In either way I think this should be the next step. But because of 39e3d00 it might require holding slices or their corresponding VA structures ( Just FYI I tried also changing |
@bgrzesik you can have owned Nalus if you rework them to use Cow instead. See the AV1 parser for inspiration. The purpose of having cow was precisely to make it easy to have owned versions of these if needed, I just did not convert all the parsers to it. |
The h265 vaapi backend in particular can benefit from this since it needs to hold on to multiple slices before submitting. This is currently done in a hackish way. This also has the added benefit of removing the lifetime parameter from Nalu, which will simplify a lot of the signatures. |
With this commit the chromeos#44 is fixed. It is achieved by submitting multiple slices of one picture using a single slice data buffer and all slice parameters using a single buffer with multiple elements.
With this commit the chromeos#44 is fixed. It is achieved by submitting multiple slices of one picture using a single slice data buffer and all slice parameters using a single buffer with multiple elements.
This is a regression introduced by 39e3d00:
The
sync()
call onsubmit_picture
when in blocking mode causes an internal decoder error on the VAAPI side. This doesn't happen in non-blocking mode, and also doesn't happen prior to commit 39e3d00. The AMD driver also seems to be unaffected.Guess the first thing to do would be to set
LIBVA_TRACE
and check whether the sequence of calls to libva before and after this CL is different - it should not, but obviously something has changed.The text was updated successfully, but these errors were encountered: