-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Capacity Error with readXenium()
Function and Issues Combining Samples Using cbind()
#48
Comments
Hi, could you please include Issue 1:
Issue 2:
|
Actually I have encountered the arrow error before. I haven't implemented it yet but I can try modifying the code to split the transcript spots and write them to multiple smaller GeoParquet files and then using DuckDB to concatenate the files. |
Seurat style full join is very problematic in that genes present in sample 1 but not sample 2 are NA's in sample 2, but Seurat fills in 0, which is not an appropriate stand in for NA. |
Hi @alikhuseynov and @lambdamoses , Thank you for the quick reply. Regarding the issue with joining samples, I am now planning to perform quality control (QC) prior to merging and remove the remaining probes from the gene panel. I believe this will allow cbind() to work correctly. We are using XOA version 3.0.0.15. Below is the traceback of the error encountered in Issue 1:
I had the error for 5 out of the 13 samples that I tried to read in. |
Thanks, so it is multimodal Xenium data and probably those 5 samples have very large transcript (tx) files. |
We might not need to use DuckDB after all. I just found out that the |
that's great! splitting using |
Hello,
Thank you for the package! I am encountering a couple of issues and would appreciate your guidance.
Issue 1: Capacity Error with
readXenium()
I am working with multiple Xenium samples and can successfully read most of them using the
readXenium()
function. However, I receive the following error message for a few samples:Error: Capacity error: array cannot contain more than 2147483646 bytes, have 2157274215.
It seems that this error is related to the Arrow package used to read 'parquet' files. Is there a way to resolve this issue or a workaround that you would recommend?
Issue 2: Combining Samples with
cbind()
I intended to use
cbind()
to combine multiple samples into a singleSpatialFeatureExperiment
. Unfortunately, I encountered an error because my samples have different numbers of rows (e.g., differing number of control probes, antisense probes, etc.). The error is as follows:Error in FUN(X[[i]], ...) : column(s) 'ID' in ‘mcols’ are duplicated and the data do not match
The error disappears when I use
cbind()
after subsetting only the genes on the Xenium gene panel (i.e., excluding the negative control probes, antisense, etc.). Is there a way to combine the samples without needing to subset them first?Thank you for the help!
The text was updated successfully, but these errors were encountered: