-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for SPSS multiple response categories #25
Comments
Sorry, no idea what MRVs are. Can you please provide an example SPSS file and explain what it is and what information are you trying to retrieve? |
The official name is "Multiple Response sets". URL: https://www.ibm.com/support/knowledgecenter/en/SSLVMB_23.0.0/spss/base/multiple_response_intro.html If the SPSS documentation is not enough I can provide an example SPSS file. |
Yes, a sample file will be needed. In addition a description of the contents in plain text, because the job is to guess where in the binary file is the content you are looking for. This will require changes to the Readstat C library. I can file an issue over there or you can do it yourself if you prefer. There is no guarantee that they will do it, nor timelines either. It will help a lot if there is somewhere a description of how are these fields represented in the binary file. If you could find that would be great, because otherwise it will be very difficult to implement. An example of such specification is here or here but these doesn't seem to explain the feature you are requesting (can you see them?) Also other libraries in python or other languages that can do the job could also be useful to look at. |
Closed due to lack of example files. |
@ofajardo, sorry for the ping, but I assume you're not receiving comments on closed issues. Could this be reopened? The specs for this record in the SPSS file are actually part of the spec you linked: https://www.gnu.org/software/pspp/pspp-dev/html_node/Multiple-Response-Sets-Records.html Essentially what it does is specify the relationship between multiple questions that should be interpreted as a single question with multiple values instead. Attached is an example file that contains 2 sets, one multiple category and multiple dichotomy. For details you could check docs here (https://www.gnu.org/software/pspp/manual/html_node/MRSETS.html#MRSETS), but for implementation that is not relevant. Let me know if I can be of further assistance, I'm not familiar with python at all, but have spent many hours hating the binary file format that is SPSS SAV... |
hi there, Haven't look into it in detail yet, but this will require that Readstat (the C library behind pyreadstat) implements this. Could you therefore open an issue there? (I am sure they will appreciate your insights into the binary file format). Once it is implemented in Readstat I will be able to bring it into pyreadstat. |
I've opened a new PR to address this #259 . In accordance with our team at Crunch.io and Evan Miller. We'll also open a PR on readstat, so this won't be immediately available. The idea is to rebase the ☝️ pr once readstat changes get shipped. |
This feature will be hugely appreciated. |
This has now been implemented, as documented in this PR: #271 |
Support to retrieve (meta) data on MRVs / multiple answer question groupings would be great.
The text was updated successfully, but these errors were encountered: