Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

subsetting strategy #249

Open
janrito opened this issue Dec 20, 2022 · 3 comments
Open

subsetting strategy #249

janrito opened this issue Dec 20, 2022 · 3 comments

Comments

@janrito
Copy link

janrito commented Dec 20, 2022

The documentation on subsets is a bit sparse. Is there a way of creating more complex strategies for subsetting?

i.e. interactions often occur between tables: products, orders, customers, a percentage of products might leave a lot of empty orders, or a lot of products not bought.

What if I wanted:

  • a subset of users based on a random sample
  • a subset of those users orders, maybe based on a specific time period (the last three months)
  • a subset of products—maybe all the ones in the orders above, plus a few random extra ones
@mbrueggemann
Copy link

+1

2 similar comments
@kr-sanjeev-singh
Copy link

+1

@nickygb
Copy link

nickygb commented Nov 23, 2024

+1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants