Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Are different database systems going to be supported data sources? #1048

Open
Smurphy000 opened this issue Sep 24, 2021 · 4 comments
Open

Are different database systems going to be supported data sources? #1048

Smurphy000 opened this issue Sep 24, 2021 · 4 comments
Labels
enhancement New feature or request

Comments

@Smurphy000
Copy link
Contributor

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Are there plans to add a way to support connect/reading from a database (Postgres, MySQL, etc)?

Describe the solution you'd like
I am currently thinking in terms of Spark, where I have the ability to specify a format where that format could also be a package like com.microsoft.sqlserver.jdbc.spark which connects to the database via jdbc

If this is already supported in some way, where can I find this?

@Smurphy000 Smurphy000 added the enhancement New feature or request label Sep 24, 2021
@Igosuki
Copy link
Contributor

Igosuki commented Sep 25, 2021

Hi, disclaimer not a maintainer. No it's not currently supported. I definitely see a use for it to join database tables with columnar data, although I find it is subject to the multiple-writer database anti-pattern, so you must have strong test suites to make sure changing the db (which is often owned by an API) won't affect the ETL process.
One would have to create an experimental crate, implement a generic TableProvider for RDBMs systems, validate the SQL statements against the db, and adapt arrow to whatever data structure is used for sql statements.
Interesting projects that come to mind for this are tokio-rs/rdbc and sqlx, there is also the arrow jdbc adapter and ongoing tasks like https://issues.apache.org/jira/browse/ARROW-7744

@jorgecarleitao
Copy link
Member

fyi, https://github.com/sfu-db/connector-x already supports arrow, so it is "only" a matter of gluing it together.

@Igosuki
Copy link
Contributor

Igosuki commented Sep 25, 2021

@jorgecarleitao didn't know about this, so just need to implement a table provider and add the sources !

@houqp
Copy link
Member

houqp commented Sep 25, 2021

Yes, I would recommend implementing a table provider plugin using connector-x as a self-contained crate :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants