Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Spark data source for lance #541

Open
Tracked by #2079
LiWeiJie opened this issue Feb 9, 2023 · 9 comments
Open
Tracked by #2079

Add Spark data source for lance #541

LiWeiJie opened this issue Feb 9, 2023 · 9 comments
Labels
enhancement New feature or request java

Comments

@LiWeiJie
Copy link
Contributor

LiWeiJie commented Feb 9, 2023

No description provided.

@LiWeiJie
Copy link
Contributor Author

LiWeiJie commented Feb 9, 2023

Hope to support lance in pyspark's spark data source

@andrei-ionescu
Copy link

Is there a Scala implementation for Lance format?

@changhiskhan
Copy link
Contributor

Is there a Scala implementation for Lance format?

Not yet. We're still debating how to do it long term. The data will come out as Arrow which has a JVM impl already.

If you're interested in hacking on one let us know! [email protected] would love to collaborate

@zhenyu
Copy link

zhenyu commented May 4, 2023

+1 for the feature!

@wjones127
Copy link
Contributor

TBD: DataSourceV2 or TableProvider?

@wjones127 wjones127 added java enhancement New feature or request labels Mar 12, 2024
@wjones127 wjones127 mentioned this issue Mar 15, 2024
20 tasks
@rjurney
Copy link

rjurney commented Feb 3, 2025

This would be really cool... want to use Lance with PySpark.

@SaintBacchus
Copy link
Collaborator

Lance spark connector can already be used in our case. But now it misses some docs about how to use that. I will write it soon.

@rjurney
Copy link

rjurney commented Feb 5, 2025

I couldn't find a lance spark connector. Can you post a URL and then I guess this ticket can be closed?

@SaintBacchus
Copy link
Collaborator

Now there are no pre-build jars for lance spark connectors. If you want to use it now, you should build it yourself.

I submit a new PR for how to build the java package and use the spark connector #3433.
But I think it is still hard to understand. Maybe you can wait a few days until the new version of Lance is released. We will publish the pre-build jars into the Maven repo and write some docs about how to use the spark connectors.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request java
Projects
None yet
Development

No branches or pull requests

7 participants