Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use the
/dbfs/put
API endpoint to upload smaller DBFS files #1951base: main
Are you sure you want to change the base?
Use the
/dbfs/put
API endpoint to upload smaller DBFS files #1951Changes from all commits
c4cea1a
91a2dfa
06af01c
4b484fd
04cae2f
f690f0a
da46c14
78b9788
2717aca
0ce50fa
63e599c
932aeee
9d8ba09
09bf4fa
cf51636
be62ead
7084392
ee80173
69fdd97
92e97ad
95f41b1
8ec1e07
890b48f
f70c472
1e2545e
6991dea
583637a
9552131
7ab9fb7
ac37ca0
f4623eb
ee9499b
e9b0afb
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This new approach really belongs in the SDK.
There is an existing interface for dealing with DBFS files (that is called here).
The details of streaming a file or doing a single put call can be abstracted there and surfaced here with a dedicated
FileMode
to indicate whether it should be a single call or multiple calls.The size of a file can be retrieved through the
io.Seeker
interface.The change here should really be limited to determining the file mode and not the implementation.
The SDK guarantees the correctness of the implementation in either streaming or single-call mode.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Given that DBFS is a service that will be deprecated, at least the public-facing part I think we should just keep it in the CLI rather than invest time to define the interfaces on the SDK side and using them here.
This PR is really meant to address the regression from the legacy Databricks CLI.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Happy to move it to the SDK if you disagree.