-
Notifications
You must be signed in to change notification settings - Fork 203
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add git to filesystem source 301 devel #893
Add git to filesystem source 301 devel #893
Conversation
- includes comments about code issues. - for more discussion see main PR at dlt-hub/verified-sources#312
Remove these in PR
✅ Deploy Preview for dlt-hub-docs canceled.
|
Supporting parameters in fsspec URLs requires some changes in
Points re supporting fsspec url arguments: FileItem.file_url has conflicting responsibilities:
FileItemDict gets its file_url from the output of case 4 - glob_files() . So if the netloc is stripped there, it will cause unpredictable results for cases 1, 2, 3. Options for resolving that:
I can see the In general for FileItemDict, not sure about this factory: questions below
|
Fsspec urls can contain some strange things that Workaround here is to call fs_client._strip_protocol, which is extended by quite a few fsspec implementations.
But _strip_protocol is an _internal method. The only public way to get fsspec to return a stripped url is via There's similar url parsing issue in
That |
Final little issue with fsspec url parameters. How to redact sensitive info.
An fssepec implementation might:
|
Support git filesystem Resource per dlt-hub/verified-sources#301
May also include general improvements to dlt fsspec handling.