One or multiple resources #7

loleg · 2019-07-08T09:54:10Z

Please clarify the behaviour of the library in respect to having one or multiple resources in the Data Package, i.e. under what conditions to expect the read_datapackage call to directly return the DataFrame vs. an array of them. It seems to me that the latter only works when there are multiple compatible types (CSV or GeoJSON). This is also not very logical, and error-prone when trying to build an application for arbitrary data input.

The text was updated successfully, but these errors were encountered:

rgieseke · 2019-07-08T10:17:12Z

Can you elaborate? Clarify the documentation?

loleg · 2019-07-08T10:44:46Z

I'd be happy to update the doc, but I first would want to make sure that this behavior is "by design".

What about a way to make it more explicit? For example, with a top() function that returns the first resource in the package. Do other datapackage-reader libraries do it similarly, i.e. is this specced somewhere?

rgieseke · 2019-07-08T11:05:19Z

An update would be great, I think you described the behaviour correctly - it's definitely not well documented atm and the silent discarding probably can be confusing.

https://github.com/frictionlessdata/datapackage-py has a more general approach where you can/need to iterate over "resources".

augusto-herrmann · 2019-07-17T14:41:50Z

There is already a package that supports reading multiple resources of a data package into a Pandas Dataframe. Even though the last commit was in 2017, at first glance it seems to offer more functionality than this one. @danfowler even did a post about it on the Open Knowledge Labs Blog. Would it make sense to merge these efforts?

rgieseke · 2019-07-21T10:26:20Z

@augusto-herrmann Don't know, when i started this tool i wanted something that would quickly load CSVs from a Data Package into Pandas DataFrames. I think the scope of the tableschema tool was more general and requires more knowledge of the DataPackage toolchain.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

One or multiple resources #7

One or multiple resources #7

loleg commented Jul 8, 2019

rgieseke commented Jul 8, 2019

loleg commented Jul 8, 2019

rgieseke commented Jul 8, 2019

augusto-herrmann commented Jul 17, 2019

rgieseke commented Jul 21, 2019

One or multiple resources #7

One or multiple resources #7

Comments

loleg commented Jul 8, 2019

rgieseke commented Jul 8, 2019

loleg commented Jul 8, 2019

rgieseke commented Jul 8, 2019

augusto-herrmann commented Jul 17, 2019

rgieseke commented Jul 21, 2019