[Python] Support serialization of Arrow files on disk without the identifier "Feather" #38515

jason-s · 2023-10-30T16:30:31Z

Describe the enhancement requested

The documentation for Arrow Columnar Format suggests that the separate Feather project has been subsumed into Arrow, and that it (Feather) is really just the canonical serialization format for Arrow tables:

We recommend the “.arrow” extension for files created with this format. Note that files created with this format are sometimes called “Feather V2” or with the “.feather” extension, the name and the extension derived from “Feather (V1)”, which was a proof of concept early in the Arrow project for language-agnostic fast data frame storage for Python (pandas) and R.

The Python support of Arrow serialization still uses the identifier feather: (see the Cookbook)

Once we have a table, it can be written to a Feather File using the functions provided by the pyarrow.feather module
import pyarrow.feather as ft

ft.write_feather(table, 'example.feather')

This functionality should be kept as is, for backwards compatibility, but I wonder if the pyarrow module should just have a write() function, without requiring the need to import the pyarrow.feather package or use the term feather. This would help to reduce confusion about file extensions and the relationship between "Arrow" and "Feather".

Component(s)

Python

The text was updated successfully, but these errors were encountered:

jason-s · 2023-10-30T16:30:45Z

See also apache/arrow-cookbook#329

lidavidm · 2025-02-17T07:24:27Z

This came up again: apache/arrow-site#586 (comment)

We should:

Deprecate pyarrow.feather
Add write_table etc to pyarrow.ipc to be consistent with pyarrow.csv and pyarrow.parquet
- Additionally, why is it pyarrow.parquet.write_table but then pyarrow.csv.write_csv and pyarrow.feather.write_feather? We should be consistent.

jason-s added the Type: enhancement label Oct 30, 2023

github-actions bot added the Component: Python label Oct 30, 2023

jason-s mentioned this issue Oct 30, 2023

Recommended file extension for Feather files apache/arrow-cookbook#329

Open

lidavidm mentioned this issue Feb 17, 2025

[Website] Add "Data wants to be free" apache/arrow-site#586

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Python] Support serialization of Arrow files on disk without the identifier "Feather" #38515

[Python] Support serialization of Arrow files on disk without the identifier "Feather" #38515

jason-s commented Oct 30, 2023

jason-s commented Oct 30, 2023

lidavidm commented Feb 17, 2025

[Python] Support serialization of Arrow files on disk without the identifier "Feather" #38515

[Python] Support serialization of Arrow files on disk without the identifier "Feather" #38515

Comments

jason-s commented Oct 30, 2023

Describe the enhancement requested

Component(s)

jason-s commented Oct 30, 2023

lidavidm commented Feb 17, 2025