Skip to content

Commit

Permalink
Updating downloads
Browse files Browse the repository at this point in the history
  • Loading branch information
ev2900 committed Jan 22, 2025
1 parent 07c5601 commit c461ea6
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Glue Aggregate Small Parquet Files

<img width="275" alt="map-user" src="https://img.shields.io/badge/cloudformation template deployments-29-blue"> <img width="85" alt="map-user" src="https://img.shields.io/badge/views-4088-green"> <img width="125" alt="map-user" src="https://img.shields.io/badge/unique visits-1244-green">
<img width="275" alt="map-user" src="https://img.shields.io/badge/cloudformation template deployments-29-blue"> <img width="85" alt="map-user" src="https://img.shields.io/badge/views-4127-green"> <img width="125" alt="map-user" src="https://img.shields.io/badge/unique visits-1244-green">

When storing data in S3 it is important to consider the size of files you store in S3. Parquet files have an ideal file size of 512 MB - 1 GB. Storing data in many small files can decrease the performance of data processing tools ie. Spark.

Expand Down

0 comments on commit c461ea6

Please sign in to comment.