Skip to content

Commit

Permalink
Updating downloads
Browse files Browse the repository at this point in the history
  • Loading branch information
ev2900 committed Nov 11, 2024
1 parent ba067cd commit 09dac59
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Glue Aggregate Small Parquet Files

<img width="275" alt="map-user" src="https://img.shields.io/badge/cloudformation template deployments-25-blue"> <img width="85" alt="map-user" src="https://img.shields.io/badge/views-3885-green"> <img width="125" alt="map-user" src="https://img.shields.io/badge/unique visits-1118-green">
<img width="275" alt="map-user" src="https://img.shields.io/badge/cloudformation template deployments-25-blue"> <img width="85" alt="map-user" src="https://img.shields.io/badge/views-3885-green"> <img width="125" alt="map-user" src="https://img.shields.io/badge/unique visits-1138-green">

When storing data in S3 it is important to consider the size of files you store in S3. Parquet files have an ideal file size of 512 MB - 1 GB. Storing data in many small files can decrease the performance of data processing tools ie. Spark.

Expand Down

0 comments on commit 09dac59

Please sign in to comment.