Skip to content

Commit

Permalink
Updating downloads
Browse files Browse the repository at this point in the history
  • Loading branch information
ev2900 committed Nov 24, 2024
1 parent ac71eaa commit 2d667ec
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Glue Aggregate Small Parquet Files

<img width="275" alt="map-user" src="https://img.shields.io/badge/cloudformation template deployments-29-blue"> <img width="85" alt="map-user" src="https://img.shields.io/badge/views-3939-green"> <img width="125" alt="map-user" src="https://img.shields.io/badge/unique visits-1164-green">
<img width="275" alt="map-user" src="https://img.shields.io/badge/cloudformation template deployments-29-blue"> <img width="85" alt="map-user" src="https://img.shields.io/badge/views-3939-green"> <img width="125" alt="map-user" src="https://img.shields.io/badge/unique visits-1171-green">

When storing data in S3 it is important to consider the size of files you store in S3. Parquet files have an ideal file size of 512 MB - 1 GB. Storing data in many small files can decrease the performance of data processing tools ie. Spark.

Expand Down

0 comments on commit 2d667ec

Please sign in to comment.