Back to feed
Dev.to
Dev.to
5/10/2026
When does Iceberg beat Parquet+projection on AWS Glue, and when doesn't ?

When does Iceberg beat Parquet+projection on AWS Glue, and when doesn't ?

Short summary

Author built a test project comparing Apache Iceberg vs Parquet+partition-projection on AWS Glue with the same streaming pipeline (Kinesis → Firehose → Athena), documenting which storage format wins under different conditions. Shares design decisions with rationale for read modes (DynamicFrame vs Spark vs Iceberg), trade-offs, and TDD-friendly architectural patterns. Includes practical guidance on partition projection constraints and cost implications.

  • Compares Iceberg vs Parquet+projection on identical AWS Glue streaming pipeline
  • Explains read mode trade-offs and cost/performance implications for each approach
  • Shares architectural patterns including partition projection constraints and TDD code organization

Generated with AI, which can make mistakes.

Is this a good recommendation for you?

Explore more