Skip to main content
DevOps & Cloud

Data Lake Architecture on AWS

Serverless data lake implementation on AWS using S3, Lambda, Glue, and Athena. Processes and stores ~20GB daily with automated partitioning and compression.

AWS PySpark Python Terraform
December 2025 156 stars 45 forks

Repository Statistics

156
GitHub Stars
45
Forks
Python
Primary Language