DevOps & Cloud
Data Lake Architecture on AWS
Serverless data lake implementation on AWS using S3, Lambda, Glue, and Athena. Processes and stores ~20GB daily with automated partitioning and compression.
AWS
PySpark
Python
Terraform
December 2025
156 stars
45 forks
Repository Statistics
156
GitHub Stars
45
Forks
Python
Primary Language