This project provides a comprehensive data pipeline solution to ETL Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and services including Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.
-
Updated
Jun 3, 2024 - Python