Block 130: Mini-Project: Mini Analytics Database
Build a complete ETL + analytics pipeline.
Concepts
- Combining file I/O, pandas, SQLite, and visualization
- Full data lifecycle in one project
Code Examples
See exercise below.
Exercise
Pipeline: load a CSV → clean with pandas → store in SQLite → run 3 analysis queries (groupby, filter, aggregate) → visualize one result as a bar chart → write findings to a markdown summary. Bonus: schedule the pipeline to re-run on new data files.
Homework
Reflection: Sketch the architecture of a real production data pipeline you could build with skills from this week. Label each component.