DESIGN A DATA WAREHOUSE

Problem: Design a petabyte-scale data warehouse for analytics. Key Considerations: - Columnar storage (Parquet/ORC) for analytical query efficiency - ETL/ELT pipelines using Apache Airflow for orchestration - Star schema or…

DESIGN A LOG AGGREGATION SYSTEM

Problem: Design a system to collect, store, and query logs from thousands of services. Key Considerations: - Log shippers (Fluentd/Logstash) on each host - Kafka as log buffer - Elasticsearch…