Automated big-data cluster lifecycle (deploy/ops/maintenance) and improved reliability/efficiency for large-scale environments.
Built a data pipeline to aggregate resource usage and generate cost/profit outputs for multi-cluster environments.
Pulled video metadata and used LLM-based extraction to build a structured dataset for downstream analysis.