Data Scientist, AI — AIG
- Led gradient boosting risk-adjustment models on claims and clinical data, lifting scoring precision +28% for value-based care underwriting.
- Orchestrated PySpark/Snowflake pipelines across 2 TB/day of healthcare data, cutting feature engineering latency −37%.
- Designed transformer-based NLP to extract structured medical entities from unstructured clinical notes, boosting accuracy +26%.
- Established MLflow + CI/CD experimentation flow, shortening model deployment timelines −43% with full audit traceability.
- Built unsupervised anomaly detection for fraud, raising detection precision +24% on large-scale claim systems.
- Containerized inference with Docker / Kubernetes for batch and real-time prediction across cloud environments.