Building intelligent systems at the intersection of machine learning and healthcare. Currently pursuing my B.S. in Statistics & Data Science at UC Santa Barbara.
Machine learning solutions for real-world healthcare challenges
Built Random Forest and XGBoost models to predict prostate cancer treatment responses from gene expression data (TCGA-PRAD), achieving 70% AUC. Implemented deep learning classifiers using TensorFlow for kidney cancer data analysis.
Developed an XGBoost model to predict inpatient procedure costs from SPARCS data, achieving 91% R² with Optuna hyperparameter tuning. Added fairness audits, drift simulations, and anomaly detection, reducing manual QA time by 50%.
Built a modular ML audit framework to catch drift, data leakage, fairness issues, and schema mismatches. Automated 7+ audit workflows with CLI + SQL support, integrating SHAP explainability and demographic bias breakdowns.
Driving innovation through data science and machine learning
I'm a Data Scientist with a passion for leveraging machine learning to solve complex healthcare challenges. Currently pursuing my B.S. in Statistics & Data Science and B.A. in Economics at UC Santa Barbara, I've had the privilege of working with leading healthcare institutions to develop impactful ML solutions.
My experience in the healthcare analytics and health tech industry has given me unique insights into how data science can transform patient care. At Houston Methodist's Medical AI Lab, I developed predictive models for cancer treatment responses, while my work at Boston Children's Hospital focused on genomic sequence analysis and computational biology.
I specialize in building end-to-end machine learning pipelines, from feature engineering and model development to deployment and monitoring. My projects emphasize not just accuracy, but also interpretability, fairness, and real-world applicability.
Model Accuracy
Genomic Sequences Analyzed
Audit Time Reduction
Let's discuss how we can work together
San Francisco, CA