Curriculum Vitae

Last updated: February 25, 2026.

Skills

  • Python: pandas, NumPy, SciPy, statsmodels, scikit-learn, PyTorch, SpaCy
  • R: tidyverse, ggplot2, survey, survival, lme4, tidymodels, multcomp, car
  • SQL: SQLite, PostgreSQL, Snowflake SQL
  • Big Data & Cloud: Apache Spark, SparkR, PySpark, AWS, parallelization
  • GIS: ArcGIS, R (raster, sf), Python (GeoPandas)
  • Visualization & Tools: Tableau, Power BI, Git/GitHub, JavaScript, HTML, CSS, LaTeX, SAS, SPSS
  • Statistical & Bayesian Analysis: generalized linear models, survival analysis, Bayesian modeling, post-stratification weighting
  • Experimental Design & Causal Inference: study design, observational analysis, causal structure, model diagnostics
  • Machine Learning & AI: supervised/unsupervised learning, deep learning, neural nets, LLMs, AI Agents, NLP, OCR, time-series modeling
  • Health Data & Interoperability: EHR, claims data, ICD-9/10 CM, SNOMED CT, CPT, RxNorm, LOINC HL7, FHIR

Education

University of California, Berkeley — School of Public Health

Aug 2025–May 2026

M.P.H. Epidemiology & Biostatistics; Graduate Certificate in Health Management

  • Select Coursework: Biostatistics, Causal Inference for Public Health, Applied Machine Learning, Healthcare Information Systems, Epidemiologic Methods and Analysis, Multivariate Statistics, GIS and Spatial Analysis, Data Management, Health Care Finance, Strategic Management and the Health Sector

University of California, Berkeley — College of Letters & Science

Aug 2021–May 2025

B.A. Public Health, Honors; Data Science Minor

  • Select Coursework: Epidemiology and Human Disease, Health Policy and Management, Principles and Techniques of Data Science, Probability and Statistics, Computer Programs, Linear Algebra and Differential Equations, Data Ethics, Biochemistry and Molecular Biology, General and Organic Chemistry

Experience

Remais Lab at UC Berkeley

Sep 2023–Present

Researcher

  • Designed and operated scalable, pipelines to analyze 20+ years of HCUP National Inpatient Sample EHR data for high-throughput population health analysis.
  • Built and documented Spark utilities to support secure cloud-based analysis of Oracle Cerner Real World EHR, enabling secure, controlled team access.
  • Engineered webscraping tools for diagnostic code normalization, streamlining disease coding and reducing misclassification in analyses and surveillance.
  • Supported studies of invasive fungal diseases by applying GLMs and Bayesian post-stratification weighting to generate adjusted national incidence estimates.
  • Led automation of a predictive modeling pipeline for Valley Fever, integrating climate and case surveillance data to deliver reproducible, version-controlled forecasts that provided early warnings and preventive strategies for public health and environmental safety stakeholders.

D-Lab at UC Berkeley

Aug 2025–Present

Data Science Consultant

  • Collaborate with faculty, research teams, and graduate students to translate complex statistical methods into actionable insights for academic projects.
  • Provide consulting support across statistical modeling, causal inference, ML, and applied research design, handling 30+ consulting tickets per semester.
  • Helped researchers integrate large language models (LLMs) into project workflows to support text analysis, automation, and research productivity.
  • Manage consulting tickets using GitHub request repository to standardize workflows, ensuring transparent collaboration and reliable project records.

UC Berkeley School of Public Health

Aug 2025-Dec 2025

Graduate Student Instructor (PH 142: Introduction to Probability and Statistics for Biology and Public Health)

  • Taught and facilitated graduate and upper-division biostatistics course, equipping students with core statistical concepts and their health applications.
  • Developed detailed lesson plans and instructional materials, leading 2 weekly lab sections focused on applied statistical computing and R programming.
  • Created R scripts for assignments, laboratory exercises, and exam material, pushing them to GitHub to facilitate reproducibility and teaching team access.
  • Provided individualized guidance on final projects, meeting throughout the semester to support design, statistical analysis, and scientific communication.

Marin County Health & Human Services

Nov 2024–May 2025

Informatics Fellow

  • Created a department-wide data inventory repository to document data pipelines, supporting future county standardization and automation efforts.
  • Built automated R/Python pipelines using OCR to extract data from Poison Control and Coroner’s Office PDFs, improving surveillance capability and speed.
  • Designed NLP models to identify and characterize bystander CPR events in EMS narratives, enabling novel analyses of pre-hospital intervention patterns.
  • Blueprinted and trialed PyTorch scripts for extracting data from handwritten forms, utilizing image preprocessing and neural handwriting recognition models.
  • Developed and maintained interactive Power BI and Tableau dashboards that delivered timely surveillance insights to internal teams and external partners.

OtisHealth

Aug 2024–Dec 2024

Healthcare Consultant

  • Built machine learning models utilizing time-series actigraphy data to classify depressive states, highlighting wearables' role in early mental health detection.
  • Conducted meta-analysis on patient-reported and wearable data, assessing integration methods, data quality challenges, and clinical adoption in healthcare.
  • Deployed Qualtrics survey capturing physical activity and mental health data from 4,000+ Bay Area residents, garnering actionable insights for stakeholders.
  • Synthesized findings into strategic recommendations for integrating patient-reported data into health infrastructure, bridging research and policy translation.

UCSF Department of Epidemiology and Biostatistics

May 2023-July 2023

Research Fellow

  • Completed graduate-level training in research design via EPI 202: Designing Clinical Research, with emphasis on causal structure and protocol development.
  • Evaluated real-world clinical and population health studies through cohort-based discussion, emphasizing model assumptions and statistical interpretations.
  • Conducted an in-depth literature review and methodological critique of my mentor’s faculty-led study on healthcare utilization among adults experiencing homelessness, evaluating study design, sampling, weighting, and modeling strategy, and presented findings to peers.

Medical Reallocation Initiative

Feb 2022–May 2024

Chief Operating Officer

  • Supported the organization's global impact by redistributing $2M+ in surplus medical supplies to underserved countries through organized volunteer events.
  • Organized developmental project with professor Hanan Maclaren to establish a clinic in Sierra Leone, Africa by providing fundraising and medical supplies.
  • Spearheaded external partnership with NGO 33 Spine Align, supporting critical spinal orthopedic care in West Africa by creating a medical supply pipeline.

Publications

Peer-Reviewed Publications

  • Bartels JGE, Camponuri SK, Snow TT, Morgan Bustamante BL, Kane NJ, Reynolds RM, Lee A, Hoffman MA, White TC, Remais JV, Head JR. (2025). Updating the Epidemiology of Blastomycosis and Histoplasmosis in the United States Using National Electronic Health Record Data, 2013–2023. The Journal of Infectious Diseases. https://doi.org/10.1093/infdis/jiaf472.
  • Morgan Bustamante BL, Martinez EG, Lee A, Kane NJ, Camponuri SK, Reynolds RM, Snow TT, Bartels JGE, Hoffman M, Remais J, et al. (2026). Epidemiology of Aspergillosis Diagnoses in U.S. Adults Using a National EHR Database, 2013–2023. Open Forum Infectious Diseases. https://doi.org/10.1093/ofid/ofag094.

Manuscripts Under Review

  • Morgan Bustamante BL, Bartels JGE, Lee A, Kane NJ, Reynolds RM, White TC, Hoffman MA, Head JR, Remais JV. (2025). A Comparative Analysis Estimating Aspergillosis and Histoplasmosis Encounters among Inpatients in the United States Using Multi-Regional Electronic Health Record and National Discharge Data, 2014–2020. AJE Advances: Research in Epidemiology.

Licenses & Certifications

  • CCPHIT — Public Health Informatics & Data Systems Certificate (Issued May 2025)
  • CITI — Biomedical Research Certification (Issued Oct 2024)
  • CITI — Social & Behavioral Research Certification (Issued Oct 2024)