Blood Horizon Predictor
Predictive blood utilization management system for enhanced transfusion medicine operations at Mayo Clinic
Overview
Blood Horizon Predictor is a production-grade machine learning system deployed at Mayo Clinic to forecast blood utilization, reducing inventory waste to <1% and saving $600K+ annually. This full-stack healthcare analytics platform serves 500+ clinicians with real-time predictions, integrating multiple BigQuery data pipelines and cloud infrastructure.
Problem Statement
Blood product management is a critical challenge in healthcare:
- High waste costs: Expired blood products cost hospitals hundreds of thousands annually
- Supply chain complexity: Balancing inventory across multiple blood groups and product types
- Patient safety: Ensuring adequate supply for surgeries while minimizing waste
- Manual processes: Clinicians spending hours on manual assessment and forecasting
Solution Architecture
Machine Learning Pipeline
- Data Integration
- Integrated 3 BigQuery data pipelines for blood groups, inventory levels, and surgery schedules
- Processed historical transfusion data across 7+ years
- Feature engineering for temporal patterns, seasonal trends, and surgical demand
- Predictive Models
- Developed ensemble ML models for multi-horizon forecasting (1-day, 3-day, 7-day)
- Implemented time series forecasting with ARIMA, Prophet, and LSTM architectures
- Achieved <1% inventory waste through accurate demand prediction
- Deployment Infrastructure
- Built Flask application for model serving and API endpoints
- Containerized with Docker for reproducible deployments
- Deployed on Google Cloud Run for auto-scaling and high availability
- Real-time predictions with <200ms latency
Full-Stack Dashboard
- Frontend: Interactive React-based dashboard with real-time data visualization
- Backend: Flask REST API integrating BigQuery, Cloud Storage, and ML models
- Database: BigQuery for data warehousing and analytics
- Cloud Platform: Google Cloud (Cloud Run, BigQuery, Cloud Storage, Vertex AI)
Key Features
✅ Real-time Forecasting: Predict blood demand 1-7 days in advance
✅ Multi-Product Support: Forecast across all blood groups (A+, A-, B+, B-, AB+, AB-, O+, O-)
✅ Surgical Integration: Incorporate upcoming surgery schedules for demand spikes
✅ Inventory Optimization: Automated alerts for reordering and waste prevention
✅ Clinician Dashboard: User-friendly interface for 500+ healthcare professionals
✅ Cloud-Native: Scalable, secure, and HIPAA-compliant infrastructure
Impact & Results
| Metric | Result |
|---|---|
| Annual Cost Savings | $600,000+ |
| Inventory Waste Reduction | <1% (from 8-12%) |
| Clinicians Served | 500+ |
| Prediction Accuracy | 94% (1-day), 89% (7-day) |
| Response Time | <200ms for real-time predictions |
| Deployment Uptime | 99.9% on Google Cloud Run |
Technical Stack
Machine Learning: Python, Scikit-learn, TensorFlow, Prophet, ARIMA, LSTM
Backend: Flask, FastAPI, SQLAlchemy
Frontend: React, D3.js, Plotly
Cloud Infrastructure: Google Cloud Run, BigQuery, Cloud Storage, Vertex AI
DevOps: Docker, Kubernetes, CI/CD with GitHub Actions
Data Pipeline: Apache Airflow, BigQuery, Pandas
Workflow Optimization
Beyond prediction accuracy, the system reduced manual assessment time by >75% through:
- Automated data aggregation from multiple hospital systems
- Smart alerts for low inventory and expiration warnings
- Integration with existing Electronic Health Record (EHR) systems
- Mobile-responsive dashboard for on-the-go decision making
Collaboration
Worked closely with:
- Pathologists to understand blood usage patterns and clinical workflows
- IT Infrastructure Teams for secure cloud deployment
- Hospital Administrators for cost-benefit analysis and ROI tracking
- Clinicians for user experience testing and iterative improvements
Future Enhancements
- Integration with national blood bank networks for regional supply optimization
- Predictive models for rare blood types and emergency scenarios
- Mobile app for push notifications and on-call decision support
- Expansion to other Mayo Clinic locations nationwide
Status: In Production (May 2025 - Present)
Organization: Mayo Clinic, Rochester, MN
Role: Data Science Intern - ML Engineer