- Williams, Stephen A;
- Kivimaki, Mika;
- Langenberg, Claudia;
- Hingorani, Aroon D;
- Casas, JP;
- Bouchard, Claude;
- Jonasson, Christian;
- Sarzynski, Mark A;
- Shipley, Martin J;
- Alexander, Leigh;
- Ash, Jessica;
- Bauer, Tim;
- Chadwick, Jessica;
- Datta, Gargi;
- DeLisle, Robert Kirk;
- Hagar, Yolanda;
- Hinterberg, Michael;
- Ostroff, Rachel;
- Weiss, Sophie;
- Ganz, Peter;
- Wareham, Nicholas J
Proteins are effector molecules that mediate the functions of genes1,2 and modulate comorbidities3-10, behaviors and drug treatments11. They represent an enormous potential resource for personalized, systemic and data-driven diagnosis, prevention, monitoring and treatment. However, the concept of using plasma proteins for individualized health assessment across many health conditions simultaneously has not been tested. Here, we show that plasma protein expression patterns strongly encode for multiple different health states, future disease risks and lifestyle behaviors. We developed and validated protein-phenotype models for 11 different health indicators: liver fat, kidney filtration, percentage body fat, visceral fat mass, lean body mass, cardiopulmonary fitness, physical activity, alcohol consumption, cigarette smoking, diabetes risk and primary cardiovascular event risk. The analyses were prospectively planned, documented and executed at scale on archived samples and clinical data, with a total of ~85 million protein measurements in 16,894 participants. Our proof-of-concept study demonstrates that protein expression patterns reliably encode for many different health issues, and that large-scale protein scanning12-16 coupled with machine learning is viable for the development and future simultaneous delivery of multiple measures of health. We anticipate that, with further validation and the addition of more protein-phenotype models, this approach could enable a single-source, individualized so-called liquid health check.