- Pearson, Katelin D;
- Nelson, Gil;
- Aronson, Myla FJ;
- Bonnet, Pierre;
- Brenskelle, Laura;
- Davis, Charles C;
- Denny, Ellen G;
- Ellwood, Elizabeth R;
- Goëau, Hervé;
- Heberling, J Mason;
- Joly, Alexis;
- Lorieul, Titouan;
- Mazer, Susan J;
- Meineke, Emily K;
- Stucky, Brian J;
- Sweeney, Patrick;
- White, Alexander E;
- Soltis, Pamela S
Machine learning (ML) has great potential to drive scientific discovery by harvesting data from images of herbarium specimens-preserved plant material curated in natural history collections-but ML techniques have only recently been applied to this rich resource. ML has particularly strong prospects for the study of plant phenological events such as growth and reproduction. As a major indicator of climate change, driver of ecological processes, and critical determinant of plant fitness, plant phenology is an important frontier for the application of ML techniques for science and society. In the present article, we describe a generalized, modular ML workflow for extracting phenological data from images of herbarium specimens, and we discuss the advantages, limitations, and potential future improvements of this workflow. Strategic research and investment in specimen-based ML methods, along with the aggregation of herbarium specimen data, may give rise to a better understanding of life on Earth.