- Andreas, Jacob;
- Beguš, Gašper;
- Bronstein, Michael;
- Diamant, Roee;
- Delaney, Denley;
- Gero, Shane;
- Goldwasser, Shafi;
- Gruber, David;
- de Haas, Sarah;
- Malkin, Peter;
- Pavlov, Nikolay;
- Payne, Roger;
- Petri, Giovanni;
- Rus, Daniela;
- Sharma, Pratyusha;
- Tchernov, Dan;
- Tønnesen, Pernille;
- Torralba, Antonio;
- Vogt, Daniel;
- Wood, Robert
Machine learning has been advancing dramatically over the past decade. Most strides are human-based applications due to the availability of large-scale datasets; however, opportunities are ripe to apply this technology to more deeply understand non-human communication. We detail a scientific roadmap for advancing the understanding of communication of whales that can be built further upon as a template to decipher other forms of animal and non-human communication. Sperm whales, with their highly developed neuroanatomical features, cognitive abilities, social structures, and discrete click-based encoding make for an excellent model for advanced tools that can be applied to other animals in the future. We outline the key elements required for the collection and processing of massive datasets, detecting basic communication units and language-like higher-level structures, and validating models through interactive playback experiments. The technological capabilities developed by such an undertaking hold potential for cross-applications in broader communities investigating non-human communication and behavioral research.