Lawrence Berkeley National Laboratory
A machine learning approach for packet loss prediction in science flows
- Author(s): Giannakou, A
- Dwivedi, D
- Peisert, S
- et al.
Published Web Locationhttps://doi.org/10.1016/j.future.2019.07.053
© 2019 Elsevier B.V. Science networks and their hosted applications require large and frequent data transfers, but these transfers are subject to network performance degradation, including queuing delays and packet drops. However, well known network dynamics along with limited instrumentation access complicate the creation of an accurate method that predicts different performance aspects of data transfers. In this study, we develop a lightweight machine learning tool to predict end-to-end packet retransmission in science flows of arbitrary size. We also identify the minimum set of necessary path and host measurements needed as input features in our predictor in order to achieve high accuracy. In our evaluation process our predictor demonstrated low training times and was able to provide accurate estimates (97%–99%) for packet retransmissions of data transfers of arbitrary sizes. The results also manifest that the our solution was able to predict retransmit behavior reasonably well (66%) even for previously unseen data if training and testing datasets had similar statistics.