Lawrence Berkeley National Laboratory
Reverse image search for scientific data within and beyond the visible spectrum
- Author(s): Araujo, FHD
- Silva, RRV
- Medeiros, FNS
- Parkinson, DD
- Hexemer, A
- Carneiro, CM
- Ushizima, DM
- et al.
Published Web Locationhttps://doi.org/10.1016/j.eswa.2018.05.015
© 2018 Elsevier Ltd The explosion in the rate, quality and diversity of image acquisition instruments has propelled the development of expert systems to organize and query image collections more efficiently. Recommendation systems that handle scientific images are rare, particularly if records lack metadata. This paper introduces new strategies to enable fast searches and image ranking from large pictorial datasets with or without labels. The main contribution is the development of pyCBIR, a deep neural network software to search scientific images by content. This tool exploits convolutional layers with locality sensitivity hashing for querying images across domains through a user-friendly interface. Our results report image searches over databases ranging from thousands to millions of samples. We test pyCBIR search capabilities using three convNets against four scientific datasets, including samples from cell microscopy, microtomography, atomic diffraction patterns, and materials photographs to demonstrate 95% accurate recommendations in most cases. Furthermore, all scientific data collections are released.