Discovering new materials underlies many innovations, and the traditional Edisonian ap- proach of brute-force screening through material databases is inefficient given the massive combinatorial search space of most material classes. Here, I target a particular class of materials that are porous and tunable, metal-organic frameworks (MOFs), where one could easily generate millions to billions of possible materials. I use various machine learning techniques such as Bayesian optimization trained on small quantities of data to identify promising candidates for methane storage as a proof-of-concept and air separations as a new material design problem. In both these studies, I show that Bayesian optimization only requires a small fraction (roughly 0.1%) of the dataset to identify candidate materials. In addition, the underlying surrogate model in Bayesian optimization, Gaussian processes, can be constructed to predict using multiple fidelities of information, thus ameliorating model inaccuracies due to a small quantity of data. Machine learning can also be used to directly learn energies and forces, which can be used to understand adsorption mechanisms in complex materials. I use this to identify new binding mechanisms of CO2 in known amine-appended MOFs and show that the same models can be used to propose new amine candidates suitable for direct air capture or improved carbon capture capacities. These two flavors of machine learning show how the materials discovery process can be accelerated dramatically by circumventing the poor computational scaling of most ab initio methods.
Cookie SettingseScholarship uses cookies to ensure you have the best experience on our website. You can manage which cookies you want us to use.Our Privacy Statement includes more details on the cookies we use and how we protect your privacy.