Advancing materials science through next-generation machine learning - LAAS-Micro Nano Bio Technologies
Journal Articles Current Opinion in Solid State and Materials Science Year : 2024

Advancing materials science through next-generation machine learning

Abstract

For over a decade, machine learning (ML) models have been making strides in computer vision and natural language processing (NLP), demonstrating high proficiency in specialized tasks.

The emergence of large-scale language and generative image models, such as ChatGPT and Stable Diffusion, has significantly broadened the accessibility and application scope of these technologies. Traditional predictive models are typically constrained to mapping input data to numerical values or predefined categories, limiting their usefulness beyond their designated tasks.

In contrast, contemporary models employ representation learning and generative modeling, enabling them to extract and encode key insights from a wide variety of data sources and decode them to create novel responses for desired goals. They can interpret queries phrased in natural language to deduce the intended output. In parallel, the application of ML techniques in materials science has advanced considerably, particularly in areas like inverse design, material prediction, and atomic modeling. Despite these advancements, the current models are overly specialized, hindering their potential to supplant established industrial processes. Materials science, therefore, necessitates the creation of a comprehensive, versatile model capable of interpreting humanreadable inputs, intuiting a wide range of possible search directions, and delivering precise solutions. To realize such a model, the field must adopt cutting-edge representation, generative, and foundation model techniques tailored to materials science. A pivotal component in this endeavor is the establishment of an extensive, centralized dataset encompassing a broad spectrum of research topics. This dataset could be assembled by crowdsourcing global research contributions and developing models to extract data from existing literature and represent them in a homogenous format. A massive dataset can be used to train a central model that learns the underlying physics of the target areas, which can then be connected to a variety of specialized downstream tasks.

Ultimately, the envisioned model would empower users to intuitively pose queries for a wide array of desired outcomes. It would facilitate the search for existing data that closely matches the soughtafter solutions and leverage its understanding of physics and material-behavior relationships to innovate new solutions when pre-existing ones fall short.

Fichier principal
Vignette du fichier
perspective_COSSMS_final.pdf (655.97 Ko) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-04760148 , version 1 (30-10-2024)

Identifiers

Cite

Rohit Unni, Mingyuan Zhou, Peter Wiecha, Yuebing Zheng. Advancing materials science through next-generation machine learning. Current Opinion in Solid State and Materials Science, 2024, 30, pp.101157. ⟨10.1016/j.cossms.2024.101157⟩. ⟨hal-04760148⟩
0 View
0 Download

Altmetric

Share

More