Skip to main content

NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.

Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.

U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Application of Data Science Tools to Quantify and Distinguish between Structures and Models in Molecular Dynamics Datasets

Published

Author(s)

Surya R. Kalidindi, Joshua Gomberg, Zachary Trautt, Chandler Becker

Abstract

Structure quantification is key to successful mining and extraction of core materials knowledge from both multiscale simulations as well as multiscale experiments. The main challenge stems from the need to transform the inherently high dimensional representations demanded by the rich hierarchical material structure into useful, high value, low dimensional representations. In this paper, we develop and demonstrate the merits of a data-driven approach for addressing this challenge at the atomic scale. The approach presented here is built on prior successes demonstrated for mesoscale representations of material internal structure, and involves three main steps: (i) digital representation of the material structure, (ii) extraction of a comprehensive set of structure measures using the framework of n-point spatial correlations, and (iii) identification of data-driven low dimensional measures using principal component analyses. These novel protocols, applied on an ensemble of structure datasets output from molecular dynamics (MD) simulations, have successfully classified the datasets based on several model input parameters such as the interatomic potential and the temperature used in the MD simulations.
Citation
Nanotechnology

Keywords

multiscale modeling, principal component analysis, molecular dynamics

Citation

Kalidindi, S. , Gomberg, J. , Trautt, Z. and Becker, C. (2015), Application of Data Science Tools to Quantify and Distinguish between Structures and Models in Molecular Dynamics Datasets, Nanotechnology, [online], https://doi.org/10.1088/0957-4484/26/34/344006, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=918634 (Accessed October 24, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created August 2, 2015, Updated October 14, 2021
Was this page helpful?