NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.
Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.
An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
Application of Data Science Tools to Quantify and Distinguish between Structures and Models in Molecular Dynamics Datasets
Published
Author(s)
Surya R. Kalidindi, Joshua Gomberg, Zachary Trautt, Chandler Becker
Abstract
Structure quantification is key to successful mining and extraction of core materials knowledge from both multiscale simulations as well as multiscale experiments. The main challenge stems from the need to transform the inherently high dimensional representations demanded by the rich hierarchical material structure into useful, high value, low dimensional representations. In this paper, we develop and demonstrate the merits of a data-driven approach for addressing this challenge at the atomic scale. The approach presented here is built on prior successes demonstrated for mesoscale representations of material internal structure, and involves three main steps: (i) digital representation of the material structure, (ii) extraction of a comprehensive set of structure measures using the framework of n-point spatial correlations, and (iii) identification of data-driven low dimensional measures using principal component analyses. These novel protocols, applied on an ensemble of structure datasets output from molecular dynamics (MD) simulations, have successfully classified the datasets based on several model input parameters such as the interatomic potential and the temperature used in the MD simulations.
Kalidindi, S.
, Gomberg, J.
, Trautt, Z.
and Becker, C.
(2015),
Application of Data Science Tools to Quantify and Distinguish between Structures and Models in Molecular Dynamics Datasets, Nanotechnology, [online], https://doi.org/10.1088/0957-4484/26/34/344006, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=918634
(Accessed October 24, 2025)