Comparison of dissimilarity measures for cluster analysis of X-ray diffraction data from combinatorial libraries

Aaron Gilad Kusne; Ichiro Takeuchi; Yuma Iwasaki

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Comparison of dissimilarity measures for cluster analysis of X-ray diffraction data from combinatorial libraries

Published

February 3, 2017

Author(s)

Aaron Gilad Kusne, Ichiro Takeuchi, Yuma Iwasaki

Abstract

Machine learning techniques have proven invaluable to manage the ever growing volume of materials research data produced as developments continue in high-throughput materials simulation, fabrication, and characterization. In particular, machine learning techniques have been demonstrated for their utility in rapidly and automatically identifying potential composition–phase maps from structural data characterization of composition spread libraries, enabling rapid materials fabrication-structure-property analysis and functional materials discovery. A key issue in development of an automated phase-diagram determination method is the choice of dissimilarity measure, or kernel function. The desired measure reduces the impact of confounding structural data issues on analysis performance. The issues include peak height changes and peak shifting due to lattice constant change as a function of composition. In this work, we investigate the choice of dissimilarity measure in X-ray diffraction-based structure analysis and the choice of measure's performance impact on automatic composition-phase map determination. Nine dissimilarity measures are investigated for their impact in analyzing X-ray diffraction patterns for a Fe–Co–Ni ternary alloy composition spread. The cosine, Pearson correlation coefficient, and Jensen–Shannon divergence measures are shown to provide the best performance in the presence of peak height change and peak shifting (due to lattice constant change) when the magnitude of peak shifting is unknown. With prior knowledge of the maximum peak shifting, dynamic time warping in a normalized constrained mode provides the best performance. This work also serves to demonstrate a strategy for rapid analysis of a large number of X-ray diffraction patterns in general beyond data from combinatorial libraries.

Citation

npj Computational Materials

Volume

Issue

Pub Type

Journals

Download Paper

https://doi.org/10.1038/s41524-017-0006-2

Local Download

Keywords

dissimilarity measures, X-ray diffraction, composition spread, machine learning, materials informatics

Statistical analysis, Materials characterization and Applied AI

Citation

Kusne, A. , Takeuchi, I. and Iwasaki, Y. (2017), Comparison of dissimilarity measures for cluster analysis of X-ray diffraction data from combinatorial libraries, npj Computational Materials, [online], https://doi.org/10.1038/s41524-017-0006-2, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=921209 (Accessed September 28, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created February 3, 2017, Updated October 14, 2021

Was this page helpful?

Comparison of dissimilarity measures for cluster analysis of X-ray diffraction data from combinatorial libraries

Author(s)

Abstract

Download Paper

Keywords

Citation

Additional citation formats

Issues