Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Unsupervised pharmaceutical polymorph identification and multicomponent particle mapping of ToF-SIMS data by non-negative matrix factorization

Published

Author(s)

Thomas P. Forbes, J Greg Gillen, Amanda Souna, Jeffrey Lawrence

Abstract

Crystal polymorphism of pharmaceutical compounds directly impacts resulting physicochemical characteristics, a critical aspect in active pharmaceutical ingredient (API) production. Tools to characterize and chemically map these polymorphs at the single particle scale remain important to advancing directed manufacture of targeted polymorphs. Here, time-of-flight secondary ion mass spectrometry (ToF-SIMS) was employed for chemically imaging inkjet printed acetaminophen samples. ToF-SIMS generates large datasets of high spatial resolution images. Extracting relevant data and peaks of interest can be laborious for, and biased by, users. Advances in machine learning approaches have introduced many supervised and unsupervised methods for data analysis. In this study, we apply non-negative matrix factorization (NMF) for the unsupervised analysis of ToF-SIMS chemical image data. More specifically, an expanded variant of NMF, NMFk, was employed to determine the dataset's latent dimensionality. NMFk combines the spectral unmixing of traditional NMF with k-means clustering of the resulting factors and an optimization of the reconstruction and clustering. The method was used to identify the number of polymorph phases – and their representative mass spectra – generated from inkjet printed acetaminophen samples. Amorphous, crystalline form I, and crystalline form II polymorphs were observed. The learned polymorph mass spectra were then used to map the learned polymorphs onto subsequent particle samples of acetaminophen. Finally, NMFk also enabled the decomposition of mixed particle samples (i.e., migraine medicine), learning the number of compounds and their composition. The extracted constituent phase mass spectra – representing single compounds – were searched against mass spectral libraries for identification.
Citation
Analytical Chemistry
Volume
94
Issue
47

Keywords

Unsupervised machine learning, Non-negative matrix factorization, ToF-SIMS, Polymorph, Pharmaceuticals, Chemical mapping

Citation

Forbes, T. , Gillen, J. , Souna, A. and Lawrence, J. (2022), Unsupervised pharmaceutical polymorph identification and multicomponent particle mapping of ToF-SIMS data by non-negative matrix factorization, Analytical Chemistry, [online], https://doi.org/10.1021/acs.analchem.2c03913, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=935288 (Accessed February 8, 2023)
Created November 15, 2022, Updated November 29, 2022