Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Dark solitons in Bose–Einstein condensates: a dataset for many-body physics research



Amilson R. Fritsch, Shangjie Guo, Sophia Koh, Ian Spielman, Justyna Zwolak


We establish a dataset of over 1.6 x 10^4 experimental images of Bose–Einstein condensates containing solitonic excitations to enable machine learning (ML) for many-body physics research. About 33 % of this dataset has manually assigned and carefully curated labels. The remainder is automatically labeled using SolDet—an implementation of a physics-informed ML data analysis framework—consisting of a convolutional-neural-network-based classifier and object detector as well as a statistically motivated physics-informed classifier and a quality metric. This technical note constitutes the definitive reference of the dataset, providing an opportunity for the data science community to develop more sophisticated analysis tools, to further understand nonlinear many-body physics, and even advance cold atom experiments.
Machine Learning: Science and Technology


dataset, dark solitons, machine learning, supervised learning


Fritsch, A. , Guo, S. , Koh, S. , Spielman, I. and Zwolak, J. (2022), Dark solitons in Bose–Einstein condensates: a dataset for many-body physics research, Machine Learning: Science and Technology, [online],, (Accessed February 26, 2024)
Created December 21, 2022, Updated December 29, 2022