TY - JOUR T1 - Dark Solitons in Bose-Einstein Condensates: A Dataset for Many-body Physics Research Y1 - 2022 A1 - Amilson R. Fritsch A1 - Shangjie Guo A1 - Sophia M. Koh A1 - I. B. Spielman A1 - Justyna P. Zwolak AB -

We establish a dataset of over 1.6×104 experimental images of Bose-Einstein condensates containing solitonic excitations to enable machine learning (ML) for many-body physics research. About 33 % of this dataset has manually assigned and carefully curated labels. The remainder is automatically labeled using SolDet -- an implementation of a physics-informed ML data analysis framework -- consisting of a convolutional-neural-network-based classifier and object detector as well as a statistically motivated physics-informed classifier and a quality metric. This technical note constitutes the definitive reference of the dataset, providing an opportunity for the data science community to develop more sophisticated analysis tools, to further understand nonlinear many-body physics, and even advance cold atom experiments.

UR - https://arxiv.org/abs/2205.09114 ER -