After checking that the number of topics seemed reasonable, we labeled the individual topics by hand by reading the top 20 statements for each topic and making our best guess what the cluster was about. 
Some of the topics correspond well with one of the standard templates we encourage authors to use on our Data Sharing Policy page \cite{z3qpxr} and were easy to label, such as "#5: Third party restrictions" which matched with "Data subject to third party restrictions".
Other labels were more problematic. "6: Uncharacterizable" was a cluster that included experimental sections and actual data that the authors had copied and pasted into the Data Availability statement, perhaps highlighting the need for better author instructions. "7: Mixed" had many different kinds of statements that the LDA algorithm with the given parameters had combined. Tuning the text preprocessing parameters or the LDA parameters (number of topics, other hyperparameters) might resolve this mixed topic.
Some labels are also repeated. Topics 8 and 9 are both examples of "Available on reasonable request", although the LDA algorithm has resolved them into two separate topics based on the words contained in the statements themselves.
The percentages of each topic are shown in Figure 3.