The data file I added contained multiple columns, but in the assay annotation, I just pointed to the file. Is there a way to specify which column belongs to which sample?
Yes! For this, you can make use of data fragment selectors.
There exist various different file formats and each requires its own format for annotating fragments as well.
In this case, we have a tabular file format, for which our tooling provides help.
We will now add fragment selectors to the previously created annotation table to connect our sample to specific data points.
This way, provenance is persistently tracked. Data scientists down the line will thank you for the increase in machine-actionability.
ARCitect
In the “SugarMeasurement” assay, open the “Sugar Measurement” table
On top of the annotation table, open the “Data Annotator” tool
Click Choose a file, find and select sugar_result.csv and click Open
On top switch from “autodetect” to “output”
Select only the 6(!) Data columns (RT_M1, RT_M2, RT_M3, Cold_M1, Cold_M2, Cold_M3). These are the direct results of the 6 samples.
Wow, this is very verbose, even though it only took a few clicks! And you say data scientists will now finally be content?
Yes, but with a final bit of annotation you might even raise a smile from them.
Using data fragment selectors in the annotation table, you described the provenance between your samples and your data fragments.
In addition, you can further annotate these data fragments and even those which do not directly originate from any of the samples.
For example, the column containing the names of the sugars. This annotation is done in the datamap.
We’ll now use the datamap to annotate all columns in the file, even the ones we didn’t cover in the annotation table.
ARCitect
Go back to assay metadata (the sheet named Assay)
In the Datamap section click Add Datamap to add a sheet Data Map
Open the Data Map table sheet at the bottom
Open the “Data Annotator” tool and select the same file (sugar_result.csv) as before.
Click on Open Annotator and this time select all 7 columns and click Submit
From the same file select all 7 columns
The first column of my result table “sugar_result.csv” contains the name of the measured sugar, that is the molecule name.
The other six columns contain the measured data: molecule count in mmol / kg.
Add the following annotation to describe what the columns in “sugar_result.csv”