Dear TA1-2,
As discussed airrflow requires some columns in the samplesheet that have to be present but don’t necessarily have to be filled.
We can’t make these columns optional because they are required for the AIRR Data commons standard, so when we use the functions airr::read_rearrangement it would throw an error otherwise.
I put together a little excel which assigns the samplesheet columns to “required_pipeline” which is whether or not the column must be present and “actually_required” which tells whether or not the column must be filled with sensible values so that the pipeline can run correctly.
Please be in touch if you have further questions or if something is unclear.
Best,
Mark