What is the difference between the 'curated' and 'calculated' receptor sequence information?

Users will see in the IEDB receptor exports that there are two groups of fields for the VDJ Genes and for the CDR 1, 2, 3 and junction sequences; described as either ‘curated’ or ‘calculated’. Below is the difference between the two types:

  • Curated - The curated data are as reported by our team of curators. Typically, these values are as the authors specified them in the original publication, unless informed changes were made to support standardization.
  • Calculated - The calculated sequences and genes are the output of our standardization and validation pipeline. A combination of different tools is used to ensure sequence information is presented according to IMGT numbering, and VDJ information matches valid IMGT nomenclature for alleles, genes or subgroups for the given species.

Users are recommended to use the calculated data fields in their computational analyses. The curated fields are kept for reference, and can be used to cross-check information on a case-by-case basis.