Release 4 5 Snapshot #1

This page is part of the FHIR Specification (v4.0.1: R4 - Mixed Normative and STU ) in it's permanent home (it will always be available at this URL). (v5.0.0-snapshot1: R5 Snapshot #1). The current version which supercedes this version is 5.0.0 . For a full list of available versions, see the Directory of published versions . Page versions: R5 R4B R4

Clinical Genomics Work Group Maturity Level : 1   Trial Use Security Category : Patient Compartments : Patient

Detailed Descriptions for the elements in the MolecularSequence resource.

MolecularSequence
Element Id MolecularSequence
Definition

Raw data describing a biological sequence.

Cardinality 0..*
Type DomainResource
Summary true false
Invariants
Defined on this element
msq-3 Rule Only 0 and 1 are valid for coordinateSystem coordinateSystem = 1 or coordinateSystem = 0
MolecularSequence.identifier
Element Id MolecularSequence.identifier
Definition

A unique identifier for this particular sequence instance. This is a FHIR-defined id.

Note This is a business identifier, not a resource identifier (see discussion )
Cardinality 0..*
Type Identifier
Requirements

Allows sequences to be distinguished and referenced.

Summary true
MolecularSequence.type
Element Id MolecularSequence.type
Definition

Amino Acid Sequence/ DNA Sequence / RNA Sequence.

Cardinality 0..1
Terminology Binding sequenceType ( Required )
Type code
Summary true
MolecularSequence.coordinateSystem
Element Id MolecularSequence.coordinateSystem
Definition

Whether the sequence is numbered starting at 0 (0-based numbering or coordinates, inclusive start, exclusive end) or starting at 1 (1-based numbering, inclusive start and inclusive end).

Cardinality 1..1
Type integer
Summary true
MolecularSequence.patient
Element Id MolecularSequence.patient
Definition

The patient whose sequencing results are described by this resource.

Cardinality 0..1
Type Reference ( Patient )
Summary true
MolecularSequence.specimen
Element Id MolecularSequence.specimen
Definition

Specimen used for sequencing.

Cardinality 0..1
Type Reference ( Specimen )
Summary true
MolecularSequence.device
Element Id MolecularSequence.device
Definition

The method for sequencing, for example, chip information.

Cardinality 0..1
Type Reference ( Device )
Summary true
MolecularSequence.performer
Element Id MolecularSequence.performer
Definition

The organization or lab that should be responsible for this result.

Cardinality 0..1
Type Reference ( Organization )
Summary true
MolecularSequence.quantity
Element Id MolecularSequence.quantity
Definition

The number of copies of the sequence of interest. (RNASeq).

Cardinality 0..1
Type Quantity
Summary true
MolecularSequence.referenceSeq
Element Id MolecularSequence.referenceSeq
Definition

A sequence that is used as a reference to describe variants that are present in a sequence analyzed.

Cardinality 0..1
Summary true
Invariants
Defined on this element
msq-5 Rule GenomeBuild and chromosome must be both contained if either one of them is contained (chromosome.empty() and genomeBuild.empty()) or (chromosome.exists() and genomeBuild.exists())
msq-6 Rule Have and only have one of the following elements in referenceSeq : 1. genomeBuild ; 2 referenceSeqId; 3. referenceSeqPointer; 4. referenceSeqString; (genomeBuild.count()+referenceSeqId.count()+ referenceSeqPointer.count()+ referenceSeqString.count()) = 1
MolecularSequence.referenceSeq.chromosome
Element Id MolecularSequence.referenceSeq.chromosome
Definition

Structural unit composed of a nucleic acid molecule which controls its own replication through the interaction of specific proteins at one or more origins of replication ( SO:0000340 ).

Cardinality 0..1
Terminology Binding chromosome-human ( Example )
Type CodeableConcept
Summary true
MolecularSequence.referenceSeq.genomeBuild
Element Id MolecularSequence.referenceSeq.genomeBuild
Definition

The Genome Build used for reference, following GRCh build versions e.g. 'GRCh 37'. Version number must be included if a versioned release of a primary build was used.

Cardinality 0..1
Type string
Summary true
MolecularSequence.referenceSeq.orientation
Element Id MolecularSequence.referenceSeq.orientation
Definition

A relative reference to a DNA strand based on gene orientation. The strand that contains the open reading frame of the gene is the "sense" strand, and the opposite complementary strand is the "antisense" strand.

Cardinality 0..1
Terminology Binding orientationType ( Required )
Type code
Summary true
MolecularSequence.referenceSeq.referenceSeqId
Element Id MolecularSequence.referenceSeq.referenceSeqId
Definition

Reference identifier of reference sequence submitted to NCBI. It must match the type in the MolecularSequence.type field. For example, the prefix, “NG_” identifies reference sequence for genes, “NM_” for messenger RNA transcripts, and “NP_” for amino acid sequences.

Cardinality 0..1
Terminology Binding E n s e m b l ENSEMBL ( Example )
Type CodeableConcept
Summary true
MolecularSequence.referenceSeq.referenceSeqPointer
Element Id MolecularSequence.referenceSeq.referenceSeqPointer
Definition

A pointer to another MolecularSequence entity as reference sequence.

Cardinality 0..1
Type Reference ( MolecularSequence )
Summary true
MolecularSequence.referenceSeq.referenceSeqString
Element Id MolecularSequence.referenceSeq.referenceSeqString
Definition

A string like "ACGT".

Cardinality 0..1
Type string
Summary true
MolecularSequence.referenceSeq.strand
Element Id MolecularSequence.referenceSeq.strand
Definition

An absolute reference to a strand. The Watson strand is the strand whose 5'-end is on the short arm of the chromosome, and the Crick strand as the one whose 5'-end is on the long arm.

Cardinality 0..1
Terminology Binding strandType ( Required )
Type code
Summary true
MolecularSequence.referenceSeq.windowStart
Element Id MolecularSequence.referenceSeq.windowStart
Definition

Start position of the window on the reference sequence. If the coordinate system is either 0-based or 1-based, then start position is inclusive.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.referenceSeq.windowEnd
Element Id MolecularSequence.referenceSeq.windowEnd
Definition

End position of the window on the reference sequence. If the coordinate system is 0-based then end is exclusive and does not include the last position. If the coordinate system is 1-base, then end is inclusive and includes the last position.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.variant
Element Id MolecularSequence.variant
Definition

The definition of variant here originates from Sequence ontology ( variant_of ). This element can represent amino acid or nucleic sequence change(including insertion,deletion,SNP,etc.) It can represent some complex mutation or segment variation with the assist of CIGAR string.

Cardinality 0..*
Summary true
MolecularSequence.variant.start
Element Id MolecularSequence.variant.start
Definition

Start position of the variant on the reference sequence. If the coordinate system is either 0-based or 1-based, then start position is inclusive.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.variant.end
Element Id MolecularSequence.variant.end
Definition

End position of the variant on the reference sequence. If the coordinate system is 0-based then end is exclusive and does not include the last position. If the coordinate system is 1-base, then end is inclusive and includes the last position.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.variant.observedAllele
Element Id MolecularSequence.variant.observedAllele
Definition

An allele is one of a set of coexisting sequence variants of a gene ( SO:0001023 ). Nucleotide(s)/amino acids from start position of sequence to stop position of sequence on the positive (+) strand of the observed sequence. When the sequence type is DNA, it should be the sequence on the positive (+) strand. This will lay in the range between variant.start and variant.end.

Cardinality 0..1
Type string
Summary true
MolecularSequence.variant.referenceAllele
Element Id MolecularSequence.variant.referenceAllele
Definition

An allele is one of a set of coexisting sequence variants of a gene ( SO:0001023 ). Nucleotide(s)/amino acids from start position of sequence to stop position of sequence on the positive (+) strand of the reference sequence. When the sequence type is DNA, it should be the sequence on the positive (+) strand. This will lay in the range between variant.start and variant.end.

Cardinality 0..1
Type string
Summary true
MolecularSequence.variant.cigar
Element Id MolecularSequence.variant.cigar
Definition

Extended CIGAR string for aligning the sequence with reference bases. See detailed documentation here .

Cardinality 0..1
Type string
Summary true
MolecularSequence.variant.variantPointer
Element Id MolecularSequence.variant.variantPointer
Definition

A pointer to an Observation containing variant information.

Cardinality 0..1
Type Reference ( Observation )
Summary true
MolecularSequence.observedSeq
Element Id MolecularSequence.observedSeq
Definition

Sequence that was observed. It is the result marked by referenceSeq along with variant records on referenceSeq. This shall start from referenceSeq.windowStart and end by referenceSeq.windowEnd.

Cardinality 0..1
Type string
Summary true
MolecularSequence.quality
Element Id MolecularSequence.quality
Definition

An experimental feature attribute that defines the quality of the feature in a quantitative way, such as a phred quality score ( SO:0001686 ).

Cardinality 0..*
Summary true
MolecularSequence.quality.type
Element Id MolecularSequence.quality.type
Definition

INDEL / SNP / Undefined variant.

Cardinality 1..1
Terminology Binding qualityType ( Required )
Type code
Summary true
MolecularSequence.quality.standardSequence
Element Id MolecularSequence.quality.standardSequence
Definition

Gold standard sequence used for comparing against.

Cardinality 0..1
Terminology Binding F d a- standard sequence FDA-StandardSequence ( Example )
Type CodeableConcept
Summary true
MolecularSequence.quality.start
Element Id MolecularSequence.quality.start
Definition

Start position of the sequence. If the coordinate system is either 0-based or 1-based, then start position is inclusive.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.quality.end
Element Id MolecularSequence.quality.end
Definition

End position of the sequence. If the coordinate system is 0-based then end is exclusive and does not include the last position. If the coordinate system is 1-base, then end is inclusive and includes the last position.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.quality.score
Element Id MolecularSequence.quality.score
Definition

The score of an experimentally derived feature such as a p-value ( SO:0001685 ).

Cardinality 0..1
Type Quantity
Summary true
MolecularSequence.quality.method
Element Id MolecularSequence.quality.method
Definition

Which method is used to get sequence quality.

Cardinality 0..1
Terminology Binding F d a- method FDA-Method ( Example )
Type CodeableConcept
Summary true
MolecularSequence.quality.truthTP
Element Id MolecularSequence.quality.truthTP
Definition

True positives, from the perspective of the truth data, i.e. the number of sites in the Truth Call Set for which there are paths through the Query Call Set that are consistent with all of the alleles at this site, and for which there is an accurate genotype call for the event.

Cardinality 0..1
Type decimal
Summary true
MolecularSequence.quality.queryTP
Element Id MolecularSequence.quality.queryTP
Definition

True positives, from the perspective of the query data, i.e. the number of sites in the Query Call Set for which there are paths through the Truth Call Set that are consistent with all of the alleles at this site, and for which there is an accurate genotype call for the event.

Cardinality 0..1
Type decimal
Summary true
MolecularSequence.quality.truthFN
Element Id MolecularSequence.quality.truthFN
Definition

False negatives, i.e. the number of sites in the Truth Call Set for which there is no path through the Query Call Set that is consistent with all of the alleles at this site, or sites for which there is an inaccurate genotype call for the event. Sites with correct variant but incorrect genotype are counted here.

Cardinality 0..1
Type decimal
Summary true
MolecularSequence.quality.queryFP
Element Id MolecularSequence.quality.queryFP
Definition

False positives, i.e. the number of sites in the Query Call Set for which there is no path through the Truth Call Set that is consistent with this site. Sites with correct variant but incorrect genotype are counted here.

Cardinality 0..1
Type decimal
Summary true
MolecularSequence.quality.gtFP
Element Id MolecularSequence.quality.gtFP
Definition

The number of false positives where the non-REF alleles in the Truth and Query Call Sets match (i.e. cases where the truth is 1/1 and the query is 0/1 or similar).

Cardinality 0..1
Type decimal
Summary true
MolecularSequence.quality.precision
Element Id MolecularSequence.quality.precision
Definition

QUERY.TP / (QUERY.TP + QUERY.FP).

Cardinality 0..1
Type decimal
Summary true
MolecularSequence.quality.recall
Element Id MolecularSequence.quality.recall
Definition

TRUTH.TP / (TRUTH.TP + TRUTH.FN).

Cardinality 0..1
Type decimal
Summary true
MolecularSequence.quality.fScore
Element Id MolecularSequence.quality.fScore
Definition

Harmonic mean of Recall and Precision, computed as: 2 * precision * recall / (precision + recall).

Cardinality 0..1
Type decimal
Summary true
MolecularSequence.quality.roc
Element Id MolecularSequence.quality.roc
Definition

Receiver Operator Characteristic (ROC) Curve to give sensitivity/specificity tradeoff.

Cardinality 0..1
Summary true
MolecularSequence.quality.roc.score
Element Id MolecularSequence.quality.roc.score
Definition

Invidual data point representing the GQ (genotype quality) score threshold.

Cardinality 0..*
Type integer
Summary true
MolecularSequence.quality.roc.numTP
Element Id MolecularSequence.quality.roc.numTP
Definition

The number of true positives if the GQ score threshold was set to "score" field value.

Cardinality 0..*
Type integer
Summary true
MolecularSequence.quality.roc.numFP
Element Id MolecularSequence.quality.roc.numFP
Definition

The number of false positives if the GQ score threshold was set to "score" field value.

Cardinality 0..*
Type integer
Summary true
MolecularSequence.quality.roc.numFN
Element Id MolecularSequence.quality.roc.numFN
Definition

The number of false negatives if the GQ score threshold was set to "score" field value.

Cardinality 0..*
Type integer
Summary true
MolecularSequence.quality.roc.precision
Element Id MolecularSequence.quality.roc.precision
Definition

Calculated precision if the GQ score threshold was set to "score" field value.

Cardinality 0..*
Type decimal
Summary true
MolecularSequence.quality.roc.sensitivity
Element Id MolecularSequence.quality.roc.sensitivity
Definition

Calculated sensitivity if the GQ score threshold was set to "score" field value.

Cardinality 0..*
Type decimal
Summary true
MolecularSequence.quality.roc.fMeasure
Element Id MolecularSequence.quality.roc.fMeasure
Definition

Calculated fScore if the GQ score threshold was set to "score" field value.

Cardinality 0..*
Type decimal
Summary true
MolecularSequence.readCoverage
Element Id MolecularSequence.readCoverage
Definition

Coverage (read depth or depth) is the average number of reads representing a given nucleotide in the reconstructed sequence.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.repository
Element Id MolecularSequence.repository
Definition

Configurations of the external repository. The repository shall store target's observedSeq or records related with target's observedSeq.

Cardinality 0..*
Summary true
MolecularSequence.repository.type
Element Id MolecularSequence.repository.type
Definition

Click and see / RESTful API / Need login to see / RESTful API with authentication / Other ways to see resource.

Cardinality 1..1
Terminology Binding repositoryType ( Required )
Type code
Summary true
MolecularSequence.repository.url
Element Id MolecularSequence.repository.url
Definition

URI of an external repository which contains further details about the genetics data.

Cardinality 0..1
Type uri
Summary true
MolecularSequence.repository.name
Element Id MolecularSequence.repository.name
Definition

URI of an external repository which contains further details about the genetics data.

Cardinality 0..1
Type string
Summary true
MolecularSequence.repository.datasetId
Element Id MolecularSequence.repository.datasetId
Definition

Id of the variant in this external repository. The server will understand how to use this id to call for more info about datasets in external repository.

Cardinality 0..1
Type string
Summary true
MolecularSequence.repository.variantsetId
Element Id MolecularSequence.repository.variantsetId
Definition

Id of the variantset in this external repository. The server will understand how to use this id to call for more info about variantsets in external repository.

Cardinality 0..1
Type string
Summary true
MolecularSequence.repository.readsetId
Element Id MolecularSequence.repository.readsetId
Definition

Id of the read in this external repository.

Cardinality 0..1
Type string
Summary true
MolecularSequence.pointer
Element Id MolecularSequence.pointer
Definition

Pointer to next atomic sequence which at most contains one variant.

Cardinality 0..*
Type Reference ( MolecularSequence )
Summary true
MolecularSequence.structureVariant
Element Id MolecularSequence.structureVariant
Definition

Information about chromosome structure variation.

Cardinality 0..*
Summary true
MolecularSequence.structureVariant.variantType
Element Id MolecularSequence.structureVariant.variantType
Definition

Information about chromosome structure variation DNA change type.

Cardinality 0..1
Terminology Binding LOINC LL379-9 answerlist :
Type CodeableConcept
Summary true
MolecularSequence.structureVariant.exact
Element Id MolecularSequence.structureVariant.exact
Definition

Used to indicate if the outer and inner start-end values have the same meaning.

Cardinality 0..1
Type boolean
Summary true
MolecularSequence.structureVariant.length
Element Id MolecularSequence.structureVariant.length
Definition

Length of the variant chromosome.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.structureVariant.outer
Element Id MolecularSequence.structureVariant.outer
Definition

Structural variant outer.

Cardinality 0..1
Summary true
MolecularSequence.structureVariant.outer.start
Element Id MolecularSequence.structureVariant.outer.start
Definition

Structural variant outer start. If the coordinate system is either 0-based or 1-based, then start position is inclusive.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.structureVariant.outer.end
Element Id MolecularSequence.structureVariant.outer.end
Definition

Structural variant outer end. If the coordinate system is 0-based then end is exclusive and does not include the last position. If the coordinate system is 1-base, then end is inclusive and includes the last position.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.structureVariant.inner
Element Id MolecularSequence.structureVariant.inner
Definition

Structural variant inner.

Cardinality 0..1
Summary true
MolecularSequence.structureVariant.inner.start
Element Id MolecularSequence.structureVariant.inner.start
Definition

Structural variant inner start. If the coordinate system is either 0-based or 1-based, then start position is inclusive.

Cardinality 0..1
Type integer
Summary true
MolecularSequence.structureVariant.inner.end
Element Id MolecularSequence.structureVariant.inner.end
Definition

Structural variant inner end. If the coordinate system is 0-based then end is exclusive and does not include the last position. If the coordinate system is 1-base, then end is inclusive and includes the last position.

Cardinality 0..1
Type integer
Summary true