Request Parameter Definitions¶
Component definitions for requestProfiles
. The definition of each parameter as a component allows for reuse across multiple request patterns but also for e.g. easy referencing in OpenAPI endpoints.
AdjacencyAccession
¶
Description¶
A GA4GH RefGet identifier for the adjacent sequence in adjacency/fusion scenarios.
Definitions¶
-
versions
:VQS
-
$ref
:#/$defs/RefgetAccession
AdjacencyStart
¶
Description¶
Start position or range (i.e. in case of a fuzzy breakpoint)
on an adjacent sequence in case of a sequence adjecency
Status:
- PROPOSED FOR BEACON v2.n
- based on VRS v2 with modification
Definitions¶
-
versions
:VQS
-
oneOf
:integer
$ref: #/$defs/Range
AdjacencyEnd
¶
Description¶
End position or range (i.e. in case of a fuzzy breakpoint) on an
adjacent sequence in case of a sequence adjecency
Status:
- PROPOSED FOR BEACON v2.n
- based on VRS v2 with modification
Definitions¶
-
versions
:VQS
-
oneOf
:integer
$ref: #/$defs/Range
AminoacidChange
¶
Description¶
Aminoacid alteration of interest in 1 letter format.
Origin: Beacon v2.0
Definitions¶
-
versions
:v2.0
VQS
-
type
:string
-
examples
:V600E
M734V
CopyChange
¶
Description¶
MUST use a primaryCode representing one of:
- EFO:0030069: complete genomic loss
- EFO:0020073: high-level loss
- EFO:0030068: low-level loss
- EFO:0030067: loss
- EFO:0030064: regional base ploidy
- EFO:0030070: gain
- EFO:0030071: low-level gain
- EFO:0030072: high-level gain
Endpoints are expected to provide query expansion according to the hierarchy
of the terms:
- EFO:0030064
- EFO:0030067
|- EFO:0030068
\- EFO:0020073
\- EFO:0030069
- EFO:0030070
|- EFO:0030071
\- EFO:0030072
Origin: Beacon v2.n, based on VRS v1.3+
Definitions¶
-
versions
:VQS
-
type
:string
-
enum
:EFO:0030069
EFO:0020073
EFO:0030068
EFO:0030067
EFO:0030064
EFO:0030070
EFO:0030071
EFO:0030072
GeneId
¶
Description¶
- A gene identifier
- It is strongly suggested to use a symbol following
the HGNC (www.genenames.org) nomenclature.
Origin: Beacon v2.0
Definitions¶
-
versions
:v2.0
VQS
-
type
:string
-
examples
:BRAF
SCN5A
GenomicFeature
¶
Description¶
Descriptor of the class of genomic feature affected by the variant. Values
from SO (Sequence ontology) are recommended, e.g.
SO:0001623
for 5 prime UTR variantsSO:0000605
for intergenic variants
Note: While genes are features in SO they have a distinct parameter geneId
.
Definitions¶
type
:string
GenomicAlleleShortForm
¶
Description¶
HGVSId descriptor
Origin: Beacon v2.0
Definitions¶
-
versions
:v2.0
VQS
-
type
:string
-
examples
:NM_004006.2:c.4375C>T
MolecularEffect
¶
Description¶
Desscriptor of the molecular consequence generated by the variant. Use of
Ontology terms from SO (Sequence Ontology) are recommended, e.g.
SO:0001583
for finding missense variantsSO:0001580
for finding changes the coding sequenceSO:0001587
for finding variants with a gained stop codon
The logical path for matching to the Beacon v2 model here would be
genomicVariations.molecularAttributes.molecularEffects.id
.
Definitions¶
-
type
:string
-
examples
:SO:0001583
SO:0001580
SO:0001587
Range
¶
Description¶
An array of 2 integer values. If referring to sequence positions the
"0-based, inclusive" format is used.
Definitions¶
-
versions
:VQS
-
type
:array
items
:type
:integer
minimum
:0
minItems
:2
maxItems
:2
RefgetAccession
¶
Description¶
A GA4GH RefGet identifier for the reference sequence, i.e. either a
computed GA4GH checksum or other unique namespaced identifier supported
by the server. It replaces the referenceName
and assemblyId
parameters
according to the VRS v2 definition.
Origin: VRS v2
Reference: GA4GH RefGet
Definitions¶
-
versions
:VQS
-
type
:string
-
examples
:refseq:NC_000009.12
ga4gh:SQ.S_KjnFVz-FE7M0W6yoaUDgYxLPc1jyWU
RepeatSubunitLength
¶
Description¶
The number of residues in a repeat subunit. In contrast to the VRS model we
allow for range queries (e.g. all repeats with subunits of 1 or 2).
Origin: VRS v2
Definitions¶
-
versions
:VQS
-
oneOf
:integer
$ref: #/$defs/Range
RequestProfileId
¶
Description¶
The requestProfile
parameter here allows beacons to check the type of query
being performed and to compare the provided request parameters for conformity
with the expected query profile.
The parameter definition here is a placeholder; the definitions of typed queries
will use constant values for their requestProfile
parameter.
Origin: Beacon v2+
Definitions¶
-
versions
:VQS
-
type
:string
Sequence
¶
Description¶
DNA bases.
- Accepted values:
[ACGTN]*
- N is a wildcard, that denotes the position of any base,
and can be used as a standalone base of any type or within a partially known
sequence. As example, a query ofANNT
the Ns can take take any form of [ACGT]
and will matchANNT
,ACNT
,ACCT
,ACGT
... and so forth.
Origin: VRS v1.n
TODO: Review use of base characters.
Definitions¶
-
versions
:VQS
-
type
:string
SequenceLength
¶
Description¶
The sequence length when querying the conceptual representation of a
sequence according to a VRS ReferenceLengthExpression
class.
Additionally, the sequenceLength
parameter can be used to limit the length
of matched variants, e.g. by specifying a range of lengths for copyChange
matches.
Origin: VRS v2
Definitions¶
-
versions
:VQS
-
oneOf
:integer
$ref: #/$defs/Range
SequenceStart
¶
Description¶
Start position or range (i.e. in case of a fuzzy breakpoint) on a sequence.
Status: PROPOSED FOR BEACON v2.n, based on VRS v2 with modification
Definitions¶
-
versions
:VQS
-
oneOf
:integer
$ref: #/$defs/Range
SequenceEnd
¶
Description¶
End position or range (i.e. in case of a fuzzy breakpoint) on a sequence.
Status: PROPOSED FOR BEACON v2.n, based on VRS v2 with modification
Definitions¶
-
versions
:VQS
-
oneOf
:integer
$ref: #/$defs/Range
VariantId
¶
Description¶
- A variant identifier such as a VRSid, ClinVar id, dbSNP rsID or a
COSMIC identifier - In the default data model this query parameter corresponds to
identifiers.variantAlternateIds
but potentially can map to other
identifiers as well. However, no specific query parameterwas defined
in the Beacon v2 default model.
Status: PROPOSED FOR BEACON v2.n
Definitions¶
-
versions
:VQS
-
type
:string
-
examples
:ClinGen:CA152954
dbSNP:rs587780345
VRStype
¶
Description¶
Type of the variation according to the VRS model. Examples are here e.g.
Adjacency
or Allele
.
Origin: VRS v2
Definitions¶
-
versions
:VQS
-
type
:string
-
enum
:Adjacency
Allele
CisPhasedBlock
CopyNumberChange
CopyNumberCount
DerivativeMolecule
Terminus
Assembly
¶
Description¶
Genomic assembly accession and version as RefSqeq assembly accession
(e.g. "GCF_000001405.39") or a versioned assembly name or synonym such as UCSC
Genome Browser assembly (e.g. "hg38") or Genome Reference Consortium Human (e.g.
"GRCh38.p13") names.
DEPRECATION NOTE: The use of a assembly specific sequence identifier obviates
this parameter. Not part of VRS v2 aligned model versions.
Definitions¶
-
versions
:v1
v2.0
v2.1
-
type
:string
-
example
:GCF_000001405.39
hg38
GRCh38.p13
RefSeqId
¶
Description¶
Reference sequence id for genomic reference sequence in which variant
coordinates are given, e.g. "refseq:NC_000009.12" for human chromosome 9 in the
GRCh38 assembly. The use of the assembly specific RefSeqId is recommended
although alternatively names, synonymous or aliases e.g. "chr9" could be
used in conjunction with an Assembly
parameter.
DEPRECATION NOTE: To be replaced with the RefgetAccession
from
VRS v2.
Definitions¶
-
versions
:v1
v2.0
v2.1
-
type
:string
-
example
:refseq:NC_000009.12
chr9
NC_012920.1
ReferenceBases
¶
Description¶
The reference bases for the variant at the indicated position. It is based
on the VCF concept of having (anchored) reference bases at an indicated
genomic location in combination with alternateBases
to define their
replacement. In contrast, standards such as GA4GH VRS only indicate the
sequence
observed at a given base position, including the use of an empty
sequence together with start
+ end
positions with end - start > 0
to indicate deletions.
Origin: VCF derived (optional) use in Beacon v0.3 -> v2.1
Status: LEGACY
Definitions¶
-
versions
:v1
v2.0
v2.1
-
$ref
:#/$defs/Sequence
AlternateBases
¶
Description¶
The bases of a sequence variant at a given position differing from the
reference sequence, as defined by the referenceBases
parameter. Please
see refereenceBases
for further information.
Origin: VCF derived use in Beacon v0.3 -> v2.1
Status: LEGACY
Definitions¶
-
versions
:v1
v2.0
v2.1
-
$ref
:#/$defs/Sequence
VariantType
¶
Description¶
The variantType
is used to query variants which are not defined through
a sequence of one or more bases using the alternateBases
parameter. This
VCF derived parameter is being replaced by the more specific VRS derived
parameters such as copyChange
.
(Legacy) Examples here are e.g. structural variants:
DUP
- increased allelic count of material from the genomic region between
start
and end
positions
- no assumption about the placement of the additional sequences is being
made (i.e. no in situ requirement as tandem duplications)
DEL: deletion of sequence following start
In contrast to the updated VRS based v2.n parameters such as copyChange
the Beacon v1.1 -> v2.1 query model is not prescriptive with regard to the
values allowed for variantType
with use of extended types (e.g. EFO:0030063
)
being permitted. However, a support for the basic CNV types above - where
represented in the data - is recommended.
Status: LEGACY with potential use in v2.n for non-CNV parameters
Note: The VRS v2 copyChange
is now a partial and more specific replacement
over variantType
for copy number variations. However, additional
concepts so far have not been covered and might warrant use of an
additional parameter (variantClass
?).
Definitions¶
-
versions
:v1
v2.0
v2.1
-
type
:string
-
examples
:EFO:0030070
DUP
DEL
EFO:0030069
Start
¶
Description¶
NOTE: This parameter will be potentially replaced by the VRS based definition
which uses either an integer or a Range (2 integers) in contrast to
the use of an array with 1 or 2 integers here. The difference lies in
the format of "1 integer array" versus "1 integer".
Precise or fuzzy start coordinate position(s), allele locus
(0-based, inclusive).
start
only:- for single positions, e.g. the start of a specified sequence
alteration where the size is given through the specifiedalternateBases
- typical use are queries for SNV and small InDels
- the use of
start
without anend
parameter requires the use of
alternateBases
- 1 value in both
start
andend
: - for searching any variant falling fully or partially within the range
betweenstart
andend
(a.k.a. "range query") - additional use of
variantType
ORalternateBases
can limit the
scope of the query - by convention, partial overlaps of variants with the indicated genomic
range are accepted; for specific overlap requirements the 4-parameter
"Bracket Queries" should be employed - 2 values in both
start
andend
for constructing a "Bracket Query": - can be used to match any contiguous genomic interval, e.g. for querying
imprecise positions - identifies all structural variants starting between
start[0]
andstart[1]
,
and ending betweenend[0]
<->end[1]
- single or double sided precise matches can be achieved by setting
start[1]=start[0]+1
andend[1]=end[0]+1
Definitions¶
-
versions
:v2.0
v2.1
-
type
:array
-
items
:type
:integer
format
:int64
minimum
:0
-
minItems
:1
-
maxItems
:2
End
¶
Description¶
Notes¶
See the start
parameter for information on the potential replacement
of this parameter with the VRS based definition.
Precise or bracketing the end of the variants of interest:
- (0-based, exclusive) - see
start
- for bracket queries, provide 2 values (e.g. [111,222]).
Definitions¶
-
versions
:v2.0
v2.1
-
type
:array
-
items
:type
:integer
format
:int64
minimum
:1
-
minItems
:1
-
maxItems
:2
MateName
¶
Description¶
Notes¶
- while the
mateName
parameter was originally defined for Beacon v1.1
it was never properly documented and is not considered a part of the
supported Beacon v2.n specification. It is now fully implemented in the
VRS v2 basedadjacencyAccession
parameter.
Status: DEPRECATED in v2.n
Definitions¶
-
versions
:v1.1
v2.0
v2.1
-
$ref
:#/$defs/RefSeqId
MateStart
¶
Description¶
genomic start position of fusion partner breakpoint region
Status: DEPRECATED in v2.n (see mateName
)
Definitions¶
-
versions
:v1.1
v2.0
v2.1
-
type
:integer
MateEnd
¶
Description¶
genomic end position of fusion partner breakpoint region
Status: DEPRECATED in v2.n (see mateName
)
Definitions¶
-
versions
:v1.1
v2.0
v2.1
-
type
:integer
VariantMinLength
¶
Description¶
- Minimum length in bases of a genomic variant
- This is an optional parameter without prescribed use. While a length is
commonly available for structural variants such as copy number variations,
it is recommended that length based queries should also be supported for
variants with indicated referenceBases and alternateBases, to enable
length-specific wildcard queries.
Origin: Beacon v2.0
Status: DEPRECATED in v2.n (see sequenceLength
)
Definitions¶
-
versions
:v2.0
v2.1
-
type
:integer
-
format
:int64
-
minimum
:0
VariantMaxLength
¶
Description¶
- Maximum length in bases of a genomic variant.
- This is an optional parameter without prescribed use. While a length is
commonly available for structural variants such as copy number variations,
it is recommended that length based queries should also be supported for
variants with indicated referenceBases and alternateBases, to enable
length-specific wildcard queries.
Status: DEPRECATED in v2.n (see sequenceLength
)
Origin: Beacon v2.0
Definitions¶
-
versions
:v2.0
v2.1
-
type
:integer
-
format
:int64
-
minimum
:1