Pedigree¶
This element is used to represent a pedigree to describe the family relationships of each sample along with their gender and phenotype (affected status). PED files are typically used by software for genetic linkage analysis. The phenopacket schema uses conventions similar to those of PED files to promote interoperability between existing PED files and PED software, but does not actually store a PED file. See the detailed description at the PLINK website for more information about PED files.
The information in this element can be used by programs for analysis of a multi-sample VCF file with exome or genome sequences of members of a family, some of whom are affected by a Mendelian disease.
Data model
Field | Type | Status | Description |
---|---|---|---|
persons | list of Person | required | list of family members in this pedigree |
The pedigree is simply a list of Person objects. These objects are meant to reflect the elements of a PED file.
Person¶
Field | Type | Status | Description |
---|---|---|---|
family_id | string | required | application specific identifier |
individual_id | string | required | application specific identifier |
paternal_id | string | required | application specific identifier |
maternal_id | string | required | application specific identifier |
sex | Sex | required | see text |
affected_status | AffectedStatus | required | see text |
Example
{
"persons": [
{
"familyId": "family 1",
"individualId": "kindred 1A",
"paternalId": "FATHER",
"maternalId": "MOTHER",
"sex": "MALE",
"affectedStatus": "AFFECTED"
},
{
"familyId": "family 1",
"individualId": "kindred 1B",
"paternalId": "FATHER",
"maternalId": "MOTHER",
"sex": "FEMALE",
"affectedStatus": "AFFECTED"
},
{
"familyId": "family 1",
"individual_id": "MOTHER",
"paternalId": "0",
"maternalId": "0",
"sex": "FEMALE",
"affectedStatus": "UNAFFECTED"
},
{
"familyId": "family 1",
"individualId": "FATHER",
"sex": "MALE",
"paternalId": "0",
"maternalId": "0",
"affectedStatus": "UNAFFECTED"
}
]
}
AffectedStatus¶
This element is an enumeration to
Name | Description |
---|---|
MISSING | It is unknown if the individual has the affected phenotype |
UNAFFECTED | The individual does not show the affected phenotype of the proband |
AFFECTED | The individual has the affected phenotype of the proband |
In a PED file, affected persons are encoded with “2”, and unaffecteds by “1” (a “0” is used if no information is available). Instead, Phenopackets uses an enumeration as shown in the table. In a PED file, the sex of individuals is encoded as a “1” for females, “2” for males, and “0” for unknown. Phenopackets uses Sex instead.
The message is made up of a list of Person
elements (the Person element is defined within the Pedigree element).
Each Person element is equivalent to one row of a PED file.
The family ID and the individual IDs may be made up of letters and digits, and the combination of family and individual ID should uniquely identify each person represented in the PED file. The parents of a person in the pedigree are shown with the corresponding individual IDs. Individuals whose parents are not represented in the PED file are known as founders; their parents are represented by a zero (“0”) in the columns for mother and father. Finally, the sex and the affected (disease) status of the person are shown.
If a Phenopacket
is used to represent any of the individuals listed in the Pedigree
, then it is essential that
the individual_id
used in the pedigree matches the id
of the subject
of the Phenopacket
. It is allowable
for the Pedigree
to have individuals that do not have an associated Phenopacket
. This is useful, for instance,
if the Pedigree
is being used to store the affected/not affected status of family members being examined by exome or genome
sequencing. In this case (i.e. where there are no associated phenopackets for the Pedigree.individual_id
), it is
expected that the individual_id
elements match the sample identifiers of the exome/genome file.
The Pedigree object does not support reporting multiple phenotypes in one individual. The phenotype represented by the affectation status is whether the disease is present or not. If this is desired, then one would have to create full phenopackets for each individual in a family.