phenotype.hpoa output

The phenotype.hpoa file is is the core download of disease annotation data for the HPO project. Internally, the HPO project uses one so-called "small file" for each disease; information from these files is processed to make the phenotype.hpoa project, which is offered for download. The small files are not offered for download at this time.

GA4GH phenotools has functionality to export a cohort of phenopackets to small file format. For convenience, we explain this format here and explain the assumptions to code makes to generate the files.

Format

#diseaseIDdiseaseNamephenotypeIDphenotypeNameonsetIDonsetNamefrequencysexnegationmodifierdescriptionpublicationevidencebiocuration
OMIM:605275Noonan syndrome 2HP:0011636Abnormal coronary artery origin1/6PMID:30368668PCSHPO:probinson[2021-05-21]
OMIM:605275Noonan syndrome 2HP:0001928Abnormality of coagulation0/4PMID:30368668PCSORCID:0000-0002-0736-9199[2024-10-02]
OMIM:605275Noonan syndrome 2HP:0000766Abnormal sternum morphologyPMID:29469822PCSHPO:skoehler[2019-04-18]; HPO:probinson[2021-05-21]
OMIM:605275Noonan syndrome 2HP:0006721Acute lymphoblastic leukemia3/20PMID:29469822PCSORCID:0000-0002-0736-9199[2024-04-01]
OMIM:605275Noonan syndrome 2HP:0030674Antenatal onset11/18PMID:29469822PCSORCID:0000-0002-0736-9199[2024-04-01]
OMIM:605275Noonan syndrome 2HP:0001134Anterior polar cataract1/20PMID:29469822PCSORCID:0000-0002-0736-9199[2024-04-01]
OMIM:605275Noonan syndrome 2HP:0000463Anteverted nares3/20PMID:29469822PCSORCID:0000-0002-0736-9199[2024-04-01]
OMIM:605275Noonan syndrome 2HP:0002804Arthrogryposis multiplex congenita2/20PMID:29469822PCSHPO:skoehler[2019-04-18]; HPO:probinson[2021-05-21]
OMIM:605275Noonan syndrome 2HP:0001631Atrial septal defect2/3PMID:30368668PCSHPO:probinson[2021-05-21]

Note that this file has one line that specifies the frequency of each HPO term in the cohort of individuals described in the PMID. It is thus possible to have multiple likes for the same HPO with data from different PMIDs.