Pheno4J: a gene to phenotype graph database

    loading  Checking for direct PDF access through Ovid



Efficient storage and querying of large amounts of genetic and phenotypic data is crucial to contemporary clinical genetic research. This introduces computational challenges for classical relational databases, due to the sparsity and sheer volume of the data. Our Java based solution loads annotated genetic variants and well phenotyped patients into a graph database to allow fast efficient storage and querying of large volumes of structured genetic and phenotypic data. This abstracts technical problems away and lets researchers focus on the science rather than the implementation. We have also developed an accompanying webserver with end-points to facilitate querying of the database.

Availability and implementation

The Java and Python code are available at


Supplementary information

Supplementary data are available at Bioinformatics online.

Related Topics

    loading  Loading Related Articles