The human aldehyde dehydrogenase (ALDH) gene superfamily consists of 19 genes encoding enzymes critical for NAD(P)+-dependent oxidation of endogenous and exogenous aldehydes, including drugs and environmental toxicants. Mutations in ALDH genes are the molecular basis of several disease states (e.g. Sjögren-Larsson syndrome, pyridoxine-dependent seizures, and type II hyperprolinemia) and may contribute to the etiology of complex diseases such as cancer and Alzheimer's disease. The aim of this nomenclature update was to identify splice transcriptional variants principally for the human ALDH genes.Methods
Data-mining methods were used to retrieve all human ALDH sequences. Alternatively spliced transcriptional variants were determined based on (i) criteria for sequence integrity and genomic alignment; (ii) evidence of multiple independent cDNA sequences corresponding to a variant sequence; and (iii) if available, empirical evidence of variants from the literature.Results and conclusion
Alternatively spliced transcriptional variants and their encoded proteins exist for most of the human ALDH genes; however, their function and significance remain to be established. When compared with the human genome, rat and mouse include an additional gene, Aldh1a7, in the ALDH1A subfamily. To avoid confusion when identifying splice variants in various genomes, nomenclature guidelines for the naming of such alternative transcriptional variants and proteins are recommended herein. In addition, a web database (www.aldh.org) has been developed to provide up-to-date information and nomenclature guidelines for the ALDH superfamily.