search
  • Type
    Clear
  • Country
    Clear
  • Compatibility Level
  • Thematic
  • Jurisdiction

  • Data Repository
  • CN
  • ES

  • The database is gene-centered and organized by paralog family. It focused on the paralogs and the duplication events in the evolution. The paralog families and paralogons can be searched by text or sequence, and are downloadable from the website in plain text files.

    more_vert
  • more_vert
  • DisGeNET is a discovery platform containing one of the largest collections available of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models, and the scientific literature, and covers the whole landscape of human diseases. The current version of DisGeNET (v7.0) contains 1,134,942 gene-disease associations (GDAs), between 21,671 genes and 30,170 diseases, disorders, traits, and clinical or abnormal human phenotypes, and 369,554 variant-disease associations (VDAs), between 194,515 variants and 14,155 diseases, traits, and phenotypes. The data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype-phenotype relationships. The information is accessible through a web interface, a Cytoscape App, an RDF SPARQL endpoint, a REST API, and an R package.

    more_vert
  • Virtual Chinese Genome Database (VCGDB) is a genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. We are unsure when this database was last updated, and as such we have marked this record as Uncertain. Please contact us if you have any information on its current status.

    more_vert
  • FCP is a publicly accessible web tool dedicated to analysing the current state and trends on the population of available structures along the classification schemes of enzymes and nuclear receptors, offering both graphical and quantitative data on the degree of functional coverage in that portion of the proteome by existing structures, as well as on the bias observed in the distribution of those structures among proteins.

    more_vert
  • altitude; temperature

    more_vert
  • more_vert
  • The PR2 reference sequence database began as part of the BioMarks project from previous work in the Plankton Group of the Station Biologique of Roscoff. It aims to provide a reference database of carefully annotated 18S rRNA sequences using nine unique taxonomic fields (from domain to species). At present, it contains over 240,000 sequences. Although it focuses on protists, it also contains sequences from metazoa, fungi and plants as well a limited set of 16S sequences from plastids and bacteria. Several metadata fields are available for many sequences, including geo-localisation, whether it originates from a culture or a natural sample, and host type. The annotation of PR2 is performed by experts in each of the taxonomic groups.

    more_vert
  • more_vert
  • Internal Control Genes (ICG) is a wiki-based knowledgebase of internal control genes (or reference genes) for RT-qPCR normalization in a variety of species across three domains of life. Based on community curation, ICG provides curated data from a large volume of literature and provides information on internal control genes corresponding to specific experimental conditions for both model and non-model organisms.

    more_vert
  • chevron_left
  • 1
  • 2
  • 3
  • 4
  • 5
  • chevron_right
255 Data sources
  • The database is gene-centered and organized by paralog family. It focused on the paralogs and the duplication events in the evolution. The paralog families and paralogons can be searched by text or sequence, and are downloadable from the website in plain text files.

    more_vert
  • more_vert
  • DisGeNET is a discovery platform containing one of the largest collections available of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models, and the scientific literature, and covers the whole landscape of human diseases. The current version of DisGeNET (v7.0) contains 1,134,942 gene-disease associations (GDAs), between 21,671 genes and 30,170 diseases, disorders, traits, and clinical or abnormal human phenotypes, and 369,554 variant-disease associations (VDAs), between 194,515 variants and 14,155 diseases, traits, and phenotypes. The data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype-phenotype relationships. The information is accessible through a web interface, a Cytoscape App, an RDF SPARQL endpoint, a REST API, and an R package.

    more_vert
  • Virtual Chinese Genome Database (VCGDB) is a genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. We are unsure when this database was last updated, and as such we have marked this record as Uncertain. Please contact us if you have any information on its current status.

    more_vert
  • FCP is a publicly accessible web tool dedicated to analysing the current state and trends on the population of available structures along the classification schemes of enzymes and nuclear receptors, offering both graphical and quantitative data on the degree of functional coverage in that portion of the proteome by existing structures, as well as on the bias observed in the distribution of those structures among proteins.

    more_vert
  • altitude; temperature

    more_vert
  • more_vert
  • The PR2 reference sequence database began as part of the BioMarks project from previous work in the Plankton Group of the Station Biologique of Roscoff. It aims to provide a reference database of carefully annotated 18S rRNA sequences using nine unique taxonomic fields (from domain to species). At present, it contains over 240,000 sequences. Although it focuses on protists, it also contains sequences from metazoa, fungi and plants as well a limited set of 16S sequences from plastids and bacteria. Several metadata fields are available for many sequences, including geo-localisation, whether it originates from a culture or a natural sample, and host type. The annotation of PR2 is performed by experts in each of the taxonomic groups.

    more_vert
  • more_vert
  • Internal Control Genes (ICG) is a wiki-based knowledgebase of internal control genes (or reference genes) for RT-qPCR normalization in a variety of species across three domains of life. Based on community curation, ICG provides curated data from a large volume of literature and provides information on internal control genes corresponding to specific experimental conditions for both model and non-model organisms.

    more_vert
  • chevron_left
  • 1
  • 2
  • 3
  • 4
  • 5
  • chevron_right