search
  • Type
    Clear
  • Country
    Clear
  • Compatibility Level
  • Thematic
  • Jurisdiction

  • Data Repository
  • IT
  • ES

  • The Eurac Research CLARIN Centre (ERCC) is a dedicated repository for language data. It is hosted by the Institute for Applied Linguistics (IAL) at Eurac Research, a private research centre based in Bolzano, South Tyrol, Italy. The Centre is part of the Europe-wide CLARIN infrastructure, which means that it follows well-defined international standards for (meta)data and procedures and is well-embedded in the wider European Linguistics infrastructure. The repository hosts data collected at the IAL, but is also open for data deposits from external collaborators.

    more_vert
  • wFleaBase includes data from all species of the genus, yet the primary species are Daphnia pulex and Daphnia magna, because of the broad set of genomic tools that have already been developed for these animals.

    more_vert
  • DisGeNET is a discovery platform containing one of the largest collections available of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models, and the scientific literature, and covers the whole landscape of human diseases. The current version of DisGeNET (v7.0) contains 1,134,942 gene-disease associations (GDAs), between 21,671 genes and 30,170 diseases, disorders, traits, and clinical or abnormal human phenotypes, and 369,554 variant-disease associations (VDAs), between 194,515 variants and 14,155 diseases, traits, and phenotypes. The data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype-phenotype relationships. The information is accessible through a web interface, a Cytoscape App, an RDF SPARQL endpoint, a REST API, and an R package.

    more_vert
  • more_vert
  • FCP is a publicly accessible web tool dedicated to analysing the current state and trends on the population of available structures along the classification schemes of enzymes and nuclear receptors, offering both graphical and quantitative data on the degree of functional coverage in that portion of the proteome by existing structures, as well as on the bias observed in the distribution of those structures among proteins.

    more_vert
  • more_vert
  • The PR2 reference sequence database began as part of the BioMarks project from previous work in the Plankton Group of the Station Biologique of Roscoff. It aims to provide a reference database of carefully annotated 18S rRNA sequences using nine unique taxonomic fields (from domain to species). At present, it contains over 240,000 sequences. Although it focuses on protists, it also contains sequences from metazoa, fungi and plants as well a limited set of 16S sequences from plastids and bacteria. Several metadata fields are available for many sequences, including geo-localisation, whether it originates from a culture or a natural sample, and host type. The annotation of PR2 is performed by experts in each of the taxonomic groups.

    more_vert
  • MobiDB is a database of intrinsically disordered regions (IDRs) and related features from various sources and prediction tools. Different levels of reliability and different features are reported as different and independent annotations. The database features three levels of annotation: manually curated, indirect and predicted. MobiDB annotates the binding modes of disordered proteins, whether they undergo disorder-to-order transitions or remain disordered in the bound state. In addition, disordered regions undergoing liquid-liquid phase separation or post-translational modifications are defined.

    more_vert
  • more_vert
  • more_vert
  • chevron_left
  • 1
  • 2
  • 3
  • 4
  • 5
  • chevron_right
187 Data sources
  • The Eurac Research CLARIN Centre (ERCC) is a dedicated repository for language data. It is hosted by the Institute for Applied Linguistics (IAL) at Eurac Research, a private research centre based in Bolzano, South Tyrol, Italy. The Centre is part of the Europe-wide CLARIN infrastructure, which means that it follows well-defined international standards for (meta)data and procedures and is well-embedded in the wider European Linguistics infrastructure. The repository hosts data collected at the IAL, but is also open for data deposits from external collaborators.

    more_vert
  • wFleaBase includes data from all species of the genus, yet the primary species are Daphnia pulex and Daphnia magna, because of the broad set of genomic tools that have already been developed for these animals.

    more_vert
  • DisGeNET is a discovery platform containing one of the largest collections available of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models, and the scientific literature, and covers the whole landscape of human diseases. The current version of DisGeNET (v7.0) contains 1,134,942 gene-disease associations (GDAs), between 21,671 genes and 30,170 diseases, disorders, traits, and clinical or abnormal human phenotypes, and 369,554 variant-disease associations (VDAs), between 194,515 variants and 14,155 diseases, traits, and phenotypes. The data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype-phenotype relationships. The information is accessible through a web interface, a Cytoscape App, an RDF SPARQL endpoint, a REST API, and an R package.

    more_vert
  • more_vert
  • FCP is a publicly accessible web tool dedicated to analysing the current state and trends on the population of available structures along the classification schemes of enzymes and nuclear receptors, offering both graphical and quantitative data on the degree of functional coverage in that portion of the proteome by existing structures, as well as on the bias observed in the distribution of those structures among proteins.

    more_vert
  • more_vert
  • The PR2 reference sequence database began as part of the BioMarks project from previous work in the Plankton Group of the Station Biologique of Roscoff. It aims to provide a reference database of carefully annotated 18S rRNA sequences using nine unique taxonomic fields (from domain to species). At present, it contains over 240,000 sequences. Although it focuses on protists, it also contains sequences from metazoa, fungi and plants as well a limited set of 16S sequences from plastids and bacteria. Several metadata fields are available for many sequences, including geo-localisation, whether it originates from a culture or a natural sample, and host type. The annotation of PR2 is performed by experts in each of the taxonomic groups.

    more_vert
  • MobiDB is a database of intrinsically disordered regions (IDRs) and related features from various sources and prediction tools. Different levels of reliability and different features are reported as different and independent annotations. The database features three levels of annotation: manually curated, indirect and predicted. MobiDB annotates the binding modes of disordered proteins, whether they undergo disorder-to-order transitions or remain disordered in the bound state. In addition, disordered regions undergoing liquid-liquid phase separation or post-translational modifications are defined.

    more_vert
  • more_vert
  • more_vert
  • chevron_left
  • 1
  • 2
  • 3
  • 4
  • 5
  • chevron_right