1
AlphaFold, the state-of-the-art AI system developed by DeepMind, is able to computationally predict protein structures with unprecedented accuracy and speed. Working in partnership with EMBL’s European Bioinformatics Institute (EMBL-EBI), DeepMind released over 200 million protein structure predictions by AlphaFold that are freely and openly available to the global scientific community. Included are nearly all catalogued proteins known to science.
Go to Database
2
This page gives access to the digitized version of the Roche Applied Science "Biochemical Pathways" wall chart. The electronic Biochemical Pathways allows the user to search the wall charts with keywords, set focus effects, activate filtering functions and zooming in on the details and elements of interest.
Go to Database
3
BioCyc is a collection of 18,030 Pathway/Genome Databases, plus software tools for exploring them. Key aspects of BioCyc data: - Quality data curated from tens of thousands of publications, including curated databases for E. coli, B. subtilis, S. cerevisiae, and H. sapiens - Computationally predicted metabolic pathways and operons (bacteria and archaea) - Data integrated from other databases including gene essentiality, regulatory networks, protein features, and GO annotations
Go to Database
Database Tutorial
4
The Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
Go to Database
Database Tutorial
5
BRENDA (The Comprehensive Enzyme Information System) is an information system representing one of the most comprehensive enzyme repositories. It comprises molecular and biochemical information on enzymes that have been classified by the IUBMB. Every classified enzyme is characterized with respect to its catalyzed biochemical reaction. Kinetic properties of the corresponding reactants are described in detail.
Go to Database
Database Tutorial
6
Colibri provides a complete dataset of DNA and protein sequences derived from the paradigm strain E. coli K-12, linked to the relevant annotations and functional assignments. It allows one to easily browse through these data and retrieve information, using various criteria (gene names, location, keywords, etc.).
Go to Database
7
ExPASy is the SIB Bioinformatics Resource Portal which provides access to scientific databases and software tools in different areas of life sciences including proteomics, genomics, phylogeny, systems biology, population genetics, transcriptomics etc.
Go to Database
8
The Human Protein Reference Database represents a centralized platform to visually depict and integrate information pertaining to domain architecture, post-translational modifications, interaction networks and disease association for each protein in the human proteome. All the information in HPRD has been manually extracted from the literature by expert biologists who read, interpret and analyze the published data.
Go to Database
9
Tools for in silico experiments with complete genomes. The list of genomes and associated data are updated shortly after their availability at NCBI.
Go to Database
10
The Interactive Metabolic Pathways Map shows biochemical reactions of the main biological pathways in life in an overview. The "backbone" of the map is the Glycolytic Pathway followed by the TCA (Krebs) Cycle and the Respiratory Chain which together lead to the synthesis of ATP by ATP Synthase. Many biosynthetic and breakdown pathways of metabolism such as carbohydrates, amino acids, lipids are associated with this backbone and are differentiated by the use of color. Metabolites, enzymes, and selected pathways are searchable and interactive. Some 550 reactions are identified by their IUBMB Enzyme Commission (EC) numbers which are then indexed.
Go to Database
11
KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. Databases: - KEGG PATHWAY: Pathway information - KEGG GENES: Genomicinformation - KEGG LIGAND: Chemical information - KEGG BRITE: Binary relations and hierarchies etc.
Go to Database
Database Tutorial
12
MPEx is a java-based tool for exploring the topology and other features of membrane proteins by means of hydropathy plots, using thermodynamic principles developed in the Stephen Whitelaboratory of the University of California.
Go to Database
13
MPtopo is a curated database of membrane proteins with experimentally validated transmembrane (TM) segments. The database is maintained in SQL form using the MySQL server.
Go to Database
14
A current list of membrane protein structures determined by x-ray and electron diffraction with links to the Protein Data Bank and other useful sites.
Go to Database
15
Entrez is the integrated, text-based search and retrieval system used at NCBI for the major databases, including PubMed, Nucleotide and Protein Sequences, Protein Structures, Complete Genomes, Taxonomy, and others.
Go to Database
16
Pfam is a collection of protein families and domains. Pfam contains multiple protein alignments and profile-HMMs of these families. Pfam is a semi-automatic protein family database, which contains aims to be comprehensive as well as accurate. For each family in Pfam you can look at multiple alignments, view protein domain architectures, examine species distribution, follow links to other databases or view known protein structures.
Go to Database
17
Integrated protein informatics resource for genomic, proteomic and systems biology research.
Go to Database
18
Database of protein domains, families and functional sites. PROSITE consists of documentation entries describing protein domains, families and functional sites as well as associated patterns and profiles to identify them. PROSITE is complemented by ProRule, a collection of rules based on profiles and patterns, which increases the discriminatory power of profiles and patterns by providing additional information about functionally and/or structurally critical amino acids.
Go to Database
19
A Resource for Studying Biological Macromolecules. The PDB archive contains information about experimentally-determined structures of proteins, nucleic acids, and complex assemblies. As a member of the wwPDB, the RCSB PDB curates and annotates PDB data according to agreed upon standards. The RCSB PDB also provides a variety of tools and resources. Users can perform simple and advanced searches based on annotations relating to sequence, structure and function. These molecules are visualized, downloaded, and analyzed by users who range from students to specialized scientists.
Go to Database
20
Nearly all proteins have structural similarities with other proteins and, in some of these cases, share a common evolutionary origin. The SCOP database, created by manual inspection and abetted by a battery of automated methods, aims to provide a detailed and comprehensive description of the structural and evolutionary relationships between all proteins whose structure is known. As such, it provides a broad survey of all known protein folds, detailed information about the close relatives of any particular protein, and a framework for future research and classification.
Go to Database