bioinformaticsing: 蛋白序列分析

Finding tools/databases

The Online Bioinformatics Resources Collection (OBRC) - contains annotations and links for 2362 bioinformatics databases and software tools.
NAR Searchable database of summary papers
Bioinformatics Links Directory

Finding a sequence

Entrez Protein database at NCBI: The protein entries in the Entrez search and retrieval system have been compiled from a variety of sources, including SwissProt, PIR, PRF, PDB, and translations from annotated coding regions in GenBank and RefSeq.

NCBI Handbook:

Finding a sequence

UniProt – you can accomplish a lot here!

The mission of UniProt is to provide the scientific community with a comprehensive, high-quality and freely accessible resource of protein sequence and functional information.
In 2002, PIR, EBI and SIB, were awarded a grant from NIH to create UniProt, a single worldwide database of protein sequence and function, by unifying the PIR-PSD, Swiss-Prot, and TrEMBL databases.

UniProt help

Outreach and education resources

Protein Spotlight One month, one protein.
2Can Support Portal The bioinformatics educational resource. (tutorials)
e-Proxemis Bioinformatics learning portal for proteomics.

ExPASy

ExPASy (Expert Protein Analysis System)

The proteomics server of the Swiss Institute of Bioinformatics (SIB) is dedicated to the analysis of protein sequences and structures as well as 2-D PAGE (Disclaimer / References / Linking to ExPASy).

Databases
Tools & Software
Education & Services
Links
Announcements
Mirror Sites
Job openings

ExPASy Proteomics tools

ExPASy Proteomics tools available on a variety of topics, such as:

Other proteomics tools
DNA -> Protein
Similarity searches
Pattern and profile searches
Post-translational modification prediction
Topology prediction
Primary structure analysis
Secondary structure prediction
Tertiary structure
Sequence alignment
Gateways
Phylogenetic analysis
Biological text analysis

Other proteomics resources (info from Online Bioinformatics Resources Collection)

PEP — Predictions for Entire Proteomes

The database contains summaries of analyses of protein sequences (open reading frames) from a range of organisms representing all three major kingdoms of life: eukaryotes, prokaryotes and archaea.
All proteins publicly available for organisms were aligned against SWISS-PROT, TrEMBL and PDB.
Additionally, the following annotations are provided: secondary structure, transmembrane helices, coiled coils, regions of low complexity, signal peptides, PROSITE motifs, nuclear localization signals and classes of cellular function. Proteins that contain long regions without regular secondary structure are also identified.

ISPIDER Central – an integrated database web-server for proteomics

ISPIDER Central Proteomic Database search is an integration service offering novel search capabilities over leading, mature, proteomic repositories including PRoteomics IDEntifications database (PRIDE), PepSeeker, PeptideAtlas and the Global Proteome Machine.
It enables users to search for proteins and peptides that have been characterised in mass spectrometry-based proteomics experiments from different groups, stored in different databases, and view the collated results with specialist viewers/clients.

Motif searching, predictive methods

PROSITE - Protein families and domains.

Database of biologically significant sites, patterns and profiles that help to reliably identify to which known protein family (if any) a new sequence belongs. Currently contains patterns and profiles specific for >1000 protein families or domains.

List of PROSITE entries - Background information for each of these protein signatures is provided.

Sequence manipulation

Sequence Manipulation Suite

The Sequence Manipulation Suite is a collection of JavaScript programs for generating, formatting, and analyzing short DNA and protein sequences. It is commonly used by molecular biologists, for teaching, and for program and algorithm testing.
See the about the Sequence Manipulation Suite page for more information about individual Sequence Manipulation Suite programs.
You can easily mirror the Sequence Manipulation Suite on your own web site, or you can use it off-line.

Sequence alignments

ClustalW2 – also available from the UniProt site…

a general purpose multiple sequence alignment program for DNA or proteins. It produces biologically meaningful multiple sequence alignments of divergent sequences. It calculates the best match for the selected sequences, and lines them up so that the identities, similarities and differences can be seen. Evolutionary relationships can be seen via viewing Cladograms or Phylograms.
ClustalW@ FAQ includes information about supported sequence formats
Download Clustal to run locally
Help documentation
Multiple sequence alignment with the Clustal series of programs. Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD. Nucleic Acids Res. 2003 Jul 1;31(13):3497-500.

Sequences

Alcohol dehydrogenase [human]:http://www.uniprot.org/uniprot/P00325
30S ribosomal protein S12 [E. coli]:http://www.uniprot.org/uniprot/P0A7S3
Glutathione S-transferase [rat]:http://www.uniprot.org/uniprot/P04905

Other resources/helpful links

NCBI Mini-Course, “Making Sense of DNA and Protein Sequences”, by Medha Bhagwat and David Wheeler.

Special ExPASY features:

List of on-line experts.
Life Science Directory - The ExPASy list of Biomolecular servers.

NCBI’s Proteome Analysis tools:

Search the Conserved Domain Database using RPS-BLAST [CDD]
Search by domain architecture [CDART]
COGs - Phylogenetic classification of proteins encoded in complete genomes
Search the Entrez Protein [Protein Sequences]
Entrez Genome [Complete Genomes]
TaxPlot - 3-Way comparison of Proteomes

Other Proteomics Websites:

Protein - Functional Analysis - from EBI
Protein - Structural Analysis - from EBI
Protein Information Resources

References

Baxevanis, A.D. and Ouellette, B.F.F., eds., Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, third edition. Wiley, 2005. ISBN 0-471-47878-4
Geer, R.C., Messersmith, D.J, Alpi, K., Bhagwat, M., Chattopadhyay, A., Gaedeke, N., Lyon, J., Minie, M.E., Morris, R.C., Ohles, J.A., Osterbur, D.L. & Tennant, M.R. 2002. NCBI Advanced Workshop for Bioinformatics Information Specialists. [Online] Protein Analysis. http://www.ncbi.nlm.nih.gov/Class/NAWBIS/. [date revised July 23, 2006; date cited February 15, 2009]
Chen, YB, Chattopadhyay A., Bergen P., Gadd C and Tannery N. 2007. The online Bioinformatics resources collection at the University of Pittsburgh Health Sciences Library System - A one-stop gateway to online Bioinformatics databases and software tools. Nucleic Acids Research 2007 Database Issue, 35:D780-D785 http://www.hsls.pitt.edu/guides/genetics/obrc [date cited February 15, 2009]

bioinformaticsing

2009年2月19日星期四

蛋白序列分析

Finding tools/databases

Finding a sequence

Finding a sequence

ExPASy

ExPASy Proteomics tools

Other proteomics resources (info from Online Bioinformatics Resources Collection)

Motif searching, predictive methods

Sequence manipulation

Sequence alignments

Sequences

Other resources/helpful links

References

没有评论:

发表评论

ad

关注者

博客归档

我的简介

文子天空