![]() |
Submit Sequences | Resources | Documentation | Contact |
|
The dataset used to train the current version of PSORTb (v.2.0) is available below in FASTA format. The dataset contains 1591 Gram-negative and 576 Gram-positive proteins. Each protein's record contains the NCBI GI number, name, and experimentally verified localization site on the FASTA header line. If you make use of the PSORTb v.2.0 training dataset in your research, please cite: J.L. Gardy, M.R. Laird, F. Chen, S. Rey, C.J. Walsh, M. Ester, and F.S.L. Brinkman (2005) PSORTb v.2.0: expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis, Bioinformatics 21(5):617-623 PSORTdb v.2.0 Gram-negative sequences:
PSORTdb v.2.0 Gram-positive sequences:
|