SpliceProt 2.0: A sequence repository of human, mouse, and rat proteoforms

Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil.
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil.
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil. / Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Laboratório de Toxinologia. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil.
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Laboratório de Toxinologia. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil.
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil. / Universidade Federal de São Carlos. Departamento de Genética e Evolução. São Carlos, SP, Brasil.
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil.

Abstract

SpliceProt 2.0 is a public proteogenomics database that aims to list the sequence of known proteins and potential new proteoforms in human, mouse, and rat proteomes. This updated repository provides an even broader range of computationally translated proteins and serves, for example, to aid with proteomic validation of splice variants absent from the reference UniProtKB/SwissProt database. We demonstrate the value of SpliceProt 2.0 to predict orthologous proteins between humans and murines based on transcript reconstruction, sequence annotation and detection at the transcriptome and proteome levels. In this release, the annotation data used in the reconstruction of transcripts based on the methodology of ternary matrices were acquired from new databases such as Ensembl, UniProt, and APPRIS. Another innovation implemented in the pipeline is the exclusion of transcripts predicted to be susceptible to degradation through the NMD pathway. Taken together, our repository and its applications represent a valuable resource for the proteogenomics community.

Publisher

MDPI

Citation

SANTOS, Letícia Graziela Costa et al. SpliceProt 2.0: A sequence repository of human, mouse, and rat proteoforms. Int. J. Mol. Sci., v. 25, n. 1183, p. 1-24, 2024.

DOI

https://doi.org/10.3390/ ijms25021183

ISSN

1422-0067

Please use this identifier to cite or link to this item: https://www.arca.fiocruz.br/handle/icict/62821

Type

Copyright

Open access

Collections

Share

Statistics

Metadata

Author

Affilliation

Abstract

Keywords

Keywords in Spanish

DeCS

Publisher

MDPI

Citation

DOI

ISSN