Secretome Tool
Databases and published papers were identified providing lists of secreted proteins, transmembrane or membrane proteins, signal peptides, G-protein coupled receptors, or proteins existing in the extracellular region. Each database was then searched for human-specific proteins. Additional supporting materials are available on request.
Mapping of identifiers
Maps between Affymetrix hgu133plus2 probeset identifiers and gene symbol, Uniprot Gene Symbol, GENBANK ID, Unigene ID, PDB protein ID, Uniprot Accession, and Uniprot ID were obtained from BioMART - ENSEMBLE GENES 63, Homo sapiens genes GRCh37.p3 (http://www.biomart.org, Sept. 21, 2011), selecting two external IDs per map only - hgu133plus2 probeset and one of the above identifiers. An additional candidate map for Uniprot ID was obtained from DAVID v6.7 (http://david.abcc.ncifcrf.gov/conversion.jsp, Sept. 21, 2011). The DAVID map contained conversions for all Affymetrix 3’ arrays, so these were further narrowed down to hgu133plus2 maps only, using the hgu133plus2.db Bioconductor package (v. 2.4.5). Identifiers from each secretome source database were checked against each map, and the map containing the highest fraction of these identifiers was used. By this method, all identifiers were mapped to zero, one, or more Affymetrix hgu133plus2 probeset identifiers in one step. For each probeset identifier, a record was kept of which databases identified it, and which original identifiers mapped to it, to provide a record of how each probeset was identified. All computations were performed in the R statistical environment v. 2.12.1 (R development Core Team, 2010).
Contact Information
Steven Skates