Skip to content. | Skip to navigation

National Cancer Institute U.S. National Institutes of Health


Personal tools

You are here: Home / Protocols / Basophile: Accurate Fragment Charge State Prediction Improves Peptide Identification Rates
Not an EDRN Protocol

Basophile: Accurate Fragment Charge State Prediction Improves Peptide Identification Rates

Wang, DongVanderbilt University Medical Center

No involved investigator sites defined.

In shotgun proteomics, database search algorithms rely on fragmentation models to predict fragment ions that should be observed for a given peptide sequence. The most widely used strategy (Naive model) is oversimplified, cleaving all peptide bonds with equal probability to produce fragments of all charges below that of the precursor ion. More accurate models, based on fragmentation simulation, are too computationally intensive for on-the-fly use in database search algorithms. We have created an ordinal-regression-based model called Basophile that takes fragment size and basic residue distribution into account when determining the charge retention during CID/higher-energy collision induced dissociation (HCD) of charged peptides. This model improves the accuracy of predictions by reducing the number of unnecessary fragments that are routinely predicted for highly-charged precursors. Basophile increased the identification rates by 26% (on average) over the Naive model, when analyzing triply-charged precursors from ion trap data. Basophile achieves simplicity and speed by solving the prediction problem with an ordinal regression equation, which can be incorporated into any database search software for shotgun proteomic identification.

1. Assemble a set of highly-confident peptide identifications as a training set. 2. Generate a ordinal logistic regression model that describes charge segregation. 3. Test the model in peptide identification by modifying predicted fragments.
Biostatistics in R environment.

There are currently no biomarkers annotated for this protocol.

Announcement 11/20/2014

New Round of EDRN FOAs

The RFAs for EDRN have been released:
- Biomarker Developmental Laboratories (U01),
- Clinical Validation Centers (U01),
- Biomarker Reference Laboratories (U24),
- Data Management and Coordinating Center (U24).

EDRN Renewal flyer NOTE-New receipt deadline for applications submitted for all EDRN FOAs is January 20, 2015, by 5:00 PM local time of applicant organization.

There will be a Pre-Application webinar to discuss each of the four individual EDRN FOAs on Tuesday, December 2nd, 2014, from 1pm-5pm (Eastern). Potential applicants interested in participating in the webinar should send a message to Dr. Sharmistha Ghosh ( no later than 5:00 p.m. (EST) November 21, 2014. Please mention the FOA of interest in the subject line.

Announcement 10/07/2014

EDRN Patient Advocates will host an EDRN Advocacy Educational Webinar, Biomarkers for Prostate Cancer Detection and Monitoring, on Monday, January 12th, 2015, at 1 p.m. EDT / 10 a.m. PDT. Registration is not required for this. Please click for more information.