Calculate sequence coverage for each identified protein.
calculate_sequence_coverage(data, protein_sequence, peptides)
a data frame containing at least the protein sequence and the identified peptides as columns.
a character column in the data
data frame that contains protein
sequences. Can be obtained by using the function fetch_uniprot()
a character column in the data
data frame that contains the identified
peptides.
A new column in the data
data frame containing the calculated sequence coverage
for each identified protein
data <- data.frame(
protein_sequence = c("abcdefghijklmnop", "abcdefghijklmnop"),
pep_stripped_sequence = c("abc", "jklmn")
)
calculate_sequence_coverage(
data,
protein_sequence = protein_sequence,
peptides = pep_stripped_sequence
)
#> protein_sequence pep_stripped_sequence coverage
#> 1 abcdefghijklmnop abc 50
#> 2 abcdefghijklmnop jklmn 50