Skip to contents

Search for published T cell receptor beta CDR3 amino acid sequences with known antigen specificity in a list of data frames.

Usage

searchDB(study_table, dbname = "all", chain = "trb")

Arguments

study_table

A tibble generated by the LymphoSeq2 functions readImmunoSeq(), productiveSeq(), searchPublished() or topSeqs(). "junction_aa", "duplicate_frequency", and "duplicate_count" are required columns.

dbname

A vector of database source to search for the antigenic specificity of a T-cell or B-cell in the dataset: "all", "IEDB", "McPAS-TCR", "VdjDB".

chain

The receptor chain type to search in the public databases: "tra", "trb", "light", "heavy".

Value

Returns the input table annotated with the any recorded antigenic specificity from the public databases.

Examples

file_path <- system.file("extdata", "TCRB_sequencing", package = "LymphoSeq2")
study_table <- LymphoSeq2::readImmunoSeq(path = file_path, threads = 1)
study_table <- LymphoSeq2::topSeqs(study_table, top = 100)
amino_table <- LymphoSeq2::productiveSeq(study_table = study_table, aggregate = "junction_aa")
top_seqs <- LymphoSeq2::topSeqs(productive_table = amino_table, top = 1)
LymphoSeq2::searchDB(study_table = top_seqs, dbname = "all", chain = "trb")
#> # A tibble: 10 × 26
#>    repertoire_id     junction_aa v_call d_call j_call v_family d_family j_family
#>    <chr>             <chr>       <chr>  <chr>  <chr>  <chr>    <chr>    <chr>   
#>  1 TRB_CD4_949       CASSESAGST… TRBV1… TRBD2… TRBJ2… TRBV10   TRBD2    TRBJ2   
#>  2 TRB_CD8_949       CASSPAGAYY… NA     TRBD2… TRBJ2… NA       TRBD2    TRBJ2   
#>  3 TRB_CD8_CMV_369   CASSPPTGER… TRBV7… TRBD2… TRBJ2… TRBV7    TRBD2    TRBJ2   
#>  4 TRB_Unsorted_0    CASSPVSNEQ… TRBV2… TRBD2… TRBJ2… TRBV28   TRBD2    TRBJ2   
#>  5 TRB_Unsorted_1320 CASSPAGAYY… NA     TRBD2… TRBJ2… NA       TRBD2    TRBJ2   
#>  6 TRB_Unsorted_1496 CASSPAGAYY… NA     TRBD2… TRBJ2… NA       TRBD2    TRBJ2   
#>  7 TRB_Unsorted_32   CASSLQGREK… TRBV2… TRBD1… TRBJ1… TRBV27   TRBD1    TRBJ1   
#>  8 TRB_Unsorted_369  CASSQDWERL… TRBV1… NA     TRBJ2… TRBV14   NA       TRBJ2   
#>  9 TRB_Unsorted_83   CASSLQGREK… TRBV2… TRBD1… TRBJ1… TRBV27   TRBD1    TRBJ1   
#> 10 TRB_Unsorted_949  CASSPAGAYY… NA     TRBD2… TRBJ2… NA       TRBD2    TRBJ2   
#> # ℹ 18 more variables: reading_frame <chr>, duplicate_count <dbl>,
#> #   duplicate_frequency <dbl>, tra_cdr3_aa <chr>, gene <chr>, epitope <chr>,
#> #   pathology <chr>, antigen <chr>, tra_v_call <chr>, tra_j_call <chr>,
#> #   mhc_allele <chr>, reference <chr>, score <dbl>, cell_type <chr>,
#> #   source <chr>, trb_v_call <chr>, trb_j_call <chr>, Species <chr>