Title: | Identification of gene fusions using paired-end sequencing |
---|---|
Description: | DNAfusion can identify gene fusions such as EML4-ALK based on paired-end sequencing results. This package was developed using position deduplicated BAM files generated with the AVENIO Oncology Analysis Software. These files are made using the AVENIO ctDNA surveillance kit and Illumina Nextseq 500 sequencing. This is a targeted hybridization NGS approach and includes ALK-specific but not EML4-specific probes. |
Authors: | Christoffer Trier Maansson [aut, cre] , Emma Roger Andersen [ctb, rev], Maiken Parm Ulhoi [dtc], Peter Meldgaard [dtc], Boe Sandahl Sorensen [rev, fnd] |
Maintainer: | Christoffer Trier Maansson <[email protected]> |
License: | GPL-3 |
Version: | 1.9.0 |
Built: | 2024-10-30 05:26:15 UTC |
Source: | https://github.com/bioc/DNAfusion |
This function identifies the basepairs following the ALK breakpoint.
ALK_sequence(reads, basepairs = 20, genome = "hg38")
ALK_sequence(reads, basepairs = 20, genome = "hg38")
reads |
|
basepairs |
|
genome |
|
If EML4-ALK is detected, returns a table
of identified
ALK basepairs with the number of corresponding reads for each sequence.
If no spanning reads in ALK is detected an empty GAlignments
object is returned.If no EML4-ALK is detected "No EML4-ALK was detected"
is returned.
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") ALK_sequence(EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), basepairs=20, genome="hg38") ALK_sequence(EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2), basepairs=20, genome="hg38")
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") ALK_sequence(EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), basepairs=20, genome="hg38") ALK_sequence(EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2), basepairs=20, genome="hg38")
This function identifies the genomic position in EML4 or ALK, where the breakpoint has happened.
break_position(reads, gene, genome = "hg38")
break_position(reads, gene, genome = "hg38")
reads |
|
gene |
|
genome |
|
If EML4-ALK is detected, it returns a table
of genomic positions
with the number of corresponding reads for each sequence.
If no spanning reads in EML4 or ALK is detected
an empty GAlignments
object is returned. If no EML4-ALK is detected
"No EML4-ALK was detected" is returned.
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") break_position(EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2),gene="EML4",genome="hg38") break_position(EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2),gene="ALK",genome="hg38") break_position(EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2),gene="EML4",genome="hg38") break_position(EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2),gene="ALK",genome="hg38")
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") break_position(EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2),gene="EML4",genome="hg38") break_position(EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2),gene="ALK",genome="hg38") break_position(EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2),gene="EML4",genome="hg38") break_position(EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2),gene="ALK",genome="hg38")
This function identifies the read depth at the basepair before the breakpoint in EML4 or ALK
break_position_depth(file, reads, gene, genome = "hg38")
break_position_depth(file, reads, gene, genome = "hg38")
file |
The name of the file which the data are to be read from. |
reads |
|
gene |
|
genome |
|
If EML4-ALK is detected a single integer
corresponding
to the read depth at the breakpoint is returned.
If no spanning reads in EML4 or ALK is detected
an empty GAlignments object is returned. If no EML4-ALK is detected
"No EML4-ALK was detected" is returned.
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") break_position_depth(file=H3122_bam, EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), gene="ALK",genome="hg38") break_position_depth(file=H3122_bam, EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), gene="EML4",genome="hg38") break_position_depth(file=HCC827_bam, EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2), gene="ALK",genome="hg38") break_position_depth(file=H3122_bam, EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), gene="EML4",genome="hg38")
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") break_position_depth(file=H3122_bam, EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), gene="ALK",genome="hg38") break_position_depth(file=H3122_bam, EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), gene="EML4",genome="hg38") break_position_depth(file=HCC827_bam, EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2), gene="ALK",genome="hg38") break_position_depth(file=H3122_bam, EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), gene="EML4",genome="hg38")
This functions collects the results from the other functions of the package.
EML4_ALK_analysis(file, genome = "hg38", mates = 2, basepairs = 20)
EML4_ALK_analysis(file, genome = "hg38", mates = 2, basepairs = 20)
file |
The name of the file which the data are to be read from. |
genome |
|
mates |
|
basepairs |
|
A list
object with
clipped_reads corresponding to EML4_ALK_detection()
,
last_EML4 corresponding to EML4_sequence()
,
first_ALK corresponding to ALK_sequence()
,
breakpoint_ALK corresponding to break_position()
, gene = "ALK",
breakpoint_EML4 corresponding to break_position()
,gene = "EML4",
read_depth_ALK corresponding to break_position_depth()
.gene = "ALK",
and read_depth_EML4 corresponding to break_position_depth()
,gene = "EML4".
If no EML4-ALK is detected an empty GAlignments
is returned.
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") EML4_ALK_analysis(file=H3122_bam, genome="hg38", mates=2, basepairs=20) EML4_ALK_analysis(file=HCC827_bam, genome="hg38", mates=2, basepairs=20)
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") EML4_ALK_analysis(file=H3122_bam, genome="hg38", mates=2, basepairs=20) EML4_ALK_analysis(file=HCC827_bam, genome="hg38", mates=2, basepairs=20)
This function identifies the genomic position in ALK and EML4 where the breakpoint has happened. This function looks for ALK-EML4 and EML4-ALK mate pair reads in the BAM file.
EML4_ALK_detection(file, genome = "hg38", mates = 2)
EML4_ALK_detection(file, genome = "hg38", mates = 2)
file |
The name of the file which the data are to be read from. |
genome |
|
mates |
|
A GAlignments
object with soft-clipped reads representing
ALK-EML4 and EML4-ALK is returned. If no ALK-EML4 or EML4-ALK is detected
the GAlignments
is empty.
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2) EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2)
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2) EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2)
This function identifies the basepairs leading up to the EML4 breakpoint.
EML4_sequence(reads, basepairs = 20, genome = "hg38")
EML4_sequence(reads, basepairs = 20, genome = "hg38")
reads |
|
basepairs |
|
genome |
|
If EML4-ALK is detected, returns a table
of identified
EML4 basepairs with the number of corresponding reads for each sequence.
If no EML4-ALK is detected "No EML4-ALK was detected" is returned.
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") EML4_sequence(EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), basepairs=20, genome="hg38") EML4_sequence(EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2), basepairs=20, genome="hg38")
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") EML4_sequence(EML4_ALK_detection(file=H3122_bam, genome="hg38", mates=2), basepairs=20, genome="hg38") EML4_sequence(EML4_ALK_detection(file=HCC827_bam, genome="hg38", mates=2), basepairs=20, genome="hg38")
This function identifies ALK-EML4 variants using the intron of the breakpoint of EML4
find_variants(file, genome = "hg38")
find_variants(file, genome = "hg38")
file |
The name of the file which the data are to be read from. |
genome |
|
A dataframe
of the ALK-EML4 variant is returned.
If no variant is detected, "No ALK-EML4 was detected" is returned.
If the variant is not classified a list
with identified introns with
breakpoints is returned.
If the breakpoint could not be identified in either of the genes a list
with identified introns with breakpoints is returned.
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") find_variants(file=H3122_bam,genome="hg38") find_variants(file=HCC827_bam,genome="hg38")
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") find_variants(file=H3122_bam,genome="hg38") find_variants(file=HCC827_bam,genome="hg38")
This function identifies the introns of ALK and EML4 where the breakpoint has happened.
introns_ALK_EML4(file, genome = "hg38")
introns_ALK_EML4(file, genome = "hg38")
file |
The name of the file which the data are to be read from. |
genome |
|
Adataframe
of the ALK- and EML4-intron of the breakpoint is returned
corresponding to the transcript ENST00000389048.8 for ALK and
ENST00000318522.10 for EML4.
If the breakpoint is not located in introns of ALK or EML4,
"Breakpoint not located in intron of ALK" or
"Breakpoint not located in intron of EML4" is returned.
If no EML4-ALK is detected “No EML4-ALK was detected” is returned.
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") introns_ALK_EML4(file=H3122_bam,genome="hg38") introns_ALK_EML4(file=HCC827_bam,genome="hg38")
H3122_bam <- system.file("extdata", "H3122_EML4.bam", package="DNAfusion") HCC827_bam <- system.file("extdata", "HCC827_EML4.bam", package="DNAfusion") introns_ALK_EML4(file=H3122_bam,genome="hg38") introns_ALK_EML4(file=HCC827_bam,genome="hg38")