Package 'R3CPET' reference manual

Title:	3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process
Description:	The package provides a method to infer the set of proteins that are more probably to work together to maintain chormatin interaction given a ChIA-PET experiment results.
Authors:	Djekidel MN, Yang Chen et al.
Maintainer:	Mohamed Nadhir Djekidel <[email protected]>
License:	GPL (>=2)
Version:	1.39.0
Built:	2025-03-31 07:17:37 UTC
Source:	https://github.com/bioc/R3CPET

3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process

Description

The main goal of 3CPET is to try to infer the set of protein networks that are likely to work together inorder to maintain chromatin loops obtained by a ChIA-PET experiment. It is based on an idea silimar to the one used for document classification. It starts first by building a PPI network for each chromatin interaction, then uses an HDLA (Hierarchical Dirichlet Latent Allocation) model to infer the set of networks that are enriched together.

Details

Package:	R3CPET
Type:	Package
Version:	1.0
Date:	2013-11-23
License:	GPL (>= 3.0)

Author(s)

Written by M.N.Djekidel Maintainer: Mohamed Nadhir Djekidel <[email protected]>

References

M.N Djekidel et al,3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, in press, 2015

Add the gene expression attribute to the graph nodes

Description

This method is a kinda of helper method, it helps the user to add for each node in the inferred chromatin maintainer network the RPKM attributes. It is useful if the user want to save the networks as ".gml" files and visualize them using software such as Gephi or Cytoscape. Or maybe if he wants to know which networks are highly expressed then others.

Usage

## S4 method for signature 'ChromMaintainers,data.frame'
annotateExpression(object, RPKMS)
## S4 method for signature 'ChromMaintainers,data.frame'
annotateExpression(object, RPKMS)

Arguments

`object`	a `ChromMaintainers` object in which the networks are already present
`RPKMS`	a two columns `data.frame`, where the first column contains the name of the gene and the second contains the expression values

Value

A ChromMaintainers object in which the networks are annotated.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples


  ## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")    

## Not run: 
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  x
  
## build the different indexes
x <- createIndexes(x)

## build networks connecting each interacting regions
nets<- buildNetworks(x)

## infer the networks
hlda<- InferNetworks(nets)

networks(hlda)

## Annotate networks
hlda<- annotateExpression(hlda,as.data.frame(RPKMS))

## Notice the addition of the RPKM attribute to each network
networks(hlda)

## End(Not run)
## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")    

## Not run: 
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  x
  
## build the different indexes
x <- createIndexes(x)

## build networks connecting each interacting regions
nets<- buildNetworks(x)

## infer the networks
hlda<- InferNetworks(nets)

networks(hlda)

## Annotate networks
hlda<- annotateExpression(hlda,as.data.frame(RPKMS))

## Notice the addition of the RPKM attribute to each network
networks(hlda)

## End(Not run)

Biogrid Network

Description

loads an igraph object that contains the Biogrid V 2.0.49 PPI .

Usage

data(Biogrid)data(Biogrid)

Value

an igraph named PPI.Biogrid.

Source

http://thebiogrid.org/

Examples

data(Biogrid)
PPI.Biogrid
data(Biogrid)
PPI.Biogrid

Building interaction networks connecting interacting regions

Description

This methods uses the background PPI to try to build an interaction network that connects each interacting regions. If a regionA interacts with a regionB and if $TF_A$ is the list of TF in regionA and $TF_B$ is the list of TF in regionB, than we use the loaded PPI as a background network to connect each TF from $TF_A$ to each TF in $TF_B$ .

We suppose that a minimum number of physical interactions (minimum energy) are needed to connection each TF to the other. Thus, we take the shortest path in the PPI. at this stage, each network is a collection of edges.

Usage

## S4 method for signature 'ChiapetExperimentData'
buildNetworks(object, minFreq = 0.25, maxFreq = 0.75)
## S4 method for signature 'ChiapetExperimentData'
buildNetworks(object, minFreq = 0.25, maxFreq = 0.75)

Arguments

`object`	a `ChiapetExperimentData` object in which the interactions and TFBS and PPI are already loaded. Check `loadPETs`, `loadTFBS`, `loadPPI` for more info.
`minFreq`	After constructing the networks for all the interacting regions all edges that appear in less than `minFreq` of the networks are considered to be outliers.
`maxFreq`	After constructing the networks for all the interacting regions all edges that appear in more than `maxFreq` of the networks are considered to be interactions involving general TF and are removed.

Value

A NetworkCollection object that contain the list of all the constructed networks and their sizes.

NOTE: interactions for which no TF was bound or no networks could be constructed or which was empty after filtering will not be considered.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples

    
  ## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
  
   ## Not run:   
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)  

  ## build networks connecting each interacting regions
  nets<- buildNetworks(x)
  nets

## End(Not run)
## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
  
   ## Not run:   
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)  

  ## build networks connecting each interacting regions
  nets<- buildNetworks(x)
  nets

## End(Not run)

3CPET used raw data

Description

The ChiapetExperimentData class is a container for storing the set of raw data used by 3CPET to do the prediction.

Usage

ChiapetExperimentData(pet='', tfbs='', ppi=NULL,  
                      ## loadPETs options
                      IsBed=TRUE, petHasHeader=FALSE, dist=1000,
                      ## loadTFBS options
                      tfbsHasHeader=FALSE,
                      ## loadPPI options
                      ppiType=c("HPRD","Biogid"),
                      filter=FALSE, term="GO:0005634", annot=NULL,
                      RPKM= NULL, threshold=1
                      )
ChiapetExperimentData(pet='', tfbs='', ppi=NULL,  
                      ## loadPETs options
                      IsBed=TRUE, petHasHeader=FALSE, dist=1000,
                      ## loadTFBS options
                      tfbsHasHeader=FALSE,
                      ## loadPPI options
                      ppiType=c("HPRD","Biogid"),
                      filter=FALSE, term="GO:0005634", annot=NULL,
                      RPKM= NULL, threshold=1
                      )

Arguments

`pet`	(optional) a ChIAP-PET interactions file path or a `GRanges` object. The `GRanges` object should have a column named `PET_ID`. details on the file format can be found on the `loadPETs` help page.
`tfbs`	(optional) a file path to a BED file containing transcription factors binding site or a `GRanges` object. The `GRanges` object should have a metadata column named `TF`.
`ppi`	(optional) an `igraph` object that contains a user defined protein-protein interaction network. if this parameter is not specified, the `ppiType` paramter will be used.
`IsBed`	(optional) considered only if the `pet` parameter is a file path. More info about this paramters can be found in the `loadPETs` help page.
`petHasHeader`	(optional) `logical`. Indicates if the ChIA-PET interactions file has a header or not.
`dist`	(optional) The size of the region to consider arround the center of the interacting regions.
`tfbsHasHeader`	(optional) `logical`. Indicates if the TFBS file has a header or not.
`ppiType`	(optional) considered only if the `ppi` paramter is not specified. This paramter tell the pakage to load one of the PPI (`HPRD` or `Biogrid`) shiped with the package.
`filter`	(optional) `logical`. whether of not to filter the PPI network. if the `RPKM` paramter is specified then the RPKM dataset incorporated with the package will be used. if you want to to your own way of filtering, you ca set `filter = FALSE` and pass an already processed PPI to the `ppi` paramterer.
`term`	(optional) the GO term used to filter the nodes of the PPI. this is different from the `filter` parameter. in the `filter` parameter the PPI nodes are filtered by their gene expression, while in the `term` parameter they are filtered by their genomic location. by default `"GO:0005634"` is used for filtering.
`annot`	(optional) the annotation dataset used for filtering by default the `geneLocations` is used. The user can pass a custom `data.frame`. For more details check the `loadPPI` help page.
`RPKM`	(optional) a `data.frame` object that contains the expression value of each genes. by default the `RPKMS` dataset will be used (expression value in K562 celline). For more information about the format of the data passed to this parameter please check the `loadPPI`
`threshold`	(optional) threshold value used to filter gene expression. default: 1.

Details

The ChiapetExperimentData class stores the genomic coordinates of the ChIA-PET interactions, the binding sites of the different transcription factor (TFBS) and the background protein-protein interaction (PPI) network used to infer the final chromatin maintainer networks.

Value

Constructs a ChiapetExperimentData object with the specified fields populated.

Slots

pet: : Object of class GRanges that stores the genomic coordinated of the interactions. it can be populated using the method loadPETs
tfbs: : Object of class GRanges that stores the TF binding site. it can be populated using the method loadTFBS. NOTE: the TFBS locations can be obtained from a ChIP-Seq experiment or a motif finding software. for more information on the format of the provided data check loadTFBS
ppi: Object of class "igraph" used as the background PPI for further analysis. it can be populated using the method loadPPI
.dt: Object of class "list" contains a collection of data.table serving as indexes used internally by the package (not expected to be manipulated by the user). it can be populated using the method createIndexes

Accessors

The following methods can be used to get the content of a ChiapetExperimentData object x :

pet(x), pet(x) <- value: Get ChIA-PET interactions encoded as a GRanges object in x. The returned GRanges objects contains an attribute PET_ID in which the left side have an id of the form PET#\d+\.1 and the right side interaction have an id of the form PET#\d+\.2. for more information check loadPETs

          seqnames             ranges strand |      PET_ID
             <Rle>          <IRanges>  <Rle> | <character>
      [1]     chr1 [1240734, 1242734]      * |     PET#1.1
      [2]     chr1 [1242224, 1244224]      * |     PET#1.2
      [3]     chr1 [1282208, 1284208]      * |     PET#2.1
      [4]     chr1 [1283334, 1285334]      * |     PET#2.2
      [5]     chr1 [1370371, 1372371]      * |     PET#3.1
      [6]     chr1 [1371822, 1373822]      * |     PET#3.2

tfbs(x), tfbs(x) <- value: Get the GRanges storing the transcription factor binding sites.

ppi(x), ppi(x) <- value: Returns an igraph object used as a background PPI. check the loadPPI for more information.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Li G, Fullwood MJ, Xu H et al.ChIA-PET tool for comprehensive chromatin interaction analysis with paired-end tag sequencing. Genome Biology 2010, 11(2):R22

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples

## for example Reading ChIA-PET interaction results generated from ChIA-PET tool
## it should be formatted as follow:

## -------------------------------------------------------------------------------------------
## chromleft startleft endleft chromright startright endright counts      pvalue      qvalue
##      chr1    872113  879175       chr1     933836   938416     12 1.84529e-30 6.90983e-28
##      chr1    874165  879175       chr1     933340   938306     10 1.23139e-25 3.58932e-23
##      chr1    889676  896594       chr1     933897   938982     13 4.91311e-36 2.33753e-33
##      chr1    898753  907581       chr1     931133   939571     19 0.00000e+00 0.00000e+00
##      chr1    910103  918775       chr1     930834   938627     15 2.20004e-43 1.32812e-40
##      chr1    919314  922154       chr1     934212   937864      6 3.70292e-21 7.88551e-19  
##---------------------------------------------------------------------------------------------

## The counts, pvalue and qvalue fields are not considered in our case 
## it is up to the user to filter the interactions.

## The TFBS should be a BED file that contain the chromosome, start, end and the TF name 

## Not run: 

  ## load the different datasets
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
  
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  x
  
  ## Pass objects instead of files.
  pet <- sample(pet(x),size = 20,replace = TRUE )
  tfbs <- sample(tfbs(x), size=300, replace=TRUE)
  ppi <- ppi(x)
  tst <- ChiapetExperimentData(pet = pet, tfbs= tfbs, ppi=ppi) 
  tst <- createIndexes(tst)
  tst

## End(Not run)
## for example Reading ChIA-PET interaction results generated from ChIA-PET tool
## it should be formatted as follow:

## -------------------------------------------------------------------------------------------
## chromleft startleft endleft chromright startright endright counts      pvalue      qvalue
##      chr1    872113  879175       chr1     933836   938416     12 1.84529e-30 6.90983e-28
##      chr1    874165  879175       chr1     933340   938306     10 1.23139e-25 3.58932e-23
##      chr1    889676  896594       chr1     933897   938982     13 4.91311e-36 2.33753e-33
##      chr1    898753  907581       chr1     931133   939571     19 0.00000e+00 0.00000e+00
##      chr1    910103  918775       chr1     930834   938627     15 2.20004e-43 1.32812e-40
##      chr1    919314  922154       chr1     934212   937864      6 3.70292e-21 7.88551e-19  
##---------------------------------------------------------------------------------------------

## The counts, pvalue and qvalue fields are not considered in our case 
## it is up to the user to filter the interactions.

## The TFBS should be a BED file that contain the chromosome, start, end and the TF name 

## Not run: 

  ## load the different datasets
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
  
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  x
  
  ## Pass objects instead of files.
  pet <- sample(pet(x),size = 20,replace = TRUE )
  tfbs <- sample(tfbs(x), size=300, replace=TRUE)
  ppi <- ppi(x)
  tst <- ChiapetExperimentData(pet = pet, tfbs= tfbs, ppi=ppi) 
  tst <- createIndexes(tst)
  tst

## End(Not run)

Chomatin maintainer networks

Description

The ChromMaintainers holds information about the inferred network by the method InferNetworks. It contains the list of inferred networks as igraph object, a list of edges and a list of proteins. In addition to an HLDAResult object that contains the final probabilities calculated by the HLDA algorithm.

Usage

 ChromMaintainers( maintainers,topEdges,topNodes, clusRes = NULL, networks = list())
ChromMaintainers( maintainers,topEdges,topNodes, clusRes = NULL, networks = list())

Arguments

`maintainers`	Object of class `"HLDAResult"` that contains the HLDA results.
`topEdges`	a `"matrix"` containing the list the top edges per network.
`topNodes`	a `"matrix"` containing the list the top proteins per network.
`networks`	the list of infered networks as an `igraph` objects list
`clusRes`	Object of class `"cluesOrSota"` describing the assignment of each DNA-interaction to a chromatin-maintainer network according to their enrichment.

Value

a ChromMaintainers object.

Accesors

if x is a ChromMaintainers object the following accessors can be applied :

networks(x): gets the list of networks as igraph objects
topNodes(x): gets a matrix object that contains the list of top proteins per network
topEdges(x): gets a matrix object that contains the list of top proteins per network
getClusters(x): returns the clustering results of DNA-interaction into groups according to their partnership enrichment profile to the set of inferred chromatin maintainer networks.

Methods

Many plotting and annotation methods are associated with this class.

annotateExpression(object, RPKMS): To add the gene expression attribute to the igraph objects
clusterInteractions(object, method, nbClus ): To cluster the DNA-interactions according to their partnership enrichment profile.
GenerateGmlNetworks(object,...): Creates the list of igraph object from the topEdges matrix
outputGenesPerClusterToDir(hdaRes,data,path, ...): get the list genes belonging to each DNA-interaction cluster.
getRegionsIncluster(hdaRes,data, cluster, ...): returns the coordinates of the DNA interactions for a given cluster.
GOEnrich.networks(object, pval=0.05, GOlimit= 5,path=""): do a GO enrichment of the elements of each inferred network.
plot3CPETRes(object, path, W, H, type,byEdge, netPerRow, layoutfct, ...): provide different type of plots to visualize the results
visualizeCircos(object, data, cluster, chrLenghts): Draws a circos plot of the DNA interactions in a given cluster.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

https://www.cs.princeton.edu/~blei/topicmodeling.html (C. Wang's hdp code)

Chong Wang, John Paisley and David M. Blei, Online variational inference for the hierarchical Dirichlet process .In AISTATS 2011

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples

showClass("ChromMaintainers")
showClass("ChromMaintainers")

Human chromosom lenghts

Description

This dataset contains the human chromosoms lengths

Examples

data(chromosoms)
Chromosoms
data(chromosoms)
Chromosoms

Wrapper for `clues` and `sota` S3 classes

Description

This is an S4 virtual union class that defines a new object that can be a sota or a clues class. Now that the clues method is deprecated, only the sota method is supported.

Definition

setClassUnion("cluesOrSota", c("sota","NULL"))

Objects from the Class

A virtual Class: No objects may be created from it.

Methods

No methods defined with class "cluesOrSota" in the signature.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Herrero, J., Valencia, A, and Dopazo, J. (2005).A hierarchical unsupervised growing neural network for clustering gene expression patterns. Bioinformatics, 17, 126-136.

Examples

showClass("cluesOrSota")
showClass("cluesOrSota")

Grouping DNA interactions by enrichment profile

Description

This method aims at clustering the DNA interactions according to their partnership probability to the inferred chromatin maintainer networks.

Usage

## S4 method for signature 'ChromMaintainers'
clusterInteractions(object, method="sota", nbClus=20 )
## S4 method for signature 'ChromMaintainers'
clusterInteractions(object, method="sota", nbClus=20 )

Arguments

`object`	(Required) a non-empty `ChromMaintainers` object
`method`	(optional)used to specify the method to use. Only the `method = "sota"` is supported for the moment, the `method='clues'` is deprecated. The user needs to specify the number of clusters by setting the parameter `nbClus`, by default it is set to 20.
`nbClus`	(optional) The user-specified number of clusters. It is taken into consideration only if `method = sota`.

Value

A ChromMaintainers object in which the clusRes is populated as a sota.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Herrero, J., Valencia, A, and Dopazo, J. (2005). A hierarchical unsupervised growing neural network for clustering gene expression patterns. Bioinformatics, 17, 126-136.

Examples


 data(RPKMS)

  ## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
## Not run: 
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  x

  ## build networks connecting each interacting regions
  nets<- buildNetworks(x)

  ## infer the networks
  hlda<- InferNetworks(nets)

  #cluster
  hlda<- clusterInteractions(hlda)

  #Display heatmap
  plot3CPETRes(hlda,type="heatmap")
  hlda

## End(Not run)
data(RPKMS)

  ## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
## Not run: 
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  x

  ## build networks connecting each interacting regions
  nets<- buildNetworks(x)

  ## infer the networks
  hlda<- InferNetworks(nets)

  #cluster
  hlda<- clusterInteractions(hlda)

  #Display heatmap
  plot3CPETRes(hlda,type="heatmap")
  hlda

## End(Not run)

Create centered interactions

Description

This helper method can be used to create a "bed" file in which the coordinates of the regions are the centre of the interactions in the raw data. in R3CPET we suppose that the centre of the interactions are the most enriched when doing read mapping ,thus, we consider just the region around the centre to detect the TFBS.

Usage

## S4 method for signature 'character'
CreateCenteredBED(file, header=TRUE,dist=1000)
## S4 method for signature 'character'
CreateCenteredBED(file, header=TRUE,dist=1000)

Arguments

`file`	a `character` indicating the location of the rawdata file. the file should be a six column "bed" file in which the fist 3 columns indicate the left side interaction (chr, start, stop) and the other 3 columns indicate the right side interaction.
`header`	`logical`, indicates if the first line in the file is a header.
`dist`	`numeric`, indicated the distance around the center of the region to take.

Value

A 4 columns data.frame object, in which the first 3 columns indicate the location of the region and the 4th on indicate its name. The names are of the format PET#\w+.1 for the left side regions and PET#\w+.2 for the right side ones.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples


## get interactions file location
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")

res <- CreateCenteredBED(petFile, header=TRUE, dist=1000)
head(res)
## get interactions file location
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")

res <- CreateCenteredBED(petFile, header=TRUE, dist=1000)
head(res)

Preparing TF indexes per region

Description

After loading the interactions and the TFBS, the createIndexes method can be used to build indexes for fast look-up for which which TF are located in which region. This method is an intermediate step needed for further analysis.

Usage

## S4 method for signature 'ChiapetExperimentData'
createIndexes(object, minOverlap = 50)
## S4 method for signature 'ChiapetExperimentData'
createIndexes(object, minOverlap = 50)

Arguments

`object`	a `ChiapetExperimentData` object in which the interactions and TFBS are already loaded. Check `loadPETs` and `loadTFBS` for more info.
`minOverlap`	The minimum overlap between a TF binding site and a region, to consider a TF as binding to that region. The default value is `50`.

Value

A ChiapetExperimentData object in which the .dt slot is populated as a data.table object.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples


  ## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

## Not run: 
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  x

## End(Not run)
## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

## Not run: 
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  x

## End(Not run)

Explore results in a web browser

Description

To facilitate the interaction of the user with the package, we added an additional web interface using the shiny package. The user can check some basic statistics about the row data and he can explore the results and generate some graphs.

Usage

## S4 method for signature 
## 'ChiapetExperimentData,NetworkCollection,ChromMaintainers'
createServer(x,nets,hlda)
## S4 method for signature 
## 'ChiapetExperimentData,NetworkCollection,ChromMaintainers'
createServer(x,nets,hlda)

Arguments

`x`	a `ChiapetExperimentData` object in which the interactions, TFBS and the index tables are already created.
`nets`	a `NetworkCollection` object containing the list of the used TF and the initial interaction networks.
`hlda`	a `ChromMaintainers` object in which the results are already calculated.

Value

A webpage is opened.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples


  ## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
  
## Not run: 
  data(RPKMS)
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)

  ## build networks connecting each interacting regions
  nets<- buildNetworks(x)

  ## infer the networks and do the clustering 
  hlda<- InferNetworks(nets)
  hlda<- clusterInteractions(hlda)

  ## Run the server
  createServer(x, nets, hlda)


## End(Not run)
## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
  
## Not run: 
  data(RPKMS)
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)

  ## build networks connecting each interacting regions
  nets<- buildNetworks(x)

  ## infer the networks and do the clustering 
  hlda<- InferNetworks(nets)
  hlda<- clusterInteractions(hlda)

  ## Run the server
  createServer(x, nets, hlda)


## End(Not run)

Ensemble to HGNC conversion

Description

This helper method uses the biomaRt package to convert Ensembl ids to HGNC ids.

Usage

EnsemblToHGNC(EnsemblIDs)
EnsemblToHGNC(EnsemblIDs)

Arguments

EnsemblIDs

a character vector with Ensembl IDs.

Value

returns a data.frame containing the Ensembl ID and his corresponding HGNC gene id and Name plus a description of the gene.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples

## Not run: 
EnsemblIDs<-c("ENSG00000164548","ENSG00000118515","ENSG00000105705",
        "ENSG00000177414","ENSG00000108179")

EnsemblToHGNC(EnsemblIDs)

## End(Not run)
## Not run: 
EnsemblIDs<-c("ENSG00000164548","ENSG00000118515","ENSG00000105705",
        "ENSG00000177414","ENSG00000108179")

EnsemblToHGNC(EnsemblIDs)

## End(Not run)

Entrez to HGNC conversion

Description

This helper method uses the biomaRt package to convert Entrez ids to HGNC icS.

Usage

EntrezToHGNC(EntrezID)
EntrezToHGNC(EntrezID)

Arguments

EntrezID

a character vector with Entrez IDs.

Value

returns a data.frame containing the Entrez ID and his corresponding HGNC gene id and Name plus a description of the gene.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples

## Not run: 
EntrezID <-c("2114","9757","5886","9373","6921",
        "4088","7006","6196","10054","10945")

EntrezToHGNC(EntrezID)

## End(Not run)
## Not run: 
EntrezID <-c("2114","9757","5886","9373","6921",
        "4088","7006","6196","10054","10945")

EntrezToHGNC(EntrezID)

## End(Not run)

Nucleus located genes

Description

This dataset contains a data.frame containing the set of genes that are located in the nucleus "GO:0005634".

Usage

data(geneLocations)data(geneLocations)

Value

data.frame containing genes located at the nucleus.

Examples

data(geneLocations)
head(geneLocations.nucleus)
data(geneLocations)
head(geneLocations.nucleus)

Generate a list of `igraph` networks

Description

This methods converts the networks slot of a ChromMaintainers object, it reads the topEdge slot and convert it into a list of igraph objectts.

Usage

## S4 method for signature 'ChromMaintainers'
GenerateNetworks(object,...)
## S4 method for signature 'ChromMaintainers'
GenerateNetworks(object,...)

Arguments

`object`	a `ChromMaintainers` object
`...`	future options not considered for the moment.

Value

Returns ChromMaintainers object in which the networks slot is populated.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples


  ## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
## Not run: 
  data(RPKMS)
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  ## build networks connecting each interacting regions
  nets<- buildNetworks(x)

  ## infer the networks
  hlda<- InferNetworks(nets)

  hlda <- GenerateNetworks(hlda)
  networks(hlda)

## End(Not run)
## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
## Not run: 
  data(RPKMS)
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## build the diffrent indexes
  x <- createIndexes(x)
  ## build networks connecting each interacting regions
  nets<- buildNetworks(x)

  ## infer the networks
  hlda<- InferNetworks(nets)

  hlda <- GenerateNetworks(hlda)
  networks(hlda)

## End(Not run)

list of interactions per cluster

Description

This method can be used to retrieve the genomic coordinated of the DNA-interactions in each cluster.

Usage

## S4 method for signature 'ChromMaintainers,ChiapetExperimentData,numeric'
getRegionsIncluster(hdaRes,data, cluster=1, ...)
## S4 method for signature 'ChromMaintainers,ChiapetExperimentData,numeric'
getRegionsIncluster(hdaRes,data, cluster=1, ...)

Arguments

`hdaRes`	a `ChromMaintainers` object in which the clusters are already calculated
`data`	a `ChiapetExperimentData` object that contains the genomic location of all the interactions.
`cluster`	The ID of the cluster for which we want to get the list of the involved regions.
`...`	additional parameters not used for the moment.

Value

a GRanges object is returned

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples

    ## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")
## Not run: 
    data(RPKMS)
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE)
    ## build the different indexes
    x <- createIndexes(x)
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks and do the clustering
    hlda<- InferNetworks(nets)
    hlda<- clusterInteractions(hlda)

    ## return the DNA-interactions in cluster 3
    getRegionsIncluster(hlda,x,cluster=3)


## End(Not run)
## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")
## Not run: 
    data(RPKMS)
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE)
    ## build the different indexes
    x <- createIndexes(x)
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks and do the clustering
    hlda<- InferNetworks(nets)
    hlda<- clusterInteractions(hlda)

    ## return the DNA-interactions in cluster 3
    getRegionsIncluster(hlda,x,cluster=3)


## End(Not run)

list of interactions per network

Description

This method can be used to retrieve the genomic coordinated of the DNA-interactions enriched for each network given a certain threshold.

Usage

## S4 method for signature 'ChromMaintainers,ChiapetExperimentData,numeric'
getRegionsInNetwork(hdaRes,data, net=1, thr=0.5, ...)
## S4 method for signature 'ChromMaintainers,ChiapetExperimentData,numeric'
getRegionsInNetwork(hdaRes,data, net=1, thr=0.5, ...)

Arguments

`hdaRes`	a `ChromMaintainers` object which already contains the calculated results
`data`	a `ChiapetExperimentData` object that contains the genomic location of all the interactions.
`net`	The ID of the network for which we want to get the list of the involved regions.
`thr`	Specify the minimum enrichment value to consider an interaction to be controlled by the network. by default it is 0.5
`...`	additional parameters not used for the moment.

Value

a GRanges object is returned

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples

    ## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

## Not run: 
    data(RPKMS)    
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE)
    ## build the different indexes
    x <- createIndexes(x)

    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks and do the clustering
    hlda<- InferNetworks(nets)

    ## return the DNA-interactions in cluster 3
    getRegionsIncluster(hlda,x,net=3)


## End(Not run)
## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

## Not run: 
    data(RPKMS)    
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE)
    ## build the different indexes
    x <- createIndexes(x)

    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks and do the clustering
    hlda<- InferNetworks(nets)

    ## return the DNA-interactions in cluster 3
    getRegionsIncluster(hlda,x,net=3)


## End(Not run)

GO enrichment methods

Description

This helper methods can be called to do GO enrichment by using the DAVID web service.

GOEnrich.networks can be used to do a GO enrichment of the chromatin maintainer networks.

GOEnrich.folder can be called to do a GO enrichment on the gene-list files generated by the method outputGenesPerClusterToDir.

There is a 5 secs delay between each request to not avoid being rejected by the server.

Usage

## S4 method for signature 'character'
GOEnrich.folder(folder, fdr=0.05,GOlimit=20)

## S4 method for signature 'ChromMaintainers'
GOEnrich.networks(object, fdr=0.05, GOlimit= 5,path="")
## S4 method for signature 'character'
GOEnrich.folder(folder, fdr=0.05,GOlimit=20)

## S4 method for signature 'ChromMaintainers'
GOEnrich.networks(object, fdr=0.05, GOlimit= 5,path="")

Arguments

`folder`	name of the folder that contains the gene-list files. The files are supposed to have a `.txt` extension. The first column of each file is supposed to contain the genes EntrezID.
`object`	a `"ChromMaintainers"` objects with the `topNodes` already calculated.
`fdr`	cut-off value GO terms with fdr value <= fdr will be considered. Benjamini-Hochberg FDR is used.
`GOlimit`	the number of top GO terms to return.
`path`	the path where to store the generated plot (pdf file). if not specified the plot will be displayed.

Value

Returns a list of data.frame that contain the GO results for each file (or network).

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

http://david.abcc.ncifcrf.gov/ (DAVID website)

Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. Nature Protoc. 2009;4(1):44-57.

Examples

    ## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

 ## Not run: 

    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)
  
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)

    ## Get the list of genes in each cluster by default 
    ## a folder ClustersGenes will be created
    outputGenesPerClusterToDir(hlda,x)

    ## GO enrichment 
    GOEnrich.folder(folder="ClustersGenes/")


## End(Not run)
## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

 ## Not run: 

    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)
  
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)

    ## Get the list of genes in each cluster by default 
    ## a folder ClustersGenes will be created
    outputGenesPerClusterToDir(hlda,x)

    ## GO enrichment 
    GOEnrich.folder(folder="ClustersGenes/")


## End(Not run)

Class `"HLDAResult"`

Description

This class is a container for the results generated by the HLDA algorithm

Usage

 HLDAResult(docPerTopic,wordsPerTopic ,betas)
HLDAResult(docPerTopic,wordsPerTopic ,betas)

Arguments

`docPerTopic`	Object of class `"matrix"` describing the partnership of each document in a topic. In our case a documents is the protein interaction networks associated with each DNA interaction and the topics are the inferred chromatin maintainer networks.
`wordsPerTopic`	Object of class `"matrix"` describing the partnership of each word to a topic. In our case a word is an edge and the topics are the chromatin maintainer networks.
`betas`	Object of class `"numeric"` the beta values for each network. Basically, it shows how popular is the network.

Value

an HLDAResult object.

Accessors

For a given HLDAResult object the following accesor functions can be used:

docPerTopic(x): gets the content of the docPerTopic matrix.
wordsPerTopic(x): gets the content of the wordsPerTopic matrix.
betas(x): gets the betas values.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Chong Wang, John Paisley and David M. Blei, Online variational inference for the hierarchical Dirichlet process .In AISTATS 2011

Mohamed Nadhir D, Yang C et al, 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples

showClass("HLDAResult")
showClass("HLDAResult")

HPRD protein interaction Network

Description

loads an igraph object that contains the HRPD PPI release 9.

Usage

data(HPRD)data(HPRD)

Value

an igraph object named PPI.HPRD.

Source

http://hprd.org/

Examples

data(HPRD)
PPI.HPRD
data(HPRD)
PPI.HPRD

Network construction using Hierarchical Dirichlet Process

Description

This methods applies a Hierarchical Dirichlet Process (HDP) algorithm on the collection of proteins networks to infer the set of chromatin loop-maintainer proteins. HDP are non-parametric Bayesian models widely used in document classification as it enables us to model datasets with a mixtures of classes. In our case, we suppose that different kinds of networks are involved in maintaining the different loops. Thus, to make an analogy with topic modeling, we each DNA-interaction maintaining protein network as a document and each edge in this network as word. Thus, the task is to say which word (edge) belongs to which topic (chromatin-maintainer family). The method implementation is based on the C++ code of Chong Wang and David Blei with adaptation to Rcpp and removal of the dependency on the Gnu Scientific Library.

Usage

## S4 method for signature 'NetworkCollection'
InferNetworks(object,thr =0.5,max_iter = 500L, max_time = 3600L, ...)
## S4 method for signature 'NetworkCollection'
InferNetworks(object,thr =0.5,max_iter = 500L, max_time = 3600L, ...)

Arguments

`object`	a `NetworkCollection` object in which the list of protein interactions associated with each DNA interaction is already populated.
`thr`	Used to select the top protein interaction in each inferred chromatin-maintainer family. In HDP each topic (Chromatin-maintainer family) is considered as a distribution over words (edges), thus, for each topic we consider the words that capture `threshold` percent of the topic to be the top words. For example, in topic1, we first rank edges by partnership probability to topic1 in a decreasing order, and we take the top edges that capture 50% of the partnership.
`max_iter`	maximum number of iterations (befault 500).
`max_time`	maximum runing time (3600 sec).
`...`	You can pass additional paramters to control the behaviour of the HDP model. The possible paramters are eta, alpha and gamma. eta controls how edges are assigned to CMNs on the global level. smaller eta values will lead to sparce edge-to-CMN assignment, which eta >1 leads to more uniform assignments. gamma on the other hand, controls the number of CMNs, smaller gamma values will produce a small number of CMNs and gamma>1 will favor the generation of more. alpha controls the sparcity at the local PPI. smaller alpha value force edges to be conrolled by a small number of CMNs, while lagrger values leads to more uniform distribution. By default eta = 0.01, gamma =1 and alpha =1.

Value

Returns a ChromMaintainers object that contains the list of inferred networks and the probability of each edge in each network.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

https://www.cs.princeton.edu/~blei/topicmodeling.html (C. Wang's hdp code)

Chong Wang, John Paisley and David M. Blei, Online variational inference for the hierarchical Dirichlet process .In AISTATS 2011

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples

    ## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
## Not run: 
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)
    hlda

## End(Not run)
## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
## Not run: 
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)
    hlda

## End(Not run)

Parsing ChIA-PET interaction data

Description

This method loads the ChIA-PET interactions data into a GRanges object from a file generated by ChIA-PET tool or an already formatted BED file.

Usage

## S4 method for signature 'ChiapetExperimentData,character'
loadPETs(object, petFile, IsBed=TRUE, header=TRUE, dist=1000)
## S4 method for signature 'ChiapetExperimentData,character'
loadPETs(object, petFile, IsBed=TRUE, header=TRUE, dist=1000)

Arguments

`object`	(Required) a `ChiapetExperimentData` object
`petFile`	(Required) path to the file to parse. two types of formats are accepted. If the file comes from the ChIA-PET tool or has the same format used in the ENCODE project chromleft startleft endleft chromright startright endright chr1 872113 879175 chr1 933836 938416 chr1 874165 879175 chr1 933340 938306 chr1 889676 896594 chr1 933897 938982 chr1 898753 907581 chr1 931133 939571 chr1 910103 918775 chr1 930834 938627 chr1 919314 922154 chr1 934212 937864 only the the coordinates of the left and the right regions are considered (which means the first 6 columns), The additional columns are just ignored. We suppose that the user has already selected the interactions that sound significant for him. if this kind of file is provided the `IsBed` parameter should be set to `FASLE`. if the file has no header the user needs to set `header=FALSE`. if `IsBed` parameter is provided, the provided file should have 4 columns, three to describe the region location and the forth column to indicate the ID of the interaction and its position (1: for left and 2: for right), according to following pattern `PET#\d\.1` or `PET#\d\.2`. chr1 1241234 1242234 PET#1.1 chr1 1242724 1243724 PET#1.2 chr1 1282708 1283708 PET#2.1 chr1 1283834 1284834 PET#2.2 chr1 1370871 1371871 PET#3.1 chr1 1372322 1373322 PET#3.2
`IsBed`	(optional) The flag indicates whether the provided file has a 4 columns BED (when `IsBed = TRUE` ) format or has the ChIA-PET tool format (when `IsBed = FALSE`). By default, `IsBed = TRUE`
`header`	(optional) Indicates whether or not the first line of the file should be considered as a header. by default it is `TRUE`
`dist`	(optional) This parameter indicates the size of the up- and down-stream regions (in bp) to consider around the center of each region. 3CPET is based on the assumption that the real ChIP-Seq signal is more enriched around the center of the regions and get more depleted when moving further in both directions. Thus, when the user provides a file that has a ChIA-PET tool format (`IsBed = TRUE`), 3CPET will first center the regions and consider only `distance` bp up- and down-stream of the center of each region. A default value of 1000bp is used.

Value

A ChiapetExperimentData object in which the ppi is populated as a GRanges object.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Li G, Fullwood MJ, Xu H et al.ChIA-PET tool for comprehensive chromatin interaction analysis with paired-end tag sequencing. Genome Biology 2010, 11(2):R22

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples

## Create a ChiapetExperimentData object
x <- ChiapetExperimentData(ppiType= "HPRD")

## load the different datasets (where the file has a Chia-PET tool format )
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")
x <- loadPETs(x, petFile=petFile, IsBed=FALSE)

## when loading an already formatted BED file
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_centered.bed")
x <- loadPETs(x, petFile=petFile, IsBed=TRUE, header=FALSE)

pet(x)
## Create a ChiapetExperimentData object
x <- ChiapetExperimentData(ppiType= "HPRD")

## load the different datasets (where the file has a Chia-PET tool format )
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")
x <- loadPETs(x, petFile=petFile, IsBed=FALSE)

## when loading an already formatted BED file
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_centered.bed")
x <- loadPETs(x, petFile=petFile, IsBed=TRUE, header=FALSE)

pet(x)

Setting the background protein-protein interaction

Description

This method enables the user to define the PPI network as a background network. The user can provide his own PPI or use the HPRD or the Biogrid PPI incorporated in the package.

Usage

## S4 method for signature 'ChiapetExperimentData'
loadPPI(object,type=c("HPRD","Biogid"),customPPI= NULL, 
              filter=FALSE, term ="GO:0005634", annot=NULL, 
              RPKM= NULL, threshold=1 )
## S4 method for signature 'ChiapetExperimentData'
loadPPI(object,type=c("HPRD","Biogid"),customPPI= NULL, 
              filter=FALSE, term ="GO:0005634", annot=NULL, 
              RPKM= NULL, threshold=1 )

Arguments

`object`	a `ChiapetExperimentData` to load the PPI into as an `igraph` object
`type`	if `customPPI = NULL`, this parameter indicats which of the available PPI to use `HPRD` or `Biogrid` (build 3.2.100). by default, the `HPRD` network is used.
`customPPI`	If the user wants to use his own PPI interaction network (for example for another species), he can provide an `igraph` object or a path to an `"ncol"` formatted graph (a file with two columns indicating the interacting parts and an optional third column to indicate the weight). It is preferred that the user provides an `igraph` object, to avoid any problems when parsing the `"ncol"` file.
`filter`	This parameter indicates whether the user want to filter the selected PPI or not. `filter = FALSE` means that the PPI (provided by the user or by the `HPRD` and `Biogrid` networks) will be used as is. if `filter = TRUE`,the proteins in the PPI will be filtered according to their position in cell ( by default proteins located in the nucleus are kept and the other removed). In addition to the filtering by location, if the user wants just to keep the proteins that show a certain amount of expression he can provide a gene expression table to the `RPKM` parameter and set the `threshold` for filtering.
`term`	The GO term used to for filtering. By default, only protein that are located in the nucleus are kept (`term = "GO:0005634"`). If the user want to use another annotation table he can pass a `data.frame` object to the parameter `annot`.
`annot`	If the user wants to provide his own annotation data-set, he can pass a `data.frame` object to this parameter. The passed `data.frame` should have at least two columns named respectively: `geneSymbol` that contains the gene names, and `cellular_component_term` that contain the term. geneSymbol cellular_component_term FAU Ribosome (GO:0005840); Nucleolus (GO:0005730) ALDH3A1 Cytoplasm (GO:0005737); Nucleus (GO:0005634) ASCL1 Nucleus (GO:0005634); Cytoplasm (GO:0005737)
`RPKM`	A `data.frame` that contains the expression of each gene in the PPI. The `data.frame` should at least have 2 columns. The first one contains the gene symbol (should be the same as the one used in the PPI) and the second gives the expression.
`threshold`	Threshold value used to filter gene expression. All genes with expression value less than `threshold` are removed.

Value

A ChiapetExperimentData object in which the ppi slot is populated as an igraph object filtered according to the specified conditions.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Prasad, T. S. K. et al. (2009) Human Protein Reference Database - 2009 Update. Nucleic Acids Research. 37, D767-72.

Chatr-Aryamontri A, Breitkreutz BJ et al. The BioGRID Interaction Database: 2013 update. Nucleic Acids Res. 2012 Nov 30

M.N Djekidel et al,3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, in press, 2015

Examples


## Create a ChiapetExperimentData object
x <- ChiapetExperimentData(ppiType= "HPRD")

## Loading the default HPRD network without filtering

x <- loadPPI(x,type="HPRD")
ppi(x)

## Using the HPRD network and filtering using the GO:0005634
x <- loadPPI(x,type="HPRD", filter=TRUE)
ppi(x)

data(RPKMS)
x <- loadPPI(x,type="HPRD",filter=TRUE,annot= NULL, RPKM= RPKMS, threshold = 5)
ppi(x)
## Create a ChiapetExperimentData object
x <- ChiapetExperimentData(ppiType= "HPRD")

## Loading the default HPRD network without filtering

x <- loadPPI(x,type="HPRD")
ppi(x)

## Using the HPRD network and filtering using the GO:0005634
x <- loadPPI(x,type="HPRD", filter=TRUE)
ppi(x)

data(RPKMS)
x <- loadPPI(x,type="HPRD",filter=TRUE,annot= NULL, RPKM= RPKMS, threshold = 5)
ppi(x)

Loading TF binding sites

Description

This methods reads a BED file that contains the peak positions of different TF. All the TF peaks should be merged into one BED file that contains 4 columns that respectively contain the chromosome name, peak start, peak end, TF name.

Usage

## S4 method for signature 'ChiapetExperimentData,character'
loadTFBS(object, tfbsFile,header=FALSE, ...)
## S4 method for signature 'ChiapetExperimentData,character'
loadTFBS(object, tfbsFile,header=FALSE, ...)

Arguments

object

(Required) a ChiapetExperimentData object

tfbsFile

(Required): path the BED file containing the position of the different TF binding site All the TF binding sites should be merged in this file as showed in this example:

 chr1  569820  569998 BHLHE40
 chr1  936071  936346 BHLHE40
 chr1 1014795 1015082 BHLHE40
 ...........................
 ...........................
chrY 13485240 13485769 ZBTB33
chrY 13488718 13489030 ZBTB33
chrY 15016340 15016848 ZBTB33
chrY 58843918 58844104 ZBTB33

(optional) indicates if the provided BED file has a header or node. by default header=FALSE

...

reserved for later use.

Value

A ChiapetExperimentData object in which the tfbs slot is populated as a GRanges object.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples

## Create a ChiapetExperimentData object
x <- ChiapetExperimentData(ppiType= "HPRD")

## load TFBS
tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")
x <- loadTFBS(x,tfbsFile=tfbsFile)
tfbs(x)
## Create a ChiapetExperimentData object
x <- ChiapetExperimentData(ppiType= "HPRD")

## load TFBS
tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")
x <- loadTFBS(x,tfbsFile=tfbsFile)
tfbs(x)

protein interaction networks maintaining DNA loops

Description

The class NetworkCollection stores information about the set of protein networks that maintains DNA interactions.

Usage

 NetworkCollection(networks, sizes,  TFCollection)
NetworkCollection(networks, sizes,  TFCollection)

Arguments

`networks`	a `list` of `list`, each list contains the set of edges in each network
`sizes`	the sizes of each network. The should correspond to the sizes of the networks
`TFCollection`	the set of all the TF involved in all the interactions.

Details

The NetworkCollection contains three main information: (i) the set of edges in each network maintaining each DNA loop, (ii) the number of edges in each network and (iii) the set of TF involved in all the networks.

Value

a NetworkCollection object.

Accesors

networks: gets the list of networks
sizes: gets the vector containing the size of each network
TF: gets the list of involved TF (after filtering)

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples

showClass("NetworkCollection")
showClass("NetworkCollection")

List of genes in each cluster

Description

This helper methods get the set of genes located in the DNA-regions in each cluster. A folder that contain a bunch of .txt files (one for each cluster) is generated. We consider a gene to be part of a cluster if the (-1500bp, +500bp) around its TSS intersects with one of the DNA regions of the cluster.

Usage

## S4 method for signature 'ChromMaintainers,ChiapetExperimentData'
outputGenesPerClusterToDir(hdaRes,data,path="ClustersGenes", ...)
## S4 method for signature 'ChromMaintainers,ChiapetExperimentData'
outputGenesPerClusterToDir(hdaRes,data,path="ClustersGenes", ...)

Arguments

`hdaRes`	a `ChromMaintainers` object in which the DNA-interaction are already clustered.
`data`	a `ChiapetExperimentData` object containing information about the interactions.
`path`	path of the folder to create. by default a folder named `ClustersGenes` is created in the current working directory.
`...`	additional parameters, not used for the moment.

Value

The specified folder is created with a list .txt files that contain the list of genes.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples


    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
## Not run: 
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)

    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)
    hlda<- clusterInteractions(hlda)

    ## get the list of genes per cluster.
    outputGenesPerClusterToDir(hlda,x)


## End(Not run)
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
## Not run: 
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)

    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)
    hlda<- clusterInteractions(hlda)

    ## get the list of genes per cluster.
    outputGenesPerClusterToDir(hlda,x)


## End(Not run)

List of genes controlled by each network

Description

This helper methods get the set of genes located in the DNA-regions controlled by each network. A folder that contains a bunch of .txt files (one for each network) is generated. We consider (-2500bp, +2500bp) around the TSS of gene located in a region showing 0.5 or more enrichment for the network.

Usage

## S4 method for signature 'ChromMaintainers,ChiapetExperimentData'
outputGenesPerNetworkToDir(hdaRes,data,path="NetworksGenes", ...)
## S4 method for signature 'ChromMaintainers,ChiapetExperimentData'
outputGenesPerNetworkToDir(hdaRes,data,path="NetworksGenes", ...)

Arguments

`hdaRes`	a `ChromMaintainers` object containing the enrichment values.
`data`	a `ChiapetExperimentData` object containing information about the interactions.
`path`	path of the folder to create. by default a folder named `NetworksGenes` is created in the current working directory.
`...`	additional parameters, not used for the moment.

Value

The specified folder is created with a list .txt files each for each network that contain the list of genes.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples

  ## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")    
## Not run: 
    data(RPKMS)
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)

    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)

    ## get the list of genes per network.
    outputGenesPerNetworkToDir(hlda,x)


## End(Not run)
## get the different datasets path
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")    
## Not run: 
    data(RPKMS)
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)

    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)

    ## get the list of genes per network.
    outputGenesPerNetworkToDir(hlda,x)


## End(Not run)

Plotting clustering results

Description

This method enables the user the generate different types of plots to visualize the results.

Usage

## S4 method for signature 'ChromMaintainers'
plot3CPETRes(object, path="", W=14, H=7 , 
          type=c("heatmap","clusters","curve","avgCurve","netSim", "networks"),
          byEdge=TRUE, layoutfct=layout.kamada.kawai, ...)
## S4 method for signature 'ChromMaintainers'
plot3CPETRes(object, path="", W=14, H=7 , 
          type=c("heatmap","clusters","curve","avgCurve","netSim", "networks"),
          byEdge=TRUE, layoutfct=layout.kamada.kawai, ...)

Arguments

`object`	(Required) a `ChromMaintainers` object that contains the results
`path`	(optional) path where to save the plots should be `".pdf"`. if not provided the plot will be displayed on the screen.
`W`	(optional) The width of the plot in the pdf file.by default it is 14 inch
`H`	(optional) The Height of the plot in the pdf file.by default it is 7 inch
`type`	type of the plot to generate. It can support the following values ((default: "heatmap")): `"heatmap"` : Generates a heatmap of the partnership of each DNA interaction to a chromatin maintainer network. Each column is a DNA interaction and each row a chromatin maintainer network. `"clusters"` : generates a pair-wise scatter plots of all clusters. Note: only supported if the `sota` method was applied. `"curve"` : for each cluster plots the enrichment profile of all the elements. `"avgCurve"` : draws the average curve of the enrichment profile for each clusters. `"netSim"` : plots a heatmap showing the percentage of common proteins or edges between the chromatin maintainer networks. Note: generally the similarity between networks is so small so the user can set the value in the diagonal to zero and then re-plot or plot the dissimilarity plot. if `byEdge = TRUE` a similarity based on the common edges is calculated otherwise by common nodes. `"networks"` : plots all the networks. if this option is chosen a pdf file named `"AllGraphs.pdf"` is generated in the current working unless the `path` parameter is explicitly determined. To get a finer control, the user can specify the type of layout to use, by default the `layout.kamada.kawai` is used. For additional layout functions you can check the `igraph` package. The reason we generate a pdf file because there is a lot of networks and it will not be convenient to display them in one plot, or generating multiple plots.
`byEdge`	(optional) if `TRUE` and `type = "netSim"` then the a heatmap showing the similarity between the chromatin maintainer networks by common edges. if `FALSE` the similarity is calculated based on the number of common nodes.
`layoutfct`	(optional) The graph layout algorithm to use. by default the `layout.kamada.kawai` is used. Additional functions are available in the `igraph` package.
`...`	options for future use.

Value

Different types of values are returned depending on the type of the plot selected.

"heatmap": returns a list generated by the pheatmap method, however it is always empty.
"clusters","curve","avgCurve": returns a list describing the number of plots per row and column.
"netSim": returns a list that contains a ggplot2 object and the similarity matrix
"networks": returns a list of ggplot2 objects, one per network.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples


    ## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

## Not run: 
    data(RPKMS)
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)
    
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)

    ## cluster results
    hlda<- clusterInteractions(hlda)

    ## plot a heatmap
    plot3CPETRes(hlda,type="heatmap")

    ## plot clusters pair-wise scatter plots
    plot3CPETRes(hlda,type="clusters")

    ## enrichment plot for the elements in each network
    plot3CPETRes(hlda,type="curve")

    ##  average enrichment plot for the elements in each network
    plot3CPETRes(hlda,type="avgCurve")

    ## heatmap showing the similarity between the different network
    plot3CPETRes(hlda,type="netSim")

    ## plot all the networks in the file "AllGraphs.pdf" 
    nets_plot <- plot3CPETRes(hlda,type="networks")

    ## plot one of the networks
    plot(nets_plot[[3]])


## End(Not run)
## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

## Not run: 
    data(RPKMS)
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    ## build the different indexes
    x <- createIndexes(x)
    
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)

    ## cluster results
    hlda<- clusterInteractions(hlda)

    ## plot a heatmap
    plot3CPETRes(hlda,type="heatmap")

    ## plot clusters pair-wise scatter plots
    plot3CPETRes(hlda,type="clusters")

    ## enrichment plot for the elements in each network
    plot3CPETRes(hlda,type="curve")

    ##  average enrichment plot for the elements in each network
    plot3CPETRes(hlda,type="avgCurve")

    ## heatmap showing the similarity between the different network
    plot3CPETRes(hlda,type="netSim")

    ## plot all the networks in the file "AllGraphs.pdf" 
    nets_plot <- plot3CPETRes(hlda,type="networks")

    ## plot one of the networks
    plot(nets_plot[[3]])


## End(Not run)

Plot interaction on a genomic track

Description

This helper method can be used to display a genomic track for a certain location that contains the chromosome and the related interactions if any.

Usage

## S4 method for signature 'ChiapetExperimentData,GRanges'
plotTrack(object, range)
## S4 method for signature 'ChiapetExperimentData,GRanges'
plotTrack(object, range)

Arguments

`object`	a `ChiapetExperimentData` object which contains the raw data.
`range`	The genomic coordinates of the location to display

Value

a ggbio::track object

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples


    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

    ## Not run: 
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    gr <- GRanges("chr1",IRanges(start=100000,end=300000))
    plotTrack(x,gr)

## End(Not run)
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

    ## Not run: 
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
    gr <- GRanges("chr1",IRanges(start=100000,end=300000))
    plotTrack(x,gr)

## End(Not run)

Loading the raw data all at once

Description

Instead of loading the data one at a time and then creating the index using the methods loadPETs,loadTFBS and createIndexes. The user can directly use the method PrepareData to do that.

Usage

## S4 method for signature 'character,character,logical'
PrepareData(petFile,tfbsFile, petIsBed=TRUE)
## S4 method for signature 'character,character,logical'
PrepareData(petFile,tfbsFile, petIsBed=TRUE)

Arguments

`petFile`	a `character` specifying the path to the interaction file. it the file is in a "bed" format `petIsBed` should be `TRUE`. The data should be formated as described in `loadPETs`.
`tfbsFile`	a `character` specifying the path to the transcription factors binding site file. The data should be formated as described in `loadTFBS`.
`petIsBed`	a logical value specifying if the interaction file is in a "bed" format or not.

Value

A ChiapetExperimentData object in which the pet,tfbs and .dt slots populated .

Author(s)

Mohamed Nadhir Djekidel ([email protected])

References

Mohamed Nadhir D, Yang C et al 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process, ....

Examples


    ## get interactions file location
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")

    ## get the TFBS file location
    tfFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")

    ## Not run: 
    ## load the data
    x<- PrepareData(petFile, tfFile, FALSE)
    x
    
## End(Not run)
## get interactions file location
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")

    ## get the TFBS file location
    tfFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")

    ## Not run: 
    ## load the data
    x<- PrepareData(petFile, tfFile, FALSE)
    x
    
## End(Not run)

A gene expression dataset

Description

A gene expression dataset for the K562 cell line.

Usage

data(RPKMS)data(RPKMS)

Value

a data.frame containing genes expression in K562 cells.

Examples

data(RPKMS)
head(RPKMS)
data(RPKMS)
head(RPKMS)

update top maintain networks elements

Description

This helper method can be used to update the list of interactions constituting the chromatin maintainer networks by changing the threshold.

Usage

## S4 method for signature 'ChromMaintainers,NetworkCollection,numeric'
updateResults(object,nets,thr=0.5)
## S4 method for signature 'ChromMaintainers,NetworkCollection,numeric'
updateResults(object,nets,thr=0.5)

Arguments

`object`	a `ChromMaintainers` object in which the clusters are already calculated
`nets`	a `NetworkCollection` used to get the vocabulary list.
`thr`	The cut-off threshold. the top edges that have an enrichment value that sum up to a value >= `thr` are considered.

Value

a ChromMaintainers object in which the topNodes and topEdges tables are updated.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples


    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
    ## Not run: 
    data(RPKMS)
    
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE)     
    ## build the different indexes
    x <- createIndexes(x)    
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)
    topNodes(hlda)

    hlda <- updateResults(hlda, nets, 0.4)
    topNodes(hlda)
    
## End(Not run)
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
    
    ## Not run: 
    data(RPKMS)
    
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE)     
    ## build the different indexes
    x <- createIndexes(x)    
    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)
    topNodes(hlda)

    hlda <- updateResults(hlda, nets, 0.4)
    topNodes(hlda)
    
## End(Not run)

Generate circos plot per cluster

Description

This method generates a basic circos plot of the chromatin interaction in a given cluster.

Usage

## S4 method for signature 'ChromMaintainers,ChiapetExperimentData,numeric'
visualizeCircos(object, data, cluster = 1, chrLenghts = NULL)
## S4 method for signature 'ChromMaintainers,ChiapetExperimentData,numeric'
visualizeCircos(object, data, cluster = 1, chrLenghts = NULL)

Arguments

`object`	a `ChromMaintainers` object in which the clusters are already calculated.
`data`	a `ChiapetExperimentData` containing the interaction data.
`cluster`	the number of the cluster to display
`chrLenghts`	the chromatin lengths. if not provided the package suppose it is a human chromatin and uses the corresponding lengths. Change it if you are using another species.

Value

circos: a GRanges object that contains the coordinate of the left side interactions. The right side interactions can be accessed by writing circos\$to.gr.
plot: a ggplot object

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples


    ## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

## Not run: 
    data(RPKMS)
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE)
    ## build the different indexes
    x <- createIndexes(x)

    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)
    hlda<- clusterInteractions(hlda)

    visualizeCircos(hlda,x, cluster=3)

## End(Not run)
## get the different datasets path
    petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
    tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  

## Not run: 
    data(RPKMS)
    x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE)
    ## build the different indexes
    x <- createIndexes(x)

    ## build networks connecting each interacting regions
    nets<- buildNetworks(x)

    ## infer the networks
    hlda<- InferNetworks(nets)
    hlda<- clusterInteractions(hlda)

    visualizeCircos(hlda,x, cluster=3)

## End(Not run)

Display a Circos plot of ChIA-pet interactions

Description

This method can be used to draw a circos plot of the chromatin interactions located in the given genomic range.

Usage

## S4 method for signature 'ChiapetExperimentData,GRanges'
visualizeInteractions(object, range)
## S4 method for signature 'ChiapetExperimentData,GRanges'
visualizeInteractions(object, range)

Arguments

`object`	a `ChiapetExperimentData` in which the interactions are already loaded. Check `loadPETs` for more info.
`range`	a `GRanges` object containing the genomic region of interest.

Value

A ciros plot of the selected region is displayed and a list containing the following objects is returned.

circos :: a GRanges object that contains the involved chromatin interactions.
plot :: a ggplot object containing plot.

Author(s)

Mohamed Nadhir Djekidel ([email protected])

Examples

  
  petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
  
  
  ## Not run: 
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## plot intractions in the region of interest
  gr <- GRanges("chr1", IRanges(1240000,10300000))
  p <- visualizeInteractions(x, gr)
  p
  
## End(Not run)
petFile <- file.path(system.file("example",package="R3CPET"),"HepG2_interactions.txt")  
  tfbsFile <- file.path(system.file("example",package="R3CPET"),"HepG2_TF.txt.gz")  
  
  
  ## Not run: 
  x <- ChiapetExperimentData(pet = petFile, tfbs=  tfbsFile, IsBed = FALSE, ppiType="HPRD", filter= TRUE) 
  ## plot intractions in the region of interest
  gr <- GRanges("chr1", IRanges(1240000,10300000))
  p <- visualizeInteractions(x, gr)
  p
  
## End(Not run)

Package 'R3CPET'

Help Index

3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process

Description

Details

Author(s)

References

See Also

Add the gene expression attribute to the graph nodes

Description

Usage

Arguments

Value

Author(s)

See Also

Examples

Biogrid Network

Description

Usage

Value

Source

Examples

Building interaction networks connecting interacting regions

Description

Usage

Arguments

Value

Author(s)

References

See Also

Examples

3CPET used raw data

Description

Usage

Arguments

Details

Value

Slots

Accessors

Author(s)

References

See Also

Examples

Chomatin maintainer networks

Description

Usage

Arguments

Value

Accesors

Methods

Author(s)

References

See Also

Examples

Human chromosom lenghts

Description

Examples

Wrapper for clues and sota S3 classes

Description

Definition

Objects from the Class

Methods

Author(s)

References

See Also

Examples

Grouping DNA interactions by enrichment profile

Description

Usage

Arguments

Value

Author(s)

References

See Also

Examples

Create centered interactions

Description

Usage

Arguments

Value

Wrapper for `clues` and `sota` S3 classes

Generate a list of `igraph` networks