NEWS
DaMiRseq 2.6.0
- Major: We fixed a bug in DaMiR.ModelSelect. Now optimal models
are correctly selected;
- Major: Now users can plot specific graphs in DaMiR.Allplot and
we added new plots;
- Minor: We modified the color scale in corrplot
DaMiRseq 2.0.0
- Since version 2.0.0 of the software, DaMiRseq offers a solution
to solve two distinct problems, in supervised learning analysis:
(i) finding a small set of robust features, and (ii) building the
most reliable model to predict new samples;
- The functions DaMiR.EnsembleLearning2cl_Training,
EnsembleLearning2cl_Test and EnsembleLearning2cl_Predict were
deprecated and replaced by DaMiR.EnsL_Train, DaMiR.EnsL_Test
and DaMiR.EnsL_Predict, respectively;
- We have created a new function DaMiR.ModelSelect to select the
best model in a machine learning analysis;
- We have created two new functions DaMiR.iTSnorm and DaMiR.iTSadjust
to normalize and adjust the gene espression of independent test sets;
- Two types of expression value distribution plot were added
to the DaMiR.Allplot function.
DaMiRseq 1.5.2
- The DaMiR.normalization function embeds also the 'logcpm'
normalization.
- Now, DaMiR.EnsembleLearning calculates also the Positive
Predicted Values (PPV) and the Negative Predicted Values (NPV).
- Three new functions have been implemented for the binary
classification task: DaMiR.EnsembleLearning2cl_Training,
DaMiR.EnsembleLearning2cl_Test and DaMiR.EnsembleLearning2cl_Predict.
The first one allows the user to implement the training task and
to select the model with the highest accuracy or the average
accuracy; the second function allows the user to test the
selected classification model on a test set defined by the
user; the last function allows the user to predict the class
of new samples.
- Removed black dots in the violin plots.
DaMiRseq 1.4.1
- Adjusted Sensitivity and Specificity calculations.
DaMiRseq 1.4
Relevant modifications
- DaMiRseq performs both binary and multi-class classification
analysis.
- The 'Stacking' meta-learner can be composed by the user,
setting the new parameter 'cl_type' of the
DaMiR.EnsembleLearning function. Any combination up to 8
classifiers ('RF', 'NB', 'kNN', 'SVM', 'LDA', 'LR', 'NN',
'PLS') is now allowed.
- If the dataset is imbalanced, a 'Down-Sampling' strategy is
automatically applied.
- The DaMiR.FSelect function has the new argument, called
nPlsIter, which allows the user to have a more robust features
set. In fact, several feature sets are generated by the 'bve_pl'
function, setting 'nPLSIter' parameter greater than 1. Finally,
an intersection among all the feature sets is performed to return
those features which constantly occur in all runs. However,
by default, nPlsIter = 1.
Minor modifications and bugs fixed
- DaMiR.Allplot accepts also 'matrix' objects.
- The DaMiR.normalization function estimates the dispersion, through
the parameter 'nFitType'.
- In the DaMiR.normalization function, the gene filtering is
disabled if 'minCount = 0'.
- In the DaMiR.EnsembleLearning function, the method for implementing
the Logistic Regression has been changed to allow multi-class
comparisons; instead of the native 'lm' function, the 'bayesglm'
method is now used.
- The new parameter 'second.var' of the 'DaMiR.SV' function, allows
the user to take into account a secondary variable of interest
(factorial or numerical) that the user does not wish to correct for,
during the sv identification.
DaMiRseq 1.3.7
- DaMiRseq performs multi-class classification analysis.
- The Stacking meta-learner can be composed by the user,
setting the new parameter 'cl_type' of the
DaMiR.EnsembleLearning() function. Any combination
of the 8 classifiers is now allowed.
- If the dataset is imbalanced, a 'Down-Sampling'
strategy is automatically applied.
- The DaMiR.FSelect() function has the new
argument, called 'nPlsIter', which allows the
user to have a more robust features set. In fact,
several feature sets are generated by the bve_pls()
fuction (embedded in DaMiR.FSelect()), setting
'nPLSIter' parameter greater than 1.
Finally, an intersection among all the feature sets
is performed to return those features which constantly
occur in all runs. However, by default, 'nPlsIter = 1'.
- DaMiR.Allplot() accepts also 'matrix' objects as well as NA values
(which are not plotted).
- The DaMiR.normalization() function estimates the
dispersion, through the parameter 'nFitType'; as in
DESeq2 package, the argument can be
'parametric' (default), 'local' and 'mean'.
- In the DaMiR.normalization() function, the gene
filtering is desabled if 'minCount = 0'.
- In the DaMiR.EnsembleLearning() function, the method for
implementing the Logistic Regression has been changed
to allow multi-class comparisons; instead of the native
'lm' function, 'bayesglm' method implemented in the caret
'train' function, properly set, is now used.
- The new parameter 'second.var' of the DaMiR.SV() function,
allows the user to take into account a secondary variable
of interest (factorial or numerical) that the user does
not wish to correct for, during the sv identification.