Parameters and Configuration¶
Lasso.bag¶
-
mat¶ Type: data.frame Default: noneindependent variables
-
out.mat¶ Type: data.frame Default: nonedependent variables, which contains one column or two columns
-
bootN¶ Type: numeric Default: 1000the size of resampled sample, only valid when permutation set to TRUE
-
imputeN¶ Type: numeric Default: 1000the initial permutation times, only valid when permutation set to TRUE
-
imputeN.max¶ Type: numeric Default: 2000the max permutation times. Regardless of whether p has meet the requirement,, only valid when permutation set to TRUE
-
permut.increase¶ Type: numeric Default: 1000if the initial imputeN times of permutation doesn’t meet the requirement, then we add ??permut.increase times of permutation?? to get more random/permutation values, only valid when permutation set to TRUE
-
boot.rep¶ Type: bool Default: TRUEwhether :”sampling with return” or not, only valid when permutation set to TRUE
-
a.family¶ Type: string Default: nonewhat kind of regression method to use, it should match the type of out.mat
-
parallel¶ Type: bool Default: FALSEwhether the script run in parallel, you need to set n.cores in case this package conquers all your cpu resource
-
fit.pareto¶ Type: string Default: mlethe method of fitting Generalized Pareto Distribution, alternative choice is “gd”, for gradient descend
-
permutation¶ Type: string Default: TRUEto decide whether to do permutation test, if set TRUE, no p value returns
-
n.cores¶ Type: numeric Default: 1how many cores/process to be assigned for this function, in Windows, you have to set this to 1
-
rd.seed¶ Type: numeric Default: 89757it is the random seed of this function, in case some of the experiments need to be reappeared
-
plot.freq¶ Type: string/bool Default: FALSEwhether to show all the non-zero frequency in the final barplot or not. If “full”, all the features(including zero frequency) will be plotted. If “part”, all the non-zero features will be plotted. If “not”, will not print the plot.
-
plot.out¶ Type: string/bool Default: FALSEthe path or file’s name to save the plot. If set to FALSE, no plot will be output. If you run this function in Linux command line, you don’t have to set this param for the plot.freq will output your plot to your current working directory with name “Rplot.pdf”.Default to FALSE
LessPermutation¶
-
X¶ Type: vector Default: nonea union of input data, e.g. c(1,2,3,4,5,6).
-
x0¶ Type: numeric Default: nonethe observed value
-
fitting.method¶ Type: string Default: gdthe fitting method of General Pareto Distribution(GPD)
-
search.step¶ Type: numeric Default: 0.01the length of step (this param * length(X)) to find threshold, default 0.01
-
fit.cutoff¶ Type: numeric Default: 0.05the cutoff of p value to judge whether it fits well to GPD, default 0.05
-
when.to.fit¶ Type: numeric Default: 0.05a cutoff to tell how many sample values are bigger than the observed value then we don’t need to fit GPD. it is a portion.