New CRAN release after archival.
type
argument.R CMD CHECK
. #38Fixed an issue with the HDRDA classifier’s predict
function. The posterior probabilities did not sum to 1 because they were
unnormalized. #34
Fixed another issue with the HDRDA classifier’s
predict
function, where the class names were incorrect when
predicting a single observation. #34
Improved docs throughout the package to pass
R CMD CHECK
. #35
The predict
function now returns
posterior-probability estimates for each classifier.
The object returned by cv_hdrda()
can be plotted. A
heatmap is produced using ggplot2
to illustrate the
cross-validation error rate for each tuning-parameter pair
considered.
The predict
function for the HDRDA classifier is now
substantially faster when classifying a large number of observations.
#33
The cross-validation helper function cv_hdrda()
for
the HDRDA classifier now returns a trained classifier rather than the
optimal model.
cv_hdrda()
also has an optional verbose
argument to dump summary information while the cross-validation is
running.
Fixed issue with classifiers’ documentation not appearing in help index. #26
Better handling of HDRDA when its tuning parameters are both 0.
Corrected calculation of W_k and Q_k in HDRDA classifier.
Added unit tests for HDRDA.
Can now specify population means in
generate_blockdiag()
.
Added unit tests for generate_blockdiag()
.
Updated man docs with roxygen2 4.0.
Added log_determinant()
helper function to calculate
the log-determinant of a matrix.
The High-Dimensional Regularized Discriminant Analysis (HDRDA)
classifier from Ramey, Stein, and Young (2014) implemented in
hdrda()
has been revamped to improve its computational
performance.
lda_pseudo()
is an implementation of Linear
Discriminant Analysis (LDA) with the Moore-Penrose
Pseudo-Inverse
lda_schafer()
is an implementation of Linear
Discriminant Analysis (LDA) using the covariance matrix estimator from
Schafer and Strimmer (2005)
lda_thomaz()
is an implementation of Linear
Discriminant Analysis (LDA) using the covariance matrix estimator from
Thomaz, Kitani, and Gillies (2006)
mdeb()
is an implementation of the Minimum Distance
Empirical Bayesian Estimator (MDEB) classifier from Srivistava and
Kubokawa (2007)
mdmeb()
is an implementation of the Minimum Distance
Rule using Modified Empirical Bayes (MDMEB) classifier from Srivistava
and Kubokawa (2007)
mdmp()
is an implementation of the Minimum Distance
Rule using Moore-Penrose Inverse (MDMP) classifier from Srivistava and
Kubokawa (2007)
smdlda()
is an implementation of the
Shrinkage-mean-based Diagonal Linear Discriminant Analysis (SmDLDA) from
Tong, Chen, and Zhao (2012)
smdqda()
is an implementation of the
Shrinkage-mean-based Diagonal Quadratic Discriminant Analysis (SmDQDA)
from Tong, Chen, and Zhao (2012)
Added a summary function for hdrda
classifiers
First version of the sparsediscrim
package. With
this package, we aim to provide a large collection of regularized and
sparse discriminant analysis classifiers intended for high-dimensional
classification.
hdrda()
is an implementation of the High-Dimensional
Regularized Discriminant Analysis classifier from Ramey, Stein, and
Young (2014).
dlda()
is an implementation of the Diagonal Linear
Discriminant Analysis classifier from Dudoit, Fridlyand, and Speed
(2002).
dqda()
is an implementation of the Diagonal
Quadratic Discriminant Analysis classifier from Dudoit, Fridlyand, and
Speed (2002).
sdlda()
is an implementation of the Shrinkage-based
Diagonal Linear Discriminant Analysis classifier from Pang, Tong, and
Zhao (2009).
sdqda()
is an implementation of the Shrinkage-based
Diagonal Quadratic Discriminant Analysis classifier from Pang, Tong, and
Zhao (2009).
generate_blockdiag()
generates random variates from
K multivariate normal populations, where each class is generated with a
constant mean vector and a covariance matrix consisting of
block-diagonal autocorrelation matrices.
generate_intraclass()
generates random variates from
K multivariate normal populations, where class is generated with a
constant mean vector and an intraclass covariance matrix.
cv_partition()
randomly partitions data for
cross-validation.
no_intercept()
removes the intercept term from a
formula if it is included.
cov_mle()
computes the maximum likelihood estimator
for the sample covariance matrix under the assumption of multivariate
normality.
cov_pool()
computes the pooled maximum likelihood
estimator for the common covariance matrix under the assumption of
multivariate normality.
cov_eigen()
computes the eigenvalue decomposition of
the maximum likelihood estimators of the covariance matrices for the
given data matrix. We provide an option to calculate the eigenvalue
decomposition using the Fast Singular Value Decomposition, which can
greatly expedite the eigenvalue decomposition for very tall data (large
n, small p) or very wide data (small n, large p).