Version 0.2#
January 1, 2017
Changelog#
Bug fixes#
Fixed a bug in
under_sampling.NearMiss
which was not picking the right samples during under sampling for the method 3. By Guillaume Lemaitre.Fixed a bug in
ensemble.EasyEnsemble
, correction of therandom_state
generation. By Guillaume Lemaitre and Christos Aridas.Fixed a bug in
under_sampling.RepeatedEditedNearestNeighbours
, add additional stopping criterion to avoid that the minority class become a majority class or that a class disappear. By Guillaume Lemaitre.Fixed a bug in
under_sampling.AllKNN
, add stopping criteria to avoid that the minority class become a majority class or that a class disappear. By Guillaume Lemaitre.Fixed a bug in
under_sampling.CondensedNeareastNeigbour
, correction of the list of indices returned. By Guillaume Lemaitre.Fixed a bug in
ensemble.BalanceCascade
, solve the issue to obtain a single array if desired. By Guillaume Lemaitre.Fixed a bug in
pipeline.Pipeline
, solve to embedPipeline
in otherPipeline
. #231 by Christos Aridas.Fixed a bug in
pipeline.Pipeline
, solve the issue to put to sampler in the samePipeline
. #188 by Christos Aridas.Fixed a bug in
under_sampling.CondensedNeareastNeigbour
, correction of the shape ofsel_x
when only one sample is selected. By Aliaksei Halachkin.Fixed a bug in
under_sampling.NeighbourhoodCleaningRule
, selecting neighbours instead of minority class misclassified samples. #230 by Aleksandr Loskutov.Fixed a bug in
over_sampling.ADASYN
, correction of the creation of a new sample so that the new sample lies between the minority sample and the nearest neighbour. #235 by Rafael Wampfler.
New features#
Added AllKNN under sampling technique. By Dayvid Oliveira.
Added a module
metrics
implementing some specific scoring function for the problem of balancing. #204 by Guillaume Lemaitre and Christos Aridas.
Enhancement#
Added support for bumpversion. By Guillaume Lemaitre.
Validate the type of target in binary samplers. A warning is raised for the moment. By Guillaume Lemaitre and Christos Aridas.
Change from
cross_validation
module tomodel_selection
module forsklearn
deprecation cycle. By Dayvid Oliveira and Christos Aridas.
API changes summary#
size_ngh
has been deprecated incombine.SMOTEENN
. Usen_neighbors
instead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_ngh
has been deprecated inunder_sampling.EditedNearestNeighbors
. Usen_neighbors
instead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_ngh
has been deprecated inunder_sampling.CondensedNeareastNeigbour
. Usen_neighbors
instead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_ngh
has been deprecated inunder_sampling.OneSidedSelection
. Usen_neighbors
instead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_ngh
has been deprecated inunder_sampling.NeighbourhoodCleaningRule
. Usen_neighbors
instead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_ngh
has been deprecated inunder_sampling.RepeatedEditedNearestNeighbours
. Usen_neighbors
instead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_ngh
has been deprecated inunder_sampling.AllKNN
. Usen_neighbors
instead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.Two base classes
BaseBinaryclassSampler
andBaseMulticlassSampler
have been created to handle the target type and raise warning in case of abnormality. By Guillaume Lemaitre and Christos Aridas.Move
random_state
to be assigned in theSamplerMixin
initialization. By Guillaume Lemaitre.Provide estimators instead of parameters in
combine.SMOTEENN
andcombine.SMOTETomek
. Therefore, the list of parameters have been deprecated. By Guillaume Lemaitre and Christos Aridas.k
has been deprecated inover_sampling.ADASYN
. Usen_neighbors
instead. #183 by Guillaume Lemaitre.k
andm
have been deprecated inover_sampling.SMOTE
. Usek_neighbors
andm_neighbors
instead. #182 by Guillaume Lemaitre.n_neighbors
acceptKNeighborsMixin
based object forunder_sampling.EditedNearestNeighbors
,under_sampling.CondensedNeareastNeigbour
,under_sampling.NeighbourhoodCleaningRule
,under_sampling.RepeatedEditedNearestNeighbours
, andunder_sampling.AllKNN
. #109 by Guillaume Lemaitre.
Documentation changes#
Replace some remaining
UnbalancedDataset
occurences. By Francois Magimel.Added doctest in the documentation. By Guillaume Lemaitre.