Implementation of the GiniIndex With Laplace correction as separation criterion.
Large Gini Indices (maximum 0.5) mean , that the sample is well mixed (same amount of signal and bkg) bkg.
Small Indices mean, well separated.
\[Gini(Sample M) = 1 - (\frac{c(1)}{N})^2 - (\frac{c(2)}{N})^2 .... - (\frac{c(k)}{N})^2 \]
Where:
\( M \) is a sample of whatever \( N \) elements (events) that belong to \( K \) different classes.
\( c(k) \) is the number of elements that belong to class \( k \) for just Signal and Background classes this boils down to:
The Laplace's correction to the probability distribution would turn the \( \frac{c(1)}{N} \) into \( \frac{(c(1)+1)}{(N+2)} \) using this the simple Gini Index for two classes
\[Gini(Sample) = \frac{2sb}{(s+b)^2} \]
turns into
\[GiniLaplace(Sample) = \frac{2(sb+s+b+1)}{(s+b+2)^2} \]
Definition at line 59 of file GiniIndexWithLaplace.h.
Public Member Functions | |
| GiniIndexWithLaplace () | |
| GiniIndexWithLaplace (const GiniIndexWithLaplace &g) | |
| virtual | ~GiniIndexWithLaplace () |
| const TString & | GetName () |
| virtual Double_t | GetSeparationGain (const Double_t nSelS, const Double_t nSelB, const Double_t nTotS, const Double_t nTotB) |
| Separation Gain: the measure of how the quality of separation of the sample increases by splitting the sample e.g. | |
| Double_t | GetSeparationIndex (const Double_t s, const Double_t b) override |
| TClass * | IsA () const override |
| void | Streamer (TBuffer &) override |
| void | StreamerNVirtual (TBuffer &ClassDef_StreamerNVirtual_b) |
Static Public Member Functions | |
| static TClass * | Class () |
| static const char * | Class_Name () |
| static constexpr Version_t | Class_Version () |
| static const char * | DeclFileName () |
Protected Attributes | |
| TString | fName |
| Double_t | fPrecisionCut |
#include <TMVA/GiniIndexWithLaplace.h>
|
inline |
Definition at line 64 of file GiniIndexWithLaplace.h.
|
inline |
Definition at line 67 of file GiniIndexWithLaplace.h.
|
inlinevirtual |
Definition at line 70 of file GiniIndexWithLaplace.h.
|
static |
|
static |
|
inlinestaticconstexpr |
Definition at line 77 of file GiniIndexWithLaplace.h.
|
inlinestatic |
Definition at line 77 of file GiniIndexWithLaplace.h.
|
inlineinherited |
Definition at line 104 of file SeparationBase.h.
|
virtualinherited |
Separation Gain: the measure of how the quality of separation of the sample increases by splitting the sample e.g.
into a "left-node" and a "right-node" (N * Index_parent) - (N_left * Index_left) - (N_right * Index_right) this is then the quality criterion which is optimized for when trying to increase the information in the system (making the best selection
Reimplemented in TMVA::SdivSqrtSplusB.
Definition at line 100 of file SeparationBase.cxx.
|
overridevirtual |
Implements TMVA::SeparationBase.
Definition at line 81 of file GiniIndexWithLaplace.cxx.
|
inlineoverridevirtual |
Reimplemented from TMVA::SeparationBase.
Definition at line 77 of file GiniIndexWithLaplace.h.
|
overridevirtual |
Reimplemented from TMVA::SeparationBase.
|
inline |
Definition at line 77 of file GiniIndexWithLaplace.h.
|
protectedinherited |
Definition at line 108 of file SeparationBase.h.
|
protectedinherited |
Definition at line 110 of file SeparationBase.h.