Classification example of TMVA based on public Higgs UCI dataset

The UCI data set is a public HIGGS data set , see http://archive.ics.uci.edu/ml/datasets/HIGGS used in this paper: Baldi, P., P. Sadowski, and D. Whiteson. “Searching for Exotic Particles in High-energy Physics with Deep Learning.” Nature Communications 5 (July 2, 2014).

******************************************************************************
*Tree    :sig_tree  : tree                                                   *
*Entries :    10000 : Total =         1177229 bytes  File  Size =     785298 *
*        :          : Tree compression factor =   1.48                       *
******************************************************************************
*Br    0 :Type      : Type/F                                                 *
*Entries :    10000 : Total  Size=      40556 bytes  File Size  =        307 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression= 130.54     *
*............................................................................*
*Br    1 :lepton_pT : lepton_pT/F                                            *
*Entries :    10000 : Total  Size=      40581 bytes  File Size  =      30464 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.32     *
*............................................................................*
*Br    2 :lepton_eta : lepton_eta/F                                          *
*Entries :    10000 : Total  Size=      40586 bytes  File Size  =      28650 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.40     *
*............................................................................*
*Br    3 :lepton_phi : lepton_phi/F                                          *
*Entries :    10000 : Total  Size=      40586 bytes  File Size  =      30508 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.31     *
*............................................................................*
*Br    4 :missing_energy_magnitude : missing_energy_magnitude/F              *
*Entries :    10000 : Total  Size=      40656 bytes  File Size  =      35749 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.12     *
*............................................................................*
*Br    5 :missing_energy_phi : missing_energy_phi/F                          *
*Entries :    10000 : Total  Size=      40626 bytes  File Size  =      36766 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.09     *
*............................................................................*
*Br    6 :jet1_pt   : jet1_pt/F                                              *
*Entries :    10000 : Total  Size=      40571 bytes  File Size  =      32298 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.24     *
*............................................................................*
*Br    7 :jet1_eta  : jet1_eta/F                                             *
*Entries :    10000 : Total  Size=      40576 bytes  File Size  =      28467 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.41     *
*............................................................................*
*Br    8 :jet1_phi  : jet1_phi/F                                             *
*Entries :    10000 : Total  Size=      40576 bytes  File Size  =      30399 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.32     *
*............................................................................*
*Br    9 :jet1_b-tag : jet1_b-tag/F                                          *
*Entries :    10000 : Total  Size=      40586 bytes  File Size  =       5087 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   7.88     *
*............................................................................*
*Br   10 :jet2_pt   : jet2_pt/F                                              *
*Entries :    10000 : Total  Size=      40571 bytes  File Size  =      31561 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.27     *
*............................................................................*
*Br   11 :jet2_eta  : jet2_eta/F                                             *
*Entries :    10000 : Total  Size=      40576 bytes  File Size  =      28616 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.40     *
*............................................................................*
*Br   12 :jet2_phi  : jet2_phi/F                                             *
*Entries :    10000 : Total  Size=      40576 bytes  File Size  =      30547 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.31     *
*............................................................................*
*Br   13 :jet2_b-tag : jet2_b-tag/F                                          *
*Entries :    10000 : Total  Size=      40586 bytes  File Size  =       5031 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   7.97     *
*............................................................................*
*Br   14 :jet3_pt   : jet3_pt/F                                              *
*Entries :    10000 : Total  Size=      40571 bytes  File Size  =      30642 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.31     *
*............................................................................*
*Br   15 :jet3_eta  : jet3_eta/F                                             *
*Entries :    10000 : Total  Size=      40576 bytes  File Size  =      28955 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.38     *
*............................................................................*
*Br   16 :jet3_phi  : jet3_phi/F                                             *
*Entries :    10000 : Total  Size=      40576 bytes  File Size  =      30433 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.32     *
*............................................................................*
*Br   17 :jet3_b-tag : jet3_b-tag/F                                          *
*Entries :    10000 : Total  Size=      40586 bytes  File Size  =       4879 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   8.22     *
*............................................................................*
*Br   18 :jet4_pt   : jet4_pt/F                                              *
*Entries :    10000 : Total  Size=      40571 bytes  File Size  =      29189 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.37     *
*............................................................................*
*Br   19 :jet4_eta  : jet4_eta/F                                             *
*Entries :    10000 : Total  Size=      40576 bytes  File Size  =      29311 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.37     *
*............................................................................*
*Br   20 :jet4_phi  : jet4_phi/F                                             *
*Entries :    10000 : Total  Size=      40576 bytes  File Size  =      30525 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.31     *
*............................................................................*
*Br   21 :jet4_b-tag : jet4_b-tag/F                                          *
*Entries :    10000 : Total  Size=      40586 bytes  File Size  =       4725 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   8.48     *
*............................................................................*
*Br   22 :m_jj      : m_jj/F                                                 *
*Entries :    10000 : Total  Size=      40556 bytes  File Size  =      34991 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.15     *
*............................................................................*
*Br   23 :m_jjj     : m_jjj/F                                                *
*Entries :    10000 : Total  Size=      40561 bytes  File Size  =      34460 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.16     *
*............................................................................*
*Br   24 :m_lv      : m_lv/F                                                 *
*Entries :    10000 : Total  Size=      40556 bytes  File Size  =      32232 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.24     *
*............................................................................*
*Br   25 :m_jlv     : m_jlv/F                                                *
*Entries :    10000 : Total  Size=      40561 bytes  File Size  =      34598 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.16     *
*............................................................................*
*Br   26 :m_bb      : m_bb/F                                                 *
*Entries :    10000 : Total  Size=      40556 bytes  File Size  =      35012 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.14     *
*............................................................................*
*Br   27 :m_wbb     : m_wbb/F                                                *
*Entries :    10000 : Total  Size=      40561 bytes  File Size  =      34493 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.16     *
*............................................................................*
*Br   28 :m_wwbb    : m_wwbb/F                                               *
*Entries :    10000 : Total  Size=      40566 bytes  File Size  =      34410 *
*Baskets :        1 : Basket Size=    1500672 bytes  Compression=   1.16     *
*............................................................................*
DataSetInfo              : [dataset] : Added class "Signal"
                         : Add Tree sig_tree of type Signal with 10000 events
DataSetInfo              : [dataset] : Added class "Background"
                         : Add Tree bkg_tree of type Background with 10000 events
Factory                  : Booking method: ␛[1mLikelihood␛[0m
                         : 
Factory                  : Booking method: ␛[1mFisher␛[0m
                         : 
Factory                  : Booking method: ␛[1mBDT␛[0m
                         : 
                         : Rebuilding Dataset dataset
                         : Building event vectors for type 2 Signal
                         : Dataset[dataset] :  create input formulas for tree sig_tree
                         : Building event vectors for type 2 Background
                         : Dataset[dataset] :  create input formulas for tree bkg_tree
DataSetFactory           : [dataset] : Number of events in input trees
                         : 
                         : 
                         : Number of training and testing events
                         : ---------------------------------------------------------------------------
                         : Signal     -- training events            : 7000
                         : Signal     -- testing events             : 3000
                         : Signal     -- training and testing events: 10000
                         : Background -- training events            : 7000
                         : Background -- testing events             : 3000
                         : Background -- training and testing events: 10000
                         : 
DataSetInfo              : Correlation matrix (Signal):
                         : ----------------------------------------------------------------
                         :             m_jj   m_jjj    m_lv   m_jlv    m_bb   m_wbb  m_wwbb
                         :    m_jj:  +1.000  +0.777  +0.010  +0.107  +0.036  +0.517  +0.532
                         :   m_jjj:  +0.777  +1.000  +0.006  +0.083  +0.157  +0.682  +0.669
                         :    m_lv:  +0.010  +0.006  +1.000  +0.111  -0.026  +0.011  +0.023
                         :   m_jlv:  +0.107  +0.083  +0.111  +1.000  +0.325  +0.550  +0.555
                         :    m_bb:  +0.036  +0.157  -0.026  +0.325  +1.000  +0.463  +0.347
                         :   m_wbb:  +0.517  +0.682  +0.011  +0.550  +0.463  +1.000  +0.912
                         :  m_wwbb:  +0.532  +0.669  +0.023  +0.555  +0.347  +0.912  +1.000
                         : ----------------------------------------------------------------
DataSetInfo              : Correlation matrix (Background):
                         : ----------------------------------------------------------------
                         :             m_jj   m_jjj    m_lv   m_jlv    m_bb   m_wbb  m_wwbb
                         :    m_jj:  +1.000  +0.804  +0.017  +0.125  +0.007  +0.381  +0.394
                         :   m_jjj:  +0.804  +1.000  +0.025  +0.159  +0.153  +0.535  +0.520
                         :    m_lv:  +0.017  +0.025  +1.000  +0.114  +0.042  +0.064  +0.069
                         :   m_jlv:  +0.125  +0.159  +0.114  +1.000  +0.286  +0.592  +0.542
                         :    m_bb:  +0.007  +0.153  +0.042  +0.286  +1.000  +0.623  +0.441
                         :   m_wbb:  +0.381  +0.535  +0.064  +0.592  +0.623  +1.000  +0.878
                         :  m_wwbb:  +0.394  +0.520  +0.069  +0.542  +0.441  +0.878  +1.000
                         : ----------------------------------------------------------------
DataSetFactory           : [dataset] :  
                         : 
Factory                  : Booking method: ␛[1mDNN_GPU␛[0m
                         : 
                         : Parsing option string: 
                         : ... "!H:V:ErrorStrategy=CROSSENTROPY:VarTransform=G:WeightInitialization=XAVIER:InputLayout=1|1|7:BatchLayout=1|128|7:Layout=DENSE|64|TANH,DENSE|64|TANH,DENSE|64|TANH,DENSE|64|TANH,DENSE|1|LINEAR:TrainingStrategy=LearningRate=1e-3,Momentum=0.9,ConvergenceSteps=10,BatchSize=128,TestRepetitions=1,MaxEpochs=20,WeightDecay=1e-4,Regularization=None,Optimizer=ADAM,ADAM_beta1=0.9,ADAM_beta2=0.999,ADAM_eps=1.E-7,DropConfig=0.0+0.0+0.0+0.:Architecture=GPU"
                         : The following options are set:
                         : - By User:
                         :     <none>
                         : - Default:
                         :     Boost_num: "0" [Number of times the classifier will be boosted]
                         : Parsing option string: 
                         : ... "!H:V:ErrorStrategy=CROSSENTROPY:VarTransform=G:WeightInitialization=XAVIER:InputLayout=1|1|7:BatchLayout=1|128|7:Layout=DENSE|64|TANH,DENSE|64|TANH,DENSE|64|TANH,DENSE|64|TANH,DENSE|1|LINEAR:TrainingStrategy=LearningRate=1e-3,Momentum=0.9,ConvergenceSteps=10,BatchSize=128,TestRepetitions=1,MaxEpochs=20,WeightDecay=1e-4,Regularization=None,Optimizer=ADAM,ADAM_beta1=0.9,ADAM_beta2=0.999,ADAM_eps=1.E-7,DropConfig=0.0+0.0+0.0+0.:Architecture=GPU"
                         : The following options are set:
                         : - By User:
                         :     V: "True" [Verbose output (short form of "VerbosityLevel" below - overrides the latter one)]
                         :     VarTransform: "G" [List of variable transformations performed before training, e.g., "D_Background,P_Signal,G,N_AllClasses" for: "Decorrelation, PCA-transformation, Gaussianisation, Normalisation, each for the given class of events ('AllClasses' denotes all events of all classes, if no class indication is given, 'All' is assumed)"]
                         :     H: "False" [Print method-specific help message]
                         :     InputLayout: "1|1|7" [The Layout of the input]
                         :     BatchLayout: "1|128|7" [The Layout of the batch]
                         :     Layout: "DENSE|64|TANH,DENSE|64|TANH,DENSE|64|TANH,DENSE|64|TANH,DENSE|1|LINEAR" [Layout of the network.]
                         :     ErrorStrategy: "CROSSENTROPY" [Loss function: Mean squared error (regression) or cross entropy (binary classification).]
                         :     WeightInitialization: "XAVIER" [Weight initialization strategy]
                         :     Architecture: "GPU" [Which architecture to perform the training on.]
                         :     TrainingStrategy: "LearningRate=1e-3,Momentum=0.9,ConvergenceSteps=10,BatchSize=128,TestRepetitions=1,MaxEpochs=20,WeightDecay=1e-4,Regularization=None,Optimizer=ADAM,ADAM_beta1=0.9,ADAM_beta2=0.999,ADAM_eps=1.E-7,DropConfig=0.0+0.0+0.0+0." [Defines the training strategies.]
                         : - Default:
                         :     VerbosityLevel: "Default" [Verbosity level]
                         :     CreateMVAPdfs: "False" [Create PDFs for classifier outputs (signal and background)]
                         :     IgnoreNegWeightsInTraining: "False" [Events with negative weights are ignored in the training (but are included for testing and performance evaluation)]
                         :     RandomSeed: "0" [Random seed used for weight initialization and batch shuffling]
                         :     ValidationSize: "20%" [Part of the training data to use for validation. Specify as 0.2 or 20% to use a fifth of the data set as validation set. Specify as 100 to use exactly 100 events. (Default: 20%)]
DNN_GPU                  : [dataset] : Create Transformation "G" with events from all classes.
                         : 
                         : Transformation, Variable selection : 
                         : Input : variable 'm_jj' <---> Output : variable 'm_jj'
                         : Input : variable 'm_jjj' <---> Output : variable 'm_jjj'
                         : Input : variable 'm_lv' <---> Output : variable 'm_lv'
                         : Input : variable 'm_jlv' <---> Output : variable 'm_jlv'
                         : Input : variable 'm_bb' <---> Output : variable 'm_bb'
                         : Input : variable 'm_wbb' <---> Output : variable 'm_wbb'
                         : Input : variable 'm_wwbb' <---> Output : variable 'm_wwbb'
                         : Will now use the GPU architecture !

 
## Declare Factory
 
 
## Create the Factory class. Later you can choose the methods
## whose performance you'd like to investigate.
 
## The factory is the major TMVA object you have to interact with. Here is the list of parameters you need to pass
 
## - The first argument is the base of the name of all the output
## weightfiles in the directory weight/ that will be created with the
##    method parameters
 
## - The second argument is the output file for the training results
 
## - The third argument is a string option defining some general configuration for the TMVA session. For example all TMVA output can be suppressed by removing the "!" (not) in front of the "Silent" argument in the option string
 
import os
 
import ROOT
 
TMVA = ROOT.TMVA
TFile = ROOT.TFile
 
TMVA.Tools.Instance()
 
# options to control used methods
useLikelihood = True  # likelihood based discriminant
useLikelihoodKDE = False  # likelihood based discriminant
useFischer = True  # Fischer discriminant
useMLP = False  # Multi Layer Perceptron (old TMVA NN implementation)
useBDT = True  # Boosted Decision Tree
useDL = True  # TMVA Deep learning ( CPU or GPU)
useKeras = True  # Use Keras Deep Learning via PyMVA
 
if ROOT.gSystem.GetFromPipe("root-config --has-tmva-pymva") == "yes":
    TMVA.PyMethodBase.PyInitialize()
else:
    useKeras = False  # cannot use Keras if PYMVA is not available
 
if useKeras:
    try:
        pass
    except:
        ROOT.Warning("TMVA_Higgs_Classification", "Skip using Keras since tensorflow is not available")
        useKeras = False
 
outputFile = TFile.Open("Higgs_ClassificationOutput.root", "RECREATE")
factory = TMVA.Factory(
    "TMVA_Higgs_Classification", outputFile, V=False, ROC=True, Silent=False, Color=True, AnalysisType="Classification"
)
 
 
## Setup Dataset(s)
 
# Define now input data file and signal and background trees
 
inputFileName = str(ROOT.gROOT.GetTutorialDir()) + "/machine_learning/data/Higgs_data.root"
 
inputFile = TFile.Open(inputFileName)
if inputFile is None:
   raise FileNotFoundError("Input file is not found - exit")
 
 
# --- Register the training and test trees
signalTree = inputFile.Get("sig_tree")
backgroundTree = inputFile.Get("bkg_tree")
signalTree.Print()
 
## Declare DataLoader(s)
 
# The next step is to declare the DataLoader class that deals with input variables
# Define the input variables that shall be used for the MVA training
# note that you may also use variable expressions, which can be parsed by TTree::Draw( "expression" )]
loader = TMVA.DataLoader("dataset")
 
loader.AddVariable("m_jj")
loader.AddVariable("m_jjj")
loader.AddVariable("m_lv")
loader.AddVariable("m_jlv")
loader.AddVariable("m_bb")
loader.AddVariable("m_wbb")
loader.AddVariable("m_wwbb")
 
# We set now the input data trees in the TMVA DataLoader class
# global event weights per tree (see below for setting event-wise weights)
signalWeight = 1.0
backgroundWeight = 1.0
# You can add an arbitrary number of signal or background trees
loader.AddSignalTree(signalTree, signalWeight)
loader.AddBackgroundTree(backgroundTree, backgroundWeight)
 
# Set individual event weights (the variables must exist in the original TTree)
#   for signal    : factory->SetSignalWeightExpression    ("weight1*weight2");
#   for background: factory->SetBackgroundWeightExpression("weight1*weight2");
#   loader->SetBackgroundWeightExpression( "weight" );
 
 
# Apply additional cuts on the signal and background samples (can be different)
mycuts = ROOT.TCut("")  # for example: TCut mycuts = "abs(var1)<0.5 && abs(var2-0.5)<1";
mycutb = ROOT.TCut("")  # for example: TCut mycutb = "abs(var1)<0.5";
 
# Tell the factory how to use the training and testing events
#
# If no numbers of events are given, half of the events in the tree are used
# for training, and the other half for testing:
#    loader->PrepareTrainingAndTestTree( mycut, "SplitMode=random:!V" );
# To also specify the number of testing events, use:
 
loader.PrepareTrainingAndTestTree(
    mycuts, mycutb, nTrain_Signal=7000, nTrain_Background=7000, SplitMode="Random", NormMode="NumEvents", V=False
)
 
## Booking Methods
 
# Here we book the TMVA methods. We book first a Likelihood based on KDE (Kernel Density Estimation), a Fischer discriminant, a BDT
# and a shallow neural network
# Likelihood ("naive Bayes estimator")
if useLikelihood:
    factory.BookMethod(
        loader,
        TMVA.Types.kLikelihood,
        "Likelihood",
        H=True,
        V=False,
        TransformOutput=True,
        PDFInterpol="Spline2:NSmoothSig[0]=20:NSmoothBkg[0]=20:NSmoothBkg[1]=10",
        NSmooth=1,
        NAvEvtPerBin=50,
    )
 
# Use a kernel density estimator to approximate the PDFs
if useLikelihoodKDE:
    factory.BookMethod(
        loader,
        TMVA.Types.kLikelihood,
        "LikelihoodKDE",
        H=False,
        V=False,
        TransformOutput=False,
        PDFInterpol="KDE",
        KDEtype="Gauss",
        KDEiter="Adaptive",
        KDEFineFactor=0.3,
        KDEborder=None,
        NAvEvtPerBin=50,
    )
 
# Fisher discriminant (same as LD)
if useFischer:
    factory.BookMethod(
        loader,
        TMVA.Types.kFisher,
        "Fisher",
        H=True,
        V=False,
        Fisher=True,
        VarTransform=None,
        CreateMVAPdfs=True,
        PDFInterpolMVAPdf="Spline2",
        NbinsMVAPdf=50,
        NsmoothMVAPdf=10,
    )
 
# Boosted Decision Trees
if useBDT:
    factory.BookMethod(
        loader,
        TMVA.Types.kBDT,
        "BDT",
        V=False,
        NTrees=200,
        MinNodeSize="2.5%",
        MaxDepth=2,
        BoostType="AdaBoost",
        AdaBoostBeta=0.5,
        UseBaggedBoost=True,
        BaggedSampleFraction=0.5,
        SeparationType="GiniIndex",
        nCuts=20,
    )
 
# Multi-Layer Perceptron (Neural Network)
if useMLP:
    factory.BookMethod(
        loader,
        TMVA.Types.kMLP,
        "MLP",
        H=False,
        V=False,
        NeuronType="tanh",
        VarTransform="N",
        NCycles=100,
        HiddenLayers="N+5",
        TestRate=5,
        UseRegulator=False,
    )
 
## Here we book the new DNN of TMVA if we have support in ROOT. We will use GPU version if ROOT is enabled with GPU
 
 
## Booking Deep Neural Network
 
# Here we define the option string for building the Deep Neural network model.
 
#### 1. Define DNN layout
 
# The DNN configuration is defined using a string. Note that whitespaces between characters are not allowed.
 
# We define first the DNN layout:
 
# - **input layout** :   this defines the input data format for the DNN as  ``input depth | height | width``.
# In case of a dense layer as first layer the input layout should be  ``1 | 1 | number of input variables`` (features)
# - **batch layout**  : this defines how are the input batch. It is related to input layout but not the same.
# If the first layer is dense it should be ``1 | batch size ! number of variables`` (features)
 
# *(note the use of the character `|` as  separator of  input parameters for DNN layout)*
 
# note that in case of only dense layer the input layout could be omitted but it is required when defining more
# complex architectures
 
# - **layer layout** string defining the layer architecture. The syntax is
# - layer type (e.g. DENSE, CONV, RNN)
# - layer parameters (e.g. number of units)
# - activation function (e.g  TANH, RELU,...)
 
# *the different layers are separated by the ``","`` *
 
#### 2. Define Training Strategy
 
# We define here the training strategy parameters for the DNN. The parameters are separated by the ``","`` separator.
# One can then concatenate different training strategy with different parameters. The training strategy are separated by
# the ``"|"`` separator.
 
# - Optimizer
# - Learning rate
# - Momentum (valid for SGD and RMSPROP)
# - Regularization and Weight Decay
# - Dropout
# - Max number of epochs
# - Convergence steps. if the test error will not decrease after that value the training will stop
# - Batch size (This value must be the same specified in the input layout)
# - Test Repetitions (the interval when the test error will be computed)
 
 
#### 3. Define general DNN options
 
# We define the general DNN options concatenating in the final string the previously defined layout and training strategy.
# Note we use the ``":"`` separator to separate the different higher level options, as in the other TMVA methods.
# In addition to input layout, batch layout and training strategy we add now:
 
# - Type of Loss function (e.g. CROSSENTROPY)
# - Weight Initizalization (e.g XAVIER, XAVIERUNIFORM, NORMAL )
# - Variable Transformation
# - Type of Architecture (e.g. CPU, GPU, Standard)
 
# We can then book the DL method using the built option string
if useDL:
    useDLGPU = ROOT.gSystem.GetFromPipe("root-config --has-tmva-gpu") == "yes"
 
    # Define DNN layout
    # Define Training strategies
    # one can catenate several training strategies
    training1 = ROOT.TString(
        "LearningRate=1e-3,Momentum=0.9,"
        "ConvergenceSteps=10,BatchSize=128,TestRepetitions=1,"
        "MaxEpochs=20,WeightDecay=1e-4,Regularization=None,"
        "Optimizer=ADAM,ADAM_beta1=0.9,ADAM_beta2=0.999,ADAM_eps=1.E-7,"  # ADAM default parameters
        "DropConfig=0.0+0.0+0.0+0."
    )
    #   training2 = ROOT.TString("LearningRate=1e-3,Momentum=0.9"
    #                      "ConvergenceSteps=10,BatchSize=128,TestRepetitions=1,"
    #                       "MaxEpochs=20,WeightDecay=1e-4,Regularization=None,"
    #                       "Optimizer=SGD,DropConfig=0.0+0.0+0.0+0.")
 
    # General Options.
    dnnMethodName = ROOT.TString("DNN_CPU")
 
    if useDLGPU:
        arch = "GPU"
        dnnMethodName = "DNN_GPU"
    else:
        arch = "CPU"
 
    factory.BookMethod(
        loader,
        TMVA.Types.kDL,
        dnnMethodName,
        H=False,
        V=True,
        ErrorStrategy="CROSSENTROPY",
        VarTransform="G",
        WeightInitialization="XAVIER",
        InputLayout="1|1|7",
        BatchLayout="1|128|7",
        Layout="DENSE|64|TANH,DENSE|64|TANH,DENSE|64|TANH,DENSE|64|TANH,DENSE|1|LINEAR",
        TrainingStrategy=training1,
        Architecture=arch,
    )
 
# Keras DL
if useKeras:
    ROOT.Info("TMVA_Higgs_Classification", "Building Deep Learning keras model")
    # create Keras model with 4 layers of 64 units and relu activations
    from tensorflow.keras.layers import Dense
    from tensorflow.keras.models import Sequential
    from tensorflow.keras.optimizers import Adam
 
    model = Sequential()
    model.add(Dense(64, activation="relu", input_dim=7))
    model.add(Dense(64, activation="relu"))
    model.add(Dense(64, activation="relu"))
    model.add(Dense(64, activation="relu"))
    model.add(Dense(2, activation="sigmoid"))
    model.compile(loss="binary_crossentropy", optimizer=Adam(learning_rate=0.001), weighted_metrics=["accuracy"])
    model.save("model_higgs.keras")
    model.summary()
 
    if not os.path.exists("model_higgs.keras"):
        raise FileNotFoundError("Error creating Keras model file - skip using Keras")
    else:
        # book PyKeras method only if Keras model could be created
        ROOT.Info("TMVA_Higgs_Classification", "Booking Deep Learning keras model")
        factory.BookMethod(
            loader,
            TMVA.Types.kPyKeras,
            "PyKeras",
            H=True,
            V=False,
            VarTransform=None,
            FilenameModel="model_higgs.keras",
            FilenameTrainedModel="trained_model_higgs.keras",
            NumEpochs=20,
            BatchSize=100,
        )
#            GpuOptions="allow_growth=True",
#        )  # needed for RTX NVidia card and to avoid TF allocates all GPU memory
 
 
## Train Methods
 
# Here we train all the previously booked methods.
 
factory.TrainAllMethods()
## Test  all methods
 
# Now we test and evaluate all methods using the test data set
factory.TestAllMethods()
 
factory.EvaluateAllMethods()
 
# after we get the ROC curve and we display
 
c1 = factory.GetROCCurve(loader)
c1.Draw()
# at the end we close the output file which contains the evaluation result of all methods and it can be used by TMVAGUI
# to display additional plots
 
outputFile.Close()

Author: Harshal Shende

Definition in file TMVA_Higgs_Classification.py.