doc/v610/portfolio_8C_source.html

 /// \file
 /// \ingroup tutorial_quadp
 /// \notebook
 /// This macro shows in detail the use of the quadratic programming package quadp .
 /// Running this macro :
 ///
 /// ~~~{.cpp}
 ///     .x portfolio.C+
 /// ~~~
 ///
 /// or
 ///
 /// ~~~{.cpp}
 /// gSystem->Load("libQuadp");
 /// .L portFolio.C+; portfolio()
 /// ~~~
 ///
 /// Let's first review what we exactly mean by "quadratic programming" :
 ///
 /// We want to minimize the following objective function :
 ///
 ///   \f$ c^T x + ( 1/2 ) x^T Q x \f$  wrt. the vector \f$ x \f$
 ///
 ///  \f$ c \f$ is a vector and \f$ Q \f$ a symmetric positive definite matrix
 ///
 /// You might wonder what is so special about this objective which is quadratic in
 /// the unknowns, that can not be done by Minuit/Fumili . Well, we have in addition
 /// the following boundary conditions on \f$ x \f$:
 ///
 ///  \f[
 ///          A x =  b \\
 ///  clo \le  C x \le cup \\
 ///  xlo \le    x \le xup
 ///  \f]  where A and C are arbitrary matrices and the rest are vectors
 ///
 /// Not all these constraints have to be defined . Our example will only use \f$ xlo \f$,
 /// \f$ A \f$ and \f$ b \f$
 /// Still, this could be handled by a general non-linear minimizer like Minuit by introducing
 /// so-called "slack" variables . However, quadp is tailored to objective functions not more
 /// complex than being quadratic . This allows usage of solving techniques which are even
 /// stable for problems involving for instance 500 variables, 100 inequality conditions
 /// and 50 equality conditions .
 ///
 /// Enough said about quadratic programming, let's return to our example .
 /// Suppose, after a long day of doing physics, you have a look at your investments and
 /// realize that an early retirement is not possible, given the returns of your stocks .
 /// So what now ? ROOT to the rescue ...
 ///
 /// In 1990 Harry Markowitz was awarded the Nobel prize for economics: " his work provided new tools
 /// for weighing the risks and rewards of different investments and for valuing corporate stocks and bonds" .
 /// In plain English, he developed the tools to balance greed and fear, we want the maximum return
 /// with the minimum amount of risk. Our stock portfolio should be at the
 /// ["Efficient Frontier"](see http://www.riskglossary.com/articles/efficient_frontier.htm).
 /// To quantify better the risk we are willing to take, we define a utility function \f$ U(x) \f$. It describes
 /// as a function of our total assets \f$ x \f$, our "satisfaction" . A common choice is \f$ 1-exp(-k*x) \f$ (the reason for
 /// the exponent will be clear later)  . The parameter \f$ k \f$ is the risk-aversion factor . For small values of \f$ k \f$
 /// the satisfaction is small for small values of \f$ x \f$; by increasing \f$ x \f$ the satisfaction can still be increased
 /// significantly . For large values of \f$ k \f$, \f$ U(x) \f$ increases rapidly to 1, there is no increase in satisfaction
 /// for additional dollars earned .
 ///
 /// In summary :
 ///  - small \f$ k \f$ ==> risk-loving investor
 ///  - large \f$ k \f$ ==> risk-averse investor
 ///
 /// Suppose we have for nrStocks the historical daily returns \f$ r = closing_price(n) - closing_price(n-1) \f$.
 /// Define a vector \f$ x \f$ of length of \f$ nrStocks \f$, which contains the fraction of our money invested in
 /// each stock . We can calculate the average daily return \f$ z \f$ of our portfolio and its variance using
 /// the portfolio covariance Covar :
 ///
 ///  \f$ z = r^T x \f$  and \f$ var = x^T Covar x \f$
 ///
 /// Assuming that the daily returns have a Normal distribution, \f$ N(x) \f$, so will \f$ z \f$ with mean \f$ r^T x \f$
 /// and variance \f$ x^T Covar x \f$
 ///
 /// The expected value of the utility function is :
 ///
 ///  \f[
 /// E(u(x)) = Int (1-exp(-k*x) N(x) dx \\
 ///         = 1-exp(-k (r^T x - 0.5 k x^T Covar x) ) \\
 ///  \f]
 ///
 /// Its value is maximised by maximising \f$ r^T x -0.5 k x^T Covar x \f$
 /// under the condition \f$ sum (x_i) = 1 \f$, meaning we want all our money invested and
 /// \f$ x_i \ge 0 \f$, we can not "short" a stock
 ///
 /// For 10 stocks we got the historical daily data for Sep-2000 to Jun-2004:
 ///
 ///  - GE   : General Electric Co
 ///  - SUNW : Sun Microsystems Inc
 ///  - QCOM : Qualcomm Inc
 ///  - BRCM : Broadcom Corp
 ///  - TYC  : Tyco International Ltd
 ///  - IBM  : International Business Machines Corp
 ///  - AMAT : Applied Materials Inc
 ///  - C    : Citigroup Inc
 ///  - PFE  : Pfizer Inc
 ///  - HD   : Home Depot Inc
 ///
 /// We calculate the optimal portfolio for 2.0 and 10.0 .
 ///
 /// Food for thought :
 /// - We assumed that the stock returns have a Normal distribution . Check this assumption by
 ///   histogramming the stock returns !
 /// - We used for the expected return in the objective function, the flat average over a time
 ///   period . Investment firms will put significant resources in improving the return prediction .
 /// - If you want to trade significant number of shares, several other considerations have
 ///   to be taken into account :
 ///    +  If you are going to buy, you will drive the price up (so-called "slippage") .
 ///       This can be taken into account by adding terms to the objective
 ///       (Google for "slippage optimization")
 ///    +  FTC regulations might have to be added to the inequality constraints
 /// - Investment firms do not want to be exposed to the "market" as defined by a broad
 ///   index like the S&P and "hedge" this exposure away . A perfect hedge this can be added
 ///    as an equality constrain, otherwise add an inequality constrain .
 ///
 /// \macro_image
 /// \macro_output
 /// \macro_code
 ///
 /// \author Eddy Offermann

 #include "Riostream.h"
 #include "TCanvas.h"
 #include "TFile.h"
 #include "TMath.h"
 #include "TTree.h"
 #include "TArrayF.h"
 #include "TH1.h"
 #include "TF1.h"
 #include "TLegend.h"
 #include "TSystem.h"

 #include "TMatrixD.h"
 #include "TMatrixDSym.h"
 #include "TVectorD.h"
 #include "TQpProbDens.h"
 #include "TGondzioSolver.h"

 const Int_t nrStocks = 10;
 static const Char_t *stocks[] =
      {"GE","SUNW","QCOM","BRCM","TYC","IBM","AMAT","C","PFE","HD"};

 class TStockDaily  {
 public:
   Int_t fDate;
   Int_t fOpen;     // 100*open_price
   Int_t fHigh;     // 100*high_price
   Int_t fLow;      // 100*low_price
   Int_t fClose;    // 100*close_price
   Int_t fVol;
   Int_t fCloseAdj; // 100*close_price adjusted for splits and dividend

   TStockDaily() {
      fDate = fVol = fOpen = fHigh = fLow = fClose = fCloseAdj = 0;
   }
   virtual ~TStockDaily() {}

   ClassDef(TStockDaily,1)
 };

 //---------------------------------------------------------------------------
 Double_t RiskProfile(Double_t *x, Double_t *par) {
   Double_t riskFactor = par[0];
   return 1-TMath::Exp(-riskFactor*x[0]);
 }

 //---------------------------------------------------------------------------
 TArrayF &StockReturn(TFile *f,const TString &name,Int_t sDay,Int_t eDay)
 {
   TTree *tDaily = (TTree*)f->Get(name);
   TStockDaily *data = 0;
   tDaily->SetBranchAddress("daily",&data);
   TBranch *b_closeAdj = tDaily->GetBranch("fCloseAdj");
   TBranch *b_date     = tDaily->GetBranch("fDate");

   //read only the "adjusted close" branch for all entries
   const Int_t nrEntries = (Int_t)tDaily->GetEntries();
   TArrayF closeAdj(nrEntries);
   for (Int_t i = 0; i < nrEntries; i++) {
     b_date->GetEntry(i);
     b_closeAdj->GetEntry(i);
     if (data->fDate >= sDay && data->fDate <= eDay)
       closeAdj[i] = data->fCloseAdj/100.;
   }

   TArrayF *r = new TArrayF(nrEntries-1);
   for (Int_t i = 1; i < nrEntries; i++)
 //    (*r)[i-1] = closeAdj[i]-closeAdj[i-1];
     (*r)[i-1] = closeAdj[i]/closeAdj[i-1];

   return *r;
 }

 #ifndef __MAKECINT__
 //---------------------------------------------------------------------------
 TVectorD OptimalInvest(Double_t riskFactor,TVectorD r,TMatrixDSym Covar)
 {
 // what the quadratic programming package will do:
 //
 //  minimize    c^T x + ( 1/2 ) x^T Q x
 //  subject to                A x  = b
 //                    clo <=  C x <= cup
 //                    xlo <=    x <= xup
 // what we want :
 //
 //  maximize    c^T x - k ( 1/2 ) x^T Q x
 //  subject to        sum_x x_i = 1
 //                    0 <= x_i

   // We have nrStocks weights to determine,
   // 1 equality- and 0 inequality- equations (the simple square boundary
   // condition (xlo <= x <= xup) does not count)

   const Int_t nrVar     = nrStocks;
   const Int_t nrEqual   = 1;
   const Int_t nrInEqual = 0;

   // flip the sign of the objective function because we want to maximize
   TVectorD    c = -1.*r;
   TMatrixDSym Q = riskFactor*Covar;

   // equality equation
   TMatrixD A(nrEqual,nrVar); A = 1;
   TVectorD b(nrEqual); b = 1;

   // inequality equation
   //
   // - although not applicable in the current situation since nrInEqual = 0, one
   //   has to specify not only clo and cup but also an index vector iclo and icup,
   //   whose values are either 0 or 1 . If iclo[j] = 1, the lower boundary condition
   //   is active on x[j], etc. ...

   TMatrixD C   (nrInEqual,nrVar);
   TVectorD clo (nrInEqual);
   TVectorD cup (nrInEqual);
   TVectorD iclo(nrInEqual);
   TVectorD icup(nrInEqual);

   // simple square boundary condition : 0 <= x_i, so only xlo is relevant .
   // Like for clo and cup above, we have to define an index vector ixlo and ixup .
   // Since each variable has the lower boundary, we can set the whole vector
   // ixlo = 1

   TVectorD xlo (nrVar); xlo  = 0;
   TVectorD xup (nrVar); xup  = 0;
   TVectorD ixlo(nrVar); ixlo = 1;
   TVectorD ixup(nrVar); ixup = 0;

   // setup the quadratic programming problem . Since a small number of variables are
   // involved and "Q" has everywhere entries, we chose the dense version "TQpProbDens" .
   // In case of a sparse formulation, simply replace all "Dens" by "Sparse" below and
   // use TMatrixDSparse instead of TMatrixDSym and TMatrixD

   TQpProbDens *qp = new TQpProbDens(nrVar,nrEqual,nrInEqual);

   // stuff all the matrices/vectors defined above in the proper places

   TQpDataDens *prob = (TQpDataDens *)qp->MakeData(c,Q,xlo,ixlo,xup,ixup,A,b,C,clo,iclo,cup,icup);

   // setup the nrStock variables, vars->fX will contain the final solution

   TQpVar      *vars  = qp->MakeVariables(prob);
   TQpResidual *resid = qp->MakeResiduals(prob);

   // Now we have to choose the method of solving, either TGondzioSolver or TMehrotraSolver
   // The Gondzio method is more sophisticated and therefore numerically more involved
   // If one want the Mehrotra method, simply replace "Gondzio" by "Mehrotra" .

   TGondzioSolver *s = new TGondzioSolver(qp,prob);
   const Int_t status = s->Solve(prob,vars,resid);

   const TVectorD weight = vars->fX;

   delete qp; delete prob; delete vars; delete resid; delete s;
   if (status != 0) {
     cout << "Could not solve this problem." <<endl;
     return TVectorD(nrStocks);
   }

   return weight;
 }
 #endif

  //---------------------------------------------------------------------------
 void portfolio()
 {
    const Int_t sDay = 20000809;
    const Int_t eDay = 20040602;

    const char *fname = "stock.root";
    TFile *f = 0;
    if (!gSystem->AccessPathName(fname)) {
       f = TFile::Open(fname);
    } else if (!gSystem->AccessPathName(Form("%s/quadp/%s", TROOT::GetTutorialDir().Data(), fname))) {
       f = TFile::Open(Form("%s/quadp/%s", TROOT::GetTutorialDir().Data(), fname));
    } else {
       printf("accessing %s file from http://root.cern.ch/files\n",fname);
       f = TFile::Open(Form("http://root.cern.ch/files/%s",fname));
    }
    if (!f) return;

    TArrayF *data = new TArrayF[nrStocks];
    for (Int_t i = 0; i < nrStocks; i++) {
       const TString symbol = stocks[i];
       data[i] = StockReturn(f,symbol,sDay,eDay);
    }

    const Int_t nrData = data[0].GetSize();

    TVectorD r(nrStocks);
    for (Int_t i = 0; i < nrStocks; i++)
       r[i] = data[i].GetSum()/nrData;

    TMatrixDSym Covar(nrStocks);
    for (Int_t i = 0; i < nrStocks; i++) {
       for (Int_t j = 0; j <= i; j++) {
          Double_t sum = 0.;
          for (Int_t k = 0; k < nrData; k++) {
             sum += (data[i][k] - r[i]) * (data[j][k] - r[j]);
          }
          Covar(i,j) = Covar(j,i) = sum/nrData;
       }
    }

    const TVectorD weight1 = OptimalInvest(2.0,r,Covar);
    const TVectorD weight2 = OptimalInvest(10.,r,Covar);

    cout << "stock     daily  daily   w1     w2" <<endl;
    cout << "symb      return sdv              " <<endl;
    for (Int_t i = 0; i < nrStocks; i++)
       printf("%s\t: %.3f  %.3f  %.3f  %.3f\n",stocks[i],r[i],TMath::Sqrt(Covar[i][i]),weight1[i],weight2[i]);

    TCanvas *c1 = new TCanvas("c1","Portfolio Optimizations",10,10,800,900);
    c1->Divide(1,2);

    // utility function / risk profile

    c1->cd(1);
    gPad->SetGridx();
    gPad->SetGridy();

    TF1 *f1 = new TF1("f1",RiskProfile,0,2.5,1);
    f1->SetParameter(0,2.0);
    f1->SetLineColor(49);
    f1->Draw("AC");
    f1->GetHistogram()->SetXTitle("dollar");
    f1->GetHistogram()->SetYTitle("utility");
    f1->GetHistogram()->SetMinimum(0.0);
    f1->GetHistogram()->SetMaximum(1.0);
    TF1 *f2 = new TF1("f2",RiskProfile,0,2.5,1);
    f2->SetParameter(0,10.);
    f2->SetLineColor(50);
    f2->Draw("CSAME");

    TLegend *legend1 = new TLegend(0.50,0.65,0.70,0.82);
    legend1->AddEntry(f1,"1-exp(-2.0*x)","l");
    legend1->AddEntry(f2,"1-exp(-10.*x)","l");
    legend1->Draw();

    // vertical bar chart of portfolio distribution

    c1->cd(2);
    TH1F *h1 = new TH1F("h1","Portfolio Distribution",nrStocks,0,0);
    TH1F *h2 = new TH1F("h2","Portfolio Distribution",nrStocks,0,0);
    h1->SetStats(0);
    h1->SetFillColor(49);
    h2->SetFillColor(50);
    h1->SetBarWidth(0.45);
    h1->SetBarOffset(0.1);
    h2->SetBarWidth(0.4);
    h2->SetBarOffset(0.55);
    for (Int_t i = 0; i < nrStocks; i++) {
       h1->Fill(stocks[i],weight1[i]);
       h2->Fill(stocks[i],weight2[i]);
    }

    h1->Draw("BAR2 HIST");
    h2->Draw("BAR2SAME HIST");

    TLegend *legend2 = new TLegend(0.50,0.65,0.70,0.82);
    legend2->AddEntry(h1,"high risk","f");
    legend2->AddEntry(h2,"low  risk","f");
    legend2->Draw();
 }
TSystem::AccessPathName
virtual Bool_t AccessPathName(const char *path, EAccessMode mode=kFileExists)
Returns FALSE if one can access a file using the specified access mode.
Definition: TSystem.cxx:1272

par
double par[1]
Definition: unuranDistr.cxx:38

TH1::SetBarOffset
virtual void SetBarOffset(Float_t offset=0.25)
Definition: TH1.h:337

TH1::Fill
virtual Int_t Fill(Double_t x)
Increment bin with abscissa X by 1.
Definition: TH1.cxx:3126

sum
static long int sum(long int i)
Definition: Factory.cxx:2162

TH1::SetMaximum
virtual void SetMaximum(Double_t maximum=-1111)
Definition: TH1.h:375

TLegend
This class displays a legend box (TPaveText) containing several legend entries.
Definition: TLegend.h:23

TROOT::GetTutorialDir
static const TString & GetTutorialDir()
Get the tutorials directory in the installation. Static utility function.
Definition: TROOT.cxx:2865

c1
return c1
Definition: legend1.C:41

TQpResidual
Definition: TQpResidual.h:61

TQpProbDens.h

TLegend::Draw
virtual void Draw(Option_t *option="")
Draw this legend with its current attributes.
Definition: TLegend.cxx:452

ROOT::Experimental::TH1F
THist< 1, float, THistStatContent, THistStatUncertainty > TH1F
Definition: THist.hxx:311

TTree.h

TFile
A ROOT file is a suite of consecutive data records (TKey instances) with a well defined format...
Definition: TFile.h:46

TCanvas::cd
TVirtualPad * cd(Int_t subpadnumber=0)
Set current canvas & pad.
Definition: TCanvas.cxx:679

RegressionKeras.name
name
Definition: RegressionKeras.py:30

TVectorT
TVectorT.
Definition: TMatrixTBase.h:77

TDirectoryFile::Get
virtual TObject * Get(const char *namecycle)
Return pointer to object identified by namecycle.
Definition: TDirectoryFile.cxx:887

TH1::SetMinimum
virtual void SetMinimum(Double_t minimum=-1111)
Definition: TH1.h:376

TFile.h

ApplicationClassificationKeras.data
data
Definition: ApplicationClassificationKeras.py:17

TString
Basic string class.
Definition: TString.h:129

TH1F
tomato 1-D histogram with a float per channel (see TH1 documentation)}
Definition: TH1.h:551

TArrayF
Array of floats (32 bits per element).
Definition: TArrayF.h:27

TQpProbDens::MakeResiduals
virtual TQpResidual * MakeResiduals(const TQpDataBase *data)
Setup the residuals.
Definition: TQpProbDens.cxx:172

Int_t
int Int_t
Definition: RtypesCore.h:41

TH1::SetYTitle
virtual void SetYTitle(const char *title)
Definition: TH1.h:390

TF1::Draw
virtual void Draw(Option_t *option="")
Draw this function with its current attributes.
Definition: TF1.cxx:1087

TGondzioSolver.h

TQpVar::fX
TVectorD fX
Definition: TQpVar.h:91

ROOT::Math::Cephes::A
static double A[]
Definition: SpecFuncCephes.cxx:170

TGondzioSolver
Definition: TGondzioSolver.h:56

TH1::SetBarWidth
virtual void SetBarWidth(Float_t width=0.5)
Definition: TH1.h:338

TMatrixT
TMatrixT.
Definition: TMatrixDfwd.h:22

TFile::Open
static TFile * Open(const char *name, Option_t *option="", const char *ftitle="", Int_t compress=1, Int_t netopt=0)
Create / open a file.
Definition: TFile.cxx:3909

TVectorD.h

x
Double_t x[n]
Definition: legend1.C:17

ClassDef
#define ClassDef(name, id)
Definition: Rtypes.h:297

TTree::SetBranchAddress
virtual Int_t SetBranchAddress(const char *bname, void *add, TBranch **ptr=0)
Change branch address, dealing with clone trees properly.
Definition: TTree.cxx:7873

TSystem.h

Data
std::vector< std::vector< double > > Data
Definition: ParallelTest.cxx:55

TMatrixDSym.h

TCanvas.h

h1
TH1F * h1
Definition: legend1.C:5

TMatrixTSym< Double_t >

TQpDataDens
Definition: TQpDataDens.h:62

TVectorD
TVectorT< Double_t > TVectorD
Definition: TVectorDfwd.h:22

TTree::GetBranch
virtual TBranch * GetBranch(const char *name)
Return pointer to the branch with the given name in this tree or its friends.
Definition: TTree.cxx:4979

TAttLine::SetLineColor
virtual void SetLineColor(Color_t lcolor)
Set the line color.
Definition: TAttLine.h:40

TLegend.h

TGondzioSolver::Solve
virtual Int_t Solve(TQpDataBase *prob, TQpVar *iterate, TQpResidual *resid)
Solve the quadratic programming problem as formulated through prob, store the final solution in itera...
Definition: TGondzioSolver.cxx:129

ROOT::Math::Cephes::C
static double C[]
Definition: SpecFuncCephes.cxx:187

r
TRandom2 r(17)

gSystem
R__EXTERN TSystem * gSystem
Definition: TSystem.h:539

TH1::Draw
virtual void Draw(Option_t *option="")
Draw this histogram with options.
Definition: TH1.cxx:2851

TArray::GetSize
Int_t GetSize() const
Definition: TArray.h:47

TAttFill::SetFillColor
virtual void SetFillColor(Color_t fcolor)
Set the fill area color.
Definition: TAttFill.h:37

Form
char * Form(const char *fmt,...)

TBranch::GetEntry
virtual Int_t GetEntry(Long64_t entry=0, Int_t getall=0)
Read all leaves of entry and return total number of bytes read.
Definition: TBranch.cxx:1291

Riostream.h

TCanvas
The Canvas class.
Definition: TCanvas.h:31

TF1.h

TMath::Exp
Double_t Exp(Double_t x)
Definition: TMath.h:622

f
double f(double x)
Definition: testIntegration.cxx:12

Double_t
double Double_t
Definition: RtypesCore.h:55

TLegend::AddEntry
TLegendEntry * AddEntry(const TObject *obj, const char *label="", Option_t *option="lpf")
Add a new entry to this legend.
Definition: TLegend.cxx:359

TTree::GetEntries
virtual Long64_t GetEntries() const
Definition: TTree.h:381

Char_t
char Char_t
Definition: RtypesCore.h:29

TH1::SetXTitle
virtual void SetXTitle(const char *title)
Definition: TH1.h:389

TMatrixD.h

TQpVar
Definition: TQpVar.h:59

TPad::Divide
virtual void Divide(Int_t nx=1, Int_t ny=1, Float_t xmargin=0.01, Float_t ymargin=0.01, Int_t color=0)
Automatic pad generation by division.
Definition: TPad.cxx:1135

f2
double f2(const double *x)
Definition: testIntegration.cxx:16

TQpProbDens
Definition: TQpProbDens.h:60

TF1
1-Dim function class
Definition: TF1.h:150

f1
TF1 * f1
Definition: legend1.C:11

b
you should not use this method at all Int_t Int_t Double_t Double_t Double_t Int_t Double_t Double_t Double_t Double_t b
Definition: TRolke.cxx:630

TH1.h

gPad
#define gPad
Definition: TVirtualPad.h:284

TTree
A TTree object has a header with a name and a title.
Definition: TTree.h:78

TF1::GetHistogram
virtual TH1 * GetHistogram() const
Return a pointer to the histogram used to visualise the function.
Definition: TF1.cxx:1311

TF1::SetParameter
virtual void SetParameter(Int_t param, Double_t value)
Definition: TF1.h:578

TMath::Sqrt
Double_t Sqrt(Double_t x)
Definition: TMath.h:591

TQpProbDens::MakeData
virtual TQpDataBase * MakeData(Double_t *c, Double_t *Q, Double_t *xlo, Bool_t *ixlo, Double_t *xup, Bool_t *ixup, Double_t *A, Double_t *bA, Double_t *C, Double_t *clo, Bool_t *iclo, Double_t *cup, Bool_t *icup)
Setup the data.
Definition: TQpProbDens.cxx:80

TMath.h

TBranch
A TTree is a list of TBranches.
Definition: TBranch.h:57

TArrayF.h

TH1::SetStats
virtual void SetStats(Bool_t stats=kTRUE)
Set statistics option on/off.
Definition: TH1.cxx:8103

ROOT::Math::Cephes::Q
static double Q[]
Definition: SpecFuncCephes.cxx:294

TQpProbDens::MakeVariables
virtual TQpVar * MakeVariables(const TQpDataBase *data)
Setup the variables.
Definition: TQpProbDens.cxx:182