Use Range to limit the amount of data processed.
This tutorial shows how to express the concept of ranges when working with the RDataFrame.
import ROOT
def fill_tree(treeName, fileName):
df.Define("b1", "(int) rdfentry_")\
.Define("b2", "(float) rdfentry_ * rdfentry_").Snapshot(treeName, fileName)
fileName = "df006_ranges_py.root"
treeName = "myTree"
fill_tree(treeName, fileName)
c_all = d.Count()
d_0_30 = d.Range(30)
c_0_30 = d_0_30.Count()
d_15_end = d.Range(15, 0)
c_15_end = d_15_end.Count()
d_15_end_3 = d.Range(15, 0, 3)
c_15_end_3 = d_15_end_3.Count()
d_0_50 = d.Range(50)
c_0_50_odd_b1 = d_0_50.Filter("1 == b1 % 2").Count()
c_0_3_after_even_b1 = d.Filter(
"0 == b1 % 2").
Range(0, 3).Count()
print("Usage of ranges:")
print(" - All entries:", c_all.GetValue())
print(" - Entries from 0 to 30:", c_0_30.GetValue())
print(" - Entries from 15 onwards:", c_15_end.GetValue())
print(" - Entries from 15 onwards in steps of 3:", c_15_end_3.GetValue())
print(" - Entries from 0 to 50, odd only:", c_0_50_odd_b1.GetValue())
print(" - First three entries of all even entries:", c_0_3_after_even_b1.GetValue())
ROOT's RDataFrame offers a modern, high-level interface for analysis of data stored in TTree ,...
Usage of ranges:
- All entries: 100
- Entries from 0 to 30: 30
- Entries from 15 onwards: 85
- Entries from 15 onwards in steps of 3: 29
- Entries from 0 to 50, odd only: 25
- First three entries of all even entries: 3
- Date
- March 2017
- Author
- Danilo Piparo (CERN)
Definition in file df006_ranges.py.