Wrapper sink that coalesces cluster column page writes.
TODO(jblomer): The interplay of derived class and RPageSink is not yet optimally designed for page storage wrapper classes like this one. Header and footer serialization, e.g., are done twice. To be revised.
Definition at line 43 of file RPageSinkBuf.hxx.
Classes | |
class | RColumnBuf |
A buffered column. More... | |
struct | RCounters |
I/O performance counters that get registered in fMetrics. More... | |
Public Member Functions | |
RPageSinkBuf (const RPageSinkBuf &)=delete | |
RPageSinkBuf (RPageSinkBuf &&)=default | |
RPageSinkBuf (std::unique_ptr< RPageSink > inner) | |
~RPageSinkBuf () override | |
RNTupleMetrics & | GetMetrics () final |
Returns the default metrics object. Subclasses might alternatively provide their own metrics object by overriding this. | |
RPageSinkBuf & | operator= (const RPageSinkBuf &)=delete |
RPageSinkBuf & | operator= (RPageSinkBuf &&)=default |
void | ReleasePage (RPage &page) final |
Every page store needs to be able to free pages it handed out. | |
RPage | ReservePage (ColumnHandle_t columnHandle, std::size_t nElements) final |
Get a new, empty page for the given column that can be filled with up to nElements. | |
void | UpdateSchema (const RNTupleModelChangeset &changeset, NTupleSize_t firstEntry) final |
Incorporate incremental changes to the model into the ntuple descriptor. | |
Public Member Functions inherited from ROOT::Experimental::Detail::RPageSink | |
RPageSink (const RPageSink &)=delete | |
RPageSink (RPageSink &&)=default | |
RPageSink (std::string_view ntupleName, const RNTupleWriteOptions &options) | |
~RPageSink () override | |
ColumnHandle_t | AddColumn (DescriptorId_t fieldId, const RColumn &column) final |
Register a new column. | |
std::uint64_t | CommitCluster (NTupleSize_t nEntries) |
Finalize the current cluster and create a new one for the following data. | |
void | CommitClusterGroup () |
Write out the page locations (page list envelope) for all the committed clusters since the last call of CommitClusterGroup (or the beginning of writing). | |
void | CommitDataset () |
Finalize the current cluster and the entrire data set. | |
void | CommitPage (ColumnHandle_t columnHandle, const RPage &page) |
Write a page to the storage. The column must have been added before. | |
void | CommitSealedPage (DescriptorId_t physicalColumnId, const RPageStorage::RSealedPage &sealedPage) |
Write a preprocessed page to storage. The column must have been added before. | |
void | CommitSealedPageV (std::span< RPageStorage::RSealedPageGroup > ranges) |
Write a vector of preprocessed pages to storage. The corresponding columns must have been added before. | |
void | Create (RNTupleModel &model) |
Physically creates the storage container to hold the ntuple (e.g., a keys a TFile or an S3 bucket) To do so, Create() calls CreateImpl() after updating the descriptor. | |
void | DropColumn (ColumnHandle_t) final |
Unregisters a column. | |
EPageStorageType | GetType () final |
Whether the concrete implementation is a sink or a source. | |
const RNTupleWriteOptions & | GetWriteOptions () const |
Returns the sink's write options. | |
RPageSink & | operator= (const RPageSink &)=delete |
RPageSink & | operator= (RPageSink &&)=default |
Public Member Functions inherited from ROOT::Experimental::Detail::RPageStorage | |
RPageStorage (const RPageStorage &other)=delete | |
RPageStorage (RPageStorage &&other)=default | |
RPageStorage (std::string_view name) | |
virtual | ~RPageStorage () |
const std::string & | GetNTupleName () const |
Returns the NTuple name. | |
RPageStorage & | operator= (const RPageStorage &other)=delete |
RPageStorage & | operator= (RPageStorage &&other)=default |
void | SetTaskScheduler (RTaskScheduler *taskScheduler) |
Protected Member Functions | |
RNTupleLocator | CommitClusterGroupImpl (unsigned char *serializedPageList, std::uint32_t length) final |
Returns the locator of the page list envelope of the given buffer that contains the serialized page list. | |
std::uint64_t | CommitClusterImpl (NTupleSize_t nEntries) final |
Returns the number of bytes written to storage (excluding metadata) | |
void | CommitDatasetImpl (unsigned char *serializedFooter, std::uint32_t length) final |
RNTupleLocator | CommitPageImpl (ColumnHandle_t columnHandle, const RPage &page) final |
RNTupleLocator | CommitSealedPageImpl (DescriptorId_t physicalColumnId, const RSealedPage &sealedPage) final |
void | CreateImpl (const RNTupleModel &model, unsigned char *serializedHeader, std::uint32_t length) final |
Protected Member Functions inherited from ROOT::Experimental::Detail::RPageSink | |
virtual std::vector< RNTupleLocator > | CommitSealedPageVImpl (std::span< RPageStorage::RSealedPageGroup > ranges) |
Vector commit of preprocessed pages. | |
void | EnableDefaultMetrics (const std::string &prefix) |
Enables the default set of metrics provided by RPageSink. | |
RSealedPage | SealPage (const RPage &page, const RColumnElementBase &element, int compressionSetting) |
Helper for streaming a page. | |
Protected Member Functions inherited from ROOT::Experimental::Detail::RPageStorage | |
void | WaitForAllTasks () |
Private Attributes | |
std::vector< RColumnBuf > | fBufferedColumns |
Vector of buffered column pages. Indexed by column id. | |
std::unique_ptr< RCounters > | fCounters |
std::unique_ptr< RNTupleModel > | fInnerModel |
The buffered page sink maintains a copy of the RNTupleModel for the inner sink. | |
std::unique_ptr< RPageSink > | fInnerSink |
The inner sink, responsible for actually performing I/O. | |
RNTupleMetrics | fMetrics |
Additional Inherited Members | |
Public Types inherited from ROOT::Experimental::Detail::RPageStorage | |
using | ColumnHandle_t = RColumnHandle |
The column handle identifies a column with the current open page storage. | |
using | SealedPageSequence_t = std::deque< RSealedPage > |
Static Public Member Functions inherited from ROOT::Experimental::Detail::RPageSink | |
static std::unique_ptr< RPageSink > | Create (std::string_view ntupleName, std::string_view location, const RNTupleWriteOptions &options=RNTupleWriteOptions()) |
Guess the concrete derived page source from the file name (location) | |
Static Protected Member Functions inherited from ROOT::Experimental::Detail::RPageSink | |
static RSealedPage | SealPage (const RPage &page, const RColumnElementBase &element, int compressionSetting, void *buf) |
Seal a page using the provided buffer. | |
Protected Attributes inherited from ROOT::Experimental::Detail::RPageSink | |
std::unique_ptr< RNTupleCompressor > | fCompressor |
Helper to zip pages and header/footer; includes a 16MB (kMAXZIPBUF) zip buffer. | |
std::unique_ptr< RCounters > | fCounters |
RNTupleDescriptorBuilder | fDescriptorBuilder |
RNTupleMetrics | fMetrics |
std::uint64_t | fNextClusterInGroup = 0 |
Remembers the starting cluster id for the next cluster group. | |
std::vector< RClusterDescriptor::RColumnRange > | fOpenColumnRanges |
Keeps track of the number of elements in the currently open cluster. Indexed by column id. | |
std::vector< RClusterDescriptor::RPageRange > | fOpenPageRanges |
Keeps track of the written pages in the currently open cluster. Indexed by column id. | |
std::unique_ptr< RNTupleWriteOptions > | fOptions |
NTupleSize_t | fPrevClusterNEntries = 0 |
Used to calculate the number of entries in the current cluster. | |
Protected Attributes inherited from ROOT::Experimental::Detail::RPageStorage | |
std::string | fNTupleName |
RTaskScheduler * | fTaskScheduler = nullptr |
#include <ROOT/RPageSinkBuf.hxx>
|
explicit |
Definition at line 36 of file RPageSinkBuf.cxx.
|
delete |
|
default |
|
override |
Definition at line 48 of file RPageSinkBuf.cxx.
|
finalprotectedvirtual |
Returns the locator of the page list envelope of the given buffer that contains the serialized page list.
Typically, the implementation takes care of compressing and writing the provided buffer.
Implements ROOT::Experimental::Detail::RPageSink.
Definition at line 192 of file RPageSinkBuf.cxx.
|
finalprotectedvirtual |
Returns the number of bytes written to storage (excluding metadata)
Implements ROOT::Experimental::Detail::RPageSink.
Definition at line 147 of file RPageSinkBuf.cxx.
|
finalprotectedvirtual |
Implements ROOT::Experimental::Detail::RPageSink.
Definition at line 200 of file RPageSinkBuf.cxx.
|
finalprotectedvirtual |
Implements ROOT::Experimental::Detail::RPageSink.
Definition at line 104 of file RPageSinkBuf.cxx.
|
finalprotectedvirtual |
Implements ROOT::Experimental::Detail::RPageSink.
Definition at line 137 of file RPageSinkBuf.cxx.
|
finalprotectedvirtual |
Implements ROOT::Experimental::Detail::RPageSink.
Definition at line 56 of file RPageSinkBuf.cxx.
|
inlinefinalvirtual |
Returns the default metrics object. Subclasses might alternatively provide their own metrics object by overriding this.
Reimplemented from ROOT::Experimental::Detail::RPageSink.
Definition at line 154 of file RPageSinkBuf.hxx.
|
delete |
|
default |
|
finalvirtual |
Every page store needs to be able to free pages it handed out.
But Sinks and sources have different means of allocating pages.
Implements ROOT::Experimental::Detail::RPageStorage.
Definition at line 212 of file RPageSinkBuf.cxx.
|
finalvirtual |
Get a new, empty page for the given column that can be filled with up to nElements.
If nElements is zero, the page sink picks an appropriate size.
Implements ROOT::Experimental::Detail::RPageSink.
Definition at line 207 of file RPageSinkBuf.cxx.
|
finalvirtual |
Incorporate incremental changes to the model into the ntuple descriptor.
This happens, e.g. if new fields were added after the initial call to RPageSink::Create(RNTupleModel &)
. firstEntry
specifies the global index for the first stored element in the added columns.
Reimplemented from ROOT::Experimental::Detail::RPageSink.
Definition at line 64 of file RPageSinkBuf.cxx.
|
private |
Vector of buffered column pages. Indexed by column id.
Definition at line 132 of file RPageSinkBuf.hxx.
|
private |
Definition at line 124 of file RPageSinkBuf.hxx.
|
private |
The buffered page sink maintains a copy of the RNTupleModel for the inner sink.
For the unbuffered case, the RNTupleModel is instead managed by a RNTupleWriter.
Definition at line 130 of file RPageSinkBuf.hxx.
|
private |
The inner sink, responsible for actually performing I/O.
Definition at line 127 of file RPageSinkBuf.hxx.
|
private |
Definition at line 125 of file RPageSinkBuf.hxx.