RDataFrame
RDataFrame is a high-level data analysis interface provided by the ROOT framework. It offers a declarative, dataframe-style API for processing data stored in ROOT TTrees, enabling concise pipelines and scalable performance for large datasets.
The API emphasizes lazy evaluation and automatic parallelization. Users compose a sequence of operations such as
Key concepts and operations include filters to select events, Define to create new columns or derived quantities,
Under the hood, RDataFrame constructs an execution graph representing the pipeline and generates efficient code that
RDataFrame is part of ROOT’s data-analysis toolkit and is accessible from C++ and via PyROOT in Python,