Substrait Modifications in Gluten
Substrait is a project aiming to create a well-defined, cross-language specification for data compute operations. Since it is still under active development, there are some lacking representations for Gluten needed computing operations. At the same time, some existing representations need to be modified a bit to satisfy the needs of computing.
In Gluten, the base version of Substrait is v0.23.0
. This page records all the Gluten changes to Substrait proto files for reference. It is preferred to upstream these changes to Substrait, but for those cannot be upstreamed, alternatives like AdvancedExtension
could be considered.
Modifications to algebra.proto
- Added
JsonReadOptions
andTextReadOptions
inFileOrFiles
(#1584). - Changed join type
JOIN_TYPE_SEMI
toJOIN_TYPE_LEFT_SEMI
andJOIN_TYPE_RIGHT_SEMI
(#408). - Added
WindowRel
, addedcolumn_name
andwindow_type
inWindowFunction
, changedUnbounded
inWindowFunction
intoUnbounded_Preceding
andUnbounded_Following
, and added WindowType(#485). - Added
output_schema
in RelRoot(#1901). - Added
ExpandRel
(#1361). - Added
GenerateRel
(#574). - Added
PartitionColumn
inLocalFiles
(#2405). - Added
WriteRel
(#3690).