Implemented Upper/lower for REE#16969
Conversation
1a40951 to
a5359f9
Compare
a5359f9 to
b746836
Compare
|
@rich-t-kid-datadog Is this ready? It looks like it's fully implemented and has unit tests. I think this is a great way to start REE support, which I'm also interested in. If we merge in this PR then I think we can uncomment parts of the tests in #16715 so that it's not just merging in a commented out file. |
| if value_index.data_type() == &DataType::Utf8 { | ||
| case_conversion_run_array::<i32, _>( | ||
| array, | ||
| op, | ||
| name, | ||
| &run_index.data_type(), | ||
| ) | ||
| } else if value_index.data_type() == &DataType::LargeUtf8 { |
There was a problem hiding this comment.
What about Utf8View? As I'm thinking about it, it doesn't seem to make sense to have a REE with value data type of Utf8View. I haven't dug deep enough to verify this though. I ask mostly for my own understanding.
|
Thank you for your contribution. Unfortunately, this pull request is stale because it has been open 60 days with no activity. Please remove the stale label or comment or this will be closed in 7 days. |
Which issue does this PR close?
Work towards closing Ree Epic
Rationale for this change
Adding a RunEndEncoded branch in
case_conversionallows for REE with string's as a value type to be converted to upper/lower using the Datafusion UDFWhat changes are included in this PR?
Allows for Lower/Upper UDF to be called on Run-End Encoded Arrays
Are these changes tested?
Yes both the upper and lower functions have test attached to them for REE
Are there any user-facing changes?
This is an extensible change only. Users will see no changes unless they explicitly opt in to use the UDF with REE arrays.