Combine_cubes #540

VictorVerhaert · 2025-06-02T15:27:26Z

Adds a new experimental combine_cubes process that aims at taking away complexity from merge_cubes.

The two process can now each have their distinct usage: one for performing operations that have two cubes as input and one for merging data with a possible overlap.

Related issue: #280

…ial/filter_vector (Open-EO#462)

… (Open-EO#475)

…other processes. Default to numerical index instead of string. (Open-EO#478)

* `filter_spatial`: Clarified that a masking get applied for the given geometries. Open-EO#469 * `filter_bbox`: Clarified that the bounding box is reprojected to the CRS of the spatial data cube dimensions if required. --------- Co-authored-by: Stefaan Lippens <[email protected]>

…meter.

* divide, ln, log, mod: Clarified behavior for 0 input / infinity results * Trigonometric functions: Clarified that NaN is returned outside of their defined ranges and the output value range for some processes * Clarified for various mathematical functions the defined input and output ranges. Mention that `NaN` is returned outside of the defined input range where possible. * Remove NaN

…tion in temporal aggregations

…k definition

…#511) (Open-EO#518)

* Add `export_collection`, `export_workspace`, `stac_update`; `save_results` returns the STAC resource instead of boolean `true` Open-EO/openeo-api#376 * Update stac_update/modify * Added details about STAC support. * Update meta/implementation.md Co-authored-by: Matthias Mohr <[email protected]> --------- Co-authored-by: Michele Claus <[email protected]>

…gn better with the other reducers Open-EO#522

…n-EO#520) * Implementation guidelines for EOAP Open-EO#507

m-mohr · 2025-06-02T16:12:30Z

Looks pretty good at first look.

I'm wondering whether the process parameter needs to be required?

Is this process meant to be merge cubes but only return the overlap or is this process something else? I feel the issue was asking for the first and the process is slightly different from it ("combine").
The combine term seems a bit too close to merge though if it's the first option. It would be great if we could more easily distinguish what the processes do. If we'd start from scratch, we may name them merge_cubes_union and merge_cubes_intersection, but that's too late. Maybe we could call this process intersect_cubes or so?

VictorVerhaert · 2025-06-03T07:51:03Z

The initial draft of this PR was indeed more with a goal of a forced overlap_resolver in mind, but not necessarily only the intersection. This is a usecase I often come across which indirectly leads to the overlap-mode/intersection mode in the issue but goes a bit wider.
With a combine_cubes you could implement an intersection but it allows for more, e.g. a left join or A - B where A=0 if A == nodata. The main feature being that a process is forced on both inputs and let the user handle no-data in the provided process.

As for the naming, perhaps it would be clearer to lend the term binary_operation from math definitions. Pandas uses the terms merge and combine as well which might make it intuitive for users, or we could use join as used in SQL.
Brainstorm:

apply_binary
apply_binary_cubes
apply_binary_cube_operation
apply_binary_process
apply_cubes (dangerous imo)
apply_on_overlap
apply_on_intersection
combine_binary_cubes
join_cubes (personally not a fan but including for brainstorm sakes)
unite_cubes (ambiguous with merge_cubes)
conflate_cubes
fuse_cubes (ambiguous with merge_cubes)

soxofaan · 2025-06-03T07:51:11Z

I'm wondering whether the process parameter needs to be required?

Yes I think that's the core feature of this process:
the user wants to combine cube1 and cube2 with an explicit operation (add values, take difference, calculate ratio, ....).

It's like apply_dimension or reduce_dimension but instead of working on values along a dimension of a single cube, you work with values from two different cubes.
And to make that work, your cubes have to be "aligned" properly, which practically means we only work on the intersection and drop all the rest.

Is this process meant to be merge cubes but only return the overlap or is this process something else? I feel the issue was asking for the first and the process is slightly different from it ("combine").

So it's a bit the other way around: working on the intersection is the consequence of requiring an operation to combine the cubes

soxofaan · 2025-06-13T13:35:05Z

FYI another user support issue that, after hours of digging, again turned out out to be about unexpected results from merge_cubes with an unused overlap resolver:
https://forum.dataspace.copernicus.eu/t/inconsistent-rbr-pixel-values-using-openeo-sentinel-2-data/3804

VictorVerhaert · 2025-06-13T13:40:35Z

I was thinking about making it clear(er) in the description that band names should be equal as well (or the cubes should not have a band dimension) as I anticipate that users might oversee these labels.
The python client implementation should also check this and throw a warning if there are no overlapping band names.

clausmichele · 2025-11-06T16:10:45Z

Hello! This might be interesting for me too, did you implement it at VITO @VictorVerhaert @soxofaan ?
I'm facing strange issues with merge_cubes which might require a different process, see cloudinsar/s1-workflows#67

soxofaan · 2025-11-07T08:22:37Z

no we didn't implement this yet

if I understand your issue cloudinsar/s1-workflows#67 well, I think you still want the "classic" merge_cubes there as you want to fuse partially incomplete cubes to get the "union".
combine_cubes is for doing calculations with values from the intersection of cubes, and discard everything outside of the intersection

m-mohr and others added 28 commits May 25, 2023 13:30

clean-up

7b558ac

aggregate_spatial_window typo fix (Open-EO#446)

7a29378

Issue Open-EO#460 doc crossreferences between filter_bbox/filter_spat…

0833d4e

…ial/filter_vector (Open-EO#462)

Move tests to dev

c130dd7

Merge remote-tracking branch 'origin/draft' into draft

836a84b

Use x \ y instead of a \ b

c2d77e2

sqrt: Clarified that NaN is returned for negative numbers Open-EO#474…

13c3f85

… (Open-EO#475)

clip: Throw an exception if min > max Open-EO#472 (Open-EO#477)

4fd92b2

array_append: Added number type for labels to be consistent with …

ab4a62e

…other processes. Default to numerical index instead of string. (Open-EO#478)

between: Clarify that null is passed through

d8cf96a

eq and neq: Explicitly set the minimum value for the delta para…

899b824

…meter.

Clarify linear_scale_range

ab2e6c2

Added uniqueness contraints and clarified DimensionNotAvailable excep…

4274215

…tion in temporal aggregations

Remove unused exception from aggregate_temporal_period, clarified wee…

d5d0a18

…k definition

Renamed create_data_cube to create_cube. Open-EO#68

47b45d4

Update docgen

9363471

Update CI

469e06f

apply_polygon: rename polygons parameter to geometries (Open-EO…

a2e3780

…#511) (Open-EO#518)

Fix formatting

9532cb1

Fix cum* process descriptions and make ignore_nodata descriptions ali…

0390558

…gn better with the other reducers Open-EO#522

apply_polygon: datacube instead of raster-cube Open-EO#524 (Open-EO#525)

0100f9a

Simplify some enum types (Open-EO#529)

4422c60

Guidelines and processes to run OGC API - Processes / CWL / EOAP (Ope…

fd26251

…n-EO#520) * Implementation guidelines for EOAP Open-EO#507

Fix broken epsg href (Open-EO#539)

52fa980

added combine_cubes proposal

4448756

VictorVerhaert mentioned this pull request Jun 2, 2025

"Overlap-only" mode for merge_cubes #280

Open

m-mohr force-pushed the draft branch from a6b9196 to cd73c5d Compare July 16, 2025 18:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Combine_cubes #540

Combine_cubes #540

Uh oh!

VictorVerhaert commented Jun 2, 2025

Uh oh!

m-mohr commented Jun 2, 2025 •

edited

Loading

Uh oh!

VictorVerhaert commented Jun 3, 2025

Uh oh!

soxofaan commented Jun 3, 2025

Uh oh!

soxofaan commented Jun 13, 2025

Uh oh!

VictorVerhaert commented Jun 13, 2025

Uh oh!

clausmichele commented Nov 6, 2025

Uh oh!

soxofaan commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Combine_cubes #540

Are you sure you want to change the base?

Combine_cubes #540

Uh oh!

Conversation

VictorVerhaert commented Jun 2, 2025

Uh oh!

m-mohr commented Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VictorVerhaert commented Jun 3, 2025

Uh oh!

soxofaan commented Jun 3, 2025

Uh oh!

soxofaan commented Jun 13, 2025

Uh oh!

VictorVerhaert commented Jun 13, 2025

Uh oh!

clausmichele commented Nov 6, 2025

Uh oh!

soxofaan commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

m-mohr commented Jun 2, 2025 •

edited

Loading