Introduce e2e testing with testcontainers by gopidesupavan · Pull Request #54072 · apache/airflow

gopidesupavan · 2025-08-03T18:42:08Z

Introducing End to End testing to test against real environment with dags.

We can write all the possible scenario dags and test on real environment.
For now i have added a simple dag that triggers dag and validates xcom value.
Extend this to more complex scenario to validate.

My plan is it have a schedule run frequently similar to canary to validate all the tests thats under e2e.
edit: Default it runs in canary after prod image built.

We can utilise this one to test on RC releases likely provide input to image version and run all the tests. So once release manager cuts RC and released then he can trigger this test suite. IMHO very useful to test.

Add breeze support
Add documentation

^ Add meaningful description above
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in airflow-core/newsfragments.

gopidesupavan · 2025-08-03T20:35:55Z

Should we have e2e testing separate repo and keep all the tests there and uses them part of workflow? 🤔

potiuk · 2025-08-03T20:42:23Z

Should we have e2e testing separate repo and keep all the tests there and uses them part of workflow? 🤔

I think separate repo is generally a bad idea. Monorepo has it's drawbacks - but it has also benefits - for example much easier syncing of dependencie with workspace, keeping single contribuition workflow, being able to stop PR from merging when the tests fail etc. I can't see any real benefit of having separate repo for tests that should essentially be run as part of the same workflow as all other tests (especially when we have uv sync and workspaces.

gopidesupavan · 2025-08-03T20:50:49Z

Should we have e2e testing separate repo and keep all the tests there and uses them part of workflow? 🤔

I think separate repo is generally a bad idea. Monorepo has it's drawbacks - but it has also benefits - for example much easier syncing of dependencie with workspace, keeping single contribuition workflow, being able to stop PR from merging when the tests fail etc. I can't see any real benefit of having separate repo for tests that should essentially be run as part of the same workflow as all other tests (especially when we have uv sync and workspaces.

agree make sense :) do you think we should run these e2e tests part PR's, i am thinking of we can do it in schedule, i remember @kaxil was mentioning there were around 200+ test scenario combinations for xcoms itself so i think it would fit in schedule run? WDYT?

potiuk · 2025-08-03T20:53:32Z

for xcoms itself so i think it would fit in schedule run? WDYT?

a) we can improve selective tests to trigger a relevant subset of those
b) we can run full set in canary runs

gopidesupavan · 2025-08-03T21:07:31Z

for xcoms itself so i think it would fit in schedule run? WDYT?

a) we can improve selective tests to trigger a relevant subset of those b) we can run full set in canary runs

yes that also works good idea ;)

gopidesupavan · 2025-08-21T11:10:17Z

getting this one ready today, we should be able to catch more errors running actual dag scenarios, like the recent secrets issue.

gopidesupavan · 2025-08-21T11:47:56Z

@vatsrahul1001 am adding e2e test suite to CI and this can be triggered part of regular CI or after rc release it helps detecting any issues by running e2e tests.

Could you please help me what test suite you regularly run and the all the example dags you have setup for testing? i will add it to test suite.

@kaxil please add your thoughts on this if you have anything in mind to cover specifically combination of scenarios. i see you mentioned here around 254 combinations it would be good to know those :) #53130 (comment)

kaxil · 2025-08-21T12:35:39Z

@vatsrahul1001 am adding e2e test suite to CI and this can be triggered part of regular CI or after rc release it helps detecting any issues by running e2e tests.

Could you please help me what test suite you regularly run and the all the example dags you have setup for testing? i will add it to test suite.

@kaxil please add your thoughts on this if you have anything in mind to cover specifically combination of scenarios. i see you mentioned here around 254 combinations it would be good to know those :) #53130 (comment)

Hey @gopidesupavan 👋

It is

airflow/task-sdk/tests/task_sdk/execution_time/test_task_runner.py

Lines 1391 to 1426 in 00b3b14

    
               @pytest.mark.parametrize( 
        
                   "map_indexes", 
        
                   [ 
        
                       pytest.param(-1, id="not_mapped_index"), 
        
                       pytest.param(1, id="single_map_index"), 
        
                       pytest.param([0, 1], id="multiple_map_indexes"), 
        
                       pytest.param((0, 1), id="any_iterable_multi_indexes"), 
        
                       pytest.param(None, id="index_none"), 
        
                       pytest.param(NOTSET, id="index_not_set"), 
        
                   ], 
        
               ) 
        
               @pytest.mark.parametrize( 
        
                   "task_ids", 
        
                   [ 
        
                       pytest.param("push_task", id="single_task"), 
        
                       pytest.param(["push_task1", "push_task2"], id="tid_multiple_tasks"), 
        
                       pytest.param({"push_task1", "push_task2"}, id="tid_any_iterable"), 
        
                       pytest.param(None, id="tid_none"), 
        
                       pytest.param(NOTSET, id="tid_not_set"), 
        
                   ], 
        
               ) 
        
               @pytest.mark.parametrize( 
        
                   "xcom_values", 
        
                   [ 
        
                       pytest.param("hello", id="string_value"), 
        
                       pytest.param("'hello'", id="quoted_string_value"), 
        
                       pytest.param({"key": "value"}, id="json_value"), 
        
                       pytest.param([], id="empty_list_no_xcoms_found"), 
        
                       pytest.param((1, 2, 3), id="tuple_int_value"), 
        
                       pytest.param([1, 2, 3], id="list_int_value"), 
        
                       pytest.param(42, id="int_value"), 
        
                       pytest.param(True, id="boolean_value"), 
        
                       pytest.param(pd.DataFrame({"col1": [1, 2], "col2": [3, 4]}), id="dataframe_value"), 
        
                   ], 
        
               ) 
        
               def test_xcom_pull(

root@6cb799c71804:/opt/airflow# pytest task-sdk/tests/task_sdk/execution_time/test_task_runner.py::TestRuntimeTaskInstance::test_xcom_pull --collect-only

...
...
==== 270 tests collected in 2.69s ======

gopidesupavan · 2025-08-21T12:46:44Z

@vatsrahul1001 am adding e2e test suite to CI and this can be triggered part of regular CI or after rc release it helps detecting any issues by running e2e tests.
Could you please help me what test suite you regularly run and the all the example dags you have setup for testing? i will add it to test suite.
@kaxil please add your thoughts on this if you have anything in mind to cover specifically combination of scenarios. i see you mentioned here around 254 combinations it would be good to know those :) #53130 (comment)

Hey @gopidesupavan 👋

It is

airflow/task-sdk/tests/task_sdk/execution_time/test_task_runner.py

Lines 1391 to 1426 in 00b3b14

@pytest.mark.parametrize(

"map_indexes",

[

pytest.param(-1, id="not_mapped_index"),

pytest.param(1, id="single_map_index"),

pytest.param([0, 1], id="multiple_map_indexes"),

pytest.param((0, 1), id="any_iterable_multi_indexes"),

pytest.param(None, id="index_none"),

pytest.param(NOTSET, id="index_not_set"),

],

)

@pytest.mark.parametrize(

"task_ids",

[

pytest.param("push_task", id="single_task"),

pytest.param(["push_task1", "push_task2"], id="tid_multiple_tasks"),

pytest.param({"push_task1", "push_task2"}, id="tid_any_iterable"),

pytest.param(None, id="tid_none"),

pytest.param(NOTSET, id="tid_not_set"),

],

)

@pytest.mark.parametrize(

"xcom_values",

[

pytest.param("hello", id="string_value"),

pytest.param("'hello'", id="quoted_string_value"),

pytest.param({"key": "value"}, id="json_value"),

pytest.param([], id="empty_list_no_xcoms_found"),

pytest.param((1, 2, 3), id="tuple_int_value"),

pytest.param([1, 2, 3], id="list_int_value"),

pytest.param(42, id="int_value"),

pytest.param(True, id="boolean_value"),

pytest.param(pd.DataFrame({"col1": [1, 2], "col2": [3, 4]}), id="dataframe_value"),

],

)

def test_xcom_pull(
root@6cb799c71804:/opt/airflow# pytest task-sdk/tests/task_sdk/execution_time/test_task_runner.py::TestRuntimeTaskInstance::test_xcom_pull --collect-only

...
...
==== 270 tests collected in 2.69s ======

cool thanks will make these combinations to execute on real dag scenario.

vatsrahul1001 · 2025-08-22T04:36:44Z

@vatsrahul1001 am adding e2e test suite to CI and this can be triggered part of regular CI or after rc release it helps detecting any issues by running e2e tests.

Could you please help me what test suite you regularly run and the all the example dags you have setup for testing? i will add it to test suite.

@kaxil please add your thoughts on this if you have anything in mind to cover specifically combination of scenarios. i see you mentioned here around 254 combinations it would be good to know those :) #53130 (comment)

This is great work and will be very helpful during the RC testing phase! A good starting point could be running all the example DAGs that already exist, as they cover a wide range of features. From there, our team can also look into contributing additional DAGs that reflect some others common use cases.

potiuk · 2025-08-22T18:56:46Z

This is great work and will be very helpful during the RC testing phase! A good starting point could be running all the example DAGs that already exist, as they cover a wide range of features. From there, our team can also look into contributing additional DAGs that reflect some others common use cases.

Agreed :)

gopidesupavan · 2025-08-22T19:30:49Z

This is great work and will be very helpful during the RC testing phase! A good starting point could be running all the example DAGs that already exist, as they cover a wide range of features. From there, our team can also look into contributing additional DAGs that reflect some others common use cases.

Agreed :)

Yeah have already plan to include them part of this. working on them currently.

gopidesupavan · 2025-08-23T11:21:46Z

Oh github actions is not allowing some patterns in file paths.

With the provided path, there will be 352 files uploaded
Artifact name is valid!
Root directory input is valid!
Error: The path for one of the files in artifact is not valid: /dag_id=example_bash_decorator/run_id=manual__2025-08-23T10:27:47.958128+00:00/task_id=also_run_this/attempt=1.log. Contains the following character:  Colon :
          
Invalid characters include:  Double quote ", Colon :, Less than <, Greater than >, Vertical bar |, Asterisk *, Question mark ?, Carriage return \r, Line feed \n
          
The following characters are not allowed in files that are uploaded due to limitations with certain file systems such as NTFS. To maintain file system agnostic behavior, these characters are intentionally not allowed to prevent potential problems with downloads on different file systems.

gopidesupavan · 2025-08-25T21:16:44Z

Finally ready for review.

gopidesupavan · 2025-08-25T21:18:56Z

Would like to get this first pass simple and added basic test and test to trigger example dags. will extend this for further xcoms/connections etc;

@vatsrahul1001 can also extend on top of this, feel free to add your suite in this :)

potiuk

Fantastic job @gopidesupavan . Just some nit comments.

potiuk · 2025-09-03T07:36:00Z

Not sure if we intend to backport it, but It might be possible actually, so I left the label,.

gopidesupavan · 2025-09-11T11:56:49Z

Not sure if we intend to backport it, but It might be possible actually, so I left the label,.

🤔 yeah not sure, these test runs in canary anyway it hints if anything breaks.

…uild

gopidesupavan · 2025-09-11T19:36:58Z

Finally green :) merging it now.

github-actions · 2025-09-11T19:37:56Z

Backport failed to create: v3-0-test. View the failure log Run details

Status	Branch	Result
❌	v3-0-test

You can attempt to backport this manually by running:

cherry_picker 0954496 v3-0-test

This should apply the commit to the v3-0-test branch and leave the commit in conflict state marking
the files that need manual conflict resolution.

After you have resolved the conflicts, you can continue the backport process by running:

cherry_picker --continue

* Introduce e2e testing with testcontainers * Fix test command * Fix test command * Upload test report * Add option to trigger with workflow_dispatch * Add test to trigger example dags * Upload logs * Upload logs * zip logs * Fix example_bash_decorator file stat function * Add breeze commands and docs * Update breeze commands * Make docker-image-tag to empty and determine in conftest for canary build * Fix mnt writable

gopidesupavan requested review from ashb and potiuk as code owners August 3, 2025 18:42

boring-cyborg bot added area:dev-tools backport-to-v3-0-test labels Aug 3, 2025

gopidesupavan marked this pull request as draft August 3, 2025 18:43

gopidesupavan force-pushed the e2e-testing branch from 11c816b to c2f9cf9 Compare August 10, 2025 20:44

gopidesupavan force-pushed the e2e-testing branch from c2f9cf9 to 645ab4d Compare August 21, 2025 10:38

gopidesupavan force-pushed the e2e-testing branch 3 times, most recently from 28abb8b to b4c2b31 Compare August 23, 2025 09:53

gopidesupavan force-pushed the e2e-testing branch from 950d049 to bc8c21f Compare August 25, 2025 20:23

gopidesupavan marked this pull request as ready for review August 25, 2025 21:16

gopidesupavan requested review from amoghrajesh and jedcunningham as code owners August 25, 2025 21:16

gopidesupavan requested a review from kaxil August 25, 2025 21:19

potiuk reviewed Sep 3, 2025

View reviewed changes

Comment thread airflow-e2e-tests/tests/airflow_e2e_tests/e2e_test_utils/clients.py

potiuk approved these changes Sep 3, 2025

View reviewed changes

potiuk added backport-to-v3-0-test and removed backport-to-v3-0-test labels Sep 3, 2025

gopidesupavan force-pushed the e2e-testing branch from f7d358e to 8c2b2ac Compare September 11, 2025 11:58

This was referenced Sep 11, 2025

Add localstack Breeze integration #54050

Merged

Use httpbingo for example-dag-decorator #55512

Merged

gopidesupavan added 14 commits September 11, 2025 16:58

Introduce e2e testing with testcontainers

bfddb01

Fix test command

3486eb4

Fix test command

cf04704

Upload test report

6b287a1

Add option to trigger with workflow_dispatch

f0a1105

Add test to trigger example dags

3ddc5ce

Upload logs

4b14720

Upload logs

c0bfe73

zip logs

d71c0bb

Fix example_bash_decorator file stat function

9604f67

Add breeze commands and docs

04643b2

Update breeze commands

4c14504

Make docker-image-tag to empty and determine in conftest for canary b…

d74eb7e

…uild

Fix mnt writable

ad4eb1c

gopidesupavan force-pushed the e2e-testing branch from e761553 to ad4eb1c Compare September 11, 2025 15:58

gopidesupavan merged commit 0954496 into apache:main Sep 11, 2025
196 checks passed

gopidesupavan deleted the e2e-testing branch September 11, 2025 19:37

Conversation

gopidesupavan commented Aug 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gopidesupavan commented Aug 3, 2025

Uh oh!

potiuk commented Aug 3, 2025

Uh oh!

gopidesupavan commented Aug 3, 2025

Uh oh!

potiuk commented Aug 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gopidesupavan commented Aug 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gopidesupavan commented Aug 21, 2025

Uh oh!

gopidesupavan commented Aug 21, 2025

Uh oh!

kaxil commented Aug 21, 2025

Uh oh!

gopidesupavan commented Aug 21, 2025

Uh oh!

vatsrahul1001 commented Aug 22, 2025

Uh oh!

potiuk commented Aug 22, 2025

Uh oh!

gopidesupavan commented Aug 22, 2025

Uh oh!

gopidesupavan commented Aug 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gopidesupavan commented Aug 25, 2025

Uh oh!

gopidesupavan commented Aug 25, 2025

Uh oh!

Uh oh!

potiuk left a comment

Choose a reason for hiding this comment

Uh oh!

potiuk commented Sep 3, 2025

Uh oh!

gopidesupavan commented Sep 11, 2025

Uh oh!

gopidesupavan commented Sep 11, 2025

Uh oh!

Uh oh!

github-actions bot commented Sep 11, 2025

Backport failed to create: v3-0-test. View the failure log Run details

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gopidesupavan commented Aug 3, 2025 •

edited

Loading

potiuk commented Aug 3, 2025 •

edited

Loading

gopidesupavan commented Aug 3, 2025 •

edited

Loading

gopidesupavan commented Aug 23, 2025 •

edited

Loading