Nrichers/sc 15354/workflows fix notebook execution workflows by nrichers · Pull Request #1263 · validmind/documentation

nrichers · 2026-04-01T02:33:18Z

Pull Request Description

What and why?

This PR solves the issue whereby notebook execution and LLM Markdown rendering fail when no generated template schema docs are present but are referenced in an {{< include >}} statement.

The solution:

Allow check-in of baseline template schema docs by writers — there's always a file to build regardless of where we're building docs
Make template schema docs generation during CI workflows fail harder if the docs cannot be generated — a failsafe to ensure we don't silently publish older baseline docs

The alternative solution would have been a comment marker of some kind that gets replaced with a file embed or a mostly blank template schema docs baseline, both with more downsides than the solution in this PR. Happy to talk through this, if necessary.

Changes:

.gitignore — Removed _template-schema-generated.qmd from gitignore so baseline docs can be tracked
README.md — Minor documentation updates
scripts/generate_template_schema_docs.py — Modified with stricter validation to fail CI on generation errors
site/Makefile — Changed pip to python -m pip for environment robustness
_template-schema-generated.qmd — Added tracked baseline template schema docs
templates/customize-document-templates.qmd — Removed comment that is no longer applicable

Fixes sc-15354

How to test

Verify that the stuff that fails in other PRs completes successfully:

Notebook execution workflow: https://github.com/validmind/documentation/actions/runs/23829214298/job/69458857557?pr=1263
Validate LLM markdown render step: https://github.com/validmind/documentation/actions/runs/23829214300/job/69458857537?pr=1263

Check that the template schema docs continue to be rendered: Customize email templates

What needs special review?

Dependencies, breaking changes, and deployment notes

Release notes

Checklist

…fail harder if necessary

scripts/generate_template_schema_docs.py

validbeck

@nrichers I don't think this fixed the issue (with the notebooks, at least) — the notebooks were NOT executed, they were skipped because no files were changed:

Let me pull test with a fake changed notebook file to see if it runs correctly.

nrichers · 2026-04-02T17:26:18Z

Let me pull test with a fake changed notebook file to see if it runs correctly.

I am not that familiar with our notebook execution step, so I appreciate your extra testing here. The issue should be resolved, as there is now always a file to build for the {{< include >}}.

validbeck · 2026-04-02T17:29:11Z

I am not that familiar with our notebook execution step, so I appreciate your extra testing here. The issue should be resolved, as there is now always a file to build for the {{< include >}}.

The notebook executions always run for staging and prod, and for PR previews it's a very easy filter for changed files in the notebooks/EXECUTED directory:

    # See if site/notebooks/ has updates
    # Checks the current PR branch against the target branch
    - name: Filter changed files
      uses: dorny/paths-filter@v2
      id: filter
      with:
        base: ${{ github.event.pull_request.base_ref }}
        ref: ${{ github.head_ref }}
        filters: |
          notebooks:
            - 'site/notebooks/EXECUTED/**'

(Just pointing out that again, just because something is documented doesn't make it transparent to the person that didn't do the designing/documenting. :p)

validbeck

Nice, it did work: https://github.com/validmind/documentation/actions/runs/23913198659/job/69740245958?pr=1263

Let me remove the test notebook before you merge.

validbeck

Actually, I just noticed this race condition:

The parallel render of the notebooks used to take longer than the full site render (which is why we pulled them out of the render and ran them parallel to begin with, to minimize runtime and prevent failures if the notebooks failed), meaning that the executed versions of the notebooks would replace the non-executed versions.

Now, the LLM markdown render step brings the runtime of the full render to longer than the execution, meaning ... the executed notebooks will always be replaced by the non-executed versions.

Can we look into chaining these steps into a separate workflow that only runs if the full site render is successful?

    - name: Install pandoc
      run: |
        sudo apt-get update
        sudo apt-get install -y pandoc

    - name: Validate LLM markdown render
      run: bash llm/render.sh && bash llm/clean.sh
      working-directory: site

nrichers · 2026-04-02T19:16:04Z

Can we look into chaining these steps into a separate workflow that only runs if the full site render is successful?

Yes, but not in this PR. I added c7a6d11 to prevent our validate workflow from syncing the executed notebooks folder. If that works, I can update staging and prod workflows. Simplest, safest solution.

github-actions · 2026-04-02T19:27:42Z

Lighthouse check results

⚠️ WARN: Average accessibility score is 0.87 (required: >0.9) — Check the workflow run

Show Lighthouse scores

Folder depth level checked: 0

Commit SHA: 41d20de

Modify the workflow to check a different depth:

0: Top-level navigation only — /index.html, /guide/guides.html, ...
1: All first-level subdirectories — /guide/.html, /developer/.html, ...
2: All second-level subdirectories — /guide/attestation/*.html, ...

Page	Accessibility	Performance	Best Practices	SEO
/developer/validmind-library.html	0.85	0.68	1.00	0.82
/get-started/get-started.html	0.85	0.74	1.00	0.73
/guide/guides.html	0.81	0.68	1.00	0.82
/index.html	0.93	0.61	1.00	0.82
/releases/all-releases.html	0.86	0.69	1.00	0.73
/support/support.html	0.91	0.65	1.00	0.82
/training/training.html	0.85	0.62	0.96	0.73

github-actions · 2026-04-02T19:42:35Z

Execute training notebooks for PRs

✓ INFO: Live previews are available —

validbeck

Looks like that worked — can you apply the same changes to staging & prod (and remove the test notebook)? 🙏🏻

nrichers · 2026-04-02T23:37:23Z

OK, applied the same --exclude directive for the executed notebooks when we sync to AWS S3 for staging and prod. Looks like we're done here, merging.

github-actions · 2026-04-02T23:38:19Z

PR Summary

This PR introduces several functional improvements in the documentation deployment and template schema generation processes:

GitHub workflow updates:
- In the production, staging, and PR preview deployment workflows, an additional exclusion pattern (notebooks/EXECUTED/*) is added to the AWS S3 sync commands. This ensures that unwanted notebook artifacts are not uploaded.
Template schema generation enhancements:
- The script responsible for generating the template schema documentation has been refactored (e.g., renaming the output variable from OUTPUT_FILE to TARGET_FILE).
- Additional sanity checks have been incorporated including a minimum expected file size and validation of HTML structure to catch generation failures early.
- Post-processing of the generated HTML is performed to remove unnecessary title and heading tags, and to clean up excessive blank lines before embedding into a Quarto document.
Documentation updates:
- The README and associated documentation file have been updated to reflect that the auto-generated template schema now overwrites the baseline output and is injected into the customize-document-templates.qmd file.
- A section in the repository previously ignored (the generated template schema file) is now tracked and included in the documentation build, ensuring that users see the latest schema content.
Build and dependency adjustments:
- The Makefile now uses python -m pip install instead of plain pip install to improve environment consistency.

Together, these changes streamline the documentation deployment and ensure that the template schema documentation is reliably and accurately generated and updated whenever the backend JSON schema changes.

Test Suggestions

Manually run the GitHub workflows (or use a staging branch) to verify that the S3 sync commands properly exclude the 'notebooks/EXECUTED/*' directory and that CloudFront invalidation is triggered as expected.
Execute the updated schema generation script to ensure that it produces an output file with a size above the defined minimum, contains valid HTML (i.e., includes both and tags), and that the post-processing (removal of title and h1 tags) functions correctly.
Confirm that the changes in the README and the inclusion of the generated file into customize-document-templates.qmd display the documentation correctly when rendered.

github-actions · 2026-04-02T23:59:15Z

Validate docs site

✓ INFO: A live preview of the docs site is available — Open the preview

nrichers added the internal Not to be externalized in the release notes label Apr 1, 2026

Allow check-in of baseline template schema docs and make CI workflow …

92c71e8

…fail harder if necessary

nrichers force-pushed the nrichers/sc-15354/workflows-fix-notebook-execution-workflows branch from b9e42ef to 92c71e8 Compare April 1, 2026 02:36

Fix README.md

4e1f603

nrichers requested review from nibalizer and validbeck April 1, 2026 03:16

nrichers commented Apr 1, 2026

View reviewed changes

scripts/generate_template_schema_docs.py Outdated Show resolved Hide resolved

nibalizer approved these changes Apr 1, 2026

View reviewed changes

nrichers commented Apr 2, 2026

View reviewed changes

scripts/generate_template_schema_docs.py Show resolved Hide resolved

validbeck reviewed Apr 2, 2026

View reviewed changes

validbeck self-requested a review April 2, 2026 18:02

validbeck approved these changes Apr 2, 2026

View reviewed changes

validbeck force-pushed the nrichers/sc-15354/workflows-fix-notebook-execution-workflows branch from 23cf169 to 4e1f603 Compare April 2, 2026 18:02

validbeck self-requested a review April 2, 2026 18:25

validbeck requested changes Apr 2, 2026

View reviewed changes

Exclude notebooks/EXECUTED/* from AWS S3 sync

c7a6d11

nrichers added 2 commits April 2, 2026 12:23

Another test notebook

9eac216

Move notebook into EXECUTED path

a6c6746

validbeck approved these changes Apr 2, 2026

View reviewed changes

nrichers added 2 commits April 2, 2026 16:34

Also --exclude executed notebooks on staging & prod

c793f28

Delete another test notebook

ea56c2b

Update scripts/generate_template_schema_docs.py

d070a40

nrichers merged commit 31541c2 into main Apr 3, 2026
6 of 7 checks passed

nrichers deleted the nrichers/sc-15354/workflows-fix-notebook-execution-workflows branch April 3, 2026 00:58

Conversation

nrichers commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Description

What and why?

How to test

What needs special review?

Dependencies, breaking changes, and deployment notes

Release notes

Checklist

Uh oh!

Uh oh!

Uh oh!

validbeck left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nrichers commented Apr 2, 2026

Uh oh!

validbeck commented Apr 2, 2026

Uh oh!

validbeck left a comment

Choose a reason for hiding this comment

Uh oh!

validbeck left a comment

Choose a reason for hiding this comment

Uh oh!

nrichers commented Apr 2, 2026

Uh oh!

github-actions bot commented Apr 2, 2026

Lighthouse check results

Uh oh!

github-actions bot commented Apr 2, 2026

Execute training notebooks for PRs

Uh oh!

validbeck left a comment

Choose a reason for hiding this comment

Uh oh!

nrichers commented Apr 2, 2026

Uh oh!

github-actions bot commented Apr 2, 2026

PR Summary

Test Suggestions

Uh oh!

github-actions bot commented Apr 2, 2026

Validate docs site

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nrichers commented Apr 1, 2026 •

edited

Loading

validbeck left a comment •

edited

Loading