Skip to content

Add Glue doc to Cloud#363

Merged
kbatuigas merged 7 commits intomainfrom
DOC-1377-cloud-aws-glue
Aug 5, 2025
Merged

Add Glue doc to Cloud#363
kbatuigas merged 7 commits intomainfrom
DOC-1377-cloud-aws-glue

Conversation

@kbatuigas
Copy link
Copy Markdown
Contributor

Description

This pull request introduces new documentation and updates to support AWS Glue integration for Iceberg in Redpanda. The changes include adding a new branch to the Antora playbook, updating the navigation structure, and creating new documentation pages for AWS Glue and REST catalog integration.

Documentation updates for AWS Glue and REST Catalog integration:

  • Antora playbook update:

    • Added the branch DOC-1377-document-feature-aws-glue-support-for-iceberg to the documentation repository in local-antora-playbook.yml. This ensures the new AWS Glue-related content is included in the documentation build.
  • Navigation structure update:

    • Updated modules/ROOT/nav.adoc to add links for AWS Glue, Databricks Unity Catalog, and Snowflake/Open Catalog under a new "REST Catalog" section. Removed duplicate entries for Databricks Unity Catalog and Snowflake/Open Catalog from the previous structure.
  • New AWS Glue documentation:

    • Added a new page, iceberg-topics-aws-glue.adoc, describing how to query Iceberg topics using AWS Glue. This includes metadata such as categories and a beta flag.
  • New REST Catalog documentation:

    • Added an index page, rest-catalog/index.adoc, to document integrating Redpanda topics with managed Iceberg REST Catalogs.

Resolves https://github.com/redpanda-data/documentation-private/issues/
Review deadline:

Page previews

Checks

  • New feature
  • Content gap
  • Support Follow-up
  • Small fix (typos, links, copyedits, etc)

@netlify
Copy link
Copy Markdown

netlify bot commented Jul 22, 2025

Deploy Preview for rp-cloud ready!

Name Link
🔨 Latest commit 6b63556
🔍 Latest deploy log https://app.netlify.com/projects/rp-cloud/deploys/689290911476d2000848a4a8
😎 Deploy Preview https://deploy-preview-363--rp-cloud.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai bot commented Jul 22, 2025

Important

Review skipped

Auto incremental reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

📝 Walkthrough

Walkthrough

This change updates the documentation navigation and content for Iceberg integrations. The navigation structure under the "Iceberg" section is reorganized to add a new "rest-catalog" subsection. Three pages—AWS Glue, Databricks Unity Catalog, and Snowflake/Open Catalog—are now nested under this new subsection. A new documentation page for AWS Glue integration is introduced, and a placeholder index page for REST Catalog integrations is added. No changes to exported or public code entities are present; all changes are limited to documentation structure and content.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant DocsNav
    participant IcebergSection
    participant RestCatalogSection

    User->>DocsNav: Access "Iceberg" in navigation
    DocsNav->>IcebergSection: Display Iceberg options
    IcebergSection->>RestCatalogSection: Show "REST Catalog" subsection
    RestCatalogSection->>User: List AWS Glue, Databricks Unity, Snowflake/Open Catalog docs
Loading

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~2 minutes

Assessment against linked issues

Objective Addressed Explanation
Document feature: AWS Glue Support for Iceberg (DOC-1377)

Assessment against linked issues: Out-of-scope changes

No out-of-scope changes found.

Possibly related PRs

Suggested reviewers

  • paulohtb6
  • Feediver1
✨ Finishing Touches
🧪 Generate unit tests
  • Create PR with unit tests
  • Post copyable unit tests in a comment
  • Commit unit tests in branch DOC-1377-cloud-aws-glue

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Explain this complex logic.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai explain this code block.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and explain its main purpose.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR.
  • @coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
  • @coderabbitai generate unit tests to generate unit tests for this PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

@kbatuigas kbatuigas changed the title Add Glue pages to Cloud Add Glue doc to Cloud Jul 22, 2025
@kbatuigas kbatuigas force-pushed the DOC-1377-cloud-aws-glue branch from 4811b96 to ecb3c8f Compare August 1, 2025 16:53
@kbatuigas kbatuigas marked this pull request as ready for review August 1, 2025 16:54
@kbatuigas kbatuigas requested a review from a team as a code owner August 1, 2025 16:54
Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Nitpick comments (3)
modules/manage/pages/iceberg/rest-catalog/index.adoc (1)

1-3: Missing common page attributes

Consider adding frequently-used, discoverability-related attributes (:page-aliases:, :page-categories:, :page-beta: if applicable) to keep this index page consistent with other Iceberg index pages.

modules/manage/pages/iceberg/iceberg-topics-aws-glue.adoc (1)

3-4: Category list is unusually long

The :page-categories: line spans many orthogonal topics. Consider trimming to the most relevant ones (for example Iceberg, Integration, Tiered Storage) to avoid diluting search filters.

modules/ROOT/nav.adoc (1)

418-421: Minor naming consistency

To match other navigation labels (for example “Use Iceberg Catalogs”), consider adding explicit labels:

-*** xref:manage:iceberg/rest-catalog/index.adoc[]
+*** xref:manage:iceberg/rest-catalog/index.adoc[REST Catalogs]
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 6a4e126 and ecb3c8f.

📒 Files selected for processing (3)
  • modules/ROOT/nav.adoc (1 hunks)
  • modules/manage/pages/iceberg/iceberg-topics-aws-glue.adoc (1 hunks)
  • modules/manage/pages/iceberg/rest-catalog/index.adoc (1 hunks)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)
  • GitHub Check: Redirect rules - rp-cloud
  • GitHub Check: Header rules - rp-cloud
  • GitHub Check: Pages changed - rp-cloud
🔇 Additional comments (1)
modules/ROOT/nav.adoc (1)

418-421: No duplicate nav entries found

I ran a grep across modules/ROOT/nav.adoc and only the intended nested references at lines 420–421 appear. There are no other occurrences of these xrefs, so the legacy links have been fully removed.

Comment on lines +1 to +6
= Query Iceberg Topics using AWS Glue
:description: Add Redpanda topics as Iceberg tables that you can query from AWS Glue Data Catalog.
:page-categories: Iceberg, Tiered Storage, Management, High Availability, Data Replication, Integration
:page-beta: true

include::ROOT:manage:iceberg/iceberg-topics-aws-glue.adoc[tag=single-source] No newline at end of file
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Verification agent

❓ Verification inconclusive

Self-include may trigger a circular include or fail to resolve

include::ROOT:manage:iceberg/iceberg-topics-aws-glue.adoc[tag=single-source] points to the same file.
Unless the single-source tag is defined outside this wrapper (for example in a shared partial), Antora will either:

  1. recurse indefinitely, or
  2. fail because the tag is not found.

Double-check the target file and tag. If the intent is to reuse content from another source module, update the path; otherwise embed the content directly.

-include::ROOT:manage:iceberg/iceberg-topics-aws-glue.adoc[tag=single-source]
+// TODO Verify include target; replace with correct path or inline content

🏁 Script executed:

#!/usr/bin/env bash
set -e
# Locate where the `single-source` tag is defined in the repo
rg -n "tag::single-source" -g "*.adoc"

Length of output: 47


Fix circular self-include in Asciidoc page
The include::ROOT:manage:iceberg/iceberg-topics-aws-glue.adoc[tag=single-source] directive in modules/manage/pages/iceberg/iceberg-topics-aws-glue.adoc references the same file and points at a single-source tag that isn’t defined anywhere. This will cause Antora to error or recurse indefinitely.

Please update the include to reference the correct external partial (or define the tag in a shared file), or inline the content directly.

• File: modules/manage/pages/iceberg/iceberg-topics-aws-glue.adoc
• Lines: 1–6

Suggested diff:

- include::ROOT:manage:iceberg/iceberg-topics-aws-glue.adoc[tag=single-source]
+ // TODO: replace with correct include path or inline this content
+ // e.g. include::ROOT:partials/iceberg-single-source.adoc[tag=single-source]
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
= Query Iceberg Topics using AWS Glue
:description: Add Redpanda topics as Iceberg tables that you can query from AWS Glue Data Catalog.
:page-categories: Iceberg, Tiered Storage, Management, High Availability, Data Replication, Integration
:page-beta: true
include::ROOT:manage:iceberg/iceberg-topics-aws-glue.adoc[tag=single-source]
= Query Iceberg Topics using AWS Glue
:description: Add Redpanda topics as Iceberg tables that you can query from AWS Glue Data Catalog.
:page-categories: Iceberg, Tiered Storage, Management, High Availability, Data Replication, Integration
:page-beta: true
// TODO: replace with correct include path or inline this content
// e.g. include::ROOT:partials/iceberg-single-source.adoc[tag=single-source]
🤖 Prompt for AI Agents
In modules/manage/pages/iceberg/iceberg-topics-aws-glue.adoc lines 1 to 6, the
include directive references the same file with a tag 'single-source' that is
not defined, causing a circular self-include error. To fix this, either remove
the include directive and inline the intended content directly, or update the
include to point to a different external file that contains the 'single-source'
tag. Ensure the included content is valid and does not cause recursive
inclusion.

@kbatuigas kbatuigas marked this pull request as ready for review August 5, 2025 19:01

== Iceberg topics with AWS Glue

A new xref:manage:iceberg/iceberg-topics-aws-glue.adoc[integration with AWS Glue Data Catalog] allows you to add Redpanda topics as Iceberg tables in your data lakehouse. The AWS Glue catalog integration is available in BYOC clusters with Redpanda version 25.2 and later.
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
A new xref:manage:iceberg/iceberg-topics-aws-glue.adoc[integration with AWS Glue Data Catalog] allows you to add Redpanda topics as Iceberg tables in your data lakehouse. The AWS Glue catalog integration is available in BYOC clusters with Redpanda version 25.2 and later.
A new xref:manage:iceberg/iceberg-topics-aws-glue.adoc[integration with AWS Glue Data Catalog] allows you to add Redpanda topics as Iceberg tables in your data lakehouse. The AWS Glue catalog integration is available in BYOC clusters with Redpanda version 25.2 and later.

do customers know which version they're running ?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, that should be available via rpk or the Cloud UI.

Copy link
Copy Markdown
Contributor

@paulohtb6 paulohtb6 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@kbatuigas kbatuigas merged commit 36c7bcc into main Aug 5, 2025
5 checks passed
@kbatuigas kbatuigas deleted the DOC-1377-cloud-aws-glue branch August 5, 2025 23:27
micheleRP added a commit that referenced this pull request Aug 7, 2025
* DOC-1435 Add gcloud commands for creating firewall rules for PSC consumer

* fix API docs similarly

* Add Glue doc to Cloud (#363)

* Add Glue pages to Cloud

* Sync with docs main

* Use docs branch with updated Cloud-specific guidance

* Apply suggestion from PM review

* Add What's New entry

* Auth configs still need to be allowlisted for 25.1 in Cloud

* Update local-antora-playbook.yml

* Update @redpanda-data/docs-extensions-and-macros (#378)

* docs: fix property alias (#379)

* Fix heading for AWS Glue entry (#380)

* minor edits

* Paul's review feedback

* copilot suggestions for consistency

* coderabbit suggestions

* move step into BYOVPC section

* DOC-1435 Add gcloud commands for creating firewall rules for PSC consumer

* fix API docs similarly

* minor edits

* Paul's review feedback

* copilot suggestions for consistency

* coderabbit suggestions

* move step into BYOVPC section

* fixes from coderabbit

* minor edit

* incorporate doc review suggestions

---------

Co-authored-by: Kat Batuigas <36839689+kbatuigas@users.noreply.github.com>
Co-authored-by: vbotbuildovich <62446873+vbotbuildovich@users.noreply.github.com>
Co-authored-by: Paulo Borges <paulohtb6@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants