[ENH] Simplified Publish API with Automatic Type Recognition by Omswastik-11 · Pull Request #1554 · openml/openml-python

Omswastik-11 · 2025-12-24T10:27:02Z

initially

from openml_sklearn.extension import SklearnExtension
from sklearn.neighbors import KNeighborsClassifier
clf = KNeighborsClassifier(n_neighbors=3)
extension = SklearnExtension()# User instantiates the extension object
knn_flow = extension.model_to_flow(clf) # User manually converts the model (estimator instance) to an OpenMLFlow object
knn_flow.publish()

API

from sklearn.neighbors import KNeighborsClassifier
import openml_sklearn  # Register the extension
import openml

clf = KNeighborsClassifier(n_neighbors=3)

openml.publish(clf)

examples/Basics/simple_flows_and_runs_tutorial.py

openml/__init__.py

fkiraly

I get this is a draft still, some early comments.

works for flows only, I would recommend to try for at least two different object types to see the dispatching challenge there.
do the extension checking inside publish and not in the usage example

Omswastik-11 · 2025-12-25T08:07:12Z

Thanks @fkiraly !!
I checked on flow , datset , task . it is working correctly but in run it is getting some server side issues.

Task 1 failed: https://test.openml.org/api/v1/xml/data/features/1 returned code 274: No features found. Additionally, dataset processed with error - None

jgyasu · 2025-12-31T10:11:19Z

The PR description is not entirely correct. This is how the interface looks currently:

from openml_sklearn.extension import SklearnExtension
from sklearn.neighbors import KNeighborsClassifier
clf = KNeighborsClassifier(n_neighbors=3)
extension = SklearnExtension()# User instantiates the extension object
knn_flow = extension.model_to_flow(clf) # User manually converts the model (estimator instance) to an OpenMLFlow object
knn_flow.publish()

But I like the idea of a unified publish. I am currently working on a design document for refactoring Extension and this design coincides mine as well, which is a good thing.

Omswastik-11 · 2025-12-31T13:11:48Z

The PR description is not entirely correct. This is how the interface looks currently:
from openml_sklearn.extension import SklearnExtension
from sklearn.neighbors import KNeighborsClassifier
clf = KNeighborsClassifier(n_neighbors=3)
extension = SklearnExtension()# User instantiates the extension object
knn_flow = extension.model_to_flow(clf) # User manually converts the model (estimator instance) to an OpenMLFlow object
knn_flow.publish()
But I like the idea of a unified publish. I am currently working on a design document for refactoring Extension and this design coincides mine as well, which is a good thing.

Thanks for the correction I used the syntax example used in example script . this unified publish was Franz's idea . https://github.com/gc-os-ai/openml-project-dev/issues/8

codecov-commenter · 2026-01-06T12:44:04Z

Codecov Report

❌ Patch coverage is 28.00000% with 18 lines in your changes missing coverage. Please review.
✅ Project coverage is 52.00%. Comparing base (7feb2a3) to head (54d92e2).

Files with missing lines	Patch %	Lines
openml/publish.py	25.00%	18 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1554      +/-   ##
==========================================
- Coverage   52.82%   52.00%   -0.83%     
==========================================
  Files          37       38       +1     
  Lines        4371     4396      +25     
==========================================
- Hits         2309     2286      -23     
- Misses       2062     2110      +48

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

openml/__init__.py

jgyasu · 2026-01-13T08:57:36Z

I have added some comments. I also feel we should not populate __init__.py with these functions, we can have them in a seperate file and use __init__.py only for imports.

for more information, see https://pre-commit.ci

Copilot

Pull request overview

Adds a new top-level openml.publish() helper to simplify publishing by automatically recognizing whether the input is an OpenML object or an external estimator handled by an installed extension (e.g., openml-sklearn).

Changes:

Introduces openml/publish.py with a unified publish(obj, name=..., tags=...) entry point and tag/name merging behavior.
Exposes publish from openml.__init__ and updates the basic flows/runs tutorial to demonstrate the simplified publishing path.
Adds unit tests covering publishing an OpenMLBase instance and publishing via the extension registry.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

File	Description
`openml/publish.py`	Implements the new unified publish API with automatic routing and tag/name handling.
`openml/__init__.py`	Exports `publish` at the package top-level and updates `__all__`.
`tests/test_openml/test_openml.py`	Adds tests validating publish behavior for OpenML objects and extension-backed models.
`examples/Basics/simple_flows_and_runs_tutorial.py`	Demonstrates automatic publishing vs manual flow construction in the tutorial.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-26T10:53:42Z

openml/__init__.py

+    "__version__",
+    "_api_calls",
    "_api_calls",


__all__ contains duplicated entries for __version__ and _api_calls. This is likely an accidental copy/paste and can cause confusing docs / from openml import * behavior. Remove the duplicate strings so each symbol appears only once in __all__.

Suggested change

"__version__",

"_api_calls",

"_api_calls",

"_api_calls",

Copilot · 2026-02-26T10:53:42Z

tests/test_openml/test_openml.py

+        model = object()
+        flow_id = openml.publish(model, name="n", tags=["x"])
+
+        get_ext_mock.assert_called_once_with(model, raise_if_no_extension=True)
+        ext_instance.model_to_flow.assert_called_once_with(model)
+        assert flow_mock.name == "n"
+        assert flow_mock.tags == ["x"]
+        flow_mock.publish.assert_called_once_with()
+        assert flow_id == "flow-id"


This test mocks flow.publish() to return a string and then asserts openml.publish(model, ...) returns that string (and names the variable flow_id). In the real codebase, OpenMLBase.publish() and OpenMLFlow.publish() return self, so openml.publish() should return the published flow object, not an ID. Update the test to reflect the actual return contract (e.g., have flow_mock.publish return flow_mock and assert the returned object is the flow).

Copilot · 2026-02-26T10:53:43Z

examples/Basics/simple_flows_and_runs_tutorial.py

@@ -77,6 +86,9 @@
 knn_flow.publish()
 print(f"knn_flow was published with the ID {knn_flow.flow_id}")

+# %% [markdown]
+# Now we'll use the auto-published flow to create and upload a run.
+


In this tutorial, both “Option A” and “Option B” are executed sequentially, which will upload/publish two flows and then overwrite knn_flow. The later text says it will use the auto-published flow, but the code that follows actually uses the manually created flow. Consider making only one option executable (or use different variables like knn_flow_auto/knn_flow_manual) to avoid double uploads and keep the narrative consistent.

geetu040

I am fine with most of the core logic, just have to double-check the extension work and tests.
There are some reviews by Copilot you would want to address.
I've left a comment for discussion.

geetu040 · 2026-02-26T13:00:21Z

openml/publish.py

+from .base import OpenMLBase
+
+
+def publish(obj: Any, *, name: str | None = None, tags: Sequence[str] | None = None) -> Any:


What's the reason for adding name and tags as a parameter to publish? I'm not necessarily against it, I would actually prefer it this way, but I am just curious about the motivation or use case behind this. Are there other parameters we can introduce like these? possibly the common attributes between resources that are used in _to_dict to create artifacts for upload.

improve publish api for users

0f21640

Omswastik-11 changed the title ~~[ENH] improve publish api for users~~ [ENH] Simplified Publish API with Automatic Type Recognition Dec 24, 2025

fkiraly reviewed Dec 24, 2025

View reviewed changes

examples/Basics/simple_flows_and_runs_tutorial.py Outdated Show resolved Hide resolved

fkiraly reviewed Dec 24, 2025

View reviewed changes

openml/__init__.py Outdated Show resolved Hide resolved

fkiraly requested changes Dec 24, 2025

View reviewed changes

Omswastik-11 added 3 commits December 25, 2025 13:20

improve doc-string

3b1d961

update __init__.py

3dfe34a

update examples

db36778

Omswastik-11 requested a review from fkiraly December 25, 2025 08:07

Omswastik-11 added 2 commits December 31, 2025 18:41

Merge branch 'main' into prototype-publish

8c600cb

Merge branch 'main' into prototype-publish

79bf2c2

Omswastik-11 marked this pull request as ready for review January 1, 2026 11:07

Merge branch 'main' into prototype-publish

8837954

Merge branch 'main' into prototype-publish

c904b1a

jgyasu suggested changes Jan 13, 2026

View reviewed changes

openml/__init__.py Outdated Show resolved Hide resolved

openml/__init__.py Outdated Show resolved Hide resolved

move publish func to a separate file

7242ee3

Omswastik-11 requested a review from jgyasu January 13, 2026 15:00

Omswastik-11 and others added 3 commits January 15, 2026 14:48

Merge branch 'main' into prototype-publish

6f141a5

[pre-commit.ci] auto fixes from pre-commit.com hooks

bdfa2cb

for more information, see https://pre-commit.ci

Merge branch 'main' into prototype-publish

88ac5fd

geetu040 assigned Omswastik-11 Jan 19, 2026

Omswastik-11 added 2 commits February 4, 2026 11:12

Merge branch 'main' into prototype-publish

76a4d3f

Merge branch 'main' into prototype-publish

54d92e2

Copilot AI review requested due to automatic review settings February 26, 2026 10:49

Copilot started reviewing on behalf of Omswastik-11 February 26, 2026 10:50 View session

Copilot AI reviewed Feb 26, 2026

View reviewed changes

This comment was marked as duplicate.

Sign in to view

geetu040 suggested changes Feb 26, 2026

View reviewed changes

		from .base import OpenMLBase


		def publish(obj: Any, *, name: str \| None = None, tags: Sequence[str] \| None = None) -> Any:

Uh oh!

Conversation

Omswastik-11 commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API

Uh oh!

Uh oh!

Uh oh!

fkiraly left a comment

Choose a reason for hiding this comment

Uh oh!

Omswastik-11 commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jgyasu commented Dec 31, 2025

Uh oh!

Omswastik-11 commented Dec 31, 2025

Uh oh!

codecov-commenter commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

jgyasu commented Jan 13, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

This comment was marked as duplicate.

Uh oh!

geetu040 left a comment

Choose a reason for hiding this comment

Uh oh!

geetu040 Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Omswastik-11 commented Dec 24, 2025 •

edited

Loading

Omswastik-11 commented Dec 25, 2025 •

edited

Loading

codecov-commenter commented Jan 6, 2026 •

edited

Loading