Extractors -> listeners, trigger listeners on saved feeds automatically #114

max-zilla · 2022-09-30T15:33:19Z

This renames extractors to listeners (and creates a legacy endpoint for old extractors) as well as includes basic implementation of job feeds (saved searches) and listeners on those feeds. Lots of decisions here we should discuss.

See test_feeds.py for implementation example, still in progress and very rough.

tcnichol

Ran this one and it still works with extractors. Looks good. Marking approved.

tcnichol · 2022-10-12T17:41:17Z

I think the failing test might be due to ListenerIn needing to be a superclass of ExtractorInfo instead of BaseModel, or perhaps the other way around.

lmarini

Can you add docstrings to all functions and models? We need to start requiring this on all PRs. Thanks!

lmarini · 2022-10-21T20:12:49Z

Will you be adding https://github.com/clowder-framework/clowder2/blob/workflow-documentation/backend/docs/source/listeners.md to this PR?

lmarini · 2022-10-21T20:14:35Z

Are you still in favor or renaming Listeners to EventListeners to be a bit more specific. When looking through the code and documentation I am still wondering if Listeners is still too generic, even for a developer.

max-zilla · 2022-10-25T16:14:22Z

Can you add docstrings to all functions and models? We need to start requiring this on all PRs. Thanks!

Added a bunch of these.

Will you be adding https://github.com/clowder-framework/clowder2/blob/workflow-documentation/backend/docs/source/listeners.md to this PR?

Added with minor updates as well.

Are you still in favor or renaming Listeners to EventListeners to be a bit more specific. When looking through the code and documentation I am still wondering if Listeners is still too generic, even for a developer.

I renamed to EventListeners across the app, although "listeners.py" is still the filename for simplicity.

lmarini

Thanks for updating the PR. Left a few more questions / comments as I continue to review this. There is a lot of digest. Thanks!

lmarini · 2022-10-25T21:11:42Z

backend/app/models/listeners.py

+    parameters: List[dict] = []
+
+
+class EventListenerBase(BaseModel):


Is this supposed to inherit from ExtractorInfo?

no, my intent was to allow listeners to be registered without requiring v1 extractor info. So submitting a v2 listener to /api/listeners doesn't need extractor_info contents, but /api/extractors does.

more broadly my hope was that we would rethink/redesign extractor_info because it has some limitations/bloat. however there is some tension between doing that and getting a working demo sooner with an existing extractor. if it is all too confusing, we can just remove LegacyListener concept and revert v2 models to match v1 and keep extractor/listener framework more or less the same? not sure what is easiest for short vs. long term.

Sounds good. We can try to keep both. Maybe add a few more comments explaining the difference? As long as we can submit/trigger with current extractors, we are good short term. I don't think this gets in the way of that?

lmarini · 2022-10-25T21:12:44Z

backend/app/models/listeners.py

+    """v1 Extractors can submit data formatted as a LegacyEventListener (i.e. v1 format) and it will be converted to a v2 EventListener."""
+
+    name: str
+    version: str = "1.0"


Why the difference in types between version for EventListenerBase and LegacyEventListener (str vs int)?

Clowder v1 lets users arbitrarily provide a version with no rules (e.g. "1", "1.0", "v1" "beta" all allowed). I was thinking (before v1 back-compatibility) that version could just be an integer for clarity and simplicity, but after seeing it is just a string field for v1 we should probably use that.

If we use string, "version" might better be called "tag" or something like Docker does? Don't necessarily see a point in enforcing something that can be cast as a float in that case...

Clowder v1 lets users arbitrarily provide a version with no rules (e.g. "1", "1.0", "v1" "beta" all allowed). I was thinking (before v1 back-compatibility) that version could just be an integer for clarity and simplicity, but after seeing it is just a string field for v1 we should probably use that.

If we use string, "version" might better be called "tag" or something like Docker does? Don't necessarily see a point in enforcing something that can be cast as a float in that case...

Good point. Tag sounds great. We could also enforce semantic versioning in the string and call it version. But that might be more work. Similar to what npm does.

lmarini · 2022-10-25T21:13:16Z

backend/app/tests/test_feeds.py

+    os.remove(dummy_file)
+    assert response.status_code == 200
+
+    # Verify the message


Is there a way to test that the listener was triggered?

not yet. if we had an event log we could check that but it doesn't exist yet. Otherwise we'd have to check RMQ somehow.

@tcnichol how easily could you add v1 events back into v2? I am looking at the heartbeat PR and will leave comments there soon. It would be a similar approach?

lmarini · 2022-10-25T21:21:19Z

backend/app/routers/feeds.py

+
+router = APIRouter()
+
+clowder_bucket = os.getenv("MINIO_BUCKET_NAME", "clowder")


I don't think this is being used.

lmarini · 2022-10-25T21:38:02Z

backend/app/routers/feeds.py

+                        listeners_found.append(listener.listener_id)
+
+    for targ_listener in listeners_found:
+        queue = ""  # TODO: Each extractor gets a queue - routing key same as name?


Can this default to the listener name so that legacy extractors can be triggered?

max-zilla added 3 commits September 27, 2022 13:38

Rename extractors to listeners

da1d413

Additional model work

f889857

More implementation

0de6b7c

max-zilla requested review from lmarini and longshuicy as code owners September 30, 2022 15:33

max-zilla marked this pull request as draft September 30, 2022 15:33

max-zilla added 7 commits September 30, 2022 10:49

auto-generation of feeds

5e8322f

listener assignment

e59cd95

standardizing search object and search criteria

99e828e

add delete endpoints

334c7ae

name cleanup

2503855

refactor how RMQ message built

c96945b

standardize some of the submission methods

9ee5bc0

max-zilla marked this pull request as ready for review October 12, 2022 13:59

max-zilla changed the title ~~[WIP] Trigger listener on saved search feed when file uploaded~~ Trigger listener on saved search feed when file uploaded Oct 12, 2022

max-zilla changed the title ~~Trigger listener on saved search feed when file uploaded~~ Extractors -> listeners, trigger listeners on saved feeds automatically Oct 12, 2022

max-zilla requested a review from tcnichol October 12, 2022 14:03

tcnichol approved these changes Oct 12, 2022

View reviewed changes

ddey2 self-requested a review October 12, 2022 15:29

max-zilla added 3 commits October 14, 2022 10:58

Merge branch 'main' into trigger-listeners-on-feed

261f64b

resolve merges, cleanup ds metadata

4684f51

Update metadata_datasets.py

e808044

lmarini requested changes Oct 20, 2022

View reviewed changes

max-zilla added 3 commits October 25, 2022 10:57

Listener -> Event Listener, add docstrings

9bd9605

fix remaining imports

99a188a

docstrings

06578c9

Merge branch 'main' into trigger-listeners-on-feed

90eb220

max-zilla requested a review from lmarini October 25, 2022 19:26

lmarini requested changes Oct 25, 2022

View reviewed changes

max-zilla added 3 commits October 27, 2022 09:47

get listener name & version as default

ee0103c

remove version

25e5074

Update feeds.py

2a90139

lmarini approved these changes Oct 27, 2022

View reviewed changes

lmarini merged commit e4b2562 into main Oct 27, 2022

lmarini deleted the trigger-listeners-on-feed branch October 27, 2022 16:10

max-zilla restored the trigger-listeners-on-feed branch October 31, 2022 14:30

max-zilla deleted the trigger-listeners-on-feed branch October 31, 2022 14:30

		parameters: List[dict] = []


		class EventListenerBase(BaseModel):


		router = APIRouter()

		clowder_bucket = os.getenv("MINIO_BUCKET_NAME", "clowder")

Extractors -> listeners, trigger listeners on saved feeds automatically #114

Extractors -> listeners, trigger listeners on saved feeds automatically #114

Uh oh!

Conversation

max-zilla commented Sep 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tcnichol left a comment

Choose a reason for hiding this comment

Uh oh!

tcnichol commented Oct 12, 2022

Uh oh!

lmarini left a comment

Choose a reason for hiding this comment

Uh oh!

lmarini commented Oct 21, 2022

Uh oh!

lmarini commented Oct 21, 2022

Uh oh!

max-zilla commented Oct 25, 2022

Uh oh!

lmarini left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

max-zilla commented Sep 30, 2022 •

edited

Loading