fix: timers wake immediately on actor shutdown in threads module by ElFantasma · Pull Request #142 · lambdaclass/spawned

ElFantasma · 2026-01-26T19:34:23Z

EDIT (28-01-2026):

Ran Devin review and fixed all found issues:
https://app.devin.ai/review/lambdaclass/spawned/pull/142

Summary

Add on_cancel() callback to CancellationToken for efficient waiting
Fix is_cancelled() bug that was clearing the flag
Change CancellationToken methods from &mut self to &self
Update send_after/send_interval to use channel-based wake pattern
Add test_send_after_actor_shutdown test

Closes #123

…test

…mer-actor-shutdown

ElFantasma · 2026-01-29T18:29:49Z

@claude @chatgpt @Kimi please review

ElFantasma · 2026-01-29T18:47:57Z

@Kimi please review

ElFantasma · 2026-01-30T17:45:18Z

/claude /kimi /codex

claude · 2026-01-30T17:46:14Z

PR Review - Findings

CRITICAL: Race Condition in CancellationToken

Severity: High
Location: rt/src/threads/mod.rs:91-103

The on_cancel() method has a race condition that can cause callbacks to be lost or fired multiple times:

pub fn on_cancel(&self, callback: CancelCallback) {
    let mut callbacks = self.callbacks.lock().unwrap_or_else(|e| e.into_inner());
    if self.is_cancelled() {  // ← Race window here\!
        drop(callbacks);
        callback();
    } else {
        callbacks.push(callback);
    }
}

The Problem:

Thread A: Acquires lock, sees is_cancelled() = false, about to push callback
Thread B: Calls cancel(), sets flag to true, acquires lock, drains callbacks (empty list)
Thread A: Pushes callback to the list
Result: Callback is never fired because it was added AFTER cancel() drained the list

The Fix:
Check is_cancelled() BEFORE releasing the lock in cancel():

pub fn cancel(&self) {
    let mut callbacks = self.callbacks.lock().unwrap_or_else(|e| e.into_inner());
    if self.is_cancelled.swap(true, Ordering::SeqCst) {
        return; // Already cancelled
    }
    let cbs: Vec<_> = callbacks.drain(..).collect();
    drop(callbacks); // Release lock before firing
    for cb in cbs {
        cb();
    }
}

pub fn on_cancel(&self, callback: CancelCallback) {
    let mut callbacks = self.callbacks.lock().unwrap_or_else(|e| e.into_inner());
    if self.is_cancelled() {
        drop(callbacks);
        callback();
    } else {
        callbacks.push(callback);
    }
}

This ensures the flag is set while holding the lock, preventing the race.

BUG: Incorrect Loop Logic in send_interval

Severity: Medium
Location: concurrency/src/threads/time.rs:92-98

The loop exits when cancellation is detected AFTER a timeout, but it should send the message BEFORE checking cancellation to avoid skipping the final message:

while let Err(RecvTimeoutError::Timeout) = wake_rx.recv_timeout(period) {
    if timer_token.is_cancelled() || actor_token.is_cancelled() {
        break;  // ← Checks BEFORE sending
    }
    let _ = handle.send(message.clone());
}

This means if cancellation happens just as the timeout expires, no message is sent. Consider checking after the send, or document this behavior clearly.

Minor Issues

Potential memory leak concern (rt/src/threads/mod.rs:75-83): If a callback panics during execution, subsequent callbacks won't fire. Consider using catch_unwind or document this behavior.
Missing validation (concurrency/src/threads/time.rs:18, 65): No validation for zero or very small Duration values which could cause unexpected behavior.

Positive Changes

Fixed the is_cancelled() bug that was clearing the flag ✓
Improved error logging from trace to error level ✓
Good test coverage for the shutdown scenario ✓
Correct use of &self instead of &mut self for thread-safe methods ✓

Automated review by Claude (Anthropic)

ElFantasma · 2026-01-30T17:54:34Z

/kimi

ElFantasma · 2026-01-30T17:54:38Z

/codex

github-actions · 2026-01-30T17:54:56Z

🤖 Kimi Code Review

Security & Bug Review

1. Critical Security Issue – `CancellationToken` race condition (rt/src/threads/mod.rs:71-75)

The on_cancel method has a data race between checking is_cancelled() and registering the callback. Between the check and the lock acquisition, another thread could call cancel(), causing the callback to never fire. This breaks the guarantee that callbacks fire exactly once.

Fix: Move the is_cancelled() check inside the locked section to ensure atomicity.

2. Medium Bug – `CancellationToken` callback ordering issue (rt/src/threads/mod.rs:71-75)

The current implementation can drop callbacks if cancel() is called between the is_cancelled() check and the lock acquisition. While the race window is small, this violates the "exactly once" guarantee.

Fix: Same as above - check cancellation while holding the lock.

3. Low Bug – `send_interval` logic flaw (concurrency/src/threads/time.rs:91-97)

The cancellation check happens after the sleep expires, which could allow a message to be sent even after cancellation if the cancellation occurs between the timeout and the check. While the new wake-up mechanism reduces this window, the check should happen before sending.

Fix: Swap the order of the cancellation check and message send:

if !timer_token.is_cancelled() && !actor_token.is_cancelled() {
    let _ = handle.send(message.clone());
} else {
    break;
}

4. Low Bug – `CancellationToken::is_cancelled()` behavior change (rt/src/threads/mod.rs:65-67)

The method changed from fetch_and(false) (which clears the flag) to load(), which is a breaking change in behavior. While this appears intentional for the new callback system, it could break existing code that relied on the flag-clearing behavior.

Note: This seems intentional for the new design, but should be documented as a breaking change.

5. Minor – Unnecessary `mut` removal (multiple files)

The removal of mut from cancellation tokens is correct as they're now used immutably with the new callback system. No issues here.

Summary

Critical: Fix the race condition in CancellationToken::on_cancel
Medium: Ensure callback registration is atomic
Low: Fix potential message send after cancellation in send_interval
Low: Document the breaking change in is_cancelled() behavior

Automated review by Kimi (Moonshot AI)

github-actions · 2026-01-30T17:56:15Z

🤖 Codex Code Review

Findings

Bug (Medium): init_tracing() panics if called more than once because set_global_default is unwrapped. Any consumer that calls rt::tasks::run() and rt::threads::run() (or calls run twice in tests) will crash. Consider using try_init/set_global_default with graceful fallback. rt/src/tracing/mod.rs:10-18, called from rt/src/tasks/mod.rs:25-29 and rt/src/threads/mod.rs:20-23.
Performance/Availability (Low): ctrl_c() stores every subscriber Sender forever and never removes dead ones. Repeated calls can grow the vector unboundedly and each Ctrl+C will iterate/attempt send on stale channels. Consider pruning on send failure or using Weak-like cleanup. rt/src/threads/mod.rs:122-145.

No other security vulnerabilities or significant bugs found in the PR diff.

Automated review by OpenAI Codex

ElFantasma · 2026-01-30T21:19:26Z

Analysis of AI Review Comments

1. Claude & Kimi: Race condition in CancellationToken - ❌ Invalid

Both reviewers claim there's a race condition in on_cancel(). However, the code is correct:

pub fn on_cancel(&self, callback: CancelCallback) {
    let mut callbacks = self.callbacks.lock()...;  // Lock acquired FIRST
    if self.is_cancelled() {                        // Check while holding lock
        drop(callbacks);
        callback();
    } else {
        callbacks.push(callback);
    }
}

The code comment explains the design:

"Hold the lock while checking is_cancelled to avoid a race with cancel(). cancel() sets the flag BEFORE acquiring the lock, so if we see is_cancelled=false while holding the lock, cancel() hasn't drained callbacks yet and will drain ours after we release the lock."

Claude's scenario ("Thread B acquires lock, drains callbacks") is impossible while Thread A holds the lock. The callback is guaranteed to fire exactly once.

2. Claude & Kimi: send_interval logic flaw - ❌ Invalid

Current code checks cancellation BEFORE sending:

if timer_token.is_cancelled() || actor_token.is_cancelled() {
    break;  // Don't send if cancelled
}
let _ = handle.send(message.clone());

This is correct behavior. If cancelled, we should NOT send. The reviewers suggest swapping the order, which would incorrectly send messages after cancellation.

3. Codex: init_tracing() panics if called twice - ⚠️ Pre-existing

Not introduced by this PR. Outside scope.

4. Codex: ctrl_c() memory leak - ✅ Fixed in PR #140

The retain() fix addresses this. Not applicable to PR #142.

5. Kimi: is_cancelled() breaking change - ⚠️ Intentional bug fix

The old behavior (fetch_and(false)) was a bug that cleared the flag on read. The new behavior (load()) is correct. Should be documented as a breaking change.

Summary

Issue	Reviewer	Verdict
Race condition in on_cancel	Claude, Kimi	❌ Invalid - lock prevents race
send_interval logic	Claude, Kimi	❌ Invalid - current order is correct
init_tracing panic	Codex	⚠️ Pre-existing, not in PR scope
ctrl_c memory leak	Codex	✅ Fixed in PR #140
is_cancelled() change	Kimi	⚠️ Intentional fix, should document

pablodeymo · 2026-02-02T18:43:14Z

concurrency/src/threads/stream.rs

 {
    let mut iter = stream.into_iter();
-    let mut cancelation_token = handle.cancellation_token();
+    let cancelation_token = handle.cancellation_token();


there is a type: cancelation_token has 2 l's

ElFantasma added 5 commits January 26, 2026 13:50

feat: add unified ctrl_c() signal handling to spawned-rt

1bd01e8

Formatted several files

93a546e

fix: use JoinHandle for Actor::join() instead of polling

6edc25c

fix: timers wake immediately on actor shutdown in threads module

4fa663c

Removed message about reverting state that no longer holds

c4c0eed

ElFantasma mentioned this pull request Jan 27, 2026

feat: add unified ctrl_c() signal handling #140

Merged

7 tasks

ElFantasma added 6 commits January 27, 2026 17:32

fix: allow multiple actors to subscribe to ctrl_c signal

d3ad93e

test: update signal_test to use two actors

fe66e8d

fix: use spawn_blocking for Thread backend join and fix flaky stream …

482be45

…test

fix: handle poisoned mutex in ctrl_c subscriber registration

740e776

Merge branch 'feature/ctrl-c-signal-handling' into feature/threads-ti…

31e908c

…mer-actor-shutdown

fix: race condition in CancellationToken::on_cancel

525759f

pablodeymo reviewed Feb 2, 2026

View reviewed changes

pablodeymo approved these changes Feb 2, 2026

View reviewed changes

Fixed typo

14c5cfa

Base automatically changed from feature/ctrl-c-signal-handling to main February 3, 2026 16:29

Merge branch 'main' into feature/threads-timer-actor-shutdown

a244c6a

ElFantasma merged commit 3a8e98c into main Feb 3, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: timers wake immediately on actor shutdown in threads module#142

fix: timers wake immediately on actor shutdown in threads module#142
ElFantasma merged 13 commits intomainfrom
feature/threads-timer-actor-shutdown

ElFantasma commented Jan 26, 2026 •

edited

Loading

Uh oh!

ElFantasma commented Jan 29, 2026

Uh oh!

ElFantasma commented Jan 29, 2026

Uh oh!

ElFantasma commented Jan 30, 2026

Uh oh!

claude bot commented Jan 30, 2026

Uh oh!

ElFantasma commented Jan 30, 2026

Uh oh!

ElFantasma commented Jan 30, 2026

Uh oh!

github-actions bot commented Jan 30, 2026

Uh oh!

github-actions bot commented Jan 30, 2026

Uh oh!

ElFantasma commented Jan 30, 2026

Uh oh!

pablodeymo Feb 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ElFantasma commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

EDIT (28-01-2026):

Summary

Uh oh!

ElFantasma commented Jan 29, 2026

Uh oh!

ElFantasma commented Jan 29, 2026

Uh oh!

ElFantasma commented Jan 30, 2026

Uh oh!

claude bot commented Jan 30, 2026

PR Review - Findings

CRITICAL: Race Condition in CancellationToken

BUG: Incorrect Loop Logic in send_interval

Minor Issues

Positive Changes

Uh oh!

ElFantasma commented Jan 30, 2026

Uh oh!

ElFantasma commented Jan 30, 2026

Uh oh!

github-actions bot commented Jan 30, 2026

🤖 Kimi Code Review

Security & Bug Review

1. Critical Security Issue – CancellationToken race condition (rt/src/threads/mod.rs:71-75)

2. Medium Bug – CancellationToken callback ordering issue (rt/src/threads/mod.rs:71-75)

3. Low Bug – send_interval logic flaw (concurrency/src/threads/time.rs:91-97)

4. Low Bug – CancellationToken::is_cancelled() behavior change (rt/src/threads/mod.rs:65-67)

5. Minor – Unnecessary mut removal (multiple files)

Summary

Uh oh!

github-actions bot commented Jan 30, 2026

🤖 Codex Code Review

Uh oh!

ElFantasma commented Jan 30, 2026

Analysis of AI Review Comments

1. Claude & Kimi: Race condition in CancellationToken - ❌ Invalid

2. Claude & Kimi: send_interval logic flaw - ❌ Invalid

3. Codex: init_tracing() panics if called twice - ⚠️ Pre-existing

4. Codex: ctrl_c() memory leak - ✅ Fixed in PR #140

5. Kimi: is_cancelled() breaking change - ⚠️ Intentional bug fix

Summary

Uh oh!

pablodeymo Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ElFantasma commented Jan 26, 2026 •

edited

Loading

1. Critical Security Issue – `CancellationToken` race condition (rt/src/threads/mod.rs:71-75)

2. Medium Bug – `CancellationToken` callback ordering issue (rt/src/threads/mod.rs:71-75)

3. Low Bug – `send_interval` logic flaw (concurrency/src/threads/time.rs:91-97)

4. Low Bug – `CancellationToken::is_cancelled()` behavior change (rt/src/threads/mod.rs:65-67)

5. Minor – Unnecessary `mut` removal (multiple files)