Queue combo by thestumonkey · Pull Request #121 · SimpleOpenSoftware/chronicle

thestumonkey · 2025-09-30T23:39:35Z

created models for

Conversation
User
audio_session

Make them use beanie pattern to remove complex raw database saving in database.py.

Move to using RQ for queued jobs, in order to get decent restart protection if server dies, and reduce complexity of code.

Did not move the audio chunking part or speech detection.

This is not a final PR, I need to check over the code and ensure we have tests and they work. There's still a bunch of managers and providers that I think are confusing, but I feel it's less complex for a lot of it, although the whole audio streaming / audio cropping / transctripiotn / creation of converastion still feels comples

Summary by CodeRabbit

New Features
- Admin audio upload endpoint for WAV processing.
- Queue management: API endpoints and new web UI page with navigation.
- Conversation versioning UI: view counts, switch active versions, and reprocess transcripts.
Improvements
- Health/readiness now include Redis/RQ worker status.
- Auto-select transcription provider; richer diarization segment handling.
- Backend URL configurable in web UI builds.
DevOps
- Startup scripts run backend and RQ workers; K8s helm adds Redis and updates DB URI.
- Docker/test compose tweaks and CORS origin config updates.
Tests
- Extensive Robot Framework suites and Makefile targets.
Bug Fixes
- Minor prompt text correction.

Added version selection from the conversation page, delete version and the queuemanagement page # Conflicts: # backends/advanced/src/advanced_omi_backend/main.py

commit 3ba5506 Merge: dbf37c7 8ffd97a Author: Stuart Alexander <stu@theawesome.co.uk> Date: Wed Sep 24 20:03:29 2025 +0100 Merge branch 'AnkushMalaker:main' into pytest commit 8ffd97a Author: Ankush Malaker <43288948+AnkushMalaker@users.noreply.github.com> Date: Wed Sep 24 09:25:11 2025 +0530 fix faiss issue in speaker recognition cpu commit dbf37c7 Author: stu alexander <thestumonkey@gmail.com> Date: Sat Sep 20 19:02:22 2025 +0100 Fixed update user commit 14845a6 Author: stu alexander <thestumonkey@gmail.com> Date: Sat Sep 20 19:02:03 2025 +0100 Added robot test suite

… audio paths

coderabbitai · 2025-09-30T23:39:41Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Walkthrough

The PR introduces RQ-based background processing and Beanie models for conversations and audio sessions, adds audio upload and queue monitoring APIs, refactors controllers and processors to enqueue jobs, enhances health checks with Redis/RQ, adds startup scripts, updates Makefile and configs, expands web UI for queue/versioning, and adds extensive Robot tests.

Changes

Cohort / File(s)	Summary
Environment & Config `.env.template`, `backends/charts/advanced-backend/values.yaml`, `skaffold.yaml`, `backends/advanced/docker-compose-test.yml`, `backends/advanced/src/advanced_omi_backend/config.py`	MongoDB URI updated to friend-lite; CORS handling adjusted; Skaffold adds Redis release and REDIS_URL; test compose mounts src, sets CORS, healthcheck delay; new DATA_DIR/CHUNK_DIR and audio storage settings function.
Build, Scripts & Dependencies `Makefile`, `requirements.txt`, `backends/advanced/Dockerfile`, `backends/advanced/Dockerfile.k8s`, `backends/advanced/start.sh`, `backends/advanced/start-k8s.sh`, `backends/advanced/webui/Dockerfile`	New Robot Framework targets and TEST_ENV; add Robot deps; Dockerfiles now start via scripts; startup scripts orchestrate FastAPI and RQ workers; web UI accepts VITE_BACKEND_URL.
Models (Beanie) `.../models/__init__.py`, `.../models/user.py`, `.../models/conversation.py`, `.../models/audio_session.py`, `.../models/job.py`	Adds Beanie models for User, Conversation (with versioning), AudioSession; introduces JobPriority enum; package init stub.
Controllers `.../controllers/audio_controller.py`, `.../controllers/conversation_controller.py`, `.../controllers/system_controller.py`, `.../controllers/system_controller.py.backup`, `.../controllers/user_controller.py`	New audio upload/processing controller; conversation controller rewritten to Beanie and RQ flows; system controller removes audio processing endpoints, adds system/config APIs; backup variant retained; user update now accepts UserUpdate.
Routers `.../routers/api_router.py`, `.../routers/modules/__init__.py`, `.../routers/modules/audio_routes.py`, `.../routers/modules/queue_routes.py`, `.../routers/modules/conversation_routes.py`, `.../routers/modules/system_routes.py`, `.../routers/modules/user_routes.py`	Registers audio and queue routers; adds audio upload route and queue monitoring endpoints; conversation handler rename; removes legacy audio-processing routes; updates imports and user update schema.
Processing & Queueing `.../processors.py`, `.../processors.py.backup`, `.../rq_queue.py`, `.../conversation_manager.py`	ProcessorManager now handles streaming in-process and enqueues memory/cropping to RQ; adds comprehensive RQ integration (queues, jobs, enqueuers, health); final transcription enqueued post-audio.
Database & App Init `.../database.py`, `.../main.py`	Removes legacy repositories; narrows collections; adds update_transcription_status; Beanie init includes Conversation/AudioSession; health/readiness check Redis/RQ workers.
Transcription & Providers `.../transcription.py`, `.../transcription_providers.py`, `.../speaker_recognition_client.py`, `.../memory/prompts.py`, `.../memory/utils.py`	Transcription path migrated to Beanie and RQ enqueues; provider auto-select and Deepgram segment extraction; expanded logging for speaker client; prompt text fix; regex patterns altered in memory utils.
Web UI `backends/advanced/webui/src/App.tsx`, `.../components/ConversationVersionDropdown.tsx`, `.../components/ConversationVersionHeader.tsx`, `.../components/layout/Layout.tsx`, `.../pages/Conversations.tsx`, `.../pages/Queue.tsx`, `.../pages/System.tsx`, `.../services/api.ts`	Adds queue page and admin nav; conversation version header/dropdown; conversations page supports versioning and on-demand transcript fetch; system shows RQ worker stats; queue API client added.
Robot Tests & Resources `tests/*/.robot`, `tests/README.md`, `tests/TESTING_GUIDELINES.md`, `tests/test_env.py`, `tests/test_data.py`	Introduces comprehensive Robot suites for auth, conversations, memory, health, chat, queue, integration, browser test; shared resources for sessions, setup, keywords, timing; env/test data modules.
Speaker Service Extras `extras/speaker-recognition/docker-compose-test.yml`, `extras/speaker-recognition/pyproject.toml`	Test compose loads .env and CORS; deps adjusted: faiss-cpu >=1.9, numpy pin in cpu group, remove root numpy.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor Admin as Admin (API)
  participant FE as Web UI
  participant API as FastAPI (/api/audio/upload)
  participant AC as audio_controller
  participant FS as Filesystem (CHUNK_DIR)
  participant RQ as RQ Enqueue
  participant W as RQ Worker
  participant DB as Beanie (AudioSession, Conversation)

  Admin->>FE: Upload WAV files
  FE->>API: POST /api/audio/upload (files, device_name)
  API->>AC: Delegate upload_and_process_audio_files
  AC->>AC: Validate files, WAV, sample rate
  AC->>FS: Save processed mono 16k WAVs
  AC->>RQ: enqueue_audio_processing(client_id, audio_meta, path)
  RQ-->>API: Return Job IDs
  API-->>FE: JSON {files[], enqueued_count, client_id, job_ids}

  Note over W,DB: Worker execution
  RQ->>W: process_audio_job
  W->>DB: Create AudioSession / link conversation (if any)
  W->>RQ: enqueue_initial_transcription(...)

sequenceDiagram
  autonumber
  actor User as User
  participant FE as Web UI (Conversations)
  participant API as FastAPI (/api/conversations/{id})
  participant CC as conversation_controller
  participant DB as Beanie (Conversation)
  participant RQ as RQ Enqueue
  participant W as RQ Worker

  User->>FE: Open conversation detail
  FE->>API: GET /api/conversations/{id}
  API->>CC: get_conversation
  CC->>DB: Load Conversation, active versions, segments
  DB-->>CC: Conversation + version_info
  CC-->>FE: Details + version_info

  User->>FE: Reprocess transcript
  FE->>API: POST /api/conversations/{id}/reprocess/transcript
  API->>RQ: enqueue_transcript_processing(conversation_id, new_version_id)
  RQ->>W: process_transcript_job
  W->>DB: Add TranscriptVersion, set active
  W-->>FE: Status via queue endpoints

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~75 minutes

Possibly related PRs

Reprocessing WIP #96 — Similar introduction of versioned Conversation model, reprocess/activate flows, and related controller/route changes.
Better audio processing #111 — Overlaps in audio/transcription pipeline refactors, processing job tracking, and ProcessorManager changes.
Memory config from UI #90 — Related system/diarization and memory configuration controller additions and config module setup.

Poem

I wired the queues, then hopped away,
Beanie jars lined neat in clay.
RQ hums, transcripts grow,
Versions bloom in tidy rows.
With Makefile drums and tests alight,
I thump approval—ship tonight! 🐇✨

Pre-merge checks and finishing touches

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title Check	❓ Inconclusive	The title “Queue combo” is brief and somewhat related to the introduction of RQ-based queues, but it is too generic and does not clearly convey the primary scope of the changes—namely the migration to RQ for background jobs, the addition of new Beanie models for Conversation, User, and AudioSession, and the removal of legacy database repositories.	Please revise the title to more explicitly describe the main changes, for example: “Migrate background processing to RQ and add Conversation, User, and AudioSession Beanie models.”

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	Docstring coverage is 89.17% which is sufficient. The required threshold is 80.00%.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

AnkushMalaker · 2025-10-01T01:38:12Z

@coderabbitai review

coderabbitai · 2025-10-01T01:38:18Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 56

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (8)

backends/advanced/src/advanced_omi_backend/processors.py.backup (1)

906-1270: Remove the stray legacy block

Starting at Line 906 there’s a legacy _memory_processor fragment sitting at module scope with a leading indent. If Python ever tries to import this .backup module (e.g., python -m compileall), it fails immediately with IndentationError: unexpected indent, and the block references attributes (memory_queue, track_processing_task) that no longer exist. Please delete this trailing block or drop the .backup module entirely so the tree remains syntactically valid.

backends/advanced/src/advanced_omi_backend/memory/utils.py (1)

74-83: Regex now matches literal “\s” instead of whitespace; JSON extraction will fail.

In a raw string, \\s matches a backslash + s. Use \s to match whitespace. Same for the array colon segment. Fix both patterns.
-        r'\{"memory"\\s*:\\s*\[.*?\]\\s*\}',
+        r'\{"memory"\s*:\s*\[.*?\]\s*\}',
-        r'\{"facts"\\s*:\\s*\[.*?\]\\s*\}',
+        r'\{"facts"\s*:\s*\[.*?\]\s*\}',

tests/integration/mobile_client_tests.robot (1)

1-51: Add consistent "todo" tags and consider excluding from CI.

This placeholder test suite will cause CI failures since all tests explicitly fail. Consider:

Add "todo" tag to all tests (currently only the first test has it)
Exclude from CI until implementation begins

Apply this diff to add consistent "todo" tags:

 Login to server
     [Documentation]    Test logging in to the server from mobile client
+    [Tags]             debug    todo
     Log    Logging in to server    INFO
     Fail    Test not written yet - placeholder test

 Scan bluetooth devices
     [Documentation]    Scan for available bluetooth devices
+    [Tags]             debug    todo
     Log    Scanning bluetooth devices    INFO
     Fail    Test not written yet - placeholder test

 Filter devices by omi
     [Documentation]    Filter scanned devices by omi
+    [Tags]             debug    todo
     Log    Filtering devices by omi    INFO
     Fail    Test not written yet - placeholder test

 Connect to bluetooth device
     [Documentation]    Connect to a bluetooth device
+    [Tags]             debug    todo
     Log    Connecting to bluetooth device    INFO
     Fail    Test not written yet - placeholder test

 Get device codec
     [Documentation]    Get the codec information from the device
+    [Tags]             debug    todo
     Log    Getting device codec    INFO
     Fail    Test not written yet - placeholder test

 Get device battery level
     [Documentation]    Get the battery level from the device
+    [Tags]             debug    todo
     Log    Getting device battery level    INFO
     Fail    Test not written yet - placeholder test

 Start audio stream
     [Documentation]    Start streaming audio from the device
+    [Tags]             debug    todo
     Log    Starting audio stream    INFO
     Fail    Test not written yet - placeholder test

Then update your Robot test runner to exclude "todo" tests:

robot --exclude todo tests/

tests/integration/conversation_queue.robot (1)

100-132: Use evaluated variables when checking job status

Expressions like Should Be True '${job_details}[status]' in [...] and '${status}' in [...] treat the left side as a literal string, so the checks always fail once the earlier syntax issues are fixed. Switch to the $variable syntax so Robot evaluates the actual status values. (robotframework.org)
-    Should Be True    '${job_details}[status]' in ['completed', 'finished']    Job status: ${job_details}[status], expected completed or finished
+    Should Be True    $job_details['status'] in ['completed', 'finished']    Job status: ${job_details}[status], expected completed or finished
...
-    Should Be True    '${status}' in ['completed', 'finished', 'failed']    Job status: ${status}
+    Should Be True    $status in ['completed', 'finished', 'failed']    Job status: ${status}

Makefile (2)

8-11: Fix env export filters: \s isn’t valid in grep here; comments/blank lines aren’t excluded.

On many systems grep (basic regex) treats \s as literal s, so comment/blank lines slip through and export may try to export #... as a var name. Use a single robust sed to extract only valid identifiers.

Apply this diff in both blocks:
-    export $(shell sed 's/=.*//' .env | grep -v '^\s*$$' | grep -v '^\s*\#')
+    # Export only VAR names that look like shell identifiers (skip blanks/comments)
+    export $(shell sed -n 's/^[[:space:]]*$[A-Za-z_][A-Za-z0-9_]*$[[:space:]]*=.*/\1/p' .env)
-export $(shell sed 's/=.*//' config.env | grep -v '^\s*$$' | grep -v '^\s*\#')
+# Export only valid identifiers from config.env
+export $(shell sed -n 's/^[[:space:]]*$[A-Za-z_][A-Za-z0-9_]*$[[:space:]]*=.*/\1/p' config.env)
Also applies to: 14-17

159-159: Use POSIX ‘.’ instead of Bash-only ‘source’.

make runs /bin/sh by default; source may fail. Use the POSIX dot.
- @set -a; source skaffold.env; set +a; skaffold run --profile=infrastructure --default-repo=$(CONTAINER_REGISTRY)
+ @set -a; . skaffold.env; set +a; skaffold run --profile=infrastructure --default-repo=$(CONTAINER_REGISTRY)
- @set -a; source skaffold.env; set +a; skaffold run --profile=advanced-backend --default-repo=$(CONTAINER_REGISTRY)
+ @set -a; . skaffold.env; set +a; skaffold run --profile=advanced-backend --default-repo=$(CONTAINER_REGISTRY)
(and same change in deploy-apps.)

Also applies to: 242-242, 255-255

backends/advanced/src/advanced_omi_backend/transcription.py (1)

886-904: Undefined variable conversation (use conversation_model).

This crashes the queueing path and breaks memory processing. Replace all conversation[...] uses with conversation_model attributes.

Based on learnings

-            logger.info(
-                f"💭 Queuing memory processing for conversation {conversation_id} (audio: {conversation['audio_uuid']})"
-            )
+            logger.info(
+                f"💭 Queuing memory processing for conversation {conversation_id} (audio: {conversation_model.audio_uuid})"
+            )
@@
-            enqueue_memory_processing(
+            enqueue_memory_processing(
                 client_id=self._client_id,
-                user_id=conversation["user_id"],
+                user_id=conversation_model.user_id,
                 user_email=audio_session["user_email"],
                 conversation_id=conversation_id
             )

backends/advanced/src/advanced_omi_backend/controllers/system_controller.py (1)

115-131: Fix broken queue stats after RQ migration

ProcessorManager no longer exposes memory_queue, cropping_queue, or processing_tasks, so this call now raises an AttributeError every time /system/processor-status hits it. We need to pull queue sizes from the new RQ queues instead of the removed in-process queues, and count tracked jobs via processing_jobs.

@@
-from advanced_omi_backend.processors import get_processor_manager
+from advanced_omi_backend.processors import get_processor_manager
+from advanced_omi_backend.rq_queue import DEFAULT_QUEUE, MEMORY_QUEUE, get_queue
@@
-        status = {
+        memory_queue = get_queue(MEMORY_QUEUE)
+        default_queue = get_queue(DEFAULT_QUEUE)
+
+        status = {
             "queues": {
                 "audio_queue": processor_manager.audio_queue.qsize(),
                 "transcription_queue": processor_manager.transcription_queue.qsize(),
-                "memory_queue": processor_manager.memory_queue.qsize(),
-                "cropping_queue": processor_manager.cropping_queue.qsize(),
+                "memory_queue": len(memory_queue.jobs),
+                "cropping_queue": sum(
+                    1
+                    for job in default_queue.jobs
+                    if getattr(job, "id", "") and job.id.startswith("cropping_")
+                ),
             },
             "processors": {
                 "audio_processor": "running",
                 "transcription_processor": "running",
-                "memory_processor": "running",
-                "cropping_processor": "running",
+                "memory_processor": "running",
+                "cropping_processor": "running",
             },
             "active_clients": len(processor_manager.active_file_sinks),
             "active_audio_uuids": len(processor_manager.active_audio_uuids),
-            "processing_tasks": len(processor_manager.processing_tasks),
+            "processing_tasks": len(processor_manager.processing_jobs),
             "timestamp": int(time.time()),
         }

🧹 Nitpick comments (34)

backends/advanced/webui/src/pages/System.tsx (1)
279-284: Consider extending the service status interface for type safety.

The conditional rendering of Redis worker stats uses (status as any) to access properties (worker_count, active_workers, idle_workers) that aren't defined in the HealthData interface (lines 7-14). While the runtime check !== undefined provides safety, TypeScript loses compile-time guarantees.

Consider extending the interface to include these optional properties:
 interface HealthData {
   status: 'healthy' | 'partial' | 'unhealthy'
   services: Record<string, {
     healthy: boolean
     message?: string
+    status?: string
+    provider?: string
+    worker_count?: number
+    active_workers?: number
+    idle_workers?: number
   }>
   timestamp?: string
 }
This would enable autocomplete, prevent property name typos, and provide better IDE support without changing runtime behavior.
backends/advanced/src/advanced_omi_backend/transcription_providers.py (1)
833-856: Unreachable code at lines 855-856.

The else clause at lines 855-856 is unreachable because:

Lines 806-832 handle all non-None provider_name cases (returning a provider or raising an exception).

Lines 835-854 handle the provider_name is None case and always return a value (either a provider instance or None at line 854).

Apply this diff to remove the unreachable code:
             logger.warning("No transcription provider configured (DEEPGRAM_API_KEY or PARAKEET_ASR_URL required)")
             return None
-    else:
-        return None
tests/resources/timing_keywords.robot (2)
20-39: Add error handling for missing start time.

Line 28 will raise an error if Start Timer was never called for the operation. Consider adding a default value or validation to provide a clearer error message.

Apply this diff to add a default value and validation:
-    ${start_time}=   Get From Dictionary    ${TIMING_DATA}    ${start_key}
+    ${start_time}=   Get From Dictionary    ${TIMING_DATA}    ${start_key}    default=None
+    IF    ${start_time} == None
+        Fail    Timer '${operation_name}' was never started
+    END
41-47: Add error handling for missing duration.

Line 46 will raise an error if the timer was never stopped. Consider adding a default value or validation.

Apply this diff:
-    ${duration}=        Get From Dictionary    ${TIMING_DATA}    ${duration_key}
+    ${duration}=        Get From Dictionary    ${TIMING_DATA}    ${duration_key}    default=None
+    IF    ${duration} == None
+        Fail    Timer '${operation_name}' was never stopped or doesn't exist
+    END
requirements.txt (1)
1-5: Recommend pinning dependency versions for reproducibility.

The Robot Framework and python-dotenv dependencies are unpinned, which can lead to non-reproducible builds and unexpected test failures when upstream packages release breaking changes.

For example, python-dotenv introduced breaking changes in v15+ around comment parsing and multiline handling (Based on learnings). Similarly, Robot Framework libraries may introduce incompatible changes.

Consider pinning versions to ensure consistent test behavior:
-robotframework
-robotframework-tidy
-robotframework-requests
-robotframework-browser
-python-dotenv
+robotframework==7.0
+robotframework-tidy==4.0.0
+robotframework-requests==0.9.6
+robotframework-browser==18.0.0
+python-dotenv==1.0.0
Alternatively, generate a lock file using pip freeze > requirements-lock.txt after installing to capture exact versions.
tests/browser/browser_auth.robot (1)
23-23: Consider making timeout configurable.

The hard-coded 10-second timeout might be insufficient in slow CI environments or with network latency. Consider extracting timeouts to variables or suite-level configuration for better flexibility.

Example refactor in the Variables section:
*** Variables ***
${LOGIN_TIMEOUT}    10s
${DEFAULT_TIMEOUT}    10s

*** Test Cases ***
Test Browser Can Access Login Page
    ...
    Wait For Elements State    id=email    visible    timeout=${LOGIN_TIMEOUT}
    ...
    Wait For Elements State    text=Friend-Lite Dashboard    visible    timeout=${DEFAULT_TIMEOUT}
backends/advanced/Dockerfile (1)

42-47: start.sh verified with basic error handling; consider adding set -u and set -o pipefail

File backends/advanced/start.sh exists and includes set -e.

Script correctly orchestrates both RQ workers and the FastAPI backend.

For more robust failure detection, add set -u (treat unset variables as errors) and set -o pipefail (catch pipeline failures).

backends/advanced/src/advanced_omi_backend/models/job.py (1)

11-23: Update or remove stale timeout docstring in JobPriority enum
Timeout values are defined per-job in rq_queue.py via multiple timeout_mapping dicts (e.g., transcription uses 600/480/300/180 s, audio uses 120/90/60/30 s, cropping uses 300/240/180/120 s, etc), but the enum’s docstring only lists the transcription defaults (10 min, 8 min, 5 min, 3 min). This duplication is prone to drift—extract the mappings to a single source of truth or sync/delete the enum docstring.
backends/advanced/src/advanced_omi_backend/controllers/user_controller.py (2)
83-83: Drop print; use logger.debug.

Replace the debug print with structured logging.
-    print("DEBUG: New update_user function is being called!")
+    logger.debug("update_user called")
95-101: Prefer manager-based existence check for consistency.

Use the user manager to fetch by id (and centralize constraints), instead of mixing raw users_col with the manager.

Example:
existing_user = await user_manager.get(object_id)
if not existing_user:
    return JSONResponse(status_code=404, content={"message": f"User {user_id} not found"})
Also ensure UserUpdate cannot carry privileged fields (e.g., is_superuser) through this path. Please confirm the schema.
tests/all_api_tests.robot (1)
45-53: Ensure ${API_URL} is defined or guard it.

Without Variables tests/test_env.py or CLI -V, ${API_URL} will be undefined. Add a Variables import (above) or fallback from environment:
 Master Suite Setup
     [Documentation]    Setup for the entire test suite
     Log    Starting Friend-Lite API Test Suite
-    Log    Testing against: ${API_URL}
+    ${api_url}=    Set Variable If    '${API_URL}'!='${API_URL}'    ${API_URL}    %{API_URL}
+    Log    Testing against: ${api_url}
Confirm your CI invokes Robot with -V tests/test_env.py or sets API_URL env.
tests/endpoints/auth_tests.robot (1)
90-90: Use Robot Framework's native type checking instead of Python's isinstance.

The isinstance() call may not evaluate correctly in Robot Framework's expression context. Consider using Robot Framework's built-in keywords for type validation.

Apply this diff to use Robot Framework's native approach:
-    Should Be True     isinstance($response.json(), list)
+    ${users}=    Set Variable    ${response.json()}
+    # Verify it's a list by checking length exists
+    ${length}=    Get Length    ${users}
+    Should Be True    ${length} >= 0
Or simply verify the list by iterating (which implicitly validates it's iterable):
-    Should Be True     isinstance($response.json(), list)
+    # Implicit list validation - Get Length will fail if not a list
+    ${users}=    Set Variable    ${response.json()}
+    ${length}=    Get Length    ${users}
backends/advanced/start.sh (3)
10-20: Improve shutdown reliability by killing tracked PIDs explicitly.

The pkill -TERM -P $$ approach may not reliably kill background jobs in all scenarios. Since you're already capturing WORKER_PID and BACKEND_PID, use them explicitly in the shutdown handler.

Apply this diff to improve shutdown handling:
 shutdown() {
     echo "🛑 Shutting down services..."
-    pkill -TERM -P $$
+    kill -TERM $WORKER_PID $BACKEND_PID 2>/dev/null || true
     wait
     echo "✅ All services stopped"
     exit 0
 }
22-28: Consider health checks instead of arbitrary sleep for worker readiness.

The sleep 2 assumes workers will be ready in 2 seconds, but there's no verification. Consider adding a Redis connection check or RQ worker ping to confirm readiness.

Example health check approach:
# Start RQ workers in the background
echo "🔧 Starting RQ workers..."
uv run --extra deepgram rq worker transcription memory default --url "${REDIS_URL:-redis://localhost:6379/0}" &
WORKER_PID=$!

# Wait for Redis connection instead of arbitrary sleep
echo "⏳ Waiting for workers to be ready..."
for i in {1..30}; do
    if redis-cli -u "${REDIS_URL:-redis://localhost:6379/0}" ping &>/dev/null; then
        echo "✅ Redis connection confirmed"
        break
    fi
    sleep 1
done
30-44: Consider log separation for easier debugging.

Both the RQ workers and FastAPI backend write to the same stdout/stderr, which can make debugging difficult. Consider redirecting their output to separate files or using a log prefix.

Optional: Redirect logs to separate files:
 # Start RQ workers in the background
 echo "🔧 Starting RQ workers..."
-uv run --extra deepgram rq worker transcription memory default --url "${REDIS_URL:-redis://localhost:6379/0}" &
+uv run --extra deepgram rq worker transcription memory default --url "${REDIS_URL:-redis://localhost:6379/0}" > /var/log/rq-worker.log 2>&1 &
 WORKER_PID=$!

 # Start the main FastAPI application
 echo "🌐 Starting FastAPI backend..."
-uv run --extra deepgram python3 src/advanced_omi_backend/main.py &
+uv run --extra deepgram python3 src/advanced_omi_backend/main.py > /var/log/backend.log 2>&1 &
 BACKEND_PID=$!
backends/advanced/start-k8s.sh (1)
80-89: Process exit handling may leave orphaned processes.

The script uses wait -n to wait for any process to exit, then kills both worker and backend PIDs. However, if one process has already exited, the kill command at line 85 may fail silently (with 2>/dev/null), and the script won't distinguish between a clean shutdown and an unexpected crash.

Improve exit handling to report which process exited:
 # Wait for any process to exit
-wait -n
+wait -n -p EXITED_PID
 
 # If we get here, one process has exited - kill the others
-echo "⚠️  One service exited, stopping all services..."
+if [ "$EXITED_PID" = "$WORKER_PID" ]; then
+    echo "⚠️  RQ workers exited unexpectedly, stopping backend..."
+elif [ "$EXITED_PID" = "$BACKEND_PID" ]; then
+    echo "⚠️  FastAPI backend exited unexpectedly, stopping workers..."
+fi
 kill $WORKER_PID $BACKEND_PID 2>/dev/null || true
 wait
tests/endpoints/chat_tests.robot (1)
49-81: Improve session structure validation in loop.

The test at lines 68-77 loops through sessions and validates structure inside the loop, but the validation comment at line 69 is misplaced outside the loop body. The dictionary validation at lines 70-73 should logically be inside the IF block or dedented to clarify scope.

Consider restructuring for clarity:
     FOR    ${session}    IN    @{sessions}
-        # Verify chat session structure
-    Dictionary Should Contain Key    ${session}    session_id
-    Dictionary Should Contain Key    ${session}    title
-    Dictionary Should Contain Key    ${session}    created_at
-    Dictionary Should Contain Key    ${session}    updated_at
+        # Verify chat session structure
+        Dictionary Should Contain Key    ${session}    session_id
+        Dictionary Should Contain Key    ${session}    title
+        Dictionary Should Contain Key    ${session}    created_at
+        Dictionary Should Contain Key    ${session}    updated_at
         IF    '${session}[session_id]' == '${test_session}[session_id]'
             ${found}=    Set Variable    ${True}
         END
     END
tests/endpoints/health_tests.robot (2)
42-48: Guard logs to avoid KeyErrors when optional services are absent

Direct indexing will fail if a service is not configured. Wrap logs with IF checks.
-    Log To Console    Mongodb: ${services}[mongodb][status]    
-    Log To Console    AudioAI: ${services}[audioai][status]
-    Log To Console    Memory Service: ${services}[memory_service][status]
-    Log To Console    Speech to Text: ${services}[speech_to_text][status]
-    Log To Console    Speaker recognition: ${services}[speaker_recognition][status]
+    IF    'mongodb' in ${services}
+        Log To Console    Mongodb: ${services}[mongodb][status]
+    END
+    IF    'audioai' in ${services}
+        Log To Console    AudioAI: ${services}[audioai][status]
+    END
+    IF    'memory_service' in ${services}
+        Log To Console    Memory Service: ${services}[memory_service][status]
+    END
+    IF    'speech_to_text' in ${services}
+        Log To Console    Speech to Text: ${services}[speech_to_text][status]
+    END
+    IF    'speaker_recognition' in ${services}
+        Log To Console    Speaker recognition: ${services}[speaker_recognition][status]
+    END
120-128: Nit: unused session

Get Anonymous Session anon_session is unused in this test. Consider removing to reduce noise.
tests/endpoints/rq_queue_tests.robot (3)
135-137: Strengthen persistence assertions (current checks are tautologies)

>= 0 can never fail and doesn’t prove persistence. Compare against the pre-restart count.
-        Should Be True    ${jobs_count_after} >= 0
+        Should Be True    ${jobs_count_after} >= ${jobs_count_before}
If jobs may process during restart, wrap with a brief retry:
Wait Until Keyword Succeeds    20s    2s    Validate Jobs Persist    ${jobs_count_before}
Add a helper:
*** Keywords ***
Validate Jobs Persist
    [Arguments]    ${min_count}
    ${after}=    Check Queue Jobs
    ${count}=    Get Length    ${after}[jobs]
    Should Be True    ${count} >= ${min_count}
Also applies to: 191-194

15-18: Avoid double initialization: Test Setup repeats Suite Setup

Running the same setup both per-suite and per-test adds time and flakiness. Keep it only in one place.
-Suite Setup         Suite Setup
-Suite Teardown      Delete All Sessions
-Test Setup          Suite Setup
+Suite Setup         Suite Setup
+Suite Teardown      Delete All Sessions
26-33: FLUSHALL wipes the entire Redis instance

Safe for an isolated test Redis, but risky if shared. Prefer a dedicated DB index or key prefix deletion.
tests/endpoints/memory_tests.robot (2)
184-186: Endpoint path likely incorrect; align with admin path used elsewhere

Elsewhere you use /api/admin/memories. This test hits /api/memories/admin and expects 403, which may return 404/405 instead.
-    ${response}=       GET On Session    user_session    /api/memories/admin    expected_status=403
+    ${response}=       GET On Session    user_session    /api/admin/memories    expected_status=403
20-25: Consider using a non-admin user for “user memories” test

Using admin_session here doesn’t validate user scoping. Add a variant with a regular user to ensure only that user’s memories are returned.
backends/advanced/webui/src/components/ConversationVersionDropdown.tsx (2)

114-118: Optional: show accurate labels before history loads

Use versionInfo.active_*_version when versionHistory is null so labels reflect the real active versions immediately.

124-145: Accessibility: add ARIA and keyboard support

Add aria-haspopup="listbox", aria-expanded, and allow keyboard navigation for options. Improves usability without changing behavior.

Also applies to: 186-207

tests/endpoints/system_admin_tests.robot (1)

206-207: Keyword signature inconsistency (Verify)

Delete Test User is called with a session arg here, but without one in other suites. Ensure the resource keyword supports both forms or standardize call sites.
tests/test_env.py (1)
7-8: Force test env vars to override host values.

Without override=True, any pre-set BACKEND_URL, ADMIN_EMAIL, etc. on the runner will bleed into tests, potentially pointing automation at real services. Make the load deterministic by overriding host values:
-load_dotenv(test_env_path)
+load_dotenv(test_env_path, override=True)
Makefile (2)
22-22: Add conventional phony aliases ‘all’ and ‘test’.

Satisfies static analysis and matches expectations (make test runs tests).
-.PHONY: help menu setup-k8s setup-infrastructure setup-rbac setup-storage-pvc config config-docker config-k8s config-all clean deploy deploy-docker deploy-k8s deploy-k8s-full deploy-infrastructure deploy-apps check-infrastructure check-apps build-backend up-backend down-backend k8s-status k8s-cleanup k8s-purge audio-manage test-robot test-robot-integration test-robot-unit test-robot-endpoints test-robot-specific test-robot-clean
+.PHONY: all test help menu setup-k8s setup-infrastructure setup-rbac setup-storage-pvc config config-docker config-k8s config-all clean deploy deploy-docker deploy-k8s deploy-k8s-full deploy-infrastructure deploy-apps check-infrastructure check-apps build-backend up-backend down-backend k8s-status k8s-cleanup k8s-purge audio-manage test-robot test-robot-integration test-robot-unit test-robot-endpoints test-robot-specific test-robot-clean
+
+all: menu
+test: test-robot
188-203: Make config-k8s resilient when cluster is absent.

You already swallow apply errors; also consider guarding kubectl get ... | sed | kubectl apply copy steps when the source resources don’t exist to avoid noisy failures in local dev.
tests/resources/memory_keywords.robot (1)

31-38: Consider URL-encoding and types for query params.

Ensure score_threshold is passed as a number and properly encoded by RequestsLibrary; current dict is fine, but if thresholds can be None, add conditional inclusion like in Get User Memories.
backends/advanced/src/advanced_omi_backend/transcription.py (3)
467-479: Absolute timestamps should anchor to session start, not time.time().

absolute_timestamp = time.time() + start introduces drift; use the audio session/conversation session_start (ms → seconds).
-                        "absolute_timestamp": time.time() + segment.get("start", 0.0),
+                        # Anchor to session_start (UTC) if available; fall back to now
+                        "absolute_timestamp": (
+                            (conversation_model.session_start.timestamp() if 'conversation_model' in locals() and conversation_model and conversation_model.session_start else time.time())
+                            + float(segment.get("start", 0.0))
+                        ),
If you prefer not to touch here, compute once above after fetching conversation_model.

437-441: Narrow overly broad except Exception or at least log with context and re-raise selectively.

Broad catches hide programmer errors and complicate retries/alerts.

Prefer specific exceptions (I/O, network, Beanie, JSON).

Where you must catch Exception, include exc_info=True and consider re-raising or returning error states for upstream handling.

Also applies to: 570-577, 969-971

657-688: Session timestamp conversion: good, but guard zero/None.

datetime.fromtimestamp(raw/1000) will be epoch when missing; add a conditional to avoid misleading 1970-01-01Z session starts.
-                "session_start": datetime.fromtimestamp(audio_session.get("timestamp", 0) / 1000, tz=UTC),
+                "session_start": (
+                    datetime.fromtimestamp(audio_session["timestamp"] / 1000, tz=UTC)
+                    if audio_session.get("timestamp") else datetime.now(UTC)
+                ),
Also applies to: 691-699

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 8ffd97a and 8735339.

⛔ Files ignored due to path filters (1)

extras/speaker-recognition/uv.lock is excluded by !**/*.lock

📒 Files selected for processing (80)

.env.template (1 hunks)
Makefile (4 hunks)
backends/advanced/Dockerfile (1 hunks)
backends/advanced/Dockerfile.k8s (1 hunks)
backends/advanced/docker-compose-test.yml (3 hunks)
backends/advanced/src/advanced_omi_backend/config.py (3 hunks)
backends/advanced/src/advanced_omi_backend/controllers/audio_controller.py (1 hunks)
backends/advanced/src/advanced_omi_backend/controllers/conversation_controller.py (12 hunks)
backends/advanced/src/advanced_omi_backend/controllers/system_controller.py (2 hunks)
backends/advanced/src/advanced_omi_backend/controllers/system_controller.py.backup (1 hunks)
backends/advanced/src/advanced_omi_backend/controllers/user_controller.py (3 hunks)
backends/advanced/src/advanced_omi_backend/conversation_manager.py (1 hunks)
backends/advanced/src/advanced_omi_backend/database.py (3 hunks)
backends/advanced/src/advanced_omi_backend/main.py (5 hunks)
backends/advanced/src/advanced_omi_backend/memory/prompts.py (1 hunks)
backends/advanced/src/advanced_omi_backend/memory/utils.py (2 hunks)
backends/advanced/src/advanced_omi_backend/models/__init__.py (1 hunks)
backends/advanced/src/advanced_omi_backend/models/audio_session.py (1 hunks)
backends/advanced/src/advanced_omi_backend/models/conversation.py (1 hunks)
backends/advanced/src/advanced_omi_backend/models/job.py (1 hunks)
backends/advanced/src/advanced_omi_backend/models/user.py (1 hunks)
backends/advanced/src/advanced_omi_backend/processors.py (12 hunks)
backends/advanced/src/advanced_omi_backend/processors.py.backup (1 hunks)
backends/advanced/src/advanced_omi_backend/routers/api_router.py (2 hunks)
backends/advanced/src/advanced_omi_backend/routers/modules/__init__.py (1 hunks)
backends/advanced/src/advanced_omi_backend/routers/modules/audio_routes.py (1 hunks)
backends/advanced/src/advanced_omi_backend/routers/modules/conversation_routes.py (1 hunks)
backends/advanced/src/advanced_omi_backend/routers/modules/queue_routes.py (1 hunks)
backends/advanced/src/advanced_omi_backend/routers/modules/system_routes.py (1 hunks)
backends/advanced/src/advanced_omi_backend/routers/modules/user_routes.py (2 hunks)
backends/advanced/src/advanced_omi_backend/rq_queue.py (1 hunks)
backends/advanced/src/advanced_omi_backend/speaker_recognition_client.py (2 hunks)
backends/advanced/src/advanced_omi_backend/transcription.py (10 hunks)
backends/advanced/src/advanced_omi_backend/transcription_providers.py (3 hunks)
backends/advanced/src/advanced_omi_backend/users.py (1 hunks)
backends/advanced/start-k8s.sh (1 hunks)
backends/advanced/start.sh (1 hunks)
backends/advanced/tests/test_conversation_models.py (1 hunks)
backends/advanced/webui/Dockerfile (1 hunks)
backends/advanced/webui/src/App.tsx (2 hunks)
backends/advanced/webui/src/components/ConversationVersionDropdown.tsx (1 hunks)
backends/advanced/webui/src/components/ConversationVersionHeader.tsx (1 hunks)
backends/advanced/webui/src/components/layout/Layout.tsx (2 hunks)
backends/advanced/webui/src/pages/Conversations.tsx (10 hunks)
backends/advanced/webui/src/pages/Queue.tsx (1 hunks)
backends/advanced/webui/src/pages/System.tsx (2 hunks)
backends/advanced/webui/src/services/api.ts (2 hunks)
backends/charts/advanced-backend/values.yaml (1 hunks)
extras/speaker-recognition/docker-compose-test.yml (1 hunks)
extras/speaker-recognition/pyproject.toml (1 hunks)
requirements.txt (1 hunks)
skaffold.yaml (2 hunks)
tests/README.md (1 hunks)
tests/TESTING_GUIDELINES.md (1 hunks)
tests/all_api_tests.robot (1 hunks)
tests/browser/browser_auth.robot (1 hunks)
tests/endpoints/*client_queue_tests.robot (1 hunks)
tests/endpoints/auth_tests.robot (1 hunks)
tests/endpoints/chat_tests.robot (1 hunks)
tests/endpoints/conversation_tests.robot (1 hunks)
tests/endpoints/health_tests.robot (1 hunks)
tests/endpoints/memory_tests.robot (1 hunks)
tests/endpoints/rq_queue_tests.robot (1 hunks)
tests/endpoints/system_admin_tests.robot (1 hunks)
tests/integration/conversation_queue.robot (1 hunks)
tests/integration/integration_test.robot (1 hunks)
tests/integration/mobile_client_tests.robot (1 hunks)
tests/resources/audio_keywords.robot (1 hunks)
tests/resources/chat_keywords.robot (1 hunks)
tests/resources/conversation_keywords.robot (1 hunks)
tests/resources/integration_keywords.robot (1 hunks)
tests/resources/memory_keywords.robot (1 hunks)
tests/resources/queue_keywords.robot (1 hunks)
tests/resources/session_resources.robot (1 hunks)
tests/resources/setup_resources.robot (1 hunks)
tests/resources/timing_keywords.robot (1 hunks)
tests/resources/transcript_verification.robot (1 hunks)
tests/resources/user_resources.robot (1 hunks)
tests/test_data.py (1 hunks)
tests/test_env.py (1 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-09-18T01:42:08.562Z

Learnt from: CR
PR: AnkushMalaker/friend-lite#0
File: CLAUDE.md:0-0
Timestamp: 2025-09-18T01:42:08.562Z
Learning: Applies to backends/advanced/src/**/*.py : Use conversation_id (not audio_uuid) for all reprocessing operations and version management

Applied to files:

backends/advanced/src/advanced_omi_backend/conversation_manager.py
backends/advanced/src/advanced_omi_backend/controllers/conversation_controller.py
backends/advanced/src/advanced_omi_backend/models/conversation.py
backends/advanced/src/advanced_omi_backend/transcription.py

🪛 checkmake (0.2.2)

Makefile

[warning] 22-22: Missing required phony target "all"

(minphony)

[warning] 22-22: Missing required phony target "test"

(minphony)

🪛 dotenv-linter (3.3.0)

.env.template

[warning] 108-108: [UnorderedKey] The MONGODB_K8S_URI key should go before the MONGODB_URI key

(UnorderedKey)

🪛 markdownlint-cli2 (0.18.1)

tests/TESTING_GUIDELINES.md

13-13: Emphasis used instead of a heading