Feature/workspace isolation by disillusioners · Pull Request #3011 · HKUDS/LightRAG

disillusioners · 2026-05-04T05:41:07Z

Description

Adds workspace-based data isolation across all 13 LightRAG storage backends, enabling safe multi-tenant deployments where each LightRAG instance operates in its own isolated data space.

Every LightRAG instance can now be assigned an immutable workspace identifier. All data — entities, relations, documents, indexes — is namespaced under that workspace, preventing cross-tenant data access or collision.

Supports all 13 storage backends:

Category	Backends
Graph	NetworkX, Neo4j, Memgraph
Vector	FAISS, NanoVectorDB, Milvus, Qdrant
KV / Doc	PostgreSQL, MongoDB, Redis, OpenSearch, JSON KV, JSON DocStatus

Isolation strategy varies by storage type:

Shared storage backends (Neo4j, Redis, PostgreSQL, etc.) — uses {workspace}:{namespace} prefix on entity/relation identifiers. Each workspace's data is isolated at the key/ID level within the same physical store.
File-based backends (NetworkX, NanoVectorDB, FAISS, JSON KV, JSON DocStatus) — uses directory-based isolation via self.workspace path. Each workspace gets its own data directory.

Workspace lifecycle:

Set at LightRAG(working_dir=..., workspace="tenant-a") construction time
Immutable after creation — cannot be changed on an existing instance
Strong input sanitization: path traversal prevention, character whitelist (a-z, 0-9, -, _), length limits

Administrators of server-based backends can leverage existing environment variable controls (e.g., WORKSPACE_ISOLATION, {BACKEND}_WORKSPACE) alongside this feature.

Related Issues

Changes Made

Added workspace parameter to LightRAG constructor for multi-tenant data isolation
Implemented workspace-based key namespacing ({workspace}:{namespace}) for shared storage backends (Neo4j, Memgraph, PostgreSQL, MongoDB, Redis, OpenSearch)
Implemented directory-based isolation for file-based backends (NetworkX, NanoVectorDB, FAISS, JSON KV, JSON DocStatus)
Added workspace input sanitization (path traversal prevention, character whitelist, length limits)
Ensured workspace feature works with existing server-backend environment variable controls (WORKSPACE_ISOLATION, {BACKEND}_WORKSPACE)
Ensured full backward compatibility — existing code without workspace continues to work identically

Checklist

Changes tested locally
Code reviewed
Documentation updated (if necessary)
Unit tests added (if applicable)

Additional Notes

Test coverage: 3 test files, 1,653 lines total:

File	Focus	Scenarios
`test_workspace_isolation.py`	End-to-end isolation across backends	11 scenarios
`test_workspace_migration_isolation.py`	PostgreSQL migration under isolation	Migration-specific
`test_workspace_sanitization.py`	Cypher injection & input sanitization	Security edge cases

Backward compatibility: Fully backward compatible. Internal separator differences between backends (: vs _) and empty-workspace normalization are preserved for compatibility.

Usage example:

# Each tenant gets an isolated LightRAG instance
rag_tenant_a = LightRAG(working_dir="./data", workspace="tenant-a")
rag_tenant_b = LightRAG(working_dir="./data", workspace="tenant-b")

# Data is fully isolated — no cross-contamination
rag_tenant_a.insert("Alice works at Acme Corp")
rag_tenant_b.insert("Bob works at Tech Inc")

# Queries return only the tenant's own data
rag_tenant_a.query("Who works where?")  # → Alice at Acme
rag_tenant_b.query("Who works where?")  # → Bob at Tech

…isolation Phase 1: - Add WorkspaceManager class with LRU cache (max 10 instances), reference counting, per-workspace async locking, and safe eviction - Add WorkspaceCapacityError for capacity overflow - Add sanitize_workspace_name() utility in api/utils.py - Add comprehensive unit tests (26 tests) Phase 2: - Create factory callable in lightrag_server.py capturing all 25 LightRAG constructor args - Replace single rag instance with WorkspaceManager - Add FastAPI lifespan handler for startup pre-warm and shutdown cleanup - Update route factory signatures to accept workspace_mgr - Update /health endpoint to use WorkspaceManager with try/finally release - Reduce Neo4j default connection pool from 100 to 10 - Audit _default_workspace usage (verified safe, documented)

…e reporting - Wire sanitize_workspace_name() into get_workspace_from_request() (C2) - Add defensive sanitization call in WorkspaceManager.get_or_create() - Move _finalize_instance() outside global lock in _evict_one() and shutdown() (C3) - Document pre-warm ref_count=1 design choice (W3) - Fix /health endpoint to report actual queried workspace (W4)

…n via WorkspaceManager

…code

Add comprehensive integration tests for workspace isolation at the HTTP API layer using httpx.AsyncClient with ASGITransport. Tests verify: - Header-based workspace extraction from LIGHTRAG-WORKSPACE header - Default workspace fallback (empty string) when no header present - Workspace name validation (special chars, path traversal, length) - Concurrent request isolation across different workspaces - Background task pattern with proper ref count management - Streaming response pattern with ref held during stream - Capacity limit enforcement returning HTTP 503 - LRU eviction under concurrent load Also update conftest.py to allow @pytest.mark.offline tests in tests/integration/ to run without --run-integration flag.

…e isolation

…and secrets

…t fixes

…ng JSONResponse

…vars

…nboundLocalError

…eError - memgraph_impl.py: initialize memgraph_workspace and original_workspace to None before conditional block - neo4j_impl.py: initialize original_workspace to None before conditional block (neo4j_workspace was already fixed) These variables were only assigned inside conditional blocks but referenced unconditionally in logging statements, causing NameError when WORKSPACE_ISOLATION=true.

- Add WorkspaceSelector dropdown component with auto-refresh - Add currentWorkspace state to settings store with v20 migration - Add Workspace type and getWorkspaces() API function - Inject LIGHTRAG-WORKSPACE header conditionally in axios interceptor and streaming fetch - Integrate selector into SiteHeader with proper separator handling - Add workspace i18n keys to English locale

- C1: Sanitize LIGHTRAG-WORKSPACE header to prevent CRLF injection - C2: Add malformed response guard in getWorkspaces() - W3: Reset stale workspace selection when workspace removed server-side

- Add workspace API tests (sanitizeHeader, getWorkspaces, header injection) - Add WorkspaceSelector component logic tests (fetch, stale detection, change handling) - Add settings store migration tests - Export sanitizeHeader and axiosInstance for testability - Add testing dependencies: @testing-library/react, @testing-library/jest-dom, @playwright/test, happy-dom, playwright

- Remove folder icon from workspace selector dropdown - Add tooltip on hover showing "Workspace" label - Add spacing between LightRAG title and workspace selector

- Add useWorkspaceChange hook that monitors workspace changes and clears state - Documents: clear and re-fetch document list on workspace change - Knowledge Graph: reset graph state including isFetching flag - Retrieval: clear query messages and history on workspace change - Add workspaceRefreshTrigger signal in settings store - API tab confirmed workspace-agnostic (no changes needed) Files modified: - src/stores/settings.ts (workspaceRefreshTrigger + triggerWorkspaceRefresh) - src/stores/graph.ts (isFetching: false in reset) - src/hooks/useWorkspaceChange.ts (new) - src/App.tsx (useWorkspaceChange hook) - src/features/DocumentManager.tsx (workspace refresh handling) - src/features/RetrievalTesting.tsx (clear messages on workspace change)

- Add partialize to settings persist config to exclude trigger counters from localStorage, preventing stale refresh on page reload - Move graphDataFetchAttempted/labelsFetchAttempted resets and incrementGraphDataVersion into graph.reset() for completeness - Remove now-redundant manual calls from useWorkspaceChange hook

… functionality Add comprehensive tests for workspace isolation features including: - workspaceRefreshTrigger state and triggerWorkspaceRefresh() in settings store - searchLabelDropdownRefreshTrigger state and triggerSearchLabelDropdownRefresh() in settings store - useWorkspaceChange hook behavior - graph store workspace isolation

…ration Include /workspaces in the VITE_API_ENDPOINTS environment variable to ensure the development server correctly proxies workspace-related API requests.

…e loop

The root cause was state.reset() being called inside the fetch completion handler (useLightragGraph.tsx line 377). reset() sets graphDataFetchAttempted to false, which re-triggers the fetch useEffect that checks that flag. The fix replaces state.reset() with targeted clears that preserve the fetch attempt flags (graphDataFetchAttempted, labelsFetchAttempted), preventing the fetch useEffect from re-triggering after a successful fetch. Fetch flags are only reset by the workspace change handler (useWorkspaceChange), which is the correct place for full state reset.

The previous fix for the infinite loop (commit 3cc3613) prevented state.reset() from being called in the fetch completion handler. But this broke workspace switching: after calling reset(), the fetch useEffect never re-fired because none of its React dependencies actually changed. Root cause: Two issues after workspace change: 1. graphDataVersion was not incremented, so the fetch useEffect's dependency array didn't change (isFetching was already false) 2. queryLabel stayed empty ('') because the previous fetch handler cleared it when graph data was empty. The emptyDataHandledRef guard then blocked re-fetching. Fix: In useWorkspaceChange, after calling reset(): - Call incrementGraphDataVersion() to trigger the fetch useEffect - Call setQueryLabel(defaultQueryLabel) to restore '*' so the fetch path is entered (avoids emptyDataHandledRef guard) Verified with Playwright E2E: - Initial load: 1 /graphs call - Switch workspace: 1 /graphs call (was 0 before fix) - Switch back: 1 /graphs call (was 0 before fix) - No infinite loop: 0 calls during 15s watch periods - All 86 unit tests pass

The workspace change useEffect was calling fetchPopularLabels() without await, causing bumpDropdownData() to trigger AsyncSelect remount BEFORE the popular labels were fetched and stored in SearchHistoryManager. This resulted in the combobox reading stale/empty data. Fixed by awaiting the fetchPopularLabels() call before triggering the dropdown refresh, ensuring SearchHistoryManager is populated before the component remounts and re-reads the data.

…load

…CACHE_LIMIT env var The LRU cache limit for workspace RAG instances was hardcoded to 10. Now configurable via LIGHTRAG_WORKSPACE_CACHE_LIMIT environment variable. Defaults to 10. Invalid/non-numeric/negative values fall back to 10.

disillusioners added 30 commits April 30, 2026 18:09

feat(api): migrate all 42 route handlers to workspace-aware resolutio…

17183cb

…n via WorkspaceManager

fix(api): fix factory call mismatch, ollama streaming ref leak, dead …

8c43c00

…code

refactor(api): consolidate workspace extraction, remove duplication

ed708b1

test(api): fix false-confidence tests, add coverage gaps for workspac…

a1d7ac2

…e isolation

fix: workspace isolation initialization + add E2E test

62b2c7f

chore: move secrets to .env-test, add .gitignore entries for .agents …

6bcd95a

…and secrets

test: fix CI compatibility — offline markers, sync factory mocks, lin…

a015051

…t fixes

style: fix linting issues for CI

d745211

fix(api): raise HTTPException on invalid workspace instead of returni…

af072e4

…ng JSONResponse

feat: add WORKSPACE_ISOLATION flag to override backend workspace env …

4f1b115

…vars

fix: initialize neo4j_workspace before conditional block to prevent U…

20a4b2a

…nboundLocalError

chore: remove dead code and inconsistent imports from workspace API

2a351f1

fix: harden workspace selector security and error handling

fb2b93c

- C1: Sanitize LIGHTRAG-WORKSPACE header to prevent CRLF injection - C2: Add malformed response guard in getWorkspaces() - W3: Reset stale workspace selection when workspace removed server-side

fix: improve workspace selector UI

41e2e34

- Remove folder icon from workspace selector dropdown - Add tooltip on hover showing "Workspace" label - Add spacing between LightRAG title and workspace selector

fix: scope workspace selector spacing to selector only

096527e

chore(env): add /workspaces endpoint to development API proxy configu…

a1925f3

…ration Include /workspaces in the VITE_API_ENDPOINTS environment variable to ensure the development server correctly proxies workspace-related API requests.

fix: remove incrementGraphDataVersion from reset() to prevent infinit…

1990c1a

…e loop

fix(graph): reload labels API on workspace change

6a45d9c

disillusioners added 2 commits May 4, 2026 00:51

fix(graph): use async IIFE in workspace change useEffect for label re…

e0adfa7

…load

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/workspace isolation#3011

Feature/workspace isolation#3011
disillusioners wants to merge 32 commits intoHKUDS:mainfrom
disillusioners:feature/workspace-isolation

disillusioners commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

disillusioners commented May 4, 2026

Description

Related Issues

Changes Made

Checklist

Additional Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant