mempalace

Author	SHA1	Message	Date
Sergey Kuznetsov	ae5196bc8d	Мempalace backend seam (#413 ) * refactor: add stage-1 backend abstraction seam Introduce the first upstreamable storage seam for MemPalace without bringing in the PostgreSQL spike or any benchmark artifacts. This change adds a small backend package with: - BaseCollection as the minimal collection contract - ChromaBackend/ChromaCollection as the default implementation It then routes the main runtime collection consumers through that seam: - palace.py - searcher.py - layers.py - palace_graph.py - mcp_server.py - miner.status() Behavioral constraints kept for stage 1: - ChromaDB remains the only backend and the default path - no config/env backend selection yet - no PostgreSQL code - no benchmark or research files - existing tests stay unchanged Important compatibility details: - read paths now call the seam with create=False so they still surface the existing 'no palace found' behavior instead of silently creating empty collections - write paths keep create=True semantics through palace.get_collection() - layers/searcher retain a chromadb module attribute so the existing mock-based tests can keep patching PersistentClient unchanged - ChromaBackend only creates palace directories on create=True, which preserves mocked read-path tests that use fake read-only paths Verification: - python3 -m py_compile mempalace/backends/__init__.py mempalace/backends/base.py mempalace/backends/chroma.py mempalace/palace.py mempalace/searcher.py mempalace/layers.py mempalace/palace_graph.py mempalace/mcp_server.py mempalace/miner.py - pytest -q # 529 passed, 106 deselected * refactor: clean up stage-1 seam compatibility shims Tighten the stage-1 backend abstraction branch after review. This follow-up does three small things: - keep the chromadb compatibility hook in searcher.py and layers.py, but express it through the backends.chroma module so it no longer reads like an accidental unused import - fix the palace_graph.py helper alias to avoid the local name collision flagged by ruff (imported helper vs local _get_collection wrapper) - preserve the existing mock-based test patch points unchanged while keeping the new backend seam intact Why this matters: - the direct form looked like a dead import in review, even though it was intentionally preserving the existing test seam ( and ) - palace_graph.py had a real lint issue ( redefinition) that was small but worth fixing before a public PR Verification: - /opt/homebrew/bin/ruff check mempalace/backends/__init__.py mempalace/backends/base.py mempalace/backends/chroma.py mempalace/palace.py mempalace/searcher.py mempalace/layers.py mempalace/palace_graph.py mempalace/mcp_server.py mempalace/miner.py - pytest -q tests/test_layers.py tests/test_searcher.py - pytest -q # 529 passed, 106 deselected * docs: explain backend shim imports in search paths Add short code comments in searcher.py and layers.py explaining why the module-level `chromadb` alias remains after the stage-1 backend seam refactor. The alias is intentional: it preserves the existing mock patch points used by the current test suite (`mempalace.searcher.chromadb.PersistentClient` and `mempalace.layers.chromadb.PersistentClient`) while the runtime logic now flows through the backend abstraction. This keeps the public PR easier to review because the apparent "unused import" now has an explicit reason next to it. Verification: - /opt/homebrew/bin/ruff check mempalace/searcher.py mempalace/layers.py - pytest -q tests/test_layers.py tests/test_searcher.py * refactor: reuse a default backend instance in palace helper Tighten the stage-1 backend seam by promoting the default Chroma backend adapter to a module-level singleton in `mempalace/palace.py`. This keeps the stage-1 scope unchanged — Chroma is still the only backend wired in this branch — but avoids constructing a fresh `ChromaBackend()` object on every `get_collection()` call. The backend is stateless today, so this is a readability/cleanup change rather than a behavioral one. Why this helps: - makes `palace.get_collection()` read like a real default factory instead of an inline constructor call - keeps the stage-1 branch a little cleaner before opening the public PR - does not widen the backend surface or change any config/runtime behavior Verification: - python3 -m py_compile mempalace/palace.py - pytest -q tests/test_miner.py tests/test_layers.py tests/test_searcher.py - pytest -q # 529 passed, 106 deselected * fix: harden read-only seam behavior and update seam tests Preserve the stage-1 backend abstraction while closing the real read-path regression surfaced in PR review. What changed: - make ChromaBackend.get_collection(create=False) fail fast when the palace directory does not exist instead of letting PersistentClient create it as a side effect - update miner.status() to call get_collection(..., create=False) so status keeps the historical 'No palace found' behavior - remove the temporary chromadb shim aliases from layers.py and searcher.py now that the tests patch the seam directly - add focused tests for the new backends package, including ChromaCollection delegation and ChromaBackend create=True/create=False behavior - retarget layer/searcher tests to patch the backend seam instead of patching chromadb.PersistentClient inside production modules - add a regression test that status() does not create an empty palace when the target path is missing Verification: - ruff check . - uv run pytest -q - uv run pytest -q tests/test_backends.py tests/test_cli.py tests/test_mcp_server.py tests/test_layers.py tests/test_searcher.py tests/test_miner.py Notes: - the separate benchmark/slow/stress layer was started as a soak but not used as the merge gate for this PR branch * refactor: drop duplicate mcp collection cache declaration Remove a redundant `_collection_cache = None` assignment in `mempalace/mcp_server.py` left over after the stage-1 backend seam refactor. This does not change behavior; it only trims review noise in the MCP server module after the read-path hardening pass. Verification: - ruff check mempalace/mcp_server.py - uv run pytest -q tests/test_mcp_server.py --------- Co-authored-by: Sergey Kuznetsov <sergey@iterudit.com>	2026-04-11 16:16:49 -07:00
grtninja	154e8a78ec	fix: implement MCP ping health checks (#600 )	2026-04-11 16:16:37 -07:00
Arnold Wender	89c0a58271	fix: align cmd_compress dict keys with compression_stats() return values (#569 ) * fix: align cmd_compress dict keys with compression_stats() return values * test: align compress test mocks with actual compression_stats() keys * fix: address review — add Total: assertion, move stats key test to test_dialect.py	2026-04-11 16:16:31 -07:00
Ahmad Othman Ammar Adi.	9c4b7302cc	fix: skip unreachable reparse points in detect_rooms_from_folders (#558 ) On Windows, projects containing git-submodule junctions or dev-drive reparse points cause iterdir() to list the entry successfully but Path.is_dir() to raise OSError when it calls stat() internally. Reproducer: any Windows project with a submodule checked out as a junction (e.g. skills/pr-perfect) crashes mempalace init with: OSError: [WinError 448] The path cannot be traversed because it contains an untrusted mount point Fix: wrap every is_dir() call in detect_rooms_from_folders with try/except OSError so the scanner skips inaccessible entries and continues rather than aborting. Covers both the top-level pass and the one-level-deep nested pass. Two new tests mock the OSError on specific paths and verify the function returns correct rooms from the remaining accessible entries.	2026-04-11 16:16:06 -07:00
Ben Sigman	ad806cf3f8	Merge branch 'main' into fix/query-sanitizer-prompt-contamination	2026-04-10 22:39:31 -07:00
MSL	e30c283fd8	style: ruff format Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 08:49:35 -07:00
MSL	15c5a528ed	test: add 33 tests for repair.py and dedup.py - 18 tests for repair (scan, prune, rebuild, edge cases) - 15 tests for dedup (grouping, dedup logic, wing filter, stats) - Fixes coverage drop from adding new modules Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 08:45:27 -07:00
Kevin Pulikkottil	2981433535	fix: add mcp command with setup guidance (#315 ) * fix: add mcp command with setup guidance * fix: include --palace guidance in mcp command output * fix: make mcp guidance commands copy-pastable --------- Co-authored-by: Milla J <millaj1217@gmail.com>	2026-04-09 11:21:18 -07:00
bensig	b1adc047e6	fix: address Octocode review — move size check, add tests for all 3 fixes - Move file size check before try block so IOError propagates cleanly (not caught by the except OSError handler below it) - Wrap os.path.getsize in its own try/except to preserve existing test_normalize_io_error behavior on missing files - Add test_normalize_rejects_large_file (mocked getsize) - Add test_null_arguments_does_not_hang (#394) - Add test_cmd_repair_trailing_slash_does_not_recurse (#395) 532 tests pass locally, 0 regressions.	2026-04-09 10:40:53 -07:00
bensig	58b8d5b198	fix: release ChromaDB handles before rmtree on Windows	2026-04-09 09:31:55 -07:00
bensig	1c48f4d2c3	fix: use os.utime in mtime test for Windows compatibility	2026-04-09 09:23:08 -07:00
Ben Sigman	e293e290d5	Merge branch 'main' into fix/mcp-protocol-version-negotiation	2026-04-09 09:15:06 -07:00
bensig	2448ac0026	test: add coverage for file_already_mined mtime check Covers the check_mtime=True path in palace.py to meet 85% coverage threshold.	2026-04-09 08:56:28 -07:00
Ben Sigman	725fa2b6f1	Merge branch 'main' into fix/query-sanitizer-prompt-contamination	2026-04-09 08:11:39 -07:00
Ben Sigman	70f2160bd6	Merge branch 'main' into fix/mcp-protocol-version-negotiation	2026-04-09 08:09:57 -07:00
matrix9neonebuchadnezzar2199-sketch	7509a72502	fix: mitigate system prompt contamination in search queries (#333 ) Addresses Issue #333: AI agents prepending system prompts to search queries causes embedding retrieval to collapse (89.8% → 1.0% R@10). Mitigation approach (減災): - New query_sanitizer.py with 4-stage pipeline: Step 1: passthrough for short queries (≤200 chars) Step 2: question extraction (finds ? sentences) → ~85-89% recovery Step 3: tail sentence extraction → ~80-89% recovery Step 4: tail truncation fallback → ~70-80% recovery Worst case without sanitizer: 1.0% (catastrophic) Worst case with sanitizer: ~70-80% (survivable) - mcp_server.py: tool_search applies sanitizer before ChromaDB query - MCP schema: query description warns agents not to include prompts - New 'context' parameter separates background info from search intent - Sanitizer metadata included in response when triggered 22 new tests covering all pipeline stages and real-world scenarios. Made-with: Cursor	2026-04-09 23:28:59 +09:00
Tal Muskal	da64016a94	fix: format test_layers_bench.py with ruff to pass CI lint Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-09 08:24:51 +03:00
virgil-at-biocompute	950d52baf2	fix: negotiate MCP protocol version instead of hardcoding The initialize handler hardcoded protocolVersion "2024-11-05", which causes newer MCP clients (e.g. Claude Code) to reject the connection when they negotiate "2025-11-25" or later. Echo the client's requested version if it is in the supported set, otherwise fall back to the latest supported version. This keeps backwards compatibility with older clients while allowing newer ones to connect. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:57:32 -04:00
Ben Sigman	d26606b2f9	Merge branch 'main' into main	2026-04-08 14:07:33 -07:00
Igor Lins e Silva	c4e52954fe	Merge upstream/main into bench/scale-test-suite to resolve conflicts Merged both the PR's benchmark suite additions (psutil dep, pytest markers, --ignore=tests/benchmarks) and upstream's coverage changes (pytest-cov, --cov-fail-under=30, coverage config) so both coexist. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2026-04-08 16:28:06 -03:00
Tal Muskal	28de031f25	fix: remove stale palace_path reference in test helper _patch_mcp_server had palace_path removed from its signature but the assertion body still referenced it, causing NameError at runtime and F821 from ruff. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 22:07:46 +03:00
Tal Muskal	dbf456b73b	Merge branch 'main' into main	2026-04-08 22:02:50 +03:00
Tal Muskal	abd52534bb	test: bring coverage to 85%, set threshold to 85, reset version to 3.0.11 - Add tests for config, convo_miner, spellcheck, knowledge_graph - Fix Windows PermissionError in test cleanup (chromadb file locks) - Add UTF-8 encoding to split_mega_files, entity_registry, hooks_cli - Fix mcp_server parse_known_args logging for unknown args - Set coverage threshold to 85 in pyproject.toml and CI - Reset all version files to 3.0.11 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 21:38:12 +03:00
Igor Lins e Silva	a0bcd0c836	fix: ruff format test_hooks_cli.py and test_knowledge_graph.py	2026-04-08 15:12:12 -03:00
Igor Lins e Silva	af42a850f6	fix: split semicolon statements onto two lines for ruff E702	2026-04-08 15:11:55 -03:00
Igor Lins e Silva	bf88daa649	fix: address review — re-mine modified files, idempotent add_drawer, cleanup ChromaDB handles	2026-04-08 15:11:55 -03:00
Igor Lins e Silva	a4149ab248	fix: use upsert and deterministic IDs to prevent data stagnation MCP tool_add_drawer: - Make drawer_id content-based: hash full content instead of content[:100] + timestamp. Same content → same ID, eliminating TOCTOU race conditions - Switch from col.add() to col.upsert() so re-filing with updated content updates the existing drawer miner.add_drawer: - Switch from collection.add() to collection.upsert() so re-mining a modified file updates instead of silently failing - Remove the try/except catching 'already exists' — upsert handles this naturally Findings: #11 (HIGH — add ignores updates), #6 (MEDIUM — TOCTOU), #13 (MEDIUM — non-deterministic IDs) Includes test infrastructure from PR #131. 92 tests pass.	2026-04-08 15:11:55 -03:00
Tal Muskal	9ca70264f3	style: format test files with ruff Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 21:08:49 +03:00
Tal Muskal	e24d8ca733	test: expand coverage to 70%, fix mcp_server CI crash (threshold 60%) Add/expand tests for normalize (39%→97%), searcher (39%→100%), layers (28%→97%), split_mega_files (34%→72%). Fix mcp_server.py parse_args→parse_known_args to prevent SystemExit when imported during pytest (CI was crashing on all test jobs). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 21:07:03 +03:00
Tal Muskal	03e9b57108	test: add comprehensive test coverage (35% → 58%, threshold 50%) Add 180+ new tests across 10 test files covering previously untested modules: - instructions_cli (0% → 100%), hooks_cli (73% → 96%), spellcheck (28% → 84%) - palace_graph (9% → 91%), general_extractor (0% → 92%), entity_detector (0% → 69%) - entity_registry (0% → 70%), room_detector_local (0% → 55%), layers (0% → 28%) - onboarding (0% → 36%) Also fixes Windows encoding bug in onboarding.py (write_text without encoding="utf-8"). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 20:54:56 +03:00
Ben Sigman	59d011a23b	Merge pull request #270 from tmuskal/main Package MemPalace as standard Claude and Codex plugins with easy installation	2026-04-08 10:41:45 -07:00
Tal Muskal	9de302f881	feat: update README and CI configuration, add tests for hooks functionality	2026-04-08 20:40:03 +03:00
Igor Lins e Silva	ebc26f3960	fix: resolve formatting, regression logic, and pytest defaults - Run ruff format on all benchmark files (fixes CI lint job) - Fix check_regression() substring ambiguity: ordered keyword matching so "latency_improvement_pct" is correctly classified as higher-is-better - Update stale comments in conftest.py referencing wrong fixture - Add pytest addopts to skip benchmark/slow/stress markers by default	2026-04-08 10:56:39 -03:00
Igor Lins e Silva	7e4db33061	fix: resolve ruff lint errors in benchmark suite Remove unused imports (shutil, string, datetime, os, yaml, time, SCALE_CONFIGS) and unused variable assignments in timing-only calls.	2026-04-08 05:10:39 -03:00
Igor Lins e Silva	e8017ca2ec	bench: add per-room recall threshold test Concentrates all drawers into a single wing+room to isolate the embedding model's retrieval limit independent of palace filtering. Confirms recall degrades to ~0.4-0.5 at 5K drawers per room even with wing+room filters applied — the spatial structure helps by keeping buckets small, but can't fix the underlying embedding ceiling.	2026-04-08 05:06:31 -03:00
Igor Lins e Silva	7b89291334	bench: add scale benchmark suite (94 tests) Benchmark mempalace at configurable scale (1K–100K drawers) to find real-world performance limits. Tests cover MCP tool OOM thresholds, ChromaDB query degradation, search recall@k, mining throughput, knowledge graph concurrency, memory leak detection, palace boost quantification, and Layer1 unbounded fetch behavior. - tests/benchmarks/ with 8 test modules + data generator + report system - Deterministic data factory with planted needles for recall measurement - JSON report output with regression detection (--bench-report flag) - CI benchmark job on PRs at small scale - psutil added as dev dependency for RSS tracking	2026-04-08 05:06:31 -03:00
Igor Lins e Silva	47696bef8c	fix: address Copilot review — derive MCP version, improve test isolation and portability	2026-04-08 04:41:03 -03:00
Igor Lins e Silva	a67b00d7c7	perf: cache ChromaDB PersistentClient instead of re-instantiating per call The MCP server previously created a new PersistentClient on every tool call via _get_collection(). This incurs HNSW index loading overhead on each request. Cache the client and collection at module level. The cache resets naturally on process restart (MCP runs as a subprocess). Also adds a _reset_mcp_cache fixture to conftest.py for test isolation. Includes test infrastructure from PR #131. 92 tests pass.	2026-04-08 04:39:19 -03:00
Ben Sigman	a8de2911e5	Merge pull request #136 from igorls/fix/kg-hardening fix: enable SQLite WAL mode and add consistent LIMIT to KG timeline	2026-04-07 16:05:13 -07:00
Igor Lins e Silva	d3145e9a7b	fix: update dialect tests for PR #147 stats API and remove unused fixture param	2026-04-07 18:58:25 -03:00
Igor Lins e Silva	6fa985eac2	fix: update dialect tests for PR #147 stats API and remove unused fixture param	2026-04-07 18:58:20 -03:00
Igor Lins e Silva	b45bff9db1	test: add WAL mode and entity timeline limit assertions	2026-04-07 18:27:19 -03:00
Igor Lins e Silva	5ac4947d02	fix: preserve CLI exit codes, log tracebacks, sanitize search errors, validate fixture	2026-04-07 18:26:39 -03:00
Ben Sigman	27623a3b17	Merge pull request #131 from igorls/test/expand-coverage-and-uv-migration test: expand coverage from 20 to 92 tests, migrate to uv	2026-04-07 14:15:01 -07:00
Igor Lins e Silva	96de23cd97	fix: CI failures — update workflow for uv migration, fix lint and format - Switch CI install step from `pip install -r requirements.txt` to `pip install -e ".[dev]"` since requirements.txt was removed - Add noqa: E402 to intentionally-late imports in conftest.py (HOME must be isolated before mempalace imports) - Remove unused KnowledgeGraph import in test_knowledge_graph.py - Apply ruff formatting to test files	2026-04-07 17:59:21 -03:00
Ben Sigman	3068f75c2d	Merge pull request #22 from sheetsync/bugfix/split-known-names-loading refactor: consolidate split known-names config loading	2026-04-07 13:58:54 -07:00
Igor Lins e Silva	cd8b245fdc	fix: address Copilot review — remove unused imports, isolate HOME in tests, restore dev extra	2026-04-07 17:55:10 -03:00
Igor Lins e Silva	72c548b729	test: expand coverage from 20 to 92 tests, migrate to uv - Migrate from setuptools to hatchling build backend - Add dependency-groups (PEP 735) for dev tooling (pytest, ruff) - Remove redundant requirements.txt in favor of uv.lock - Fix __version__ mismatch (2.0.0 -> 3.0.0 to match pyproject.toml) New test files: - conftest.py: shared fixtures (isolated palace, KG, ChromaDB collection) - test_knowledge_graph.py: 17 tests (entity CRUD, temporal queries, timeline) - test_mcp_server.py: 25 tests (protocol dispatch, read/write/KG/diary tools) - test_searcher.py: 7 tests (search_memories API, filters, error handling) - test_dialect.py: 13 tests (AAAK compression, entity/emotion detection, zettel encoding) All 92 tests pass on Python 3.13 with chromadb 0.6.3.	2026-04-07 17:55:10 -03:00
Ben Sigman	e8f9b47e31	Merge pull request #16 from sheetsync/bugfix/version-consistency fix: unify package and MCP version reporting	2026-04-07 13:54:03 -07:00
ac-opensource	c8c220d789	fix: support nested .gitignore rules during mining	2026-04-08 00:02:21 +08:00

1 2 3 4 5

204 Commits