mempalace

Author	SHA1	Message	Date
google-labs-jules[bot]	21793cfb48	perf: optimize regex compilation in entity extraction Move regular expression compilation to the module level in `dialect.py` to prevent repeated parsing during loop execution. Co-authored-by: igorls <4753812+igorls@users.noreply.github.com>	2026-04-14 17:43:26 +00:00
Tal Muskal	da64016a94	fix: format test_layers_bench.py with ruff to pass CI lint Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-09 08:24:51 +03:00
Igor Lins e Silva	ebc26f3960	fix: resolve formatting, regression logic, and pytest defaults - Run ruff format on all benchmark files (fixes CI lint job) - Fix check_regression() substring ambiguity: ordered keyword matching so "latency_improvement_pct" is correctly classified as higher-is-better - Update stale comments in conftest.py referencing wrong fixture - Add pytest addopts to skip benchmark/slow/stress markers by default	2026-04-08 10:56:39 -03:00
Igor Lins e Silva	7e4db33061	fix: resolve ruff lint errors in benchmark suite Remove unused imports (shutil, string, datetime, os, yaml, time, SCALE_CONFIGS) and unused variable assignments in timing-only calls.	2026-04-08 05:10:39 -03:00
Igor Lins e Silva	e8017ca2ec	bench: add per-room recall threshold test Concentrates all drawers into a single wing+room to isolate the embedding model's retrieval limit independent of palace filtering. Confirms recall degrades to ~0.4-0.5 at 5K drawers per room even with wing+room filters applied — the spatial structure helps by keeping buckets small, but can't fix the underlying embedding ceiling.	2026-04-08 05:06:31 -03:00
Igor Lins e Silva	7b89291334	bench: add scale benchmark suite (94 tests) Benchmark mempalace at configurable scale (1K–100K drawers) to find real-world performance limits. Tests cover MCP tool OOM thresholds, ChromaDB query degradation, search recall@k, mining throughput, knowledge graph concurrency, memory leak detection, palace boost quantification, and Layer1 unbounded fetch behavior. - tests/benchmarks/ with 8 test modules + data generator + report system - Deterministic data factory with planted needles for recall measurement - JSON report output with regression detection (--bench-report flag) - CI benchmark job on PRs at small scale - psutil added as dev dependency for RSS tracking	2026-04-08 05:06:31 -03:00

6 Commits