I built MOS (MemoryOS) – a lightweight, self-hosted memory microservice for LLMs using Node.js, pgvector, and local embeddings.
A developer has created MemoryOS (MOS), a self-hosted microservice designed to manage long-term memory for large language models. The system utilizes Node.js for its backend, PostgreSQL with the pgvector extension for storing embeddings, and a separate Python service for local embedding generation. MOS incorporates a custom ranking algorithm that combines vector similarity with an importance score, includes memory expiration features, and offers basic prompt compression to reduce token usage. AI
IMPACT Provides a self-hosted solution for managing LLM context, potentially reducing reliance on external services and improving data privacy.