▎ Purpose: Reduces input token costs for AI agents that load persistent memory into LLM context. Operates as a stateless compression middleware — not a memory system. Compatible with any upstream memory store. ▎ Supported memory formats: claude-md (CLAUDE.md), openclaw-md, chatgpt, google-aom, generic markdown ▎ Endpoints: ▎ POST /v1/compress — Structural compression. Strips filler phrases,…

1 subscribers
10 endpoints
The in-depth APIMemo review for this API hasn't been published yet — the data below comes straight from the public marketplace listing.

MemQ Token Compression API endpoints

MethodEndpointDescription
v1
POST /v1/bulk-optimize
/v1/bulk-optimize
Full MEMORY.md optimization with TF-IDF cosine similarity semantic deduplication, paraphrase detection, section merging, and compression. Achieves 50-95% token reduction. Uses…
POST /v1/compress/stream
/v1/compress/stream
Compress a single new memory entry for real-time ingest.
GET /v1/stats
/v1/stats
Calculate projected token cost savings for a specific LLM model and usage pattern.
POST /v1/compress
/v1/compress
Compress raw AI memory content into compact LLM-parseable format. Reduces token count by 40-55% structurally. Uses real GPT tokenizer for accurate token counts.
POST /v1/compress/incremental
/v1/compress/incremental
Add new memory entries to an already-compressed block. Uses TF-IDF cosine similarity with synonym normalization to detect duplicates and paraphrases. Only genuinely new…
POST /v1/compress/chunked
/v1/compress/chunked
Compress very large memory files (200K+ tokens) by splitting into manageable chunks and reassembling. Supports up to 10MB payloads. Uses real GPT tokenizer for accurate token…
POST /v1/memory/wrap
/v1/memory/wrap
Embed a MemQ YAML frontmatter header into any memory file without compressing it. The header contains agent instructions to automatically call /v1/recall before loading and run…
POST /v1/recall
/v1/recall
Retrieve only the memory sections relevant to a given query. Scores all sections using TF-IDF cosine similarity and returns only those that match within a token budget. Supports…
POST /v1/memory/health
/v1/memory/health
Analyze memory content for quality issues. Detects redundancy (duplicate pairs via TF-IDF cosine similarity), bloat (verbose entries that could be more compact), and staleness…
POST /v1/decompress
/v1/decompress
Reverse compression back to human-readable Markdown.

MemQ Token Compression API pricing

PlanPriceRate limitQuotas
BASIC Free
  • Requests: 30 / daily
PRO $4.99 / month 5 / minute
  • Requests: 90 / daily
ULTRA $9.99 / month 5 / minute
  • Requests: 200 / daily
MEGA $14.99 / month 20 / minute
  • Requests: 1,000 / daily

More Artificial Intelligence/Machine Learning APIs

View all →
  • An almost free AI image generation API for cost-conscious developers. including text to image, object…

    Artificial Intelligence/Machine LearningFreemium56 subscribers
  • Harness the potential (100x affordable) of OPEN AI ( with internet access ), Claude 3 , GPT-4 (at…

    Artificial Intelligence/Machine LearningFreemium8.9k subscribers
  • Professional astrology API with natal charts, transits, synastry analysis. 23 house systems, fixed stars,…

    Artificial Intelligence/Machine LearningFreemium186 subscribers
  • Detects ChatGPT, GPT4 & Gemini Content: Simple Way & High Accuracy; OpenAI Detection API; AI Essay Detector…

    Artificial Intelligence/Machine LearningFreemium1.7k subscribers
  • 100x affordable than OpenAI same AI, with Chatgpt Vision, GPT4o vision , GPT 3.5. image processing ,Text to…

    Artificial Intelligence/Machine LearningFreemium1.8k subscribers
  • The ChatGPT 4 API from PR Labs is a multi-model AI gateway hosted on RapidAPI that bundles access to GPT-4o,…

    ReviewedArtificial Intelligence/Machine LearningFreemium21.2k subscribers