See how YAVIQ delivers measurable token savings across different application types. Real numbers from real tests.
Verified Savings
All numbers are from production test results
Document-heavy applications
Structured data optimization
Long conversations
Compact structured data
Plain text prompts
Note: Savings depend on input size and structure. Metrics shown are real test results. Your actual savings may vary.
Use Cases
Real-world scenarios with measurable savings
Reduce document payload size, faster responses, predictable costs
RAG compression reduces token usage while preserving semantic meaning. Perfect for document-heavy applications.
Token Savings
up to 78.6%
// Optimize RAG documents
const result = await client.optimizeRAG(documents, {
mode: "balanced",
rag_chunk_limit: 10
});
console.log(`Saved ${result.savings}% tokens`);Smaller memory footprint, controlled context growth
Compress chat history and agent memory without losing critical context. Perfect for long-running agent workflows.
Token Savings
up to 52.3%
// Compress chat history
const result = await client.optimizeChatHistory(messages, {
mode: "balanced"
});
console.log(`Compressed ${messages.length} messages`);
console.log(`Savings: ${result.savings}%`);Predictable LLM bills, metrics & dashboards
Optimize structured data (JSON, YAML, CSV) for internal tools and APIs. Real savings, real metrics.
Token Savings
up to 42.7%
// Optimize structured data
const result = await client.optimizeStructured(jsonData, {
format: "json",
mode: "balanced"
});
console.log(`Original: ${result.original_tokens} tokens`);
console.log(`Optimized: ${result.optimized_tokens} tokens`);
console.log(`Savings: ${result.reduction_percent}%`);Normalize agent-to-agent communication, reduce token bloat
Optimize agent context, memory, and inter-agent messages. Control token growth in complex multi-agent workflows.
Token Savings
up to 50%
// Optimize agent context
const result = await client.optimizeAgentContext({
messages: agentMessages,
maxMessages: 12
});
console.log(`Context compressed: ${result.reductionPercent}%`);Why YAVIQ
Get your API key and start optimizing LLM costs in minutes. Real metrics, real savings, production-ready.