Emcie Documentation

Emcie is an auto-optimizing inference platform for Parlant agents. It automatically optimizes your agent's inference through dynamic prompt tuning and SLM distillation, delivering LLM-level accuracy at significantly reduced costs.

Contents


Getting Started

Get up and running with Emcie in minutes. Set your API key, configure Parlant to use Emcie's NLP service, and deploy your agent.


Pricing

Emcie uses token-based pricing with significant savings as your agent transitions from Teacher to Student models.


Rate Limits

API rate limits protect the platform and ensure fair access for all users. Limits increase as your usage grows.


Model Tiers

Choose the right balance between cost and accuracy for your use case.


Model Roles

Understand how Emcie's Teacher/Student architecture automatically optimizes your inference costs.


Need Help?