Tag: engineering
2 articles
LLM Application Engineering: Key Practices from Demo to P...
Core experience moving LLM applications from prototype to production: context management, error handling, cost control, observability. No basics, just real pitfalls.
Real-time Voice Interaction Pipeline Latency Optimization...
Documenting the process of building ASR→LLM→TTS real-time voice pipeline: why latency is high, how pipeline concurrency reduces first-byte latency, VAD endpoint detection pitfalls, and practical co...