Smart & Fast

XiaoZhi AI Voice Robot

Open-source AI voice robot, making intelligence at your fingertips! 
Open-source ESP32 | Zero-code integration with LLM+ASR+TTS | Multilingual dialogue + IoT control
Intelligent voice assistant development based on ESP32, providing complete customization and R&D solutions

Main Features

· Offline voice wake-up · Multilingual speech recognition (Chinese/English/Japanese/Korean etc.) · Real-time voice dialogue · Large model integration (connecting to Qwen/DeepSeek/Doubao etc.) · Smart home device control · Display screen and LED light feedback

Main Features

Multi-Platform Support

Supports ESP32 series chips, including C3/S3 and other models, compatible with over 30 development boards

Multi-Platform Support

Technical Architecture

· Hardware abstraction layer: Unified interfaces with singleton pattern · Audio processing pipeline: Collection→Resampling→Encoding · Communication protocol: WebSocket or MQTT+UDP · AI capabilities: Voice wake-up, recognition, large model integration

Technical Architecture

Development & Deployment

· Choose a compatible ESP32-S3 development board · Use ESP-IDF v5.3+ environment · Configure, compile and flash using idf.py commands · Deploy the companion server-side program

Design Highlights

· High compatibility: Abstract classes support multiple development boards · Low power consumption: Dynamically turn off unused peripherals · Multilingual: Resources organized by language, configurations generated automatically

Application Scenarios

Smart home control · Educational robots · Industrial inspection · Retail terminals and other scenarios

Open Source Sharing

Using MIT open source license, can be used for commercial products, supports custom development

Extension Directions

· Integrate TensorFlow Lite for local AI inference · Add ESP-NOW for device networking · Deep low-power optimization · Integrate ESP32-CAM for visual interaction