🌟 About the Project
In this project, we will control a simple robot by using voice commands using the XIAO nRF52840 Sense that has a built-in microphone. In this project, we use the micro_speech library, and the received commands are displayed on the old screen.
📷 Pictures

🎯 Features
- 4 Different Voice Commands
- Show all received commands on Oled Display
🧰 Getting Started
We will use XIAO nRF52840 Sense as a processor. and we will add an OLED display to show received commands. Also with the Help of the DRV8833 Dual Motor Driver Module, we were able to control our DC Motors as Voice Commands were received.
- XIAO nRF52840 Sense
- DRV8833 Dual Motor Driver
- Oled 0.96 SSD1306
🔌 Schematic
Make the connections according to the table and schematic below.
- XIAO nRF52840 and DRV8833.
We Should Connect STBY pin to VCC pin.
- Complete Schematic

⚙️ Installation
Before uploading the code you need to install the required library in Arduino IDE. Follow these steps:
- Follow this path Sketch> Include Library> Manage Libraries
- Search for Adafruit SSD1306
- Install the library
-Then search for the “GFX” and install it also.
👀 Usage
After completing all the steps, the robot moves by receiving voice messages and the received messages are displayed on the screen.
EXPANDED TECHNICAL DETAILS
Edge-AI Acoustic Control
This advanced project leverages the on-board PDM microphone and Neural Network capabilities of the XIAO nRF52840 Sense to create a robot that obeys spoken commands.
- TinyML Speech Recognition: Uses the Edge Impulse framework to run an optimized neural network on the nRF52840's ARM Cortex-M4F processor. The robot can recognize specific keywords like "Forward," "Stop," and "Left" without an internet connection.
- BLE Telemetry Pipeline: Simultaneously streams the recognition confidence and battery level to a smartphone app via Bluetooth-LE, providing a real-time "Thought Process" dashboard for the AI.
Performance
- Zero-Latency Response: Optimized to perform inference in less than 100ms, ensuring the robot feels responsive and intuitive to the user's voice.