GitHub - cbh778899/pepper-chatbot: Chatbot workflow running on Pepper robot in Python2.7

Build And Run

If you don't have git installed in Pepper, you can clone this repo and use scp to upload files to your pepper robot.

Open a ssh connection to Pepper robot,

To install dependencies, run

pip install -r requirements.txt

This project sends identified text to the server you provided in OpenAI format. Currently it sent to the server built using our another project SkywardAI pepper-voyager on an AWS EC2 server.

Assume we've got the AWS EC2 Server address, run

python start.py --base-route "http://<address-of-ec2-server>/v1"

To start the service on pepper robot. DO NOT forget to add the http or https before address.

The service will automatically add /chat/completions in the end of provided base-route.

And after it outputs: INF: Started, you can speek now, just speak and wait for its response.

Output

The terminal output is in the format of

Speech Recognition Result:
================================
hi how are you
================================

AI Inference Result:
================================
Hello! As an AI assistant, I don't have feelings, but I'm operating properly and ready to help you with any questions or tasks you may have. How can I assist you today?
================================

You can also save the conversation to a csv file, see Flags

.env

You can add a .env file to add more control
Available fields are listed below:

NAO_IP - String: The IP address of Pepper Robot, default localhost
NAO_PORT - Integer: The port of Pepper Robot, default 9559
URL - String: The url of services, required
SPEECH_RECOGINITION_URL - String: The url of Speech Recognition service. Default to the same as URL.
CHAT_COMPLETION_ROUTE - String: The route of AI Completion based on provided url, default /chat/completions
SPEECH_RECOGINITION_ROUTE - String: The route of AI Speech Recognition based on provided url, default /speech/recognition
MODEL_NAME - String: The model name when integrate with OpenAI, for example, gpt-4o
API_KEY - String: The API Key of all services, sent in Authorization header. You can specify api key for each services individually.
SPEECH_API_KEY - String: The API Key of speech recognition service, sent in Authorization header
HOLD_TIME - Float: Minimum recording time in seconds. Default 2.0
RELEASE_TIME - Float: Time idle after stopped recording each piece in seconds. Default 1.0
RECORD_DURATION - Float: Maximum recording time in seconds. Default 7.0
LOOK_AHEAD_DURATION - Float: Amount of seconds before the threshold trigger that will be included in the request. Default 0.5
AUTO_DETECTION_THREADSHOLD - Integer: Threadshold of autodetection. Default 5
WEBVIEW - String: Specify the url of a html file, load with the built-in webview after all modules started.

Flags

There are some flags you can set when running, available flags are listed below:

--ip: Specify the IP Address of Pepper robot, default localhost
--port: Specify the port of Pepper robot, default 9559
--url: Specify the DNS of services. Either set it here or in .env file. You can specify routes in .env file for difference services, see the .env section.
--chat-route: Specify the route of AI Chat Completions based on server, default /chat/completions
--speech-route: Specify the route of AI Speech Recognition based on server, default /speech/recognition
--api-key: Specify the services API Key
--speech-api-key: Specify the speech recognition API key. Default to the same as --api-key option.
--model-name: Specify the OpenAI model name
--save-csv: Set to save conversation to dialogue.csv
--prompt: Specify the system prompt to use in AI Chat Completions.
--fprompt: Load the system prompt from a file, if the --propmt option specified, this will be ignored.
--webview: Load a html file using built-in webview when started.

Example Usage:

python start.py --url "http://<ec2-instance-public-DNS>/v1" --save-csv

Development Guide

start.py: Anything related to initialize
module_speechrecognition.py: Anything related to speech recognition
module_receiver.py: Receive from speech recognition, send request to LLM and speak out

memory in main function of start.py is for sending events between modules, search for self.memory in module files for usage.
myBroker is necessary to build channel in python runtime, it's the basic of using memory.

Notes

Advanced functions may require use different speech recognition models, a new file implement other libraries might required.
Try use different LLM to test performance, also fine-tune the required information to models or add the documents to conversation context for response related to the project.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
module_eyecontact.py		module_eyecontact.py
module_receiver.py		module_receiver.py
module_speechrecognition.py		module_speechrecognition.py
requirements.txt		requirements.txt
start.py		start.py
tools.py		tools.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build And Run

Output

.env

Flags

Example Usage:

Development Guide

Notes

About

Releases

Packages

Languages

License

cbh778899/pepper-chatbot

Folders and files

Latest commit

History

Repository files navigation

Build And Run

Output

.env

Flags

Example Usage:

Development Guide

Notes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages