AlexaLikeWhisper

Implement of audio speech recognition "Whisper" released by OpenAI triggered on Wakeup word detection

Demo

After detected wakeup words, whisper recognizes audio speech like Alexa!
Using recognized words, you can control avatar robots or IoT...etc!

System

Users : Say wakeup words like "Hey, Siri" and some speech
PC : Input audio speech with a microphone and recognize it with whisper
IoT : Using recognized words, do tasks

PC Spec

OS : Ubuntu 20.04
GPU : Geforce RTX 2080Ti

Setup

PC

Install Whisper

Using API

# install openai
pip install openai

Install Whisper when not using API

install pytorch
Install Pytorch with matching GPU, CUDA and cuDNN versions.
Pytorch

# install transformers
pip install transformers

# install whisper
sudo apt update && sudo apt install ffmpeg
pip install git+https://github.com/openai/whisper.git

Install other packages

# install pyaudio
sudo apt-get install portaudio19-dev
pip install pyaudio

# install pvporcupine
pip install pvporcupine

To use pvporcupine, you need to register to PICOVOICE and get a API Key.
And download a model file(.ppn) and place it in AlexaLikeWhisper/model.

Usage

# get source of alexa like whisper and install alexa_like_whisper
git clone https://github.com/tech-life-hacking/AlexaLikeWhisper.git
cd AlexaLikeWhisper
pip install -e .

Place a model file(.ppn) in AlexaLikeWhisper/model.

import alexa_like_whisper

if __name__ == "__main__":
    # Modelsizes on whisper
    MODELSIZES = ['tiny', 'base', 'small', 'medium', 'large']

    # AccessKey obtained from Picovoice Console (https://console.picovoice.ai/)
    ACCESS_KEY = "YOUR_ACCESS_KEY"
    KEYWORD_PATH = ['PPN_FILE_PATH']

    # Recording Time(s)
    RECORDING_TIME = 3

    # if using API, set True
    WHISPER_API = True

    # if using API, set API Key or "export OPENAI_API_KEY='YOUR_API_KEY'"
    openai.api_key = "YOUR_API_KEY"

    alexa_like = alexa_like_whisper.AlexaLikeWhisper(ACCESS_KEY, KEYWORD_PATH, MODELSIZES[3], RECORDING_TIME, WHISPER_API)

    while True:
        result = alexa_like.run()
        print(result)

result shows

Waiting wakeup words : "Sleep"
After detected wakeup words and on recording : "On recording..."
When recognizing audio speech : the result

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
alexa_like_whisper		alexa_like_whisper
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AlexaLikeWhisper

Demo

System

PC Spec

Setup

PC

Install Whisper

Using API

Install Whisper when not using API

Install other packages

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

tech-life-hacking/AlexaLikeWhisper

Folders and files

Latest commit

History

Repository files navigation

AlexaLikeWhisper

Demo

System

PC Spec

Setup

PC

Install Whisper

Using API

Install Whisper when not using API

Install other packages

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages