Drunk AI

sdmydbr9 · Mar 12, 2023, 11:30 AM

Recently I came across this awesome module MMM-OpenAI, I have posted earlier with a short demo and YouTube link for the full demo. and I decided to do something fun, I was not expecting it to turn out this way, I have detailed the step by step instructions in the GitHub to set this up, Iam not from a technical background therefore the codes are not really polished and there maybe issues but I tried by best to be thorough in the instructions as much as possible, I’m sure someone else will come up with better idea to implement this, here is the link Drunk AI repo
At the moment it’s quite slow and it will get faster will more enhancements

Detailed instructions:
Drunk-AI

This is an integration of OPENAI chatgpt in the magicmirror.

Demo below

watch the full demo here ([https://youtu.be/eDcc7zqAFwE])

This is the initial version, I am not from a technical background therefore if the codes and scripts are little off the charts or there is a way to enhance it feel free to send PR. I admit this is quite messy and little long process to set up but in the end its worth it.

Download the shortcuts linked below and configure it with the api key from MMM-RemoteControl
more info here ([https://github.com/Jopyth/MMM-Remote-Control.git])

ios shortcuts link

([https://www.icloud.com/shortcuts/8a0e7600808d45eb9616dae8105653ef])

EDIT THE DOWNLOADED SHORTCUT with your magicmirror ip address AND REPLACE apikey in the text with your api key from MMM-RemoteControl.

you need to run this shortcuts every time you want to interact with the chatgpt, or you can use the default telegram commands but that is without voice input, hotword detection will be added soon.

When you run this shortcuts, the mirror with update with jarvis animation and text saying “say something”, you have to speak in the microphone after you see Say something and if the audio was captured successfully, it will update with the spoken text. the listening duration is set to 4 seconds by default, you can adjust this by editing the transcript.py in the MMM-Chat module mentioned below.

Install and configure MMM-Chat from here ([ https://github.com/sdmydbr9/MMM-Chat])

Install MMM-NotificationTrigger ([https://github.com/MMRIZE/MMM-NotificationTrigger.git])

Add the following lines to the config

   {
  module: 'MMM-NotificationTrigger',
  config: {
    triggers: [
      {
        trigger: 'SHOW_ALERT',
        fires: [
          {
            fire: 'MY_COMMAND',
            exec: (payload) => `python3 /home/pi/MagicMirror/modules/MMM-11-TTS/main.py "${payload.message}"`
          }
        ]
      }
    ]
  }
},

Install MMM-OpenAI from here ([https://github.com/MMRIZE/MMM-OpenAI.git])

Add the following in your config

                  {
    module: "MMM-OpenAI",
    position: 'top_right',
    config: {
            defaultChatInstruction: "Your name is Marvin and you are a paranoid android that reluctantly answers questions with sarcastic responses.",
            stealth: true,  // <- This is needed to hide default module view.
            postProcessing: function (handler, responseObj) {
                    if (responseObj.error) return // When the error happens, just do nothing.
                    let method = responseObj.options.method
                    let alertPayload = {
                            title: responseObj.request.prompt,
                            imageUrl: (method === 'IMAGE') ? responseObj.response.data[0].url : null,
                            message: (method === 'TEXT') ? responseObj.response.choices[0].text : 
                            ((method === 'CHAT') ? responseObj.response.choices[0].message.content : null),
                            timer: 2 * 1000
                    }
                    handler.sendNotification('SHOW_ALERT', alertPayload)
            }
    }
 },

Clone the following respository ([https://github.com/sdmydbr9/MMM-11-TTS]) in your modules folder and install it according to instruction

edit the main.py and add your api key and voice id, a voice id as already set by default, you can add any voice id, refer elevenlabs api doc for more details.

Disclaimer: Even though the quality of the output of their voice is far superior to GOOGLE TTS, the character limit is very limited, 10,000 characters per month per account, I hope they will offer more in future, or you can opt for a paid account and get around 30,000 characters as well as voice cloning features, clone any voice you want, for example clone the voice of jarvis and transform your magicmirror into jarvis. just add the voice ID in the main.py and youre good to go.

Disclaimer 2: The above module works in my test but it is not very efficient since the script will first downlaod the audio from the api request and convert it using fmpeg and play the audio as output.

Depends on fmpeg. Install it if you dont have it installed using apt

This whole implementation maybe possible to be implemented in a single module, I hope someone will try to make this in a single module and less messy