Face It 👀, Save All Your Happy Moments Via Google Aiy

Made by xiaowen-bi / Artificial intelligence / Communication / Home Automation / Robotics / Voice

About the project

We could all use a reminder of the good times when we are down. For happy memory curation, we propose Face it 👀 - an automatic happy moment achiever by utilizing the Google AIY Voice kits, will automatically begin recording when you initialize a recording trigger word, and save the moment to play back later.

Project info

Difficulty: Moderate

Platforms: Android, Python, AIY

Estimated time: 1 week

License: GNU General Public License, version 3 or later (GPL3+)

Items used in this project

Hardware components

	Google AIY Voice Kit for Raspberry Pi Raspberry Pi Zero WH, speaker and microSD card are all included	x 1
	android phone	x 1

Software apps and online services

	Google assistant API
	visual studio
	github

Hand tools and fabrication machines

2mm flat screwdriver

x 1

Story

Introduction

The main goal of this project is to voice-trigger a recording of a moment and play it later at a specific time you want. Via using the original setup from Google AIY Voice Kit, we were able to record a 3s voice message and replayed it by manually inputting a command. However, we would need to program extra code to start the auto-recording based on the voice recognition ideas we want to achieve. Below is a video of our final product demo, in which you will see an example on how we could use Face it 👀 in a birthday surprise party.

Birthday Surprise Use Case

Instruction

Below are the steps you will need to do in order to reproduce what we have done in the video (there might be different ways of programming, but we are providing you one approach out of millions out there😊):

1. Build the Google AIY Voice Kit

We followed the official Google AIY Voice Kit guide webpage to build up our kit: https://aiyprojects.withgoogle.com/voice/#assembly-guide

The guide is easy to follow, and we took photos along the way assembling it:

2. Configure the Kit

After assembling the kit, you will need to set up SSH to get the terminal ready, connect to the Raspberry Pi and get credentials from the Google Cloud Platform in order to use the Google Assistant APIs. One thing we tried and fixed apart from the tutorial is an error in one of the example files when executing src/examples/voice/assistant_grpc_demp.py. You will need to change one function call in one of the python scripts then continue to follow the Google tutorial as normal. For more details on how to fix the issue, you can refer to: https://github.com/google/aiyprojects-r ... issues/658

3. Use cases

We have made three use cases of this project related to voice recognition via using trigger-word recording. Imagine there is a Paddington bear, who is friend with Teddy bear and Pinky bear. One day Teddy and Pinky bear bought a Face it 👀 which is using Google AIY Voice recognition technology. By giving different key words and functions in the code, Teddy and Pinky bear want to try different games with Paddington via using the kit. Below is what they planned:

1. Prosocial teasing with friends – invite him/er to eat chicken feet

2. Birthday surprise party gift

3. Google Google who is the fairest of us all?

In the first use case, Teddy and Pinky bear invite Paddington to try a Chinese traditional finger food (real fingers by all means) – chicken feet. They record Paddington’s first reaction towards the invitation (without Paddington knowing it, because the auto voice trigger word is “chicken feet”), and replay the recording after Paddington tried the food and changed his mind. It is a friendly teasing of Paddington, to encourage him never say no to new things that he didn’t know before. Just like a lot of parents who said they didn’t want a dog/cat, and when you finally get them one, they changed their mind and cherish the pet more than anything else. You can record such scenarios via Face it 👀 any time:

Use Case 1 Video Link

In the second use case, Teddy and Pinky bear organized a surprise party for Paddington’s birthday. And when Teddy bear says “Let’s sing a birthday song”, the “birthday song” key word will trigger the auto-recording without Paddington knowing it. What’s more, Teddy wants to give Paddington this precious voice recording memory as a birthday gift later, so he can either ask Google via using the trigger word “birthday gift” to replay the song sang by their voices, or save the recording as an audio file to send to Paddington.

Use Case 2 Video Link

Last but not the least, Teddy bear hard coded a fun question and answer in the kit: whenever someone asked the question “Google google, who is the fairest of us all?”, by detecting the key word “fairest of us all”, Google will reply “Teddy bear, is the fairest in the world”. The key is that, after this question Teddy will say “next question” and trigger to switch automatically from the “special question” mode back to a normal Google assistant Q&A mode without being noticed. So when Paddington started to ask the kit any other questions, such as “what day is it today”, Google will reply regularly based on the Google Assistant library.

Use case 3 Video Link

4. Set up your IDE to connect to the microSD card

To better test the result from our code writing, we connected our Visual Studio and push the below command to cover the code directly in the microSD card to test. This gives you the opportunity to see simultaneous outcome from what you write. We modified directly in the grpc.py file (~AIY-project-python/src/aiy/assistant/grpc.py) from one of the default examples of the Google Voice Kit and another demo file (~AIY-project-python/src/examples/voice/assistant_grpc_demo_ex.py), and pi@192.168.2.173 is our IP address:

5. Write and run the code

Here comes to the juicy part of this project - the real “go behind the scenes”.

1. Modify the code to start listening conversation once the kit is powered on, instead of triggering listening via pressing the button from the kit as default setup from the Google Voice Kit. We need to keep the listening always on-going in order to catch the trigger words later.

2. Once the kit is consistently listening, we call function conversation2 to listen trigger words (both start the conversation with Google or ask Google to turn off speaking / shut up herself):

The reason we need Google to “shut up” herself is that we want Google to “quietly listening” for the trigger word while we talk, instead of keep interrupting users’ conversation if she doesn’t understand. For all the conversations she doesn’t understand, she should keep silent until she captures any trigger word.

We call function Listner to listen trigger words (to play voice message from respective previous recordings):

3. For all the trigger words in three use cases, below is the set-up:

“hey google” – start the normal conversation with Google (Q&A mode)

“shut up” – ask Google to keep quiet while waiting for the trigger word (turn off the Q&A mode and change to “quiet listening” mode)

“everybody ready” – ask Google to keep quiet while waiting for the trigger word (in use case 2, turn off the Q&A mode and change to “quiet listening” mode)

“chicken feet” – start recording a voice message (in use case 1)

“birthday song” – start recording a voice message (in use case 2)

“play memory” – replay the recording from the voice message triggered by “chicken feet” (in use case 1)

“birthday gift” – replay the recording from the voice message triggered by “birthday song” (in use case 2)

“next question” – ask Google to keep quiet while waiting for the trigger word (in use case 3, turn off the Q&A mode and change to “quiet listening” mode)

“fairest of us all” – trigger to play the answer “teddy bear, is the fairest in the world” (in use case 3)

Function for recording and playing:

Final thoughts

Our team had a lot of fun doing this project while trying and playing with the Google kit. We think it is a very good tool to entertain your families and friends and explore the AI world by using mostly the basic tool setups. It is also a good way for Python beginners to dive in and learn coding via playing. In our original proposal, we would like to record the memory by combining both vision and voice recordings, however the camera from the Vision Kit we received was not working since the beginning. We have contacted the support-aiyprojects@google.com team for a replacement part and it takes time to ship to us, so our development will continue once the camera arrives. We are happy with our current Face it 👀 result especially the auto-voice listening and trigger word based re-playing functions. We also want to thank electromaker platform to give us the opportunity to test Google AIY kits, and all the help answering our questions in their website forum. We hope that you enjoy our team’s idea and solutions, as much as we enjoy this contest.

A sincere thank you from our team:

Sai Xu

Vadym Prokopets

Xiaowen Bi

Paddington bear

Pinky bear

Teddy bear

Code

Face it 👀 Repository

The file grpc.py and assistant_grpc_demo_ex.py are the files where we run the three use cases.

assistant_grpc_demo_ex.py file

#!/usr/bin/env python3
# Copyright 2017 Google Inc.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

"""A demo of the Google Assistant GRPC recognizer."""

import argparse
import locale
import logging
import signal
import sys
import math
import time
import threading

from aiy.assistant.grpc import AssistantServiceClientWithLed
from aiy.board import Board
from aiy.leds import (Leds, Pattern, PrivacyLed, RgbLeds, Color)
from aiy.voice.audio import AudioFormat, play_wav, record_file, Recorder

def volume(string):
    value = int(string)
    if value < 0 or value > 100:
        raise argparse.ArgumentTypeError('Volume must be in [0...100] range.')
    return value

def locale_language():
    #language, _ = locale.getdefaultlocale()
    language, _ = locale.getdefaultlocale()
    return language

def main():
    logging.basicConfig(level=logging.DEBUG)
    signal.signal(signal.SIGTERM, lambda signum, frame: sys.exit(0))

    parser = argparse.ArgumentParser(description='Assistant service example.')
    parser.add_argument('--language', default=locale_language())
    parser.add_argument('--volume', type=volume, default=100)
    #args = parser.parse_args()

   # parser = argparse.ArgumentParser()
    parser.add_argument('--filename', '-f', default='recording.wav')
    args = parser.parse_args()


    with Board() as board:
        assistant = AssistantServiceClientWithLed(board=board,
                                                  volume_percentage=args.volume,
                                                  language_code=args.language)
        #done=threading.Event()
        talk=True
        while True:
            logging.info('Conversation started!')
            #assistant.conversation2()
            if not assistant.conversation2():
                if assistant.Listner():
                    continue            

if __name__ == '__main__':
    main()

grpc.py file

# Copyright 2017 Google Inc.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

"""
Enables a conversation with the Google Assistant, using the `Google Assistant Service`_, which
connects to the Google Assistant using a streaming endpoint over gRPC.
This gRPC service is typically more complicated to set up, compared to the Google Assistant
Library, but this API takes care of all the complexity for you. So you simply create an instance
of :class:`AssistantServiceClient`, then start the Google Assistant by calling
:meth:`~AssistantServiceClient.conversation`.
This API provides only an interface to initiate a conversation with the Google Assistant. It
speaks and prints all responses for you—it does not allow you to handle the response events or
create custom commands.
For an example, see :github:`src/examples/voice/assistant_grpc_demo.py`.
If you want to integrate custom device commands with the Google Assistant using the gRPC interface,
instead use the `Google Assistant Service`_ directly. For an example, see `this gRPC sample
<https://github.com/googlesamples/assistant-sdk-python/blob/master/google-assistant-sdk/googlesamples/assistant/grpc/pushtotalk.py>`_.
Or instead of interacting with the Google Assistant, you can use :mod:`aiy.cloudspeech`
to convert your voice commands into text that triggers your actions.
"""

import array
import logging
import math
import os
import sys

os.environ['GRPC_POLL_STRATEGY'] = 'epoll1'
import google.auth.transport.grpc
import google.auth.transport.requests
import google.oauth2.credentials

from google.assistant.embedded.v1alpha2 import embedded_assistant_pb2
from google.assistant.embedded.v1alpha2 import embedded_assistant_pb2_grpc

from aiy.assistant import auth_helpers, device_helpers
from aiy.board import Led
from aiy.voice.audio import AudioFormat, Recorder, BytesPlayer, record_file, play_wav
import aiy.voice.tts as tts

import argparse
import time
import threading

from aiy.board import Board

logger = logging.getLogger(__name__)



ASSISTANT_API_ENDPOINT = 'embeddedassistant.googleapis.com'
END_OF_UTTERANCE = embedded_assistant_pb2.AssistResponse.END_OF_UTTERANCE
DIALOG_FOLLOW_ON = embedded_assistant_pb2.DialogStateOut.DIALOG_FOLLOW_ON
CLOSE_MICROPHONE = embedded_assistant_pb2.DialogStateOut.CLOSE_MICROPHONE
PLAYING = embedded_assistant_pb2.ScreenOutConfig.PLAYING
DEFAULT_GRPC_DEADLINE = 60 * 3 + 5
AUDIO_SAMPLE_RATE_HZ = 16000
AUDIO_FORMAT=AudioFormat(sample_rate_hz=AUDIO_SAMPLE_RATE_HZ,
                         num_channels=1,
                         bytes_per_sample=2)

def _normalize_audio_buffer(buf, volume_percentage, sample_width=2):
    assert sample_width == 2
    scale = math.pow(2, 1.0 * volume_percentage / 100) - 1
    arr = array.array('h', buf)
    for i in range(0, len(arr)):
        arr[i] = int(arr[i] * scale)
    return arr.tobytes()

# https://developers.google.com/assistant/sdk/reference/rpc/
class AssistantServiceClient:
    """
    Provides a simplified interface for the `EmbeddedAssistant
    <https://developers.google.com/assistant/sdk/reference/rpc/google.assistant.embedded.v1alpha2#google.assistant.embedded.v1alpha2.EmbeddedAssistant>`_.
    
    Args:
        language_code: Language expected from the user, in IETF BCP 47 syntax (default is "en-US").
            See the `list of supported languages
            <https://developers.google.com/assistant/sdk/reference/rpc/languages>`_.
        volume_percentage: Volume level of the audio output. Valid values are 1 to 100
            (corresponding to 1% to 100%).
    """
    def __init__(self, language_code='de-DE', volume_percentage=100):
        self._volume_percentage = volume_percentage  # Mutable state.
        self._conversation_state = None              # Mutable state.
        self._language_code = language_code

        ##
        credentials = auth_helpers.get_assistant_credentials()
        device_model_id, device_id = device_helpers.get_ids_for_service(credentials)

        logger.info('device_model_id: %s', device_model_id)
        logger.info('device_id: %s', device_id)

        http_request = google.auth.transport.requests.Request()
        try:
            credentials.refresh(http_request)
        except Exception as e:
            raise RuntimeError('Error loading credentials: %s', e)

        api_endpoint = ASSISTANT_API_ENDPOINT
        grpc_channel = google.auth.transport.grpc.secure_authorized_channel(
            credentials, http_request, api_endpoint)
        logger.info('Connecting to %s', api_endpoint)
        ##

        self._assistant = embedded_assistant_pb2_grpc.EmbeddedAssistantStub(grpc_channel)
        self._device_config = embedded_assistant_pb2.DeviceConfig(
            device_model_id=device_model_id,
            device_id=device_id)

    @property
    def volume_percentage(self):
        """
        Volume level of the audio output. Valid values are 1 to 100 (corresponding to 1% to 100%).
        """
        return self._volume_percentage

    def _recording_started(self):
        logger.info('Recording started.')

    def _recording_stopped(self):
        logger.info('Recording stopped.')

    def _playing_started(self):
        logger.info('Playing started.')

    def _playing_stopped(self):
        logger.info('Playing stopped.')

    def _requests(self, recorder):
        audio_in_config = embedded_assistant_pb2.AudioInConfig(
            encoding='LINEAR16',
            sample_rate_hertz=AUDIO_SAMPLE_RATE_HZ)

        audio_out_config = embedded_assistant_pb2.AudioOutConfig(
            encoding='LINEAR16',
            sample_rate_hertz=AUDIO_SAMPLE_RATE_HZ,
            volume_percentage=self._volume_percentage)

        dialog_state_in = embedded_assistant_pb2.DialogStateIn(
            conversation_state=self._conversation_state,
            language_code=self._language_code)

        config = embedded_assistant_pb2.AssistConfig(
            audio_in_config=audio_in_config,
            audio_out_config=audio_out_config,
            device_config=self._device_config,
            dialog_state_in=dialog_state_in)

        yield embedded_assistant_pb2.AssistRequest(config=config)

        for chunk in recorder.record(AUDIO_FORMAT,
                                     chunk_duration_sec=0.1,
                                     on_start=self._recording_started,
                                     on_stop=self._recording_stopped):
            yield embedded_assistant_pb2.AssistRequest(audio_in=chunk)


    def _assist(self, recorder, play, deadline):
        continue_conversation = False

        for response in self._assistant.Assist(self._requests(recorder), deadline):
            if response.event_type == END_OF_UTTERANCE:
                logger.info('End of audio request detected.')
                recorder.done()

            # Process 'speech_results'.
            if response.speech_results:
                logger.info('You said: "%s".',
                            ' '.join(r.transcript for r in response.speech_results))

            # Process 'audio_out'.
            if response.audio_out.audio_data:
                recorder.done()  # Just in case.
                play(_normalize_audio_buffer(response.audio_out.audio_data,
                                             self._volume_percentage))

            # Process 'dialog_state_out'.
            if response.dialog_state_out.conversation_state:
                conversation_state = response.dialog_state_out.conversation_state
                logger.debug('Updating conversation state.')
                self._conversation_state = conversation_state  # Mutable state change.

            volume_percentage = response.dialog_state_out.volume_percentage
            if volume_percentage:
                logger.info('Setting volume to %s%%', volume_percentage)

            supplemental_display_text = response.dialog_state_out.supplemental_display_text
            if supplemental_display_text:
                logger.info('Assistant said assist1: "%s"', supplemental_display_text)

            microphone_mode = response.dialog_state_out.microphone_mode
            if microphone_mode == DIALOG_FOLLOW_ON:
                continue_conversation = True
                logger.info('Expecting follow-on query from user.')
            elif microphone_mode == CLOSE_MICROPHONE:
                continue_conversation = False
                logger.info('Not expecting follow-on query from user.')

        return continue_conversation

    def conversation(self, deadline=DEFAULT_GRPC_DEADLINE):
        """
        Starts a conversation with the Google Assistant.
        The device begins listening for your query or command and will wait indefinitely.
        Once it completes a query/command, it returns to listening for another.
        Args:
            deadline: The amount of time (in milliseconds) to wait for each gRPC request to
                complete before terminating.
        """
        keep_talking = True
        while keep_talking:
            playing = False
            with Recorder() as recorder, BytesPlayer() as player:
                play = player.play(AUDIO_FORMAT)

                def wrapped_play(data):
                    nonlocal playing
                    if not playing:
                        self._playing_started()
                        playing = True
                    play(data)

                try:
                    keep_talking = self._assist(recorder, wrapped_play, deadline)
                finally:
                    play(None)       # Signal end of sound stream.
                    recorder.done()  # Signal stop recording.

            if playing:
                self._playing_stopped()

    def Listner(self,deadline=DEFAULT_GRPC_DEADLINE):
        """
        Starts a conversation with the Google Assistant.
        The device begins listening for your query or command and will wait indefinitely.
        Once it completes a query/command, it returns to listening for another.
        Args:
            deadline: The amount of time (in milliseconds) to wait for each gRPC request to
                complete before terminating.
        """
        keep_talking = True
        while keep_talking:
            playing = False           
            with Recorder() as recorder, BytesPlayer() as player:
                play = player.play(AUDIO_FORMAT)
                def wrapped_play(data):
                    nonlocal playing
                    if not playing:
                        self._playing_started()
                        playing = True
                    play(data)

                try:
                    logger.info("FINALLY")
                    keep_talking = self._listen(recorder, wrapped_play, deadline)
                finally:
                    play(None)       # Signal end of sound stream.
                    recorder.done()  # Signal stop recording.

            if playing:
                self._playing_stopped()
        return keep_talking

    def _listen(self, recorder,play, deadline):
        # mic is mute, but conversation continue
        continue_conversation = False
        
        for response in self._assistant.Assist(self._requests(recorder), deadline):
            if response.event_type == END_OF_UTTERANCE:
                logger.info('End of audio request detected.')
                recorder.done()
            
            # Process 'speech_results'.
            if response.speech_results:
                result = ' '.join(r.transcript for r in response.speech_results)
                logger.info('You said assist2: "%s".',
                            result)
                result = result.lower()
                if ('chicken feet' in result.lower()):
                    logger.info("WE can start recording")
                    self.record_candy()
                    return True
                if ('birthday song' in result.lower()):
                    logger.info("WE can start recording now")
                    self.record_birthday()
                    return True
                if ('birthday gift' in result.lower()):
                    self.play_birthday()
                    return True
                if ('fairest of us all' in result.lower()):
                    tts.say("teddy bear, is the fairest in the world")
                    return False
                if ('hey google' in result.lower()):
                    return False

            # Process 'audio_out'.
            if response.audio_out.audio_data:
                recorder.done()  # Just in case.
                #play(_normalize_audio_buffer(response.audio_out.audio_data,
                 #                            self._volume_percentage))
            # Process 'dialog_state_out'.
            if response.dialog_state_out.conversation_state:
                conversation_state = response.dialog_state_out.conversation_state
                logger.debug('Updating conversation state.')
                self._conversation_state = conversation_state  # Mutable state change.
            volume_percentage = response.dialog_state_out.volume_percentage

            if volume_percentage:
                logger.info('Setting volume to %s%%', volume_percentage)
                self._volume_percentage = volume_percentage  # Mutable state change.

            supplemental_display_text = response.dialog_state_out.supplemental_display_text
            if supplemental_display_text:
                logger.info('Assistant said: "%s"', supplemental_display_text)

            microphone_mode = response.dialog_state_out.microphone_mode
            if microphone_mode == DIALOG_FOLLOW_ON:
                continue_conversation = True
                logger.info('Expecting follow-on query from user.')
            elif microphone_mode == CLOSE_MICROPHONE:
                continue_conversation = False
                logger.info('Not expecting follow-on query from user.')

        return True


    def _assist_2(self,recorder, play, deadline):
        continue_conversation = False

        for response in self._assistant.Assist(self._requests(recorder), deadline):
            if response.event_type == END_OF_UTTERANCE:
                logger.info('End of audio request detected.')
                recorder.done()
            
            # Process 'speech_results'.
            if response.speech_results:
                result = ' '.join(r.transcript for r in response.speech_results)
                logger.info('You said assist2: "%s".',
                            result)
                result = result.lower()
                if ('chicken feet' in result.lower()):
                    logger.info("WE can start recording")
                    self.record_candy()
                    return True
                if ('birthday gift' in result.lower()):
                    self.play_birthday()
                    return True
                if ('everybody ready' in result.lower()):
                    logger.info("google will be shut up")
                    return False
                if('shut up' in result.lower()):
                    logger.info("shup up")
                    return False
                if('next question' in result.lower()):
                    logger.info("prepared")
                    play(_normalize_audio_buffer(response.audio_out.audio_data,
                                            self._volume_percentage))
                    return False
                if('play memory' in result.lower()):
                    logger.info("play record")
                    self.play_candy()
                    return True
                if ('fairest of us all' in result.lower()):
                    tts.say("teddy bear, is the fairest in the world")
                    return True                

            # Process 'audio_out'.
            if response.audio_out.audio_data:
                recorder.done()  # Just in case.
                play(_normalize_audio_buffer(response.audio_out.audio_data,
                                             self._volume_percentage))

            # Process 'dialog_state_out'.
            if response.dialog_state_out.conversation_state:
                conversation_state = response.dialog_state_out.conversation_state
                logger.debug('Updating conversation state.')
                self._conversation_state = conversation_state  # Mutable state change.

            volume_percentage = response.dialog_state_out.volume_percentage
            if volume_percentage:
                logger.info('Setting volume to %s%%', volume_percentage)
                self._volume_percentage = volume_percentage  # Mutable state change.

            supplemental_display_text = response.dialog_state_out.supplemental_display_text
            if supplemental_display_text:
                logger.info('Assistant said: "%s"', supplemental_display_text)

            microphone_mode = response.dialog_state_out.microphone_mode
            if microphone_mode == DIALOG_FOLLOW_ON:
                continue_conversation = True
                logger.info('Expecting follow-on query from user.')
            elif microphone_mode == CLOSE_MICROPHONE:
                continue_conversation = False
                logger.info('Not expecting follow-on query from user.')

        return True

    def record_candy(self):
        #record scene: refusing to eat durian
        parser = argparse.ArgumentParser()
        parser.add_argument('--filename', '-f', default='recording.wav')
        args = parser.parse_args()
        def wait():
            time.sleep(7)
        record_file(AudioFormat.CD, filename=args.filename, wait=wait, filetype='wav')

    def record_birthday(self):
        #start to record singing of birthday-song
        parser = argparse.ArgumentParser()
        parser.add_argument('--filename_2', '-f', default='birthday.wav')
        args = parser.parse_args()
        def wait():
            time.sleep(23)
        record_file(AudioFormat.CD, filename=args.filename_2, wait=wait, filetype='wav')

    def play_candy(self):
        # play .wav of recording scene: refusing to eat durian
        parser = argparse.ArgumentParser()
        parser.add_argument('--filename', '-f', default='recording.wav')
        args = parser.parse_args()
        print('Playing...')
        play_wav(args.filename)        
        print('Done.')


    def play_birthday(self):
        # play .wav of birthday song
        parser = argparse.ArgumentParser()
        parser.add_argument('--filename2', '-f', default='birthday.wav')
        args = parser.parse_args()
        print('Playing...')
        play_wav(args.filename2)        
        print('Done.')

    

    def conversation2(self,deadline=DEFAULT_GRPC_DEADLINE):
        """
        Starts a conversation with the Google Assistant.
        The device begins listening for your query or command and will wait indefinitely.
        Once it completes a query/command, it returns to listening for another.
        Args:
            deadline: The amount of time (in milliseconds) to wait for each gRPC request to
                complete before terminating.
        """
        keep_talking = True
        while keep_talking:
            playing = False           
            with Recorder() as recorder, BytesPlayer() as player:
                play = player.play(AUDIO_FORMAT)
                def wrapped_play(data):
                    nonlocal playing
                    if not playing:
                        self._playing_started()
                        playing = True
                    play(data)

                try:
                    logger.info("FINALLY")
                    keep_talking = self._assist_2(recorder, wrapped_play, deadline)
                finally:
                    play(None)       # Signal end of sound stream.
                    recorder.done()  # Signal stop recording.

            if playing:
                self._playing_stopped()




class AssistantServiceClientWithLed(AssistantServiceClient):
    """ 
    Same as :class:`AssistantServiceClient` but this also turns the
    Voice Kit's button LED on and off in response to the conversation.
    Args:
        board: An instance of :class:`~aiy.board.Board`.
        language_code: Language expected from the user, in IETF BCP 47 syntax (default is "en-US").
            See the `list of supported languages`_.
        volume_percentage: Volume level of the audio output. Valid values are 1 to 100
            (corresponding to 1% to 100%).
    """
    def _update_led(self, state, brightness):
        self._board.led.state = state
        self._board.led.brightness = brightness

    def __init__(self, board, language_code='en-US', volume_percentage=100):
        super().__init__(language_code, volume_percentage)

        self._board = board
        self._update_led(Led.ON, 0.1)

    def _recording_started(self):
        super()._recording_started()
        self._update_led(Led.ON, 1.0)

    def _recording_stopped(self):
        self._update_led(Led.ON, 0.1)
        super()._recording_stopped()

    def _playing_started(self):
        super()._playing_started()
        self._update_led(Led.PULSE_SLOW, 1.0)

    def _playing_stopped(self):
        self._update_led(Led.ON, 0.1)
        super()._playing_stopped()

My cart

Shop

Project Hub

Video

Blog

Face It 👀, Save All Your Happy Moments Via Google Aiy

About the project

Project info