Chirp Tutorial: How to use Google's Chirp speech-to-text AI model on Google Cloud console.

Wednesday, June 14, 2023 by abdibrokhim

Introduction

Chirp is Google Cloud's 2B-parameter speech model built via self-supervised training on millions of hours of audio and 28 billion sentences of text spanning 100+ languages. Chirp delivers 98% speech recognition accuracy in English and over 300% relative improvement in several languages with less than 10M speakers.

What we are going to do?

In this tutorial, we will learn how to effortlessly set up Google Cloud console to leverage the powerful Chirp speech-to-text AI model. This comprehensive tutorial provides step-by-step guide for an easy setup process and a quick start to use Chirp speech-to-text AI model. Sit back, relax, enjoy the tutorial and don't forget to make a cup of coffee ☕️.

Learning outcomes

How to use Google Cloud console.
How to set up Google's Chirp speech-to-text AI model on Google Cloud console.

We will cover the following steps:

Creating a Google Cloud account.
Creating a new project on the Google Cloud console.
Enabling the Speech API.
Creating an STT (Speech-to-Text) Recognizer using the Chirp model.
Creating a new Workspace for the project.
Performing a transcription on an audio file.
Viewing and downloading the transcription results.

Prerequisites

Nothing! Just a cup of coffee ☕️ and a laptop 💻.

Getting started

Step 1: Create a Google Cloud account

First, we need to create a Google Cloud account. If you already have one, you can skip this step and go here. If you don't have one, you can create a free account go here.

Google Cloud Console Create account — Google Cloud console Create account

Step 2: Create a new project

Next, we need to create a new project. On the top left corner click on the project dropdown menu > New Project.

Google Cloud console creating new project

Give your project a name and click Create.

Step 3: Enable API

Go to Speech in Google Cloud console. Click ENABLE API.

Google Cloud console enable API — Google Cloud console enableAPI

Step 4: Create an STT Recognizer

Go over left sidebar navigation and click Recognizers > CREATE RECOGNIZERS.

Give your recognizer a name chirp-recognizer. Select Chirp as the model. Select the language en-US in our case, leave the rest of the settings as default and click Save.

Step 5: Create a new Workspace

Now, we need to create a new Workspace for our project. Go to Workspace dropdown menu and click New Workspace. You should see a sidebar pops up on the right side of your screen.

From the Create a new workspace navigation sidebar, click Browse > Create a new bucket. Give your bucket a name chirp-bucket and click Continue.

Note: Rest of the settings are optional, you can leave them as default.

Click Create and you should see a new bucket is created as shown below.

Then, click Select > Continue > Create to finish creating your workspace for the speech-to-text UI.

Perfet! Now, we have a workspace for our speech-to-text UI. Let's move on to the next step.

Step 6: Create a new Transcription

Let's perform a transcription on our actual audio file.

Go over left sidebar navigation and click Transcription > New Transcription. From the New Transcription page, select your audio file through either uploading audio files Local upload or specifying an existing Cloud Storage file Cloud storage.

In this tutorial, we will use Local upload option and this audio file.

Note: The UI will assess your audio file parameters automatically, and you can also specify them manually.

Click Continue.

By default API version is set to V1, change it to V2, specify the language English (United States) - en-US in our case spoken in the audio file from the dropdown menu, then choose Chirp as transcription model and our newly created chirp-recognizer as the recognizer.

Google Cloud console transcription options

Click Submit and wait for a few seconds.

Step 7: View Transcription Results and Download

To see the transcription results, click the name of your transcription.

Google Cloud console transcription results

You may also download the transcription results in 4 different formats: JSON, TXT, SRT, and CSV. Let's download the transcription results in TXT format. Click Download > TXT > `Download.

Google Cloud console transcription download options

Wrapping Up

This comprehensive guide has led you through implementing Google Chirp's speech-to-text AI model on the Google Cloud console. With step-by-step directions, you can now tap into Chirp's capabilities for precise speech recognition.

This AI tutorial aimed to provide a user-friendly, detailed roadmap, complete with screenshots, ensuring a smooth setup experience for newcomers and experienced Google Cloud users alike.

Upon completing this tutorial, you should have the know-how required to apply Google Chirp's speech-to-text AI model effectively. Dive into Chirp's potential, incorporating it into projects and applications, and experiment with various languages and audio files.

Put your newfound knowledge to the test in our upcoming AI Hackathon!

Cheers to your AI journey!

If you have any questions, feel free to reach out to me on LinkedIn or Twitter. I'd love to hear from you!

made with 💜 by abdibrokhim for lablab.ai tutorials.