Whisper 1 | Pricing | Token Size | LMSys Score

Models

Github →

Home /

Whisper 1

OpenAI

Model ID:

whisper-1

Speech to text

Use This Model

Model Overview

FAQs

Related Links

Overview

Whisper 1 is a state-of-the-art speech recognition model designed for high accuracy in transcribing audio to text. It excels in various acoustic environments and supports multiple languages, making it an excellent choice for applications in transcription services and voice-activated systems.

Specializations

Robust Speech-to-Text Transcription: Accurately transcribes audio into text, even in noisy environments.
Multilingual Support: Capable of transcribing speech in multiple languages.
Real-Time Transcription: Enables real-time transcription of audio streams.

Integration Guide (Javascript)

To use this model through Portkey, follow these steps:

1. Install Portkey SDK:

npm install --save portkey-ai

2. Set up client with Portkey:

import fs from "fs";

import OpenAI from 'openai';

import { PORTKEY_GATEWAY_URL, createHeaders } from 'portkey-ai'

const openai = new OpenAI({

apiKey: 'OPENAI_API_KEY',

baseURL: PORTKEY_GATEWAY_URL,

defaultHeaders: createHeaders({

provider: "openai",

apiKey: "PORTKEY_API_KEY"

})

});

3. Make a request:

3. Transcription:

async function transcribe() {

const transcription = await openai.audio.transcriptions.create({

file: fs.createReadStream("/path/to/file.mp3"),

model: "whisper-1",

});

console.log(transcription.text);

}

transcribe();

4. Translation

// Translation

async function translate() {

const translation = await openai.audio.translations.create({

file: fs.createReadStream("/path/to/file.mp3"),

model: "whisper-1",

});

console.log(translation.text);

}

translate();

Model Specifications

Release Date:

21/9/2022

License:

Proprietary

Technical Report/Model Card:

Link