Transcriptions（语音转文字）

POST/v1/audio/transcriptions

将音频转录为输入语言。
转录API接受您想要转录的音频文件作为输入，以及您希望的音频转录输出文件格式。我们目前支持多种输入和输出文件格式。

价格：0.003PTC / 分钟

请求参数

Header 参数

string

必需

示例值:

application/json

Authorization

string

可选

示例值:

Bearer {{YOUR_API_KEY}}

Body 参数multipart/form-data

file

必需

要转录的音频文件，采用以下格式之一：mp3、mp4、mpeg、mpga、m4a、wav 或 webm。

model

string

必需

whisper-large-v3

示例值:

whisper-large-v3

prompt

string

可选

可选文本，用于指导模型的风格或继续之前的音频片段。提示应与音频语言相匹配。

response_format

string

可选

成绩单输出的格式，采用以下选项之一：json、text、verbose_json

示例值:

json

temperature

number

可选

采样温度，介于 0 和 1 之间。较高的值（如 0.8）将使输出更加随机，而较低的值（如 0.2）将使输出更加集中和确定。如果设置为 0，模型将使用对数概率自动升高温度，直到达到特定阈值。

示例值:

示例代码

返回响应

OK(200)

HTTP 状态码: 200

内容格式: JSONapplication/json

text

string

必需

{
  "text": "Imagine the wildest idea that you've ever had, and you're curious about how it might scale to something that's a 100, a 1,000 times bigger. This is a place where you can get to do that."
}

最后修改时间： 3 个月前