Skip to content

Gemini媒体识别

Gemini媒体识别

POST /v1beta/models/{model}:generateContent

Gemini图像、PDF、音频、视频识别请求

⚠️注意:仅支持通过 inlineData 以 base64 方式上传图像、PDF、音频、视频,不支持 fileData.fileUri 或 File API。

路径参数 (Path Params)

参数名类型必选描述示例
modelstring模型名称``

请求头 (Headers)

参数名类型必选描述示例
Authorizationstring鉴权令牌Bearer ********************

请求体 (Request Body)

Content-Type: application/json

字段类型必选描述
contentsarray of object
contents[].rolestring
contents[].partsarray of object
contents[].parts[].textstring
contents[].parts[].inlineDataobject
contents[].parts[].inlineData.mimeTypestring
contents[].parts[].inlineData.datastring
generationConfigobject
generationConfig.temperaturenumber
generationConfig.topPnumber
generationConfig.topKinteger
generationConfig.maxOutputTokensinteger
generationConfig.stopSequencesarray of string
safetySettingsarray of object
safetySettings[].categorystring
safetySettings[].thresholdstring
toolsarray of object
systemInstructionobject
systemInstruction.partsarray of object

请求体示例

json
{
  "contents": [
    {
      "role": "string",
      "parts": [
        {
          "text": "string",
          "inlineData": {
            "mimeType": "string",
            "data": "string"
          }
        }
      ]
    }
  ],
  "generationConfig": {
    "temperature": 0,
    "topP": 0,
    "topK": 0,
    "maxOutputTokens": 0,
    "stopSequences": [
      "string"
    ]
  },
  "safetySettings": [
    {
      "category": "string",
      "threshold": "string"
    }
  ],
  "tools": [
    {}
  ],
  "systemInstruction": {
    "parts": [
      {}
    ]
  }
}

返回响应 (Responses)

200 - 成功

字段类型描述
candidatesarray of object
candidates[].contentobject
candidates[].content.rolestring
candidates[].content.partsarray of object
candidates[].finishReasonstring
candidates[].safetyRatingsarray of object
usageMetadataobject
usageMetadata.promptTokenCountinteger
usageMetadata.candidatesTokenCountinteger
usageMetadata.totalTokenCountinteger

响应示例

json
{
  "candidates": [
    {
      "content": {
        "role": "string",
        "parts": [
          {}
        ]
      },
      "finishReason": "string",
      "safetyRatings": [
        {}
      ]
    }
  ],
  "usageMetadata": {
    "promptTokenCount": 0,
    "candidatesTokenCount": 0,
    "totalTokenCount": 0
  }
}

Released under the MIT License.