0% found this document useful (0 votes)

46 views44 pages

Aidoc

The document provides guidance on using the Gemini API with OpenAI libraries, specifically for JavaScript and Python. It outlines how to update code to access Gemini models, including the necessary API key and endpoint adjustments. Additionally, it discusses the thinking capabilities of Gemini models and provides examples for implementing various features like reasoning effort and streaming responses.

Uploaded by

Tech Flyer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views44 pages

Aidoc

Uploaded by

Tech Flyer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

OpenAI compatibility

Source: https://ai.google.dev/gemini-api/docs/openai#javascript

Skip to main content

Models → https://ai.google.dev/gemini-api/docs

Gemini API docs → https://ai.google.dev/gemini-api/docs

API Reference → https://ai.google.dev/api
Cookbook → https://github.com/google-gemini/cookbook
Community → https://discuss.ai.google.dev/c/gemini-api/
Solutions
Code assistance
Showcase
Community
Get started
Overview → https://ai.google.dev/gemini-api/docs
Quickstart → https://ai.google.dev/gemini-api/docs/quickstart
API keys → https://ai.google.dev/gemini-api/docs/api-key
Libraries → https://ai.google.dev/gemini-api/docs/libraries
OpenAI compatib ility → https://ai.google.dev/gemini-api/docs/openai
Models
All models → https://ai.google.dev/gemini-api/docs/models
Pricing → https://ai.google.dev/gemini-api/docs/pricing
Rate limits → https://ai.google.dev/gemini-api/docs/rate-limits
Billing info → https://ai.google.dev/gemini-api/docs/billing
Model Capabilit ies
Text generation → https://ai.google.dev/gemini-api/docs/text-generation
Image generation → https://ai.google.dev/gemini-api/docs/image-generati
on
Video generation → https://ai.google.dev/gemini-api/docs/video

1
Speech generation → https://ai.google.dev/gemini-api/docs/speech-genera
tion
Music generation → https://ai.google.dev/gemini-api/docs/music-generatio
n
Long context → https://ai.google.dev/gemini-api/docs/long-context
Structured output → https://ai.google.dev/gemini-api/docs/structured-out
put
Thinking → https://ai.google.dev/gemini-api/docs/thinking
Function calling → https://ai.google.dev/gemini-api/docs/function-calling
Document understanding → https://ai.google.dev/gemini-api/docs/docu
ment-processing
Image understanding → https://ai.google.dev/gemini-api/docs/image-und
erstanding
Video understanding → https://ai.google.dev/gemini-api/docs/video-unde
rstanding
Audio understanding → https://ai.google.dev/gemini-api/docs/audio
Code execution → https://ai.google.dev/gemini-api/docs/code-execution
URL context → https://ai.google.dev/gemini-api/docs/url-context
Google Search → https://ai.google.dev/gemini-api/docs/google-search
Guides
Prompt engineering → https://ai.google.dev/gemini-api/docs/prompting-s
trategies

Context caching → https://ai.google.dev/gemini-api/docs/caching

Files API → https://ai.google.dev/gemini-api/docs/files
Token counting → https://ai.google.dev/gemini-api/docs/tokens
Embeddings → https://ai.google.dev/gemini-api/docs/embeddings

Resources
Migrate to Gen AI SDK → https://ai.google.dev/gemini-api/docs/migrate
Release notes → https://ai.google.dev/gemini-api/docs/changelog

2
API troubleshooting → https://ai.google.dev/gemini-api/docs/troubleshoo
ting
Fine-tuning → https://ai.google.dev/gemini-api/docs/model-tuning

Policies
Terms of service → https://ai.google.dev/gemini-api/terms
Available regions → https://ai.google.dev/gemini-api/docs/available-region
s
Additional usage polices → https://ai.google.dev/gemini-api/docs/usage-
policies
Gemini models are accessible using the OpenAI libraries (Python
and TypeScript / Javascript) along with the REST API, by updating
three lines of code and using your Gemini API key → https://aistudio.go
ogle.com/apikey. If you aren't already using the OpenAI libraries, we
recommend that you call the Gemini API directly → https://ai.google.de
v/gemini-api/docs/quickstart.

3
Python

from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

response = client.chat.completions.cre
ate(
model="gemini-2.5-
flash",
mes
sages=[
{"role": "sys
tem", "content": "You are a help
ful
assis
tant."},
{
"role": "user",
"con
tent": "Explain to me how AI works"
}
]
)

print(response.choices[0].mes
sage)

4
JavaScript

import Ope
nAI from "ope
nai";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
});

const response = await openai.chat.com

ple
tions.create({
model: "gemini-2.0-
flash",
mes
sages: [
{ role: "sys
tem", con
tent: "You are a helpful
assis
tant." },
{
role: "user",
con
tent: "Explain to me how AI works",
},
],
});

con
sole.log(response.choices[0].mes
sage);

5
REST

What changed? Just three lines!

INI_API_KEY": Replace "GEM
api_key="GEM INI_API_KEY" with your
actual Gemini API key, which you can get in Google AI Studio
→ https://aistudio.google.com/.
base_url="https://gen
er
a
tive
lan
‐
nai/": This tells the OpenAI
guage.googleapis.com/v1beta/ope
library to send requests to the Gemini API endpoint instead of
the default URL.
model="gemini- flash": Choose a compatib
2.0- le Gemini model

Thinking
Gemini 2.5 models are trained to think through complex problems,
leading to significantly improved reasoning. The Gemini API comes
with a "thinking budget" parameter → https://ai.google.dev/gemini-api/doc
s/thinking#set-budget which gives fine grain control over how much the
model will think.

6
Unlike the Gemini API, the OpenAI API offers three levels of
thinking control: "low", "medium", and "high", which map to 1,024,
8,192, and 24,576 tokens, respectively.
If you want to disable thinking, you can set reasoning_effort to
"none" (note that reasoning cannot be turned off for 2.5 Pro models).

Python

from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

response = client.chat.completions.cre
ate(
model="gemini-2.5-
flash",
rea
soning_effort="low",
mes
sages=[
{"role": "sys
tem", "content": "You are a help
ful
assis
tant."},
{
"role": "user",
"con
tent": "Explain to me how AI works"
}
]
)

print(response.choices[0].mes
sage)

7
JavaScript

import Ope
nAI from "ope
nai";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
});

const response = await openai.chat.com

ple
tions.create({
model: "gemini-2.5-
flash",
rea
soning_effort: "low",
mes
sages: [
{ role: "sys
tem", con
tent: "You are a helpful
assis
tant." },
{
role: "user",
con
tent: "Explain to me how AI works",
},
],
});

con
sole.log(response.choices[0].mes
sage);

8
REST

curl "https://gener
a
tive
lan‐
guage.googleapis.com/v1beta/ope nai/chat/com
ple
tions" \
-H "Content-Type: appli
cation/json" \
-H "Authoriza
tion: Bearer GEMINI_API_KEY" \
-d '{
"model": "gemini-2.5-
flash",
"rea
soning_effort": "low",
"mes
sages": [
{"role": "user", "con tent": "Explain to me how AI
works"}
]
}'

Gemini thinking models also produce thought summaries → https://ai.

google.dev/gemini-api/docs/thinking#summaries and can use exact thinking
budgets → https://ai.google.dev/gemini-api/docs/thinking#set-budget. You can
use the extra_body field to include these fields in your request.
Note that reasoning_effort and thinking_budget overlap func‐
tionality, so they can't be used at the same time.

9
Python

from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

response = client.chat.com ple

tions.cre
ate(
model="gemini-2.5-
flash",
mes
sages=[{"role": "user", "con tent": "Explain to me
how AI works"}],
extra_body={
'extra_body': {
"google": {
"think
ing_con
fig": {
"think
ing_bud
get": 800,
"include_thoughts": True
}
}
}
}
)

print(response.choices[0].mes
sage)

10
JavaScript

import Ope
nAI from "ope
nai";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
});

const response = await openai.chat.comple

tions.cre
ate({
model: "gemini-2.5-
flash",
mes
sages: [{role: "user", con tent: "Explain to me how
AI works",}],
extra_body: {
"google": {
"think
ing_con
fig": {
"think
ing_bud
get": 800,
"include_thoughts": true
}
}
}
});

con
sole.log(response.choices[0].mes
sage);

11
REST

curl "https://gen er

a
tive
lan‐
guage.googleapis.com/v1beta/ope nai/chat/com
ple
tions" \
-H "Content- Type: appli
cation/json" \
-H "Authorization: Bearer GEMINI_API_KEY" \
-d '{
"model": "gemini- 2.5-
flash",
"messages": [{"role": "user", "con tent": "Explain to
me how AI works"}],
"extra_body": {
"google": {
"thinking_con
fig": {
"include_thoughts": true
}
}
}
}'

Streaming
The Gemini API supports streaming responses → https://ai.google.dev/ge
mini-api/docs/text-generation?lang=python#generate-a-text-stream.

12
Python

from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

response = client.chat.completions.cre
ate(
model="gemini-
2.0-flash",
mes
sages=[
{"role": "system", "content": "You are a help
ful
assis
tant."},
{"role": "user", "content": "Hello!"}
],
stream=True
)

for chunk in response:

print(chunk.choices[0].delta)

13
JavaScript

import Ope
nAI from "ope
nai";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
});

async function main() {

const comple
tion = await openai.chat.com
ple
tions.cre‐
ate({
model: "gemini-2.0-
flash",
messages: [
{"role": "sys
tem", "con
tent": "You are a helpful
assis
tant."},
{"role": "user", "con
tent": "Hello!"}
],
stream: true,
});

for await (const chunk of com

ple
tion) {
con
sole.log(chunk.choices[0].delta.con
tent);
}
}

main();

14
REST

Function calling
Function calling makes it easier for you to get structured data out‐
puts from generative models and is supported in the Gemini API → h
ttps://ai.google.dev/gemini-api/docs/function-calling/tutorial.

15
Python

from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

tools = [
{
"type": "func tion",
"function": {
"name": "get_weather",
"description": "Get the weather in a given loca
‐
tion",
"para
meters": {
"type": "object",
"proper
ties": {
"location": {
"type": "string",
"descrip
tion": "The city and state, e.g.
Chicago, IL",
},
"unit": {"type": "string", "enum": ["celsius",
"fahrenheit"]},
},
"required": ["location"],
},
}
}
]

mes
sages = [{"role": "user", "con
tent": "What's the

16
weather like in Chicago today?"}]
response = client.chat.comple
tions.cre
ate(
model="gemini-
2.0-flash",
mes
sages=mes
sages,
tools=tools,
tool_choice="auto"
)

print(response)

17
JavaScript

import Ope
nAI from "ope
nai";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
});

async function main() {

const messages = [{"role": "user", "con tent": "What's
the weather like in Chicago today?"}];
const tools = [
{
"type": "func tion",
"function": {
"name": "get_weather",
"descrip tion": "Get the weather in a given loca
‐
tion",
"para me
ters": {
"type": "object",
"proper
ties": {
"location": {
"type": "string",
"descrip
tion": "The city and state, e.g.
Chicago, IL",
},
"unit": {"type": "string", "enum": ["cel‐
sius", "fahren heit"]},
},
"required": ["location"],
},
}
}

18
];

const response = await ope

nai.chat.com
ple
tions.cre
ate({
model: "gemini-
2.0-
flash",
mes
sages: mes
sages,
tools: tools,
tool_choice: "auto",
});

con
sole.log(response);
}

main();

19
REST

curl "https://gener
a
tivelan
‐
guage.googleapis.com/v1beta/ope nai/chat/com
ple
tions" \
-H "Content-Type: applica
tion/json" \
-H "Autho
rization: Bearer GEMINI_API_KEY" \
-d '{
"model": "gemini-2.0-
flash",
"mes
sages": [
{
"role": "user",
"con
tent": "What'\''s the weather like in Chicago
today?"
}
],
"tools": [
{
"type": "function",
"func
tion": {
"name": "get_weather",
"description": "Get the current weather in a given
loca
tion",
"parame
ters": {
"type": "object",
"proper
ties": {
"location": {
"type": "string",
"descrip
tion": "The city and state, e.g.
Chicago, IL"
},
"unit": {
"type": "string",
"enum": ["celsius", "fahren
heit"]
}
},

20
"required": ["loca
tion"]
}
}
}
],
"tool_choice": "auto"
}'

Image understanding
Gemini models are natively multimodal and provide best in class
performance on many common vision tasks → https://ai.google.dev/gemi
ni-api/docs/vision.

21
Python

import base64
from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

# Func
tion to encode the image
def encode_image(image_path):
with open(image_path, "rb") as image_file:
return
base64.b64encode(image_file.read()).decode('utf-8')

# Get
ting the base64 string
base64_image = encode_image("Path/to/agi/image.jpeg")

response = client.chat.com ple

tions.cre
ate(
model="gemini- 2.0-
flash",
messages=[
{
"role": "user",
"content": [
{
"type": "text",
"text": "What is in this image?",
},
{
"type": "image_url",
"image_url": {
"url": f"data:image/jpeg;base64,
{base64_image}"

22
},
},
],
}
],
)

print(response.choices[0])

23
JavaScript

import Ope
nAI from "ope
nai";
import fs from 'fs/promises';

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tive
lan
‐
guage.googleapis.com/v1beta/ope
nai/"
});

async func
tion encodeImage(imagePath) {
try {
const image
Buffer = await fs.readFile(imagePath);
return imageBuffer.toString('base64');
} catch (error) {
con
sole.error("Error encoding image:", error);
return null;
}
}

async func
tion main() {
const imagePath = "Path/to/agi/image.jpeg";
const base64Image = await encodeIm
age(imagePath);

const messages = [
{
"role": "user",
"content": [
{
"type": "text",
"text": "What is in this image?",
},
{
"type": "image_url",

24
"image_url": {
"url": `data:image/jpeg;base64,${base64Image}`
},
},
],
}
];

try {
const response = await ope
nai.chat.com
ple
tions.cre
‐
ate({
model: "gemini-
2.0-
flash",
mes
sages: mes
sages,
});

con
sole.log(response.choices[0]);
} catch (error) {
con
sole.error("Error call
ing Gem
ini API:", error);
}
}

main();

25
REST

bash -c '
base64_image=$(base64 -i "Path/to/agi/image.jpeg");
curl "https://gen
er
a
tive
lan‐
guage.googleapis.com/v1beta/openai/chat/com
ple
tions" \
-H "Content-
Type: appli
ca
tion/json" \
-H "Autho
riza
tion: Bearer GEMINI_API_KEY" \
-d "{
\"model\": \"gemini-
2.0-flash\",
\"mes
sages\": [
{
\"role\": \"user\",
\"con
tent\": [
{ \"type\": \"text\", \"text\": \"What is in
this image?\" },
{
\"type\": \"image_url\",
\"image_url\": { \"url\":
\"data:image/jpeg;base64,${base64_image}\" }
}
]
}
]
}"
'

Generate an image
Generate an image:

26
Python

import base64
from ope
nai import OpenAI
from PIL import Image
from io import Byte
sIO

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/",
)

response = client.images.gener
ate(
model="imagen-3.0-
generate-002",
prompt="a portrait of a sheepadoo
dle wear
ing a cape",
response_for
mat='b64_json',
n=1,
)

for image_data in response.data:

image = Image.open(Byte
‐
sIO(base64.b64decode(image_data.b64_json)))
image.show()

27
JavaScript

import Ope
nAI from "ope
nai";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tive
lan
‐
guage.googleapis.com/v1beta/ope
nai/",
});

async function main() {

const image = await openai.images.gener
ate(
{
model: "imagen-
3.0-
generate-002",
prompt: "a por
trait of a sheepadoodle wear
ing a
cape",
response_for
mat: "b64_json",
n: 1,
}
);

con
sole.log(image.data);
}

main();

28
REST

curl "https://gener
a
tive
lan‐
guage.googleapis.com/v1beta/ope nai/images/gen
er
a
tions" \
-H "Content-
Type: appli ca
tion/json" \
-H "Autho
rization: Bearer GEMINI_API_KEY" \
-d '{
"model": "imagen-3.0-generate-002",
"prompt": "a portrait of a sheepadoodle wear
ing a
cape",
"response_format": "b64_json",
"n": 1,
}'

Audio understanding
Analyze audio input:

29
Python

import base64
from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

with open("/path/to/your/audio/file.wav", "rb") as

audio_file:
base64_audio =
base64.b64encode(audio_file.read()).decode('utf-8')

response = client.chat.com ple

tions.cre
ate(
model="gemini- 2.0-
flash",
mes
sages=[
{
"role": "user",
"con tent": [
{
"type": "text",
"text": "Tran scribe this audio",
},
{
"type": "input_audio",
"input_audio": {
"data": base64_audio,
"for
mat": "wav"
}
}
],
}

30
],
)

print(response.choices[0].mes
sage.con
tent)

31
JavaScript

import fs from "fs";

import Ope
nAI from "ope
nai";

const client = new Ope

nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tive
lan
‐
guage.googleapis.com/v1beta/ope
nai/",
});

const audioFile = fs.read

File
‐
Sync("/path/to/your/audio/file.wav");
const base64Audio =
Buffer.from(audioFile).toString("base64");

async func tion main() {

const response = await client.chat.com ple
tions.cre
ate({
model: "gemini- 2.0-
flash",
mes
sages: [
{
role: "user",
content: [
{
type: "text",
text: "Tran
scribe this audio",
},
{
type: "input_audio",
input_audio: {
data: base64Audio,
for
mat: "wav",
},
},
],

32
},
],
});

con
sole.log(response.choices[0].mes
sage.con
tent);
}

main();

33
REST

bash -c '
base64_audio=$(base64 -i
"/path/to/your/audio/file.wav");
curl "https://gen
er
a
tive
lan‐
guage.googleapis.com/v1beta/openai/chat/com
ple
tions" \
-H "Content-
Type: appli
ca
tion/json" \
-H "Autho
riza
tion: Bearer GEMINI_API_KEY" \
-d "{
\"model\": \"gemini-
2.0-flash\",
\"mes
sages\": [
{
\"role\": \"user\",
\"con
tent\": [
{ \"type\": \"text\", \"text\": \"Transcribe
this audio file.\" },
{
\"type\": \"input_audio\",
\"input_audio\": {
\"data\": \"${base64_audio}\",
\"for
mat\": \"wav\"
}
}
]
}
]
}"
'

34
Structured output
Gemini models can output JSON objects in any structure you define
→ https://ai.google.dev/gemini-api/docs/structured-output.

Python

from pydan
tic import Base
Model
from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

class Cal
endarEvent(Base
Model):
name: str
date: str
par
tic
i
pants: list[str]

com
pletion = client.beta.chat.com ple
tions.parse(
model="gemini-2.0-flash",
mes
sages=[
{"role": "sys
tem", "con tent": "Extract the event
informa
tion."},
{"role": "user", "con tent": "John and Susan are
going to an AI con fer
ence on Fri day."},
],
response_format=Calen
darEvent,
)

print(com
ple
tion.choices[0].mes
sage.parsed)

35
JavaScript

import Ope
nAI from "ope
nai";
import { zodRespon
se
For
mat } from "ope
nai/helpers/zod";
import { z } from "zod";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai"
});

const Calen
darEvent = z.object({
name: z.string(),
date: z.string(),
par
tic
i
pants: z.array(z.string()),
});

const comple
tion = await openai.beta.chat.com
ple
‐
tions.parse({
model: "gemini-2.0-
flash",
mes
sages: [
{ role: "system", con
tent: "Extract the event infor
ma‐
tion." },
{ role: "user", content: "John and Susan are going to
an AI con
ference on Fri
day" },
],
response_format: zodRespon
seFor
mat(Cal
en
darEvent,
"event"),
});

const event = com

ple
tion.choices[0].mes
sage.parsed;
con
sole.log(event);

36
Embeddings
Text embeddings measure the relatedness of text strings and can be
generated using the Gemini API → https://ai.google.dev/gemini-api/docs/em
beddings.

Python

from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

response = client.embed
dings.cre
ate(
input="Your text string goes here",
model="text-
embedding-004"
)

print(response.data[0].embed
ding)

37
JavaScript

import Ope
nAI from "ope
nai";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tivelan
‐
guage.googleapis.com/v1beta/ope
nai/"
});

async func
tion main() {
const embed
ding = await ope
nai.embed
dings.cre
ate({
model: "text-
embedding-004",
input: "Your text string goes here",
});

con
sole.log(embed
ding);
}

main();

REST

curl "https://gen
er
ative
lan‐
guage.googleapis.com/v1beta/ope nai/embed
dings" \
-H "Content-
Type: application/json" \
-H "Autho
riza
tion: Bearer GEM INI_API_KEY" \
-d '{
"input": "Your text string goes here",
"model": "text-
embedding-004"
}'

38
extra_body
There are several features supported by Gemini that are not avail‐
able in OpenAI models but can be enabled using the extra_body
field.
extra_body features

safety_set
‐ Corresponds to Gemini's SafetySetting.
tings

cached_con
tent Corresponds to Gemini's GenerateContentRe‐
quest.cached_con tent.

think
ing_con
‐ Corresponds to Gemini's ThinkingConfig.
fig

cached_con
tent
Here's an example of using extra_body to set cached_content:

39
Python

from ope
nai import Ope
nAI

client = Ope
nAI(
api_key=MY_API_KEY,
base_url="https://gen
er
a
tive
lan
‐
guage.googleapis.com/v1beta/"
)

stream = client.chat.completions.cre
ate(
model="gemini-2.5-pro",
n=1,
mes
sages=[
{
"role": "user",
"con
tent": "Summa
rize the video"
}
],
stream=True,
stream_options={'include_usage': True},
extra_body={
'extra_body':
{
'google': {
'cached_con
tent': "cachedCon
‐
tents/0000aaaa1111bbbb2222cccc3333dddd4444eeee"
}
}
}
)

for chunk in stream:

print(chunk)

40
print(chunk.usage.to_dict())

List models
Get a list of available Gemini models:

Python

from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tive
lan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

mod
els = client.mod
els.list()
for model in mod
els:
print(model.id)

41
JavaScript

import Ope
nAI from "ope
nai";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tive
lan
‐
guage.googleapis.com/v1beta/ope
nai/",
});

async func
tion main() {
const list = await ope
nai.mod
els.list();

for await (const model of list) {

con
sole.log(model);
}
}
main();

REST

curl https://gen
er
a
tive
lan
guage.googleapis.com/v1beta/ope
‐
nai/mod
els \
-H "Autho
riza
tion: Bearer GEM
INI_API_KEY"

Retrieve a model
Retrieve a Gemini model:

42
Python

from ope
nai import Ope
nAI

client = Ope
nAI(
api_key="GEM
INI_API_KEY",
base_url="https://gen
er
a
tive
lan
‐
guage.googleapis.com/v1beta/ope
nai/"
)

model = client.mod
els.retrieve("gemini-
2.0-
flash")
print(model.id)

JavaScript

import Ope
nAI from "ope
nai";

const ope
nai = new Ope
nAI({
apiKey: "GEM
INI_API_KEY",
baseURL: "https://gen
er
a
tive
lan
‐
guage.googleapis.com/v1beta/ope
nai/",
});

async func
tion main() {
const model = await ope
nai.mod
els.retrieve("gemini-
2.0-
flash");
con
sole.log(model.id);
}

main();

43
REST

curl https://gen
er
ative
lan
guage.googleapis.com/v1beta/ope
‐
nai/mod
els/gemini-
2.0-flash \
-H "Autho
riza
tion: Bearer GEM INI_API_KEY"

Current limit ations

Support for the OpenAI libraries is still in beta while we extend fea‐
ture support.
If you have questions about supported parameters, upcoming
features, or run into any issues getting started with Gemini, join
our Developer Forum → https://discuss.ai.google.dev/c/gemini-api/4.

What's next
Try our OpenAI Compatib
ility Colab → https://colab.sandbox.google.com/
github/google-gemini/cookbook/blob/main/quickstarts/Get_started_OpenAI_Compat
ibility.ipynbto work through more detailed examples.
Except as otherwise noted, the content of this page is licensed under
the Creative Commons Attribution 4.0 License → https://creativecommon
s.org/licenses/by/4.0/, and code samples are licensed under the Apache
2.0 License → https://www.apache.org/licenses/LICENSE-2.0. For details, see
the Google Developers Site Policies → https://developers.google.com/site-p
olicies. Java is a registered trademark of Oracle and/or its affilia
tes.
Last updated 2025-06-18 UTC.

Lecture 06 Calling Gemini and OpenAI APIs
No ratings yet
Lecture 06 Calling Gemini and OpenAI APIs
9 pages
Lecture 06 Calling Gemini and OpenAI APIs
No ratings yet
Lecture 06 Calling Gemini and OpenAI APIs
9 pages
Maxbox Starter148 Gemini AI Model API
No ratings yet
Maxbox Starter148 Gemini AI Model API
4 pages
API Reference - OpenAI API
No ratings yet
API Reference - OpenAI API
116 pages
OpenAI TypeScript and JavaScript API Library
No ratings yet
OpenAI TypeScript and JavaScript API Library
9 pages
Requesting An Api Key: Openai Website
No ratings yet
Requesting An Api Key: Openai Website
29 pages
Postman Gemini Workspace - Google DeepMind
No ratings yet
Postman Gemini Workspace - Google DeepMind
9 pages
Openai Workingcourse Introduction To Chatgpt Api Chatgpt Api Parameters
No ratings yet
Openai Workingcourse Introduction To Chatgpt Api Chatgpt Api Parameters
11 pages
Gemini 1.5 Pro
No ratings yet
Gemini 1.5 Pro
21 pages
Maxbox Starter136 Google Gemini API
No ratings yet
Maxbox Starter136 Google Gemini API
8 pages
Assigment - Software Development Engineer
No ratings yet
Assigment - Software Development Engineer
2 pages
Chat GPT
No ratings yet
Chat GPT
4 pages
Google Gen AI SDK
No ratings yet
Google Gen AI SDK
484 pages
Gemini Integration Installation Guide
No ratings yet
Gemini Integration Installation Guide
2 pages
TDSynexx - Virtual Training Session
No ratings yet
TDSynexx - Virtual Training Session
49 pages
Lab1 Installation
No ratings yet
Lab1 Installation
8 pages
Python Code Explanation
No ratings yet
Python Code Explanation
4 pages
Generate Content With The Gemini Enterprise API - Generative AI On Vertex AI - Google Cloud
No ratings yet
Generate Content With The Gemini Enterprise API - Generative AI On Vertex AI - Google Cloud
24 pages
Chrome AI Setup: Gemini Nano Guide
No ratings yet
Chrome AI Setup: Gemini Nano Guide
5 pages
Chapter 3
No ratings yet
Chapter 3
35 pages
Openrouter - Ai Docs Llms-Full
No ratings yet
Openrouter - Ai Docs Llms-Full
147 pages
Text Generation - OpenAI API
No ratings yet
Text Generation - OpenAI API
12 pages
Finalyear Project
No ratings yet
Finalyear Project
17 pages
Agent Ai
No ratings yet
Agent Ai
30 pages
Integrations - Create
No ratings yet
Integrations - Create
36 pages
Export Text - New Recording 1.m4a (09 - 05 - 2025)
No ratings yet
Export Text - New Recording 1.m4a (09 - 05 - 2025)
3 pages
AI SDK Providers - Google Generative AI
No ratings yet
AI SDK Providers - Google Generative AI
16 pages
Built-In AI Early Preview Program - The Writer and Rewriter APIs - Update #5
No ratings yet
Built-In AI Early Preview Program - The Writer and Rewriter APIs - Update #5
13 pages
Using OpenAI's RealTime API - WorkAdventure Documentation
No ratings yet
Using OpenAI's RealTime API - WorkAdventure Documentation
1 page
Guide Complet N8N
No ratings yet
Guide Complet N8N
41 pages
Prompt Engineering - OpenAI API
No ratings yet
Prompt Engineering - OpenAI API
9 pages
Introduction To LangChain
No ratings yet
Introduction To LangChain
4 pages
ServiceNow VA - Gemini
No ratings yet
ServiceNow VA - Gemini
15 pages
Message
No ratings yet
Message
10 pages
Gemini AI
No ratings yet
Gemini AI
15 pages
Assistants API Overview (Python SDK) OpenAI Cookbook
No ratings yet
Assistants API Overview (Python SDK) OpenAI Cookbook
21 pages
Multi-Turn Resume Builder
No ratings yet
Multi-Turn Resume Builder
14 pages
How To Build Your Own Custom ChatGPT Bot With Custom Knowledge Base - Better Programming
No ratings yet
How To Build Your Own Custom ChatGPT Bot With Custom Knowledge Base - Better Programming
8 pages
Built-In AI Early Preview Program - Welcome and Heads-Up About The Prompt API - Update #1
No ratings yet
Built-In AI Early Preview Program - Welcome and Heads-Up About The Prompt API - Update #1
20 pages
Chatgpt API Short Workshop 230928162255 A12ec1ef
No ratings yet
Chatgpt API Short Workshop 230928162255 A12ec1ef
42 pages
Gemini Implementation
No ratings yet
Gemini Implementation
4 pages
Hands-On Lab: Text Generation in Action
No ratings yet
Hands-On Lab: Text Generation in Action
8 pages
Phát triển trí tuệ
No ratings yet
Phát triển trí tuệ
16 pages
VPD 23 Ro
No ratings yet
VPD 23 Ro
5 pages
Gemini Comprehensive Report
No ratings yet
Gemini Comprehensive Report
3 pages
Deepseek Docs
No ratings yet
Deepseek Docs
94 pages
OpenAI API
No ratings yet
OpenAI API
11 pages
Finkster-Python Cheatsheet
No ratings yet
Finkster-Python Cheatsheet
11 pages
Project Documentation - PDF Q&A With Gemini (LangChain Practical Implementation)
No ratings yet
Project Documentation - PDF Q&A With Gemini (LangChain Practical Implementation)
6 pages
Lab - Tổng Quan Về AI Và Một Số Ứng Dụng Cơ Bản
No ratings yet
Lab - Tổng Quan Về AI Và Một Số Ứng Dụng Cơ Bản
38 pages
Quickstart Openai
No ratings yet
Quickstart Openai
8 pages
Document 14
No ratings yet
Document 14
4 pages
CustomGPT For Zapier
No ratings yet
CustomGPT For Zapier
22 pages
Custom Data-Driven Rag Chatbot Using Api & Langchain Framework
No ratings yet
Custom Data-Driven Rag Chatbot Using Api & Langchain Framework
18 pages
Automate Tasks with AI Agent Teams
No ratings yet
Automate Tasks with AI Agent Teams
8 pages
AI 050T00A ENU PowerPoint 02
No ratings yet
AI 050T00A ENU PowerPoint 02
14 pages
Standard Operating Procedure (SOP) For Using LLMs (
No ratings yet
Standard Operating Procedure (SOP) For Using LLMs (
12 pages
Assessment Brief For Assesment Point 2 - Reflective Writing
No ratings yet
Assessment Brief For Assesment Point 2 - Reflective Writing
7 pages
What Is Hyperledger Fabric
No ratings yet
What Is Hyperledger Fabric
14 pages
Internux Recharge Using Dealer Inventory Stock API - Technical Specification V 1.0.0
No ratings yet
Internux Recharge Using Dealer Inventory Stock API - Technical Specification V 1.0.0
14 pages
Exam Long Questions
No ratings yet
Exam Long Questions
8 pages
Hello Beyond Words - T2 - All in 1
No ratings yet
Hello Beyond Words - T2 - All in 1
92 pages
Making Inferences
No ratings yet
Making Inferences
1 page
Conjuctions 120921070121 Phpapp01
No ratings yet
Conjuctions 120921070121 Phpapp01
54 pages
Divine Liturgy: Living the Eucharistic Life
No ratings yet
Divine Liturgy: Living the Eucharistic Life
32 pages
CFG Pda Equivalence
No ratings yet
CFG Pda Equivalence
15 pages
# 'B02 - 12 - 2021 GMT 0408 Diff
No ratings yet
# 'B02 - 12 - 2021 GMT 0408 Diff
9 pages
P-CAD 2004 New Features
No ratings yet
P-CAD 2004 New Features
23 pages
Langham DevotionalBooklet Who Is Jesus
No ratings yet
Langham DevotionalBooklet Who Is Jesus
68 pages
Clases 3-4
No ratings yet
Clases 3-4
9 pages
Difference Between Direct and Indirect Speech
50% (2)
Difference Between Direct and Indirect Speech
2 pages
Lie Detection
No ratings yet
Lie Detection
4 pages
MC - Final Student List (HSC Form Fillup-2025) 19.3.25
No ratings yet
MC - Final Student List (HSC Form Fillup-2025) 19.3.25
200 pages
IV Sem DS & RDBMS
No ratings yet
IV Sem DS & RDBMS
6 pages
Evaluacion de Ingles 4
No ratings yet
Evaluacion de Ingles 4
2 pages
Exam Preparation Guide: Week 10
No ratings yet
Exam Preparation Guide: Week 10
98 pages
Eex4436 DP 2023 V2
No ratings yet
Eex4436 DP 2023 V2
22 pages
FGA Explained Learning Seminar Fall 2020: Lectures by Various
No ratings yet
FGA Explained Learning Seminar Fall 2020: Lectures by Various
29 pages
Terex Bendin 36b Calibration
100% (2)
Terex Bendin 36b Calibration
28 pages
Qemetica Brand Guidelines Eng 1
No ratings yet
Qemetica Brand Guidelines Eng 1
59 pages
TWN IT Fundamentals Course Brochure
No ratings yet
TWN IT Fundamentals Course Brochure
21 pages
Gamification & LARP: Boosting School Creativity
No ratings yet
Gamification & LARP: Boosting School Creativity
11 pages
Jannat Ka Anokha Darkht
No ratings yet
Jannat Ka Anokha Darkht
18 pages
Learning Area Grade Level Quarter Date
0% (1)
Learning Area Grade Level Quarter Date
6 pages
Winxp PDF
No ratings yet
Winxp PDF
18 pages
Tata Ibadah Bahasa Inggris Sekolah
No ratings yet
Tata Ibadah Bahasa Inggris Sekolah
1 page
Data Exfiltration
No ratings yet
Data Exfiltration
40 pages

Aidoc

Uploaded by

Aidoc

Uploaded by

OpenAI compatibility

Skip to main con­tent

Gem­ini API docs → https://ai.google.dev/gemini-api/docs

Con­text caching → https://ai.google.dev/gemini-api/docs/caching

const response = await ope­nai.chat.com­

What changed? Just three lines!

const response = await ope­nai.chat.com­

Gem­ini think­ing mod­els also pro­duce thought sum­maries → https://ai.

response = client.chat.com­ ple­

const response = await ope­nai.chat.com­ple­

curl "https://gen­ er­

for chunk in response:

async func­tion main() {

for await (const chunk of com­

async func­tion main() {

const response = await ope­

response = client.chat.com­ ple­

for image_data in response.data:

async func­tion main() {

with open("/path/to/your/audio/file.wav", "rb") as

response = client.chat.com­ ple­

import fs from "fs";

const client = new Ope­

const audioFile = fs.read­

async func­ tion main() {

const event = com­

for chunk in stream:

for await (const model of list) {

Cur­rent lim­it­ a­tions

You might also like

Skip to main content

Gemini API docs → https://ai.google.dev/gemini-api/docs

Context caching → https://ai.google.dev/gemini-api/docs/caching

const response = await openai.chat.com

const response = await openai.chat.com

Gemini thinking models also produce thought summaries → https://ai.

response = client.chat.com ple

const response = await openai.chat.comple

curl "https://gen er

async function main() {

for await (const chunk of com

async function main() {

const response = await ope

response = client.chat.com ple

async function main() {

response = client.chat.com ple

const client = new Ope

const audioFile = fs.read

async func tion main() {

const event = com

Current limit ations