OpenRouter Images & PDFs | Complete Documentation | OpenRouter

OpenRouter supports sending images and PDFs via the API. This guide will show you how to work with both file types using our API.

Both images and PDFs also work in the chat room.

You can send both PDF and images in the same request.

Image Inputs

Requests with images, to multimodel models, are available via the /api/v1/chat/completions API with a multi-part messages parameter. The image_url can either be a URL or a base64-encoded image. Note that multiple images can be sent in separate content array entries. The number of images you can send in a single request varies per provider and per model. Due to how the content is parsed, we recommend sending the text prompt first, then the images. If the images must come first, we recommend putting it in the system prompt.

Using Image URLs

Here’s how to send an image using a URL:

1 import requests
2 import json
3 
4 url = "https://openrouter.ai/api/v1/chat/completions"
5 headers = {
6     "Authorization": f"Bearer {API_KEY_REF}",
7     "Content-Type": "application/json"
8 }
9 
10 messages = [
11     {
12         "role": "user",
13         "content": [
14             {
15                 "type": "text",
16                 "text": "What's in this image?"
17             },
18             {
19                 "type": "image_url",
20                 "image_url": {
21                     "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
22                 }
23             }
24         ]
25     }
26 ]
27 
28 payload = {
29     "model": "{{MODEL}}",
30     "messages": messages
31 }
32 
33 response = requests.post(url, headers=headers, json=payload)
34 print(response.json())

Using Base64 Encoded Images

For locally stored images, you can send them using base64 encoding. Here’s how to do it:

1 import requests
2 import json
3 import base64
4 from pathlib import Path
5 
6 def encode_image_to_base64(image_path):
7     with open(image_path, "rb") as image_file:
8         return base64.b64encode(image_file.read()).decode('utf-8')
9 
10 url = "https://openrouter.ai/api/v1/chat/completions"
11 headers = {
12     "Authorization": f"Bearer {API_KEY_REF}",
13     "Content-Type": "application/json"
14 }
15 
16 # Read and encode the image
17 image_path = "path/to/your/image.jpg"
18 base64_image = encode_image_to_base64(image_path)
19 data_url = f"data:image/jpeg;base64,{base64_image}"
20 
21 messages = [
22     {
23         "role": "user",
24         "content": [
25             {
26                 "type": "text",
27                 "text": "What's in this image?"
28             },
29             {
30                 "type": "image_url",
31                 "image_url": {
32                     "url": data_url
33                 }
34             }
35         ]
36     }
37 ]
38 
39 payload = {
40     "model": "{{MODEL}}",
41     "messages": messages
42 }
43 
44 response = requests.post(url, headers=headers, json=payload)
45 print(response.json())

Supported image content types are:

image/png
image/jpeg
image/webp

PDF Support

OpenRouter supports PDF processing through the /api/v1/chat/completions API. PDFs can be sent as base64-encoded data URLs in the messages array, via the file content type. This feature works on any model on OpenRouter.

When a model supports file input natively, the PDF is passed directly to the model. When the model does not support file input natively, OpenRouter will parse the file and pass the parsed results to the requested model.

Note that multiple PDFs can be sent in separate content array entries. The number of PDFs you can send in a single request varies per provider and per model. Due to how the content is parsed, we recommend sending the text prompt first, then the PDF. If the PDF must come first, we recommend putting it in the system prompt.

Plugin Configuration

To configure PDF processing, use the plugins parameter in your request. OpenRouter provides several PDF processing engines with different capabilities and pricing:

1 {
2   plugins: [
3     {
4       id: 'file-parser',
5       pdf: {
6         engine: 'pdf-text', // or 'mistral-ocr' or 'native'
7       },
8     },
9   ],
10 }

Pricing

OpenRouter provides several PDF processing engines:

"mistral-ocr": Best for scanned documents or PDFs with images ($2 per 1,000 pages).
"pdf-text": Best for well-structured PDFs with clear text content (Free).
"native": Only available for models that support file input natively (charged as input tokens).

If you don’t explicitly specify an engine, OpenRouter will default first to the model’s native file processing capabilities, and if that’s not available, we will use the "mistral-ocr" engine.

Processing PDFs

Here’s how to send and process a PDF:

1 import requests
2 import json
3 import base64
4 from pathlib import Path
5 
6 def encode_pdf_to_base64(pdf_path):
7     with open(pdf_path, "rb") as pdf_file:
8         return base64.b64encode(pdf_file.read()).decode('utf-8')
9 
10 url = "https://openrouter.ai/api/v1/chat/completions"
11 headers = {
12     "Authorization": f"Bearer {API_KEY_REF}",
13     "Content-Type": "application/json"
14 }
15 
16 # Read and encode the PDF
17 pdf_path = "path/to/your/document.pdf"
18 base64_pdf = encode_pdf_to_base64(pdf_path)
19 data_url = f"data:application/pdf;base64,{base64_pdf}"
20 
21 messages = [
22     {
23         "role": "user",
24         "content": [
25             {
26                 "type": "text",
27                 "text": "What are the main points in this document?"
28             },
29             {
30                 "type": "file",
31                 "file": {
32                     "filename": "document.pdf",
33                     "file_data": data_url
34                 }
35             },
36         ]
37     }
38 ]
39 
40 # Optional: Configure PDF processing engine
41 # PDF parsing will still work even if the plugin is not explicitly set
42 plugins = [
43     {
44         "id": "file-parser",
45         "pdf": {
46             "engine": "{{ENGINE}}"  # defaults to "{{DEFAULT_PDF_ENGINE}}". See Pricing above
47         }
48     }
49 ]
50 
51 payload = {
52     "model": "{{MODEL}}",
53     "messages": messages,
54     "plugins": plugins
55 }
56 
57 response = requests.post(url, headers=headers, json=payload)
58 print(response.json())

Skip Parsing Costs

When you send a PDF to the API, the response may include file annotations in the assistant’s message. These annotations contain structured information about the PDF document that was parsed. By sending these annotations back in subsequent requests, you can avoid re-parsing the same PDF document multiple times, which saves both processing time and costs.

Here’s how to reuse file annotations:

1 import requests
2 import json
3 import base64
4 from pathlib import Path
5 
6 # First, encode and send the PDF
7 def encode_pdf_to_base64(pdf_path):
8     with open(pdf_path, "rb") as pdf_file:
9         return base64.b64encode(pdf_file.read()).decode('utf-8')
10 
11 url = "https://openrouter.ai/api/v1/chat/completions"
12 headers = {
13     "Authorization": f"Bearer {API_KEY_REF}",
14     "Content-Type": "application/json"
15 }
16 
17 # Read and encode the PDF
18 pdf_path = "path/to/your/document.pdf"
19 base64_pdf = encode_pdf_to_base64(pdf_path)
20 data_url = f"data:application/pdf;base64,{base64_pdf}"
21 
22 # Initial request with the PDF
23 messages = [
24     {
25         "role": "user",
26         "content": [
27             {
28                 "type": "text",
29                 "text": "What are the main points in this document?"
30             },
31             {
32                 "type": "file",
33                 "file": {
34                     "filename": "document.pdf",
35                     "file_data": data_url
36                 }
37             },
38         ]
39     }
40 ]
41 
42 payload = {
43     "model": "{{MODEL}}",
44     "messages": messages
45 }
46 
47 response = requests.post(url, headers=headers, json=payload)
48 response_data = response.json()
49 
50 # Store the annotations from the response
51 file_annotations = None
52 if response_data.get("choices") and len(response_data["choices"]) > 0:
53     if "annotations" in response_data["choices"][0]["message"]:
54         file_annotations = response_data["choices"][0]["message"]["annotations"]
55 
56 # Follow-up request using the annotations (without sending the PDF again)
57 if file_annotations:
58     follow_up_messages = [
59         {
60             "role": "user",
61             "content": [
62                 {
63                     "type": "text",
64                     "text": "What are the main points in this document?"
65                 },
66                 {
67                     "type": "file",
68                     "file": {
69                         "filename": "document.pdf",
70                         "file_data": data_url
71                     }
72                 }
73             ]
74         },
75         {
76             "role": "assistant",
77             "content": "The document contains information about...",
78             "annotations": file_annotations
79         },
80         {
81             "role": "user",
82             "content": "Can you elaborate on the second point?"
83         }
84     ]
85 
86     follow_up_payload = {
87         "model": "{{MODEL}}",
88         "messages": follow_up_messages
89     }
90 
91     follow_up_response = requests.post(url, headers=headers, json=follow_up_payload)
92     print(follow_up_response.json())

When you include the file annotations from a previous response in your subsequent requests, OpenRouter will use this pre-parsed information instead of re-parsing the PDF, which saves processing time and costs. This is especially beneficial for large documents or when using the mistral-ocr engine which incurs additional costs.

Response Format

The API will return a response in the following format:

1 {
2   "id": "gen-1234567890",
3   "provider": "DeepInfra",
4   "model": "google/gemma-3-27b-it",
5   "object": "chat.completion",
6   "created": 1234567890,
7   "choices": [
8     {
9       "message": {
10         "role": "assistant",
11         "content": "The document discusses..."
12       }
13     }
14   ],
15   "usage": {
16     "prompt_tokens": 1000,
17     "completion_tokens": 100,
18     "total_tokens": 1100
19   }
20 }