All OpenRouter endpoints with 8k (8,192 tokens) or less context
length will default to using context compression. To disable this, pass
plugins: [{"id": "context-compression", "enabled": false}] in the request body.image_url parts. Multimodal models that output both text and images (e.g., Gemini, gpt-image) still use compression since they have real text context windows.
The middle of the prompt is compressed because LLMs pay less attention to the middle of sequences.