Interface: ChatOptions

Defined in: providers/types.ts:193

Options for chat requests

Properties

enablePromptCaching?

optional enablePromptCaching?: boolean;

Defined in: providers/types.ts:230

Enable prompt caching for system prompt and tools (Claude-specific)

When enabled, the system prompt and tool definitions are cached server-side, reducing token costs by up to 90% on subsequent requests.

  • Cache write: 1.25x base input cost (first request)
  • Cache read: 0.1x base input cost (subsequent requests within 5 min)

Default

Provider-level setting (typically true)

maxTokens?

optional maxTokens?: number;

Defined in: providers/types.ts:195

model?

optional model?: string;

Defined in: providers/types.ts:194

signal?

optional signal?: AbortSignal;

Defined in: providers/types.ts:217

AbortSignal for cancelling the LLM request. When aborted, the provider should stop streaming and throw/return immediately.

stopSequences?

optional stopSequences?: string[];

Defined in: providers/types.ts:197

temperature?

optional temperature?: number;

Defined in: providers/types.ts:196

thinking?

optional thinking?: ThinkingConfig;

Defined in: providers/types.ts:211

Extended thinking configuration (Claude-specific)

When enabled, Claude will show its reasoning process before providing the final response. Requires budget_tokens >= 1024.

Example

thinking: { type: 'enabled', budgetTokens: 10000 }

tools?

optional tools?: ToolDefinition[];

Defined in: providers/types.ts:198