GitHub Releases Changelog File — Mastra

Fixed score and feedback emission to support live correlation context and unanchored annotations. (#14942)
Fixed a crash when using provider-defined tools (like openai.tools.webSearch()) with autoResumeSuspendedTools enabled. (#14940)
Fixed an AsyncLocalStorage runtime error when importing @mastra/core/observability in browser environments. (#14948)
Fixed assistant message prefill error crashing sessions. When a model does not support assistant message prefill, the harness now automatically retries with a user message instead of failing. (#14953)
Added error name and stack trace to SpanErrorInfo, allowing exporters to access the original error class name and stack trace for richer error reporting. (#14944)
Fixed workflow spans missing entityName, which caused the metrics dashboard to show 'unknown' for workflow trace volume (#14949)

1.21.0-alpha.2

Minor Changes

Adds a new trimMode option with a contiguous strategy that preserves a continuous suffix of messages by stopping at the first message that exceeds the token budget. Default behavior remains unchanged. (#14801)

Added component-scoped logging with custom filtering to ConsoleLogger (#14947)

new ConsoleLogger({
  level: 'debug',
  filter: ({ component }) => component === 'AGENT',
});

Patch Changes

Fixed score and feedback emission to support live correlation context and unanchored annotations. (#14942)
Fixed a crash when using provider-defined tools (like openai.tools.webSearch()) with autoResumeSuspendedTools enabled. (#14940)
Fixed an AsyncLocalStorage runtime error when importing @mastra/core/observability in browser environments. (#14948)
Fixed assistant message prefill error crashing sessions. When a model does not support assistant message prefill, the harness now automatically retries with a user message instead of failing. (#14953)
Added error name and stack trace to SpanErrorInfo, allowing exporters to access the original error class name and stack trace for richer error reporting. (#14944)
Fixed workflow spans missing entityName, which caused the metrics dashboard to show 'unknown' for workflow trace volume (#14949)

1.21.0-alpha.1

Minor Changes

Added scorer tracing and exported scores through the observability bus. (#14920)

What changed
- Added SCORER_RUN and SCORER_STEP spans for scorer execution.
- Exported scorer results through mastra.observability.addScore() when a target trace is available.
- Added score metadata for scorer name, target entity type, target scope, and scorer trace links.
- Deprecated the legacy scores-store helper while keeping the legacy write path during the transition.
Why This makes scorer execution easier to debug and starts moving scorer results onto the new observability-based score pipeline.

Example
```
await scorer.run({
  input,
  output,
  scoreSource: 'experiment',
  targetScope: 'span',
  targetTraceId: traceId,
  targetSpanId: spanId,
});
```

1.21.0-alpha.0

Patch Changes

Update provider registry and model documentation with latest models and providers (9a43b47)

1.20.0

Minor Changes

Added DualLogger that transparently forwards all infrastructure logger calls (debug, info, warn, error, trackException) to the observability system (loggerVNext). This means all internal Mastra logs now automatically appear in your observability storage (e.g. DuckDB) without any code changes. (#14899)

trackException now extracts structured error data (errorId, domain, category, details, cause) and forwards it as an error-level log to observability storage, so exceptions are queryable alongside regular logs.

Added logging config option to ObservabilityInstance for controlling which logs reach observability storage:
```
new Observability({
  instance: new MastraObservability({
    logging: {
      enabled: true, // set to false to disable log forwarding
      level: 'info', // minimum level: 'debug' | 'info' | 'warn' | 'error' | 'fatal'
    },
  }),
});
```
Add registerExporter method to the observability stack and Mastra class for runtime exporter registration (#14730)

Patch Changes

Fixed Anthropic API rejection of empty user text content blocks. (#14906)

User messages containing only empty text parts (e.g., { type: 'text', text: '' }) are now filtered out before being sent to the LLM. This prevents the "text content blocks must be non-empty" error that could occur when corrupted messages existed in the database.

Note: The root cause of how these empty user messages get persisted is still under investigation.
Improved the pattern field description in the list_files workspace tool to prevent AI models from passing "*" when they intend to match all files. The description now clarifies that omitting pattern lists all files, that * only matches within a single directory level (standard glob), and that glob patterns only filter files while directories are always shown. (#14897)
Added a lastMessageOnly option to the LLM-backed moderation, language detection, prompt injection, PII, and system prompt scrubber processors so they can inspect only the newest message instead of re-checking the full conversation on every run. (#14903)
Fixed providerMetadata (e.g. Gemini's thoughtSignature) being stripped from tool-call events when using the non-streaming (generate) code path (#14900)
Standardized all logger calls across the codebase to use static string messages with structured data objects. Dynamic values are now passed as key-value pairs in the second argument instead of being interpolated into template literal strings. This improves log filterability and searchability in observability storage. (#14899)

Removed ~150 redundant or noisy log calls including duplicate error logging after trackException and verbose in-memory storage CRUD traces.
Fixed duplicate OpenAI item ID errors when using web search. When OpenAI streams responses with web search citations, it interleaves source chunks with text, causing multiple message parts to share the same item ID. This resulted in 'Duplicate item found' errors on subsequent requests. The fix prevents text flushing on source chunks and merges any existing duplicate parts. (#14908)

1.20.0-alpha.0

Minor Changes

Added DualLogger that transparently forwards all infrastructure logger calls (debug, info, warn, error, trackException) to the observability system (loggerVNext). This means all internal Mastra logs now automatically appear in your observability storage (e.g. DuckDB) without any code changes. (#14899)

trackException now extracts structured error data (errorId, domain, category, details, cause) and forwards it as an error-level log to observability storage, so exceptions are queryable alongside regular logs.

Added logging config option to ObservabilityInstance for controlling which logs reach observability storage:
```
new Observability({
  instance: new MastraObservability({
    logging: {
      enabled: true, // set to false to disable log forwarding
      level: 'info', // minimum level: 'debug' | 'info' | 'warn' | 'error' | 'fatal'
    },
  }),
});
```
Add registerExporter method to the observability stack and Mastra class for runtime exporter registration (#14730)

Patch Changes

Fixed Anthropic API rejection of empty user text content blocks. (#14906)

User messages containing only empty text parts (e.g., { type: 'text', text: '' }) are now filtered out before being sent to the LLM. This prevents the "text content blocks must be non-empty" error that could occur when corrupted messages existed in the database.

Note: The root cause of how these empty user messages get persisted is still under investigation.
Improved the pattern field description in the list_files workspace tool to prevent AI models from passing "*" when they intend to match all files. The description now clarifies that omitting pattern lists all files, that * only matches within a single directory level (standard glob), and that glob patterns only filter files while directories are always shown. (#14897)
Added a lastMessageOnly option to the LLM-backed moderation, language detection, prompt injection, PII, and system prompt scrubber processors so they can inspect only the newest message instead of re-checking the full conversation on every run. (#14903)
Fixed providerMetadata (e.g. Gemini's thoughtSignature) being stripped from tool-call events when using the non-streaming (generate) code path (#14900)
Standardized all logger calls across the codebase to use static string messages with structured data objects. Dynamic values are now passed as key-value pairs in the second argument instead of being interpolated into template literal strings. This improves log filterability and searchability in observability storage. (#14899)

Removed ~150 redundant or noisy log calls including duplicate error logging after trackException and verbose in-memory storage CRUD traces.
Fixed duplicate OpenAI item ID errors when using web search. When OpenAI streams responses with web search citations, it interleaves source chunks with text, causing multiple message parts to share the same item ID. This resulted in 'Duplicate item found' errors on subsequent requests. The fix prevents text flushing on source chunks and merges any existing duplicate parts. (#14908)

1.19.0

Minor Changes

feat(memory): add minMessages option to generateTitle config (#14778)

Delay automatic title generation until a minimum number of messages is reached, improving title quality and reducing unnecessary LLM calls.

Patch Changes

Update provider registry and model documentation with latest models and providers (180aaaf)
Streaming traces now end correctly when a model call fails or a request is aborted, so they no longer remain stuck "in progress" in observability tools. (#14661)
Fix getWorkflowRunById with withNestedWorkflows not returning nested steps for branch sub-workflows (#14713)
Tools that return objects with circular references no longer crash the agent with "Converting circular structure to JSON". Circular parts are replaced with "[Circular]" and the conversation continues normally. (#14535)
Fixed crashes when using ModelRouterLanguageModel with AI SDK v6's generateObject() or generateText(). The model router now correctly preserves usage and metadata from underlying models. (#14283)
Agents using structured output no longer fail when workflow tools are present. Setting toolChoice to 'none' now correctly prevents tools from being sent to the provider, fixing errors from providers like Gemini that reject structured output requests when tools are included. (#14466)
Sub-agent tool calls no longer fail when LLMs use query, message, or input instead of prompt during repeated sub-agent calls via custom gateways. These common aliases are now automatically recognized and mapped to prompt when the schema expects it. (#14219)
Fixed an issue where supervisor agent messages were being saved to the sub-agent thread, causing duplicate tool call badges to appear in the chat history when sub-agents are invoked multiple times. (#13881)
Fixed workspace vector indexing silently swallowing embedder and search engine errors during auto-indexing. File-read errors (binary files, invalid UTF-8) are still skipped, but indexing failures are now logged as warnings instead of being silently ignored. ()

1.19.0-alpha.2

Minor Changes

feat(memory): add minMessages option to generateTitle config (#14778)

Delay automatic title generation until a minimum number of messages is reached, improving title quality and reducing unnecessary LLM calls.

Patch Changes

Sub-agent tool calls no longer fail when LLMs use query, message, or input instead of prompt during repeated sub-agent calls via custom gateways. These common aliases are now automatically recognized and mapped to prompt when the schema expects it. (#14219)

1.18.1-alpha.1

Patch Changes

Streaming traces now end correctly when a model call fails or a request is aborted, so they no longer remain stuck "in progress" in observability tools. (#14661)
Fix getWorkflowRunById with withNestedWorkflows not returning nested steps for branch sub-workflows (#14713)
Tools that return objects with circular references no longer crash the agent with "Converting circular structure to JSON". Circular parts are replaced with "[Circular]" and the conversation continues normally. (#14535)
Fixed crashes when using ModelRouterLanguageModel with AI SDK v6's generateObject() or generateText(). The model router now correctly preserves usage and metadata from underlying models. (#14283)
Agents using structured output no longer fail when workflow tools are present. Setting toolChoice to 'none' now correctly prevents tools from being sent to the provider, fixing errors from providers like Gemini that reject structured output requests when tools are included. (#14466)
Fixed an issue where supervisor agent messages were being saved to the sub-agent thread, causing duplicate tool call badges to appear in the chat history when sub-agents are invoked multiple times. (#13881)
Fixed workspace vector indexing silently swallowing embedder and search engine errors during auto-indexing. File-read errors (binary files, invalid UTF-8) are still skipped, but indexing failures are now logged as warnings instead of being silently ignored. (#14786)
Fixed incorrect type cast for sub-agent context messages. The context option for new API methods (generate, stream, resumeGenerate, resumeStream) now correctly casts to ModelMessage[] instead of CoreMessage[]. (#14895)

1.18.1-alpha.0

Patch Changes

Update provider registry and model documentation with latest models and providers (180aaaf)

1.18.0

Minor Changes

Add version-aware code-agent lookup and override version lifecycle support. (#14776)

Mastra.getAgent(name, version) and Mastra.getAgentById(id, version) can now resolve draft or specific stored override versions when the editor package is configured, and throw a clear error when versioned lookup is requested without the editor.

client.getAgent(id, version) now carries version selection through agent detail and voice metadata requests, and the Agent resource now supports override version management methods including listVersions, createVersion, getVersion, activateVersion, restoreVersion, deleteVersion, and compareVersions.

Agent.createVersion(...) is intentionally limited to code-agent overrideable fields plus version metadata, rather than the full stored-agent configuration surface.
Trajectory evaluation: Added trajectory types and trace-based extraction for evaluating agent and workflow execution paths. (#14697)

TrajectoryStep models each step in an execution as a typed object — tool calls, model generations, agent runs, workflow steps, and control flow nodes each have their own variant with relevant properties (e.g., toolArgs/toolResult for tool calls, modelId/promptTokens for model generations). Steps can be nested via children to represent hierarchical execution.

TrajectoryExpectation lets you define what a good trajectory looks like — expected steps, ordering, step/token/duration budgets, blacklisted tools, and retry thresholds. ExpectedStep provides a simple way to define expected steps by name and optional stepType, with support for nested expectations via children to set different evaluation rules at each level of the hierarchy.

Trace-based extraction: extractTrajectoryFromTrace() builds hierarchical trajectories from observability trace spans. The runEvals pipeline automatically uses this when storage is configured, capturing the full execution tree including nested agent runs and tool calls. Falls back to extractTrajectory (agents) or extractWorkflowTrajectory (workflows) when storage is unavailable.

Pipeline: flows from dataset items through to trajectory scorers. Added key to both and .

Patch Changes

Update provider registry and model documentation with latest models and providers (dc514a8)
Persist observational memory threshold settings across restarts and restore per-thread overrides. (#14788)
Fixed title generation blocking stream completion. The generateTitle LLM call now runs in the background instead of blocking the stream from closing, removing the 2-3 second post-response delay in the UI when memory is enabled. (#14757)
feat(memory): add recall-tool history retrieval for agents using observational memory (#14567)

Agents that use observational memory can now use the recall tool to retrieve history from past conversations, including raw messages, thread listings, and indexed observation-group memories.

Enable observational-memory retrieval when listing tools:
```
const tools = await memory.listTools({
  threadId: 'thread_123',
  resourceId: 'resource_abc',
  observationalMemory: {
    retrieval: { vector: true, scope: 
```

1.18.0-alpha.5

Minor Changes

Added public score and feedback analytics APIs to observability storage: (#14861) getScoreAggregate / getFeedbackAggregate for counts, sums, averages, minimums, maximums, or latest values; getScoreBreakdown / getFeedbackBreakdown for grouped results by dimension; getScoreTimeSeries / getFeedbackTimeSeries for time-bucketed trends; and getScorePercentiles / getFeedbackPercentiles for percentile series such as p50 and p95.
```
await observability.getScoreTimeSeries({
  scorerId: 'relevance',
  interval: '1h',
  aggregation: 'avg',
});
// returns time-bucketed average scores
```

Patch Changes

Added resolvedVersionId to agent run trace span attributes for tracking which agent version was used during execution. (#14847)

1.18.0-alpha.4

Minor Changes

Added support for attaching scorers to datasets. Scorers attached to a dataset automatically run when an experiment is triggered, alongside any scorers specified at trigger time. New scorerIds field on DatasetRecord, CreateDatasetInput, and UpdateDatasetInput types. (#14783)
Added new observability entrypoint APIs for persisted traces. You can now call mastra.observability.getRecordedTrace({ traceId }) to load a recorded trace, and use optional top-level mastra.observability.addScore()/addFeedback() helpers to annotate a persisted trace by ID. (#14842)
Align observability signal contracts around first-class trace and span fields. (#14838)

Improved observability signal consistency Logs, metrics, scores, and feedback now carry traceId and spanId directly on each signal. Shared correlation metadata stays in correlationContext.

Added clearer provenance fields Score and feedback payloads now support scoreSource, feedbackSource, and executionSource for clearer source tracking.

Migration note Deprecated fields (like source and feedback userId) are still accepted for compatibility.

Patch Changes

Fixed agent run traces not appearing in Datadog and other observability backends when LLM calls fail. Previously, an API error during streaming would leave the root AGENT_RUN span open indefinitely, causing the entire trace tree to be silently dropped by exporters that wait for the root span to close. Failed agent runs now correctly end the span with error information, making failures visible in your observability dashboard. (#14850)
Fixed mcpOptions (including serverless: true) being silently ignored when using the Mastra deployer. The deployer now forwards mcpOptions from your server config to the underlying MastraServer, so MCP stateless mode works correctly in serverless environments like Cloudflare Workers, Vercel Edge, and AWS Lambda. (#14810) (#14812)

What changed:
- Added mcpOptions to the ServerConfig type so it can be set in new Mastra({ server: { ... } })
- The deployer now passes server.mcpOptions through to MastraServer
Example:
```
const mastra = new Mastra({
  server: {
    mcpOptions: {
      serverless: true,
    },
  },
});
```

1.18.0-alpha.3

Minor Changes

Add version-aware code-agent lookup and override version lifecycle support. (#14776)

Mastra.getAgent(name, version) and Mastra.getAgentById(id, version) can now resolve draft or specific stored override versions when the editor package is configured, and throw a clear error when versioned lookup is requested without the editor.

client.getAgent(id, version) now carries version selection through agent detail and voice metadata requests, and the Agent resource now supports override version management methods including listVersions, createVersion, getVersion, activateVersion, restoreVersion, deleteVersion, and compareVersions.

Agent.createVersion(...) is intentionally limited to code-agent overrideable fields plus version metadata, rather than the full stored-agent configuration surface.

Fixed score and feedback emission to support live correlation context and unanchored annotations. (#14942)
Fixed a crash when using provider-defined tools (like openai.tools.webSearch()) with autoResumeSuspendedTools enabled. (#14940)
Fixed an AsyncLocalStorage runtime error when importing @mastra/core/observability in browser environments. (#14948)
Fixed assistant message prefill error crashing sessions. When a model does not support assistant message prefill, the harness now automatically retries with a user message instead of failing. (#14953)
Added error name and stack trace to SpanErrorInfo, allowing exporters to access the original error class name and stack trace for richer error reporting. (#14944)
Fixed workflow spans missing entityName, which caused the metrics dashboard to show 'unknown' for workflow trace volume (#14949)

1.21.0-alpha.2

Minor Changes

Adds a new trimMode option with a contiguous strategy that preserves a continuous suffix of messages by stopping at the first message that exceeds the token budget. Default behavior remains unchanged. (#14801)

Added component-scoped logging with custom filtering to ConsoleLogger (#14947)

new ConsoleLogger({
  level: 'debug',
  filter: ({ component }) => component === 'AGENT',
});

Patch Changes

Fixed score and feedback emission to support live correlation context and unanchored annotations. (#14942)
Fixed a crash when using provider-defined tools (like openai.tools.webSearch()) with autoResumeSuspendedTools enabled. (#14940)
Fixed an AsyncLocalStorage runtime error when importing @mastra/core/observability in browser environments. (#14948)
Fixed assistant message prefill error crashing sessions. When a model does not support assistant message prefill, the harness now automatically retries with a user message instead of failing. (#14953)
Added error name and stack trace to SpanErrorInfo, allowing exporters to access the original error class name and stack trace for richer error reporting. (#14944)
Fixed workflow spans missing entityName, which caused the metrics dashboard to show 'unknown' for workflow trace volume (#14949)

1.21.0-alpha.1

Minor Changes

Added scorer tracing and exported scores through the observability bus. (#14920)

What changed
- Added SCORER_RUN and SCORER_STEP spans for scorer execution.
- Exported scorer results through mastra.observability.addScore() when a target trace is available.
- Added score metadata for scorer name, target entity type, target scope, and scorer trace links.
- Deprecated the legacy scores-store helper while keeping the legacy write path during the transition.
Why This makes scorer execution easier to debug and starts moving scorer results onto the new observability-based score pipeline.

Example
```
await scorer.run({
  input,
  output,
  scoreSource: 'experiment',
  targetScope: 'span',
  targetTraceId: traceId,
  targetSpanId: spanId,
});
```

1.21.0-alpha.0

Patch Changes

Update provider registry and model documentation with latest models and providers (9a43b47)

1.20.0

Minor Changes

Added DualLogger that transparently forwards all infrastructure logger calls (debug, info, warn, error, trackException) to the observability system (loggerVNext). This means all internal Mastra logs now automatically appear in your observability storage (e.g. DuckDB) without any code changes. (#14899)

trackException now extracts structured error data (errorId, domain, category, details, cause) and forwards it as an error-level log to observability storage, so exceptions are queryable alongside regular logs.

Added logging config option to ObservabilityInstance for controlling which logs reach observability storage:
```
new Observability({
  instance: new MastraObservability({
    logging: {
      enabled: true, // set to false to disable log forwarding
      level: 'info', // minimum level: 'debug' | 'info' | 'warn' | 'error' | 'fatal'
    },
  }),
});
```
Add registerExporter method to the observability stack and Mastra class for runtime exporter registration (#14730)

Patch Changes

Fixed Anthropic API rejection of empty user text content blocks. (#14906)

User messages containing only empty text parts (e.g., { type: 'text', text: '' }) are now filtered out before being sent to the LLM. This prevents the "text content blocks must be non-empty" error that could occur when corrupted messages existed in the database.

Note: The root cause of how these empty user messages get persisted is still under investigation.
Improved the pattern field description in the list_files workspace tool to prevent AI models from passing "*" when they intend to match all files. The description now clarifies that omitting pattern lists all files, that * only matches within a single directory level (standard glob), and that glob patterns only filter files while directories are always shown. (#14897)
Added a lastMessageOnly option to the LLM-backed moderation, language detection, prompt injection, PII, and system prompt scrubber processors so they can inspect only the newest message instead of re-checking the full conversation on every run. (#14903)
Fixed providerMetadata (e.g. Gemini's thoughtSignature) being stripped from tool-call events when using the non-streaming (generate) code path (#14900)
Standardized all logger calls across the codebase to use static string messages with structured data objects. Dynamic values are now passed as key-value pairs in the second argument instead of being interpolated into template literal strings. This improves log filterability and searchability in observability storage. (#14899)

Removed ~150 redundant or noisy log calls including duplicate error logging after trackException and verbose in-memory storage CRUD traces.
Fixed duplicate OpenAI item ID errors when using web search. When OpenAI streams responses with web search citations, it interleaves source chunks with text, causing multiple message parts to share the same item ID. This resulted in 'Duplicate item found' errors on subsequent requests. The fix prevents text flushing on source chunks and merges any existing duplicate parts. (#14908)

1.20.0-alpha.0

Minor Changes

Added DualLogger that transparently forwards all infrastructure logger calls (debug, info, warn, error, trackException) to the observability system (loggerVNext). This means all internal Mastra logs now automatically appear in your observability storage (e.g. DuckDB) without any code changes. (#14899)

trackException now extracts structured error data (errorId, domain, category, details, cause) and forwards it as an error-level log to observability storage, so exceptions are queryable alongside regular logs.

Added logging config option to ObservabilityInstance for controlling which logs reach observability storage:
```
new Observability({
  instance: new MastraObservability({
    logging: {
      enabled: true, // set to false to disable log forwarding
      level: 'info', // minimum level: 'debug' | 'info' | 'warn' | 'error' | 'fatal'
    },
  }),
});
```
Add registerExporter method to the observability stack and Mastra class for runtime exporter registration (#14730)

Patch Changes

Fixed Anthropic API rejection of empty user text content blocks. (#14906)

User messages containing only empty text parts (e.g., { type: 'text', text: '' }) are now filtered out before being sent to the LLM. This prevents the "text content blocks must be non-empty" error that could occur when corrupted messages existed in the database.

Note: The root cause of how these empty user messages get persisted is still under investigation.
Improved the pattern field description in the list_files workspace tool to prevent AI models from passing "*" when they intend to match all files. The description now clarifies that omitting pattern lists all files, that * only matches within a single directory level (standard glob), and that glob patterns only filter files while directories are always shown. (#14897)
Added a lastMessageOnly option to the LLM-backed moderation, language detection, prompt injection, PII, and system prompt scrubber processors so they can inspect only the newest message instead of re-checking the full conversation on every run. (#14903)
Fixed providerMetadata (e.g. Gemini's thoughtSignature) being stripped from tool-call events when using the non-streaming (generate) code path (#14900)
Standardized all logger calls across the codebase to use static string messages with structured data objects. Dynamic values are now passed as key-value pairs in the second argument instead of being interpolated into template literal strings. This improves log filterability and searchability in observability storage. (#14899)

Removed ~150 redundant or noisy log calls including duplicate error logging after trackException and verbose in-memory storage CRUD traces.
Fixed duplicate OpenAI item ID errors when using web search. When OpenAI streams responses with web search citations, it interleaves source chunks with text, causing multiple message parts to share the same item ID. This resulted in 'Duplicate item found' errors on subsequent requests. The fix prevents text flushing on source chunks and merges any existing duplicate parts. (#14908)

1.19.0

Minor Changes

feat(memory): add minMessages option to generateTitle config (#14778)

Delay automatic title generation until a minimum number of messages is reached, improving title quality and reducing unnecessary LLM calls.

Patch Changes

Update provider registry and model documentation with latest models and providers (180aaaf)
Streaming traces now end correctly when a model call fails or a request is aborted, so they no longer remain stuck "in progress" in observability tools. (#14661)
Fix getWorkflowRunById with withNestedWorkflows not returning nested steps for branch sub-workflows (#14713)
Tools that return objects with circular references no longer crash the agent with "Converting circular structure to JSON". Circular parts are replaced with "[Circular]" and the conversation continues normally. (#14535)
Fixed crashes when using ModelRouterLanguageModel with AI SDK v6's generateObject() or generateText(). The model router now correctly preserves usage and metadata from underlying models. (#14283)
Agents using structured output no longer fail when workflow tools are present. Setting toolChoice to 'none' now correctly prevents tools from being sent to the provider, fixing errors from providers like Gemini that reject structured output requests when tools are included. (#14466)
Sub-agent tool calls no longer fail when LLMs use query, message, or input instead of prompt during repeated sub-agent calls via custom gateways. These common aliases are now automatically recognized and mapped to prompt when the schema expects it. (#14219)
Fixed an issue where supervisor agent messages were being saved to the sub-agent thread, causing duplicate tool call badges to appear in the chat history when sub-agents are invoked multiple times. (#13881)
Fixed workspace vector indexing silently swallowing embedder and search engine errors during auto-indexing. File-read errors (binary files, invalid UTF-8) are still skipped, but indexing failures are now logged as warnings instead of being silently ignored. ()

1.19.0-alpha.2

Minor Changes

feat(memory): add minMessages option to generateTitle config (#14778)

Delay automatic title generation until a minimum number of messages is reached, improving title quality and reducing unnecessary LLM calls.

Patch Changes

Sub-agent tool calls no longer fail when LLMs use query, message, or input instead of prompt during repeated sub-agent calls via custom gateways. These common aliases are now automatically recognized and mapped to prompt when the schema expects it. (#14219)

1.18.1-alpha.1

Patch Changes

Streaming traces now end correctly when a model call fails or a request is aborted, so they no longer remain stuck "in progress" in observability tools. (#14661)
Fix getWorkflowRunById with withNestedWorkflows not returning nested steps for branch sub-workflows (#14713)
Tools that return objects with circular references no longer crash the agent with "Converting circular structure to JSON". Circular parts are replaced with "[Circular]" and the conversation continues normally. (#14535)
Fixed crashes when using ModelRouterLanguageModel with AI SDK v6's generateObject() or generateText(). The model router now correctly preserves usage and metadata from underlying models. (#14283)
Agents using structured output no longer fail when workflow tools are present. Setting toolChoice to 'none' now correctly prevents tools from being sent to the provider, fixing errors from providers like Gemini that reject structured output requests when tools are included. (#14466)
Fixed an issue where supervisor agent messages were being saved to the sub-agent thread, causing duplicate tool call badges to appear in the chat history when sub-agents are invoked multiple times. (#13881)
Fixed workspace vector indexing silently swallowing embedder and search engine errors during auto-indexing. File-read errors (binary files, invalid UTF-8) are still skipped, but indexing failures are now logged as warnings instead of being silently ignored. (#14786)
Fixed incorrect type cast for sub-agent context messages. The context option for new API methods (generate, stream, resumeGenerate, resumeStream) now correctly casts to ModelMessage[] instead of CoreMessage[]. (#14895)

1.18.1-alpha.0

Patch Changes

Update provider registry and model documentation with latest models and providers (180aaaf)

1.18.0

Minor Changes

Add version-aware code-agent lookup and override version lifecycle support. (#14776)

Mastra.getAgent(name, version) and Mastra.getAgentById(id, version) can now resolve draft or specific stored override versions when the editor package is configured, and throw a clear error when versioned lookup is requested without the editor.

client.getAgent(id, version) now carries version selection through agent detail and voice metadata requests, and the Agent resource now supports override version management methods including listVersions, createVersion, getVersion, activateVersion, restoreVersion, deleteVersion, and compareVersions.

Agent.createVersion(...) is intentionally limited to code-agent overrideable fields plus version metadata, rather than the full stored-agent configuration surface.
Trajectory evaluation: Added trajectory types and trace-based extraction for evaluating agent and workflow execution paths. (#14697)

TrajectoryStep models each step in an execution as a typed object — tool calls, model generations, agent runs, workflow steps, and control flow nodes each have their own variant with relevant properties (e.g., toolArgs/toolResult for tool calls, modelId/promptTokens for model generations). Steps can be nested via children to represent hierarchical execution.

TrajectoryExpectation lets you define what a good trajectory looks like — expected steps, ordering, step/token/duration budgets, blacklisted tools, and retry thresholds. ExpectedStep provides a simple way to define expected steps by name and optional stepType, with support for nested expectations via children to set different evaluation rules at each level of the hierarchy.

Trace-based extraction: extractTrajectoryFromTrace() builds hierarchical trajectories from observability trace spans. The runEvals pipeline automatically uses this when storage is configured, capturing the full execution tree including nested agent runs and tool calls. Falls back to extractTrajectory (agents) or extractWorkflowTrajectory (workflows) when storage is unavailable.

Pipeline: flows from dataset items through to trajectory scorers. Added key to both and .

Patch Changes

Update provider registry and model documentation with latest models and providers (dc514a8)
Persist observational memory threshold settings across restarts and restore per-thread overrides. (#14788)
Fixed title generation blocking stream completion. The generateTitle LLM call now runs in the background instead of blocking the stream from closing, removing the 2-3 second post-response delay in the UI when memory is enabled. (#14757)
feat(memory): add recall-tool history retrieval for agents using observational memory (#14567)

Agents that use observational memory can now use the recall tool to retrieve history from past conversations, including raw messages, thread listings, and indexed observation-group memories.

Enable observational-memory retrieval when listing tools:
```
const tools = await memory.listTools({
  threadId: 'thread_123',
  resourceId: 'resource_abc',
  observationalMemory: {
    retrieval: { vector: true, scope: 
```

1.18.0-alpha.5

Minor Changes

Added public score and feedback analytics APIs to observability storage: (#14861) getScoreAggregate / getFeedbackAggregate for counts, sums, averages, minimums, maximums, or latest values; getScoreBreakdown / getFeedbackBreakdown for grouped results by dimension; getScoreTimeSeries / getFeedbackTimeSeries for time-bucketed trends; and getScorePercentiles / getFeedbackPercentiles for percentile series such as p50 and p95.
```
await observability.getScoreTimeSeries({
  scorerId: 'relevance',
  interval: '1h',
  aggregation: 'avg',
});
// returns time-bucketed average scores
```

Patch Changes

Added resolvedVersionId to agent run trace span attributes for tracking which agent version was used during execution. (#14847)

1.18.0-alpha.4

Minor Changes

Added support for attaching scorers to datasets. Scorers attached to a dataset automatically run when an experiment is triggered, alongside any scorers specified at trigger time. New scorerIds field on DatasetRecord, CreateDatasetInput, and UpdateDatasetInput types. (#14783)
Added new observability entrypoint APIs for persisted traces. You can now call mastra.observability.getRecordedTrace({ traceId }) to load a recorded trace, and use optional top-level mastra.observability.addScore()/addFeedback() helpers to annotate a persisted trace by ID. (#14842)
Align observability signal contracts around first-class trace and span fields. (#14838)

Improved observability signal consistency Logs, metrics, scores, and feedback now carry traceId and spanId directly on each signal. Shared correlation metadata stays in correlationContext.

Added clearer provenance fields Score and feedback payloads now support scoreSource, feedbackSource, and executionSource for clearer source tracking.

Migration note Deprecated fields (like source and feedback userId) are still accepted for compatibility.

Patch Changes

Fixed agent run traces not appearing in Datadog and other observability backends when LLM calls fail. Previously, an API error during streaming would leave the root AGENT_RUN span open indefinitely, causing the entire trace tree to be silently dropped by exporters that wait for the root span to close. Failed agent runs now correctly end the span with error information, making failures visible in your observability dashboard. (#14850)
Fixed mcpOptions (including serverless: true) being silently ignored when using the Mastra deployer. The deployer now forwards mcpOptions from your server config to the underlying MastraServer, so MCP stateless mode works correctly in serverless environments like Cloudflare Workers, Vercel Edge, and AWS Lambda. (#14810) (#14812)

What changed:
- Added mcpOptions to the ServerConfig type so it can be set in new Mastra({ server: { ... } })
- The deployer now passes server.mcpOptions through to MastraServer
Example:
```
const mastra = new Mastra({
  server: {
    mcpOptions: {
      serverless: true,
    },
  },
});
```

1.18.0-alpha.3

Minor Changes

Add version-aware code-agent lookup and override version lifecycle support. (#14776)

Mastra.getAgent(name, version) and Mastra.getAgentById(id, version) can now resolve draft or specific stored override versions when the editor package is configured, and throw a clear error when versioned lookup is requested without the editor.

client.getAgent(id, version) now carries version selection through agent detail and voice metadata requests, and the Agent resource now supports override version management methods including listVersions, createVersion, getVersion, activateVersion, restoreVersion, deleteVersion, and compareVersions.

Agent.createVersion(...) is intentionally limited to code-agent overrideable fields plus version metadata, rather than the full stored-agent configuration surface.