Changelog

New features, improvements, and fixes in Agenta.

18 April 2025v0.42.1

We are SOC 2 Type 2 Certified

We are SOC 2 Type 2 Certified. This means that our platform is audited and certified by an independent third party to meet the highest standards of security and compliance.

15 April 2025v0.42.0

Structured Output Support in the Playground

We now support structured output support in the playground. You can define the expected output format and validate the output against it.

With Agenta's playground, implementing structured outputs is straightforward:

Open any prompt
Switch the Response format dropdown from text to JSON mode or JSON Schema
Paste or write your schema (Agenta supports the full JSON Schema specification)
Run the prompt - the response panel will show the response beautified
Commit the changes - the schema will be saved with your prompt, so when your SDK fetches the prompt, it will include the schema information

Check out the blog post for more detail https://agenta.ai/blog/structured-outputs-playground

7 April 2025v0.38.0

New Feature: Prompt and Deployment Registry

We've introduced the Prompt and Deployment Registry, giving you a centralized place to manage all variants and versions of your prompts and deployments.

Key capabilities:

View all variants and revisions in a single table
Access all commits made to a variant
Use older versions of variants directly in the playground

Learn more in our blog post.

Bug Fixes

Fixed minor UI issues with dots in sidebar menu
Fixed minor playground UI issues
Fixed playground reset default model name
Fixed project_id issue on testset detail page
Fixed breaking issues with old variants encountered during QA
Fixed variant naming logic

19 March 2025v0.36.0

Improvements to the Playground and Custom Workflows

We've made several improvements to the playground, including:

Improved scrolling behavior
Increased discoverability of variants creation and comparison
Implemented stop functionality in the playground

As for custom workflows, now they work with sub-routes. This means you can have multiple routes in one file and create multiple custom workflows from the same file.

11 March 2025v0.35.0

OpenTelemetry Compliance and Custom workflows from API

We've introduced major improvements to Agenta, focusing on OpenTelemetry compliance and simplified custom workflow debugging.

OpenTelemetry (OTel) Support:

Agenta is now fully OpenTelemetry-compliant. This means you can seamlessly integrate Agenta with thousands of OTel-compatible services using existing SDKs. To integrate your application with Agenta, simply configure an OTel exporter pointing to your Agenta endpoint—no additional setup required.

We've enhanced distributed tracing capabilities to better debug complex distributed agent systems. All HTTP interactions between agents—whether running within Agenta's SDK or externally—are automatically traced, making troubleshooting and monitoring easier.

Detailed instructions and examples are available in our distributed tracing documentation.

Improved Custom Workflows:

Based on your feedback, we've streamlined debugging and running custom workflows:

Run workflows from your environments: You no longer need the Agenta CLI to manage custom workflows. Setting up custom workflows now involves simply adding the Agenta SDK to your code, creating an endpoint, and connecting it to Agenta via the web UI. You can check how it's done in the quick start guide.
Custom Workflows in the new playground: Custom workflows are now fully compatible with the new playground. You can now nest configurations, run side-by-side comparisons, and debug your agents and complex workflows very easily.

4 February 2025v0.33.0

New Playground

We've rebuilt our playground from scratch to make prompt engineering faster and more intuitive. The old playground took 20 seconds to create a prompt - now it's instant.

Key improvements:

Create prompts with multiple messages using our new template system
Format variables easily with curly bracket syntax and a built-in validator
Switch between chat and completion prompts in one interface
Load test sets directly in the playground to iterate faster
Save successful outputs as test cases with one click
Compare different prompts side-by-side
Deploy changes straight to production

For developers, now you create prompts programmatically through our API.

You can explore these features in our updated playground documentation.

27 January 2025v0.32.0

Quality of life improvements

Small release today with quality of life improvements, while we're preparing the huge release coming up in the next days:

Added a collapsible side menu for better space management
Enhanced frontend performance and responsiveness
Implemented a confirmation modal when deleting test sets
Improved permission handling across the platform
Improved frontend test coverage

15 January 2025v0.31.0

Agenta is SOC 2 Type 1 Certified

We've achieved SOC 2 Type 1 certification, validating our security controls for protecting sensitive LLM development data. This certification covers our entire platform, including prompt management, evaluation frameworks, and observability tools.

Key security features and improvements:

Data encryption in transit and at rest
Enhanced access control and authentication
Comprehensive security monitoring
Regular third-party security assessments
Backup and disaster recovery protocols

This certification represents a significant milestone for teams using Agenta in production environments. Whether you're using our open-source platform or cloud offering, you can now build LLM applications with enterprise-grade security confidence.

We've also updated our trust center with detailed information about our security practices and compliance standards. For teams interested in learning more about our security controls or requesting our SOC 2 report, please contact team@agenta.ai.

4 January 2025v0.30.0

New Onboarding Flow

We've redesigned our platform's onboarding to make getting started simpler and more intuitive. Key improvements include:

Streamlined tracing setup process
Added a demo RAG playground project showcasing custom workflows
Enhanced frontend performance
Fixed scroll behavior in trace view

11 December 2024v0.29.0

Add Spans to Test Sets

This release introduces the ability to add spans to test sets, making it easier to bootstrap your evaluation data from production. The new feature lets you:

Add individual or batch spans to test sets
Create custom mappings between spans and test sets
Preview test set changes before committing them

Additional improvements:

Fixed CSV test set upload issues
Prevented viewing of incomplete evaluations
Added mobile compatibility warning
Added support for custom ports in self-hosted installations