Opik Description

With a suite observability tools, you can confidently evaluate, test and ship LLM apps across your development and production lifecycle. Log traces and spans. Define and compute evaluation metrics. Score LLM outputs. Compare performance between app versions. Record, sort, find, and understand every step that your LLM app makes to generate a result. You can manually annotate and compare LLM results in a table. Log traces in development and production. Run experiments using different prompts, and evaluate them against a test collection. You can choose and run preconfigured evaluation metrics, or create your own using our SDK library. Consult the built-in LLM judges to help you with complex issues such as hallucination detection, factuality and moderation. Opik LLM unit tests built on PyTest provide reliable performance baselines. Build comprehensive test suites for every deployment to evaluate your entire LLM pipe-line.

Pricing

Pricing Starts At:
$39 per month
Free Version:
Yes
Free Trial:
Yes

Integrations

API:
Yes, Opik has an API

Reviews - 1 Verified Review

Total
ease
features
design
support

Company Details

Company:
Comet
Year Founded:
2017
Headquarters:
United States
Website:
www.comet.com/site/products/opik/
Update This Listing

Media

Recommended Products
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
Start Free

Product Details

Platforms
Web-Based
Windows
Mac
Linux
On-Premises
Types of Training
Training Docs
Live Training (Online)
Webinars
In Person
Training Videos
Customer Support
Business Hours
Live Rep (24/7)
Online Support

Opik Features and Options

Opik Lists

Opik User Reviews

Write a Review
  • Name: Anonymous (Verified)
    Job Title: Principal Software Engineer
    Length of product use: Less than 6 months
    Used How Often?: Daily
    Role: User, Deployment
    Organization Size: 20,000 or More
    Features
    Design
    Ease
    Pricing
    Support
    Likelihood to Recommend to Others
    1 2 3 4 5 6 7 8 9 10

    Excellent OSS Evaluation tool

    Date: Apr 03 2025

    Summary: Highly recommended. Great features with support for all LLM providers, scalable to high load of traces and roadmap that's moving super fast

    Positive: My team has switched to Opik from Arize about 4 months ago. We have evaluated Arize, Langfuse, Opik and Langsmith. Overall Opik was the best platform. Phoenix OSS doesn't have half the features, Langsmith is nice but super expensive and not OSS and Langfuse is brittle and has tons of performance issues. We found one bug on Opik, opened a PR on the GH repo and it was fixed and merged in less than 5 hours.

    Negative: Personally I think they can make the UI a bit prettier.

    Read More...
  • Previous
  • You're on page 1
  • Next