Langfuse just got faster →Langfuse just got faster – read about Fast Preview (v4) →

Hiring in Europe and SFLooking for GOATS!

Docs Changelog Pricing

Overview LLM Observability Prompt Management Evaluation Metrics

Changelog Pricing

Get Demo AppSign Up

Docs Integrations Self Hosting Guides AI Engineering Library

All FAQs What are scores in Langfuse and when should I use them?

What Are Scores and When Should I Use Them?

Scores are covered in detail on the Evaluation Concepts page, including:

When to use scores — user feedback, production monitoring, guardrails, experiments
Score types — numeric, categorical, boolean, and text
Score configs — enforce schemas and validate values on ingestion
Scores vs tags — when to use which
Score comments — add context to any score

How to Create Scores

There are four ways to add scores:

LLM-as-a-Judge: Set up automated evaluators that score traces based on custom criteria.
Scores via UI: Team members manually score traces, observations, or sessions in the Langfuse UI.
Annotation queues: Set up structured review workflows where reviewers work through batches of traces.
Scores via API/SDK: Programmatically add scores from your application code — for user feedback, guardrail results, or custom evaluation pipelines.

Was this page helpful?

On this page

What Are Scores and When Should I Use Them?How to Create Scores

Question? Give us feedback →

Contributors

Founding Engineer

Lotte Verheyden

Developer Relations

Product

Observability
Prompt Management
Evaluation
Metrics
Playground
Pricing
Enterprise

Developers

Documentation
Self-Hosting
SDKs
Integrations
API Reference
Status
Talk to Us

Resources

Blog
Changelog
Roadmap
Interactive Demo
Users
AI Engineering Library
Guides & Cookbooks

Company

About Us
Careers
Press
Security
Support
Open Source

© 2022-2026 Langfuse GmbH / Finto Technologies Inc.

Terms Privacy Imprint Cookie Policy