From Idea to Millions of Classifications, Simplified
Sutro's purpose-built tools for scalable LLM workflows help you ship faster results without the need for complex infrastructure.
import sutro as so
from pydantic import BaseModel
class ReviewClassifier(BaseModel):
sentiment: str
user_reviews = '.
User_reviews.csv
User_reviews-1.csv
User_reviews-2.csv
User_reviews-3.csv
system_prompt = 'Classify the review as positive, neutral, or negative.'
results = so.infer(user_reviews, system_prompt, output_schema=ReviewClassifier)
Progress: 1% | 1/514,879 | Input tokens processed: 0.41m, Tokens generated: 591k
█░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
Prototype
Start small and iterate fast on your classification workflows. Accelerate experiments by testing on Sutro before committing to large jobs.
Scale
Scale your LLM workflows to process millions of items in hours, not days, with no infrastructure headaches or exploding costs.
Integrate
Seamlessly connect Sutro to your existing LLM workflows. The Python SDK is compatible with popular data orchestration tools, like Airflow and Dagster.

Scale Effortlessly
Confidently handle millions of requests and billions of tokens at a time. Scale your classification workflows so your team can do more in less time, without infrastructure headaches.
Get results faster and significantly reduce costs. Sutro parallelizes your LLM calls to make large-scale classification jobs affordable and efficient.

Rapidly Prototype
Shorten development cycles by getting feedback from large batch jobs in minutes. Accelerate experiments by testing on Sutro before committing to large jobs.