From messy documents to structured tags, simplified
Sutro's Python SDK simplifies the entire process of tagging large document sets. Connect to your data and let Sutro handle the complexity of scaling your LLM workflows.
import sutro as so
from pydantic import BaseModel
class ReviewClassifier(BaseModel):
sentiment: str
user_reviews = '.
User_reviews.csv
User_reviews-1.csv
User_reviews-2.csv
User_reviews-3.csv
system_prompt = 'Classify the review as positive, neutral, or negative.'
results = so.infer(user_reviews, system_prompt, output_schema=ReviewClassifier)
Progress: 1% | 1/514,879 | Input tokens processed: 0.41m, Tokens generated: 591k
█░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
Prototype
Start small and iterate fast on your document tagging workflows. Accelerate experiments by testing on Sutro before committing to large jobs.
Scale
Scale your tagging workflows so your team can do more in less time. Process billions of tokens in hours, not days, with no infrastructure headaches.
Integrate
Seamlessly connect Sutro to your existing LLM workflows. Sutro's Python SDK is compatible with popular data orchestration tools, like Airflow and Dagster.

Scale your organization effortlessly
Confidently handle millions of documents and billions of tokens at a time. Sutro removes the pain of managing infrastructure so you can focus on results.
Get results faster and reduce costs by parallelizing your LLM calls through Sutro. Process massive datasets without exploding costs.

Shorten development cycles
Get feedback from large batch jobs in as little as minutes before scaling up. Rapidly prototype and iterate on your document tagging workflows.