From Raw Data to Rich Features, Simplified
Sutro takes the pain away from testing and scaling LLM batch jobs, letting you focus on building better models.
import sutro as so
from pydantic import BaseModel
class ReviewClassifier(BaseModel):
sentiment: str
user_reviews = '.
User_reviews.csv
User_reviews-1.csv
User_reviews-2.csv
User_reviews-3.csv
system_prompt = 'Classify the review as positive, neutral, or negative.'
results = so.infer(user_reviews, system_prompt, output_schema=ReviewClassifier)
Progress: 1% | 1/514,879 | Input tokens processed: 0.41m, Tokens generated: 591k
█░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
Prototype
Start small and iterate fast on your feature engineering workflows. Accelerate experiments by testing on Sutro before committing to large jobs.
Scale
Scale your LLM workflows to process billions of tokens in hours, not days, with no infrastructure headaches or exploding costs.
Integrate
Seamlessly connect Sutro to your existing LLM workflows. Sutro's Python SDK is compatible with popular data orchestration tools, like Airflow and Dagster.

Scale your feature generation effortlessly
Confidently handle millions of requests and billions of tokens at a time. Process your entire corpus of unstructured data without the pain of managing infrastructure.
Get results faster and reduce costs by 10x or more. Sutro parallelizes your LLM calls to transform data more efficiently than running individual requests.

Shorten development cycles
Accelerate experiments by getting feedback from large batch jobs in minutes before scaling up. Rapidly prototype new features to improve your models faster.