A simple, scalable workflow for data extraction
Sutro's Python SDK and simple workflow lets you start small with your extraction tasks and scale to millions of requests effortlessly.
import sutro as so
from pydantic import BaseModel
class ReviewClassifier(BaseModel):
sentiment: str
user_reviews = '.
User_reviews.csv
User_reviews-1.csv
User_reviews-2.csv
User_reviews-3.csv
system_prompt = 'Classify the review as positive, neutral, or negative.'
results = so.infer(user_reviews, system_prompt, output_schema=ReviewClassifier)
Progress: 1% | 1/514,879 | Input tokens processed: 0.41m, Tokens generated: 591k
█░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
Rapidly Prototype
Start small and iterate fast on your LLM batch workflows. Accelerate experiments by testing your extraction logic on Sutro before committing to large jobs.
Scale Effortlessly
Scale your extraction workflows so your team can do more in less time. Process billions of tokens in hours, not days, with no infrastructure headaches.
Integrate Seamlessly
Connect Sutro to your existing LLM workflows. Sutro's Python SDK is compatible with popular data orchestration tools, like Airflow and Dagster.

Extract insights at massive scale
Confidently handle millions of requests, and billions of tokens at a time. Crawl millions of web pages or process large corpuses of free-form text without the pain of managing infrastructure.
Get results faster and reduce costs significantly by parallelizing your LLM calls. Process your unstructured ETL workflows without exploding costs.

Get analytics-ready data faster
Shorten development cycles and get feedback from large batch jobs in minutes. Unlock valuable product insights from thousands of reviews while you brew your morning coffee.