From Unstructured Text to Actionable Contacts
Sutro simplifies the entire process of extracting contact information at scale. Start small, test your extraction logic, and scale to millions of records with ease.
import sutro as so
from pydantic import BaseModel
class ReviewClassifier(BaseModel):
sentiment: str
user_reviews = '.
User_reviews.csv
User_reviews-1.csv
User_reviews-2.csv
User_reviews-3.csv
system_prompt = 'Classify the review as positive, neutral, or negative.'
results = so.infer(user_reviews, system_prompt, output_schema=ReviewClassifier)
Progress: 1% | 1/514,879 | Input tokens processed: 0.41m, Tokens generated: 591k
█░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
Prototype and Iterate Fast
Start small and iterate fast on your contact extraction workflows. Accelerate experiments by testing on Sutro before committing to large jobs.
Scale to Millions of Records
Scale your extraction workflows to process billions of tokens in hours, not days, with no infrastructure headaches or exploding costs.
Integrate with Your Existing Tools
Seamlessly connect Sutro to your existing LLM workflows. Sutro's Python SDK is compatible with popular data orchestration tools, like Airflow and Dagster.

Scale Effortlessly
Confidently handle millions of requests to extract contact info from documents or web pages. Process billions of tokens at a time without the pain of managing infrastructure.
Get results faster and significantly reduce costs. Sutro parallelizes your LLM calls to transform massive amounts of free-form text into analytics-ready datasets.

Get Results in Hours, Not Days
Sutro takes the pain away from testing and scaling LLM batch jobs. Shorten development cycles and process millions of records in hours, not days.