Case Study

How Sutro helped SynthLabs generate a 351 billion token synthetic dataset ->

Accelerated Batch Inference for AI Research

Accelerated Batch Inference for AI Research

A platform AI researchers trust for synthetic data generation, scaling RL environments, and evaluating models. Up to 20x faster, 10x cheaper, and zero infrastructure setup.

Approved researchers can access up to $1,000 in free credits

 A Faster Path to Discovery

A Faster Path to Discovery


 A Faster Path to Discovery

Run Experiments Faster

Up to 20x faster results. Purpose-built for massive AI workloads. Get outputs in hours, not weeks.

Run Experiments Faster

Up to 20x faster results. Purpose-built for massive AI workloads. Get outputs in hours, not weeks.

Run Experiments Faster

Up to 20x faster results. Purpose-built for massive AI workloads. Get outputs in hours, not weeks.

Reduce Compute Costs

Take your budget further with up to 90% reductions in inference costs. Our efficient resource allocation makes previously cost-prohibitive experiments feasible.

Reduce Compute Costs

Up to 90% cost reduction. Our efficient resource allocation makes previously cost-prohibitive experiments feasible.

Reduce Compute Costs

Up to 90% cost reduction. Our efficient resource allocation makes previously cost-prohibitive experiments feasible.

Simple SDK, No Infrastructure

Abstract away rate limits, backoffs, and parallelization. Spend your time on research, not infrastructure wrangling.

Simple SDK, No Infrastructure

Abstract away rate limits, backoffs, and parallelization. Replace brittle for-loops with a few lines of code.

Simple SDK, No Infrastructure

Abstract away rate limits, backoffs, and parallelization. Replace brittle for-loops with a few lines of code.

Scale Without Code Changes

From a small sample to millions of inputs with the same code. Built for the experimentation cycle.

Scale Without Code Changes

From a small sample to millions of inputs with the same code. Built for synthetic data generation and agentic simulations.

Scale Without Code Changes

From a small sample to millions of inputs with the same code. Built for synthetic data generation and agentic simulations.

For researchers at leading institutions.

Sutro’s batch inference was enormously helpful for some of my research. They had no problem scaling to my very large workload, and delivered the best service at the lowest price available.

Sutro’s batch inference was enormously helpful for some of my research. They had no problem scaling to my very large workload, and delivered the best service at the lowest price available.



Sutro’s batch inference was enormously helpful for some of my research. They had no problem scaling to my very large workload, and delivered the best service at the lowest price available.

Charlie Snell

Charlie Snell

Charlie Snell

UC Berkley Researcher

UC Berkley Researcher

UC Berkley Researcher

Sutro lets our researchers fire off batch inference—whether it’s a thousand samples or a few billion—through one API call. They don’t have to check cluster queues or negotiate priorities; the job runs immediately with a predictable, fast return-time.

Sutro lets our researchers fire off batch inference—whether it’s a thousand samples or a few billion—through one API call. They don’t have to check cluster queues or negotiate priorities; the job runs immediately with a predictable, fast return-time.



Sutro lets our researchers fire off batch inference—whether it’s a thousand samples or a few billion—through one API call. They don’t have to check cluster queues or negotiate priorities; the job runs immediately with a predictable, fast return-time.

Nathan Lile

Nathan Lile

Nathan Lile

CEO, Synthlabs

CEO, Synthlabs

CEO, Synthlabs

A Simple Workflow For Batch Jobs

Prototype

Test prompts and models on a small sample. Get feedback in minutes.

Scale

Scale

Scale

Run your workflow on millions of data points. Effortlessly process billions of tokens for data generation or evaluation.

Progress: 1% | 1/2.5M Rows

█░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░

Progress: 1% | 1/2.5M Rows

█░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░

Progress: 1% | 1/2.5M Rows

█░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░

Collaborate

Share results with collaborators and teammates, or easily export results to external tools.

Built for Any Research Workload

Synthetic Data Generation

Create high-quality instruction-tuning datasets at scale.

Scale RL Environments

Run high-speed, large-scale model rollouts to continuously improve task-specific model performance.

Large-Scale Model Evals

Rigorously test model performance across millions of data points.

Agentic Simulations

Simulate thousands of interacting agents to test emergent behaviors.

Population and Market Modeling

Run social simulations against massive populations of synthetic respondents and economic agents.

Scientific Modeling

Run large-scale simulations for genomics, climate science, and more.

Built for Any Research Workload

Synthetic Data Generation

Create high-quality instruction-tuning datasets at scale.

Scale RL Environments

Run high-speed, large-scale model rollouts to continuously improve task-specific model performance.

Large-Scale Model Evals

Rigorously test model performance across millions of data points.

Agentic Simulations

Simulate thousands of interacting agents to test emergent behaviors.

Population and Market Modeling

Run social simulations against massive populations of synthetic respondents and economic agents.

Scientific Modeling

Run large-scale simulations for genomics, climate science, and more.

Built for Any Research Workload

Synthetic Data Generation

Create high-quality instruction-tuning datasets at scale.

Scale RL Environments

Run high-speed, large-scale model rollouts to continuously improve task-specific model performance.

Large-Scale Model Evals

Rigorously test model performance across millions of data points.

Agentic Simulations

Simulate thousands of interacting agents to test emergent behaviors.

Population and Market Modeling

Run social simulations against massive populations of synthetic respondents and economic agents.

Scientific Modeling

Run large-scale simulations for genomics, climate science, and more.

FAQ

How is Sutro different from a cloud provider's batch service?

Can I bring my own custom models?

How is my research data secured?

What types of models do you support?

Do you support multimodal models?

How is Sutro different from a cloud provider's batch service?

Can I bring my own custom models?

How is my research data secured?

What types of models do you support?

Do you support multimodal models?

How is Sutro different from a cloud provider's batch service?

Can I bring my own custom models?

How is my research data secured?

What types of models do you support?

Do you support multimodal models?

Accelerate Your Research

A platform AI researchers trust for synthetic data generation, scaling RL environments, and evaluating models. Up to 20x faster, 10x cheaper, and zero infrastructure setup.

A platform AI researchers trust for synthetic data generation, scaling RL environments, and evaluating models. Up to 20x faster, 10x cheaper, and zero infrastructure setup.