B

Baseten

AI Infrastructure, Machine Learning Inference

Customer SupportAIML InferenceServerlessModel Deployment
Function:Product & Engineering
Subfunction:DevOps / Site Reliability (SRE)
Loading versions...
Founded
2019
Employees
137
Funding
~$2.085B total; Series F $1.5B at up to $13B valuation (Jun 2026)
Stage
Series D \- $2.15B valuation
Report version: Oct 23, 2025

1. Products/Services & Features

  • Main Offerings:

    • Baseten Model APIs - Deploy and serve open-source and custom ML models with OpenAI-compatible endpoints
    • Baseten Inference Stack - Serverless inference platform with autoscaling, multi-cloud support, and performance optimization
    • Baseten Training - Model training and customization capabilities for machine learning workflows
  • Feature Breakdown: Model deployment and versioning, autoscaling with scale-to-zero, multi-cloud infrastructure (AWS, GCP, Azure), GPU/CPU instance selection, real-time monitoring and logging, Chains SDK for multi-step workflows, Truss CLI for model packaging, OpenAI-compatible APIs, background task automation, 99.99% uptime SLA (Departments: Engineering, Product, Sales, Marketing, Customer Success, Operations)

  • Business Industry Gearing: Horizontal - serves enterprises and startups across healthcare, finance, software, media, and other sectors

2. Security & Compliance

  • Certifications: SOC 2 Type II Certified, GDPR Compliant, HIPAA Compliant

  • Vendors/Tools: PCI-DSS compliant payment processors, independent CPA audit firms (Sensiba San Filippo LLP)

  • Risk Profile:

    • Breaches: No known public security breaches reported
    • Features: Audit trails, access controls, privilege management, data encryption, container/workload isolation, network access controls, regular penetration testing, secure code review and deployment

3. User Feedback & Adoption

  • Aggregated Reviews: No verified user reviews available on G2 or Capterra as of October 2025

    • Pros: High performance inference (2x faster than competitors on some models), cost-efficient infrastructure, developer-friendly APIs, rapid deployment capabilities, strong security and compliance posture
    • Cons: Limited public user reviews available, relatively newer platform compared to established cloud providers, pricing transparency could be improved
  • Adoption Insights:

    • Adoption Ease: High - Low-code/no-code deployment, pre-packaged models, OpenAI-compatible APIs, minimal DevOps required
    • Adoption Cultural Fit: Best fit for AI-native companies, startups, and enterprises building production AI applications; strong appeal to developers and ML engineers
  • Metrics: No public NPS or churn data available; strong customer retention indicated by rapid growth and customer expansion

  • Barriers: Requires understanding of ML model deployment, potential learning curve for non-technical users, vendor lock-in considerations

4. Monetization & Business Model

  • Revenue Model: Usage-based (pay-as-you-go) SaaS with enterprise custom pricing options

  • Pricing: Startup (free tier with credits), Pro (volume discounts), Enterprise (custom pricing) (Sources: https://www.baseten.co/pricing/)

  • Market Context:

    • TAM: AI/ML inference infrastructure market estimated at multi-billion dollar TAM, growing rapidly with AI adoption
    • Growth Stage: High-growth stage - AI inference infrastructure is one of the fastest-growing sectors in enterprise software

5. Leadership & Recent Developments

Name Description LinkedIn X Account
Tuhin Srivastava CEO and Co-founder, leads company strategy and operations; prior experience at Google in scalable AI infrastructure https://www.linkedin.com/in/tuhin-srivastava-05a90413/
Amir Haghighat CTO and Co-founder, technical leader overseeing engineering and technology; Ph.D. from MIT, prior experience at AWS and Clover Health https://www.linkedin.com/in/amir-haghighat/
Philip Howes Co-founder focused on product development; prior experience at Shape and AI startups https://www.linkedin.com/in/philip-howes/
  • Key Metrics Update:

    • Funding: Series D: $150M at $2.15B valuation (September 2025), led by BOND with participation from CapitalG, Premji, IVP, Spark, Greylock
    • Employee Growth: Grew from ~40-50 employees to 137 on LinkedIn (2025); significant hiring across engineering, GTM, and support
  • News/Trends:

    • News Launch: Launched Baseten Model APIs and Baseten Training in 2025; expanded Inference Stack capabilities for multi-model workflows
    • News Partnerships: Partnerships with LangChain, NVIDIA (Dynamo integration), Oracle Cloud, integrations with major open-source models (Llama, DeepSeek)
    • News Funding: Series D $150M funding round (September 2025) at $2.15B valuation; total raised exceeds $285M
    • News Challenges: Scaling infrastructure to meet surging inference demand, maintaining cost-performance leadership, expanding developer experience

6. Target Audience & Use Cases

  • Target Market: Enterprises and AI-native startups building production AI applications; developers and ML engineers

  • Target Users & Personas: ML Platform Engineers, Data Scientists, Product Managers, Startup Founders, Enterprise IT/Compliance Leads, Application Developers

  • User Experience Level: Intermediate to Advanced - requires some ML/infrastructure knowledge; designed to reduce DevOps complexity

  • Key Use Cases:

    • Healthcare AI - OpenEvidence uses Baseten for high-volume, reliable AI-driven medical information delivery to healthcare providers
    • Content Generation - Writer uses Baseten for scalable inference to power AI writing and content tools
    • Research and Enrichment - Clay uses Baseten as foundational infrastructure for AI-powered research and enrichment agents

7. Impact & Recommendations

  • Measurable Outcomes:

    • Workflow Improvements: Reduces time-to-production for ML models, eliminates infrastructure management overhead, enables rapid iteration and scaling, provides cost optimization through autoscaling
    • ROI Examples: 225% better cost-efficiency in inference workloads compared to alternatives; 50% lower latency and 60%+ higher throughput with NVIDIA Dynamo integration; <10 second cold-start times
  • Fit Assessment: Excellent fit for technical support use case - Baseten Agents provides infrastructure for deploying AI agents that can handle technical support tasks with high reliability and performance

  • Custom Rec Flags:

    • Priority ICP: AI-native startups ($1M-$100M ARR), mid-market enterprises ($100M-$1B revenue) building AI products, healthcare and financial services companies with compliance requirements
    • Short Term Goals: Expand Model APIs adoption, grow enterprise customer base, enhance developer tooling, scale infrastructure reliability

8. Data Sourcing Notes

Need help evaluating and implementing AI tools?

ChiriBrain orchestrates your entire AI stack — connecting tools, teams, and workflows into one governed platform.