Baseten, an AI inference platform, is closing in on a $1.5 billion funding round that values the company at $13 billion, according to reports. The raise arrives just months after the startup's previous mega-round, underscoring investor appetite for infrastructure plays in the inference layer of AI stacks.
Baseten provides a serverless platform for running large language models and other AI models at scale. The company competes in a crowded space that includes players like Replicate, Modal, and Hugging Face's inference offerings. What sets Baseten apart is its focus on making model serving cheaper and faster for developers and enterprises.
The timing reveals where venture capital sees the next wave of AI value creation. While generative AI training and model development grabbed headlines and funding through 2023 and early 2024, attention has shifted to inference, the process of running trained models in production. That's where startups can build defensible products with recurring revenue.
Baseten raised its previous round in an overheated fundraising environment. That capital allowed the company to invest heavily in its platform, expand its team, and sign enterprise customers. A $1.5 billion raise at $13 billion valuation reflects investor confidence in the inference market's size and the company's execution.
The inference space remains far from solved. Cloud giants like AWS, Google Cloud, and Azure offer inference services, but they lack the specialized focus that pure-play startups provide. Baseten's ability to optimize for cost and latency attracts builders frustrated with general-purpose cloud pricing.
The funding round, if completed as reported, positions Baseten as one of the largest AI infrastructure companies by valuation. It joins a tier of well-funded companies competing to own critical layers of the AI stack. Other inference players have also raised heavily, though few at Baseten's reported valuation.
The raise
