South African AI Startup Cerebrium Raises $8.5M to Scale Serverless Infrastructure For Real-Time AI Applications


CEREBRIUM, an IA infrastructure platform without server designed to simplify the development and scaling of multimodal AI applications, has raised 8.5 million dollars in seed funding.
The Tour was led by Gradient companies, with the participation of Y Combinator, Authentic Ventures and several investors and operators of former strategic.
With the new capital, Cerebrium plans to strengthen its engineering team to keep up with the rate of business demand and accelerate product development. The company aims to introduce new features to improve the capacities of its platform, in particular in real -time AI applications such as digital avatars, which could have large -scale implications for industries such as games, entertainment and telehealth.
Register For TEKEDIA Mini-MBA Edition 18 (September 15 – December 6, 2025)) Today for early reductions. An annual for access to Blurara.com.
Tekedia Ai in Masterclass Business open registration.
Join Tekedia Capital Syndicate and co-INivest in large world startups.
Register become a better CEO or director with CEO program and director of Tekedia.
Founded by Michael Louis and Jonathan Irwin, Cerebrium is an IA infrastructure platform without server built from zero to fuel the next generation of high performance AI applications.
Cerebrium emerged from the founders’s frustrations while building products powered by AI. “The tools were fragmented, there was a difference in education between theory and production, the economy of unity had no meaning and the development cycles took months”, “ explained CEO Michael Louis. “We have built Cerebrium so that engineers can focus on building AI products that users like with a real professional impact without the need for a dedicated infrastructure team or engaging massive cloud costs.”
Cerebrium supports applications such as AI voice, real -time digital avatars and health care solutions, offering a low -latency server -free GPU infrastructure with cold start -up times less than 5 seconds and up to 40% savings compared to traditional cloud suppliers. From real-time vocal robots to multimodal inference pipelines and large-scale lots work, the platform is radically facilitated for teams to deploy, of scale and operating the AI workloads without managing a single server.

The Cerebrium platform feeds some of the most pointed startups today, including Tavus, Deepgram and Vapi, among others. It is specifically optimized for high performance in real -time use cases such as voice agents, LLM fine adjustment, video model inference and large -scale data analysis.
Beyond its main offer of the server-free GPU infrastructure, CEREBRIUM supports the lots, multi-regional deployments and large-scale data processing. This allows engineering teams to transparently carrying out workloads in calculation with a minimum configuration, a scaling on demand while paying only what they use. Above all, the platform also meets the security and residence requirements to the data at the level of the company, reducing the burden of compliance.
The performance of the AI infrastructure platform has won the praise of key users. Roey Pazl, ML engineer in Tavus, noted, “We are running a range of real -time audio and video models, and the performance is everything. Cerebrium systematically offers the speed and reliability we need without general costs. Even if we have evolved quickly and we became viral, they responded to our calculation requests with stability. ”

Eylul Kayin, Gradient partner, added, “What the CEREBRIUM team has accomplished with such a small group is impressive. They allow some of the most advanced Vocal and Video applications to get on a scale. While real -time AI becomes fundamental for digital experiences, a specialized elastic infrastructure like Cerebrium will be essential.”
Originally founded in Cape Town, South Africa, and now whose headquarters are in New York, Cerebrium plans to use the new capital to build additional features and meet the growing demand for businesses. The company’s move to the United States reflects its ambition to compete worldwide, while its South African origins highlight the growing influence of African technology in AI innovation.