Home
Companies
Cerebrium

Cerebrium

Serverless GPU infrastructure for AI

Cerebrium is a platform to deploy machine learning models to serverless GPUs with sub-5 second cold-start times. Customers typically experience 40% in cost savings when compared to using traditional cloud providers and can scale models to more than 10K requests per minute with minimal engineering overhead. Simply write your code in Python and Cerebrium takes care of all infrastructure and scaling. Cerebrium is being used by companies and engineers from Twilio, Rudderstack, Matterport and many more.

Cerebrium
Founded:2021
Team Size:4
Location:New York

Active Founders

Michael Louis

Michael is a South African entrepreneur that has founded multiple businesses across ML & A.I, Blockchain, Retail, and consumer marketplaces. Previously he was the CTO of OneCart which sold to Walmart/Massmart for an undisclosed sum. Currently, he is a co-founder at Cerebrium, a machine learning platform that makes it easier for businesses to train, deploy and monitor ML models in production.

Michael Louis
Michael Louis
Cerebrium

Jonathan Irwin

Co-founder & CTO of Cerebrium 8+ years experience as a Javascript developer including a lot of time spent in a lead developer role. BComm undergrad and Finance Honours at the University of Cape Town. Data Science at Tilburg University.

Jonathan Irwin
Jonathan Irwin
Cerebrium

Company Launches

TL;DR

Cerebrium is a machine learning framework that makes it easy to train, deploy and monitor ML models. You can:

  • ⚙️Train LLM’s such as FlanT5, GPT-NEOx-20b etc with a few lines of code (beta)
  • 🚀Deploy LLM’s or even custom models using just one line of code to serverless CPU’s/GPU’s.
  • 📈Monitor your ML models with our built in integrations with Arize and Censius.

Hi Everyone! We’re Michael and Jonathan and we are so excited to introduce you to Cerebrium!

The Problem

Implementing machine learning based applications in a business remains challenging. You have multiple infrastructure concerns, such as expensive GPU’s, scaling issues, fragmented tooling while additionally also struggle to find machine learning talent and keep up with the latest research and techniques. These barriers to entry prohibit smaller companies from implementing machine learning into their business sooner.

🚀 The Solution

Cerebrium helps teams easily train, deploy and monitor machine learning models in production. Using our framework, you can train and deploy models (custom built or LLM’s) to serverless CPU’s/GPU’s using one line of code. You are only charged for inference time and we automatically handle the scaling and the versioning of models.

When it comes to training and/or inference, we use the latest research techniques to reduce training and inference times by 20-40%. You just focus on adding value to your use case and we will take care of the rest!

⚙️ How it works

  1. Create a free account here. You get $10 in free serverless GPU credits.
  2. Install our Python library: pip install cerebrium
  3. Write one line of code to deploy your custom model or deploy a pre-built template
  4. Get an endpoint that automatically scales based on traffic
  5. Check out our documentation to see a greater list of examples and features.

🤔 Why we Built Cerebrium

At our previous business, machine learning made a huge impact across various teams but it took months of building, testing and experimenting to get it there. We wanted to make the same sort of impact accessible to companies who don’t have the resources or know-how - and of course make it available in a fraction of the time

🙏 Our Asks

  • Try out our framework! It takes the average user less than 4 minutes to get a model up and running (yes even if its custom) and give us your feedback - its free.
  • Share our framework with people you know who are building ML based applications or are struggling to fine-tune/deploy their ML models.
  • Join our Slack and Discord communities
  • Upvote and share this post!