Home
Writesonic
11

GPT Router - API gateway for LLMs

Automatically route between 30+ LLMs, Vision and Image Models based on latency and uptime.

Tl;dr: GPTRouter is an open source LLM API Gateway that offers a universal API for 30+ LLMs, vision, and image models, with smart fallbacks based on uptime and latency, automatic retries, and streaming. Stay operational even when OpenAI is down. Try it out today.

We are open-sourcing GPTRouter, an LLMOps tool we have been using internally at Writesonic for handling millions of monthly requests for our users.

✅ Universal API for 30+ LLMs, Vision and Image Models

✅ Smart Fallbacks based on latency and uptime

✅ Automatic Retries

✅ Supports streaming

Why we built it

Since embracing OpenAI GPT-3 in production in 2020, we at Writesonic have been serving millions of users and faced the typical scaling pains with generative AI models:

1. Dependency on a single model risked total downtime.

2. Latency issues with models like GPT-4 affected user experience.

3. Integrating various models was tough due to different APIs and SDKs.

🌟 Early this year at Writesonic, we set out with a clear vision: to become model agnostic.

Faced with single-model limitations and diverse AI challenges, we began building GPTRouter - our bespoke solution to navigate and thrive in a multi-model AI world.

🔗 With GPTRouter's Universal API, you're the master of AI models.

Swap between OpenAI, Azure, Anthropic, Replicate, Cohere & more with just one line of code.

It simplifies model management to a great extent.

🛡️ Downtime isn't an option.

GPTRouter's Smart Fallbacks mean your service is always on.

You can define a hierarchy of models for each use case. GPTRouter will constantly check for uptime/downtime, latency and other factors, and automatically fallback to the next best model with zero interruption.

♻️ Say goodbye to manual retries.

GPTRouter does the heavy lifting with Automatic Retries for failed requests, keeping your AI services sharp and consistent.

🌟 GPTRouter's Edge:

✅ Universal API for seamless model switching.

✅ Smart, automatic fallbacks for continuous service.

✅ Reduced latencies for quick interactions.

This is just the starting point. We are also working on integrations with Langchain and Llamaindex.

Additionally, we will also be open sourcing our frontend LLMOps layer that provides a playground to test multiple models in parallel, keep a tab on the latencies for each model, track tokens and costs for each model and user all in one place.

📈 With GPTRouter, we're not just solving our problems at Writesonic; we're offering a solution to other startups and companies looking to make use of Generative AI in production.

Try it out here: https://github.com/Writesonic/GPTRouter

🙏 Asks

Do try it out and share your feedback at sam at writesonic.com or open an issue on Github.