Open-source RAG alternative for building AI apps over multimodal…
Morphik is an open-source multimodal end to end RAG service. Developers use Morphik to ingest any data, create knowledge graphs, and retrieve relevant information from private data.
Hey YC! 👋 We’re Adi and Arnav, co-founders of Morphik (https://morphik.ai).
TL;DR
Problem: Knowledge workers spend a lot of their time searching complex documentation with diagrams, tables, or spreadsheets, just to find relevant information. Text‑only RAG misses details, hallucinates, and provides half-baked responses.
Solution: Morphik embeds whole pages (image + text, no OCR), auto‑builds a knowledge graph, and serves it through a UI, SDK, or REST API. We have 90 % accuracy on arXiv QA (and we can fine-tune for your use case 🙂).
RAG breaks the moment documents stop being blobs of text.
Knowledge workers spend a lot of their time (people we spoke with said 50-70% time just in the research phase). eg. electrical engineers finding perfect PCB spec, pharma companies trying to search ELNs for similar experiments.
80% of enterprise knowledge lives in diagrams, tables, images, CAD, scanned pdfs, not .txt.
Teams glue together OCR, vector DBs, and regex‑heavy ETL that becomes a brittle monstrosity.
Solution
Directly embedding and search over page images
When you ingest in Morphik, in addition to OCR, we create direct embeddings for the image of a page (or frames in a video). When you ask “Colpali comparison to standard RAG”, the LLM sees the actual page image, so multimodal models can reason over arrows, labels, scales, and colours just like a human.
Rules Engine
When ingesting, you can define rules for data extraction (which would again consider tables, diagrams), or define natural language rules for PII redaction, or content transformation.
Deployment ready, scalable
Morphik comes built in with folder scoping, end user scoping, GPU accelerations, and fine grained permissions, so you can prototype and deploy with enterprise grade features. It’s end‑to‑end, so you don’t scramble to make duct tape fixes.
One more thing…Morphik Research Agent
Our research agent uses Morphik’s retrieval, graph, and extraction APIs as tools. It chains them automatically, jumping from doc to doc, node to node, seeing figures, till it finds the needle in the haystack, or has enough context to answer your questions. “Describe the third pin on the USBC21 diagram” will get the diagram, find the pin, jump to the table with pin descriptions, jump to page 13 to “find more details on page 13”.
Why we might be interesting to you
Building an internal ChatGPT over messy manuals? Morphik is the fastest way to ship something users trust.
Developing an AI product? Swap your vector‑DB‑plus‑duct‑tape stack for a single SDK call.