Skip to main content

Build your AI SaaS
this weekend.

The production-ready RAG starter kit for Next.js developers. Ingest PDFs & URLs in seconds, chat with your documents using GPT-4o, and deploy to Vercel in minutes — not weeks.

4.9/5 from 200+ developers
MIT License · Deploy anywhere
Lifetime updates included
pages/api/chat.js
ingest-url.jsvector-store.js
1import { ingestPDF, ingestURL } from '@/lib/rag';
2import { createChat } from '@/lib/chat';
3
4// Ingest any document in seconds
5const vectors = await ingestURL({
6 url: 'https://docs.example.com',
7 namespace: user.id,
8 chunkSize: 1000,
9});
10
11// Stream AI responses instantly
12const { stream } = await createChat({
13 query: userMessage,
14 namespace: user.id,
15 model: 'gpt-4o',
16});

Powered by modern infrastructure

Next.js 16
OpenAI
Pinecone
LangChain.js
Vercel AI SDK
React 19
Tailwind CSS
Puppeteer
Browserless
Next.js 16
OpenAI
Pinecone
LangChain.js
Vercel AI SDK
React 19
Tailwind CSS
Puppeteer
Browserless
Next.js 16
OpenAI
Pinecone
LangChain.js
Vercel AI SDK
React 19
Tailwind CSS
Puppeteer
Browserless

Everything included

Production-ready, out of the box.

Stop spending weeks configuring boilerplate. FastRAG handles the infrastructure so you can focus on what makes your product unique.

Scraping

Web Scraping Engine

Paste any URL — docs site, landing page, or web app. Our Puppeteer + Browserless integration renders JavaScript, scrapes, cleans, and vectorizes content in seconds.

⚡ SPAs & Next.js apps supported
Ingestion

Multi-File PDF Ingestion

Drag & drop up to 10 PDFs simultaneously. Automatic chunking with configurable overlap ensures nothing gets lost — from dense legal contracts to research papers.

📁 10 files at once, up to 10MB each
Vectors

Smart Vector Optimization

Forced 1024-dim embeddings reduce your Pinecone storage costs by 33% versus default settings. Every token counts when you're paying for vector DB at scale.

💰 33% cost reduction on storage
Streaming

Real-Time Streaming Responses

Vercel AI SDK integration means users see answers appearing token by token. No loading spinners, no waiting — just an experience that feels instant and alive.

⚡ Token-by-token streaming via SSE
Multi-Tenant

Namespace Isolation

Each user's documents live in a private Pinecone namespace. Context never bleeds between tenants — your SaaS is enterprise-safe from day one.

🔒 User-scoped vector buckets
Mobile

Mobile-Native Chat UI

The chat interface is built mobile-first and feels native on iOS and Android. Tested on real devices — not just resized desktop layouts.

📱 Responsive on all screen sizes
Developer

No Black Boxes

Full access to every API route under pages/api. Swap the vector DB, change the LLM, tweak prompts — everything is your code. No vendor lock-in.

🔓 100% customizable, MIT license
Architecture

Headless Browser Support

Uses Browserless.io so Puppeteer runs safely in serverless environments. Renders React, Vue, and SPA sites perfectly without timing issues or bot blocking.

🌐 Serverless-safe headless Chrome
Lifetime

Lifetime Updates

Buy once, get every future update. Private GitHub repo access means you pull improvements the day they ship. No new version to purchase.

♾️ Forever access, private repo

Simple setup

From zero to
AI-powered app in four steps.

No ML expertise required. No infrastructure headaches. Just clone, configure, and ship.

Ingest01

Upload your content

Drop in PDFs or paste any URL. Our ingestion engine automatically scrapes, parses, and chunks your content using LangChain's RecursiveCharacterTextSplitter — with sensible defaults you can override.

Next: Embed & store in Pinecone
Step 01
// pages/api/ingest-url.js
const result = await ingestURL({
url: 'https://docs.yourapp.com',
namespace: req.user.id,
chunkSize: 1000,
chunkOverlap: 200,
});
// → "Processed 42 chunks in 3.1s"
Vectorize02

Embed & store in Pinecone

Text chunks are converted to 1024-dimensional embeddings via OpenAI's API, then stored in your Pinecone serverless index. Each user gets their own namespace — clean multi-tenancy from day one.

Next: Answer questions with GPT-4o
Step 02
// lib/vector-store.js
await pinecone
.index('fastrag')
.namespace(userId)
.upsert(
chunks.map((chunk, i) => ({
id: `${docId}-${i}`,
values: embeddings[i],
metadata: { text: chunk },
}))
);
Chat03

Answer questions with GPT-4o

When a user asks a question, FastRAG retrieves the top-K most relevant chunks, constructs a context-rich prompt, and streams the GPT-4o response token by token — no full-page loads, no waiting.

Next: Ship to production in minutes
Step 03
// pages/api/chat.js
const matches = await vectorStore
.similaritySearch(query, {
topK: 5,
namespace: userId,
});
const { stream } = await streamChat({
model: 'gpt-4o',
context: matches,
query,
});
Deploy04

Ship to production in minutes

Push to Vercel with one command. Edge functions handle thousands of concurrent chats with zero cold starts. Add your custom domain, configure your env vars, and you're live.

Step 04
# .env.local
OPENAI_API_KEY=sk-...
PINECONE_API_KEY=pcsk_...
PINECONE_INDEX=fastrag
BROWSERLESS_URL=wss://chrome.browserless.io...
# One-command deploy
$ vercel --prod
# ✅ https://yourapp.vercel.app

Real-world applications

Ship anything.
Built for every industry.

FastRAG is a foundation, not a template. Here's what teams are shipping with it — and what you can build this weekend.

Knowledge Base

Internal Company Wiki

Let employees query your entire Notion, Confluence, or custom docs in plain English. No more hunting through folders — just ask and get an instant, accurate answer.

↓ 60%
Support tickets
↓ 40%
Onboarding time
Customer Support

AI Support Agent

Train on your help docs, FAQs, and product manuals. Answer 80% of tier-1 tickets automatically — 24/7, in any language, without hiring more staff.

↑ 3×
Resolution speed
↑ 22pts
CSAT score
EdTech

AI Course Tutor

Upload lecture notes, textbooks, and syllabi. Students get instant answers from course material instead of waiting for office hours or digging through PDFs.

↑ 55%
Study efficiency
↑ 38%
Comprehension
Legal

Contract & Policy Q&A

Upload NDAs, contracts, compliance policies, or regulation PDFs. Lawyers and operations teams query them conversationally instead of reading hundreds of pages manually.

↓ 70%
Review time
↓ 90%
Missed clauses
Developer Tools

Code Docs Search

Point FastRAG at your API reference, README files, and changelogs. Developers find what they need in seconds instead of searching GitHub or reading docs page by page.

↑ 45%
Dev velocity
↑ 89%
Docs helpfulness
E-commerce

Product Advisor Bot

Ingest your product catalog, specs, and customer reviews. Build a conversational shopping assistant that upsells intelligently and handles product questions at scale.

↑ 28%
Conversion rate
↑ 18%
AOV

Loved by builders

Real devs. Real ships.
Real results.

Don't take our word for it. Here's what the community is building with FastRAG right now.

MC
Marcus Chen
Indie Hacker · @marcusbuilds
Shipped my AI doc assistant in 2 days. The code quality is insane for the price — every file is clean, well-commented, and easy to extend. This isn't some hacky boilerplate.
Shipped in 2 days
PN
Priya Nambiar
CTO @ Veridian Tech · @priyanambiar
We evaluated five RAG frameworks before finding FastRAG. The namespace isolation alone saved us two weeks of architecture work. Had it in production before the weekend was over.
Production in 48hrs
DR
Diego Ramírez
Freelance Developer · @diegodev_
I charge clients £5K–£12K for AI chat implementations. FastRAG is my secret weapon. The web scraping engine handles SPA sites that every other solution completely chokes on.
£12K client projects
SO
Sarah O'Brien
Product Manager · @sarahobrienpm
Even though I'm not technical, I got the demo running using just the docs. The creator replied to my support email within an hour. Incredible support for a £19 product.
Non-technical user
JW
James Whitfield
AI Startup Founder · @jwhitfieldai
We raised a pre-seed round partly on the back of a prototype built with FastRAG. The streaming responses are buttery smooth — investors thought we had a full engineering team.
Pre-seed fundraise
LK
Lena Kovalska
Full-Stack Engineer · @lenakovalska
The LangChain integration is done properly — not the 'copy-paste from a tutorial' version. Smart chunking, configurable overlap, clean abstractions. This is how it should be done.
Expert developer
40+
Developers shipped
4.9/5
Average rating
48hrs
Avg. time to ship
$19
One-time price

Simple pricing

Pay once.
Own it forever.

No subscriptions. No seat fees. No surprise bills. You only pay for your own infrastructure (OpenAI, Pinecone, Vercel).

Starter

Everything you need to ship your first production RAG app this weekend.

$19$29

one-time · deploy to unlimited projects

Get Starter Kit
  • Full Next.js 16 source code (Pages Router)
  • Web scraping engine (Puppeteer + Browserless)
  • Multi-file PDF ingestion (up to 10 files)
  • OpenAI GPT-4o chat integration
  • Pinecone vector store with namespace isolation
  • Vercel AI SDK streaming responses
  • Mobile-responsive chat UI
  • Private GitHub repository access
  • Lifetime updates included
  • Commercial usage — MIT license

🔒 Secure checkout via Gumroad · Instant delivery

Most Popular
ProBest value

Advanced features for teams building multi-tenant SaaS products at scale.

$49$79

one-time · deploy to unlimited projects

Get Pro Kit
  • Everything in Starter, plus:
  • Multi-tenant dashboard (per-user analytics)
  • Admin panel with ingestion history
  • Rate limiting & token usage tracking
  • Slack / webhook notifications on ingest
  • Custom system prompt UI (per namespace)
  • Pinecone metadata filtering
  • Auth-ready (NextAuth.js integration)
  • Priority email support (24hr response)

🔒 Secure checkout via Gumroad · Instant delivery

Full comparison

FeatureStarterPro
Next.js 16 source code
PDF ingestion
Web scraping (Puppeteer)
GPT-4o chat streaming
Pinecone namespace isolation
MIT commercial license
Lifetime updates
Multi-tenant admin dashboard
Per-user analytics & usage
Auth integration (NextAuth.js)
Priority support

Questions about which plan is right for you? Talk to us →

Common questions

Everything you
need to know.

Still have a question after reading? Reach out — we reply within 24 hours.

You get an invite to a private GitHub repository (or a ZIP download) containing the complete Next.js source code — ingestion scripts, Pinecone configuration, chat API routes, the web scraping pipeline, and Vercel deployment templates. Everything is yours, forever.

Still have a question?

Contact support