Preparing your content
A search runtime for voice agents, copilots, and multimodal apps. Sub-10ms lookups, zero infrastructure. Built in Rust and WebAssembly.
Get real-time retrieval inside apps, browsers, and enterprise agents — with centralized management, analytics, and scale built in.
$Trusted by 500+ teams at
Three steps from your data to sub-10ms retrieval. No infrastructure to manage.
Push docs, knowledge bases, or chat history to Moss via our SDK or portal.
We build a compact index and deploy it wherever your agent runs — browser, edge, device, or cloud.
Your agent retrieves context locally. Zero network hops. Zero infrastructure to manage.
Get started with Moss in seconds
import { MossClient, DocumentInfo } from '@inferedge/moss'
const client = new MossClient(process.env.MOSS_PROJECT_ID!,
process.env.MOSS_PROJECT_KEY!)
const documents: DocumentInfo[] = [
{ id: 'doc1', text: 'How do I track my order? Log into your account.',
metadata: { category: 'shipping' } }
]Moss is for developers building conversational, voice, and multimodal AI - experiences where every millisecond shapes how human the interaction feels.
For real-time, offline-capable assistance.
Superfast search without sending data to others.
Tiny engine (<20kB) that fits anywhere.
Keeps code local, great for security audits.
Combine speed with optional analytics and rollouts.
Where teams are putting Moss to work today...
Recall user context instantly, even offline.
Fast, private search inside help centers.
Smart search in note apps or IDEs without sending data online.
Sub-10ms search on phones and AI-PCs — no lag even with bad network.
You bring your data. Moss powers the retrieval layer - indexing, packaging, and distributing it automatically, so semantic search runs close to where intelligence happens. It enables your application to:
Moss brings real-time semantic search to any environment - inside browsers, apps, or your own infrastructure - with sub-10 ms retrieval and no setup overhead.
Each user’s data can be embedded, searched, and updated locally, so experiences feel faster and more personal without sending data to the cloud.
A simple cloud dashboard manages analytics, policies, and updates - giving teams visibility and control without maintaining infrastructure.
Moss continuously improves search quality and syncs enhancements automatically. Built-in A/B testing for embeddings makes it easy to compare and tune indexes for the best retrieval results.