108 Integrations Across 12 Categories

Every integration is pluggable via the registry pattern. Import only what you need.

Showing 92 of 92 integrations

OpenAI

Core

GPT-4o, GPT-4, GPT-3.5 with streaming and function calling

LLM

Anthropic

Core

Claude 4, Claude 3.5 with prompt caching

LLM

Google Gemini

Core

Gemini 2.0, 1.5 Pro/Flash with multimodal

LLM

AWS Bedrock

Core

Multi-model access via AWS infrastructure

LLM

Ollama

Core

Local model inference for development and edge

LLM

Groq

Core

Ultra-fast inference with LPU hardware

LLM

Mistral

Extended

Mistral Large, Medium, Small models

LLM

DeepSeek

Extended

DeepSeek-V3 and reasoning models

LLM

xAI Grok

Extended

Grok models with real-time knowledge

LLM

Cohere

Extended

Command R+ with RAG-optimized models

LLM

Together AI

Extended

Open-source model hosting and inference

LLM

Fireworks AI

Extended

Fast inference for open models

LLM

Azure OpenAI

Extended

OpenAI models via Azure with enterprise compliance

LLM

Perplexity

Extended

Search-augmented language models

LLM

SambaNova

Extended

Enterprise AI inference platform

LLM

Cerebras

Extended

Wafer-scale inference engine

LLM

OpenRouter

Extended

Multi-provider routing and fallback

LLM

Hugging Face

Extended

Inference API for open models

LLM

Vertex AI

Extended

Google Cloud AI platform

LLM

AI21

Community

Jamba models for enterprise

LLM

OpenAI Embeddings

Core

text-embedding-3-small/large

Embeddings

Google Embeddings

Core

Gecko and text-embedding models

Embeddings

Ollama Embeddings

Core

Local embedding with any GGUF model

Embeddings

Cohere Embed

Extended

Embed v3 with compression

Embeddings

Voyage AI

Extended

Domain-specific embeddings

Embeddings

Jina Embeddings

Extended

Multilingual embeddings

Embeddings

Mistral Embed

Extended

Mistral embedding model

Embeddings

Sentence Transformers

Community

HuggingFace sentence transformers

Embeddings

pgvector

Core

PostgreSQL vector extension

Vector Stores

Qdrant

Core

High-performance vector database

Vector Stores

Pinecone

Core

Managed vector database

Vector Stores

ChromaDB

Extended

Open-source embedding database

Vector Stores

Weaviate

Extended

Vector search with GraphQL

Vector Stores

Milvus

Extended

Scalable vector database

Vector Stores

Turbopuffer

Extended

Serverless vector database

Vector Stores

Redis Vector

Extended

Redis with vector search

Vector Stores

Elasticsearch

Extended

Vector search in Elasticsearch

Vector Stores

MongoDB Atlas

Extended

MongoDB with vector search

Vector Stores

SQLite-vec

Community

SQLite vector extension

Vector Stores

Vespa

Community

Hybrid search engine

Vector Stores

Deepgram

Core

Nova-3 real-time STT

Voice STT

ElevenLabs Scribe

Core

High-accuracy transcription

Voice STT

OpenAI Whisper

Core

Whisper and Transcribe API

Voice STT

AssemblyAI

Extended

Slam-1 universal STT

Voice STT

Groq STT

Extended

Fast Whisper inference

Voice STT

Gladia

Community

Real-time transcription

Voice STT

ElevenLabs TTS

Core

High-quality voice synthesis

Voice TTS

Cartesia Sonic

Core

Low-latency TTS

Voice TTS

PlayHT

Extended

AI voice generation

Voice TTS

Groq TTS

Extended

Fast text-to-speech

Voice TTS

Fish Audio

Extended

Open-source TTS

Voice TTS

LMNT

Extended

Ultra-fast voice synthesis

Voice TTS

Smallest.ai

Community

Efficient TTS models

Voice TTS

OpenAI Realtime

Core

Direct speech-to-speech

Voice S2S

Gemini Live

Core

Google multimodal live

Voice S2S

Ultravox

Extended

Open speech-language model

Voice S2S

In-Memory

Core

Fast in-process memory store

Memory

Redis Memory

Core

Distributed memory with persistence

Memory

PostgreSQL Memory

Core

Relational memory store

Memory

SQLite Memory

Extended

Embedded memory store

Memory

Neo4j

Extended

Graph-based memory

Memory

DragonflyDB

Community

Redis-compatible memory

Memory

Memgraph

Community

Graph memory store

Memory

Firecrawl

Core

Web scraping and crawling

Document Loaders

Unstructured.io

Core

Universal document parsing

Document Loaders

Docling

Extended

Document understanding

Document Loaders

Confluence

Extended

Atlassian wiki loader

Document Loaders

Notion Loader

Extended

Notion workspace loader

Document Loaders

GitHub Loader

Extended

Repository content loader

Document Loaders

Google Drive

Extended

GDrive document loader

Document Loaders

S3/GCS

Extended

Cloud storage loader

Document Loaders

NeMo Guardrails

Core

NVIDIA safety rails

Guardrails

Guardrails AI

Extended

Validation framework

Guardrails

LLM Guard

Extended

Input/output scanning

Guardrails

Lakera

Extended

Prompt injection detection

Guardrails

Azure AI Safety

Extended

Content safety service

Guardrails

Langfuse

Core

LLM observability platform

Eval & Observability

Arize Phoenix

Core

ML observability

Eval & Observability

RAGAS

Extended

RAG evaluation framework

Eval & Observability

LangSmith

Extended

LangChain observability

Eval & Observability

Jaeger

Extended

Distributed tracing

Eval & Observability

Grafana

Extended

Metrics visualization

Eval & Observability

Datadog

Extended

Cloud monitoring

Eval & Observability

Built-in Engine

Core

Native durable execution

Workflows

Temporal

Extended

Workflow orchestration

Workflows

NATS

Extended

Message-based workflows

Workflows

Redis Streams

Community

Stream-based workflows

Workflows

Gin

Core

High-performance HTTP framework

HTTP / API

Fiber

Extended

Express-inspired Go framework

HTTP / API

Echo

Extended

Minimalist HTTP framework

HTTP / API

Chi

Extended

Lightweight composable router

HTTP / API

Connect-Go

Extended

gRPC-compatible HTTP API

HTTP / API